BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012359
         (465 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 277/444 (62%), Positives = 339/444 (76%), Gaps = 21/444 (4%)

Query: 24  SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISS 83
           SS  S+  S S  + NPSQD  Q LN LVS+SL RA H+KNPQT           T + S
Sbjct: 23  SSFISIPLSHSYTNQNPSQDHLQKLNYLVSTSLARAHHLKNPQT-----------TPVFS 71

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPSFIPKLSSSS 142
           HSYGGYSISLSFGTPPQ + F++DTGS  VWFPCT  Y C  CS +S+I  F+PK SSSS
Sbjct: 72  HSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSS 131

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
           +++GC+NPKCSWIH   ++C DC++     S+NC+QICP YL+LYGSG T G+ALSETL+
Sbjct: 132 KIIGCKNPKCSWIHQTDLRCTDCDNN----SRNCSQICPPYLILYGSGTTGGVALSETLH 187

Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           L   I+PNFLVGCSV SSRQPAGIAGFGRG +SLPSQL L KFSYCLLSHKFDDT  +SS
Sbjct: 188 LHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSS 247

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L+LD+  S SDKKT  L YTP V NP V ++ AFSVYYYV LRRI++GG+ V++ +KYL+
Sbjct: 248 LVLDS-QSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLS 306

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D+DGNGGTI+DSGTTFT+M+ E FE L++EF+SQ+   +NY RAL  EAL+GL+PCF+V
Sbjct: 307 PDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQV---KNYERALMVEALSGLKPCFNV 363

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD-REASGGPSIILGNF 441
            G K    P+L+LHFKGGA+V LP+ENYFA +G     C TVVTD  E + GP +ILGNF
Sbjct: 364 SGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNF 423

Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
           QMQN+YVEYDL+N+RLGFK++ CK
Sbjct: 424 QMQNFYVEYDLQNERLGFKKESCK 447


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  539 bits (1389), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 287/459 (62%), Positives = 338/459 (73%), Gaps = 29/459 (6%)

Query: 21  IFPSSITSLTFSLSRFHTN--PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTT 78
           +FP   +S+T  L    TN  P QD YQ LN LV++SL RA H+KNPQT   TTTT    
Sbjct: 1   LFPFISSSITIPLQHPQTNQIPFQDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAP-- 58

Query: 79  TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS------KIP 132
             + SHSYGGYS+SLSFGTPPQ + FI+DTGS +VWFPCT+HY CK+CS S      +I 
Sbjct: 59  --LFSHSYGGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQ 116

Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQC-RDCNDEPLATSKNC-TQICPSYLVLYGSG 190
            FIPK SSSS+LLGC+NPKCSWIHH +I C +DC      + K+C  Q CP Y++ YGSG
Sbjct: 117 PFIPKESSSSKLLGCKNPKCSWIHHSNINCDQDC------SIKSCLNQTCPPYMIFYGSG 170

Query: 191 LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
            T G+ALSETL+L +   PNFLVGCSV SS QPAGIAGFGRG +SLPSQL L KFSYCLL
Sbjct: 171 TTGGVALSETLHLHSLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLL 230

Query: 251 SHKFDD-TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           SH+FDD T ++SSL+LD     SDKKT  L YTPFV NP V  +++FSVYYY+GLRRITV
Sbjct: 231 SHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITV 290

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GG  V+V +KYL+   DGNGG I+DSGTTFTFMA E FEPL+DEF+ Q+   ++Y R   
Sbjct: 291 GGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQI---KDYRRVKE 347

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR- 428
            E   GLRPCF+V   KT SFPEL+L+FKGGA+V LPVENYFA VG G   CLTVVTD  
Sbjct: 348 IEDAIGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVG-GEVACLTVVTDGV 406

Query: 429 ---EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              E  GGP +ILGNFQMQN+YVEYDLRN+RLGFKQ+ C
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 275/449 (61%), Positives = 331/449 (73%), Gaps = 25/449 (5%)

Query: 27  TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
           + +T  LS    +P  D Y+NL  LVS+SL RA H+KN        TT T+TT + +HSY
Sbjct: 34  SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
           G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+  PS   FIPK SSSS
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           ++LGC NPKC WIH   +Q  CRDC  EP  TS NCTQICP YLV YGSG+T GI LSET
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLVFYGSGITGGIMLSET 203

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
           L+LP + +PNF+VGCSVLS+ QPAGI+GFGRG  SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 LDLPGKGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 263

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           SSL+LD G S S +KT GL+YTPFV NP VA ++AFSVYYY+GLR ITVGG+ V++ +KY
Sbjct: 264 SSLVLD-GESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKY 322

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L    DG+GGTI+DSGTTFT+M  E+FE +A EF  Q+       RA   E +TGLRPCF
Sbjct: 323 LIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK----RATEVEGITGLRPCF 378

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSI 436
           ++ G  T SFPEL L F+GGAE+ LP+ NY A +G    VCLT+VTD    +E SGGP+I
Sbjct: 379 NISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAI 438

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ILGNFQ QN+YVEYDLRN+RLGF+QQ CK
Sbjct: 439 ILGNFQQQNFYVEYDLRNERLGFRQQSCK 467


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 243/461 (52%), Positives = 313/461 (67%), Gaps = 33/461 (7%)

Query: 21  IFPSSIT-SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTT 79
           I PS+IT  L+ ++++    PS D ++ LN L ++S++RA H+K+P+T  +   T     
Sbjct: 23  ISPSTITIPLSPTITK---RPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLIKTP---- 75

Query: 80  NISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSF 134
            + S SYGGYS+SLS GTP Q +  I+DTGS LVWFPCT+ Y C  C+      +KIP F
Sbjct: 76  -LFSRSYGGYSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKF 134

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLT 192
           +P+LSSSS+L+GC+NPKC+W+   S+Q  C +CN +    ++NCTQ CP Y++ YG G T
Sbjct: 135 MPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQ----AQNCTQACPPYIIQYGLGST 190

Query: 193 EGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
            G+ LSET+N PN+ I +FL GCS+LS+RQP GIAGFGR + SLP QL L KFSYCL+S 
Sbjct: 191 AGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSR 250

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           +FDD+  +S LILD G S SD KTTGL+YTPF  N +     AF  YYYV LR+I VG  
Sbjct: 251 RFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKT 310

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            V+V + +L    DGNGGTIVDSG+TFTF+   +FE LA EF  QM    NYT A   + 
Sbjct: 311 HVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMA---NYTVATNVQK 367

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA-- 430
           LTGLRPCFD+ GEK+   P+L   FKGGA++ LP+ NYFA V  G  VCLT+V+D  A  
Sbjct: 368 LTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMG-VVCLTIVSDNAAAL 426

Query: 431 -------SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  S GP+IILGNFQ QN+Y+EYDL N R GFK+Q C
Sbjct: 427 GGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 249/471 (52%), Positives = 308/471 (65%), Gaps = 31/471 (6%)

Query: 6   SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
           S L   ++  F+ LS    S   +T  L+ F    S D  Q L  L SSS TRA  IK P
Sbjct: 5   SPLSFFYLLLFSSLSAIAHS-NPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTP 63

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
           ++ +   +       +S HSYG YS  LSFGTP Q +  I DTGS LVWFPCT+ Y C  
Sbjct: 64  KSNSVFKSP------LSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSE 117

Query: 126 CSSSKI-----PSFIPKLSSSSRLLGCQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQ 178
           CS  KI     P F+PKLSSSS+L+GCQNPKCSWI    +  QCR CN +    ++NCTQ
Sbjct: 118 CSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK----TENCTQ 173

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
            CP+Y+V YGSG T G+ LSETL+ P++ IPNF+VGCS LS  QP+GIAGFGRG  SLPS
Sbjct: 174 TCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRGSESLPS 233

Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
           Q+ L KF+YCL S KFDD+  +  LILD+    +  K++GLTYTPF  NPSV+  NA+  
Sbjct: 234 QMGLKKFAYCLASRKFDDSPHSGQLILDS----TGVKSSGLTYTPFRQNPSVSN-NAYKE 288

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           YYY+ +R+I VG Q V+V +K+L    DGNGG+I+DSG+TFTFM   + E +A EF  Q+
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
               N+TRA   E LTGLRPCFD+  EK+  FPEL   FKGGA+  LP+ NYFA+V    
Sbjct: 349 A---NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG 405

Query: 419 AVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             CLTVVT +        GGPS+ILG FQ QN+YVEYDL NQRLGF+QQ C
Sbjct: 406 VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  460 bits (1183), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 249/471 (52%), Positives = 308/471 (65%), Gaps = 31/471 (6%)

Query: 6   SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
           S L   ++  F+ LS    S   +T  L+ F    S D  Q L  L SSS TRA  IK P
Sbjct: 5   SPLSFFYLLLFSSLSAIAHS-NPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTP 63

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
           ++ +   +       +S HSYG YS  LSFGTP Q +  I DTGS LVWFPCT+ Y C  
Sbjct: 64  KSNSVFKSP------LSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSE 117

Query: 126 CSSSKI-----PSFIPKLSSSSRLLGCQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQ 178
           CS  KI     P F+PKLSSSS+L+GCQNPKCSWI    +  QCR CN +    ++NCTQ
Sbjct: 118 CSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK----TENCTQ 173

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
            CP+Y+V YGSG T G+ LSETL+ P++ IPNF+VGCS LS  QP+GIAGFGRG  SLPS
Sbjct: 174 TCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPS 233

Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
           Q+ L KF+YCL S KFDD+  +  LILD+    +  K++GLTYTPF  NPSV+  NA+  
Sbjct: 234 QMGLKKFAYCLASRKFDDSPHSGQLILDS----TGVKSSGLTYTPFRQNPSVSN-NAYKE 288

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           YYY+ +R+I VG Q V+V +K+L    DGNGG+I+DSG+TFTFM   + E +A EF  Q+
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
               N+TRA   E LTGLRPCFD+  EK+  FPEL   FKGGA+  LP+ NYFA+V    
Sbjct: 349 A---NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG 405

Query: 419 AVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             CLTVVT +        GGPS+ILG FQ QN+YVEYDL NQRLGF+QQ C
Sbjct: 406 VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 237/470 (50%), Positives = 319/470 (67%), Gaps = 25/470 (5%)

Query: 8   LCLSFIFFFTLLSIFPSSIT-SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ 66
           L LS +      S  P++IT  L+  L + H++ S D + +L    S+SLTRA H+K+  
Sbjct: 15  LLLSLLSHIAFTSSNPNTITLPLSPLLIKPHSSDS-DPFHSLKFAASASLTRAHHLKHRN 73

Query: 67  TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
             + +  TT         SYGGYSI L+ GTPPQ  PF+LDTGS LVWFPCT+ Y C +C
Sbjct: 74  NNSPSVATTPAY----PKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHC 129

Query: 127 S-----SSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQI 179
           +     ++KIP+FIPK SS+++LLGC+NPKC +I    +Q  C  C  E    S+NC+  
Sbjct: 130 NFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPE----SQNCSLT 185

Query: 180 CPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQ 239
           CP+Y++ YG G T G  L + LN P + +P FLVGCS+LS RQP+GIAGFGRG+ SLPSQ
Sbjct: 186 CPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQ 245

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           +NL +FSYCL+SH+FDDT ++S L+L   SS  D KT GL+YTPF +NPS     AF  Y
Sbjct: 246 MNLKRFSYCLVSHRFDDTPQSSDLVLQI-SSTGDTKTNGLSYTPFRSNPST-NNPAFKEY 303

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           YY+ LR++ VGG+ V++ + +L    DGNGGTIVDSG+TFTFM   ++  +A EFV Q+ 
Sbjct: 304 YYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLE 363

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
           K  NY+RA  AE  +GL PCF++ G KT +FPEL   FKGGA++T P++NYF++VG+   
Sbjct: 364 K--NYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEV 421

Query: 420 VCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           VCLTVV+D  A    + GP+IILGN+Q QN+Y+EYDL N+R GF  + C+
Sbjct: 422 VCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSCR 471


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 231/441 (52%), Positives = 304/441 (68%), Gaps = 25/441 (5%)

Query: 36  FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
           F  NPS D +Q L+ L S+SLTRA H+K+ +       T++  T + +HSYGGYS+SLSF
Sbjct: 43  FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 96

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
           GTP Q + F++DTGS LVWFPCT+ Y C  CS      +KIP+FIPKLSSS++++GC NP
Sbjct: 97  GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 156

Query: 151 KCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           KC ++    ++  C  C+      S NCT+ CP+Y + YG G T G+ L E+L    R  
Sbjct: 157 KCGFVMDSEVRTRCPGCDQN----SANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE 212

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
           P+F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L  G
Sbjct: 213 PDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVG 272

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
               D KT GL+YTPF  NP V+  +AF  YYYV LR I VG +RV+V + ++    DGN
Sbjct: 273 PDSKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGN 331

Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
           GGTIVDSG+TFTFM   +FE +A EF  QM    NYTRA   EAL+GL+PCF++ G  + 
Sbjct: 332 GGTIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSV 388

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQ 444
           + P L   FKGGA++ LPV NYF++VG+ S +CLT+V++       S GPSIILGN+Q Q
Sbjct: 389 ALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQ 448

Query: 445 NYYVEYDLRNQRLGFKQQLCK 465
           N+Y EYDL N+R GF++Q CK
Sbjct: 449 NFYTEYDLENERFGFRRQRCK 469


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 233/436 (53%), Positives = 298/436 (68%), Gaps = 29/436 (6%)

Query: 41  SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ 100
           S++ +  LN L S SL+RA HIK+P+TK +   T      +   SYGGYSISL+FGTPPQ
Sbjct: 49  SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTP-----LFPRSYGGYSISLNFGTPPQ 103

Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
              F++DTGS LVWFPCT+ Y C  C       + IP+FIPK SSSS L+GC+N KCSW+
Sbjct: 104 TTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWL 163

Query: 156 HHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-IIPNFL 212
               +Q  C++C+     T++NCTQ CP Y++ YG G T G+ LSETL+ P++  IP FL
Sbjct: 164 FGPKVQSKCQECD----PTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFL 219

Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           VGCS+ S RQP GIAGFGR   SLPSQL L KFSYCL+SH FDDT  +S L+LD GS   
Sbjct: 220 VGCSLFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSD 279

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
           D KT GL+YTPF  NP+ A R+    YYYV LR I +G   V+V +K+L    DGNGGTI
Sbjct: 280 DTKTPGLSYTPFQKNPTAAFRD----YYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTI 335

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           VDSGTTFTFM   ++E +A EF  Q+    +YT A   +  TGLRPCF++ GEK+ S PE
Sbjct: 336 VDSGTTFTFMEKPVYELVAKEFEKQVA---HYTVATEVQNQTGLRPCFNISGEKSVSVPE 392

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYV 448
              HFKGGA++ LP+ NYF+ V  G  +CLT+V+D  +     GGP+IILGN+Q +N++V
Sbjct: 393 FIFHFKGGAKMALPLANYFSFVDSG-VICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHV 451

Query: 449 EYDLRNQRLGFKQQLC 464
           E+DL+N+R GFKQQ C
Sbjct: 452 EFDLKNERFGFKQQNC 467


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 230/460 (50%), Positives = 308/460 (66%), Gaps = 29/460 (6%)

Query: 26  ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIK---------NPQTKTTTTTT 74
           ++++   LS F H++ S +D Y +L  L  SS+ RA  +K         +  + TTT + 
Sbjct: 16  VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASA 75

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---- 130
           T   + +S+ SYGGYS+SLSFGTP Q IPF+ DTGS LVW PCT+ Y C  C  S     
Sbjct: 76  TVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPT 135

Query: 131 -IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
            IP FIPK SSSS+++GCQ+PKC +++  ++QCR C+      ++NCT  CP Y++ YG 
Sbjct: 136 LIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCD----PNTRNCTVGCPPYILQYGL 191

Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
           G T G+ ++E L+ P+  +P+F+VGCS++S+RQPAGIAGFGRG  SLPSQ+NL +FS+CL
Sbjct: 192 GSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCL 251

Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           +S +FDDT  T+ L LD GS H S  KT GLTYTPF  NP+V+ + AF  YYY+ LRRI 
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK-AFLEYYYLNLRRIY 310

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG + V++ +KYL    +G+GG+IVDSG+TFTFM   +FE +A+EF SQM    NYTR  
Sbjct: 311 VGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM---SNYTREK 367

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             E  TGL PCF++ G+   + PEL   FKGGA++ LP+ NYF  VG    VCLTVV+D+
Sbjct: 368 DLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDK 427

Query: 429 ----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                   GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 231/460 (50%), Positives = 309/460 (67%), Gaps = 29/460 (6%)

Query: 26  ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIKN-----PQTKTTTTTTTTTT 78
           ++++   LS F H++ S +D Y +L  L  SS+ RA  +K+     P  +  ++T T + 
Sbjct: 16  VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTATASA 75

Query: 79  TNISSH----SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----- 129
           T + SH    SYGGYS+SLSFGTP Q IPF+ DTGS LVWFPCT+ Y C  C+ S     
Sbjct: 76  TVVKSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPT 135

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
           +IP FIPK SSSSR++GCQNPKC ++   ++QCR C+      ++NCT  CP Y++ YG 
Sbjct: 136 QIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRGCD----PNTRNCTVPCPPYILQYGL 191

Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
           G T GI +SE L+ P+  +P+F+VGCSV+S+R PAGIAGFGRG  SLPSQ+ L  FS+CL
Sbjct: 192 GSTAGILISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSHCL 251

Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           +S +FDDT  T+ L LD GS H S  KT GL+YTPF  NP+V+   AF  YYY+ LRRI 
Sbjct: 252 VSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSN-TAFLEYYYLNLRRIY 310

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG + V++ +K+L    +GNGG+IVDSG+TFTFM   +FE +A+EF +QM    NYTR  
Sbjct: 311 VGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQM---SNYTREK 367

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             E ++G+ PCF++ G+   + PEL   FKGGA++ LP+ NYF+ VG    VCLTVV+D 
Sbjct: 368 DLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDN 427

Query: 429 EAS----GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +     GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 229/440 (52%), Positives = 302/440 (68%), Gaps = 25/440 (5%)

Query: 36  FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
           F  NPS D +Q L+ L S+SLTRA H+K+ +       T++  T + +HSYGGYS+SLSF
Sbjct: 43  FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 96

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
           GTP Q + F++DTGS LVWFPCT+ Y C  CS      +KIP+FIPKLSSS++++GC NP
Sbjct: 97  GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 156

Query: 151 KCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           KC ++    ++  C  C+      S NCT+ CP+Y + YG G T G+ L E+L    R  
Sbjct: 157 KCGFVMDSEVRTRCPGCDQN----SANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE 212

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
           P+F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L  G
Sbjct: 213 PDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVG 272

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
               D KT GL+YTPF  NP V+  +AF  YYYV LR I VG +RV+  + ++    DGN
Sbjct: 273 PDSKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGN 331

Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
           GGTIVDSG+TFTFM   +FE +A EF  QM    NYTRA   EAL+GL+PCF++ G  + 
Sbjct: 332 GGTIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSV 388

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQ 444
           + P L   FKGGA++ LPV NYF++VG+ S +CLT+V++       S GPSIILGN+Q Q
Sbjct: 389 ALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQ 448

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N+Y EYDL N+R GF++Q C
Sbjct: 449 NFYTEYDLENERFGFRRQRC 468


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 232/470 (49%), Positives = 312/470 (66%), Gaps = 22/470 (4%)

Query: 6   SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
           S L + F+F   LL    SS ++    L+ F +    D ++ +N L+S+SL RA H+K P
Sbjct: 52  SFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKTP 111

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
           Q+K+ T+    +   +   SYG YS+SL+FGTPPQ + FI DTGS LVWFPCT  Y+C  
Sbjct: 112 QSKSNTSIQNVS---LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSR 168

Query: 126 CS-----SSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQ 178
           CS      + I  F+PKLSSS +++GC+NPKC+WI   +++  CR+CN +    S+ C+ 
Sbjct: 169 CSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSK----SRKCSD 224

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
            CP Y + YGSG T GI LSETL+L N+ +P+FLVGCSV+S  QPAGIAGFGRG  SLPS
Sbjct: 225 SCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPS 284

Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
           Q+ L +FS+CL+S  FDD+  +S L+LD+GS   + KT    Y PF  NPSV+   AF  
Sbjct: 285 QMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNA-AFRE 343

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           YYY+ LRRI +GG+ V+  +KYL  D  GNGG I+DSG+TFTF+   +FE +ADE   Q+
Sbjct: 344 YYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQL 403

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
           VK   Y RA   EA +GLRPCF++P  E++  FP++ L FKGG +++L  ENY A+V + 
Sbjct: 404 VK---YPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDE 460

Query: 418 SAVCLTVVTDRE---ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             VCLT++TD       GGP+IILG FQ QN  VEYDL  QR+GF++Q C
Sbjct: 461 GVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 234/463 (50%), Positives = 307/463 (66%), Gaps = 23/463 (4%)

Query: 14  FFFTLLSIFPSSI-TSLTFSLSRFHTN-PSQDS--YQNLNSLVSSSLTRALHIKNPQTKT 69
           F   +++ F SS   ++T  LS   TN PS  S  +  L   VS+S+TRA H+KN +   
Sbjct: 13  FLSIIITTFSSSTPNTITLHLSPLFTNHPSSSSHPFHTLKLAVSTSITRAHHLKNHKPNK 72

Query: 70  TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS- 128
           +  T       +   +YGGYSI L FGTP Q  PF+LDTGS LVW PC++HY C  C+S 
Sbjct: 73  SLETP------VHPKTYGGYSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSF 126

Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
           S  P FIPK SSSS+ +GC NPKC+W+    ++   C  +  A   NC+Q CP+Y V YG
Sbjct: 127 SNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQDK-AAFNNCSQTCPAYTVQYG 185

Query: 189 SGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
            G T G  LSE LN P +   +FL+GCSV+S  QPAGIAGFGRG+ SLPSQ+NL +FSYC
Sbjct: 186 LGSTAGFLLSENLNFPTKKYSDFLLGCSVVSVYQPAGIAGFGRGEESLPSQMNLTRFSYC 245

Query: 249 LLSHKFDDT-TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           LLSH+FDD+ T TS+L+L+  SS  D KT G++YTPF+ NP+  +  AF  YYY+ L+RI
Sbjct: 246 LLSHQFDDSATITSNLVLETASSR-DGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRI 304

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
            VG +RVRV  + L  + DG+GG IVDSG+TFTFM   +F+ +A EF  Q+    +YTRA
Sbjct: 305 VVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQV----SYTRA 360

Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
             AE   GL PCF +  G +T SFPEL+  F+GGA++ LPV NYF++VG+G   CLT+V+
Sbjct: 361 REAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVS 420

Query: 427 DREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           D  A      GP++ILGN+Q QN+YVEYDL N+R GF+ Q C+
Sbjct: 421 DDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 248/449 (55%), Positives = 299/449 (66%), Gaps = 38/449 (8%)

Query: 27  TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
           + +T  LS    +P  D Y+NL  LVS+SL RA H+KN        TT T+TT + +HSY
Sbjct: 34  SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
           G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+  PS   FIPK SSSS
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           ++LGC NPKC WIH   +Q  CRDC  EP  TS NCTQICP YL                
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLNFLRFWDHRRSQFHRR 203

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
           +  P             L       I+GFGRG  SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 MLCP-------------LHQSTRREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 250

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           SSL+LD G S S +KT GL+YTPFV NP VA ++AFSVYYY+GLR ITVGG+ V++ +KY
Sbjct: 251 SSLVLD-GESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKY 309

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L    DG+GGTI+DSGTTFT+M  E+FE +A EF  Q+       RA   E +TGLRPCF
Sbjct: 310 LIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK----RATEVEGITGLRPCF 365

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSI 436
           ++ G  T SFPEL L F+GGAE+ LP+ NY A +G    VCLT+VTD    +E SGGP+I
Sbjct: 366 NISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAI 425

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ILGNFQ QN+YVEYDLRN+RLGF+QQ CK
Sbjct: 426 ILGNFQQQNFYVEYDLRNERLGFRQQSCK 454


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 232/436 (53%), Positives = 303/436 (69%), Gaps = 29/436 (6%)

Query: 41  SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ 100
           S+  + +LN L S SL+RA HIK+P+T  +   T      +   SYGGYSISL+FGTPPQ
Sbjct: 40  SKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTP-----LFPRSYGGYSISLNFGTPPQ 94

Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
              F++DTGS LVWFPCT+ Y C  C+      + IP+F+PKLSSSS+L+GC+NP+CS I
Sbjct: 95  TTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI 154

Query: 156 HHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-IIPNFL 212
               IQ  C++C+    +T++NCTQ CP Y++ YGSG T G+ LSETL+ PN+  IP+FL
Sbjct: 155 FGPEIQSKCQECD----STAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKTIPDFL 210

Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           VGCS+ S +QP GIAGFGR   SLPSQL L KFSYCL+SH FDDT  +S L+LD GS   
Sbjct: 211 VGCSIFSIKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSG 270

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
             KT GL++TPF+ NP+ A R+    YYYV LR I +G   V+V +K+L    DGNGGTI
Sbjct: 271 VTKTAGLSHTPFLKNPTTAFRD----YYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTI 326

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           VDSGTTFTFM   ++E +A EF  QM    +YT A   + LTGLRPC+++ GEK+ S P+
Sbjct: 327 VDSGTTFTFMENPVYELVAKEFEKQMA---HYTVATEIQNLTGLRPCYNISGEKSLSVPD 383

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR----EASGGPSIILGNFQMQNYYV 448
           L   FKGGA++ LP+ NYF++V  G  +CLT+V+D        GGP+IILGN+Q +N+YV
Sbjct: 384 LIFQFKGGAKMALPLSNYFSIVDSG-VICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYV 442

Query: 449 EYDLRNQRLGFKQQLC 464
           E+DL N++ GFKQQ C
Sbjct: 443 EFDLENEKFGFKQQSC 458


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 228/450 (50%), Positives = 301/450 (66%), Gaps = 22/450 (4%)

Query: 28  SLTFSLSRFHTNP---SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH 84
           S+T  LS   T P     D + ++    SSSLTRA H+K+    + +  TT         
Sbjct: 28  SITLPLSPLLTKPHSSDSDPFHSVKLAASSSLTRAHHLKHRNNNSPSVATTPAYPK---- 83

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLS 139
           SYGGYSI L+ GTPPQ  PF+LDTGS LVWFPCT+HY C +C+      +KIP+FIPK S
Sbjct: 84  SYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNS 143

Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
           S+++LLGC+NPKC ++    ++ R C       S+NC+  CPSY++ YG G T G  L +
Sbjct: 144 STAKLLGCRNPKCGYLFGPDVESR-CPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLD 202

Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
            LN P + +P FLVGCS+LS RQP+GIAGFGRG+ SLPSQ+NL +FSYCL+SH+FDDT +
Sbjct: 203 NLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQ 262

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           +S L+L   SS  D KT GL+YTPF +NPS    + F  YYYV LR++ VGG  V++ +K
Sbjct: 263 SSDLVLQI-SSTGDTKTNGLSYTPFRSNPS--NNSVFREYYYVTLRKLIVGGVDVKIPYK 319

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
           +L    DGNGGTIVDSG+TFTFM   ++  +A EF+ Q+ K   Y+R    EA +GL PC
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGK--KYSREENVEAQSGLSPC 377

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPS 435
           F++ G KT SFPE    FKGGA+++ P+ NYF+ VG+   +C TVV+D  A    + GP+
Sbjct: 378 FNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPA 437

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           IILGN+Q QN+YVEYDL N+R GF  + CK
Sbjct: 438 IILGNYQQQNFYVEYDLENERFGFGPRNCK 467


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 229/460 (49%), Positives = 307/460 (66%), Gaps = 29/460 (6%)

Query: 26  ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIK---------NPQTKTTTTTT 74
           ++++   LS F H++ S +D Y +L  L  SS+ RA  +K         +  + TTT + 
Sbjct: 16  VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASA 75

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---- 130
           T   + +S+ SYGGYS+SLSFGTP Q IPF+ DTGS LV  PCT+ Y C  C  S     
Sbjct: 76  TVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPT 135

Query: 131 -IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
            IP FIPK SSSS+++GCQ+PKC +++  ++QCR C+      ++NCT  CP Y++ YG 
Sbjct: 136 LIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCD----PNTRNCTVGCPPYILQYGL 191

Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
           G T G+ ++E L+ P+  +P+F+VGCS++S+RQPAGIAGFGRG  SLPSQ+NL +FS+CL
Sbjct: 192 GSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCL 251

Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           +S +FDDT  T+ L LD GS H S  KT GLTYTPF  NP+V+ + AF  YYY+ LRRI 
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK-AFLEYYYLNLRRIY 310

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG + V++ +KYL    +G+GG+IVDSG+TFTFM   +FE +A+EF SQM    NYTR  
Sbjct: 311 VGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM---SNYTREK 367

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             E  TGL PCF++ G+   + PEL   FKGGA++ LP+ NYF  VG    VCLTVV+D+
Sbjct: 368 DLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDK 427

Query: 429 ----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                   GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 212/431 (49%), Positives = 285/431 (66%), Gaps = 21/431 (4%)

Query: 45  YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPF 104
           +  L   VS+S+TRA H+KN    ++  T       +   +YGGYSI L FGTPPQ  PF
Sbjct: 178 FHTLQLAVSTSITRAHHLKNHNNPSSLKTL------VHPKTYGGYSIDLKFGTPPQTFPF 231

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ 161
           +LDTGS LVW PC +HY C  C+S   +  P FIPK S SS+ +GC+NPKC+W+    + 
Sbjct: 232 VLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVT 291

Query: 162 CRDCN--DEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
              C       + + NC+Q CP+Y V YG G T G  LSE LN P + + +FLVGCSV+S
Sbjct: 292 SHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPAKNVSDFLVGCSVVS 351

Query: 220 SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
             QP GIAGFGRG+ SLP+Q+NL +FSYCLLSH+FD++   S L+++  +S   KKT G+
Sbjct: 352 VYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGV 411

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
           +YT F+ NPS  ++ AF  YYY+ LR+I VG +RVRV  + L  D +G+GG IVDSG+T 
Sbjct: 412 SYTAFLKNPST-KKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDSGSTL 470

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFK 398
           TFM   +F+ +A+EFV Q+    NYTRA   E   GL PCF +  G +T SFPE++  F+
Sbjct: 471 TFMERPIFDLVAEEFVKQV----NYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEFR 526

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRN 454
           GGA++ LPV NYF+ VG+G   CLT+V+D  A    + GP++ILGN+Q QN+YVE DL N
Sbjct: 527 GGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLEN 586

Query: 455 QRLGFKQQLCK 465
           +R GF+ Q C+
Sbjct: 587 ERFGFRSQSCQ 597


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 205/398 (51%), Positives = 270/398 (67%), Gaps = 27/398 (6%)

Query: 36  FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
           F  NPS D +Q L+ L S+SLTRA H+K+ +       T++  T + +HSYGGYS+SLSF
Sbjct: 59  FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 112

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
           GTP Q + F++DTGS LVWFPCT+ Y C  CS      +KIP+FIPKLSSS++++GC NP
Sbjct: 113 GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 172

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
           KC ++                 S NCT+ CP+Y + YG G T G+ L E+L    R  P+
Sbjct: 173 KCGFVMDSE------------NSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPD 220

Query: 211 FLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
           F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L  G  
Sbjct: 221 FVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPD 280

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
             D KT GL+YTPF  NP V+  +AF  YYYV LR I VG +RV+V + ++    DGNGG
Sbjct: 281 SKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 339

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
           TIVDSG+TFTFM   +FE +A EF  QM    NYTRA   EAL+GL+PCF++ G  + + 
Sbjct: 340 TIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSVAL 396

Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           P L   FKGGA++ LPV NYF++VG+ S +CLT+V++ 
Sbjct: 397 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNE 434


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 203/439 (46%), Positives = 274/439 (62%), Gaps = 36/439 (8%)

Query: 53  SSSLTRALHIK------NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL 106
           ++SL RALH+K      + Q  +    +   T  +  HSYGGY+ + S GTPPQ +P +L
Sbjct: 25  AASLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLL 84

Query: 107 DTGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
           DTGSHL W PCT+ Y+C+ CSS   S +P F PK SSSSRL+GC+NP C W+H  +    
Sbjct: 85  DTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLAT 144

Query: 164 DCNDEPLAT-SKNC----TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVL 218
            C   P +  + NC    + +CP Y V+YGSG T G+ +++TL  P R +P F++GCS++
Sbjct: 145 KCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLV 204

Query: 219 SSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKT 276
           S  Q P+G+AGFGRG  S+P+QL L KFSYCLLS +FDD    S SL+L           
Sbjct: 205 SVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGG 259

Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
            G+ Y P V + +  ++  + VYYY+ LR +TVGG+ VR+  +    +  G+GGTIVDSG
Sbjct: 260 EGMQYVPLVKS-AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSG 318

Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKL 395
           TTFT++ P +F+P+AD           Y R+  AE   GL PCF +P G ++ + PEL  
Sbjct: 319 TTFTYLDPTVFQPVADA--VVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSF 376

Query: 396 HFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTD--------REASGGPSIILGNFQMQN 445
           HF+GGA + LPVENYF V G G+  A+CL VVTD         E S GP+IILG+FQ QN
Sbjct: 377 HFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGS-GPAIILGSFQQQN 435

Query: 446 YYVEYDLRNQRLGFKQQLC 464
           Y VEYDL  +RLGF++Q C
Sbjct: 436 YLVEYDLEKERLGFRRQSC 454


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  368 bits (944), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 198/446 (44%), Positives = 273/446 (61%), Gaps = 36/446 (8%)

Query: 34  SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           S F  +PS    + L  L ++SL+RA H+K+ +T      +  T  ++S HSYGG+SI L
Sbjct: 38  STFTNSPSTKPLRFLQHLATASLSRAHHLKHGKT------SPLTQISLSPHSYGGHSIPL 91

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQ 148
           SFGTPPQ + F++DTGSH+VW PCT HY C  CS S     K+P F PKLSSSS++LGC+
Sbjct: 92  SFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCR 151

Query: 149 NPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           NPKC       +   C  CN      SKNC+  CP Y + YG+G + G  L E LN P +
Sbjct: 152 NPKCVNTSSPDVHLGCPPCN----GNSKNCSHACPPYSLQYGTGASSGDFLLENLNFPGK 207

Query: 207 IIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
            I  FLVGC  S +     A +AGFGR   SLP Q+ + KF+YCL SH +DDT  +S LI
Sbjct: 208 TIHEFLVGCTTSAVGEVTSAALAGFGRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLI 267

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           LD    +SD +T GL+Y PF+ NP       F +YYY+G++ I +G + +R+  KYL   
Sbjct: 268 LD----YSDGETKGLSYAPFLKNPP-----DFPIYYYLGVKDIKIGNKLLRIPSKYLAPG 318

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
            DG GG ++DSG  + +M   +F+ + +E   +M K   Y R+L AEA  G+ PC++  G
Sbjct: 319 SDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSK---YRRSLEAEAEIGVTPCYNFTG 375

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR-----EASGGPSIILG 439
           +K+   P+L   F+GGA + +P +NYF ++ E S  C  + TD      E + GPSIILG
Sbjct: 376 QKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSIILG 435

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
           N Q  +YYVE+DL+N+RLGF+QQ C+
Sbjct: 436 NSQHVDYYVEFDLKNERLGFRQQTCQ 461


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 194/419 (46%), Positives = 261/419 (62%), Gaps = 28/419 (6%)

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
           Q  +    +   T  +  HSYGGY+ + S GTPPQ +P +LDTGSHL W PCT+ Y+C+ 
Sbjct: 76  QKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRN 135

Query: 126 CSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT-SKNC----T 177
           CSS   S +P F PK SSSSRL+GC+NP C W+H  +     C   P +  + NC    +
Sbjct: 136 CSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAAS 195

Query: 178 QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRGKTSL 236
            +CP Y V+YGSG T G+ +++TL  P R +P F++GCS++S  Q P+G+AGFGRG  S+
Sbjct: 196 NVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQPPSGLAGFGRGAPSV 255

Query: 237 PSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
           P+QL L KFSYCLLS +FDD    S SL+L            G+ Y P V + +  ++  
Sbjct: 256 PAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGGEGMQYVPLVKS-AAGDKLP 309

Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
           + VYYY+ LR +TVGG+ VR+  +    +  G+GGTIVDSGTTFT++ P +F+P+AD   
Sbjct: 310 YGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADA-- 367

Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
                   Y R+  AE   GL PCF +P G ++ + PEL  HF+GGA + LPVENYF V 
Sbjct: 368 VVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVA 427

Query: 415 GEGS--AVCLTVVTD-------REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G G+  A+CL VVTD            GP+IILG+FQ QNY VEYDL  +RLGF++Q C
Sbjct: 428 GRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 200/449 (44%), Positives = 275/449 (61%), Gaps = 42/449 (9%)

Query: 30  TFSLSRFHTNPSQ-DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG 88
           TF LS    +PS  D ++++N    SSL+RA H+K P T T   T           SYGG
Sbjct: 22  TFPLS---ISPSALDKWESINLAALSSLSRARHLKRPPTLTGKVTLPAY-----PRSYGG 73

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCT---NHYQCKYCSSS-----KIPSFIPKLSS 140
           YS+  S GTPPQ +  +LDTGS LVW PCT     Y C+ C+ S     KIP +    SS
Sbjct: 74  YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           + + L C++PKC+W+    + C            + T+ CP Y + YG G T G  +S+ 
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNC------------STTKRCPYYGLEYGLGSTTGQLVSDV 181

Query: 201 LNLP--NRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
           L L   NRI P+FL GCS++S+RQP GIAGFGRG  S+P+QL L KFSYCL+SH+FDDT 
Sbjct: 182 LGLSKLNRI-PDFLFGCSLVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTP 240

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           ++  L+L  G  H+D    G+ Y PF  +P+++    +S YYY+ L +I VGG+ V +  
Sbjct: 241 QSGDLVLHRGRRHADAAANGVAYAPFTKSPALS---PYSEYYYISLSKILVGGKDVPIPP 297

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
           +YL   ++G+GG IVDSG+TFTFM   +F+P+A E    M K   Y RA   E  +GL P
Sbjct: 298 RYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTK---YKRAKEIEDSSGLGP 354

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG---GPS 435
           C+++ G+     P+L   FKGGA + LP+ +YF++V +G  VC+TV+TD +  G   GP+
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDG-VVCMTVLTDPDEPGSTTGPA 413

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           IILGN+Q QN+Y+EYDL+ QR GFK Q C
Sbjct: 414 IILGNYQQQNFYIEYDLKKQRFGFKPQQC 442


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 194/481 (40%), Positives = 283/481 (58%), Gaps = 40/481 (8%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLS-----RFHTNPSQDSYQNLNSLVSSS 55
           MAS+ + L  S    F+ L +  SS  ++  +++      F  NPS +    L  L ++S
Sbjct: 1   MASF-TTLLFSVFTLFSRLVLASSSKNNIPATITIPLTPTFTKNPSTEPLLFLQHLATAS 59

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           ++R+ H+K+ +       +    T++  HS+GG++I LSFGTPPQ + F++DTGSH+VW 
Sbjct: 60  MSRSHHLKHGKA------SPLIQTSLFPHSHGGHTIPLSFGTPPQKLSFLVDTGSHVVWA 113

Query: 116 PCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPL 170
           PCT HY C  CS S   K+P F P+LSSS ++LGC++PKC+      +   C  CN    
Sbjct: 114 PCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCN---- 169

Query: 171 ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA--GIAG 228
             SK C+  CP Y + YG+G   G  L E L+ P + I  FLVGC+  + R+P+   +AG
Sbjct: 170 GNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSDALAG 229

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
           FGR   SLP Q+ + KF+YCL SH +DDT  +  LILD    +SD +T GL+Y PF+ NP
Sbjct: 230 FGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPFLKNP 285

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
                  +  YYY+G++ + +G + +R+  KYLT   D  GG ++DSG  + +M   +F+
Sbjct: 286 P-----DYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFK 340

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
            + +E   QM K   Y R+L AE  +GL PC++  G K+   P+L   F GGA + +P  
Sbjct: 341 IVTNELKKQMSK---YRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397

Query: 409 NYFAVVGEGSAVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
           NYF +  E S  C  V TD      E + GPSIILGN+Q  ++YVE+DL+N+RLGF+QQ 
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457

Query: 464 C 464
           C
Sbjct: 458 C 458


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  350 bits (898), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 193/481 (40%), Positives = 282/481 (58%), Gaps = 40/481 (8%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR-----FHTNPSQDSYQNLNSLVSSS 55
           MAS+ + L  S    F+ L +  SS  ++  +++      F  NPS +    L  L ++S
Sbjct: 1   MASF-TTLLFSVFTLFSHLVLASSSKNNIPATITIPLTPIFTKNPSTEPLLFLQHLATAS 59

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           ++R+ H+K+ +       +    T++  HSYG ++I LSFGTPPQ + F++DTGSH+VW 
Sbjct: 60  MSRSHHLKHGKA------SPLIQTSLFPHSYGAHTIPLSFGTPPQKLSFLMDTGSHVVWA 113

Query: 116 PCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC--RDCNDEPL 170
           PCT HY C  CS S   K+P F P+LSSS ++LGC++PKC+      +      CN    
Sbjct: 114 PCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCN---- 169

Query: 171 ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA--GIAG 228
             SK C+  CP Y + YG+G   G  L E L+ P + I  FLVGC+  + R+P+   +AG
Sbjct: 170 GNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSDALAG 229

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
           FGR   SLP Q+ + KF+YCL SH +DDT  +  LILD    +SD +T GL+Y PF  NP
Sbjct: 230 FGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPFXKNP 285

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
                  + +YYY+G++ + +G + +R+  KYLT   D  GG ++DSG  +++M   +F+
Sbjct: 286 P-----DYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFK 340

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
            + +E   QM K   Y R+L  EA TG+ PC++  G K+   P+L   F GGA + +P  
Sbjct: 341 IVTNELKKQMSK---YRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397

Query: 409 NYFAVVGEGSAVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
           NYF +  E S  C  V TD      E + GPSIILGN+Q  ++YVE+DL+N+RLGF+QQ 
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457

Query: 464 C 464
           C
Sbjct: 458 C 458


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  341 bits (875), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 192/435 (44%), Positives = 261/435 (60%), Gaps = 41/435 (9%)

Query: 58  RALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
           RA H     + +    +   T  +  HSYGGY+ + S GTPPQ +P +LDTGS L W PC
Sbjct: 72  RASHHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPC 131

Query: 118 TNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIH--HESIQCRDCNDEPLAT 172
           T++Y C+ CSS   + +P F PK SSSSRL+GC+NP C W+H      +CR     P + 
Sbjct: 132 TSNYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCR----APCSR 187

Query: 173 SKNCT---QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAG 228
             NCT    +CP Y V+YGSG T G+ +++TL  P R +  F++GCS++S  Q P+G+AG
Sbjct: 188 GANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVSGFVLGCSLVSVHQPPSGLAG 247

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
           FGRG  S+P+QL L KFSYCLLS +FDD    S  ++  G +       G+ Y P V + 
Sbjct: 248 FGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVLGGDN------DGMQYVPLVKS- 300

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
           +  ++  ++VYYY+ L  +TVGG+ VR+  +    +  G+GG IVDSGTTFT++ P +F+
Sbjct: 301 AAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQ 360

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPV 407
           P+AD           Y R+   E   GL PCF +P G K+ + PEL LHFKGGA + LP+
Sbjct: 361 PVADA--VVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPL 418

Query: 408 ENYFAVVGEG------------SAVCLTVVTD------REASGGPSIILGNFQMQNYYVE 449
           ENYF V G               A+CL VVTD       +  GGP+IILG+FQ QNY VE
Sbjct: 419 ENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVE 478

Query: 450 YDLRNQRLGFKQQLC 464
           YDL  +RLGF++Q C
Sbjct: 479 YDLEKERLGFRRQPC 493


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score =  337 bits (865), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 197/431 (45%), Positives = 255/431 (59%), Gaps = 39/431 (9%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+++  T    +   ++  HSYGGY+ ++S GTPPQ +P +LDTGSHL W PCT+ YQC+
Sbjct: 65  PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCR 124

Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
            CSS    S +  F PK SSSSRL+GC+NP C WIH       DC         NCT   
Sbjct: 125 NCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 183

Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
                +CP YLV+YGSG T G+ +S+TL  P R + NF++GCS+ S  Q P+G+AGFGRG
Sbjct: 184 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 243

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
             S+PSQL L KFSYCLLS +FDD    S  LIL    +       G+ Y P     S +
Sbjct: 244 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 299

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
            R  +SVYYY+ L  ITVGG+ V++  +   +     GG IVDSGTTF++    +FEP+A
Sbjct: 300 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 358

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
                       Y+R+   E   GL PCF + PG KT   PE+ LHFKGG+ + LPVENY
Sbjct: 359 AA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 416

Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
           F V G            A+CL VV+D          +SGGP+IILG+FQ QNYY+EYDL 
Sbjct: 417 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 476

Query: 454 NQRLGFKQQLC 464
            +RLGF++Q C
Sbjct: 477 KERLGFRRQQC 487


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score =  337 bits (865), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 197/431 (45%), Positives = 255/431 (59%), Gaps = 39/431 (9%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+++  T    +   ++  HSYGGY+ ++S GTPPQ +P +LDTGSHL W PCT+ YQC+
Sbjct: 65  PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCR 124

Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
            CSS    S +  F PK SSSSRL+GC+NP C WIH       DC         NCT   
Sbjct: 125 NCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 183

Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
                +CP YLV+YGSG T G+ +S+TL  P R + NF++GCS+ S  Q P+G+AGFGRG
Sbjct: 184 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 243

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
             S+PSQL L KFSYCLLS +FDD    S  LIL    +       G+ Y P     S +
Sbjct: 244 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 299

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
            R  +SVYYY+ L  ITVGG+ V++  +   +     GG IVDSGTTF++    +FEP+A
Sbjct: 300 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 358

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
                       Y+R+   E   GL PCF + PG KT   PE+ LHFKGG+ + LPVENY
Sbjct: 359 --AAVVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 416

Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
           F V G            A+CL VV+D          +SGGP+IILG+FQ QNYY+EYDL 
Sbjct: 417 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 476

Query: 454 NQRLGFKQQLC 464
            +RLGF++Q C
Sbjct: 477 KERLGFRRQQC 487


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score =  328 bits (841), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 178/378 (47%), Positives = 239/378 (63%), Gaps = 30/378 (7%)

Query: 108 TGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           +GSHL W PCT+ Y+C+ CSS   S +P F PK SSSSRL+GC+NP C W+H  +     
Sbjct: 79  SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138

Query: 165 CNDEPLAT-SKNC----TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
           C   P +  + NC    + +CP Y V+YGSG T G+ +++TL  P R +P F++GCS++S
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVS 198

Query: 220 SRQP-AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTT 277
             QP +G+AGFGRG  S+P+QL L KFSYCLLS +FDD    S SL+L            
Sbjct: 199 VHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGGE 253

Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
           G+ Y P V + +  ++  + VYYY+ LR +TVGG+ VR+  +    +  G+GGTIVDSGT
Sbjct: 254 GMQYVPLVKS-AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGT 312

Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLH 396
           TFT++ P +F+P+AD           Y R+  AE   GL PCF +P G ++ + PEL  H
Sbjct: 313 TFTYLDPTVFQPVADA--VVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFH 370

Query: 397 FKGGAEVTLPVENYFAVVGEGS--AVCLTVVTD--------REASGGPSIILGNFQMQNY 446
           F+GGA + LPVENYF V G G+  A+CL VVTD         E S GP+IILG+FQ QNY
Sbjct: 371 FEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGS-GPAIILGSFQQQNY 429

Query: 447 YVEYDLRNQRLGFKQQLC 464
            VEYDL  +RLGF++Q C
Sbjct: 430 LVEYDLEKERLGFRRQSC 447


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  328 bits (840), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 203/453 (44%), Positives = 261/453 (57%), Gaps = 36/453 (7%)

Query: 40  PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
           P+   +  L+ L  +SL RA  ++        ++       +  HSYGGY+ SLS GTPP
Sbjct: 39  PAAAQHHPLSRLARASLARASRLRGHHQGQAASSPVRAA--LYPHSYGGYAFSLSLGTPP 96

Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSK--IPSFIPKLSSSSRLLGCQNPKCSWIH- 156
           Q +P +LDTGSHL W PCT++YQC+ CS++    P F PK SSSS L+ C +P C WIH 
Sbjct: 97  QPLPVLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIHS 156

Query: 157 --HESIQCRD---CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP-- 209
             H S   RD   C       S   T +CP YLV+YGSG T G+ +S+TL L  R     
Sbjct: 157 KSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAASR 216

Query: 210 NFLVGCSVLSSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDN 267
           NF VGCS+ S  Q P+G+AGFGRG  S+P+QL ++KFSYCLLS +FDD    S  L+L  
Sbjct: 217 NFAVGCSLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFDDDAAISGELVL-- 274

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT-LDRD 326
           G+S + K    + Y P + N     R  +SVYYY+ L  I VGG+ V +  + L  +   
Sbjct: 275 GASSAGKAKAMMQYAPLLKN--AGARPPYSVYYYLSLTGIAVGGKSVALPARALAPVSGG 332

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GE 385
           G GG I+DSGTTFT++ P +F+P              Y R+   E   GLRPCF +P G 
Sbjct: 333 GGGGAIIDSGTTFTYLDPTVFKP--VAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGA 390

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-----AVCLTVVTD---------REAS 431
           +T   PEL LHF GGAE+ LP+ENYF   G  S     A+CL VV+D             
Sbjct: 391 RTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGG 450

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GGP+IILG+FQ QNY VEYDL   RLGF+QQ C
Sbjct: 451 GGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 184/428 (42%), Positives = 257/428 (60%), Gaps = 37/428 (8%)

Query: 51  LVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
           L S+SL+RA H+K+ +T     T+      +  HSYGG+SISLSFGTPPQ + F++DTGS
Sbjct: 46  LASASLSRAHHLKHGKTNPPVKTS------LFPHSYGGHSISLSFGTPPQKLSFLVDTGS 99

Query: 111 HLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQNPKC--SWIHHESIQCR 163
            +VW PCT  Y C  CS S     K+P F PKLSSSS++L C+NPKC  ++  +  + C 
Sbjct: 100 DVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCP 159

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
            CN      SK+C+  CP Y   YG+G + G  L E L  P + I NFL+GC+  ++R+ 
Sbjct: 160 RCN----GNSKHCSYACP-YSTQYGTGASSGYFLLENLKFPRKTIRNFLLGCTTSAAREL 214

Query: 224 A--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
           +   +AGFGR   SLP Q+ + KF+YCL SH +DDT  +  LILD    + D KT GL+Y
Sbjct: 215 SSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLILD----YRDGKTKGLSY 270

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT- 340
           TPF+ +P      A + YY++G++ I +G + +R+  KYL    DG  G I+DSG     
Sbjct: 271 TPFLKSPP-----ASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAG 325

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
           +M   +F+ + +E   QM K   Y R+L AE  TGL PC++  G K+   P L   F+GG
Sbjct: 326 YMTGPVFKIVTNELKKQMSK---YRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGG 382

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSIILGNFQMQNYYVEYDLRNQR 456
           A + +P +NYF +  + S  C  + T+     E +  PSIILGN Q  +YYVEYDL+N R
Sbjct: 383 ANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDR 442

Query: 457 LGFKQQLC 464
            GF++Q C
Sbjct: 443 FGFRRQTC 450


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  323 bits (829), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 192/429 (44%), Positives = 254/429 (59%), Gaps = 42/429 (9%)

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
             + ++       T +  HSYGGY+ S+S GTPPQ +P +LDTGSHL W PCT+ YQC+ 
Sbjct: 68  HAEPSSQAPAAVRTALYPHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRN 127

Query: 126 CSSSKIPS-----FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT-QI 179
           CSSS         F PK SSSSRL+GC+NP C WIH +S     C     +T  N    +
Sbjct: 128 CSSSPSAMSAMAVFHPKNSSSSRLVGCRNPACRWIHSKSPS--TCG----STGNNGNGDV 181

Query: 180 CPSYLVLYGSGLTEGIALSETLNLPNRIIP-------NFLVGCSVLSSRQ-PAGIAGFGR 231
           CP YLV+YGSG T G+ +S+TL L             NF +GCS++S  Q P+G+AGFGR
Sbjct: 182 CPPYLVVYGSGSTSGLLISDTLRLSPSSSSSAPAPFRNFAIGCSIVSVHQPPSGLAGFGR 241

Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           G  S+PSQL + KFSYCLLS +FDD +  S  L+L +    + KK T + Y P +NN   
Sbjct: 242 GAPSVPSQLKVPKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNN--A 299

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           A +  +SVYYY+ L  I+VGG+ V +  +         GG I+DSGTTFT++ P +F+P+
Sbjct: 300 ASKPPYSVYYYLALTGISVGGKPVNLPSRAFV--PSSGGGAIIDSGTTFTYLDPTVFKPV 357

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVE 408
           A    S +     Y R+   E   GLRPCF +P    G+   P+L+L FKGGA + LPVE
Sbjct: 358 AAAMESAV--GGRYNRSRPVEDALGLRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVE 415

Query: 409 NYF-------AVVGEGSAVCLTVVTD------REASGGPSIILGNFQMQNYYVEYDLRNQ 455
           NYF              A+CL VV+D        A+ GP+IILG+FQ QNY++EYDL  +
Sbjct: 416 NYFVAAGPAGGPAAGPVAICLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKE 475

Query: 456 RLGFKQQLC 464
           RLGF+QQ C
Sbjct: 476 RLGFRQQPC 484


>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
          Length = 490

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 193/431 (44%), Positives = 251/431 (58%), Gaps = 40/431 (9%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+++  T    +   ++  HSYGGY+ ++S GTPPQ +P +L+TGSHL W P T+ Y   
Sbjct: 65  PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLETGSHLSWVPSTSSYSAN 124

Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
            CSS    S +  F PK SSSSRL+GC+NP C WIH       DC         NCT   
Sbjct: 125 -CSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 182

Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
                +CP YLV+YGSG T G+ +S+TL  P R + NF++GCS+ S  Q P+G+AGFGRG
Sbjct: 183 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 242

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
             S+PSQL L KFSYCLLS +FDD    S  LIL    +       G+ Y P     S +
Sbjct: 243 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 298

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
            R  +SVYYY+ L  ITVGG+ V++  +   +     GG IVDSGTTF++    +FEP+A
Sbjct: 299 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 357

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
                       Y+R+   E   GL PCF + PG KT   PE+ LHFKGG+ + LPVENY
Sbjct: 358 AA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 415

Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
           F V G            A+CL VV+D          +SGGP+IILG+FQ QNYY+EYDL 
Sbjct: 416 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 475

Query: 454 NQRLGFKQQLC 464
            +RLGF++Q C
Sbjct: 476 KERLGFRRQQC 486


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 162/378 (42%), Positives = 232/378 (61%), Gaps = 29/378 (7%)

Query: 106 LDTGSHLVWFPCTNHYQCKYC--SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ-- 161
           +DTGS LVW PCT +Y C  C   S+    F+P++SSS  L+ C +  C  ++  + +  
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP------NRIIPNFLVGC 215
           C+ C      + KNC++ CP Y + YG G T G+ L+ETLNLP       R I +F VGC
Sbjct: 61  CQSC----AGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGC 116

Query: 216 SVLSSRQPAGIAGFGRGKTSLPSQLN----LDKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
           S++SS+QP+GIAGFGRG  S+PSQL      D+F+YCL SH+FD+  + S ++L + +  
Sbjct: 117 SIVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176

Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGG 330
           ++     L YTPF+ N      + + VYYY+GLR +++GG+R++ +  K L  D  GNGG
Sbjct: 177 NNIP---LNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGG 233

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
           TI+DSGTTFT  + E+F+ +A  F SQ+     Y RA   E  TG+  C+DV G +    
Sbjct: 234 TIIDSGTTFTVFSDEIFKHIAAGFASQI----GYRRAGEVEDKTGMGLCYDVTGLENIVL 289

Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR---EASGGPSIILGNFQMQNYY 447
           PE   HFKGG+++ LPV NYF+      ++CLT+++ R   E   GP++ILGN Q Q++Y
Sbjct: 290 PEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFY 349

Query: 448 VEYDLRNQRLGFKQQLCK 465
           + YD    RLGF QQ CK
Sbjct: 350 LLYDREKNRLGFTQQTCK 367


>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
          Length = 446

 Score =  304 bits (778), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 200/445 (44%), Positives = 243/445 (54%), Gaps = 100/445 (22%)

Query: 27  TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
           + +T  LS    +P  D Y+NL  LVS+SL RA H+KN        TT T+TT + +HSY
Sbjct: 34  SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
           G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+  PS   FIPK SSSS
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           ++LGC NPKC WIH   +Q  CRDC  EP  TS NCTQICP YLV YGSG+T GI LSET
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLVFYGSGITGGIMLSET 203

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
           L+LP + +PNF+VGCSVLS+ QPAGI+GFGRG  SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 LDLPGKGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 263

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR-ITVGGQRVRVWHK 319
           SSLI +  ++  +K+              V  + A  V    GLR    + G     + +
Sbjct: 264 SSLIFELVAAEFEKQ--------------VQSKRATEVEGITGLRPCFNISGLNTPSFPE 309

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            LTL   G                 E+  PLA           NY   LG + +  L   
Sbjct: 310 -LTLKFRGGA---------------EMELPLA-----------NYVAFLGGDDVVCLTIV 342

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
            D    K          F GG  + L                                 G
Sbjct: 343 TDGAAGK---------EFSGGPAIIL---------------------------------G 360

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           NFQ QN+YVEYDLRN+RLGF+QQ C
Sbjct: 361 NFQQQNFYVEYDLRNERLGFRQQSC 385


>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
 gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
          Length = 330

 Score =  238 bits (606), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 191/328 (58%), Gaps = 31/328 (9%)

Query: 161 QCRDCNDEPLAT----SKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS 216
            CR  +  P A     + N   +CP YLV+YGSG T G+ +S+TL  P R + NF++GCS
Sbjct: 6   DCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCS 65

Query: 217 VLSSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDK 274
           + S  Q P+G+AGFGRG  S+PSQL L KFSYCLLS +FDD    S  LIL    +    
Sbjct: 66  LASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKD 123

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
              G+ Y P     S + R  +SVYYY+ L  ITVGG+ V++  +   +     GG IVD
Sbjct: 124 GGVGMQYAPLAR--SASARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVD 180

Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPEL 393
           SGTTF++    +FEP+A            Y+R+   E   GL PCF + PG KT   PE+
Sbjct: 181 SGTTFSYFDRTVFEPVAAA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEM 238

Query: 394 KLHFKGGAEVTLPVENYFAVVGE---------GSAVCLTVVTD--------REASGGPSI 436
            LHFKGG+ + LPVENYF V G            A+CL VV+D          +SGGP+I
Sbjct: 239 SLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAI 298

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ILG+FQ QNYY+EYDL  +RLGF++Q C
Sbjct: 299 ILGSFQQQNYYIEYDLEKERLGFRRQQC 326


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 148/414 (35%), Positives = 205/414 (49%), Gaps = 49/414 (11%)

Query: 89  YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCS----SSKIPSFIPKLSSSSR 143
           Y++S + G+ PPQ I   +DTGS LVWFPC   ++C  C     ++      P   +SS 
Sbjct: 73  YTLSFNLGSHPPQPISLYMDTGSDLVWFPCAP-FECILCEGKYDTAATGGLSPPNITSSA 131

Query: 144 LLGCQNPKCSWIHHESIQCRD------CNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
            + C++P CS  H  S+   D      C  E + TS   +  CP +   YG G       
Sbjct: 132 SVSCKSPACSAAH-TSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLY 190

Query: 198 SETLNLPNR---IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYC 248
            ++L++P     ++ NF  GC+  +  +P G+AGFGRG  SLP+QL        ++FSYC
Sbjct: 191 RDSLSMPASSPLVLHNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYC 250

Query: 249 LLSHKFD--DTTRTSSLILDNGSSHSDKKTT------GLTYTPFVNNPSVAERNAFSVYY 300
           L+SH FD     R S LIL   S   +KK           YT  ++NP          +Y
Sbjct: 251 LVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPK------HPYFY 304

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
            VGL  ITVG +++ V      +DR GNGG +VDSGTTFT +   L+E L  EF  +M  
Sbjct: 305 CVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRM-- 362

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG--- 417
            R Y RA   E  TGL PC+    +     P + LHF G + V LP  NY+    +G   
Sbjct: 363 GRVYKRATQIEERTGLGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDG 421

Query: 418 -----SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                   CL ++   D   SGGP+  LGN+Q Q + V YDL   R+GF ++ C
Sbjct: 422 QKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 146/400 (36%), Positives = 205/400 (51%), Gaps = 37/400 (9%)

Query: 89  YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y++S + G+ P Q I   +DTGS LVWFPC   ++C  C   K  +  P   + S  + C
Sbjct: 19  YTLSFNLGSHPSQSITLYMDTGSDLVWFPCAP-FECILCEG-KFNATKPLNITRSHRVSC 76

Query: 148 QNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
           Q+P CS  H     H+      C  + + TS   +  CP +   YG G        +TL+
Sbjct: 77  QSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAHLHRDTLS 136

Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL-----NL-DKFSYCLLSHKFDD 256
           +    + NF  GC+  +  +P G+AGFGRG  SLP+QL     NL ++FSYCL+SH FD 
Sbjct: 137 MSQLFLKNFTFGCAHTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHSFDK 196

Query: 257 --TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
               + S LIL +   +S ++     YT  + NP        S +Y VGL  I+VG + +
Sbjct: 197 ERVRKPSPLILGHYDDYSSERVE-FVYTSMLRNPK------HSYFYCVGLTGISVGKRTI 249

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
                   +DR G+GG +VDSGTTFT +   L+  +  EF  ++   R + RA   E  T
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRV--GRVHKRASEVEEKT 307

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFA--VVGEGSAV----CLTVVT- 426
           GL PC+ + G      P +  HF G  + V LP  NYF   + GE  A     CL ++  
Sbjct: 308 GLGPCYFLEG--LVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNG 365

Query: 427 --DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             D E SGGP  ILGN+Q Q + V YDL NQR+GF ++ C
Sbjct: 366 GDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 146/406 (35%), Positives = 200/406 (49%), Gaps = 41/406 (10%)

Query: 89  YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
           Y++SLS G P     +   LDTGS LVWFPC   + C  C     P     S +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
           SR + C +P CS  H  +          C  + + T    +  CP     YG G L   +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
                    +  + NF   C+  +  +P G+AGFGRG  SLP+QL      +FSYCL++H
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265

Query: 253 KF--DDTTRTSSLILDNGSSHS--DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
            F  D   R+S LIL   +  +      T   YTP ++NP          +Y V L  ++
Sbjct: 266 SFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPK------HPYFYSVALEAVS 319

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG+R++   +   +DRDGNGG +VDSGTTFT +  + F  +ADEF   M   R      
Sbjct: 320 VGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE- 378

Query: 369 GAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTV 424
           GAEA TGL PC+   P ++  + P + LHF+G A V LP  NYF      EG +V CL +
Sbjct: 379 GAEAQTGLAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436

Query: 425 VT------DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +       D E  GGP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 146/406 (35%), Positives = 200/406 (49%), Gaps = 41/406 (10%)

Query: 89  YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
           Y++SLS G P     +   LDTGS LVWFPC   + C  C     P     S +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
           SR + C +P CS  H  +          C  + + T    +  CP     YG G L   +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
                    +  + NF   C+  +  +P G+AGFGRG  SLP+QL      +FSYCL++H
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265

Query: 253 KF--DDTTRTSSLILDNGSSHS--DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
            F  D   R+S LIL   +  +      T   YTP ++NP          +Y V L  ++
Sbjct: 266 SFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPK------HPYFYSVALEAVS 319

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG+R++   +   +DRDGNGG +VDSGTTFT +  + F  +ADEF   M   R      
Sbjct: 320 VGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE- 378

Query: 369 GAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTV 424
           GAEA TGL PC+   P ++  + P + LHF+G A V LP  NYF      EG +V CL +
Sbjct: 379 GAEAQTGLAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436

Query: 425 VT------DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +       D E  GGP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 142/413 (34%), Positives = 202/413 (48%), Gaps = 50/413 (12%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLS-SSSRLLG 146
           G   +L+F    Q +   +DTGS +VWFPC+  ++C  C     P  +  L+ S S L+ 
Sbjct: 91  GTDYTLTFSINSQTLSVYMDTGSDIVWFPCSP-FECILCEGKFEPGTLTPLNVSKSSLIS 149

Query: 147 CQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
           C++  CS  H+     +      C  + + TS      CPS+   YG G     +L   L
Sbjct: 150 CKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDG-----SLIAKL 204

Query: 202 NLPNRIIP----------NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL-NL-----DKF 245
           +  N I+P          +F  GC+  +  +P G+AGFG G  SLP+QL NL     ++F
Sbjct: 205 HKHNLIMPSTSNKPFSLKDFTFGCAHSALGEPIGVAGFGFGSLSLPAQLANLSPDLGNQF 264

Query: 246 SYCLLSHKFDDTT--RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           SYCL+SH FD T     S LIL         + T   YTP ++NP          +Y V 
Sbjct: 265 SYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPK------HPYFYSVS 318

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           +  I+VG  RVR  +  + +DRDGNGG +VDSGTT+T +    +  +A E   ++   R 
Sbjct: 319 MEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRV--GRV 376

Query: 364 YTRALGAEALTGLRPCFDVPG---EKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGS- 418
           + RA   E+ TGL PC+ + G   E+ G   P L  HF G   V LP  NYF    +G  
Sbjct: 377 FKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGED 436

Query: 419 ------AVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                   CL ++    E+ GGP   LGN+Q Q + V YDL  +R+GF  + C
Sbjct: 437 EKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 140/406 (34%), Positives = 205/406 (50%), Gaps = 41/406 (10%)

Query: 89  YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y++S + G+ PPQ+I   +DTGS LVWFPC+  ++C  C      +    ++  +  + C
Sbjct: 75  YTLSFNLGSNPPQLITLYMDTGSDLVWFPCSP-FECILCEGKPQTTKPANITKQTHSVSC 133

Query: 148 QNPKCSWIHHESI-----QCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
           Q+P CS  H             C  + + TS   +  CP +   YG G        +TL+
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLYQQTLS 193

Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD 256
           L +  + NF  GC+  +  +P G+AGFGRG  SLP+QL+       ++FSYCL+SH FD 
Sbjct: 194 LSSLHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDG 253

Query: 257 T--TRTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
               R S LIL    D  +   D ++    YT  ++NP          YY VGL  I+VG
Sbjct: 254 DRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPK------HPYYYCVGLAGISVG 307

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + V        +D  GNGG +VDSGTTFT +    +  + +EF  ++  NR + RA   
Sbjct: 308 KRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRV--NRFHKRASEI 365

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF--------AVVGEGSAVC 421
           E  TGL PC+ + G      P LKLHF G  ++V LP +NYF         +  +G   C
Sbjct: 366 ETKTGLGPCYYLNG--LSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGC 423

Query: 422 LTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + ++    + E  GGP   LGN+Q Q + V YDL  +R+GF ++ C
Sbjct: 424 MMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 157/495 (31%), Positives = 241/495 (48%), Gaps = 60/495 (12%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTR-ALHIKNPQTK 68
           L FI  F+ +S+  S I  L  + S  +T      + + + L+ S+ +R A   ++   K
Sbjct: 9   LCFILCFSCISVSISEILYLPLTHSLSNTQ-----FTSTHHLLKSTSSRSASRFQHQHQK 63

Query: 69  TTTTTTTTTTTNISSHSYGGYSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYC- 126
                    +  +S  S   Y++S +  + PPQ +   LDTGS LVWFPC   ++C  C 
Sbjct: 64  RHLRNRHQVSLPLSPGS--DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKP-FECILCE 120

Query: 127 -----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNC 176
                +++  P   P+LSS++R + C++  CS  H      +     DC  E + TS   
Sbjct: 121 GKAENTTASTPP--PRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCH 178

Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLP----NRIIPNFLVGCSVLSSRQPAGIAGFGRG 232
           +  CPS+   YG G        +++ LP    +  + NF  GC+  +  +P G+AGFGRG
Sbjct: 179 SFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRG 238

Query: 233 KTSLPSQLNL------DKFSYCLLSHKF--DDTTRTSSLIL---DNGSSHSDKKTTGLTY 281
             SLP+QL        ++FSYCL+SH F  D     S LIL   D+     +K      Y
Sbjct: 239 VLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVY 298

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           T  ++NP          +Y VGL  I++G +++        +DR+G+GG +VDSGTTFT 
Sbjct: 299 TSMLDNPK------HPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTM 352

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG- 400
           +   L+  +  EF +++   R Y RA   E  TGL PC+    +   + P L LHF G  
Sbjct: 353 LPASLYNSVVAEFDNRV--GRVYERAKEVEDKTGLGPCYYY--DTVVNIPSLVLHFVGNE 408

Query: 401 AEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVE 449
           + V LP +NYF         V  +    CL ++    + E +GGP   LGN+Q   + V 
Sbjct: 409 SSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVV 468

Query: 450 YDLRNQRLGFKQQLC 464
           YDL  +R+GF ++ C
Sbjct: 469 YDLEQRRVGFARRKC 483


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 146/407 (35%), Positives = 192/407 (47%), Gaps = 40/407 (9%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYC-----SSSKIPS-FIPKLSS 140
           GY I+L+ GTPPQ +   LDTGS L W PC N  + C  C     +  K PS F P  SS
Sbjct: 82  GYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSS 141

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGSG-LTEGI 195
           +S    C +  C  IH        C     + S      C + CPS+   YG G L  GI
Sbjct: 142 TSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGI 201

Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK-FSYCLLSHK 253
              + L    R +P F  GC   + R+P GIAGFGRG  SLPSQL  L+K FS+C L  K
Sbjct: 202 LTRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFK 261

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ- 312
           F +    SS ++   S+ S   T  L +TP +N P       +   YY+GL  IT+G   
Sbjct: 262 FVNNPNISSPLILGASALSINLTDSLQFTPMLNTP------MYPNSYYIGLESITIGTNI 315

Query: 313 -RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              +V       D  GNGG +VDSGTT+T     L EP   + ++ +     Y RA   E
Sbjct: 316 TPTQVPLTLRQFDSQGNGGMLVDSGTTYT----HLPEPFYSQLLTTLQSTITYPRATETE 371

Query: 372 ALTGLRPCFDVP----------GEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGS 418
           + TG   C+ VP           +    FP +  HF   A + LP  N F  +    +GS
Sbjct: 372 SRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGS 431

Query: 419 AV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V CL      +   GP+ + G+FQ QN  V YDL  +R+GF+   C
Sbjct: 432 VVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 140/409 (34%), Positives = 199/409 (48%), Gaps = 46/409 (11%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS-SRLL 145
           Y++S + G   Q  P  L  DTGS LVWFPC   ++C  C     P+  P ++++ S  +
Sbjct: 48  YTLSFNLGPRAQAQPITLYMDTGSDLVWFPCA-PFKCILCEGK--PNASPPVNTTRSVAV 104

Query: 146 GCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
            C++P CS  H+     +      C  E + TS      CP +   YG G        +T
Sbjct: 105 SCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDT 164

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKF 254
           L+L +  + NF  GC+  +  +P G+AGFGRG  SLP+QL        ++FSYCL+SH F
Sbjct: 165 LSLSSLFLRNFTFGCAYTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSF 224

Query: 255 DD--TTRTSSLILDNGSSHSDKKTTG-----LTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           D     + S LIL       +++  G       YTP + NP          +Y VGL  I
Sbjct: 225 DSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPK------HPYFYTVGLIGI 278

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           +VG + V        ++  G+GG +VDSGTTFT +    +  + DEF   +   R   RA
Sbjct: 279 SVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGV--GRVNERA 336

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYF--------AVVGEGS 418
              E  TGL PC+ +        P L L F GG + V LP +NYF        A  G+  
Sbjct: 337 RKIEEKTGLAPCYYL--NSVAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394

Query: 419 AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             CL ++    + E SGGP   LGN+Q Q + VEYDL  +R+GF ++ C
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 137/410 (33%), Positives = 195/410 (47%), Gaps = 45/410 (10%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSS-KIPSFIPKLS-SSSRL 144
           Y++S + G   Q  P  L  DTGS LVWFPC   ++C  C      P+  P  + + S  
Sbjct: 70  YTLSFNLGPQAQAQPITLYMDTGSDLVWFPCA-PFKCILCEGKPNEPNASPPTNITQSVA 128

Query: 145 LGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
           + C++P CS  H+     +      C  E + TS      CP +   YG G        +
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRD 188

Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHK 253
           TL+L +  + NF  GC+  +  +P G+AGFGRG  SLP+QL        ++FSYCL+SH 
Sbjct: 189 TLSLSSLFLRNFTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHS 248

Query: 254 FDD--TTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           FD     + S LIL        +K  G      YT  + NP          +Y V L  I
Sbjct: 249 FDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPK------HPYFYTVSLIGI 302

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
            VG + +        ++  G+GG +VDSGTTFT +    +  + DEF  ++   R+  RA
Sbjct: 303 AVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRV--GRDNKRA 360

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGS------- 418
              E  TGL PC+ +        P L L F GG  + V LP +NYF    +GS       
Sbjct: 361 RKIEEKTGLAPCYYL--NSVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKR 418

Query: 419 -AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              CL ++    + + SGGP   LGN+Q Q + VEYDL  +R+GF ++ C
Sbjct: 419 KVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 143/408 (35%), Positives = 193/408 (47%), Gaps = 42/408 (10%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPS------FIPKLSS 140
           GY I+L+ GTPPQ +   +DTGS L W PC N  + C  C+  K  +      F P  SS
Sbjct: 10  GYLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSS 69

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGSG-LTEGI 195
           SS    C +  C+ IH        C     + S      C + CPS+   YG G L  GI
Sbjct: 70  SSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGI 129

Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK-FSYCLLSHK 253
              + L    R +P F  GC   +  +P GIAGFGRG  SLPSQL  L+K FS+C L  K
Sbjct: 130 LTRDILKARTRDVPRFSFGCVTSTYHEPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFK 189

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
           F +    SS ++   S+ S   T  L +TP +N P       +   YY+GL  IT+ G  
Sbjct: 190 FVNNPNISSPLILGASALSINLTDSLQFTPMLNTP------VYPNSYYIGLESITI-GTN 242

Query: 314 VRVWHKYLTL---DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           +      LTL   D  GNGG +VDSGTT+T     L  P   + ++ +     Y RA   
Sbjct: 243 ITPTQVPLTLRQFDSQGNGGMLVDSGTTYT----HLPNPFYSQLLTILQSTITYPRATET 298

Query: 371 EALTGLRPCFDVP----------GEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEG 417
           E+ TG   C+ VP           +    FP +  +F   A + LP  N F  +    +G
Sbjct: 299 ESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDG 358

Query: 418 SAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           S V CL      + + GP+ + G+FQ QN  V YDL  +R+GF+   C
Sbjct: 359 SVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 141/418 (33%), Positives = 205/418 (49%), Gaps = 51/418 (12%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC-----SSSKIPSFIPKLSSSS 142
           G   +LSF    Q I   LDTGS LVWFPC   ++C  C     ++S   +  PKLS ++
Sbjct: 79  GSDYTLSFTINSQPISLYLDTGSDLVWFPC-QPFECILCEGKAENASLASTPPPKLSKTA 137

Query: 143 RLLGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
             + C++  CS +H      +     +C  E +  S      CP +   YG G       
Sbjct: 138 TPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLY 197

Query: 198 SETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFS 246
            +++ LP     N I  NF  GC+  +  +P G+AGFGRG  SLP+QL        ++FS
Sbjct: 198 RDSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257

Query: 247 YCLLSHKFD-DTTRTSSLILDNGSSHSDK-------KTTGLTYTPFVNNPSVAERNAFSV 298
           YCL+SH FD D  R  S ++     H +K       K     YT  ++NP    R+ +  
Sbjct: 258 YCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNP----RHPY-- 311

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           +Y VGL  I++G +++        +DR G+GG +VDSGTTFT +   L++ +  EF +++
Sbjct: 312 FYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRV 371

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF------ 411
              R   RA   E  TGL PC+        + P + LHF G G+ V LP  NYF      
Sbjct: 372 --GRVNERASVIEENTGLSPCYYF-DNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDG 428

Query: 412 --AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 +    CL ++    + E SGGP   LGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 429 GHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 161/500 (32%), Positives = 229/500 (45%), Gaps = 62/500 (12%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
           MA  ++    +F+FF  + S+   SI SL              S +N NSL+   LT A 
Sbjct: 1   MAIALNKNITTFLFFLLVNSLVSYSIQSLA-------------SPRNPNSLILG-LTLAS 46

Query: 61  HIKNPQ-TKTTTTTTTTTTTNI------SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
               P   K +T++    + ++      S     GY ISL+ GTPPQ+I  ++DTGS L 
Sbjct: 47  RASFPTYPKASTSSRKIVSIDVLGAKKPSREVRDGYLISLNIGTPPQVIQVLMDTGSDLT 106

Query: 114 WFPCTN-HYQCKYCSSSK----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDE 168
           W PC N  + C  C   +    + +F P  SSSS    C +P C  IH        C   
Sbjct: 107 WVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASCASPFCIDIHSSDNPLDTCTVA 166

Query: 169 PLATS----KNCTQICPSYLVLYGS-GLTEGIALSETLNLPN------RIIPNFLVGCSV 217
             + S      C++ CPS+   YG+ G+  GI   +TL +        + IP F  GC  
Sbjct: 167 GCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLRVNGSSPGVAKEIPKFCFGCVG 226

Query: 218 LSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
            + R+P GIAGFGRG  S+ SQL      FS+C L+ K+ +    SS ++    + + K 
Sbjct: 227 SAYREPIGIAGFGRGTLSMVSQLGFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKD 286

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG-QRVRVWHKYLTLDRDGNGGTIVD 334
              + +TP +N+P       +  +YYVGL  ITVG      V       D  GNGG  +D
Sbjct: 287 D--MQFTPMLNSP------MYPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKID 338

Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS----- 389
           SGTT+T     L EP   + +S +    NY R  G E  TG   C+ VP     +     
Sbjct: 339 SGTTYT----HLPEPFYSQVLSILQSTINYPRDTGMEMQTGFDLCYKVPRPNNNTLTSDD 394

Query: 390 -FPELKLHFKGGAEVTLPVENYFAVV---GEGSAV-CLTVVTDREASGGPSIILGNFQMQ 444
             P +  HF     + LP  N+F  V   G  + V CL   +  +   GP+ + G+FQ Q
Sbjct: 395 LLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQ 454

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N  V YDL  +R+GF+   C
Sbjct: 455 NVEVVYDLEKERIGFQPMDC 474


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 144/414 (34%), Positives = 192/414 (46%), Gaps = 47/414 (11%)

Query: 89  YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS--SSRL 144
           Y++SLS G      P    LDTGS LVWFPC   + C  C     P     L     SR 
Sbjct: 90  YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPGRSGPLPPPPDSRR 148

Query: 145 LGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG-----LTE 193
           + C +P CS  H  +          C  E + T S   +  CP     YG G     L  
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRR 208

Query: 194 G-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
           G +AL         + + NF   C+  +  +P G+AGFGRG  SLP QL+     +FSYC
Sbjct: 209 GRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYC 268

Query: 249 LLSHKF--DDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           L+SH F  D   R S LIL      +  +  +T G  YTP ++NP          +Y V 
Sbjct: 269 LVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPK------HPYFYSVA 322

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  ++VG  R++   +   +DR GNGG +VDSGTTFT +  E++  +A+ F   M     
Sbjct: 323 LEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGF 382

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------AVV 414
                 AE  TGL PC+       G  P L LHF+G A V LP  NYF         A  
Sbjct: 383 ARAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGAGT 440

Query: 415 GEGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +    CL ++   +ASG    GP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 441 RKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 138/409 (33%), Positives = 190/409 (46%), Gaps = 42/409 (10%)

Query: 89  YSISLSFG--TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-------SFIPKLS 139
           Y++SLS G  +    +   LDTGS LVWFPC   + C  C     P       + +P   
Sbjct: 83  YTLSLSVGPLSTANPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPPGNNNSSNPLPP-P 140

Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCN------DEPLATSKNCTQICPSYLVLYGSG-LT 192
           + SR + C +P CS  H  +     C       D+    S   +  CP     YG G L 
Sbjct: 141 TDSRRIPCASPFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLV 200

Query: 193 EGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYC 248
             +         +  + NF   C+  +  +P G+AGFGRG  SLP+QL       +FSYC
Sbjct: 201 ARLRRGRVGIAASVAVENFTFACAHTALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYC 260

Query: 249 LLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           L++H F  D   R S LIL           TG+ YTP ++NP          +Y V L  
Sbjct: 261 LVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPK------HPYFYSVALEA 314

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           ++VGG R+    +   + R G+GG +VDSGTTFT +  E +  +A+EF   M   R    
Sbjct: 315 VSVGGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERA 374

Query: 367 ALGAEALTGLRPCF----DVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV--GEG 417
               +  TGL PC+    D    + GS    P L +HF+G A V LP  NYF      E 
Sbjct: 375 EAAEDQ-TGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEER 433

Query: 418 SAV-CLTVVTDREAS-GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             V CL ++   E   GGP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 434 RRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 157/498 (31%), Positives = 232/498 (46%), Gaps = 51/498 (10%)

Query: 3   SYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHI 62
           SY   LC S  F    +S   +    LT SLS+     +    ++ ++   +   R  H 
Sbjct: 4   SYSLLLCFSLCFSHFFISTSQTLFLPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQ 63

Query: 63  KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
           KN          T     +S     G   +LSF    Q I   LDTGS LVWFPC   ++
Sbjct: 64  KN----------THNHRQVSLPLSPGSDYTLSFTLDSQPIFLYLDTGSDLVWFPC-QPFE 112

Query: 123 CKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLAT 172
           C  C     ++S   +  PKLS ++  + C++  CS  H      +     +C  E + T
Sbjct: 113 CILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIET 172

Query: 173 SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIA 227
           S      CP +   YG G        ++++LP     N I+ NF  GC+  +  +P G+A
Sbjct: 173 SDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVA 232

Query: 228 GFGRGKTSLPSQLNL------DKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDK--KTTG 278
           GFGRG  SLP+QL        ++FSYCL+SH FD D  R  S ++     H +K  +  G
Sbjct: 233 GFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNG 292

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           +    FV   S+ +      +Y VGL  I++G +++        +D +G+GG +VDSGTT
Sbjct: 293 VNKPRFVYT-SMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTT 351

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT +   L+  +  EF +++   R   RA   E  TGL PC+        + P + LHF 
Sbjct: 352 FTMLPASLYGSVVAEFENRV--GRVNERARVIEEDTGLSPCYYF-DNNVVNVPSVVLHFV 408

Query: 399 G-GAEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNY 446
           G G+ V LP  NYF            +    CL ++    + E SGGP   LGN+Q Q +
Sbjct: 409 GNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGF 468

Query: 447 YVEYDLRNQRLGFKQQLC 464
            V YDL N+R+GF ++ C
Sbjct: 469 EVVYDLENKRVGFARRQC 486


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 157/498 (31%), Positives = 232/498 (46%), Gaps = 51/498 (10%)

Query: 3   SYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHI 62
           SY   LC S  F    +S   +    LT SLS+     +    ++ ++   +   R  H 
Sbjct: 4   SYSLLLCFSLCFSHFFISTSQTLFLPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQ 63

Query: 63  KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
           KN          T     +S     G   +LSF    Q I   LDTGS LVWFPC   ++
Sbjct: 64  KN----------THNHRQVSLPLSPGSDYTLSFTLDSQPIFLYLDTGSDLVWFPC-QPFE 112

Query: 123 CKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLAT 172
           C  C     ++S   +  PKLS ++  + C++  CS  H      +     +C  E + T
Sbjct: 113 CILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIET 172

Query: 173 SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIA 227
           S      CP +   YG G        ++++LP     N I+ NF  GC+  +  +P G+A
Sbjct: 173 SDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVA 232

Query: 228 GFGRGKTSLPSQLNL------DKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDK--KTTG 278
           GFGRG  SLP+QL        ++FSYCL+SH FD D  R  S ++     H +K  +  G
Sbjct: 233 GFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNG 292

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           +    FV   S+ +      +Y VGL  I++G +++        +D +G+GG +VDSGTT
Sbjct: 293 VNKPRFVYT-SMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTT 351

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT +   L+  +  EF +++   R   RA   E  TGL PC+        + P + LHF 
Sbjct: 352 FTMLPASLYGSVVAEFENRV--GRVNERARVIEEDTGLSPCYYF-DNNVVNVPSVVLHFV 408

Query: 399 G-GAEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNY 446
           G G+ V LP  NYF            +    CL ++    + E SGGP   LGN+Q Q +
Sbjct: 409 GNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGF 468

Query: 447 YVEYDLRNQRLGFKQQLC 464
            V YDL N+R+GF ++ C
Sbjct: 469 EVVYDLENKRVGFARRQC 486


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 136/399 (34%), Positives = 191/399 (47%), Gaps = 53/399 (13%)

Query: 89  YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
           Y++SLS G P     +   LDTGS LVWFPC   + C  C     P     S +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
           SR + C +P CS  H  +          C  + + T    +  CP     YG G L   +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
                    +  + NF   C+  +  +P G+AGFGRG  SLP+QL          +    
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQL----------APSLS 255

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            +T  +++    G+S +D       YTP ++NP          +Y V L  ++VGG+R++
Sbjct: 256 GSTDAAAI----GASETD-----FVYTPLLHNPK------HPYFYSVALEAVSVGGKRIQ 300

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
              +   +DRDGNGG +VDSGTTFT +  + F  +ADEF   M   R      GAEA TG
Sbjct: 301 AQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE-GAEAQTG 359

Query: 376 LRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTVVT----- 426
           L PC+   P ++  + P + LHF+G A V LP  NYF      EG +V CL ++      
Sbjct: 360 LAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNN 417

Query: 427 -DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            D E  GGP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 418 DDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 141/415 (33%), Positives = 187/415 (45%), Gaps = 49/415 (11%)

Query: 89  YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS------ 140
           Y++SLS G      P    LDTGS LVWFPC   + C  C     PS     S+      
Sbjct: 94  YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPSGGHSSSAPLPLPP 152

Query: 141 --SSRLLGCQNPKCSWIHHESIQCRDC-------NDEPLATSKNCTQICPSYLVLYGSGL 191
              SR + C +P CS  H  +     C        D    + +  +  CP     YG G 
Sbjct: 153 PPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGS 212

Query: 192 TEGIALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSY 247
                    + L   + + NF   C+  +  +P G+AGFGRG  SLP QL      +FSY
Sbjct: 213 LVAHLRRGRVGLGASVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLAPQLSGRFSY 272

Query: 248 CLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           CL+SH F  D   R S LIL   S  +  +T G  YTP ++NP          +Y V L 
Sbjct: 273 CLVSHSFRADRLIRPSPLILGR-SPDAAAETGGFVYTPLLHNPK------HPYFYSVALE 325

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            ++VG  R++   +   +DR GNGG +VDSGTTFT +  E +  +A+ F   M       
Sbjct: 326 AVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFAR 385

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-------AVVGEG- 417
               AE  TGL PC+       G  P L LHF+G A V LP  NYF          G G 
Sbjct: 386 AER-AEEQTGLTPCYHYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGR 443

Query: 418 --SAVCLTVVTDREASG------GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                CL ++   + SG      GP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 444 KDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 132/386 (34%), Positives = 180/386 (46%), Gaps = 44/386 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +S GTP      I+DTGS LVW  C     C  C +   P F P  SS+   L 
Sbjct: 116 GEFLMDMSIGTPALAYAAIVDTGSDLVWTQCK---PCVECFNQSTPVFDPSSSSTYSTLP 172

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  CS             D P +T  +  + C  Y   YG +  T+G+  +ET  L  
Sbjct: 173 CSSSLCS-------------DLPTSTCTSAAKDC-GYTYTYGDASSTQGVLAAETFTLAK 218

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P    GC   +      Q AG+ G GRG  SL SQL L KFSYCL S   DDT+++ 
Sbjct: 219 TKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTS--LDDTSKSP 276

Query: 262 SLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            L+    +  +D  +   +  TP + NPS         +YYV L+ +TVG  R+ +    
Sbjct: 277 LLLGSLAAISTDTASAAAIQTTPLIKNPSQPS------FYYVTLKALTVGSTRIPLPGSA 330

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             +  DG GG IVDSGT+ T++  + + PL   F +QM         +   +  GL  CF
Sbjct: 331 FAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQM------KLPVADGSAVGLDLCF 384

Query: 381 DVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
             P  G      P+L LHF GGA++ LP ENY  +     A+CLTV+  R  S     I+
Sbjct: 385 KAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLS-----II 439

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GNFQ QN    YD+    L F    C
Sbjct: 440 GNFQQQNIQFVYDVDKDTLSFAPVQC 465


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 138/408 (33%), Positives = 188/408 (46%), Gaps = 45/408 (11%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           GY ISL+ GTPP++I   +DTGS L W PC N  + C  C+  +    +   S S     
Sbjct: 28  GYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSS 87

Query: 147 ----CQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYGS-GLTEGIAL 197
               C +P CS +H        C     + S      C + CPS+   YG+ G+  G   
Sbjct: 88  LRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 147

Query: 198 SETLNLPN------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCL 249
            +TL          R +PNF  GC   + R+P GIAGFGRG  SLPSQL      FS+C 
Sbjct: 148 RDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCF 207

Query: 250 LSHKFDDTTRTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           L  KF +    SS  +I D   S +D     L +T  + NP       +  YYY+GL  I
Sbjct: 208 LGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNP------MYPNYYYIGLEAI 257

Query: 308 TVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           TVG    ++V       D  GNGG I+DSGTT+T     L  P   + +S +     Y R
Sbjct: 258 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYT----HLPGPFYTQLLSMLQSIITYPR 313

Query: 367 ALGAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-- 418
           A   EA TG   C+ +P       +     P +  HF     + LP  N+F  +G  S  
Sbjct: 314 AQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNS 373

Query: 419 --AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               CL +    ++  GP+ + G+FQ QN  V YDL  +R+GF+   C
Sbjct: 374 TVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 138/408 (33%), Positives = 188/408 (46%), Gaps = 45/408 (11%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           GY ISL+ GTPP++I   +DTGS L W PC N  + C  C+  +    +   S S     
Sbjct: 11  GYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSS 70

Query: 147 ----CQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYGS-GLTEGIAL 197
               C +P CS +H        C     + S      C + CPS+   YG+ G+  G   
Sbjct: 71  LRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 130

Query: 198 SETLNLPN------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCL 249
            +TL          R +PNF  GC   + R+P GIAGFGRG  SLPSQL      FS+C 
Sbjct: 131 RDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCF 190

Query: 250 LSHKFDDTTRTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           L  KF +    SS  +I D   S +D     L +T  + NP       +  YYY+GL  I
Sbjct: 191 LGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNP------MYPNYYYIGLEAI 240

Query: 308 TVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           TVG    ++V       D  GNGG I+DSGTT+T     L  P   + +S +     Y R
Sbjct: 241 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYT----HLPGPFYTQLLSMLQSIITYPR 296

Query: 367 ALGAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-- 418
           A   EA TG   C+ +P       +     P +  HF     + LP  N+F  +G  S  
Sbjct: 297 AQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNS 356

Query: 419 --AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               CL +    ++  GP+ + G+FQ QN  V YDL  +R+GF+   C
Sbjct: 357 TVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 138/410 (33%), Positives = 200/410 (48%), Gaps = 48/410 (11%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS------- 141
           Y++S + G   Q I   +DTGS LVWFPCT  + C  C         PKL+S        
Sbjct: 75  YTLSFNLGPHSQPITLYMDTGSDLVWFPCTP-FNCILCE------LKPKLTSDPSPPTNI 127

Query: 142 --SRLLGCQNPKCSWIHHESIQCRDCNDE--PLAT--SKNCTQI-CPSYLVLYGSGLTEG 194
             S  + C +  CS  H  +     C     PL +  +K+C    CP +   YG G    
Sbjct: 128 SHSTPISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIA 187

Query: 195 IALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYC 248
               +TL+L    + NF  GC+  +  +P G+AGFGRG  SLP+QL        ++FSYC
Sbjct: 188 SLYRDTLSLSTLQLTNFTFGCAHTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYC 247

Query: 249 LLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           L+SH F  +   + S LIL  G  + +K++ G     FV   S+ E    S +Y VGL+ 
Sbjct: 248 LVSHSFRSERIRKPSPLIL--GRYNDEKQSNGDEVVEFVYT-SMLENPKHSYFYTVGLKG 304

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I+VG + V        +++ G+GG +VDSGTTFT +  + +  + + F  +  K+    R
Sbjct: 305 ISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNR--R 362

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFAVVGEGS------- 418
           A   E  TGL PC+ +        P + L F G  + V LP +NYF    +G        
Sbjct: 363 APEIEQKTGLSPCYYL--NTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKE 420

Query: 419 -AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              CL  +    + E SGGP  +LGN+Q Q + VEYDL  +R+GF ++ C
Sbjct: 421 RVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 142/412 (34%), Positives = 203/412 (49%), Gaps = 50/412 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-KIPSFIPKLSSSSRLL 145
           G Y++S + G+    I   +DTGS LVWFPC+  ++C  C    KI S +PK++++  + 
Sbjct: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSP-FECILCEGKPKIQSPLPKIANNKSVS 132

Query: 146 GCQNPKCSWIHHESIQCRD------CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
                 CS  H  S+          C  E +  S+  +  CP +   YG G        +
Sbjct: 133 CSAA-ACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRD 191

Query: 200 TLNLPNRI------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSY 247
           +L+LP         + NF  GC+  +  +P G+AGFGRG  S+PSQL        ++FSY
Sbjct: 192 SLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSY 251

Query: 248 CLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           CL+SH F  D   R S LIL  G  ++ +  T   YT  + NP          +Y VGL 
Sbjct: 252 CLVSHSFAADRVRRPSPLIL--GRYYTGE--TEFIYTSLLENPK------HPYFYSVGLA 301

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            I+VG  R+        +D  G+GG +VDSGTTFT +   L+E +  EF ++  K  N  
Sbjct: 302 GISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRA 361

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF--------AVVGE 416
           R +  E  TGL PC+    E +   P + LHF G  + V LP +NYF         VVG 
Sbjct: 362 RRI--EENTGLSPCYYY--ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGR 417

Query: 417 GSAV-CLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              V CL ++    + E +GGP   LGN+Q Q + V YDL   R+GF ++ C
Sbjct: 418 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 158/487 (32%), Positives = 224/487 (45%), Gaps = 58/487 (11%)

Query: 11  SFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVS--SSLTRALHIKNPQTK 68
           +F+FF  + S+   SI SL                +N NSL+   +  +RA    +P+  
Sbjct: 10  TFLFFLLVNSLLFYSIQSLARP-------------RNPNSLILGLTPASRASLPTHPKAS 56

Query: 69  TTTTTTTTTTTNISS---HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCK 124
           T++    T   ++         GY ISLS GTPPQ+I   +DTGS L W PC N  + C 
Sbjct: 57  TSSRKKLTDVLDMMEPLREVRDGYLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCI 116

Query: 125 YCSSSK----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNC 176
            C + +    + SF P  SSSS    C +P C  +H        C     + S      C
Sbjct: 117 ECDNYRNNRMMASFSPSHSSSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATC 176

Query: 177 TQICPSYLVLYGS-GLTEGIALSETLNLPNR------IIPNFLVGCSVLSSRQPAGIAGF 229
           +  CP +   YG+ G+  G    +TL +  R       IP F  GC   S R+P GIAGF
Sbjct: 177 SWPCPPFAYTYGAGGVVTGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREPIGIAGF 236

Query: 230 GRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           GRG  SLPSQL   +  FS+C L+ K+ +    SS ++    + + K    + +TP + +
Sbjct: 237 GRGALSLPSQLGFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDD--MQFTPMLKS 294

Query: 288 PSVAERNAFSVYYYVGLRRITVGG-QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           P       +  YYYVGL  ITVG      V       D  GNGG +VDSGTT+T     L
Sbjct: 295 P------MYPNYYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYT----HL 344

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK----TGS-FPELKLHFKGGA 401
            EP   + +S +    NY RA   E  TG   C+ VP +     TG   P +  HF   A
Sbjct: 345 PEPFYSQVLSVLQSIINYPRATDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNA 404

Query: 402 EVTLPVENYFAVVGEGS----AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
            + L   ++F  +   S      CL   +  +   GP+ +LG+FQ Q+  V YD+  +R+
Sbjct: 405 SLVLSRGSHFYAMSAPSNSTVVKCLLFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERI 464

Query: 458 GFKQQLC 464
           GF+   C
Sbjct: 465 GFRPMDC 471


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 142/413 (34%), Positives = 191/413 (46%), Gaps = 46/413 (11%)

Query: 89  YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
           Y++SLS G      P    LDTGS LVWFPC   + C  C    +            SR 
Sbjct: 90  YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPGRLGPLPPPPDSRR 148

Query: 145 LGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG-----LTE 193
           + C +P CS  H  +          C  E + T S   +  CP     YG G     L  
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRR 208

Query: 194 G-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
           G +AL         + + NF   C+  +  +P G+AGFGRG  SLP QL+     +FSYC
Sbjct: 209 GRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYC 268

Query: 249 LLSHKF--DDTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           L+SH F  D   R S LIL      + +  +T G  YTP ++NP          +Y V L
Sbjct: 269 LVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPK------HPYFYSVAL 322

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             ++VG  R++   +   +DR GNGG +VDSGTTFT +  E++  +A+ F   M      
Sbjct: 323 EAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFA 382

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------AVVG 415
                AE  TGL PC+       G  P L LHF+G A V LP  NYF         A   
Sbjct: 383 RAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTR 440

Query: 416 EGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +    CL ++   +ASG    GP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 441 KDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 147/424 (34%), Positives = 193/424 (45%), Gaps = 59/424 (13%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYC-------SSSKIPSFIPKLS 139
           GY +SLS GTPPQ++   +DTGS L W PC N  + C+ C       S  ++ +F+P  S
Sbjct: 20  GYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHS 79

Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYG-SGLTEG 194
           S+S    C +  C  IH        C     + +      C + CPS+   YG SG+  G
Sbjct: 80  STSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTG 139

Query: 195 IALSETL---------NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK- 244
               + L         N  N+ IP F  GC   + R+P GIAGFGRG  SLP QL     
Sbjct: 140 SLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHK 199

Query: 245 -FSYCLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
            FS+C L  KF +    SS LIL N +  S  K   L +TP + +P       +  YYY+
Sbjct: 200 GFSHCFLPFKFSNNPNFSSPLILGNLAISS--KDENLQFTPLLKSP------MYPNYYYI 251

Query: 303 GLRRITVGGQ----RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           GL  IT+G      R  V  K   +D  GNGG ++DSGTT+T     L EPL  + +S +
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYT----HLPEPLYSQLISNL 307

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYF 411
                Y RA   E  TG   C+ VP +   S        P +  HF     V LP  N F
Sbjct: 308 ELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNF 367

Query: 412 ----AVVGEGSAVCL-------TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
               A +      CL           +     GP+ I G+FQ QN  V YDL  +RLGF+
Sbjct: 368 YAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQ 427

Query: 461 QQLC 464
              C
Sbjct: 428 PMDC 431


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 133/406 (32%), Positives = 185/406 (45%), Gaps = 41/406 (10%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL- 145
           GY ISL+ GTPPQ+I   +DTGS L W PC N  + C  C   +    +   S S     
Sbjct: 11  GYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSS 70

Query: 146 ---GCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGS-GLTEGIAL 197
               C +P C+ IH        C     + S      C + CPS+   YG+ G+  G   
Sbjct: 71  YRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLT 130

Query: 198 SETLNL---PNRI---IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCL 249
            +TL +   P R+   IP F  GC   +  +P GIAGF RG  S PSQL L K  FS+C 
Sbjct: 131 RDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPSQLGLLKKGFSHCF 190

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           L+ K+ +    SS ++   ++ S K    + +TP + +P       +  YYY+GL  ITV
Sbjct: 191 LAFKYANNPNISSPLVIGDTALSSKDN--MQFTPMLKSP------MYPNYYYIGLEAITV 242

Query: 310 GG-QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           G      V       D  GNGG ++DSGTT+T     L EP   + +S       Y RA 
Sbjct: 243 GNVSATTVPLNLREFDSQGNGGMLIDSGTTYT----HLPEPFYSQLLSIFKAIITYPRAT 298

Query: 369 GAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS---- 418
             E   G   C+ VP       +    FP +  HF       LP  N+F  +   S    
Sbjct: 299 EVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTV 358

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             CL   +  ++  GP+ + G+FQ QN  + YDL  +R+GF+   C
Sbjct: 359 VKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +S GTP      I+DTGS LVW  C     C  C     P F P  SS+   + 
Sbjct: 103 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 159

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  CS                L TSK  +     Y   YG S  T+G+  +ET  L  
Sbjct: 160 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 204

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P  + GC   +      Q AG+ G GRG  SL SQL LDKFSYCL S    D T  S
Sbjct: 205 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 261

Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            L+L +  G S +    + +  TP + NPS         +YYV L+ ITVG  R+ +   
Sbjct: 262 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 315

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +  DG GG IVDSGT+ T++  + +  L   F +QM        A G+    GL  C
Sbjct: 316 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 369

Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           F  P  G      P L  HF GGA++ LP ENY  + G   A+CLTV+  R  S     I
Sbjct: 370 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 424

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GNFQ QN+   YD+ +  L F    C
Sbjct: 425 IGNFQQQNFQFVYDVGHDTLSFAPVQC 451


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +S GTP      I+DTGS LVW  C     C  C     P F P  SS+   + 
Sbjct: 93  GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  CS                L TSK  +     Y   YG S  T+G+  +ET  L  
Sbjct: 150 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 194

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P  + GC   +      Q AG+ G GRG  SL SQL LDKFSYCL S    D T  S
Sbjct: 195 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 251

Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            L+L +  G S +    + +  TP + NPS         +YYV L+ ITVG  R+ +   
Sbjct: 252 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 305

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +  DG GG IVDSGT+ T++  + +  L   F +QM        A G+    GL  C
Sbjct: 306 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 359

Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           F  P  G      P L  HF GGA++ LP ENY  + G   A+CLTV+  R  S     I
Sbjct: 360 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 414

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GNFQ QN+   YD+ +  L F    C
Sbjct: 415 IGNFQQQNFQFVYDVGHDTLSFAPVQC 441


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +S GTP      I+DTGS LVW  C     C  C     P F P  SS+   + 
Sbjct: 72  GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 128

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  CS                L TSK  +     Y   YG S  T+G+  +ET  L  
Sbjct: 129 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 173

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P  + GC   +      Q AG+ G GRG  SL SQL LDKFSYCL S    D T  S
Sbjct: 174 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 230

Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            L+L +  G S +    + +  TP + NPS         +YYV L+ ITVG  R+ +   
Sbjct: 231 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 284

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +  DG GG IVDSGT+ T++  + +  L   F +QM        A G+    GL  C
Sbjct: 285 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 338

Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           F  P  G      P L  HF GGA++ LP ENY  + G   A+CLTV+  R  S     I
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 393

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GNFQ QN+   YD+ +  L F    C
Sbjct: 394 IGNFQQQNFQFVYDVGHDTLSFAPVQC 420


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 140/423 (33%), Positives = 197/423 (46%), Gaps = 55/423 (13%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSS--KIPSFIPKLSSSSR 143
           GY +SL+ GTPPQ+    LDTGS L W PC  ++ YQC  C SS    P+F+P  S+S+ 
Sbjct: 24  GYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNT 83

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDE----PLATSKNCTQICPSYLVLYGSG-LTEGIALS 198
              C +  C  +H    +   C       P  T   C + CP +   YG G L  G    
Sbjct: 84  RDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSR 143

Query: 199 ETLNLPNR-------------IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--D 243
           +++ L                  P F  GC   S R+P GIAGFGRG  SLPSQL     
Sbjct: 144 DSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFGRGALSLPSQLGFLGK 203

Query: 244 KFSYCLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
            FS+C L  +F  +   TS L++ + +  S     G  +TP + + +      +  +YYV
Sbjct: 204 GFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSAT------YPNFYYV 257

Query: 303 GLRRITV----GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL-ADEFVSQ 357
           GL  + +    GG  +        +D  GNGG +VD+GTT+T    +L +P  A    S 
Sbjct: 258 GLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYT----QLPDPFYASVLASL 313

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLP-VENYFA 412
           +     Y R+   EA TG   CF VP  +        P + LH  GGA + LP + +Y+ 
Sbjct: 314 ISAAPPYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYP 373

Query: 413 VVG-EGSAVCLTVVTDR---------EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           V     S V   ++  R          + GGP+ +LG+FQMQN  V YDL   R+GF+ +
Sbjct: 374 VTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPR 433

Query: 463 LCK 465
            C 
Sbjct: 434 DCA 436


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 135/379 (35%), Positives = 174/379 (45%), Gaps = 47/379 (12%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            GTP      I+DTGS LVW  C     C  C     P F P  SS+   + C +  CS 
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVPCSSASCS- 228

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLV 213
                          L TSK  +     Y   YG S  T+G+  +ET  L    +P  + 
Sbjct: 229 --------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVF 274

Query: 214 GCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN-- 267
           GC   +      Q AG+ G GRG  SL SQL LDKFSYCL S    D T  S L+L +  
Sbjct: 275 GCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNSPLLLGSLA 331

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
           G S +    + +  TP + NPS         +YYV L+ ITVG  R+ +      +  DG
Sbjct: 332 GISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSSAFAVQDDG 385

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP--GE 385
            GG IVDSGT+ T++  + +  L   F +QM        A G+    GL  CF  P  G 
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLCFRAPAKGV 439

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQN 445
                P L  HF GGA++ LP ENY  + G   A+CLTV+  R  S     I+GNFQ QN
Sbjct: 440 DQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----IIGNFQQQN 494

Query: 446 YYVEYDLRNQRLGFKQQLC 464
           +   YD+ +  L F    C
Sbjct: 495 FQFVYDVGHDTLSFAPVQC 513


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 147/475 (30%), Positives = 214/475 (45%), Gaps = 56/475 (11%)

Query: 10  LSFIFFFTLLSIFPSSITS-LTFSLSRFHTNPSQD--SYQNLNSLVSSSLTRALHIKNPQ 66
           L++   FTLL    ++ T+ LT      H +  +    ++ L+ +   S  RA  +    
Sbjct: 10  LAYALIFTLLFTAAATPTAGLTMRADLTHVDKGRGFTRWERLSRMAVRSRARAASLYQRG 69

Query: 67  TKTTTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKY 125
                  T T        S G Y I  + GTP PQ +   +DTGS LVW  CT    C  
Sbjct: 70  GHYGQPVTATAVP-----SSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCT---PCPV 121

Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
           C     P F P +SS+ R + C +P C          R  +   ++     T  C  YL 
Sbjct: 122 CFDQPFPLFDPSVSSTFRAVACPDPIC----------RPSSGLSVSACALKTFRC-FYLC 170

Query: 186 LYGS-GLTEGIALSETLNL--------PNRIIPNFLVGCS-----VLSSRQPAGIAGFGR 231
            YG   +T G    +T           P   +     GC      V +S + +GIAGFGR
Sbjct: 171 SYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNE-SGIAGFGR 229

Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSV 290
           G  SLPSQL + +FSYCL SH   ++ +TS++ L    +     ++G    TP +++PS 
Sbjct: 230 GPLSLPSQLRVGRFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPS- 288

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
                F  +YY+ L  ITVG  R+ V      L +DG+GGT++DSGT  T     +FE L
Sbjct: 289 -----FPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQL 343

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVEN 409
            +EFV+Q+   R    +     L     CF  P G K    P+L  H    A++ LP EN
Sbjct: 344 KNEFVAQLPLPRYDNTSEVGNLL-----CFQRPKGGKQVPVPKLIFHL-ASADMDLPREN 397

Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           Y     +   +CL ++   E      +++GNFQ QN ++ YD+ N +L F    C
Sbjct: 398 YIPEDTDSGVMCL-MINGAEVD---MVLIGNFQQQNMHIVYDVENSKLLFASAQC 448


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 146/417 (35%), Positives = 201/417 (48%), Gaps = 55/417 (13%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYC----SSSKIPSFIPKLSSS 141
           GY +SL+ G PPQ+    LDTGS L W PC   + YQC  C    S+SK         SS
Sbjct: 24  GYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSS 83

Query: 142 SRLLG-CQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEG 194
           S +   C +  C  IH     H+      C   P   S  CT+ CP +   YG G L  G
Sbjct: 84  SNMKELCGSRFCVDIHSSDNSHDPCAAVGCA-IPSFMSGLCTRPCPPFSYTYGGGALVLG 142

Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
               + + L   I        +P F  GC   S R+P GIAGFG+G  SLPSQL  LDK 
Sbjct: 143 SLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKG 202

Query: 245 FSYCLLSHKFD-DTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
           FS+C L  +F  +   TSSLI+ D   S  D       +TP +   S+   N    +YY+
Sbjct: 203 FSHCFLGFRFARNPNFTSSLIMGDLALSAKDD----FLFTPMLK--SITNPN----FYYI 252

Query: 303 GLRRITVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           GL  +++G G  +       ++D +GNGG IVD+GTT+T     L +P     +S +   
Sbjct: 253 GLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYT----HLPDPFYTAILSSLASV 308

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGE 416
             Y R+   E  TG   CF +P   T       P +  HF G  ++TLP ++ Y+AV   
Sbjct: 309 ILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAP 368

Query: 417 GSAVCLTVV----TDRE-----ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++V +  +     D E     A+ GP  +LG+FQMQN  V YD+   R+GF+ + C
Sbjct: 369 KNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 425


>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 441

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 140/426 (32%), Positives = 201/426 (47%), Gaps = 61/426 (14%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH--YQCKYCS-----SSKIPSFIPKLSS 140
           GY +SL+ GTPPQ+    LDTGS L W PC  +  YQC  C      S   P+F    S 
Sbjct: 24  GYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHSISKPTPAFSLSQSY 83

Query: 141 SSRLLGCQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEG 194
           SS    C +  C  +H     H++     C+  P+  S  CT++CP +   YG   L  G
Sbjct: 84  SSTRDLCGSRFCVDVHSSDNSHDACAAAGCS-IPVFMSGLCTRLCPPFAYTYGGRALVLG 142

Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
               +T+ L   I         P F  GC   S R+P GIAGFG+GK SLPSQL  LDK 
Sbjct: 143 SLARDTIALHGSIYGISVPIEFPGFCFGCVGSSIREPIGIAGFGKGKLSLPSQLGFLDKG 202

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           FS+C L   F      +S ++    + S K   G  +TP + + +      +  +YY+GL
Sbjct: 203 FSHCFLGFWFARNPNITSPMVIGDLALSVKD--GFLFTPMLKSLT------YPNFYYIGL 254

Query: 305 RRITVGGQRVRVWHKYLT-LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
             +T+G          L+ +D +GNGG IVD+GTT+T ++    +P     +S +     
Sbjct: 255 EGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLS----DPFYASVLSSLSSTVP 310

Query: 364 YTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS 418
           Y R+   E  TG   C  VP           P + +H  G   + LP E+ Y+AV    +
Sbjct: 311 YNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKESCYYAVTAPRN 370

Query: 419 AVCLTVV---------------TDRE----ASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           +V +  +                D E    ++GGP+ +LG+FQMQN  V YDL + R+GF
Sbjct: 371 SVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLESGRVGF 430

Query: 460 KQQLCK 465
           + + C 
Sbjct: 431 QPRDCA 436


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 144/420 (34%), Positives = 200/420 (47%), Gaps = 58/420 (13%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYC----SSSKIPSFIPKLSSS 141
           GY +SL+ G PPQ+    LDTGS L W PC   + YQC  C    S+SK         SS
Sbjct: 24  GYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSS 83

Query: 142 SRLLG-CQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEG 194
           S +   C +  C  IH     H+      C   P   S  CT+ CP +   YG G L  G
Sbjct: 84  SNMKELCGSRFCVDIHSSDNSHDPCAAVGCA-IPSFMSDLCTRPCPPFSYTYGGGALVLG 142

Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
               + + L   I        +P F  GC   S R+P GIAGFG+G  SLPSQL  LDK 
Sbjct: 143 SLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKG 202

Query: 245 FSYCLLSHKFD-DTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
           FS+C L  +F  +   TSSLI+ D   S  D       +TP +   S+   N    +YY+
Sbjct: 203 FSHCFLGFRFARNPNFTSSLIMGDLALSAKDD----FLFTPMLK--SITNPN----FYYI 252

Query: 303 GLRRITVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           GL  +++G G  +       ++D +GNGG IVD+GTT+T     L +P     +S +   
Sbjct: 253 GLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYT----HLPDPFYTAILSSLASV 308

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGE 416
             Y R+   E  TG   CF +P   T       P +  HF G  ++TLP ++ Y+AV   
Sbjct: 309 ILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAP 368

Query: 417 GSAVCLTVVTDRE------------ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++V +  +  +             A+ GP  +LG+FQMQN  V YD+   R+GF+ + C
Sbjct: 369 KNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 428


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 148/452 (32%), Positives = 213/452 (47%), Gaps = 59/452 (13%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ---TKTTTTTTTTTTTNISSHSYGGY 89
           L+R H +PS  + Q     V  +L R +H  N +     ++  TT +  T IS  + G Y
Sbjct: 32  LTRIHADPSVTASQ----FVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISPTA-GEY 86

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLL 145
            ++L+ GTPP     I DTGS L+W       QC  CSS       P + P  S++  +L
Sbjct: 87  LMTLAIGTPPVSYQAIADTGSDLIW------TQCAPCSSQCFQQPTPLYNPSSSTTFAVL 140

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
            C     S +   +         P      CT +   Y + YGSG T     SET    +
Sbjct: 141 PCN----SSLSMCAAALAGTTPPP-----GCTCM---YNMTYGSGWTSVYQGSETFTFGS 188

Query: 206 RI------IPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
                   +P    GCS  S    +   +G+ G GRG  SL SQL + KFSYCL    + 
Sbjct: 189 STPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCL--TPYQ 246

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           DT  TS+L+L  G S S   T G++ TPFV +PS A     S YYY+ L  I++G   + 
Sbjct: 247 DTNSTSTLLL--GPSASLNDTGGVSSTPFVASPSDAP---MSTYYYLNLTGISLGTTALS 301

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +    L+L  DG GG I+DSGTT T +    ++ +    VS +          G  A TG
Sbjct: 302 IPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTD----GGSAATG 357

Query: 376 LRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF++P   +   + P + LHF  GA++ LP ++Y  +  + +  CL +    +  GG
Sbjct: 358 LDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADSYMML--DSNLWCLAM--QNQTDGG 412

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            S ILGN+Q QN ++ YD+  + L F    C 
Sbjct: 413 VS-ILGNYQQQNMHILYDVGQETLTFAPAKCS 443


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 134/389 (34%), Positives = 178/389 (45%), Gaps = 48/389 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + ++ GTP      I+DTGS LVW  C     C  C     P F P  SS+   + 
Sbjct: 98  GEFLMDVAIGTPALSYAAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 154

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
           C +  CS             D P +T  + ++    Y   YG +  T+G+  SET  L  
Sbjct: 155 CSSALCS-------------DLPTSTCTSASKC--GYTYTYGDASSTQGVLASETFTLGK 199

Query: 204 PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
             + +P    GC   +      Q AG+ G GRG  SL SQL LDKFSYCL S   DD   
Sbjct: 200 EKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LDDGDG 257

Query: 260 TSSLILDNGSSHSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
            S L+L   ++   +      +  TP V NPS         +YYV L  +TVG  R+ + 
Sbjct: 258 KSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPS------FYYVSLTGLTVGSTRITLP 311

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                +  DG GG IVDSGT+ T++  + +  L   FV+QM          G+E   GL 
Sbjct: 312 ASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMA----LPTVDGSE--IGLD 365

Query: 378 PCFDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
            CF  P  G      P+L LHF GGA++ LP ENY  +     A+CLTV   R  S    
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLS---- 421

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GNFQ QN+   YD+    L F    C
Sbjct: 422 -IIGNFQQQNFQFVYDVAGDTLSFAPVQC 449


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 148/487 (30%), Positives = 213/487 (43%), Gaps = 74/487 (15%)

Query: 16  FTLLSIFPSSI------TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKN----- 64
           F++L I   +I       ++   L+R H +P   + +     V  +L R +H        
Sbjct: 4   FSVLLILACTILASDAAAAVRVGLTRIHADPEVTASE----FVRGALRRDMHRHARFARE 59

Query: 65  ---PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
              P +      T    T     + G Y ++LS GTPP     I DTGS L+W       
Sbjct: 60  QLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIW------T 113

Query: 122 QCKYCSSSKIPS-----------FIPKLSSSSRLLGCQNP--KCSWIHHESIQCRDCNDE 168
           QC  C  +   +           + P  S++  +L C +P   C+ +   S         
Sbjct: 114 QCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPS--------- 164

Query: 169 PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL------PNRIIPNFLVGCSVLSSRQ 222
                  C  +   Y   YG+G T G+   ET         P   +PN   GCS  SS  
Sbjct: 165 ---PPPGCACM---YNQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSND 218

Query: 223 ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
               AG+ G GRG  SL SQL    FSYCL    F D   TS+L+L   ++ + K T  +
Sbjct: 219 WNGSAGLVGLGRGSMSLVSQLGAGAFSYCLT--PFQDANSTSTLLLGPSAAAALKGTGPV 276

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
             TPFV  PS A     S YYY+ L  I+VG   + +     +L  DG GG I+DSGTT 
Sbjct: 277 RSTPFVAGPSKAP---MSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTI 333

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE-KTGSFPELKLHFK 398
           T +    ++ +     S +V       A G +  TGL  CF +       + P + LHF+
Sbjct: 334 TTLVDSAYQQVRAAVRSLLVT--RLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFE 391

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
           GGA++ LPVENY  ++G G   CL +   R  + G   ++GN+Q QN +V YD+R + L 
Sbjct: 392 GGADMVLPVENYM-ILGSG-VWCLAM---RNQTVGAMSMVGNYQQQNIHVLYDVRKETLS 446

Query: 459 FKQQLCK 465
           F   +C 
Sbjct: 447 FAPAVCS 453


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 143/479 (29%), Positives = 218/479 (45%), Gaps = 66/479 (13%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQT-- 67
           L+ + F  + +   S   S+   L+R H++P   + +     V  +L R +H +  ++  
Sbjct: 11  LAVLVFLVVCATLASGAASVRVGLTRIHSDPDITAPE----FVRDALRRDMHRQQSRSLF 66

Query: 68  ----KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
                 +  TT +  T     + G Y ++LS GTPP   P I DTGS L+W       QC
Sbjct: 67  GRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYPAIADTGSDLIW------TQC 120

Query: 124 KYCSSSK-----IPSFIPKLSSSSRLLGCQNP--KCSWIHHESIQCRDCNDEPLATSKNC 176
             CS  +      P + P  S++  +L C +    C+ +               A    C
Sbjct: 121 APCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGK-----------APPPGC 169

Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAG 228
             +   Y   YG+G T G+  SET    +       +P    GCS  SS      AG+ G
Sbjct: 170 ACM---YNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAGLVG 226

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
            GRG  SL SQL   +FSYCL    F DT  TS+L+L   ++ +    TG+  TPFV +P
Sbjct: 227 LGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASP 281

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
           + A     S YYY+ L  I++G + + +     +L  DG GG I+DSGTT T +    ++
Sbjct: 282 AKAP---MSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQ 338

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLP 406
                 V   V++     A+     TGL  C+ +P   +   + P + LHF  GA++ LP
Sbjct: 339 Q-----VRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFD-GADMVLP 392

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            ++Y  + G G   CL +   R  + G     GN+Q QN ++ YD+RN+ L F    C 
Sbjct: 393 ADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 130/418 (31%), Positives = 197/418 (47%), Gaps = 54/418 (12%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHL 112
           L RA+     + +  +  T +  +++ +  + G   + + L+ GTP +    I+DTGS L
Sbjct: 61  LQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMDTGSDL 120

Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
           +W  C     CK C     P F PK SSS   L C +  C+ +             P+++
Sbjct: 121 IWTQCK---PCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAAL-------------PISS 164

Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIA 227
              C+  C  YL  YG    T+G+  +ET    +  +     GC   +      Q AG+ 
Sbjct: 165 ---CSDGC-EYLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLV 220

Query: 228 GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           G GRG  SL SQL   KFSYCL S   DD+   SSL++ + ++  +  TT     P + N
Sbjct: 221 GLGRGPLSLISQLGEPKFSYCLTS--MDDSKGISSLLVGSEATMKNAITT-----PLIQN 273

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
           PS         +YY+ L  I+VG   + +     ++  DG+GG I+DSGTT T++    F
Sbjct: 274 PSQPS------FYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAF 327

Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK-TGSFPELKLHFKGGAEVTLP 406
             L  EF+SQ+  + + + +      TGL  CF +P +  T   P+L  HF+ GA++ LP
Sbjct: 328 AALKKEFISQLKLDVDESGS------TGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLP 380

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ENY         +CLT+ +    S     I GNFQ QN  V +DL  + + F    C
Sbjct: 381 AENYIIADSGLGVICLTMGSSSGMS-----IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
          Length = 218

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 135/230 (58%), Gaps = 17/230 (7%)

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           + + KF+YCL SH +DDT  +  LILD    + D KT GL+YTPF+ +P      A + Y
Sbjct: 1   MGVKKFAYCLNSHDYDDTRNSGKLILD----YRDGKTKGLSYTPFLKSPP-----ASAFY 51

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT-FMAPELFEPLADEFVSQM 358
           Y++G++ I +G + +R+  KYL    DG  G I+DSG     +M   +F+ + +E   QM
Sbjct: 52  YHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQM 111

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
            K   Y R+L AE  TGL PC++  G K+   P L   F+GGA + +P +NYF +  + S
Sbjct: 112 SK---YRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQES 168

Query: 419 AVCLTVVTDR----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             C  + T+     E +  PSIILGN Q  +YYVEYDL+N R GF++Q C
Sbjct: 169 LACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 218


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 186/390 (47%), Gaps = 52/390 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + L+ G+PP+    I+DTGS L+W  C     C+ C     P F PK SSS   + 
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK---PCQQCFDQSTPIFDPKQSSSFYKIS 420

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  C  +             P +T   C+     YL  YG S  T+G+   ET    +
Sbjct: 421 CSSELCGAL-------------PTST---CSSDGCEYLYTYGDSSSTQGVLAFETFTFGD 464

Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                  IP    GC   ++     Q AG+ G GRG  SL SQL   KF+YCL +    D
Sbjct: 465 STEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAI---D 521

Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            ++ SSL+L + ++ + K +   +  TP + NPS         +YY+ L+ I+VGG ++ 
Sbjct: 522 DSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS------FYYLSLQGISVGGTQLS 575

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG+GG I+DSGTT T++    F  L +EF++QM         +      G
Sbjct: 576 IPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM------NLPVDDSGTGG 629

Query: 376 LRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           L  CF++P G      P+L  HFK GA++ LP ENY     +   +CL + + R  S   
Sbjct: 630 LDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRGMS--- 685

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I GN Q QN+ V +DL+ + L F    C
Sbjct: 686 --IFGNLQQQNFMVVHDLQEETLSFLPTQC 713


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 143/480 (29%), Positives = 217/480 (45%), Gaps = 60/480 (12%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKN----- 64
           L+ + F  + +   S   S+   L+R H++P   + Q     V  +L R +H +      
Sbjct: 27  LAVLVFLVVCATLASGAASVRVGLTRIHSDPDTTAPQ----FVRDALRRDMHRQRSRSFG 82

Query: 65  -------PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
                   ++   T+TT +  T     + G Y ++L+ GTPP     + DTGS L+W  C
Sbjct: 83  RDRDRELAESDGRTSTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQC 142

Query: 118 TNHYQC-KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
                C   C     P + P  S++  +L C         + S+          A    C
Sbjct: 143 A---PCGTQCFEQPAPLYNPASSTTFSVLPC---------NSSLSMCAGALAGAAPPPGC 190

Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAG 228
             +   Y   YG+G T G+  SET    +       +P    GCS  SS      AG+ G
Sbjct: 191 ACM---YYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVG 247

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
            GRG  SL SQL   +FSYCL    F DT  TS+L+L   ++ +    TG+  TPFV +P
Sbjct: 248 LGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASP 302

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
           +   R   S YYY+ L  I++G + + +     +L  DG GG I+DSGTT T +A   ++
Sbjct: 303 A---RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQ 359

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTL 405
            +     SQ+V         G+++ TGL  CF +P   +      P + LHF  GA++ L
Sbjct: 360 QVRAAVKSQLVTTLPTVD--GSDS-TGLDLCFALPAPTSAPPAVLPSMTLHFD-GADMVL 415

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           P ++Y  + G G   CL +   R  + G     GN+Q QN ++ YD+R + L F    C 
Sbjct: 416 PADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 470


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 186/390 (47%), Gaps = 52/390 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + L+ G+PP+    I+DTGS L+W  C     C+ C     P F PK SSS   + 
Sbjct: 109 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK---PCQQCFDQSTPIFDPKQSSSFYKIS 165

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  C  +             P +T   C+     YL  YG S  T+G+   ET    +
Sbjct: 166 CSSELCGAL-------------PTST---CSSDGCEYLYTYGDSSSTQGVLAFETFTFGD 209

Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                  IP    GC   ++     Q AG+ G GRG  SL SQL   KF+YCL +    D
Sbjct: 210 STEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAI---D 266

Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            ++ SSL+L + ++ + K +   +  TP + NPS         +YY+ L+ I+VGG ++ 
Sbjct: 267 DSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS------FYYLSLQGISVGGTQLS 320

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG+GG I+DSGTT T++    F  L +EF++QM         +      G
Sbjct: 321 IPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM------NLPVDDSGTGG 374

Query: 376 LRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           L  CF++P G      P+L  HFK GA++ LP ENY     +   +CL + + R  S   
Sbjct: 375 LDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRGMS--- 430

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I GN Q QN+ V +DL+ + L F    C
Sbjct: 431 --IFGNLQQQNFMVVHDLQEETLSFLPTQC 458


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 133/387 (34%), Positives = 179/387 (46%), Gaps = 49/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +S GTP      I+DTGS LVW  C     C  C +   P F P  SS+   L 
Sbjct: 100 GEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK---PCVECFNQSTPVFDPSSSSTYAALP 156

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  CS             D P   S  CT     Y   YG S  T+G+  +ET  L  
Sbjct: 157 CSSTLCS-------------DLP---SSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAK 200

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P+   GC   +      Q AG+ G GRG  SL SQL L+KFSYCL S   DDT++ S
Sbjct: 201 TKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTS--LDDTSK-S 257

Query: 262 SLILDNGSS--HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            L+L + ++   S    + +  TP + NPS         +YYV L+ +TVG   + +   
Sbjct: 258 PLLLGSLATISESAAAASSVQTTPLIRNPSQPS------FYYVNLKGLTVGSTHITLPSS 311

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +  DG GG IVDSGT+ T++  + +  L   F +QM        A G+    GL  C
Sbjct: 312 AFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQM----KLPAADGSG--IGLDTC 365

Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           F+ P  G      P+L  H   GA++ LP ENY  +     A+CLTV+  R  S     I
Sbjct: 366 FEAPASGVDQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGSRGLS-----I 419

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GNFQ QN    YD+    L F    C
Sbjct: 420 IGNFQQQNIQFVYDVGENTLSFAPVQC 446


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 142/449 (31%), Positives = 207/449 (46%), Gaps = 52/449 (11%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHS--YGGYS 90
           L+R H +PS  + Q     V  +L R +H  N +      ++  T +  + +S   G Y 
Sbjct: 36  LTRVHADPSVTASQ----FVRGALRRDMHRHNARKLALAASSGATVSAPTQNSPTAGEYL 91

Query: 91  ISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           ++L+ GTPP     I DTGS L+W    PCT+      C     P + P  S++  +L C
Sbjct: 92  MALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSSTTFAVLPC 146

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
            +         S+          A    C   C +Y V YGSG T     SET    +  
Sbjct: 147 NS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSETFTFGSTP 196

Query: 208 -----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                +P    GCS  SS       +G+ G GRG+ SL SQL + KFSYCL    + DT 
Sbjct: 197 AGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL--TPYQDTN 254

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            TS+L+L  G S S   T G++ TPFV +PS A  N F   YY+ L  I++G   + +  
Sbjct: 255 STSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLGTTALSIPP 309

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               L+ DG GG I+DSGTT T +    ++ +    VS +             A TGL  
Sbjct: 310 DAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DGSAATGLDL 364

Query: 379 CFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           CF +P   +   + P + LHF  GA++ LP ++Y  +  +    CL +   +  + G   
Sbjct: 365 CFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAM---QNQTDGEVN 419

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ILGN+Q QN ++ YD+  + L F    C 
Sbjct: 420 ILGNYQQQNMHILYDIGQETLSFAPAKCS 448


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 142/449 (31%), Positives = 207/449 (46%), Gaps = 52/449 (11%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ--TKTTTTTTTTTTTNISSHSYGGYS 90
           L+R H +PS  + Q     V  +L R +H  N +      ++  T +     S + G Y 
Sbjct: 38  LTRVHADPSVTASQ----FVRGALRRDMHRHNARKLALAASSGATVSAPTQDSPTAGEYL 93

Query: 91  ISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           ++L+ GTPP     I DTGS L+W    PCT+      C     P + P  S++  +L C
Sbjct: 94  MALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSSTTFAVLPC 148

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP--- 204
            +         S+          A    C   C +Y V YGSG T     SET       
Sbjct: 149 NS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSETFTFGSTP 198

Query: 205 --NRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
             +  +P    GCS  SS       +G+ G GRG+ SL SQL + KFSYCL    + DT 
Sbjct: 199 AGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL--TPYQDTN 256

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            TS+L+L  G S S   T G++ TPFV +PS A  N F   YY+ L  I++G   + +  
Sbjct: 257 STSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLGTTALSIPP 311

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +L+ DG GG I+DSGTT T +    ++ +    VS +             A TGL  
Sbjct: 312 DAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DGSADTGLDL 366

Query: 379 CFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           CF +P   +   + P + LHF  GA++ LP ++Y  +  +    CL +   +  + G   
Sbjct: 367 CFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAM---QNQTDGEVN 421

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ILGN+Q QN ++ YD+  + L F    C 
Sbjct: 422 ILGNYQQQNMHILYDIGQETLSFAPAKCS 450


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 150/471 (31%), Positives = 210/471 (44%), Gaps = 70/471 (14%)

Query: 21  IFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSS-----SLTRALH-IKNPQTKT----- 69
           I P+S TS   S  + H  P+ + ++ +   V S      L R  H IK  +++      
Sbjct: 23  IAPTSSTSRKTSFKQQHPCPTTNGFRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQKLNA 82

Query: 70  ---TTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
                ++T  +   + +  + G   Y I L+ GTPP   P +LDTGS L+W  C     C
Sbjct: 83  MVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCK---PC 139

Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
             C     P F PK SSS   + C +  CS +                 S  C+  C  Y
Sbjct: 140 TRCYKQPTPIFDPKKSSSFSKVSCGSSLCSAL----------------PSSTCSDGC-EY 182

Query: 184 LVLYGS-GLTEGIALSETLNL---PNRI-IPNFLVGCSVLSS----RQPAGIAGFGRGKT 234
           +  YG   +T+G+  +ET       N++ + N   GC   +      Q +G+ G GRG  
Sbjct: 183 VYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPL 242

Query: 235 SLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
           SL SQL   +FSYCL      D T+ S L+L  GS    K    +  TP + NP      
Sbjct: 243 SLVSQLKEQRFSYCLTPI---DDTKESVLLL--GSLGKVKDAKEVVTTPLLKNPLQPS-- 295

Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
               +YY+ L  I+VG  R+ +      +  DGNGG I+DSGTT T++  + +E L  EF
Sbjct: 296 ----FYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEF 351

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
           +SQ         AL   + TGL  CF +P G      P+L  HFKGG ++ LP ENY   
Sbjct: 352 ISQT------KLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGG-DLELPAENYMIG 404

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  CL +     AS G S I GN Q QN  V +DL  + + F    C
Sbjct: 405 DSNLGVACLAM----GASSGMS-IFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 130/389 (33%), Positives = 176/389 (45%), Gaps = 41/389 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + LS GTP      I+DTGS LVW  C     C  C +   P F P  SS+   L 
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK---PCVECFNQTTPVFDPAASSTYAALP 170

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +  C+ +   +           ++S +       Y   YG +  T+G+  +ET  L  
Sbjct: 171 CSSALCADLPTSTCA--------SSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLAR 222

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           + +P    GC   +      Q AG+ G GRG  SL SQL +D+FSYCL S   DD    S
Sbjct: 223 QKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTS--LDDAAGRS 280

Query: 262 SLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            L+L + +  S    T     TP V NPS         +YYV L  +TVG  R+ +    
Sbjct: 281 PLLLGSAAGISASAATAPAQTTPLVKNPSQPS------FYYVSLTGLTVGSTRLALPSSA 334

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             +  DG GG IVDSGT+ T++    +  L   FV+ M    +      +E   GL  CF
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHM----SLPTVDASE--IGLDLCF 388

Query: 381 DVPGEKTG-----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
             P            P+L LHF GGA++ LP ENY  +     A+CLTV+  R  S    
Sbjct: 389 QGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLS---- 444

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GNFQ QN+   YD+    L F    C
Sbjct: 445 -IIGNFQQQNFQFVYDVAGDTLSFAPAEC 472


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 128/395 (32%), Positives = 186/395 (47%), Gaps = 55/395 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  ++S GTP ++   I DTGS L+W  C     C+ C + K P F P+ SSS   + 
Sbjct: 38  GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK---PCQACFNQKDPIFDPEGSSSYTTMS 94

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
           C +  C  +                  K+C+  C  Y   YG G  T G   SET+ L +
Sbjct: 95  CGDTLCDSLPR----------------KSCSPDC-DYSYGYGDGSGTRGTLSSETVTLTS 137

Query: 206 R-----IIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
                    N   GC  L   S    +G+ G GRG  S  SQL      KFSYCL+  + 
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWR- 196

Query: 255 DDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
           D  ++TS +   D  SSHS  K     +TP ++NP      A   +YYV L+ I++ G+ 
Sbjct: 197 DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP------AMESFYYVKLKDISIAGRA 250

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           +R+      +  DG+GG I DSGTT T +    ++ +     S++    ++ +  G+ A 
Sbjct: 251 LRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKI----SFPKIDGSSA- 305

Query: 374 TGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDRE 429
            GL  C+DV G K       P +  HF+ GA+  LPVENYF    + G+ VCL +V+   
Sbjct: 306 -GLDLCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSNM 363

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G    I GN   QN+ V YD+ + ++G+    C
Sbjct: 364 DIG----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 131/379 (34%), Positives = 182/379 (48%), Gaps = 48/379 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + ++L+ GTPP+    I+DTGS L+W  C     C  C     P F PK SSS   L 
Sbjct: 98  GEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK---PCTQCFDQPSPIFDPKKSSSFSKLS 154

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +   S                C+  C  YL  YG    T+G   +ET     
Sbjct: 155 CSSQLCKALPQSS----------------CSDSC-EYLYTYGDYSSTQGTMATETFTFGK 197

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             IPN   GC   +      Q +G+ G GRG  SL SQL   KFSYCL S    D T+TS
Sbjct: 198 VSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTSI---DDTKTS 254

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L++ + +S  +  +  +  TP + NP          +YY+ L  I+VGG R+ +     
Sbjct: 255 TLLMGSLAS-VNGTSAAIRTTPLIQNPLQPS------FYYLSLEGISVGGTRLPIKESTF 307

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L  DG GG I+DSGTT T++    F+ +  EF SQM    + + A      TGL  C++
Sbjct: 308 QLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGA------TGLELCYN 361

Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +P + +    P+L LHF  GA++ LP ENY         +CL +     +SGG S I GN
Sbjct: 362 LPSDTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAM----GSSGGMS-IFGN 415

Query: 441 FQMQNYYVEYDLRNQRLGF 459
            Q QN +V +DL  + L F
Sbjct: 416 VQQQNMFVSHDLEKETLSF 434


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 140/458 (30%), Positives = 201/458 (43%), Gaps = 50/458 (10%)

Query: 22  FPSSITSLTFSLSRFHTNPSQDSYQNLNSL--VSSSLTRALHIKNPQTKTTTTTTTTT-- 77
            P ++    F LS  H     DS +NL  +  +   + R  H  N           +   
Sbjct: 36  LPKNLPRSGFRLSLRHV----DSGKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPD 91

Query: 78  -TTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
            T NI + ++GG   + + LS G P      I+DTGS L+W  C     C  C     P 
Sbjct: 92  DTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCK---PCTECFDQPTPI 148

Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLT 192
           F P+ SSS   +GC +  C+ +        +CN++  A           YL  YG    T
Sbjct: 149 FDPEKSSSYSKVGCSSGLCNALPRS-----NCNEDKDACE---------YLYTYGDYSST 194

Query: 193 EGIALSETLNLPNR-IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
            G+  +ET    +   I     GC V +      Q +G+ G GRG  SL SQL   KFSY
Sbjct: 195 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 254

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           CL S   +D+  +SSL + + +S    KT            S+        +YY+ L+ I
Sbjct: 255 CLTS--IEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGI 312

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           TVG +R+ V      L  DG GG I+DSGTT T++    F+ L +EF S+M      +  
Sbjct: 313 TVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLP 366

Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           +     TGL  CF +P   K  + P++  HFK GA++ LP ENY         +CL + +
Sbjct: 367 VDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGS 425

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               S     I GN Q QN+ V +DL  + + F    C
Sbjct: 426 SNGMS-----IFGNVQQQNFNVLHDLEKETVSFVPTEC 458


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 141/458 (30%), Positives = 202/458 (44%), Gaps = 50/458 (10%)

Query: 22  FPSSITSLTFSLSRFHTNPSQDSYQNLNSL--VSSSLTRALHIKNPQTKTTTTTTTTT-- 77
            P ++    F LS  H     DS +NL  +  +   + R  H  N           +   
Sbjct: 37  LPKNLPRSGFRLSLRHV----DSGKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPD 92

Query: 78  -TTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
            T NI + ++GG   + + LS G P      I+DTGS L+W  C     C  C     P 
Sbjct: 93  DTNNIKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCK---PCTECFDQPTPI 149

Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLT 192
           F P+ SSS   +GC +  C+ +        +CN++      +C      YL  YG    T
Sbjct: 150 FDPEKSSSYSKVGCSSGLCNALPRS-----NCNED----KDSC-----EYLYTYGDYSST 195

Query: 193 EGIALSETLNLPNR-IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
            G+  +ET    +   I     GC V +      Q +G+ G GRG  SL SQL   KFSY
Sbjct: 196 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 255

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           CL S   +D+  +SSL + + +S    KT            S+        +YY+ L+ I
Sbjct: 256 CLTS--IEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGI 313

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           TVG +R+ V      L  DG GG I+DSGTT T++    F+ L +EF S+M      +  
Sbjct: 314 TVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLP 367

Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           +     TGL  CF +P   K  + P+L  HFK GA++ LP ENY         +CL + +
Sbjct: 368 VDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCLAMGS 426

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               S     I GN Q QN+ V +DL  + + F    C
Sbjct: 427 SNGMS-----IFGNVQQQNFNVLHDLEKETVTFVPTEC 459


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 130/392 (33%), Positives = 182/392 (46%), Gaps = 48/392 (12%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y + +  GTP +    ILDTGS L+W  C     C  C     P F P  SS+ R 
Sbjct: 88  SDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPANSSTYRS 144

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
           LGC  P C+ +++           PL   K C      Y   YG S  T G+  +ET   
Sbjct: 145 LGCSAPACNALYY-----------PLCYQKTCV-----YQYFYGDSASTAGVLANETFTF 188

Query: 202 --NLPNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             N     +P    GC  L++   A   G+ GFGRG  SL SQL   +FSYCL S  F  
Sbjct: 189 GTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTS--FLS 246

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
             R S L     ++ +    + +  TPF+ NP      A    Y++ +  I+VGG R+ +
Sbjct: 247 PVR-SRLYFGAYATLNSTNASTVQSTPFIINP------ALPTMYFLNMTGISVGGNRLPI 299

Query: 317 WHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
               L + D DG GGTI+DSGTT T++A   +  + + FV  +    +    L     + 
Sbjct: 300 DPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYL---NSTLPLLDVTETSV 356

Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF    P  ++ + P+L LHF  GA+  LP++NY  V      +CL + T  + S  
Sbjct: 357 LDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGLCLAMATSSDGS-- 413

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              I+G++Q QN+ V YDL N  L F    C 
Sbjct: 414 ---IIGSYQHQNFNVLYDLENSLLSFVPAPCN 442


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 130/395 (32%), Positives = 187/395 (47%), Gaps = 55/395 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  ++S GTP ++   I DTGS L+W  C     C+ C + K P F P+ SSS   + 
Sbjct: 38  GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK---PCQACFNQKDPIFDPEGSSSYTTMS 94

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
           C +  C     +S+  + C       S NC      Y   YG G  T G   SET+ L +
Sbjct: 95  CGDTLC-----DSLPRKSC-------SPNC-----DYSYGYGDGSGTRGTLSSETVTLTS 137

Query: 206 R-----IIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
                    N   GC  L   S    +G+ G GRG  S  SQL      KFSYCL+  + 
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWR- 196

Query: 255 DDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
           D  ++TS +   D  SSHS  K     +TP ++NP      A   +YYV L+ I++ G+ 
Sbjct: 197 DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP------AMESFYYVKLKDISIAGRA 250

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           +R+      +  DG+GG I DSGTT T +    ++ +     S++    ++    G+ A 
Sbjct: 251 LRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKV----SFPEIDGSSA- 305

Query: 374 TGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDRE 429
            GL  C+DV G K       P +  HF+ GA+  LPVENYF    + G+ VCL +V+   
Sbjct: 306 -GLDLCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSSNM 363

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G    I GN   QN+ V YD+ + ++G+    C
Sbjct: 364 DIG----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 131/386 (33%), Positives = 190/386 (49%), Gaps = 53/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +SFG+PPQ    I+DTGS L+W  C     C+ C+++    F P  SS+   + 
Sbjct: 78  GEYLIDISFGSPPQKASVIVDTGSDLIWTQC---LPCETCNAAASVIFDPVKSSTYDTVS 134

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLPN 205
           C +  CS +  +S                CT  C  Y  +YG G +   ALS ET+ +  
Sbjct: 135 CASNFCSSLPFQS----------------CTTSC-KYDYMYGDGSSTSGALSTETVTVGT 177

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTR 259
             IPN   GC   ++ S    AGI G G+G  SL SQ   +   KFSYCL+      +T+
Sbjct: 178 GTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLG---STK 234

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           TS +++ + ++       G+ YT  + N       A   +YY  L  I+V G+ V     
Sbjct: 235 TSPMLIGDSAAAG-----GVAYTALLTN------TANPTFYYADLTGISVSGKAVTYPVG 283

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
             ++D  G GG I+DSGTT T++    F  L    V+ +     +  A G  +L GL  C
Sbjct: 284 TFSIDASGQGGFILDSGTTLTYLETGAFNAL----VAALKAEVPFPEADG--SLYGLDYC 337

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           F   G    ++P +  HFK GA+  LP EN F  +  G ++CL +     AS G S I+G
Sbjct: 338 FSTAGVANPTYPTMTFHFK-GADYELPPENVFVALDTGGSICLAMA----ASTGFS-IMG 391

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
           N Q QN+ + +DL NQR+GFK+  C+
Sbjct: 392 NIQQQNHLIVHDLVNQRVGFKEANCE 417


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 132/388 (34%), Positives = 178/388 (45%), Gaps = 53/388 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L+ GTPP   P +LDTGS L+W  C     C  C     P F PK SSS   + 
Sbjct: 106 GEYLMELAIGTPPVSYPAVLDTGSDLIWTQCK---PCTQCYKQPTPIFDPKKSSSFSKVS 162

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
           C +  CS +                 S  C+  C  Y+  YG   +T+G+  +ET     
Sbjct: 163 CGSSLCSAV----------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGK 205

Query: 204 -PNRI-IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
             N++ + N   GC   +      Q +G+ G GRG  SL SQL   +FSYCL      D 
Sbjct: 206 SKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCLTPM---DD 262

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T+ S L+L  GS    K    +  TP + NP          +YY+ L  I+VG  R+ + 
Sbjct: 263 TKESILLL--GSLGKVKDAKEVVTTPLLKNPLQPS------FYYLSLEGISVGDTRLSIE 314

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                +  DGNGG I+DSGTT T++  + FE L  EF+SQ     + T +      TGL 
Sbjct: 315 KSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSS------TGLD 368

Query: 378 PCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
            CF +P G      P++  HFKGG ++ LP ENY          CL +     AS G S 
Sbjct: 369 LCFSLPSGSTQVEIPKIVFHFKGG-DLELPAENYMIGDSNLGVACLAM----GASSGMS- 422

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN Q QN  V +DL  + + F    C
Sbjct: 423 IFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 130/394 (32%), Positives = 188/394 (47%), Gaps = 46/394 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS   + 
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKESSSFENIT 246

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+C  +          + +P    K+  Q CP Y   YG  S  T   AL   T+NL
Sbjct: 247 CHDPRCKLV---------SSPDPPKPCKDENQTCP-YFYWYGDSSNTTGDFALETFTVNL 296

Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
             PN     + + N + GC   +       AG+ G GRG  S  SQL       FSYCL+
Sbjct: 297 TTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLV 356

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                DT+ +S LI   G          L +T FV      E N+   +YYVG++ I V 
Sbjct: 357 DRN-SDTSVSSKLIF--GEDKELLSHPNLNFTSFVG----GEENSVDTFYYVGIKSIMVD 409

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ +++  +   L ++G GGTI+DSGTT T+ A   +E + + F   M K + Y      
Sbjct: 410 GEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAF---MKKIKGYEL---V 463

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           E    L+PC++V G +    P+  + F  GA    PVENYF  + E   VCL ++   ++
Sbjct: 464 EGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQI-EPDLVCLAILGTPKS 522

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     I+GN+Q QN+++ YD++  RLG+    C
Sbjct: 523 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 182/390 (46%), Gaps = 49/390 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F    SS++ LL C+
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCK---PCVSCFDQPLPYFDTSRSSTNALLPCE 91

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN-LPNRI 207
           + +C      ++ C   N           Q C  Y     + +T G+  ++    +    
Sbjct: 92  STQCKLDPTVTV-CVKLNQT--------VQTCAYYTSYGDNSVTIGLLAADKFTFVAGTS 142

Query: 208 IPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT-- 260
           +P    GC      V +S +  GIAGFGRG  SLPSQL +  FS+C        TT T  
Sbjct: 143 LPGVTFGCGLNNTGVFNSNE-TGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGA 194

Query: 261 --SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
             S+++LD  +         +  TP +     A+  A    YY+ L+ ITVG  R+ V  
Sbjct: 195 IPSTVLLDLPADLFSNGQGAVQTTPLIQ---YAKNEANPTLYYLSLKGITVGSTRLPVPE 251

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               L  +G GGTI+DSGT+ T + P++++ + DEF +Q+         +     TG   
Sbjct: 252 SAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGHYT 304

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPS 435
           CF  P +     P+L LHF+ GA + LP ENY   V +    S +CL +    E +    
Sbjct: 305 CFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT---- 359

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            I+GNFQ QN +V YDL+N  L F    C 
Sbjct: 360 -IIGNFQQQNMHVLYDLQNNMLSFVAAQCD 388


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 142/479 (29%), Positives = 214/479 (44%), Gaps = 61/479 (12%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQT-- 67
           L+ + F  + +   S   S+   L+R H++P   + Q     V  +L R +H +  ++  
Sbjct: 27  LAVLVFLVVCATLASGAASVRVGLTRIHSDPDTTAPQ----FVRDALRRDMHRQRSRSFG 82

Query: 68  --------KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
                   ++   TT +  T     + G Y ++L+ GTPP     + DTGS L+W  C  
Sbjct: 83  RDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCA- 141

Query: 120 HYQC-KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
              C   C     P + P  S++  +L C         + S+          A    C  
Sbjct: 142 --PCGTQCFEQPAPLYNPASSTTFSVLPC---------NSSLSMCAGALAGAAPPPGCAC 190

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAGFG 230
           +   Y   YG+G T G+  SET    +       +P    GCS  SS      AG+ G G
Sbjct: 191 M---YNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLG 247

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG  SL SQL   +FSYCL    F DT  TS+L+L   ++ +    TG+  TPFV +P+ 
Sbjct: 248 RGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASPA- 301

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
             R   S YYY+ L  I++G + + +     +L  DG GG I+DSGTT T +A   ++  
Sbjct: 302 --RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQ- 358

Query: 351 ADEFVSQMVKNRNYT-RALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLP 406
               V   VK+   T   +     TGL  CF +P   +      P + LHF  GA++ LP
Sbjct: 359 ----VRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GADMVLP 413

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            ++Y  + G G   CL +   R  + G     GN+Q QN ++ YD+R + L F    C 
Sbjct: 414 ADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 132/386 (34%), Positives = 186/386 (48%), Gaps = 52/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + L+ GTPP+    ILDTGS L+W  C     C  C     P F PK SSS   L 
Sbjct: 95  GEFLMKLAIGTPPETYSAILDTGSDLIWTQCK---PCTQCFHQSTPIFDPKKSSSFSKLS 151

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C     E++    CN+        C      YL  YG    T+GI  SETL    
Sbjct: 152 CSSQLC-----EALPQSSCNN-------GC-----EYLYSYGDYSSTQGILASETLTFGK 194

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +PN   GC   +      Q AG+ G GRG  SL SQL   KFSYCL +    D T+TS
Sbjct: 195 ASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTV---DDTKTS 251

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L++ + +S  +  ++ +  TP +++P      A   +YY+ L  I+VG  R+ +     
Sbjct: 252 TLLMGSLAS-VNASSSAIKTTPLIHSP------AHPSFYYLSLEGISVGDTRLPIKKSTF 304

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +L  DG+GG I+DSGTT T++    F  +A EF +++         + +   TGL  CF 
Sbjct: 305 SLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI------NLPVDSSGSTGLDVCFT 358

Query: 382 VP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDREASGGPSIIL 438
           +P G      P+L  HF  GA++ LP ENY  ++G+ S    CL + +    S     I 
Sbjct: 359 LPSGSTNIEVPKLVFHFD-GADLELPAENY--MIGDSSMGVACLAMGSSSGMS-----IF 410

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q QN  V +DL  + L F    C
Sbjct: 411 GNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 146/493 (29%), Positives = 224/493 (45%), Gaps = 69/493 (13%)

Query: 6   SALCLSFIFFFTLLSIFPSSITSLTFSLSR--------FHTNPSQD--SYQNLNSLVSSS 55
           S+ C S +  +++L +FP  +  LTFSL+          H +  +    ++ L  +V+ S
Sbjct: 4   SSACNSTMKGWSVLQLFPC-VLLLTFSLAESAALRADLTHVDSGRGFTKHELLRRMVARS 62

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVW 114
             R   +++  +   T  T       S      Y I L  GTP PQ +   LDTGS LVW
Sbjct: 63  KARLASLRS--SACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVW 120

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
             C     C  C    +P F   +S +   + C +P C   H   +    C     A  +
Sbjct: 121 TQCA----CTVCFDQPVPVFRASVSHTFSRVPCSDPLCG--HAVYLPLSGCA----ARDR 170

Query: 175 NCTQICPSYLVLYG---SGLTEGIALSETLNL--PNRI-----IPNFLVGCSVLS----S 220
           +C          YG     +T G    +T     P+R      +PN   GC +++    +
Sbjct: 171 SC-------FYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFT 223

Query: 221 RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-L 279
              +GIAGFG G  SLPSQL + +FSYC  +    + +R S +IL     + +   TG +
Sbjct: 224 PNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAM---EESRVSPVILGGEPENIEAHATGPI 280

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
             TPF   P+ A   +   +Y++ LR +TVG  R+        L  DG+GGT +DSGT  
Sbjct: 281 QSTPFAPGPAGAPVGS-QPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAI 339

Query: 340 TFMAPELFEPLADEFVSQ--MVKNRNYTRALGAEALTGLRPCFDVPGEKTG-SFPELKLH 396
           TF    +F  L + FV+Q  +   + YT     + L     CF VP +K   + P+L LH
Sbjct: 340 TFFPQAVFRSLREAFVAQVPLPVAKGYTD---PDNLL----CFSVPAKKKAPAVPKLILH 392

Query: 397 FKGGAEVTLPVENYFAV-----VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
            + GA+  LP ENY         G G  +C+ +++   ++G    I+GNFQ QN ++ YD
Sbjct: 393 LE-GADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNG---TIIGNFQQQNMHIVYD 448

Query: 452 LRNQRLGFKQQLC 464
           L + ++ F    C
Sbjct: 449 LESNKMVFAPARC 461


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/356 (34%), Positives = 168/356 (47%), Gaps = 41/356 (11%)

Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG----- 190
           SR + C +P CS  H  +          C  E + T S   +  CP     YG G     
Sbjct: 21  SRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH 80

Query: 191 LTEG-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KF 245
           L  G +AL         + + NF   C+  +  +P G+AGFGRG  SLP QL+     +F
Sbjct: 81  LRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRF 140

Query: 246 SYCLLSHKF--DDTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
           SYCL+SH F  D   R S LIL      + +  +T G  YTP ++NP          +Y 
Sbjct: 141 SYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPK------HPYFYS 194

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           V L  ++VG  R++   +   +DR GNGG +VDSGTTFT +  E++  +A+ F   M   
Sbjct: 195 VALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 254

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------A 412
                   AE  TGL PC+       G  P L LHF+G A V LP  NYF         A
Sbjct: 255 GFARAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGA 312

Query: 413 VVGEGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              +    CL ++   +ASG    GP+  LGNFQ Q + V YD+   R+GF ++ C
Sbjct: 313 GTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 131/393 (33%), Positives = 180/393 (45%), Gaps = 51/393 (12%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           +  GGY++++S GTP    P + DTGS L+W  C     C  C     P F P  SS+  
Sbjct: 81  NGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFS 137

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
            L C +  C ++ +     R CN      +  C      Y   YGSG T G   +ETL +
Sbjct: 138 KLPCTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKV 183

Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
            +   P+   GCS  +      +GIAG GRG  SL  QL + +FSYCL S         S
Sbjct: 184 GDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGS---AAGAS 240

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
            ++  + ++ +D     +  TPFVNNP+V        YYYV L  ITVG   + V     
Sbjct: 241 PILFGSLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTF 292

Query: 322 TLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRP 378
              ++G  GGTIVDSGTT T++A + +E +   F+SQ   V   N TR        GL  
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTR--------GLDL 344

Query: 379 CFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASG 432
           CF   G   G + P L L F GGAE  +P   YFA V     G  +  CL ++  +    
Sbjct: 345 CFKSTGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ- 401

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            P  ++GN    + ++ YDL      F    C 
Sbjct: 402 -PMSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 139/488 (28%), Positives = 221/488 (45%), Gaps = 65/488 (13%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
           MAS +  L L      TLL     S++ + F L   H + +  SY  L  LV+ ++ R+ 
Sbjct: 1   MASPVLVLAL---VAATLLPASHCSVSGVGFQLKLRHVD-AHGSYTKLE-LVTRAIRRSR 55

Query: 61  HIKNPQTKTTTTTTT--------TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
                         T        T    + + S G Y + L+ GTPP     ++DTGS L
Sbjct: 56  ARVAALQAVAAAAATVAPVVDPITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDL 115

Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
           +W  C     C  C+    P F P  S++ RL+ C++P C+ + + +   R         
Sbjct: 116 IWTQCA---PCVLCADQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQR--------- 163

Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPA-- 224
                 +C  Y   YG    T G+  SET      N    ++ +   GC  ++S Q A  
Sbjct: 164 -----SVC-VYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANS 217

Query: 225 -GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY 281
            G+ G GRG  SL SQL   +FSYCL S    + +R +  +    NG++ S   +  +  
Sbjct: 218 SGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSP-VQS 276

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP V N       A    Y++ L+ I++G +R+ +      ++ DG GG  +DSGT+ T+
Sbjct: 277 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 330

Query: 342 MAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHF 397
           +  + ++ +  E VS +  +   N T         GL  CF  P   + +   P+++LHF
Sbjct: 331 LQQDAYDAVRHELVSVLRPLPPTNDTE-------IGLETCFPWPPPPSVAVTVPDMELHF 383

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
            GGA +T+P ENY  + G    +CL ++   +A+     I+GN+Q QN ++ YD+ N  L
Sbjct: 384 DGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-----IIGNYQQQNMHILYDIANSLL 438

Query: 458 GFKQQLCK 465
            F    C 
Sbjct: 439 SFVPAPCN 446


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 139/488 (28%), Positives = 221/488 (45%), Gaps = 65/488 (13%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
           MAS +  L L      TLL     S++ + F L   H + +  SY  L  LV+ ++ R+ 
Sbjct: 1   MASPVLVLAL---VAATLLPASHCSVSGVGFQLKLRHVD-AHGSYTKLE-LVTRAIRRSR 55

Query: 61  HIKNPQTKTTTTTTT--------TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
                         T        T    + + S G Y + L+ GTPP     ++DTGS L
Sbjct: 56  ARVAALQAVAAAAATVAPVVDPITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDL 115

Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
           +W  C     C  C+    P F P  S++ RL+ C++P C+ + + +   R         
Sbjct: 116 IWTQCA---PCVLCADQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQR--------- 163

Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPA-- 224
                 +C  Y   YG    T G+  SET      N    ++ +   GC  ++S Q A  
Sbjct: 164 -----SVC-VYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANS 217

Query: 225 -GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY 281
            G+ G GRG  SL SQL   +FSYCL S    + +R +  +    NG++ S   +  +  
Sbjct: 218 SGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSP-VQS 276

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP V N       A    Y++ L+ I++G +R+ +      ++ DG GG  +DSGT+ T+
Sbjct: 277 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 330

Query: 342 MAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHF 397
           +  + ++ +  E VS +  +   N T         GL  CF  P   + +   P+++LHF
Sbjct: 331 LQQDAYDAVRRELVSVLRPLPPTNDTE-------IGLETCFPWPPPPSVAVTVPDMELHF 383

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
            GGA +T+P ENY  + G    +CL ++   +A+     I+GN+Q QN ++ YD+ N  L
Sbjct: 384 DGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-----IIGNYQQQNMHILYDIANSLL 438

Query: 458 GFKQQLCK 465
            F    C 
Sbjct: 439 SFVPAPCN 446


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/384 (32%), Positives = 179/384 (46%), Gaps = 51/384 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + ++L+ GTP +    I+DTGS L+W  C     CK C     P F P+ SSS   L 
Sbjct: 95  GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK---PCKVCFDQPTPIFDPEKSSSFSKLP 151

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +             P+++   C+  C  Y   YG    T+G+  +ET    +
Sbjct: 152 CSSDLCVAL-------------PISS---CSDGC-EYRYSYGDHSSTQGVLATETFTFGD 194

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +     GC       +  Q AG+ G GRG  SL SQL + KFSYCL S   DD+   S
Sbjct: 195 ASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS--IDDSKGIS 252

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L++      S+        TP + NPS   R +F   YY+ L  I+VG   + +     
Sbjct: 253 TLLV-----GSEATVKSAIPTPLIQNPS---RPSF---YYLSLEGISVGDTLLPIEKSTF 301

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           ++  DG+GG I+DSGTT T++    F  L  EF+SQM  +      + A   T L  CF 
Sbjct: 302 SIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLD------VDASGSTELELCFT 355

Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +P + +    P+L  HF+ G ++ LP ENY         +CLT+ +    S     I GN
Sbjct: 356 LPPDGSPVEVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSGMS-----IFGN 409

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
           FQ QN  V +DL  + + F    C
Sbjct: 410 FQQQNIVVLHDLEKETISFAPAQC 433


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/459 (29%), Positives = 207/459 (45%), Gaps = 79/459 (17%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY-GGYSI 91
           L+R H +PS  + Q     V ++L R +H  N +    +++  T +  +S  +  G + +
Sbjct: 32  LTRVHADPSVTASQ----FVRAALHRDMHRHNARKLAASSSDGTVSAPVSPTTVPGEFLM 87

Query: 92  SLSFGTPPQIIPF--ILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLLGCQ 148
           +L+ GTPP  +PF  I DTGS L+W  C     C + C     P + P  S++   L C 
Sbjct: 88  TLAIGTPP--LPFLAIADTGSDLIWTQCA---PCSRQCFQQPTPLYNPSSSTTFSALPCN 142

Query: 149 N------PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
           +      P C+ +++                           + YGSG T     +ET  
Sbjct: 143 SSLGLCAPACACMYN---------------------------MTYGSGWTYVFQGTETFT 175

Query: 203 LPNRI------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
             +        +P    GCS  SS       +G+ G GRG  SL SQL   KFSYCL   
Sbjct: 176 FGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCL--T 233

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            + DT  TS+L+L  G S S   T  ++ TPFV +PS       S+YYY+ L  I++G  
Sbjct: 234 PYQDTNSTSTLLL--GPSASLNDTGVVSSTPFVASPS-------SIYYYLNLTGISLGTT 284

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +     +L  DG GG I+DSGTT T +    ++ +    +S +             A
Sbjct: 285 ALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTT-----DGSA 339

Query: 373 LTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----CLTVVT 426
            TGL  CF++P   +   S P + LHF  GA++ LP +NY   + +  +     CL +  
Sbjct: 340 ATGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYMMSLSDPDSDSSLWCLAMQN 398

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             +  G    ILGN+Q QN ++ YD+  + L F    C 
Sbjct: 399 QTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/384 (32%), Positives = 179/384 (46%), Gaps = 51/384 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + ++L+ GTP +    I+DTGS L+W  C     CK C     P F P+ SSS   L 
Sbjct: 95  GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK---PCKVCFDQPTPIFDPEKSSSFSKLP 151

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +             P+++   C+  C  Y   YG    T+G+  +ET    +
Sbjct: 152 CSSDLCVAL-------------PISS---CSDGC-EYRYSYGDHSSTQGVLATETFTFGD 194

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +     GC       +  Q AG+ G GRG  SL SQL + KFSYCL S   DD+   S
Sbjct: 195 ASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS--IDDSKGIS 252

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L++      S+        TP + NPS   R +F   YY+ L  I+VG   + +     
Sbjct: 253 TLLV-----GSEATVKSAIPTPLIQNPS---RPSF---YYLSLEGISVGDTLLPIEKSTF 301

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           ++  DG+GG I+DSGTT T++    F  L  EF+SQM  + +      A   T L  CF 
Sbjct: 302 SIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVD------ASGSTELELCFT 355

Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +P + +    P+L  HF+ G ++ LP ENY         +CLT+ +    S     I GN
Sbjct: 356 LPPDGSPVDVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSGMS-----IFGN 409

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
           FQ QN  V +DL  + + F    C
Sbjct: 410 FQQQNIVVLHDLEKETISFAPAQC 433


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/392 (31%), Positives = 175/392 (44%), Gaps = 46/392 (11%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y +S+  GTPP+    ILDTGS L+W  C     C  C     P F P  S S   
Sbjct: 85  SEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCA---PCMLCVDQPTPFFDPAQSPSYAK 141

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
           L C +P C+ +++           PL     C      Y   YG S  T G+  +ET   
Sbjct: 142 LPCNSPMCNALYY-----------PLCYRNVCV-----YQYFYGDSANTAGVLSNETFTF 185

Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             N     +P    GC  L++      +G+ GFGRG  SL SQL   +FSYCL S     
Sbjct: 186 GTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPV 245

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +R         +S S      +  TPF+ NP +         YY+ +  I+VGG+ + +
Sbjct: 246 PSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTM------YYLNMTGISVGGELLPI 299

Query: 317 WHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                 + D DG GG I+DSG+T T++A   ++ +   F  Q+        +L       
Sbjct: 300 DPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV---- 355

Query: 376 LRPCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF    P  K  + PEL  HF+ GA + LP+ENY  + G+   +CL +    + S  
Sbjct: 356 LDTCFVWPPPPRKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNLCLAIAASDDGS-- 412

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              I+G+FQ QN++V YD  N  L F    C 
Sbjct: 413 ---IIGSFQHQNFHVLYDNENSLLSFTPATCN 441


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/408 (32%), Positives = 191/408 (46%), Gaps = 65/408 (15%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           LDTGS LVWFPC   + C  C S  +P   P   SSS      +       H S+   D 
Sbjct: 98  LDTGSDLVWFPC-RPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSD- 155

Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
               L    NC             +  CP +   YG G       S++L+LP+  + NF 
Sbjct: 156 ----LCAISNCPLDYIETGDCNTSSYPCPPFYYAYGDGSLVAKLFSDSLSLPSVSVANFT 211

Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
            GC+  +  +P G+AGFGRG+ SLP+QL++      + FSYCL+SH FD     R S LI
Sbjct: 212 FGCAHTTLAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 271

Query: 265 LDNGSSHSDKKTTG----------------LTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           L       +K+                     +T  + NP          +Y V L+ I+
Sbjct: 272 LGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPK------HPYFYSVSLQGIS 325

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           +G + +        +D++G GG +VDSGTTFT +  + +  + +EF S++   R + RA 
Sbjct: 326 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 383

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFAVVGEG--------SA 419
             E  +G+ PC+ +   +T   P L LHF G G+ VTLP  NYF    +G          
Sbjct: 384 RVEPSSGMSPCYYL--NQTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEKRKV 441

Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL ++    + E  GG   ILGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 442 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 489


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/399 (33%), Positives = 187/399 (46%), Gaps = 50/399 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLS 139
           S + G Y ++L+ GTPP     I DTGS L+W    PCT+      C     P + P  S
Sbjct: 26  SPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSS 80

Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
           ++  +L C +         S+          A    C   C +Y V YGSG T     SE
Sbjct: 81  TTFAVLPCNS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSE 130

Query: 200 TLNLPNRI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
           T    +       +P    GCS  SS       +G+ G GRG+ SL SQL + KFSYCL 
Sbjct: 131 TFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL- 189

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
              + DT  TS+L+L  G S S   T G++ TPFV +PS A  N F   YY+ L  I++G
Sbjct: 190 -TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLG 243

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
              + +     +L+ DG GG I+DSGTT T +    ++ +    VS +            
Sbjct: 244 TTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DG 298

Query: 371 EALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VT 426
            A TGL  CF +P   +   + P + LHF  GA++ LP ++Y  +  +    CL +   T
Sbjct: 299 SADTGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAMQNQT 356

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           D E +     ILGN+Q QN ++ YD+  + L F    C 
Sbjct: 357 DGEVN-----ILGNYQQQNMHILYDIGQETLSFAPAKCS 390


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 184/395 (46%), Gaps = 47/395 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS + + 
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCYACFEQNGPYYDPKDSSSFKNIT 249

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+C  +          + +P    K  TQ CP Y   YG  S  T   AL   T+NL
Sbjct: 250 CHDPRCQLV---------SSPDPPQPCKGETQSCP-YFYWYGDSSNTTGDFALETFTVNL 299

Query: 204 PN-------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
                    +I+ N + GC   +       AG+ G GRG  S  +QL       FSYCL+
Sbjct: 300 TTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLV 359

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                +++ +S LI   G          L +T FV      + N    +YYV ++ I VG
Sbjct: 360 DRN-SNSSVSSKLIF--GEDKELLSHPNLNFTSFVG----GKENPVDTFYYVLIKSIMVG 412

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ +++  +   L   G GGTI+DSGTT T+ A   +E + + F   M K + +      
Sbjct: 413 GEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF---MRKIKGFPL---V 466

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
           E    L+PC++V G +    PE  + F  GA    PVENYF  +     VCL ++ T R 
Sbjct: 467 ETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRS 526

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           A      I+GN+Q QN+++ YDL+  RLG+    C
Sbjct: 527 ALS----IIGNYQQQNFHILYDLKKSRLGYAPMKC 557


>gi|297740193|emb|CBI30375.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 119/205 (58%), Gaps = 24/205 (11%)

Query: 34  SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           S F +  S +    L  L S+SL+RA H+K+       TT+     ++  HSYGG++I L
Sbjct: 66  STFTSKLSTEPRVFLQHLASASLSRAHHLKH------GTTSPLVKASLFPHSYGGHTIPL 119

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNP 150
           SFGTPPQ + F++DTGSH+VW PCT HY C  CS S   K+P F PKLSSS ++L C+NP
Sbjct: 120 SFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPKLSSSYKILECRNP 179

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
           KC      S+ C  CN      SKNC+  CP Y + YG+G   G  L E LN P + I  
Sbjct: 180 KC------SLGCPRCN----GNSKNCSHACPQYSLQYGTGSASGFFLLENLNFPGKTIHK 229

Query: 211 FLVGCSVLSSRQP-----AGIAGFG 230
           FLVGC+  ++ +P     AG   FG
Sbjct: 230 FLVGCTTSAAHEPTSDALAGFVDFG 254


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 124/385 (32%), Positives = 178/385 (46%), Gaps = 47/385 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS LVW  C     C  C +  +P +    SS+  L  C 
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSCD 147

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
           + +C              D  +    N T    +Y   YG    T G    ET++ +   
Sbjct: 148 STQCKL------------DPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGA 195

Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            +P  + GC +    +      GIAGFGRG  SLPSQL +  FS+C  +       + S+
Sbjct: 196 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 252

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           ++ D  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V      
Sbjct: 253 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 306

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           L ++G GGTI+DSGT FT + P ++  + DEF + +         +     TG   CF  
Sbjct: 307 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 359

Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
           P   K    P+L LHF+ GA + LP ENY   A  G   ++CL ++       G   I+G
Sbjct: 360 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 412

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           NFQ QN +V YDL+N +L F +  C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 126/388 (32%), Positives = 178/388 (45%), Gaps = 34/388 (8%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++++S GTPP   P I+DTGS+L+W  C    +C +   +  P   P  SS+   L 
Sbjct: 89  GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRC-FPRPTPAPVLQPARSSTFSRLP 147

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C    C ++   S + R C         N T  C +Y   YGSG T G   +ETL + + 
Sbjct: 148 CNGSFCQYLPTSS-RPRTC---------NATAAC-AYNYTYGSGYTAGYLATETLTVGDG 196

Query: 207 IIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
             P    GCS  +     +GI G GRG  SL SQL + +FSYCL S   D     S ++ 
Sbjct: 197 TFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--ASPILF 254

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            + +  +++    +  TP + NP +      S +YYV L  I V    + V        +
Sbjct: 255 GSLAKLTERSV--VQSTPLLKNPYLQR----STHYYVNLTGIAVDSTELPVTGSTFGFTQ 308

Query: 326 DG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
            G  GGTIVDSGTT T++A + +  +   F SQM      T A GA     L  C+    
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP--YDLDLCYKPSA 366

Query: 384 --GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
             G K    P L L F GGA+  +PV+NYFA V     G  +  CL V+   +    P  
Sbjct: 367 GGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PIS 424

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN    + ++ YD+      F    C
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 126/385 (32%), Positives = 180/385 (46%), Gaps = 47/385 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS LVW  C     C  C +  +P +    SS+  L  C 
Sbjct: 35  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSC- 90

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
                    +S QC+   D  +    N T    +Y   YG    T G    ET++ +   
Sbjct: 91  ---------DSTQCK--LDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGA 139

Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            +P  + GC +    +      GIAGFGRG  SLPSQL +  FS+C  +       + S+
Sbjct: 140 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 196

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           ++ D  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V      
Sbjct: 197 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 250

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           L ++G GGTI+DSGT FT + P ++  + DEF + +         +     TG   CF  
Sbjct: 251 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 303

Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
           P   K    P+L LHF+ GA + LP ENY   A  G   ++CL ++       G   I+G
Sbjct: 304 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 356

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           NFQ QN +V YDL+N +L F +  C
Sbjct: 357 NFQQQNMHVLYDLKNSKLSFVRAKC 381


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 178/388 (45%), Gaps = 34/388 (8%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++++S GTPP   P I+DTGS+L+W  C    +C +   +  P   P  SS+   L 
Sbjct: 89  GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRC-FPRPTPAPVLQPARSSTFSRLP 147

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C    C ++   S + R C         N T  C +Y   YGSG T G   +ETL + + 
Sbjct: 148 CNGSFCQYLPTSS-RPRTC---------NATAAC-AYNYTYGSGYTAGYLATETLTVGDG 196

Query: 207 IIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
             P    GCS  +     +GI G GRG  SL SQL + +FSYCL S   D     +S IL
Sbjct: 197 TFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADG---GASPIL 253

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             GS     + + +  TP + NP +      S +YYV L  I V    + V        +
Sbjct: 254 -FGSLAKLTEGSVVQSTPLLKNPYLQR----STHYYVNLTGIAVDSTELPVTGSTFGFTQ 308

Query: 326 DG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
            G  GGTIVDSGTT T++A + +  +   F SQM      T A GA     L  C+    
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP--YDLDLCYKPSA 366

Query: 384 --GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
             G K    P L L F GGA+  +PV+NYFA V     G  +  CL V+   +    P  
Sbjct: 367 GGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PIS 424

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN    + ++ YD+      F    C
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 123/381 (32%), Positives = 173/381 (45%), Gaps = 38/381 (9%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + LS G P      I+DTGS L+W  C     C  C     P F P+ SSS   +GC + 
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCK---PCTECFDQPTPIFDPEKSSSYSKVGCSSG 57

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-II 208
            C+ +        +CN++  A           YL  YG    T G+  +ET    +   I
Sbjct: 58  LCNALPR-----SNCNEDKDACE---------YLYTYGDYSSTRGLLATETFTFEDENSI 103

Query: 209 PNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
                GC V +      Q +G+ G GRG  SL SQL   KFSYCL S   +D+  +SSL 
Sbjct: 104 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTS--IEDSEASSSLF 161

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           + + +S    KT            S+        +YY+ L+ ITVG +R+ V      L 
Sbjct: 162 IGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELA 221

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
            DG GG I+DSGTT T++    F+ L +EF S+M      +  +     TGL  CF +P 
Sbjct: 222 EDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLPVDDSGSTGLDLCFKLPD 275

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
             K  + P++  HFK GA++ LP ENY         +CL + +    S     I GN Q 
Sbjct: 276 AAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNGMS-----IFGNVQQ 329

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           QN+ V +DL  + + F    C
Sbjct: 330 QNFNVLHDLEKETVSFVPTEC 350


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 178/385 (46%), Gaps = 47/385 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS LVW  C     C  C +  +P +    SS+  L  C 
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSCD 147

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
           + +C              D  +    N T    ++   YG    T G    ET++ +   
Sbjct: 148 STQCKL------------DPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGA 195

Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            +P  + GC +    +      GIAGFGRG  SLPSQL +  FS+C  +       + S+
Sbjct: 196 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 252

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           ++ D  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V      
Sbjct: 253 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 306

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           L ++G GGTI+DSGT FT + P ++  + DEF + +         +     TG   CF  
Sbjct: 307 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 359

Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
           P   K    P+L LHF+ GA + LP ENY   A  G   ++CL ++       G   I+G
Sbjct: 360 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 412

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           NFQ QN +V YDL+N +L F +  C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 178/390 (45%), Gaps = 47/390 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +  ILDTGS LVW  C     C  C S  +    P  SS+  +L C 
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCR---PCPVCFSRALGPLDPSNSSTFDVLPCS 471

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
           +P C     +++    C            Q C  Y+  Y  G +T G   +ET       
Sbjct: 472 SPVC-----DNLTWSSCGKHNWGN-----QTC-VYVYAYADGSITTGHLDAETFTFAAAD 520

Query: 205 ---NRIIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
                 +P+   GC + +    +    GIAGFGRG  SLPSQL +D FS+C  +      
Sbjct: 521 GTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAIT---G 577

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +  SS++L   ++        +  TP V N S          YY+ L+ ITVG  R+ + 
Sbjct: 578 SEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLR------AYYLSLKGITVGSTRLPIP 631

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                L +DG GGTI+DSGT  T +  + ++ + D F +Q+   R       + +L+ L 
Sbjct: 632 ESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQV---RLPVDNATSSSLSRLC 688

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLTVVTDREASGGP 434
             F VP       P+L LHF+ GA + LP ENY   F   G GS  CL +      +G  
Sbjct: 689 FSFSVPRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAG-GSVTCLAI-----NAGDD 741

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I+GN+Q QN +V YDL    L F    C
Sbjct: 742 LTIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/392 (30%), Positives = 176/392 (44%), Gaps = 49/392 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S  YG + + +  GTPPQ    I+DTGS L W        C+ C     P F P  SS+ 
Sbjct: 19  SAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWI---QSEPCRACFEQADPIFDPSKSSTY 75

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +  C+    + +  + C     + + NC      Y   YG G +T G    ET+
Sbjct: 76  NKIACSSSACA----DLLGTQTC-----SAAANCI-----YAYGYGDGSVTRGYFSKETI 121

Query: 202 NLPNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
              +        G SV            GI G G+G  S+PSQL     +KFSYCL+   
Sbjct: 122 TATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDW- 180

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
               + TS++   + +  S +    + YTP V N           YYY+ ++ I+VGG  
Sbjct: 181 LSAGSETSTMYFGDAAVPSGE----VQYTPIVPNAD------HPTYYYIAVQGISVGGSL 230

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      +D  G+GGTI+DSGTT T++  E+F  L   + SQ V+    T A      
Sbjct: 231 LDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQ-VRYPTTTSA------ 283

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           TGL  CF+  G  +  FP + +H   G  + LP  N F  + E + +CL   +  +    
Sbjct: 284 TGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTANTFISL-ETNIICLAFASALDF--- 338

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           P  I GN Q QN+ + YDL N R+GF    C 
Sbjct: 339 PIAIFGNIQQQNFDIVYDLDNMRIGFAPADCA 370


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 125/391 (31%), Positives = 176/391 (45%), Gaps = 50/391 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F P  SS+  L  C 
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 91

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
           +  C     + +    C       ++ C      Y   YG   +T G    +        
Sbjct: 92  STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 141

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT- 260
             +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C        TT T 
Sbjct: 142 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITG 194

Query: 261 ---SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
              S+++LD  +         +  TP +     A+  A    YY+ L+ ITVG  R+ V 
Sbjct: 195 AIPSTVLLDLPADLFSNGQGAVQTTPLIQ---YAKNEANPTLYYLSLKGITVGSTRLPVP 251

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                L  +G GGTI+DSGT+ T + P++++ + DEF +Q+        A      TG  
Sbjct: 252 ESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA------TGHY 304

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGP 434
            CF  P +     P+L LHF+ GA + LP ENY   V +    S +CL +    E +   
Sbjct: 305 TCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT--- 360

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             I+GNFQ QN +V YDL+N  L F    C 
Sbjct: 361 --IIGNFQQQNMHVLYDLQNNMLSFVAAQCD 389


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 133/397 (33%), Positives = 186/397 (46%), Gaps = 51/397 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS + +G
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCV---PCYDCFVQNGPYYDPKESSSFKNIG 246

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+C  +          + +P    K   Q CP Y   YG  S  T   AL   T+NL
Sbjct: 247 CHDPRCHLV---------SSPDPPQPCKAENQTCP-YFYWYGDSSNTTGDFALETFTVNL 296

Query: 204 PN-------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
            +       + + N + GC   +       AG+ G GRG  S  SQL       FSYCL+
Sbjct: 297 TSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 356

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV--AERNAFSVYYYVGLRRIT 308
                DT  +S LI        DK    L   P VN  S+   + N    +YYV ++ I 
Sbjct: 357 DRN-SDTNVSSKLIFG-----EDKD---LLNHPEVNFTSLVAGKENPVDTFYYVQIKSIM 407

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG+ +++  +   L  +G GGTIVDSGTT ++ A   +E + D FV + VK     +  
Sbjct: 408 VGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKK-VKGYPVIKDF 466

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TD 427
                  L PC++V G +    PE ++ F+ GA    PVENYF  +     VCL ++ T 
Sbjct: 467 PI-----LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTP 521

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           R A      I+GN+Q QN+++ YD +  RLG+    C
Sbjct: 522 RSALS----IIGNYQQQNFHILYDTKKSRLGYAPMKC 554


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
           L+R H+ P   + Q     V  +L R +H        + +  + ++   T +  T     
Sbjct: 32  LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 87

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
           + G Y ++L+ GTPPQ  P I DTGS LVW  C     C + C     P + P  S + R
Sbjct: 88  NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 144

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
           +L C     S ++  + + R     P      C   C  Y   YG+G T G+  SET   
Sbjct: 145 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 194

Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
            +       +P    GCS  SS    G A   G GRG  SL SQL    FSYCL    F 
Sbjct: 195 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 252

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           DT   S+L+L   ++ +    TG+  TPFV +PS   +   S YYY+ L  I+VG   + 
Sbjct: 253 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGAAALP 309

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG GG I+DSGTT T +    +     + V   V++            TG
Sbjct: 310 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 364

Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF +P       + P + LHF GGA++ LPVENY  +  +G   CL +   R  + G
Sbjct: 365 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 419

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               LGN+Q QN ++ YD++ + L F    C
Sbjct: 420 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
           L+R H+ P   + Q     V  +L R +H        + +  + ++   T +  T     
Sbjct: 32  LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 87

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
           + G Y ++L+ GTPPQ  P I DTGS LVW  C     C + C     P + P  S + R
Sbjct: 88  NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 144

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
           +L C     S ++  + + R     P      C   C  Y   YG+G T G+  SET   
Sbjct: 145 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 194

Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
            +       +P    GCS  SS    G A   G GRG  SL SQL    FSYCL    F 
Sbjct: 195 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 252

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           DT   S+L+L   ++ +    TG+  TPFV +PS   +   S YYY+ L  I+VG   + 
Sbjct: 253 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGPAALP 309

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG GG I+DSGTT T +    +     + V   V++            TG
Sbjct: 310 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 364

Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF +P       + P + LHF GGA++ LPVENY  +  +G   CL +   R  + G
Sbjct: 365 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 419

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               LGN+Q QN ++ YD++ + L F    C
Sbjct: 420 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 132/388 (34%), Positives = 178/388 (45%), Gaps = 56/388 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + L+ GTPP+    I+DTGS L+W  C     C  C     P F PK SSS   L 
Sbjct: 95  GEFLMKLAIGTPPETYSAIMDTGSDLIWTQCK---PCTQCFDQPTPIFDPKKSSSFSKLS 151

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +             P +T   C+  C  YL  YG    T+G+  SETL    
Sbjct: 152 CSSKLCEAL-------------PQST---CSDGC-EYLYGYGDYSSTQGMLASETLTFGK 194

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P    GC   +      Q +G+ G GRG  SL SQL   KFSYCL S    D T+ S
Sbjct: 195 VSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSV---DDTKAS 251

Query: 262 SLILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           +L++  GS  S K + + +  TP + N      +A   +YY+ L  I+VG   + +    
Sbjct: 252 TLLM--GSLASVKASDSEIKTTPLIQN------SAQPSFYYLSLEGISVGDTSLPIKKST 303

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLR 377
            +L  DG+GG I+DSGTT T++    F+ +A EF SQ+   V N            TGL 
Sbjct: 304 FSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGS---------TGLE 354

Query: 378 PCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
            CF +P   T    P+L  HF  GA++ LP ENY          CL + +    S     
Sbjct: 355 VCFTLPSGSTDIEVPKLVFHFD-GADLELPAENYMIADASMGVACLAMGSSSGMS----- 408

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN Q QN  V +DL  + L F    C
Sbjct: 409 IFGNIQQQNMLVLHDLEKETLSFLPTQC 436


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
           L+R H+ P   + Q     V  +L R +H        + +  + ++   T +  T     
Sbjct: 37  LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 92

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
           + G Y ++L+ GTPPQ  P I DTGS LVW  C     C + C     P + P  S + R
Sbjct: 93  NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 149

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
           +L C     S ++  + + R     P      C   C  Y   YG+G T G+  SET   
Sbjct: 150 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 199

Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
            +       +P    GCS  SS    G A   G GRG  SL SQL    FSYCL    F 
Sbjct: 200 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 257

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           DT   S+L+L   ++ +    TG+  TPFV +PS   +   S YYY+ L  I+VG   + 
Sbjct: 258 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGPAALP 314

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG GG I+DSGTT T +    +     + V   V++            TG
Sbjct: 315 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 369

Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF +P       + P + LHF GGA++ LPVENY  +  +G   CL +   R  + G
Sbjct: 370 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 424

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               LGN+Q QN ++ YD++ + L F    C
Sbjct: 425 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 123/388 (31%), Positives = 176/388 (45%), Gaps = 50/388 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F P  SS+  L  C 
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
           +  C     + +    C       ++ C      Y   YG   +T G    +        
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C  +    +  + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +++LD  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V     
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
           TL ++G GGTI+DSGT  T +   ++  + D F +Q+   V + N T             
Sbjct: 300 TL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPSI 436
           C   P       P+L LHF+ GA + LP ENY F V   GS++ CL ++      GG   
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEVT 403

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +GNFQ QN +V YDL+N +L F    C
Sbjct: 404 TIGNFQQQNMHVLYDLQNSKLSFVPAQC 431


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/395 (31%), Positives = 185/395 (46%), Gaps = 46/395 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +  G+PP+    ILDTGS L W  C     C  C     P + PK S S R + 
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCV---PCFDCFEQNGPYYDPKDSISFRNIT 250

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+C  +          + +P    K  TQ CP Y   YG  S  T   AL   T+NL
Sbjct: 251 CNDPRCQLV---------SSPDPPRPCKFETQSCP-YFYWYGDSSNTTGDFALETFTVNL 300

Query: 204 PN--------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCL 249
            +        R + N + GC   +       AG+ G GRG  S  SQL       FSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           +     DT+ +S LI   G          L +T  +      + N    +YY+ ++ I V
Sbjct: 361 VDRD-SDTSVSSKLIF--GEDKDLLTHPELNFTSLI----AGKENPVDTFYYLQIKSIFV 413

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GG+++++  +   L  DG GGTI+DSGTT ++ +   +  + + F+ ++   + Y     
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKV---KGYKL--- 467

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            E    L PC++V G    +FPE  + F  GA    PVENYF  + +   VCL ++   +
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPK 527

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++     I+GN+Q QN+++ YD +N RLG+    C
Sbjct: 528 SALS---IIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 131/399 (32%), Positives = 169/399 (42%), Gaps = 51/399 (12%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           +S G Y+++LS GTPP     + DTGS L+W  C     C  C++   P F P  SS+  
Sbjct: 85  NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCA---PCTECAARPAPPFQPASSSTFS 141

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
            L C +  C ++    + C          +  C    P     YG G T G   +ETL++
Sbjct: 142 KLPCASSLCQFLTSPYLTCN---------ATGCVYYYP-----YGMGFTAGYLATETLHV 187

Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                P    GCS  +      +GI G GR   SL SQ+ + +FSYCL S    D     
Sbjct: 188 GGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS----DADAGD 243

Query: 262 SLILDNGSSHSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           S IL      S  K TG  +  TP + NP +      S YYYV L  ITVG   + V   
Sbjct: 244 SPILFG----SLAKVTGGNVQSTPLLENPEMPS----SSYYYVNLTGITVGATDLPVTST 295

Query: 320 YLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                R       GGTIVDSGTT T++  E +  +   F+SQM      T   G     G
Sbjct: 296 TFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTR--FG 353

Query: 376 LRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTD 427
              CFD      GS    P L L F GGAE  +   +Y  VV     G  +  CL V+  
Sbjct: 354 FDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPA 413

Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            E     SI I+GN    + +V YDL      F    C 
Sbjct: 414 SEKL---SISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 449


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/395 (31%), Positives = 185/395 (46%), Gaps = 46/395 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +  G+PP+    ILDTGS L W  C     C  C     P + PK S S R + 
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCV---PCFDCFEQNGPYYDPKDSISFRNIT 250

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+C  +          + +P    K  TQ CP Y   YG  S  T   AL   T+NL
Sbjct: 251 CNDPRCQLV---------SSPDPPRPCKFETQSCP-YFYWYGDSSNTTGDFALETFTVNL 300

Query: 204 PN--------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCL 249
            +        R + N + GC   +       AG+ G GRG  S  SQL       FSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           +     DT+ +S LI   G          L +T  +      + N    +YY+ ++ I V
Sbjct: 361 VDRD-SDTSVSSKLIF--GEDKDLLTHPELNFTSLI----AGKENPVDTFYYLQIKSIFV 413

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GG+++++  +   L  DG GGTI+DSGTT ++ +   +  + + F+ ++   + Y     
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKV---KGYKL--- 467

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            E    L PC++V G    +FPE  + F  GA    PVENYF  + +   VCL ++   +
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPK 527

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++     I+GN+Q QN+++ YD +N RLG+    C
Sbjct: 528 SALS---IIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 130/391 (33%), Positives = 178/391 (45%), Gaps = 52/391 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           GGY++++S GTP      + DTGS L+W  C     C  C     P F P  SS+   L 
Sbjct: 84  GGYNMNISVGTPLLTFSVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFSKLP 140

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C ++ +     R CN      +  C      Y   YGSG T G   +ETL + + 
Sbjct: 141 CTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKVGDA 186

Query: 207 IIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
             P+   GCS  +      +GIAG GRG  SL  QL + +FSYCL S         S ++
Sbjct: 187 SFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGS---AAGASPIL 243

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
             + ++ +D     +  TPFVNNP+V        YYYV L  ITVG   + V        
Sbjct: 244 FGSLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTFGFT 295

Query: 325 RDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFD 381
           ++G  GGTIVDSGTT T++A + +E +   F+SQ   V   N TR        GL  CF 
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTR--------GLDLCFK 347

Query: 382 VPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGP 434
             G   G  + P L L F GGAE  +P   YFA V     G  +  CL ++  +     P
Sbjct: 348 STGGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ--P 403

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             ++GN    + ++ YDL      F    C 
Sbjct: 404 MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/385 (32%), Positives = 182/385 (47%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++LS GTP Q    I+DTGS L+W  C     C  C +   P F P+ SSS   L 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C     +++Q   C       S N  Q    Y   YG G  T+G   +ETL   +
Sbjct: 150 CSSQLC-----QALQSPTC-------SNNSCQ----YTYGYGDGSETQGSMGTETLTFGS 193

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             IPN   GC            AG+ G GRG  SLPSQL++ KFSYC+       ++ +S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSTSS 250

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L+L    S ++  T G   +P   N ++ E +    +YY+ L  ++VG   + +     
Sbjct: 251 TLLL---GSLANSVTAG---SP---NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVF 301

Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            L+  +G GG I+DSGTT T+ A   ++ +   F+SQM    N +   G+ +  G   CF
Sbjct: 302 KLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQM----NLSVVNGSSS--GFDLCF 355

Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
            +P +++    P   +HF GG ++ LP ENYF     G  +CL + +  +       I G
Sbjct: 356 QMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q QN  V YD  N  + F    C
Sbjct: 410 NIQQQNLLVVYDTGNSVVSFLFAQC 434


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 128/396 (32%), Positives = 186/396 (46%), Gaps = 47/396 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C   Y C + + +    + PK S+S + + 
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEA---FYDPKTSASFKNIT 216

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C +P+CS I          + EP    K+  Q CP Y   YG  S  T   A+   T+NL
Sbjct: 217 CNDPRCSLI---------SSPEPPVQCKSDNQSCP-YFYWYGDRSNTTGDFAVETFTVNL 266

Query: 204 -------PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
                      + N + GC   +       +G+ G GRG  S  SQL       FSYCL+
Sbjct: 267 TTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 326

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                DT  +S LI   G        T L +T FVN     + N+   +YY+ ++ I VG
Sbjct: 327 DRN-SDTNVSSKLIF--GEDKDLLNHTNLNFTSFVN----GKENSVETFYYIQIKSILVG 379

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ + +  +   +  DG GGTI+DSGTT ++ A   +E + ++F  +M +N    R    
Sbjct: 380 GEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPV 439

Query: 371 EALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
                L PCF+V G  E     PEL + F  GA    P EN F  + E   VCL ++   
Sbjct: 440 -----LDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSE-DLVCLAILGTP 493

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +++     I+GN+Q QN+++ YD +  RLGF    C
Sbjct: 494 KSTFS---IIGNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 126/385 (32%), Positives = 180/385 (46%), Gaps = 49/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I ++ GTP   +  I+DTGS LVW  C     C  CS+S I       + S  L  
Sbjct: 40  GEYLIQMAIGTPALSLSAIMDTGSDLVWTKCN---PCTDCSTSSIYDPSSSSTYSKVL-- 94

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           CQ+  C     +      CN++      +C  + P     YG    T GI   ET ++ +
Sbjct: 95  CQSSLC-----QPPSIFSCNND-----GDCEYVYP-----YGDRSSTSGILSDETFSISS 139

Query: 206 RIIPNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
           + +PN   GC   +    +  G+ GFGRG  SL SQL     +KFSYCL+S    D+++T
Sbjct: 140 QSLPNITFGCGHDNQGFDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRT--DSSKT 197

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           S L + N +S    + T +  TP V + S         +YY+ L  I+VGGQ + +    
Sbjct: 198 SPLFIGNTAS---LEATTVGSTPLVQSSSTN-------HYYLSLEGISVGGQSLAIPTGT 247

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             +  DG+GG I+DSGTT TF+    ++ + +  VS +    N  +A G      L  CF
Sbjct: 248 FDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSI----NLPQADGQ-----LDLCF 298

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +  G     FP +  HFK GA+  +P ENY         VCL ++    ++ G   I GN
Sbjct: 299 NQQGSSNPGFPSMTFHFK-GADYDVPKENYLFPDSTSDIVCLAMMP-TNSNLGNMAIFGN 356

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLCK 465
            Q QNY + YD  N  L F    C 
Sbjct: 357 VQQQNYQILYDNENNVLSFAPTACD 381


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 175/388 (45%), Gaps = 50/388 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F P  SS+  L  C 
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
           +  C     + +    C       ++ C      Y   YG   +T G    +        
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C  +    +  + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +++LD  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V     
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
            L ++G GGTI+DSGT  T +   ++  + D F +Q+   V + N T             
Sbjct: 300 AL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPSI 436
           C   P       P+L LHF+ GA + LP ENY F V   GS++ CL ++      GG   
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEVT 403

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +GNFQ QN +V YDL+N +L F    C
Sbjct: 404 TIGNFQQQNMHVLYDLQNSKLSFVPAQC 431


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 134/429 (31%), Positives = 187/429 (43%), Gaps = 53/429 (12%)

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
           TR LH  + + K      +   +  SS S G Y + L  G PPQ +  I DTGS LVW  
Sbjct: 52  TRRLHFLSLRRKPVPFVKSPVVSGASSGS-GQYFVDLRIGQPPQSLLLIADTGSDLVWVK 110

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C+    C + S + +  F P+ SS+     C +P C  +       R CN   + ++   
Sbjct: 111 CSACRNCSHHSPATV--FFPRHSSTFSPAHCYDPVCRLVPKPGRAPR-CNHTRIHST--- 164

Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPA------ 224
              CP Y   Y  G LT G+   ET +L         + +   GC    S Q        
Sbjct: 165 ---CP-YEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFN 220

Query: 225 ---GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
              G+ G GRG  S  SQL     +KFSYCL+ +       TS LI+ +G     K    
Sbjct: 221 GANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP-TSYLIIGDGGDAVSK---- 275

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L +TP + NP          +YYV L+ + V G ++R+      +D  GNGGT++DSGTT
Sbjct: 276 LFFTPLLTNPLSP------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTT 329

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT-GLRPCFDVPG--EKTGSFPELKL 395
             F+A   +  L    V Q +K  N      A+ LT G   C +V G  +     P LK 
Sbjct: 330 LAFLADPAYR-LVIAAVKQRIKLPN------ADELTPGFDLCVNVSGVTKPEKILPRLKF 382

Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
            F GGA    P  NYF +  E    CL + +     G    ++GN   Q +  E+D    
Sbjct: 383 EFSGGAVFVPPPRNYF-IETEEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRS 439

Query: 456 RLGFKQQLC 464
           RLGF ++ C
Sbjct: 440 RLGFSRRGC 448


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 181/385 (47%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++LS GTP Q    I+DTGS L+W  C     C  C +   P F P+ SSS   L 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C     +++Q   C       S N  Q    Y   YG G  T+G   +ETL   +
Sbjct: 150 CSSQLC-----QALQSPTC-------SNNSCQ----YTYGYGDGSETQGSMGTETLTFGS 193

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             IPN   GC            AG+ G GRG  SLPSQL++ KFSYC+       ++ +S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSNSS 250

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L+L    S ++  T G   +P   N ++ + +    +YY+ L  ++VG   + +     
Sbjct: 251 TLLL---GSLANSVTAG---SP---NTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVF 301

Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            L+  +G GG I+DSGTT T+     ++ +   F+SQM    N +   G+ +  G   CF
Sbjct: 302 KLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQM----NLSVVNGSSS--GFDLCF 355

Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
            +P +++    P   +HF GG ++ LP ENYF     G  +CL + +  +       I G
Sbjct: 356 QMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q QN  V YD  N  + F    C
Sbjct: 410 NIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 183/390 (46%), Gaps = 49/390 (12%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            G PPQ    I+DTGS+L+W  C+   Q   C S  +  + P  S ++R + C +  C+ 
Sbjct: 77  IGDPPQQAEAIIDTGSNLIWTQCST-CQPAGCFSQNLSFYDPSRSRTARPVACNDTACA- 134

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNRIIPNFLV 213
           +  E+   RD        +K C     + L  YG+G+  G+  +E     P     +   
Sbjct: 135 LGSETRCARD--------NKAC-----AVLTAYGAGVIGGVLGTEAFTFQPQSENVSLAF 181

Query: 214 GCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
           GC   +   P      +GI G GRG  SL SQL  +KFSYCL  + F  +T TS L +  
Sbjct: 182 GCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPY-FSQSTNTSRLFVGA 240

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
            +  S       T  PF+ NP V   + FS +YY+ L  ITVG  ++ V      L +  
Sbjct: 241 SAGLSSGGAPA-TSVPFLKNPDV---DPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVA 296

Query: 328 NG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
            G   GT++DSG+ FT +    ++ L DE V Q+  +     A GAE   GL  C  V  
Sbjct: 297 TGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPA-GAE---GLDLCAAVAH 352

Query: 385 EKTGSF-PELKLHF-KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP-------- 434
              G   P L LHF  GG +V +P ENY+  V + +A C+ V +    SGGP        
Sbjct: 353 GDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTA-CMVVFS----SGGPNSTLPMNE 407

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + I+GN+  Q+ ++ YDL    L F+   C
Sbjct: 408 TTIIGNYMQQDMHLLYDLEKGMLSFQPADC 437


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 131/460 (28%), Positives = 199/460 (43%), Gaps = 53/460 (11%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKT 69
           L ++FF  +L  +P +  +L   LS           + L  +V  S  RA ++  P +  
Sbjct: 14  LPYLFFLAILFAWPVTSATLRAHLSHVDDGRGFTKRELLRRMVVRSRARAANL-CPYSGA 72

Query: 70  TTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
           T    T      ++     Y I LS G P  Q +   LDTGS +VW  C     C  C +
Sbjct: 73  TARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE---PCAECFT 129

Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
             +P F    S++ R + C +P C+  H E                 C     +Y+  YG
Sbjct: 130 QPLPRFDTAASNTVRSVACSDPLCN-AHSE---------------HGCFLHGCTYVSGYG 173

Query: 189 SG-LTEGIALSETLNLPNR------IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLP 237
            G L+ G  L ++    +        +P+   GC + ++    +   GIAGFGRG  SLP
Sbjct: 174 DGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLP 233

Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
           SQL + +FSYC  + +F+   ++S + L          T  +  TPFV +      N+  
Sbjct: 234 SQLKVRQFSYCFTT-RFE--AKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNS-- 288

Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
            +Y +  + +TVG  R+ V      +  DG+G T +DSGT  T     +F  L   F++Q
Sbjct: 289 -HYVLSFKGVTVGKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ 343

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
                N T             CF   G+KT + P+L  H + GA+  LP ENY     E 
Sbjct: 344 AALPVNKTADED-------DICFSWDGKKTAAMPKLVFHLE-GADWDLPRENYVTEDRES 395

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
             VC+ V T  +       ++GNFQ QN ++ YDL   +L
Sbjct: 396 GQVCVAVSTSGQMD---RTLIGNFQQQNTHIVYDLAAGKL 432


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 172/381 (45%), Gaps = 39/381 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPP  +  +LDTGS L+W  C     C+ C     P + P  S++   + C+
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +P C  +     +C   +         C     +Y   YG G  T+G+  +ET  L  + 
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197

Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            +     GC   ++ S+   +G+ G GRG  SL SQL + +FSYC       + T  S L
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPF---NATAASPL 254

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            L + +  S    T    TPFV +PS   R   S YYY+ L  ITVG   + +      L
Sbjct: 255 FLGSSARLSSAAKT----TPFVPSPSGGARRR-SSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
              G+GG I+DSGTTFT +    F  LA    S++         L + A  GL  CF   
Sbjct: 310 TPMGDGGVIIDSGTTFTALEERAFVALARALASRV------RLPLASGAHLGLSLCFAAA 363

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
             +    P L LHF  GA++ L  E+Y          CL +V+ R  S     +LG+ Q 
Sbjct: 364 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 417

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           QN ++ YDL    L F+   C
Sbjct: 418 QNTHILYDLERGILSFEPAKC 438


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 139/452 (30%), Positives = 201/452 (44%), Gaps = 63/452 (13%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTT---TTTTTTNISSHSYGGY 89
           L+R H+NP   + +     V  +L R +H     T+   ++   T    T     + G Y
Sbjct: 33  LTRIHSNPDVSATE----FVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEY 88

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS----FIPKLSSSSRLL 145
            ++L+ GTPP   P I DTGS L+W       QC  C S         + P  S++  +L
Sbjct: 89  IMTLAIGTPPLSYPAIADTGSDLIW------TQCAPCGSQCFKQAGQPYNPSSSTTFGVL 142

Query: 146 GCQNP--KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
            C +    C+ +   S                C+ +   Y   YG+G T GI   ET   
Sbjct: 143 PCNSSVSMCAALAGPS------------PPPGCSCM---YNQTYGTGWTAGIQSVETFTF 187

Query: 204 -----PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
                    +P    GCS  SS      AG+ G GRG  SL SQL    FSYCL    F 
Sbjct: 188 GSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCL--TPFQ 245

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           D   TS+L+L   ++ +    TG+  TPFV +PS A     S YYY+ L  I++G   + 
Sbjct: 246 DANSTSTLLLGPSAALNG---TGVLTTPFVASPSKAP---MSTYYYLNLTGISIGTTALS 299

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG GG I+DSGTT T +    ++ +     S +        A G+++ TG
Sbjct: 300 IPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLV----TLPVADGSDS-TG 354

Query: 376 LRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           L  CF +  E +   S P +  HF  GA++ LPV+NY  ++G G   CL +   R  + G
Sbjct: 355 LDLCFALTSETSTPPSMPSMTFHFD-GADMVLPVDNYM-ILGSG-VWCLAM---RNQTVG 408

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                GN+Q QN ++ YD+  + L F    C 
Sbjct: 409 AMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 126/395 (31%), Positives = 176/395 (44%), Gaps = 47/395 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS R +G
Sbjct: 88  GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCV---PCHDCFEQNGPYYDPKESSSFRNIG 144

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSET--LNL 203
           C +P+C  +           D PL   K   Q CP Y   YG S  T G   +ET  +NL
Sbjct: 145 CHDPRCHLVSSP--------DPPLPC-KAENQTCP-YFYWYGDSSNTTGDFATETFTVNL 194

Query: 204 PN-------RIIPNFLVGCSVLSSRQPAGIAGFGRGKT---SLPSQLNL---DKFSYCLL 250
            +       + + N + GC   +     G +G         S  SQL       FSYCL+
Sbjct: 195 TSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 254

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                DT  +S LI   G          L +T  V      + N    +YYV ++ I VG
Sbjct: 255 DRN-SDTNVSSKLIF--GEDKDLLNHPELNFTTLVG----GKENPVDTFYYVQIKSIMVG 307

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ + +      +  DG GGTIVDSGTT ++     ++ + D FV ++   + Y      
Sbjct: 308 GEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKV---KGYPI---V 361

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
           +    L PC++V G +    P+  + F  GA    PVENYF  +     VCL ++ T R 
Sbjct: 362 QDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRS 421

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           A      I+GN+Q QN++V YD +  RLG+    C
Sbjct: 422 ALS----IIGNYQQQNFHVLYDTKKSRLGYAPMNC 452


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 172/381 (45%), Gaps = 39/381 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPP  +  +LDTGS L+W  C     C+ C     P + P  S++   + C+
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +P C  +     +C   +         C     +Y   YG G  T+G+  +ET  L  + 
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197

Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            +     GC   ++ S+   +G+ G GRG  SL SQL + +FSYC       + T  S L
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPF---NATAASPL 254

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            L + +  S    T    TPFV +PS   R   S YYY+ L  ITVG   + +      L
Sbjct: 255 FLGSSARLSSAAKT----TPFVPSPSGGARRR-SSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
              G+GG I+DSGTTFT +    F  LA    S++         L + A  GL  CF   
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAFVALARALASRV------RLPLASGAHLGLSLCFAAA 363

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
             +    P L LHF  GA++ L  E+Y          CL +V+ R  S     +LG+ Q 
Sbjct: 364 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 417

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           QN ++ YDL    L F+   C
Sbjct: 418 QNTHILYDLERGILSFEPAKC 438


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 132/429 (30%), Positives = 184/429 (42%), Gaps = 53/429 (12%)

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
           TR LH  + + K      +   +  +S S G Y + L  G PPQ +  I DTGS LVW  
Sbjct: 53  TRRLHFLSLRRKPIPFVKSPVVSGAASGS-GQYFVDLRIGQPPQSLLLIADTGSDLVWVK 111

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C+    C + S + +  F P+ SS+     C +P C  +          +  P+      
Sbjct: 112 CSACRNCSHHSPATV--FFPRHSSTFSPAHCYDPVCRLVPKP-------DRAPICNHTRI 162

Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPA------ 224
              C  Y   Y  G LT G+   ET +L         + +   GC    S Q        
Sbjct: 163 HSTC-HYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFN 221

Query: 225 ---GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
              G+ G GRG  S  SQL     +KFSYCL+ +       TS LI+ NG     K    
Sbjct: 222 GANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP-TSYLIIGNGGDGISK---- 276

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L +TP + NP          +YYV L+ + V G ++R+      +D  GNGGT+VDSGTT
Sbjct: 277 LFFTPLLTNPLSP------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTT 330

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT-GLRPCFDVPG--EKTGSFPELKL 395
             F+A    EP    + S +   R   +   A+ALT G   C +V G  +     P LK 
Sbjct: 331 LAFLA----EP---AYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKF 383

Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
            F GGA    P  NYF +  E    CL + +     G    ++GN   Q +  E+D    
Sbjct: 384 EFSGGAVFVPPPRNYF-IETEEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRS 440

Query: 456 RLGFKQQLC 464
           RLGF ++ C
Sbjct: 441 RLGFSRRGC 449


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 176/385 (45%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++LS GTP Q    I+DTGS L+W  C     C  C +   P F P+ SSS   L 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +                +S  C+     Y   YG G  T+G   +ETL   +
Sbjct: 150 CSSQLCQAL----------------SSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS 193

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             IPN   GC            AG+ G GRG  SLPSQL++ KFSYC+       ++  S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSTPS 250

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L+L    S ++  T G   +P   N ++ + +    +YY+ L  ++VG  R+ +     
Sbjct: 251 NLLL---GSLANSVTAG---SP---NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAF 301

Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            L+  +G GG I+DSGTT T+     ++ +  EF+SQ+    N     G+ +  G   CF
Sbjct: 302 ALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI----NLPVVNGSSS--GFDLCF 355

Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
             P + +    P   +HF GG ++ LP ENYF     G  +CL + +  +       I G
Sbjct: 356 QTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q QN  V YD  N  + F    C
Sbjct: 410 NIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 135/415 (32%), Positives = 194/415 (46%), Gaps = 47/415 (11%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+         T  + ++  S G Y + L  GTPP+    I+DTGS L W  C     C 
Sbjct: 129 PRRALAERIVATVESGVAVGS-GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCA---PCL 184

Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYL 184
            C   + P F P  S S R + C +P+C  +   +         P A  +  +  CP Y 
Sbjct: 185 DCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPT--------APRACRRPHSDPCP-YY 235

Query: 185 VLYG--SGLTEGIALSE-TLNL----PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGK 233
             YG  S  T  +AL   T+NL     +R + + + GC   S+R      AG+ G GRG 
Sbjct: 236 YWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGA 294

Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
            S  SQL       FSYCL+ H    ++  S ++   G   +      L YT      + 
Sbjct: 295 LSFASQLRAVYGHAFSYCLVDHG---SSVGSKIVF--GDDDALLGHPRLNYT----AFAP 345

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           +   A   +YYV L+ + VGG+++ +      + +DG+GGTI+DSGTT ++ A   +E +
Sbjct: 346 SAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVI 405

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
              FV +M K       L A+    L PC++V G +    PE  L F  GA    P ENY
Sbjct: 406 RRAFVERMDK----AYPLVAD-FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENY 460

Query: 411 FAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           F  +     +CL V+ T R A      I+GNFQ QN++V YDL+N RLGF  + C
Sbjct: 461 FVRLDPDGIMCLAVLGTPRSAMS----IIGNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 62/404 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTP   +  I+DTGS + W  C     CK C  +  P F P+ SSS   L C 
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCV---PCKDCVPALRPPFNPRHSSSFFKLPCA 194

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL--NLPN 205
           +  C+ ++           +P  +    T +   + + YG G L+ G+   ET+  N PN
Sbjct: 195 SSTCTNVYQ--------GVKPFCSPSGRTCL---FSIQYGDGSLSSGLLAMETIAGNTPN 243

Query: 206 ------RIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLN---LDKFSYCL--- 249
                   + N  +GC+ +         +G+ G  R   S PSQL+     KFS+C    
Sbjct: 244 FGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDK 303

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           ++H       +S L+       SD  +  L YTP V NP+V   +A   YYYVGL  I+V
Sbjct: 304 IAH-----LNSSGLVF---FGESDIISPYLRYTPLVQNPAVP--SASLDYYYVGLVGISV 353

Query: 310 GGQRVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
              R+ + HK   +D+  G+GGTI+DSGT FT++    F+ +  EF+++       +   
Sbjct: 354 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLART------SHLA 407

Query: 369 GAEALTGLRPCFDV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG---EGSAVC 421
             +  +G  PC+++       ++   P + LHF+GG +V LP  +    V    E + +C
Sbjct: 408 KVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLC 467

Query: 422 LTVVTDREASGG-PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L      + SG  P  I+GN+Q QN +VEYDL   RLG     C
Sbjct: 468 LAF----QMSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 507


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 189/403 (46%), Gaps = 60/403 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTP   +  I+DTGS + W  C     CK C  +  P F P+ SSS   L C 
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCV---PCKDCVPALRPPFNPRHSSSFFKLPCA 195

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL--NLPN 205
           +  C+ ++           +P  +    T +   + + YG G L+ G+   ET+  N PN
Sbjct: 196 SSTCTNVYQ--------GVKPFCSPSGRTCL---FSIQYGDGSLSSGLLAMETIAGNTPN 244

Query: 206 ------RIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLN---LDKFSYCL--- 249
                   + N  +GC+ +         +G+ G  R   S PSQL+     KFS+C    
Sbjct: 245 FGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDK 304

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           ++H       +S L+       SD  +  L YTP V NP+V   +A   YYYVGL  I+V
Sbjct: 305 IAH-----LNSSGLVF---FGESDIISPYLRYTPLVQNPAVP--SASLDYYYVGLVGISV 354

Query: 310 GGQRVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
              R+ + HK   +D+  G+GGTI+DSGT FT++    F+ +  EF+++       +   
Sbjct: 355 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLART------SHLA 408

Query: 369 GAEALTGLRPCFDV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG---EGSAVC 421
             +  +G  PC+++       ++   P + LHF+GG +V LP  +    V    E + +C
Sbjct: 409 KVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLC 468

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L  +   +    P  I+GN+Q QN +VEYDL   RLG     C
Sbjct: 469 LAFLMSGDI---PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 508


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 135/415 (32%), Positives = 194/415 (46%), Gaps = 47/415 (11%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+         T  + ++  S G Y + L  GTPP+    I+DTGS L W  C     C 
Sbjct: 129 PRRALAERIVATVESGVAVGS-GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCA---PCL 184

Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYL 184
            C   + P F P  S S R + C +P+C  +   +         P A  +  +  CP Y 
Sbjct: 185 DCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPT--------APRACRRPHSDPCP-YY 235

Query: 185 VLYG--SGLTEGIALSE-TLNL----PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGK 233
             YG  S  T  +AL   T+NL     +R + + + GC   S+R      AG+ G GRG 
Sbjct: 236 YWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGA 294

Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
            S  SQL       FSYCL+ H    ++  S ++   G   +      L YT      + 
Sbjct: 295 LSFASQLRAVYGHAFSYCLVDHG---SSVGSKIVF--GDDDALLGHPRLNYT----AFAP 345

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           +   A   +YYV L+ + VGG+++ +      + +DG+GGTI+DSGTT ++ A   +E +
Sbjct: 346 SAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVI 405

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
              FV +M K       L A+    L PC++V G +    PE  L F  GA    P ENY
Sbjct: 406 RRAFVERMDK----AYPLVAD-FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENY 460

Query: 411 FAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           F  +     +CL V+ T R A      I+GNFQ QN++V YDL+N RLGF  + C
Sbjct: 461 FVRLDPDGIMCLAVLGTPRSAMS----IIGNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 133/406 (32%), Positives = 186/406 (45%), Gaps = 58/406 (14%)

Query: 79  TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI------- 131
             +S  S  G+S+++  GTPPQ    I+DTGS L+W       QCK  SS+ +       
Sbjct: 81  VRLSPLSDQGHSLTVGIGTPPQPRKLIVDTGSDLIW------TQCKLSSSTAVAARHGSP 134

Query: 132 PSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN-CTQICPSYLVLYGSG 190
           P + P  SS+   L C +  C          ++C      TSKN C      Y  +YGS 
Sbjct: 135 PVYDPGESSTFAFLPCSDRLC---QEGQFSFKNC------TSKNRCV-----YEDVYGSA 180

Query: 191 LTEGIALSETLNLPNRIIPNFLVG--CSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKF 245
              G+  SET     R   +  +G  C  LS+       GI G      SL +QL + +F
Sbjct: 181 AAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRF 240

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGL 304
           SYCL    F D  +TS L+    +  S  KTT  +  T  V+NP        +VYYYV L
Sbjct: 241 SYCLT--PFADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VKTVYYYVPL 291

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             I++G +R+ V    L +  DG GGTIVDSG+T  ++    FE +  E V  +V+    
Sbjct: 292 VGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVA 350

Query: 365 TRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGS 418
            R +    L     CF +P     +       P L LHF GGA + LP +NYF     G 
Sbjct: 351 NRTVEDYEL-----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG- 404

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +CL V    + SG    I+GN Q QN +V +D+++ +  F    C
Sbjct: 405 LMCLAVGKTTDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 448


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 135/427 (31%), Positives = 189/427 (44%), Gaps = 50/427 (11%)

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           S + R     +P+   +     T  + ++  S G Y I +  GTPP+    I+DTGS L 
Sbjct: 115 SGVARMPASSSPRRALSERMVATVESGVAVGS-GEYLIDVYVGTPPRRFRMIMDTGSDLN 173

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     C  C   + P F P  SSS R + C + +C  +           + P A  
Sbjct: 174 WLQCA---PCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPP--------EAPRACR 222

Query: 174 KNCTQICPSYLVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQP 223
           +     CP Y   YG  S  T  +AL S T+NL     +R +   + GC   +       
Sbjct: 223 RPAEDSCP-YYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGA 281

Query: 224 AGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
           AG+ G GRG  S  SQL       FSYCL+ H  D  ++        G  +       L 
Sbjct: 282 AGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVVF-----GEDYLVLAHPQLK 336

Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
           YT F    S A+      +YYV L+ + VGG  + +      + +DG+GGTI+DSGTT +
Sbjct: 337 YTAFAPTSSPAD-----TFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLS 391

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
           +     ++ +   FV  M  +R Y           L PC++V G +    PEL L F  G
Sbjct: 392 YFVEPAYQVIRQAFVDLM--SRLYPL---IPDFPVLNPCYNVSGVERPEVPELSLLFADG 446

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRL 457
           A    P ENYF  +     +CL V       G P     I+GNFQ QN++V YDL+N RL
Sbjct: 447 AVWDFPAENYFVRLDPDGIMCLAV------RGTPRTGMSIIGNFQQQNFHVVYDLQNNRL 500

Query: 458 GFKQQLC 464
           GF  + C
Sbjct: 501 GFAPRRC 507


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 116/392 (29%), Positives = 168/392 (42%), Gaps = 45/392 (11%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPPQ +  ILDTGS L W  C     C  C    +P F P  S +  +L C 
Sbjct: 85  YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 140

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
                    +   CRD              IC          +T G   S+T +  +   
Sbjct: 141 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 191

Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
                 +P+   GC + ++        GIAGF RG  S+P+QL +D FSYC  +    + 
Sbjct: 192 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 251

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +     +  N  S +     G+  +  +     ++  A    YY+ L+ +TVG  R+ + 
Sbjct: 252 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 307

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
                L  DG GGTIVDSGT  T +   ++  + D FV+Q  +   N T +L        
Sbjct: 308 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 360

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV---CLTVVTDREASGG 433
           + CF VP       P L LHF+ GA + LP ENY   + E   +   CL +    + S  
Sbjct: 361 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLS-- 417

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              ++GNFQ QN +V YDL N  L F    C 
Sbjct: 418 ---VIGNFQQQNMHVLYDLANDMLSFVPARCN 446


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 171/384 (44%), Gaps = 48/384 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++++ GTP      I+DTGS L+W  C     C  C S   P F P+ SSS   L 
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE---PCTQCFSQPTPIFNPQDSSSFSTLP 150

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C++  C  +  E+    +C                 Y   YG G  T+G   +ET     
Sbjct: 151 CESQYCQDLPSETCNNNECQ----------------YTYGYGDGSTTQGYMATETFTFET 194

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +PN   GC            AG+ G G G  SLPSQL + +FSYC+ S+    ++  S
Sbjct: 195 SSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG---SSSPS 251

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L L + +S   + +   T      NP+         YYY+ L+ ITVGG  + +     
Sbjct: 252 TLALGSAASGVPEGSPSTTLIHSSLNPT---------YYYITLQGITVGGDNLGIPSSTF 302

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L  DG GG I+DSGTT T++  + +  +A  F  Q+    N       E+ +GL  CF 
Sbjct: 303 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----NLPTV--DESSSGLSTCFQ 356

Query: 382 VPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
            P +  T   PE+ + F GG  + L  +N      EG  +CL + +  +   G S I GN
Sbjct: 357 QPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEG-VICLAMGSSSQL--GIS-IFGN 411

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q Q   V YDL+N  + F    C
Sbjct: 412 IQQQETQVLYDLQNLAVSFVPTQC 435


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/433 (27%), Positives = 193/433 (44%), Gaps = 51/433 (11%)

Query: 46  QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNI-SSHSYGGYSISLSFGTPPQIIPF 104
           Q L+  ++ S  R   +++            T   +  + S G Y + L+ GTPP     
Sbjct: 45  QLLSRAIARSKARVAALQSAAVSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTA 104

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DTGS L+W  C     C  C++   P F  K S++ R L C++ +C+ +         
Sbjct: 105 IMDTGSDLIWTQCA---PCLLCAAQPTPYFDVKRSATYRALPCRSSRCAAL--------- 152

Query: 165 CNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-----PNRIIPNFLVGCSVL 218
                  +S +C +    Y   YG +  T G+  +ET              N   GC  L
Sbjct: 153 -------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSL 205

Query: 219 SSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
           ++ + A   G+ GFGRG  SL SQL   +FSYCL S+     +R    +  N +S +   
Sbjct: 206 NAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANLNSTNTSS 265

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
            + +  TPFV NP++         Y++ ++ I++G +R+ +      ++ DG GG I+DS
Sbjct: 266 GSPVQSTPFVINPALPNM------YFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDS 319

Query: 336 GTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPE 392
           GT+ T++  + +E +     S + +   N T         GL  CF  P     T + P+
Sbjct: 320 GTSITWLQQDAYEAVRRGLASTIPLPAMNDTD-------IGLDTCFQWPPPPNVTVTVPD 372

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
              HF  GA +TLP ENY  +      +CL +      +     I+GN+Q QN ++ YD+
Sbjct: 373 FVFHFD-GANMTLPPENYMLIASTTGYLCLAMAPTSVGT-----IIGNYQQQNLHLLYDI 426

Query: 453 RNQRLGFKQQLCK 465
            N  L F    C 
Sbjct: 427 ANSFLSFVPAPCD 439


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 126/394 (31%), Positives = 177/394 (44%), Gaps = 45/394 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +  GTPP+    ILDTGS L W  C     C  C     P + P  SSS R +G
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCV---PCYECFEQNGPHYDPGQSSSYRNIG 235

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
           C + +C  +          + +P    K   Q CP Y   YG  S  T   AL   T+NL
Sbjct: 236 CHDSRCHLV---------SSPDPPQPCKAENQTCP-YYYWYGDSSNTTGDFALETFTVNL 285

Query: 204 ------PN-RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
                 P  R + N + GC   +       AG+ G GRG  S  SQL       FSYCL+
Sbjct: 286 TMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 345

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                D   +S LI   G          L +T  V      + N    +YYV ++ I VG
Sbjct: 346 DRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV----AGKENPVDTFYYVQIKSIVVG 398

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ V +  +   +  DG+GGTI+DSGTT ++ A   ++ + + F   M K + Y      
Sbjct: 399 GEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF---MAKVKGYPV---V 452

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           +    L PC++V G +    P+  + F  GA    PVENYF  +     VCL ++    +
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPS 512

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     I+GN+Q QN+++ YD +  RLGF    C
Sbjct: 513 ALS---IIGNYQQQNFHILYDTKKSRLGFAPTKC 543


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 129/397 (32%), Positives = 175/397 (44%), Gaps = 58/397 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + +S GTPP+ +   LDTGS LVW  C     C        P   P  SS+   L C 
Sbjct: 90  YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDC--FEQGAAPVLDPAASSTHAALPCD 147

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL------ 201
            P C  +   S   R   D      ++C      Y+  YG   LT G   +++       
Sbjct: 148 APLCRALPFTSCGGRSWGD------RSCV-----YVYHYGDRSLTVGQLATDSFTFGGDD 196

Query: 202 NLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
           N           GC  ++         GIAGFGRG+ SLPSQLN+  FSYC  S  FD  
Sbjct: 197 NAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD-- 253

Query: 258 TRTSSLILDNGSS------HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
           T++SS++    ++      H    T  +  T  + NPS          Y+V LR I+VGG
Sbjct: 254 TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPS------LYFVPLRGISVGG 307

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
            RV V    L         TI+DSG + T +  +++E +  EFVSQ+           A 
Sbjct: 308 ARVAVPESRL------RSSTIIDSGASITTLPEDVYEAVKAEFVSQV------GLPAAAA 355

Query: 372 ALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
               L  CF +P     +  + P L LH  GGA+  LP  NY  V  + +A  L VV D 
Sbjct: 356 GSAALDLCFALPVAALWRRPAVPALTLHLDGGADWELPRGNY--VFEDYAARVLCVVLD- 412

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            A+ G  +++GN+Q QN +V YDL N  L F    C 
Sbjct: 413 -AAAGEQVVIGNYQQQNTHVVYDLENDVLSFAPARCD 448


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 126/447 (28%), Positives = 195/447 (43%), Gaps = 50/447 (11%)

Query: 31  FSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
             L+      S    Q L+  ++ S  R   +++           T    + + S G Y 
Sbjct: 31  LKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYL 90

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + L+ GTPP     I+DTGS L+W  C     C  C+    P F  K S++ R L C++ 
Sbjct: 91  VDLAIGTPPLYYTAIMDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSS 147

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLP 204
           +C+ +                +S +C +    Y   YG +  T G+  +ET      N  
Sbjct: 148 RCASL----------------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191

Query: 205 NRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                N   GC  L++   A   G+ GFGRG  SL SQL   +FSYCL S+     +R  
Sbjct: 192 KVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLY 251

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             +  N SS +    + +  TPFV NP++         Y++ L+ I++G + + +     
Sbjct: 252 FGVYANLSSTNTSSGSPVQSTPFVINPALPNM------YFLSLKAISLGTKLLPIDPLVF 305

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCF 380
            ++ DG GG I+DSGT+ T++  + +E +    VS + +   N T         GL  CF
Sbjct: 306 AINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTD-------IGLDTCF 358

Query: 381 DVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
             P     T + P+L  HF   A +TL  ENY  +      +CL +     A  G   I+
Sbjct: 359 QWPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVM-----APTGVGTII 412

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           GN+Q QN ++ YD+ N  L F    C 
Sbjct: 413 GNYQQQNLHLLYDIGNSFLSFVPAPCD 439


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 168/389 (43%), Gaps = 39/389 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPPQ +  ILDTGS L W  C     C  C    +P F P  S +  +L C 
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 166

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
                    +   CRD              IC          +T G   S+T +  +   
Sbjct: 167 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 217

Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
                 +P+   GC + ++        GIAGF RG  S+P+QL +D FSYC  +    + 
Sbjct: 218 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 277

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +     +  N  S +     G+  +  +     ++  A    YY+ L+ +TVG  R+ + 
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 333

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
                L  DG GGTIVDSGT  T +   ++  + D FV+Q  +   N T +L        
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 386

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           + CF VP       P L LHF+ GA + LP ENY   + E   + LT +     +G    
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAIN--AGEDLS 443

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++GNFQ QN +V YDL N  L F    C 
Sbjct: 444 VIGNFQQQNMHVLYDLANDMLSFVPARCN 472


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 168/389 (43%), Gaps = 39/389 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPPQ +  ILDTGS L W  C     C  C    +P F P  S +  +L C 
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 166

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
                    +   CRD              IC          +T G   S+T +  +   
Sbjct: 167 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 217

Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
                 +P+   GC + ++        GIAGF RG  S+P+QL +D FSYC  +    + 
Sbjct: 218 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 277

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +     +  N  S +     G+  +  +     ++  A    YY+ L+ +TVG  R+ + 
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 333

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
                L  DG GGTIVDSGT  T +   ++  + D FV+Q  +   N T +L        
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 386

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           + CF VP       P L LHF+ GA + LP ENY   + E   + LT +     +G    
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAIN--AGEDLS 443

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++GNFQ QN +V YDL N  L F    C 
Sbjct: 444 VIGNFQQQNMHVLYDLANDMLSFVPARCN 472


>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 131/408 (32%), Positives = 189/408 (46%), Gaps = 65/408 (15%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           LDTGS LVWFPC   + C  C S  +P   P   SSS      +       H S+   D 
Sbjct: 100 LDTGSDLVWFPC-RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSD- 157

Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
               L    NC             +  CP +   YG G       S++L+LP+  + NF 
Sbjct: 158 ----LCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFT 213

Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
            GC+  +  +P G+AGFGRG+ SLP+QL +      + FSYCL+SH FD     R S LI
Sbjct: 214 FGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 273

Query: 265 LDNGSSHSDKKT----------------TGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           L       +K+                     +T  + NP          +Y V L+ I+
Sbjct: 274 LGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPK------HPYFYSVSLQGIS 327

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           +G + +        +D++G GG +VDSGTTFT +  + +  + +EF S++   R + RA 
Sbjct: 328 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 385

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYFAVVGEG--------SA 419
             E  +G+ PC+ +   +T   P L LHF G  + VTLP  NYF    +G          
Sbjct: 386 RVEPSSGMSPCYYL--NQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 443

Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL ++    + E  GG   ILGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 444 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 138/473 (29%), Positives = 210/473 (44%), Gaps = 48/473 (10%)

Query: 10  LSFIFFFTLLSIFPS----------SITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRA 59
           + F+FFF L SI  S          + ++  FSLS   T+ S  +   L  ++ +SL   
Sbjct: 10  MHFLFFFLLSSIHLSVQLNHTTTTTNNSTSLFSLSFPLTSLSLSTNTALKMMLRNSLIAN 69

Query: 60  LHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
            +  N Q K+  ++       +S        + L  GTPPQ+ P +LDTGS L W     
Sbjct: 70  TNNNNTQLKSPPSSPYNY--KLSFKYSMALIVDLPIGTPPQVQPMVLDTGSQLSWI---- 123

Query: 120 HYQCKYCSSSKIP---SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
             QC   + +K P   SF P LSS+   L C +P C              D  L TS + 
Sbjct: 124 --QCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK---------PRIPDFTLPTSCDQ 172

Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLPNRII-PNFLVGCSVLSSRQPAGIAGFGRGKT 234
            ++C  Y   Y  G   EG  + E       +  P  ++GC+   S  P GI G  RG+ 
Sbjct: 173 NRLC-HYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCAT-ESTDPRGILGMNRGRL 230

Query: 235 SLPSQLNLDKFSYCLLSH-KFDDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAE 292
           S  SQ  + KFSYC+ +       T T S  L  N +S++ +    LT+      P    
Sbjct: 231 SFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFARSQRMP---- 286

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
            N   + Y V L+ I +GG+++ +       D  G+G T++DSG+ FT++  E ++ +  
Sbjct: 287 -NLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRA 345

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGGAEVTLPVENYF 411
           E V  +          G  A      CFD    + G    ++   F+ G ++ +P E   
Sbjct: 346 EVVRAVGPRMKKGYVYGGVADM----CFDGNAIEIGRLIGDMVFEFEKGVQIVVPKERVL 401

Query: 412 AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           A V EG   C+ +  + +  G  S I+GNF  QN +VE+DL N+R+GF    C
Sbjct: 402 ATV-EGGVHCIGIA-NSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 176/392 (44%), Gaps = 47/392 (11%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y + +  G+PP+    ++DTGS L+W  C     C  C     P F P  S+S   
Sbjct: 84  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCA---PCLLCVEQPTPYFEPAKSTSYAS 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
           L C +  C+ ++            PL     C      Y   YG S  + G+  +ET   
Sbjct: 141 LPCSSAMCNALY-----------SPLCFQNACV-----YQAFYGDSASSAGVLANETFTF 184

Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             N     +P    GC  +++      +G+ GFGRG  SL SQL   +FSYCL S     
Sbjct: 185 GTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA 244

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           T+R         +S +   +  +  TPF+ NP      A    Y++ +  I+V G  + +
Sbjct: 245 TSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP------ALPTMYFLNMTGISVAGDLLPI 298

Query: 317 WHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                 ++  DG GG I+DSGTT TF+A   +  +   FV+ +       RA    + T 
Sbjct: 299 DPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWV----GLPRANATPSDT- 353

Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
              CF    P  +  + PE+ LHF  GA++ LP+ENY  + G    +CL ++   + S  
Sbjct: 354 FDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDGS-- 410

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              I+G+FQ QN+++ YDL N  L F    C 
Sbjct: 411 ---IIGSFQHQNFHMLYDLENSLLSFVPAPCN 439


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 125/393 (31%), Positives = 179/393 (45%), Gaps = 58/393 (14%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ GTPP  +PF+   DTGS L W  C     CK C     P +   +SSS   + 
Sbjct: 93  YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPIYDTAVSSSFSPVP 147

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP--SYLVLYGSGL-TEGIALSETLNL 203
           C +  C                P+ +S+NCT       Y   YG G  + G+  +ETL  
Sbjct: 148 CASATC---------------LPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTF 192

Query: 204 PNR---IIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
           P      +     GC V +   S    G  G GRG  SL +QL + KFSYCL    F +T
Sbjct: 193 PGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL--TDFFNT 250

Query: 258 TRTSSLILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           +  S ++    +  +   T   +  TP V +P V        +YYV L  I++G  R+ +
Sbjct: 251 SLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVP------TWYYVSLEGISLGDARLPI 304

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
            +    L  DG+GG IVDSGTTFTF+    F  + D     + +      +L +      
Sbjct: 305 PNGTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSLDS------ 358

Query: 377 RPCFDVP-GEKT-GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
            PCF    GE+   + P++ LHF GGA++ L  +NY +   E S+ CL +      +G P
Sbjct: 359 -PCFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNI------AGSP 411

Query: 435 SI---ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           S    ILGNFQ QN  + +D+   +L F    C
Sbjct: 412 SADVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 185/398 (46%), Gaps = 61/398 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y + +  GTP +    ILDTGS L+W  C     C  C     P F P  S++ R 
Sbjct: 86  SDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPARSATYRS 142

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL 203
           LGC +P C+ +++           PL   K C      Y   YG S  T G+  +ET   
Sbjct: 143 LGCASPACNALYY-----------PLCYQKVCV-----YQYFYGDSASTAGVLANETFTF 186

Query: 204 ---PNRI-IPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                R+ +P    GC  L++   A   G+ GFGRG  SL SQL   +FSYCL S     
Sbjct: 187 GTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPV 246

Query: 257 TTRT-----SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
            +R      ++L   N SS   + T      PFV NP      A    Y++ +  I+VGG
Sbjct: 247 PSRLYFGVYATLNSTNASSEPVQST------PFVVNP------ALPTMYFLNMTGISVGG 294

Query: 312 QRVRVWHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALG 369
             + +      + D DG GGTI+DSGTT T++A   ++ +   F SQ+ +   N T A  
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDA-- 352

Query: 370 AEALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAV-VGEGSAVCLTVVT 426
               + L  CF    P  ++ + P+L LHF  GA+  LP++NY  V    G  +CL +  
Sbjct: 353 ----SVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGGLCLAM-- 405

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              AS     I+G++Q QN+ V YDL N  + F    C
Sbjct: 406 ---ASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/392 (29%), Positives = 174/392 (44%), Gaps = 47/392 (11%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y + +  G+PP+    ++DTGS L+W  C     C  C     P F P  S+S   
Sbjct: 81  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCA---PCLLCVEQPTPYFEPAKSTSYAS 137

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
           L C +  C+ ++            PL     C      Y   YG S  + G+  +ET   
Sbjct: 138 LPCSSAMCNALY-----------SPLCFQNACV-----YQAFYGDSASSAGVLANETFTF 181

Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             N     +P    GC  +++      +G+ GFGRG  SL SQL   +FSYCL S     
Sbjct: 182 GTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA 241

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           T+R         +S +   +  +  TPF+ NP      A    Y++ +  I+V G  + +
Sbjct: 242 TSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP------ALPTMYFLNMTGISVAGDLLPI 295

Query: 317 WHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                 ++  DG GG I+DSGTT TF+A   +  +   FV+ +   R       A     
Sbjct: 296 DPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRA-----NATPSDT 350

Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
              CF    P  +  + PE+ LHF  GA++ LP+ENY  + G    +CL ++   + S  
Sbjct: 351 FDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDGS-- 407

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              I+G+FQ QN+++ YDL N  L F    C 
Sbjct: 408 ---IIGSFQHQNFHMLYDLENSLLSFVPAPCN 436


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 123/384 (32%), Positives = 176/384 (45%), Gaps = 49/384 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++++ GTP   +  I+DTGS L+W  C     C  C S   P F P+ SSS   L 
Sbjct: 94  GEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE---PCTQCFSQPTPIFNPQDSSSFSTLP 150

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
           C++             + C D P   S++C   C  Y   YG G  T+G   +ET     
Sbjct: 151 CES-------------QYCQDLP---SESCYNDC-QYTYGYGDGSSTQGYMATETFTFET 193

Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +PN   GC            AG+ G G G  SLPSQL + +FSYC+ S     ++  S
Sbjct: 194 SSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSSG---SSSPS 250

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L L + +S   + +   T      NP+         YYY+ L+ ITVGG  + +     
Sbjct: 251 TLALGSAASGVPEGSPSTTLIHSSLNPT---------YYYITLQGITVGGDNLGIPSSTF 301

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L  DG GG I+DSGTT T++  + +  +A  F  Q+    N +     E+ +GL  CF 
Sbjct: 302 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----NLSPV--DESSSGLSTCFQ 355

Query: 382 VPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +P +  T   PE+ + F GG  + L  EN      EG  +CL + +  +   G S I GN
Sbjct: 356 LPSDGSTVQVPEISMQFDGGV-LNLGEENVLISPAEG-VICLAMGSSSQQ--GIS-IFGN 410

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q Q   V YDL+N  + F    C
Sbjct: 411 IQQQETQVLYDLQNLAVSFVPTQC 434


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 126/398 (31%), Positives = 184/398 (46%), Gaps = 51/398 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C   Y C + +      + PK S+S + + 
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM---FYDPKTSASFKNIT 214

Query: 147 CQNPKCSWIHHES--IQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TL 201
           C +P+CS I      +QC   N           Q CP Y   YG  S  T   A+   T+
Sbjct: 215 CNDPRCSLISSPDPPVQCESDN-----------QSCP-YFYWYGDRSNTTGDFAVETFTV 262

Query: 202 NL-------PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYC 248
           NL           + N + GC   +       +G+ G GRG  S  SQL       FSYC
Sbjct: 263 NLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 322

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           L+     +T  +S LI   G        T L +T FVN     + N+   +YY+ ++ I 
Sbjct: 323 LVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSFVN----GKENSVETFYYIQIKSIL 375

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG+ + +  +   +  DG+GGTI+DSGTT ++ A   +E + ++F  +M +N    R  
Sbjct: 376 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 435

Query: 369 GAEALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
                  L PCF+V G  E     PEL + F  G     P EN F  + E   VCL ++ 
Sbjct: 436 PV-----LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSE-DLVCLAILG 489

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +++     I+GN+Q QN+++ YD +  RLGF    C
Sbjct: 490 TPKSTFS---IIGNYQQQNFHILYDTKRSRLGFTPTKC 524


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 185/398 (46%), Gaps = 61/398 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y + +  GTP +    ILDTGS L+W  C     C  C     P F P  S++ R 
Sbjct: 86  SDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPARSATYRS 142

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL 203
           LGC +P C+ +++           PL   K C      Y   YG S  T G+  +ET   
Sbjct: 143 LGCASPACNALYY-----------PLCYQKVCV-----YQYFYGDSASTAGVLANETFTF 186

Query: 204 ---PNRI-IPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                R+ +P    GC  L++   A   G+ GFGRG  SL SQL   +FSYCL S     
Sbjct: 187 GTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPV 246

Query: 257 TTRT-----SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
            +R      ++L   N SS   + T      PFV NP      A    Y++ +  I+VGG
Sbjct: 247 PSRLYFGVYATLNSTNASSEPVQST------PFVVNP------ALPTMYFLNMTGISVGG 294

Query: 312 QRVRVWHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALG 369
             + +      + D DG GGTI+DSGTT T++A   ++ +   F SQ+ +   N T A  
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDA-- 352

Query: 370 AEALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAV-VGEGSAVCLTVVT 426
               + L  CF    P  ++ + P+L LHF  GA+  LP++NY  V    G  +CL +  
Sbjct: 353 ----SVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGGLCLAM-- 405

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              AS     I+G++Q QN+ V YDL N  + F    C
Sbjct: 406 ---ASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 124/405 (30%), Positives = 190/405 (46%), Gaps = 58/405 (14%)

Query: 86  YGGYSISLS---FGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSS 141
           +GG S  ++    G PPQ    I+DTGS+L+W  C+   +C+  C    +P + P  S +
Sbjct: 65  WGGQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCS---RCRPTCFRQNLPYYDPSRSRA 121

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
           +R +GC +  C+       QC       L+ +K C     + +  YG+G   G   +E L
Sbjct: 122 ARAVGCNDAACAL--GSETQC-------LSDNKTC-----AVVTGYGAGNIAGTLATENL 167

Query: 202 NLPNRIIPNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
              +  + + + GC V++   P      +GI G GRGK SLPSQL   +FSYCL  + F+
Sbjct: 168 TFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPY-FE 225

Query: 256 DTTRTSSLI------LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           DT   S ++      L NGS+ S    T +T  PFV +PS    + FS +YY+ L  IT 
Sbjct: 226 DTIEPSHMVVGASAGLINGSASS----TPVTTVPFVRSPS---DDPFSTFYYLPLTGITA 278

Query: 310 GGQRVRVWHKYLTLDRDGNG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           G  ++ V      L +   G   GT +DSG   T +    ++ L  E   Q+       +
Sbjct: 279 GKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQL--GAALVQ 336

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA----EVTLPVENYFAVVGEGSAVCL 422
            L     TG   C  +  +     P L LHF GG+    ++ +P  NY+A V   +A C+
Sbjct: 337 PLAGT--TGFDLCVALK-DAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATA-CM 392

Query: 423 TVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V +  +    P   + ++GN+  QN +V YDL    L F+   C
Sbjct: 393 VVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 178/394 (45%), Gaps = 44/394 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +  GTPP+ +  ILDTGS L W  C     C  C     P + P  SSS R + 
Sbjct: 168 GEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD---PCYDCFEQNGPHYNPNESSSYRNIS 224

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY--GSGLTEGIALSE-TLNL 203
           C +P+C  +          + +PL   K   Q CP Y   Y  GS  T   AL   T+NL
Sbjct: 225 CYDPRCQLV---------SSPDPLQHCKTENQTCP-YFYDYADGSNTTGDFALETFTVNL 274

Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
             PN     + + + + GC   +        G+ G GRG  S PSQL       FSYCL 
Sbjct: 275 TWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCL- 333

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           +  F +T+ +S LI   G          L +T  +      E      +YY+ ++ I VG
Sbjct: 334 TDLFSNTSVSSKLIF--GEDKELLNHHNLNFTKLL----AGEETPDDTFYYLQIKSIVVG 387

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ + +  K      +G GGTI+DSG+T TF     ++ + + F  ++       + + A
Sbjct: 388 GEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIK-----LQQIAA 442

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           +    + PC++V G      P+  +HF  GA    P ENYF        +CL ++  +  
Sbjct: 443 DDFI-MSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAIL--KTP 499

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     I+GN   QN+++ YD++  RLG+  + C
Sbjct: 500 NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 122/389 (31%), Positives = 170/389 (43%), Gaps = 78/389 (20%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F P  SS+  L  C 
Sbjct: 89  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 145

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +  C  +   S+   D                       G+G +               +
Sbjct: 146 STLCQGLPVASLPRSD------------------KFTFVGAGAS---------------V 172

Query: 209 PNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT---- 260
           P    GC + ++        GIAGFGRG  SLPSQL +  FS+C        TT T    
Sbjct: 173 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGAIP 225

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           S+++LD  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V    
Sbjct: 226 STVLLDLPADLFSNGQGAVQTTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESE 279

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLR 377
             L ++G GGTI+DSGT  T +   ++  + D F +Q+   V + N T            
Sbjct: 280 FAL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF-------- 330

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPS 435
            C   P       P+L LHF+ GA + LP ENY F V   GS++ CL ++      GG  
Sbjct: 331 -CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEV 383

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +GNFQ QN +V YDL+N +L F    C
Sbjct: 384 TTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 125/390 (32%), Positives = 176/390 (45%), Gaps = 60/390 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++  FGTP +    I+DTGS + W  C     C  C S   P F P+ SSS + L 
Sbjct: 136 GNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK---PCSDCYSQVDPIFEPQQSSSYKHLS 192

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C+                L T  +C      Y + YG G  ++G    ETL L +
Sbjct: 193 CLSSACT---------------ELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGS 237

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
              P+F  GC   ++   +  AG+ G GR   S PSQ       +FSYCL    F  +T 
Sbjct: 238 DSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL--PDFVSSTS 295

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           T S  +  GS  +       T+ P V+N +      +  +Y+VGL  I+VGG+R+ +   
Sbjct: 296 TGSFSVGQGSIPATA-----TFVPLVSNSN------YPSFYFVGLNGISVGGERLSIPPA 344

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLRP 378
            L     G GGTIVDSGT  T + P+ ++ L   F       R+ TR L  A+  + L  
Sbjct: 345 VL-----GRGGTIVDSGTVITRLVPQAYDALKTSF-------RSKTRNLPSAKPFSILDT 392

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
           C+D+        P +  HF+  A+V +  V   F +  +GS VCL       AS   SI 
Sbjct: 393 CYDLSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAF-----ASASQSIS 447

Query: 437 --ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I+GNFQ Q   V +D    R+GF    C
Sbjct: 448 TNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 127/400 (31%), Positives = 169/400 (42%), Gaps = 52/400 (13%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPK 137
           ++ +  G Y + LS GTPP   P I+DTGS L W    PCT       C +   P + P 
Sbjct: 88  LAENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTA-----CFAQPTPLYDPA 142

Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
            SS+   L C +P C  +       R CN      +  C      Y   Y  G T G   
Sbjct: 143 RSSTFSKLPCASPLCQALPSAF---RACN------ATGCV-----YDYRYAVGFTAGYLA 188

Query: 198 SETLNL--------PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFS 246
           ++TL +         +        GCS  +       +GI G GR   SL SQ+ + +FS
Sbjct: 189 ADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFS 248

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL S   D     S ++    ++ +  K   +  T  + NP  A R A   YYYV L  
Sbjct: 249 YCLRS---DADAGASPILFGALANVTGDK---VQSTALLRNPVAARRRA--PYYYVNLTG 300

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I VG   + V          G GG IVDSGTTFT++A   +  L   F+SQ       TR
Sbjct: 301 IAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAG--LLTR 358

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVV 425
             GA+    L  CF+  G      P L   F GGAE  +P ++YF  V EG  V CL V+
Sbjct: 359 VSGAQFDFDL--CFEA-GAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVL 415

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             R  S     ++GN    + +V YDL      F    C 
Sbjct: 416 PTRGVS-----VIGNVMQMDLHVLYDLDGATFSFAPADCA 450


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 120/394 (30%), Positives = 183/394 (46%), Gaps = 46/394 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS R + 
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKDSSSFRNIS 249

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG--LTEGIALSE-TLNL 203
           C +P+C  +          + +P    K   Q CP Y   YG G   T   AL   T+NL
Sbjct: 250 CHDPRCQLV---------SSPDPPNPCKAENQSCP-YFYWYGDGSNTTGDFALETFTVNL 299

Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
             PN     + + N + GC   +       AG+ G G+G  S  SQ+       FSYCL+
Sbjct: 300 TTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV 359

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                + + +S LI   G          L +T F       +  +   +YYV +  + V 
Sbjct: 360 DRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG----GKDGSVDTFYYVQINSVMVD 412

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + +++  +   L  +G GGTI+DSGTT T+ A   +E + + FV ++   + Y      
Sbjct: 413 DEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKI---KGYEL---V 466

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           E L  L+PC++V G +    P+  + F  GA    PVENYF  + +   VCL ++ +  +
Sbjct: 467 EGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQI-DPDVVCLAILGNPRS 525

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     I+GN+Q QN+++ YD++  RLG+    C
Sbjct: 526 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 556


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 185/425 (43%), Gaps = 62/425 (14%)

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
           NP  K+   +  +T +       G Y + +  GTPPQ +  + DTGS LVW  C+    C
Sbjct: 70  NPTLKSPLISGASTGS-------GQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNC 122

Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
            +   S   +F+P+ SSS     C +P C  + H       CN   L +       C  +
Sbjct: 123 SHHPPSS--AFLPRHSSSFSPFHCFDPHCRLLPHAPHHL--CNHTRLHSP------C-RF 171

Query: 184 LVLYGSG-LTEGIALSETLNLPN-----------------RIIPNFLVGCSVLSSRQPAG 225
           L  Y  G L+ G    ET  L +                 RI    + G     +R   G
Sbjct: 172 LYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGAR---G 228

Query: 226 IAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT--TGLT 280
           + G GRG  S  SQL     +KFSYCL+ +        +S ++  G  HS   T  T ++
Sbjct: 229 VMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPP--TSFLMIGGGLHSLPLTNATKIS 286

Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
           YTP   NP          +YY+ +  IT+ G ++ +      +D  GNGGT+VDSGTT T
Sbjct: 287 YTPLQINPLSP------TFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLT 340

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE-KTGSFPELKLHFKG 399
           ++    +E +    V + VK  N      AE   G   C +  GE +  S P L+    G
Sbjct: 341 YLTKTAYEEVLKS-VRRRVKLPN-----AAELTPGFDLCVNASGESRRPSLPRLRFRLGG 394

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           GA    P  NYF    EG  +CL  +   E+  G S+I GN   Q + +E+D    RLGF
Sbjct: 395 GAVFAPPPRNYFLETEEG-VMCL-AIRAVESGNGFSVI-GNLMQQGFLLEFDKEESRLGF 451

Query: 460 KQQLC 464
            ++ C
Sbjct: 452 TRRGC 456


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 122/397 (30%), Positives = 177/397 (44%), Gaps = 62/397 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      GTPPQ +   +D  +   W PC+    C   +SS  PSF P  SS+ R + C 
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS--PSFDPTQSSTYRPVRCG 157

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN--- 205
            P+C+ +   +  C      P     +C     ++ + Y S     +   + L+L +   
Sbjct: 158 APQCAQVPPATPSC------PAGPGASC-----AFNLSYASSTLHAVLGQDALSLSDSNG 206

Query: 206 RIIPN--FLVGCSVL-----SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
             +P+  +  GC  +      S  P G+ GFGRG  S  SQ        FSYCL S+K  
Sbjct: 207 AAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS 266

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           + + T  L    G +   ++   +  TP ++NP    R +    YYV +  + V G+ V 
Sbjct: 267 NFSGTLRL----GPAGQPRR---IKTTPLLSNP---HRPSL---YYVAMVGVRVNGKAVP 313

Query: 316 VWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           +    L LD   G GGTIVD+GT FT ++P  +  L + F       R    A  A AL 
Sbjct: 314 IPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAF-------RRGVSAPAAPALG 366

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           G   C+ V G K  S P +   F GGA VTLP EN       G   CL +      + GP
Sbjct: 367 GFDTCYYVNGTK--SVPAVAFVFAGGARVTLPEENVVISSTSGGVACLAM------AAGP 418

Query: 435 SI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           S        +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 419 SDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 128/394 (32%), Positives = 174/394 (44%), Gaps = 57/394 (14%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y   L  GTPP+ +  +LDTGS +VW  C+    C+ C S   P F P  S S 
Sbjct: 104 SQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCS---PCRKCYSQSDPIFNPYKSKSF 160

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +P C  +       R            C      Y V YG G  T G   +ETL
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTR---------RHTCL-----YQVSYGDGSFTTGDFATETL 206

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLS 251
                 I    +GC         G+        G GRG+ S PSQ  +    KFSYCL+ 
Sbjct: 207 TFRGNKIAKVALGC----GHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD 262

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
                +++ SS++  + +     +     +TP + NP +        +YYVGL  I+VGG
Sbjct: 263 RS--ASSKPSSMVFGDAAISRLAR-----FTPLIRNPKL------DTFYYVGLIGISVGG 309

Query: 312 QRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            RVR V      LD  GNGG I+DSGT+ T +    +  L D F    V  R+  R  G 
Sbjct: 310 VRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAF---RVGARHLKR--GP 364

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           E  +    C+D+ G+ +   P + LHF+ GA++ LP  NY   V E  + C         
Sbjct: 365 E-FSLFDTCYDLSGQSSVKVPTVVLHFR-GADMALPATNYLIPVDENGSFCFAFA---GT 419

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G SII GN Q Q + V YDL   R+GF  + C
Sbjct: 420 ISGLSII-GNIQQQGFRVVYDLAGSRIGFAPRGC 452


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 194/396 (48%), Gaps = 52/396 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G PP+    I+DTGS L W  C     CK C     P F P  S+S +++ 
Sbjct: 85  GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK---PCKACFDQSGPVFDPSQSTSFKIIP 141

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
           C    C  + H+  +CRD       +SK   + C  Y   YG S  T G    E+L++  
Sbjct: 142 CNAAACDLVVHD--ECRD------NSSKTSPKTC-KYFYWYGDSSRTSGDLALESLSVSL 192

Query: 204 ---PNRI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
              P+ + I + ++GC   +    +   G+ G G+G  S PSQL        FSYCL+  
Sbjct: 193 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVD- 251

Query: 253 KFDDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           + ++ + +S++    G   S H D+    + +TPFV        N+   +YY+G++ I +
Sbjct: 252 RTNNLSVSSAISFGAGFALSRHFDQ----MKFTPFVRT-----NNSVETFYYLGIQGIKI 302

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
             + + +  +   +  +G+GGTI+DSGTT T++  + +  +   F++++    +Y R   
Sbjct: 303 DQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI----SYPR--- 355

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDR 428
           A+    L  C++  G     FP L + F+ GAE+ LP ENYF       A  CL ++   
Sbjct: 356 ADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAIL--- 412

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             + G SII GNFQ QN +  YD+++ RLGF    C
Sbjct: 413 -PTDGMSII-GNFQQQNIHFLYDVQHARLGFANTDC 446


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 180/392 (45%), Gaps = 43/392 (10%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCT----NHYQCKYCSSSKIPSFIPKLSSSSR 143
           G+S+++  GTPPQ    I+DTGS L+W  C+            S  + P + P+ SSS  
Sbjct: 83  GHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFA 142

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE--TL 201
            L C +  C         C        A +  C      Y  LYGS    G+  SE  T 
Sbjct: 143 YLPCSDRLCQEGQFSYKNC--------ARNNRCM-----YDELYGSAEAGGVLASETFTF 189

Query: 202 NLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
            +  ++      GC  LS+      +G+ G   G  SL SQL++ +FSYCL         
Sbjct: 190 GVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFA---ER 246

Query: 259 RTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +TS L+    +     +TTG +  T  + NP++      + YYYV L  +++G +R+ V 
Sbjct: 247 KTSPLLFGAMADLRRYRTTGTVQTTSILRNPAME-----TAYYYVPLVGLSLGTKRLDVP 301

Query: 318 HKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTG 375
              L + + DG+GGTIVDSG+T +++    F  +    V  + +   N T     E    
Sbjct: 302 ATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTD----EDYDD 357

Query: 376 LRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
              CF +P     +    P L LHF GGA +TLP +NYF     G  +CL V T  +  G
Sbjct: 358 YELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAG-LMCLAVGTSPDGFG 416

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+GN Q QN +V +D+RNQ+  F    C
Sbjct: 417 --VSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 131/426 (30%), Positives = 196/426 (46%), Gaps = 55/426 (12%)

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
           +P+   +     T  + ++  S G Y + +  GTPP+    I+DTGS L W  C     C
Sbjct: 127 SPRRALSERMVATVESGVAVGS-GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCA---PC 182

Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ-------CRDCNDEPLATSKNC 176
             C   + P F P  SSS R + C + +C  +             CR   ++P       
Sbjct: 183 LDCFEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDP------- 235

Query: 177 TQICPSYLVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQPAGI 226
              CP Y   YG  S  T  +AL S T+NL     +R +   + GC   +       AG+
Sbjct: 236 ---CP-YYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGL 291

Query: 227 AGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTP 283
            G GRG  S  SQL       FSYCL+ H  D  ++   +  ++  + +      L YT 
Sbjct: 292 LGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKV--VFGEDDDALALAAHPQLKYTA 349

Query: 284 FVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMA 343
           F    + +  +    +YYV L+ + VGG+ + +      + +DG+GGTI+DSGTT ++  
Sbjct: 350 FAP--ASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFV 407

Query: 344 PELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEV 403
              ++ +   F+ +M  +R+Y           L PC++V G +    PEL L F  GA  
Sbjct: 408 EPAYQVIRHAFMDRM--SRSYPL---VPEFPVLSPCYNVSGVERPEVPELSLLFADGAVW 462

Query: 404 TLPVENYFAVVGE--GSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLG 458
             P ENYF  +    GS +CL V+      G P     I+GNFQ QN++V YDL+N RLG
Sbjct: 463 DFPAENYFIRLDPDGGSIMCLAVL------GTPRTGMSIIGNFQQQNFHVVYDLQNNRLG 516

Query: 459 FKQQLC 464
           F  + C
Sbjct: 517 FAPRRC 522


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 180/396 (45%), Gaps = 50/396 (12%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           IS +++ G+S+++  GTPPQ    ILD GS L+W  C+        +    P F    SS
Sbjct: 99  ISPYAHQGHSLTVGVGTPPQPSKVILDLGSDLLWTQCS---LVGPTAKQLEPVFDAARSS 155

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           S  +L C +  C              +    T+K CT    +Y   YG     G+  +ET
Sbjct: 156 SFSVLPCDSKLC--------------EAGTFTNKTCTDRKCAYENDYGIMTATGVLATET 201

Query: 201 LNLPNR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
                   +  N   GC  L++    + +GI G   G  S+  QL + KFSYCL    F 
Sbjct: 202 FTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTP--FA 259

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYT-PFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           D  +TS ++    +     KTTG   T P + NP         +YYYV +  ++VG +R+
Sbjct: 260 DR-KTSPVMFGAMADLGKYKTTGKVQTIPLLKNP------VEDIYYYVPMVGMSVGSKRL 312

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVSQMVKNRNYTRALGAE 371
            V  + L +  DG GGT++DS TT  ++    F  L     E +   V NR+        
Sbjct: 313 DVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRS-------- 364

Query: 372 ALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
            +     CF++P     +    P L LHF G AE++LP +NYF     G  +CL V+   
Sbjct: 365 -VDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSPG-MMCLAVM-QA 421

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              G P++I GN Q QN +V YD+ N++  +    C
Sbjct: 422 PFEGAPNVI-GNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/394 (30%), Positives = 182/394 (46%), Gaps = 46/394 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    ILDTGS L W  C     C  C     P + PK SSS R + 
Sbjct: 195 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKDSSSFRNIS 251

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG--LTEGIALSE-TLNL 203
           C +P+C  +            +P    K   Q CP Y   YG G   T   AL   T+NL
Sbjct: 252 CHDPRCQLVSAP---------DPPKPCKAENQSCP-YFYWYGDGSNTTGDFALETFTVNL 301

Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
             PN     + + N + GC   +       AG+ G G+G  S  SQ+       FSYCL+
Sbjct: 302 TTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV 361

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                + + +S LI   G          L +T F       +  +   +YYV ++ + V 
Sbjct: 362 DRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG----GKDGSVDTFYYVQIKSVMVD 414

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + +++  +   L  +G GGTI+DSGTT T+ A   +E + + FV ++   + Y      
Sbjct: 415 DEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKI---KGYQL---V 468

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           E L  L+PC++V G +    P+  + F   A    PVENYF  + +   VCL ++ +  +
Sbjct: 469 EGLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWI-DPEVVCLAILGNPRS 527

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     I+GN+Q QN+++ YD++  RLG+    C
Sbjct: 528 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 558


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/370 (31%), Positives = 172/370 (46%), Gaps = 49/370 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DTGS L W  C     CK C + + P F P  S S R + C +P C  +   +     
Sbjct: 149 IVDTGSDLSWVQCQ---PCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGV 205

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSR- 221
           C   P + +         Y+V YG G  T G   +E L+L N   + NF+ GC   +   
Sbjct: 206 CGSNPPSCN---------YVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGL 256

Query: 222 --QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
               +G+ G GR   SL SQ +      FSYCL      +T  + SL++  G+S   K T
Sbjct: 257 FGGASGLVGLGRSSLSLISQTSAMFGGVFSYCL---PITETEASGSLVM-GGNSSVYKNT 312

Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
           T ++YT  + NP +        +Y++ L  ITVG   V+           G  G ++DSG
Sbjct: 313 TPISYTRMIPNPQLP-------FYFLNLTGITVGSVAVQA-------PSFGKDGMMIDSG 358

Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLH 396
           T  T + P +++ L DEFV Q      ++    A A   L  CF++ G +    P +K+H
Sbjct: 359 TVITRLPPSIYQALKDEFVKQ------FSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMH 412

Query: 397 FKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRN 454
           F+G AE+ + V   F  V  + S VCL + +   E   G   I+GN+Q +N  V YD + 
Sbjct: 413 FEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG---IIGNYQQKNQRVIYDTKG 469

Query: 455 QRLGFKQQLC 464
             LGF  + C
Sbjct: 470 SMLGFAAEAC 479


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 126/386 (32%), Positives = 173/386 (44%), Gaps = 49/386 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTPP+ +  +LDTGS +VW  C     CK C +   P F P+ S S   + 
Sbjct: 124 GEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCA---PCKRCYAQSDPVFDPRKSRSFASIA 180

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C++P C       +    CN +         Q C  Y V YG G  T G   +ETL    
Sbjct: 181 CRSPLC-----HRLDSPGCNTQ--------KQTC-MYQVSYGDGSFTFGDFSTETLTFRR 226

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
             +    +GC   +       AG+ G GRG+ S PSQ       KFSYCL+      +++
Sbjct: 227 TRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRS--ASSK 284

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
            SS++  + +     +     +TP V+NP +        +YYV L  I+VGG RV  +  
Sbjct: 285 PSSMVFGDSAVSRTAR-----FTPLVSNPKL------DTFYYVELLGISVGGTRVPGITA 333

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               LD+ GNGG I+DSGT+ T +    +    D F        N  R   A   +    
Sbjct: 334 SLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAF---RAGASNLKR---APQFSLFDT 387

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CFD+ G+     P + LHF+ GA+V+LP  NY   V      CL         GG SII 
Sbjct: 388 CFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDTSGNFCLAFA---GTMGGLSII- 442

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + V YDL   R+GF    C
Sbjct: 443 GNIQQQGFRVVYDLAGSRVGFAPHGC 468


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 194/396 (48%), Gaps = 52/396 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G PP+    I+DTGS L W  C     CK C     P F P  S+S +++ 
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK---PCKACFDQSGPVFDPSQSTSFKIIP 225

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
           C    C  + H+  +CRD       +SK   + C  Y   YG S  T G    E+L++  
Sbjct: 226 CNAAACDLVVHD--ECRD------NSSKTSPKTC-KYFYWYGDSSRTSGDLALESLSVSL 276

Query: 204 ---PNRI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
              P+ + I + ++GC   +    +   G+ G G+G  S PSQL        FSYCL+  
Sbjct: 277 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVD- 335

Query: 253 KFDDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           + ++ + +S++    G   S H D+    + +TPFV        N+   +YY+G++ I +
Sbjct: 336 RTNNLSVSSAISFGAGFALSRHFDQ----MRFTPFVRT-----NNSVETFYYLGIQGIKI 386

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
             + + +  +   +  +G+GGTI+DSGTT T++  + +  +   F++++    +Y R   
Sbjct: 387 DQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI----SYPR--- 439

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDR 428
           A+    L  C++  G     FP L + F+ GAE+ LP ENYF       A  CL ++   
Sbjct: 440 ADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAIL--- 496

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             + G SII GNFQ QN +  YD+++ RLGF    C
Sbjct: 497 -PTDGMSII-GNFQQQNIHFLYDVQHARLGFANTDC 530


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 126/388 (32%), Positives = 178/388 (45%), Gaps = 51/388 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I L FGTPPQ    +LDTGS++ W PC     C  CSS + P F P  SS+   L C 
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCN---PCSGCSSKQQP-FEPSKSSTYNYLTCA 179

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI 207
           + +C  +       R C       S NC     S    YG     + I  SETL++ ++ 
Sbjct: 180 SQQCQLL-------RVCTKSD--NSVNC-----SLTQRYGDQSEVDEILSSETLSVGSQQ 225

Query: 208 IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRT 260
           + NF+ GCS     L  R P+ + GFGR   S  SQ   L    FSYCL S     +  T
Sbjct: 226 VENFVFGCSNAARGLIQRTPS-LVGFGRNPLSFVSQTATLYDSTFSYCLPS--LFSSAFT 282

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL+L            GL +TP ++N      + +  +YYVGL  I+VG + V +    
Sbjct: 283 GSLLL----GKEALSAQGLKFTPLLSN------SRYPSFYYVGLNGISVGEELVSIPAGT 332

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L+LD     GTI+DSGT  T +    +  + D F SQ+    N T A   +       C+
Sbjct: 333 LSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQL---SNLTMASPTDLFD---TCY 386

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGSAVCLTVVTDREASGGPSII-- 437
           + P      FP + LHF    ++TLP++N  +    +GS +CL         GG  ++  
Sbjct: 387 NRPSGDV-EFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAF--GLPPGGGDDVLST 443

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            GN+Q Q   + +D+   RLG   + C 
Sbjct: 444 FGNYQQQKLRIVHDVAESRLGIASENCD 471


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 129/411 (31%), Positives = 175/411 (42%), Gaps = 49/411 (11%)

Query: 67  TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
           T+ T +       +  +   G Y   +  GTP      +LDTGS +VW  C     C+ C
Sbjct: 120 TRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCA---PCRRC 176

Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
                  F P+ S S   +GC  P C  +       R          K C      Y V 
Sbjct: 177 YDQSGQVFDPRRSRSYGAVGCSAPLCRRLDSGGCDLR---------RKACL-----YQVA 222

Query: 187 YGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN 241
           YG G +T G   +ETL       +    +GC   +       AG+ G GRG  S P+Q++
Sbjct: 223 YGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQIS 282

Query: 242 LD---KFSYCLLSH--KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
                 FSYCL+      +  + +S++   +G+  S   T   ++TP V NP +      
Sbjct: 283 RRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGS---TVAASFTPMVKNPRM------ 333

Query: 297 SVYYYVGLRRITVGGQRVR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
             +YYV L  I+VGG RV  V    L LD   G GG IVDSGT+ T +A   +  L D F
Sbjct: 334 ETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF 393

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
            +     R     L     +    C+D+ G K    P + +HF GGAE  LP ENY   V
Sbjct: 394 RAAAAGLR-----LSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPV 448

Query: 415 GEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 C     TD    GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 449 DSKGTFCFAFAGTD----GGVSII-GNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 170/390 (43%), Gaps = 57/390 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   L  GTPP+    +LDTGS ++W  C     C  C     P F P  SS+ R + 
Sbjct: 151 GEYFTRLGVGTPPRYTYMVLDTGSDIMWIQC---LPCAKCYGQTDPLFNPAASSTYRKVP 207

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +      CR+         + C      Y V YG G  T G   +ETL    
Sbjct: 208 CATPLCKKLDISG--CRN--------KRYC-----EYQVSYGDGSFTVGDFSTETLTFRG 252

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
           ++I    +GC         G+        G GRG  S PSQ       +FSYCL+     
Sbjct: 253 QVIRRVALGCG----HDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSAS 308

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
            T   SSLI   G +   K      +TP ++NP +        +YYV L  I+VGG+R+ 
Sbjct: 309 GTA--SSLIF--GKAAIPKSAI---FTPLLSNPKL------DTFYYVELVGISVGGRRLT 355

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +      +D  GNGG I+DSGT+ T +    +  + D F    V   N   A G    +
Sbjct: 356 SIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAF---RVGTGNLKSAGG---FS 409

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               C+D+ G KT   P L  HF+GGA ++LP  NY   V   +  C     +   +GG 
Sbjct: 410 LFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGN---TGGL 466

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SII GN Q Q Y V +D    R+GFK   C
Sbjct: 467 SII-GNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 127/433 (29%), Positives = 196/433 (45%), Gaps = 37/433 (8%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIK-NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGT 97
           NP+ DS      L S  L+ A  +  NP+ +T +++++    +   +S     ++L  GT
Sbjct: 38  NPTTDSLSLSFPLTSLPLSTAKPLNTNPKLRTLSSSSSYNIKSSFKYSMA-LVVTLPIGT 96

Query: 98  PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
           PPQ    +LDTGS L W  C N        +    SF P LSSS  +L C +P C     
Sbjct: 97  PPQPQQMVLDTGSQLSWIQCHNK-------TPPTASFDPSLSSSFYVLPCTHPLCK---- 145

Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGC 215
                    D  L T+ +  ++C  Y   Y  G   EG  + E L   P++  P  ++GC
Sbjct: 146 -----PRVPDFTLPTTCDQNRLC-HYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGC 199

Query: 216 SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR--TSSLIL-DNGSSHS 272
           S   SR   GI G   G+ S P Q  + KFSYC+ + +  +     T S  L +N +S  
Sbjct: 200 SS-ESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSAR 258

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
            +  + LT+      P     N   + Y V ++ I +GG+++ +       +  G+G T+
Sbjct: 259 FRYVSMLTFPQSQRMP-----NLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTM 313

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS-FP 391
           VDSG+ FTF+    ++ + +E +  +          G  A      CFD    + G    
Sbjct: 314 VDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADM----CFDGNAMEIGRLLG 369

Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
           ++   F+ G E+ +P E   A VG G  V    +   E  G  S I+GNF  QN +VE+D
Sbjct: 370 DVAFEFEKGVEIVVPKERVLADVGGG--VHCVGIGRSERLGAASNIIGNFHQQNLWVEFD 427

Query: 452 LRNQRLGFKQQLC 464
           L N+R+GF    C
Sbjct: 428 LANRRIGFGVADC 440


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 130/394 (32%), Positives = 184/394 (46%), Gaps = 50/394 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+    I+DTGS L W  C     C  C   + P F P  S+S R + 
Sbjct: 148 GEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCA---PCLDCFDQRGPVFDPMASTSYRNVT 204

Query: 147 CQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TL 201
           C + +C  +   +    CR    +P          CP Y   YG  S  T  +AL   T+
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDP----------CP-YYYWYGDQSNTTGDLALEAFTV 253

Query: 202 NL---PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSH 252
           NL    +R +   ++GC   +       AG+ G GRG  S  SQL       FSYCL+ H
Sbjct: 254 NLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDH 313

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               +   S ++   G  +       L YT F   PS AE    + +YYV L+ I VGG+
Sbjct: 314 G---SAVGSKIVF--GDDNVLLSHPQLNYTAFA--PSAAE----NTFYYVQLKGILVGGE 362

Query: 313 RVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
            + +      + + DG+GGTI+DSGTT ++     ++ +   FV +M K       L A+
Sbjct: 363 MLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDK----AYPLIAD 418

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
               L PC++V G +    PE  L F  GA    P ENYF  +     +CL V+ T R A
Sbjct: 419 -FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSA 477

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 I+GN+Q QN++V YDL + RLGF  + C
Sbjct: 478 MS----IIGNYQQQNFHVLYDLHHNRLGFAPRRC 507


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/384 (30%), Positives = 169/384 (44%), Gaps = 37/384 (9%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPPQ    I+D+GS L+W  C     C  C +   P + P  SS+   + 
Sbjct: 63  GQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCA---PCLQCYAQDTPLYAPSNSSTFNPVP 119

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +P+C  I            E      +    C        + L++G+   E+  + + 
Sbjct: 120 CLSPECLLIPAT---------EGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDV 170

Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
            I     GC   +  S     G+ G G+G  S  SQ+     +KF+YCL+++  D T+ +
Sbjct: 171 RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-LDPTSVS 229

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           S LI  +           L +TP V+N     RN     YYV + ++ VGG+ + + H  
Sbjct: 230 SWLIFGD---ELISTIHDLQFTPIVSN----SRNP--TLYYVQIEKVMVGGESLPISHSA 280

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            +LD  GNGG+I DSGTT T+  P    P     ++   KN  Y R   A ++ GL  C 
Sbjct: 281 WSLDFLGNGGSIFDSGTTVTYWLP----PAYRNILAAFDKNVRYPR---AASVQGLDLCV 333

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           DV G    SFP   +   GGA       NYF  V   +  CL +     + GG + I GN
Sbjct: 334 DVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAP-NVQCLAMAGLPSSVGGFNTI-GN 391

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
              QN+ V+YD    R+GF    C
Sbjct: 392 LLQQNFLVQYDREENRIGFAPAKC 415


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/382 (30%), Positives = 170/382 (44%), Gaps = 48/382 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P + +  +LDTGS + W  CT    C  C     P F P  SSS   L 
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCT---PCADCYHQTEPIFEPSSSSSYEPLS 202

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P+C+ +  E  +CR           N T +   Y V YG G  T G   +ETL + +
Sbjct: 203 CDTPQCNAL--EVSECR-----------NATCL---YEVSYGDGSYTVGDFATETLTIGS 246

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            ++ N  VGC   +       AG+ G G G  +LPSQLN   FSYCL+    D     S+
Sbjct: 247 TLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSD-----SA 301

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
             +D G+S S          P + N      +    +YY+GL  I+VGG+ +++      
Sbjct: 302 STVDFGTSLSPDAVVA----PLLRN------HQLDTFYYLGLTGISVGGELLQIPQSSFE 351

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           +D  G+GG I+DSGT  T +  E++  L D FV   +   +  +A G   +     C+++
Sbjct: 352 MDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTL---DLEKAAG---VAMFDTCYNL 405

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             + T   P +  HF GG  + LP +NY   V      CL        +     I+GN Q
Sbjct: 406 SAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA----PTASSLAIIGNVQ 461

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            Q   V +DL N  +GF    C
Sbjct: 462 QQGTRVTFDLANSLIGFSSNKC 483


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 173/388 (44%), Gaps = 45/388 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPPQ    I+D+GS L+W  C+    C+ C +   P ++P  SS+   + 
Sbjct: 62  GQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCS---PCRQCYAQDSPLYVPSNSSTFSPVP 118

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP---SYLVLYG-SGLTEGIALSETLN 202
           C +  C  I             P      C    P   +Y  LY  +  ++G+   E+  
Sbjct: 119 CLSSDCLLI-------------PATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESAT 165

Query: 203 LPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
           +    I     GC   +  S     G+ G G+G  S  SQ+     +KF+YCL+++  D 
Sbjct: 166 VDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-LDP 224

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           T+ +SSLI  +           + YTP V+NP           YYV + ++TVGG+ + +
Sbjct: 225 TSVSSSLIFGD---ELISTIHDMQYTPIVSNPKSP------TLYYVQIEKVTVGGKSLPI 275

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                 +D  GNGG+I DSGTT T+  P  +  +   F S +    +Y R   AE++ GL
Sbjct: 276 SDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV----HYPR---AESVQGL 328

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C ++ G    SFP   + F  GA      ENYF  V   +  CL +       GG + 
Sbjct: 329 DLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAP-NVRCLAMAGLASPLGGFNT 387

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN   QN++V+YD     +GF    C
Sbjct: 388 I-GNLLQQNFFVQYDREENLIGFAPAKC 414


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/400 (28%), Positives = 178/400 (44%), Gaps = 50/400 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTPP+ +  ILDTGS L W  C   Y C   + S    + PK SS+ R + 
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSH---YYPKDSSTYRNIS 225

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET----L 201
           C +P+C  +          + +PL   K   Q CP Y   Y  G  T G   SET    L
Sbjct: 226 CYDPRCQLV---------SSSDPLQHCKAENQTCP-YFYDYADGSNTTGDFASETFTVNL 275

Query: 202 NLPN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
             PN     + + + + GC   +       +G+ G GRG  S PSQ+       FSYC L
Sbjct: 276 TWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYC-L 334

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           +  F +T+ +S LI   G          L +T  +      E      +YY+ ++ I VG
Sbjct: 335 TDLFSNTSVSSKLIF--GEDKELLNNHNLNFTTLL----AGEETPDETFYYLQIKSIMVG 388

Query: 311 GQRVRV----WH-KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
           G+ + +    WH        D  GGTI+DSG+T TF     ++ + + F  ++       
Sbjct: 389 GEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK-----L 443

Query: 366 RALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
           + + A+    + PC++V G       P+  +HF  G     P ENYF        +CL +
Sbjct: 444 QQIAADDFV-MSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAI 502

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +  +  +     I+GN   QN+++ YD++  RLG+  + C
Sbjct: 503 M--KTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 130/395 (32%), Positives = 179/395 (45%), Gaps = 50/395 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + +  GTPP+    I+DTGS L W  C     C  C   + P F P  SSS R L C 
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCA---PCLDCFEQRGPVFDPAASSSYRNLTCG 202

Query: 149 NPKCSWIHHESIQ----CRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIAL-SETL 201
           +P+C  +          CR   ++P          CP Y   YG  S  T  +AL S T+
Sbjct: 203 DPRCGHVAPPEAPAPRACRRPGEDP----------CP-YYYWYGDQSNSTGDLALESFTV 251

Query: 202 NL----PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLL 250
           NL     +  +   + GC   +       AG+ G GRG  S  SQL        FSYCL+
Sbjct: 252 NLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV 311

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
            H  D     S ++     + +      L YT F    S A+      +YYV L  + VG
Sbjct: 312 DHGSD---VASKVVFGEDDALALAAHPRLKYTAFAPASSPAD-----TFYYVRLTGVLVG 363

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ + +          G+GGTI+DSGTT ++     ++ +   F+ +M  + +Y      
Sbjct: 364 GELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRM--SGSYPPVPDF 421

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
             L+   PC++V G +    PEL L F  GA    P ENYF  +     +CL V+ T R 
Sbjct: 422 PVLS---PCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRT 478

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              G SII GNFQ QN++V YDL N RLGF  + C
Sbjct: 479 ---GMSII-GNFQQQNFHVAYDLHNNRLGFAPRRC 509


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 126/395 (31%), Positives = 169/395 (42%), Gaps = 49/395 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           +   G Y   +  GTP      +LDTGS +VW  C     C+ C       F P+ S S 
Sbjct: 134 AQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCA---PCRRCYEQSGQVFDPRRSRSY 190

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             +GC  P C  +       R            C      Y V YG G +T G   +ETL
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLR---------RSACL-----YQVAYGDGSVTAGDFATETL 236

Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH-- 252
                  +    +GC   +       AG+ G GRG  S P+Q++      FSYCL+    
Sbjct: 237 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTS 296

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             +  +R+S++   +G+  S   T   ++TP V NP +        +YYV L  I+VGG 
Sbjct: 297 SANTASRSSTVTFGSGAVGS---TVASSFTPMVKNPRM------ETFYYVQLIGISVGGA 347

Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           RV  V +  L LD   G GG IVDSGT+ T +A   +  L D F       R     L  
Sbjct: 348 RVPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLR-----LSP 402

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
              +    C+D+ G K    P + +HF GGAE  LP ENY   V      C     TD  
Sbjct: 403 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTD-- 460

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             GG SII GN Q Q + V +D   QR+ F  + C
Sbjct: 461 --GGVSII-GNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 170/388 (43%), Gaps = 53/388 (13%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ G PP  +PF+   DTGS L W  C     CK C     P + P  SS+   L 
Sbjct: 71  YLMELAIGKPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPLP 125

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETLNL 203
           C +  C  I                 S+NCT   +C  Y   YG G  + GI  +ETL L
Sbjct: 126 CSSATCLPIW----------------SRNCTPSSLC-RYRYAYGDGAYSAGILGTETLTL 168

Query: 204 PNRIIP----NFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                P        GC   +   S    G  G GRG  SL +QL + KFSYCL    F +
Sbjct: 169 GPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--TDFFN 226

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           +   S  +L   +  +   +T +  TP + +P    R      Y+V L+ I++G  R+ +
Sbjct: 227 SALDSPFLLGTLAELAPGPST-VQSTPLLQSPQNPSR------YFVSLQGISLGDVRLPI 279

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
            +    L  DG GG IVDSGTTFT +A   F  +       + +      +L A      
Sbjct: 280 PNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDA------ 333

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
            PCF  P  +    P+L LHF GGA++ L  +NY +   E S+ CL +      S   + 
Sbjct: 334 -PCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPES---TS 389

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +LGNFQ QN  + +D    +L F    C
Sbjct: 390 VLGNFQQQNIQMLFDTTVGQLSFLPTDC 417


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 126/391 (32%), Positives = 181/391 (46%), Gaps = 56/391 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++L+ G+PPQ    I+DTGS L W  C     C+ C     P F P  S S R   
Sbjct: 37  GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC---LPCRVCYQQPGPKFDPSKSRSFRKAA 93

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C           + +  PL   K C      Y   YG    T G    ET++L N
Sbjct: 94  CTDNLC-----------NVSALPL---KACAANVCQYQYTYGDQSNTNGDLAFETISLNN 139

Query: 206 ----RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
               + +PNF  GC   ++ +    AG+ G G+G  SL SQL+    +KFSYCL+S    
Sbjct: 140 GAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSL--- 196

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           ++   S L   + ++ ++     + YT  V N           YYYV L  I VGGQ + 
Sbjct: 197 NSLSASPLTFGSIAAAAN-----IQYTSIVVNAR------HPTYYYVQLNSIEVGGQPLN 245

Query: 316 VWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           +      +D+  G GGTI+DSGTT T +    +  +   + S +    NY R  G+    
Sbjct: 246 LAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV----NYPRLDGSA--Y 299

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVTDREASGG 433
           GL  CF++ G    S P++   F+ GA+  +  EN F +V    + +CL +      S G
Sbjct: 300 GLDLCFNIAGVSNPSVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAM----GGSQG 354

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            SII GN Q QN+ V YDL  +++GF    C
Sbjct: 355 FSII-GNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 117/383 (30%), Positives = 166/383 (43%), Gaps = 48/383 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P + +  +LDTGS + W  C     C  C +   P + P +S+S   +G
Sbjct: 161 GEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQ---PCADCYAQSDPVYDPSVSTSYATVG 217

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +P+C          RD +    A  +N T  C  Y V YG G  T G   +ETL L +
Sbjct: 218 CDSPRC----------RDLD---AAACRNSTGSC-LYEVAYGDGSYTVGDFATETLTLGD 263

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + N  +GC   +       AG+   G G  S PSQ++   FSYCL+     D    S
Sbjct: 264 SAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLV-----DRDSPS 318

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L  G S     T  L  +P  N            +YYV L  I+VGG+ + +     
Sbjct: 319 SSTLQFGDSEQPAVTAPLIRSPRTNT-----------FYYVALSGISVGGEALSIPSSAF 367

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            +D  G+GG IVDSGT  T +    +  L + FV      ++  RA G   ++    C+D
Sbjct: 368 AMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQ---GTQSLPRASG---VSLFDTCYD 421

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           + G  +   P + L F+GG E+ LP +NY   V      CL        + GP  I+GN 
Sbjct: 422 LAGRSSVQVPAVALWFEGGGELKLPAKNYLIPVDAAGTYCLAFA----GTSGPVSIIGNV 477

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q Q   V +D     +GF    C
Sbjct: 478 QQQGVRVSFDTAKNTVGFTADKC 500


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 173/389 (44%), Gaps = 51/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPPQ++  +LDT +  VW PC+    C  CS++         S+ S +  
Sbjct: 102 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 157

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
           C   +C+      + C   + +P   S N +         YG   +   +L  +TL L  
Sbjct: 158 CSTAQCT--QARGLTCPSSSPQPSVCSFNQS---------YGGDSSFSASLVQDTLTLAP 206

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
            +IPNF  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S +   F  
Sbjct: 207 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 266

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           + +   L           +   + YTP + NP    R +    YYV L  ++VG  +V V
Sbjct: 267 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 310

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
              YLT D +   GTI+DSGT  T  A  ++E + DEF  Q+  N +    LGA      
Sbjct: 311 DPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NVSSFSTLGA-----F 363

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF    E     P++ LH     ++ LP+EN       G+  CL++   R+ +     
Sbjct: 364 DTCFSADNENVA--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 420

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  + +D+ N R+G   + C 
Sbjct: 421 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 179/392 (45%), Gaps = 55/392 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  ++  GTP ++   I+DTGS L W  C+    C  C S     FIP  S+S   L 
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCS---PCGTCYSQNDSLFIPNTSTSFTKLA 57

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL---- 201
           C    C+ + +           P+     C      Y   YG G L+ G  + +T+    
Sbjct: 58  CGTELCNGLPY-----------PMCNQTTCV-----YWYSYGDGSLSTGDFVYDTITMDG 101

Query: 202 -NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
            N   + +PNF  GC   +    AG   I G G+G  S PSQL      KFSYCL+    
Sbjct: 102 INGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDW-L 160

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
              T+TS L+  + +  +     G+ Y   + NP V        YYYV L  I+VGG+ +
Sbjct: 161 APPTQTSPLLFGDAAVPT---FPGVKYISLLTNPKVP------TYYYVKLNGISVGGKLL 211

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEAL 373
            +      +D  G  GTI DSGTT T +A E+ +    E ++ M     +Y R   ++  
Sbjct: 212 NISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQ----EVLAAMNASTMDYPRK--SDDS 265

Query: 374 TGLRPCFDVPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           +GL  C     E +  + P +  HF+GG ++ LP  NYF  +    + C ++V+  + + 
Sbjct: 266 SGLDLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIFLESSQSYCFSMVSSPDVT- 323

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+G+ Q QN+ V YD   +++GF  + C
Sbjct: 324 ----IIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 34/384 (8%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTPP  +  +LDTGS L+W  C     C+ C     P + P  S +   + C 
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSVTYANVSCG 157

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +  C  +   S++         +        C +Y   YG G  T+G+  +ET       
Sbjct: 158 SRLCDAL--PSLRPSSRCSASASAPAPERGGC-TYYYSYGDGSSTDGVLATETFTFGAGT 214

Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            + +   GC   ++  +   +G+ G GRG  SL SQL + KFSYC     F+DTT +S L
Sbjct: 215 TVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFT--PFNDTTTSSPL 272

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            L + +S S    +    TPFV +PS   R   S YYY+ L  ITVG   + +      L
Sbjct: 273 FLGSSASLSPAAKS----TPFVPSPSGPRR---SSYYYLSLEGITVGDTLLPIDPAVFRL 325

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
              G GG I+DSGTTFT +    F       V            L + A  GL  CF  P
Sbjct: 326 TASGRGGLIIDSGTTFTALEERAFV------VLARAVAARVALPLASGAHLGLSVCFAAP 379

Query: 384 ---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
              G +    P L LHF  GA++ LP  +           CL +V+ R  S     +LG+
Sbjct: 380 QGRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSARGMS-----VLGS 433

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN +V YD+    L F+   C
Sbjct: 434 MQQQNMHVRYDVGRDVLSFEPANC 457


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 125/398 (31%), Positives = 180/398 (45%), Gaps = 50/398 (12%)

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
           +++ T+  +   G Y   L  GTPP+ +  +LDTGS +VW  C     C+ C S   P F
Sbjct: 133 SSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCA---PCRKCYSQTDPVF 189

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
            PK S S   + C++P C  +  +S  C            N  Q C  Y V YG G  T 
Sbjct: 190 DPKKSGSFSSISCRSPLC--LRLDSPGC------------NSRQSC-LYQVAYGDGSFTF 234

Query: 194 GIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSY 247
           G   +ETL      +P   +GC   +       AG+ G GRG+ S P+Q  L    KFSY
Sbjct: 235 GEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSY 294

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           CL+      +++ SS++   G S   +      +TP + NP +        +YY+ L  I
Sbjct: 295 CLVDRS--ASSKPSSVVF--GQSAVSRTA---VFTPLITNPKL------DTFYYLELTGI 341

Query: 308 TVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +VGG RV  +      LD  GNGG I+DSGT+ T +    +  L D F +     +    
Sbjct: 342 SVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKR--- 398

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
              A   +    CFD+ G+     P + +HF+ GA+V+LP  NY   V      C     
Sbjct: 399 ---APDYSLFDTCFDLSGKTEVKVPTVVMHFR-GADVSLPATNYLIPVDTNGVFCFAFAG 454

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 G SII GN Q Q + V +D+   R+GF  + C
Sbjct: 455 TMS---GLSII-GNIQQQGFRVVFDVAASRIGFAARGC 488


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 130/390 (33%), Positives = 174/390 (44%), Gaps = 57/390 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   L  GTP + +  +LDTGS +VW  C     C  C S   P F P  S S   + 
Sbjct: 143 GEYFTRLGVGTPARYVYMVLDTGSDIVWIQCA---PCIKCYSQTDPVFDPTKSRSFANIP 199

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +P C  +           D P  ++K   QIC  Y V YG G  T G   +ETL    
Sbjct: 200 CGSPLCRRL-----------DYPGCSTKK--QIC-LYQVSYGDGSFTVGEFSTETLTFRG 245

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
             +   ++GC         G+        G GRG+ S PSQ+      KFSYCL      
Sbjct: 246 TRVGRVVLGC----GHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRS-- 299

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            ++R SS++   G S   + T    +TP ++NP +        +YYV L  I+VGG RV 
Sbjct: 300 ASSRPSSIVF--GDSAISRTT---RFTPLLSNPKL------DTFYYVELLGISVGGTRVS 348

Query: 316 -VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +      LD  GNGG I+DSGT+ T +    +  L D F   +V   N  R   A   +
Sbjct: 349 GISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAF---LVGASNLKR---APEFS 402

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               CFD+ G+     P + LHF+ GA+V LP  NY   V    + C         + G 
Sbjct: 403 LFDTCFDLSGKTEVKVPTVVLHFR-GADVPLPASNYLIPVDNSGSFCFAFA---GTASGL 458

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SII GN Q Q + V YDL   R+GF  + C
Sbjct: 459 SII-GNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 124/397 (31%), Positives = 172/397 (43%), Gaps = 64/397 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---IPSFIPKLSSSSRLL 145
           +++++S GTPPQ    ILDTGS L+W       QCK   + +    P + P  SSS    
Sbjct: 89  HTLTVSIGTPPQPRTLILDTGSDLIW------TQCKLFDTRQHREKPLYDPAKSSSFAAA 142

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
            C               R C      T KNC++    Y   YGS  T+G   SET     
Sbjct: 143 PCDG-------------RLCETGSFNT-KNCSRNKCIYTYNYGSATTKGELASETFTFGE 188

Query: 206 --RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
             R+  +   GC  L+S      +GI G    + SL SQL + +FSYCL    F D   T
Sbjct: 189 HRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYCLT--PFLDRNTT 246

Query: 261 SSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           S +     +  S  +TTG +  T  V NP     +  + YYYV L  I+VG +R+ V   
Sbjct: 247 SHIFFGAMADLSKYRTTGPIQTTSLVTNP-----DGSNYYYYVPLIGISVGTKRLNVPVS 301

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM------VKNRNYTRALGAEAL 373
              + RDG+GGT VDSG T   +   + E L +  V  +        +  Y   L     
Sbjct: 302 SFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYEL----- 356

Query: 374 TGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
                CF +P    G+       P L  HF GGA + L  ++Y   V  G  +CL +   
Sbjct: 357 -----CFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGR-MCLVI--- 407

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +SG    I+GN+Q QN +V +D+ N    F    C
Sbjct: 408 --SSGARGAIIGNYQQQNMHVLFDVENHEFSFAPTQC 442


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 132/444 (29%), Positives = 189/444 (42%), Gaps = 60/444 (13%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALH------IKNPQTKTTTTTTTTTTTNISSHSY 86
           LSR H +  +      NSL ++ L  AL       +K  +T+      +T  T+ +S   
Sbjct: 105 LSRLHRDTVR-----FNSL-TARLQLALEDISKSDLKPLETEIKPEDLSTPVTSGTSQGS 158

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P +    +LDTGS + W  C     C  C     P F P  SS+   + 
Sbjct: 159 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPTASSTYAPVT 215

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           CQ+ +CS +   S  CR         S  C      Y V YG G  T G   +E+++  N
Sbjct: 216 CQSQQCSSLEMSS--CR---------SGQCL-----YQVNYGDGSYTFGDFATESVSFGN 259

Query: 206 R-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + N  +GC   +       AG+ G G G  SL +QL    FSYCL++    D+  +S
Sbjct: 260 SGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNR---DSAGSS 316

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L  ++     D  T      P + N  +        +YYVGL  ++VGGQ V +     
Sbjct: 317 TLDFNSAQLGVDSVTA-----PLMKNRKI------DTFYYVGLSGMSVGGQMVSIPESTF 365

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            LD  GNGG IVD GT  T +  + + PL D FV +M +N   T A+          C+D
Sbjct: 366 RLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFV-RMTQNLKLTSAVAL-----FDTCYD 419

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           + G+ +   P +  HF  G    LP  NY   V      C        +      I+GN 
Sbjct: 420 LSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLS----IIGNV 475

Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
           Q Q   V +DL N R+GF    C+
Sbjct: 476 QQQGTRVTFDLANNRMGFSPNKCQ 499


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 173/389 (44%), Gaps = 51/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPPQ++  +LDT +  VW PC+    C  CS++         S+ S +  
Sbjct: 28  GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 83

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
           C   +C+      + C   + +P   S N +         YG   +   +L  +TL L  
Sbjct: 84  CSTAQCT--QARGLTCPSSSPQPSVCSFNQS---------YGGDSSFSASLVQDTLTLAP 132

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
            +IPNF  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S +   F  
Sbjct: 133 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 192

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           + +   L           +   + YTP + NP    R +    YYV L  ++VG  +V V
Sbjct: 193 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 236

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
              YLT D +   GTI+DSGT  T  A  ++E + DEF  Q+  N +    LGA      
Sbjct: 237 DPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NVSSFSTLGA-----F 289

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF    E     P++ LH     ++ LP+EN       G+  CL++   R+ +     
Sbjct: 290 DTCFSADNENVA--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 346

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  + +D+ N R+G   + C 
Sbjct: 347 VIANLQQQNLRILFDVPNSRIGIAPEPCN 375


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 175/394 (44%), Gaps = 58/394 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y + LS GTPPQ+IP ++DTGS LVW  C N   C +C         F    SSS + 
Sbjct: 3   GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN---CDHCDLDHHGETIFFSDASSSYKK 59

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
           L C +  CS +    I  R            C + C  Y   YG G  T G   S+ ++ 
Sbjct: 60  LPCNSTHCSGMSSAGIGPR------------CEETC-KYKYEYGDGSRTSGDVGSDRISF 106

Query: 204 PNR--------IIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
            +             FL GC+           G+ G G+   SL  QL      KFSYCL
Sbjct: 107 RSHGAGEDHRSFFDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCL 166

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           +S+   D+  ++   L  GSS        L     V+ P +   +     YYV L+ IT+
Sbjct: 167 VSY---DSPPSAKSFLFLGSS------AALRGHDVVSTPILHGDHLDQTLYYVDLQSITI 217

Query: 310 GGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
           GG  V V+ K    +          T++DSGTT+T + P ++E +      Q++      
Sbjct: 218 GGVPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI-----L 272

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
             LG  A  GL  CF+  G+ +  FP +  +F    ++ LP EN F V      VCL++ 
Sbjct: 273 PTLGNSA--GLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSR-DVVCLSM- 328

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
              ++SGG   I+GN Q QN+++ YDL   ++ F
Sbjct: 329 ---DSSGGDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 174/394 (44%), Gaps = 58/394 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y + LS GTPPQ+IP ++DTGS LVW  C N   C +C         F    SSS + 
Sbjct: 3   GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN---CDHCDLDHHGETIFFSDASSSYKK 59

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
           L C +  CS +    I  R            C + C  Y   YG G  T G   S+ ++ 
Sbjct: 60  LPCNSTHCSGMSSAGIGPR------------CEETC-KYKYEYGDGSRTSGDVGSDRISF 106

Query: 204 PNR--------IIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
            +             FL GC            G+ G G+   SL  QL      KFSYCL
Sbjct: 107 RSHGAGEDHRSFFDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCL 166

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           +S+   D+  ++   L  GSS        L     V+ P +   +     YYV L+ ITV
Sbjct: 167 VSY---DSPPSAKSFLFLGSS------AALRGHDVVSTPILHGDHLDQTLYYVDLQSITV 217

Query: 310 GGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
           GG  V V+ K    +          T++DSGTT+T + P ++E +      Q++      
Sbjct: 218 GGVPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI-----L 272

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
             LG  A  GL  CF+  G+ +  FP +  +F    ++ LP EN F V      VCL++ 
Sbjct: 273 PTLGNSA--GLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSR-DVVCLSM- 328

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
              ++SGG   I+GN Q QN+++ YDL   ++ F
Sbjct: 329 ---DSSGGDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 134/443 (30%), Positives = 194/443 (43%), Gaps = 67/443 (15%)

Query: 36  FHTNPSQDS--YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           FH    +D+   + L+SL ++S       +N      TT  +++  +  +   G Y   +
Sbjct: 81  FHLRLQRDAIRVKKLSSLGATS-------RNLSKPGGTTGFSSSVISGLAQGSGEYFTRI 133

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
             GTPP+ +  +LDTGS +VW  C     CK C S   P F P  S S   + C+ P C 
Sbjct: 134 GVGTPPKYVYMVLDTGSDIVWLQCA---PCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 190

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
            +  ES  C            N  Q C  Y V YG G  T G  ++ETL      +    
Sbjct: 191 RL--ESPGC------------NQRQTC-LYQVSYGDGSYTTGEFVTETLTFRRTKVEQVA 235

Query: 213 VGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
           +GC         G+        G GRG  S PSQ       KFSYCL+      +++ SS
Sbjct: 236 LGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRS--ASSKPSS 289

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYL 321
           ++  N +     +     +TP + NP +        +YYV L  I+VGG  V  +   + 
Sbjct: 290 VVFGNSAVSRTAR-----FTPLLTNPRL------DTFYYVELLGISVGGTPVSGITASHF 338

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            LDR GNGG I+D GT+ T +    +  L D F +     ++      A   +    C+D
Sbjct: 339 KLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKS------APEFSLFDTCYD 392

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           + G+ T   P + LHF+ GA+V+LP  NY   V      C         + G SII GN 
Sbjct: 393 LSGKTTVKVPTVVLHFR-GADVSLPASNYLIPVDGSGRFCFAFA---GTTSGLSII-GNI 447

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q Q + V YDL + R+GF  + C
Sbjct: 448 QQQGFRVVYDLASSRVGFSPRGC 470


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 181/431 (41%), Gaps = 67/431 (15%)

Query: 68  KTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ-- 122
           K  TTT+    + + S ++   G Y +S++FGTPPQ +  I DTGS L+W  C+      
Sbjct: 29  KLATTTSFWAESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPP 88

Query: 123 --CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C   + S+ P+F+   S++  ++ C   +C  +             P      C+   
Sbjct: 89  AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLV-----------PAPRGHGPACSPAA 137

Query: 181 P---SYLVLYGSG-LTEGIALSETLNLPN-----RIIPNFLVGCSVL----SSRQPAGIA 227
           P    Y   Y  G  T G    +T  + N       +     GC       S     G+ 
Sbjct: 138 PVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVI 197

Query: 228 GFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
           G G+G+ S P+Q   L    FSYCLL  +     R+SS +         ++     YTP 
Sbjct: 198 GLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLG----RPERRAAFAYTPL 253

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
           V+NP          +YYVG+  I VG + + V      +D  GNGGT++DSG+T T++  
Sbjct: 254 VSNPLA------PTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRL 307

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-----PGEKTGSFPELKLHFKG 399
             +  L   F + +   R  +    A    GL  C++V          G FP L + F  
Sbjct: 308 GAYLHLVSAFAASVHLPRIPSS---ATFFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQ 364

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLR 453
           G  + LP  NY   V +    CL +         P++      +LGN   Q Y+VE+D  
Sbjct: 365 GLSLELPTGNYLVDVAD-DVKCLAIR--------PTLSPFAFNVLGNLMQQGYHVEFDRA 415

Query: 454 NQRLGFKQQLC 464
           + R+GF +  C
Sbjct: 416 SARIGFARTEC 426


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 169/387 (43%), Gaps = 49/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P + +  +LDTGS + W  C     C  C +   P F P LSSS   + 
Sbjct: 194 GEYFSRIGIGSPARQLYMVLDTGSDVTWLQCA---PCADCYAQSDPLFDPALSSSYATVP 250

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP- 204
           C +P C      ++    C++     + +C      Y V YG G  T G   +ETL L  
Sbjct: 251 CDSPHC-----RALDASACHNNAANGNSSCV-----YEVAYGDGSYTVGDFATETLTLGG 300

Query: 205 --NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
             +  + +  +GC   +       AG+   G G  S PSQ++  +FSYCL+     D   
Sbjct: 301 DGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLV-----DRDS 355

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWH 318
            S+  L  G+S S   T  L  +P  N            +YYV L  I+VGG+ +  +  
Sbjct: 356 PSASTLQFGASDSSTVTAPLMRSPRSN-----------TFYYVALNGISVGGETLSDIPP 404

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLR 377
               +D  G+GG IVDSGT  T +    +  L D FV         T+AL  A  ++   
Sbjct: 405 AAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRG-------TQALPRASGVSLFD 457

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            C+D+ G  +   P + L F+GG E+ LP +NY   V      CL       A+GG   I
Sbjct: 458 TCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFA----ATGGAVSI 513

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GN Q Q   V +D     +GF    C
Sbjct: 514 VGNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 127/412 (30%), Positives = 180/412 (43%), Gaps = 66/412 (16%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           SS   G Y + L  GTP +  P I+DTGS L W  C         SS   P +    SSS
Sbjct: 52  SSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSS 111

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS---YLVLYGS-GLTEGIAL 197
            R + C + +C ++             P     +C+   PS   Y   Y     T GI  
Sbjct: 112 YREIPCTDDECQFL-------------PAPIGSSCSITSPSPCDYTYGYSDQSRTTGILA 158

Query: 198 SETLNLPNRI---------------IPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPS 238
            ET+++ +R                I N  +GCS  S        +G+ G G+G  SL +
Sbjct: 159 YETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLAT 218

Query: 239 QLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
           Q         FSYCL+ +    +  +S L++  G +H  K    L +TP V NP      
Sbjct: 219 QTRHTALGGIFSYCLVDY-LRGSNASSFLVM--GRTHWRK----LAHTPIVRNP------ 265

Query: 295 AFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
           A   +YYV +  + V G+ V  +      +D DGN GTI DSGTT ++    L EP   +
Sbjct: 266 AAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSY----LREPAYSK 321

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
            +  +  +    RA   E   G   C++V   + G  P+L + F+GGA + LP  NY  +
Sbjct: 322 VLGALNASIYLPRA--QEIPEGFELCYNVTRMEKG-MPKLGVEFQGGAVMELPWNNYMVL 378

Query: 414 VGEG-SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V E    V L  VT    S     ILGN   Q++++EYDL   R+GFK   C
Sbjct: 379 VAENVQCVALQKVTTTNGSN----ILGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 173/373 (46%), Gaps = 49/373 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQCR 163
           I+DT S L W  C     C  C   + P F P  S S   + C +  C  +     +  +
Sbjct: 127 IVDTASELTWVQCE---PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQ 183

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
            C+D+P A S         Y + Y  G  + G+   + L+L    I  F+ GC   S++ 
Sbjct: 184 ACDDQPAACS---------YTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGT-SNQG 233

Query: 223 P----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLIL-DNGSSHSD 273
           P    +G+ G GR + SL SQ  +D+F    SYCL      ++  + SL+L D+ S +  
Sbjct: 234 PFGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCLPPK---ESGSSGSLVLGDDASVY-- 287

Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
           + +T + YT  V++P          +Y   L  ITVGG+ V    +       G G  IV
Sbjct: 288 RNSTPIVYTAMVSDPLQGP------FYLANLTGITVGGEDV----QSPGFSAGGGGKAIV 337

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT  T + P ++  +  EFVSQ+ +         A   + L  CFD+ G +    P L
Sbjct: 338 DSGTIITSLVPSVYAAVRAEFVSQLAEYPQ------AAPFSILDTCFDLTGLREVQVPSL 391

Query: 394 KLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
           KL F GGAEV +  +    VV G+ S VCL + + +     P  I+GN+Q +N  V +D 
Sbjct: 392 KLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTP--IIGNYQQKNLRVIFDT 449

Query: 453 RNQRLGFKQQLCK 465
              ++GF Q+ C 
Sbjct: 450 VGSQIGFAQETCD 462


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 169/395 (42%), Gaps = 48/395 (12%)

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
           +T  T+ +S   G Y   +  G P +    +LDTGS + W  C     C  C     P F
Sbjct: 6   STPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIF 62

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
            P  SS+   + CQ+ +CS +   S  CR         S  C      Y V YG G  T 
Sbjct: 63  DPTASSTYAPVTCQSQQCSSLEMSS--CR---------SGQCL-----YQVNYGDGSYTF 106

Query: 194 GIALSETLNLPNR-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
           G   +E+++  N   + N  +GC   +       AG+ G G G  SL +QL    FSYCL
Sbjct: 107 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 166

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           ++    D+  +S+L  ++     D  T      P + N  +        +YYVGL  ++V
Sbjct: 167 VNR---DSAGSSTLDFNSAQLGVDSVTA-----PLMKNRKI------DTFYYVGLSGMSV 212

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GGQ V +      LD  GNGG IVD GT  T +  + + PL D FV +M +N   T A+ 
Sbjct: 213 GGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFV-RMTQNLKLTSAVA 271

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
                    C+D+ G+ +   P +  HF  G    LP  NY   V      C        
Sbjct: 272 L-----FDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS 326

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +      I+GN Q Q   V +DL N R+GF    C
Sbjct: 327 SLS----IIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 126/438 (28%), Positives = 184/438 (42%), Gaps = 64/438 (14%)

Query: 48  LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH--SYGGYSISLSFGTP-PQIIPF 104
           L  +V  S  RA     P +++ T    T      SH   Y  Y I    GTP PQ +  
Sbjct: 50  LRRMVLRSRARAAKQLCP-SRSGTPVRVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVAL 108

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
            +DTGS +VW  C     C  C +  +P F    S +   + C +P C  +   +     
Sbjct: 109 EVDTGSDVVWTQCR---PCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLGG 165

Query: 165 CNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFLVGCSVL 218
           C                +Y V YG + +T G    ++     +      +P+ + GC   
Sbjct: 166 C----------------TYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQY 209

Query: 219 SS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD- 273
           ++        GIAGFGRG  SLP QL +  FSYC     F     + S  +  G + +D 
Sbjct: 210 NTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYC-----FTTIFESKSTPVFLGGAPADG 264

Query: 274 ---KKTTGLTYTPFV-NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
                T  +  TPF+ N+P          YYY+ L+ ITVG  R+ V      +  DG+G
Sbjct: 265 LRAHATGPILSTPFLPNHPE---------YYYLSLKGITVGKTRLAVPESAFVVKADGSG 315

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF---DVPGEK 386
           GTI+DSGT  T     +F  L + FV+Q+          G   L     CF    VP   
Sbjct: 316 GTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQ----CFSTESVPDAS 371

Query: 387 TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
               P++ LH + GA+  LP ENY A   +   +C+ V+    A      ++GNFQ QN 
Sbjct: 372 KVPVPKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVL----AGDDDRTMIGNFQQQNM 426

Query: 447 YVEYDLRNQRLGFKQQLC 464
           ++ +DL   +L  +   C
Sbjct: 427 HIVHDLAGNKLVIEPAQC 444


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/383 (29%), Positives = 169/383 (44%), Gaps = 48/383 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      GTP Q +   +D  +   W PC+    C  C++S  PSF P  SS+ R + C 
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 157

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ +   S         P     +C      + + Y +   + +   ++L L N ++
Sbjct: 158 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 204

Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS 262
            ++  GC  V+S  S  P G+ GFGRG  S  SQ        FSYCL +++  + + T  
Sbjct: 205 VSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGT-- 262

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT L Y P  + PS+         YYV +  I VG + V+V    L 
Sbjct: 263 LKLGPIGQPKRIKTTPLLYNP--HRPSL---------YYVNMIGIRVGSKVVQVPQSALA 311

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI+D+GT FT +A  ++  + D F       R   R   A  L G   C++V
Sbjct: 312 FNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV 364

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
               T S P +   F G   VTLP EN       G   CL +          ++ +L + 
Sbjct: 365 ----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASM 420

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q QN  V +D+ N R+GF ++LC
Sbjct: 421 QQQNQRVLFDVANGRVGFSRELC 443


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/393 (30%), Positives = 174/393 (44%), Gaps = 43/393 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ILDTGS L W  C     C  C       + PK S+S + + 
Sbjct: 153 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC---LPCHDCFQQNGAFYDPKASASYKNIT 209

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
           C +P+C+ +            +P    K+  Q CP Y     S  T G    ET  +NL 
Sbjct: 210 CNDPRCNLVSPP---------DPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 260

Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
                     + N + GC   +       AG+ G GRG  S  SQL       FSYCL+ 
Sbjct: 261 TSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 320

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
               DT  +S LI   G          L +T FV      + N    +YYV ++ I V G
Sbjct: 321 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----ARKENLVDTFYYVQIKSIIVAG 373

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + + +  +   +  DG GGTI+DSGTT ++ A    EP A EF+   +  +   +     
Sbjct: 374 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 428

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
               L PCF+V G  +   PEL + F  GA    P EN F  + E   VCL ++   +++
Sbjct: 429 DFPILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSA 487

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN+Q QN+++ YD +  RLG+    C
Sbjct: 488 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 517


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/396 (31%), Positives = 169/396 (42%), Gaps = 59/396 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +  +LDTGS L+W  C     C  C S   P F P  S+S   + C 
Sbjct: 96  YVVDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLSQPDPLFAPGQSASYEPMRCA 152

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-- 205
              CS I H S +  D           CT     Y   YG G +T G+  +E     +  
Sbjct: 153 GTLCSDILHHSCERPD----------TCT-----YRYNYGDGTMTVGVYATERFTFASSG 197

Query: 206 ------RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                   +P    GC   +V S    +GI GFGR   SL SQL++ +FSYCL S+    
Sbjct: 198 GGGLTTTTVP-LGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYA--- 253

Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           + R S+L+  + S       TG +  TP + +P          +YYV    +TVG +R+R
Sbjct: 254 SRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQ------NPTFYYVHFTGLTVGARRLR 307

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      L  DG+GG IVDSGT  T +   +   +   F  Q+        A G     G
Sbjct: 308 IPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQL----RLPFANGGNPEDG 363

Query: 376 LRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           +  CF VP     S        P + LHF+ GA++ LP  NY         +CL +    
Sbjct: 364 V--CFLVPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLAD-- 418

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             SG     +GN   Q+  V YDL  + L      C
Sbjct: 419 --SGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/383 (29%), Positives = 169/383 (44%), Gaps = 48/383 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      GTP Q +   +D  +   W PC+    C  C++S  PSF P  SS+ R + C 
Sbjct: 83  YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 138

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ +   S         P     +C      + + Y +   + +   ++L L N ++
Sbjct: 139 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 185

Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS 262
            ++  GC  V+S  S  P G+ GFGRG  S  SQ        FSYCL +++  + + T  
Sbjct: 186 VSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGT-- 243

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT L Y P  + PS+         YYV +  I VG + V+V    L 
Sbjct: 244 LKLGPIGQPKRIKTTPLLYNP--HRPSL---------YYVNMIGIRVGSKVVQVPQSALA 292

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI+D+GT FT +A  ++  + D F       R   R   A  L G   C++V
Sbjct: 293 FNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV 345

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
               T S P +   F G   VTLP EN       G   CL +          ++ +L + 
Sbjct: 346 ----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASM 401

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q QN  V +D+ N R+GF ++LC
Sbjct: 402 QQQNQRVLFDVANGRVGFSRELC 424


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 129/446 (28%), Positives = 187/446 (41%), Gaps = 46/446 (10%)

Query: 36  FHTNPSQDSYQ-----NLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG--- 87
           F  NPSQ +        L SL  S+ T +  +   Q  +TT           S++Y    
Sbjct: 10  FSINPSQQTNSLSLSFPLTSLSLSNDTTSKMLYTSQLFSTTKKPNNPQNKTPSYNYKFSF 69

Query: 88  GYS----ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
            YS    I+L  GTPPQ  P +LDTGS L W  C       +       SF P LSS+  
Sbjct: 70  KYSMALIINLPIGTPPQTQPMVLDTGSQLSWIQC-------HKKQPPTASFDPSLSSTFS 122

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
           +L C +P C              D  L TS +  ++C  Y   Y  G   EG  + E   
Sbjct: 123 ILPCTHPLCK---------PRIPDFTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFT 172

Query: 203 LPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRT 260
               +  P  ++GC+   S  P GI G   G+ S   Q  + KFSYC+   +     T T
Sbjct: 173 FSRSVSTPPLILGCAT-ESTDPRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPT 231

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            S  L N  S    K  G+  +     P     N   + Y + +  I + G+++ +    
Sbjct: 232 GSFYLGNNPSSKGFKYVGMMTSSRQRMP-----NFDPLAYTIPMVGIRIAGKKLNISPAV 286

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
              D  G+G T++DSG+ FT++  E ++ +  + V  +          G  A      CF
Sbjct: 287 FRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADM----CF 342

Query: 381 D-VPGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           D V   + G    E+   F+ G EV +P E   A VG G  V    +   +  G  S I+
Sbjct: 343 DSVKAVEIGRLIGEMVFEFERGVEVVIPKERVLADVGGG--VHCVGIGSSDKLGAASNII 400

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GNF  QN +VE+DL  +R+GF +  C
Sbjct: 401 GNFHQQNLWVEFDLVRRRVGFGKADC 426


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 117/431 (27%), Positives = 182/431 (42%), Gaps = 67/431 (15%)

Query: 68  KTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ-- 122
           K  T T+    + + S ++   G Y +S++FGTPPQ +  I DTGS L+W  C+      
Sbjct: 30  KLATITSFWAESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPP 89

Query: 123 --CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C   + S+ P+F+   S++  ++ C   +C  +             P     +C+   
Sbjct: 90  AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLV-----------PAPRGHGPSCSPAA 138

Query: 181 P---SYLVLYGSG-LTEGIALSETLNLPN-----RIIPNFLVGCSVL----SSRQPAGIA 227
           P    Y   Y  G  T G    +T  + N       +     GC       S     G+ 
Sbjct: 139 PVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVI 198

Query: 228 GFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
           G G+G+ S P+Q   L    FSYCLL  +     R+SS +         ++     YTP 
Sbjct: 199 GLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLG----RPERRAAFAYTPL 254

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
           V+NP          +YYVG+  I VG + + V      +D  GNGGT++DSG+T T++  
Sbjct: 255 VSNPLA------PTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRL 308

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT-----GSFPELKLHFKG 399
             +  L   F + +   R  +    A    GL  C++V    +     G FP L + F  
Sbjct: 309 GAYLHLVSAFAASVHLPRIPSS---ATFFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQ 365

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLR 453
           G  + LP  NY   V +    CL +         P++      +LGN   Q Y+VE+D  
Sbjct: 366 GLSLELPTGNYLVDVAD-DVKCLAIR--------PTLSPFAFNVLGNLMQQGYHVEFDRA 416

Query: 454 NQRLGFKQQLC 464
           + R+GF +  C
Sbjct: 417 SARIGFARTEC 427


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/393 (29%), Positives = 170/393 (43%), Gaps = 54/393 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP + +  ++DTGS + W  C     C  C   K   F P  SSS ++L 
Sbjct: 14  GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCA---PCTNCYKQKDALFNPSSSSSFKVLD 70

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
           C +  C  ++ + + C          S  C      Y   YG G       +T+ + L +
Sbjct: 71  CSSSLC--LNLDVMGC---------LSNKCL-----YQADYGDGSFTMGELVTDNVVLDD 114

Query: 200 TLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
                  ++ N  +GC   +       AGI G GRG  S P+ L+    + FSYCL   +
Sbjct: 115 AFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRE 174

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D   +++ +  D    H+   T  + + P + NP VA       YYYV +  I+VGG  
Sbjct: 175 SDPNHKSTLVFGDAAIPHT--ATGSVKFIPQLRNPRVA------TYYYVQITGISVGGNL 226

Query: 314 V-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           +  +      LD  GNGGTI DSGTT T +    +  + D F       R  T  L + A
Sbjct: 227 LTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAF-------RAATMHLTSAA 279

Query: 373 -LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
                  C+D  G  + S P +  HF+G  ++ LP  NY   V   +  C        AS
Sbjct: 280 DFKIFDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA----AS 335

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GPS+I GN Q Q++ V YD  ++++G     C
Sbjct: 336 MGPSVI-GNVQQQSFRVIYDNVHKQIGLLPDQC 367


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 125/386 (32%), Positives = 174/386 (45%), Gaps = 50/386 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTPP+ +  +LDTGS +VW  C     CK C S   P F P  S S   + 
Sbjct: 40  GEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCA---PCKNCYSQTDPVFNPVKSGSFAKVL 96

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C+ P C  +  ES  C            N  Q C  Y V YG G  T G  ++ETL    
Sbjct: 97  CRTPLCRRL--ESPGC------------NQRQTC-LYQVSYGDGSYTTGEFVTETLTFRR 141

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
             +    +GC   +       AG+ G GRG  S PSQ       KFSYCL+      +++
Sbjct: 142 TKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDR--SASSK 199

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWH 318
            SS++  N +     +     +TP + NP +        +YYV L  I+VGG  V  +  
Sbjct: 200 PSSVVFGNSAVSRTAR-----FTPLLTNPRL------DTFYYVELLGISVGGTPVSGITA 248

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
            +  LDR GNGG I+D GT+ T +    +  L D F +     ++      A   +    
Sbjct: 249 SHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKS------APEFSLFDT 302

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D+ G+ T   P + LHF+ GA+V+LP  NY   V      C         + G SII 
Sbjct: 303 CYDLSGKTTVKVPTVVLHFR-GADVSLPASNYLIPVDGSGRFCFAFA---GTTSGLSII- 357

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + V YDL + R+GF  + C
Sbjct: 358 GNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 44/393 (11%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           +   G Y   +  GTP      +LDTGS +VW  C     C+ C     P F P+ SSS 
Sbjct: 134 AQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCA---PCRRCYDQSGPVFDPRRSSSY 190

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C  P C  +       R          + C      Y V YG G +T G   +ETL
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLR---------RRACL-----YQVAYGDGSVTAGDFATETL 236

Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
                  +    +GC   +       AG+ G GRG  S P+Q++      FSYCL+    
Sbjct: 237 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTS 296

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             ++  +S    +  +      +  ++TP V NP +        +YYV L  I+VGG RV
Sbjct: 297 SSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRM------ETFYYVQLVGISVGGARV 350

Query: 315 -RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
             V    L LD   G GG IVDSGT+ T +A   +  L D F +     R     L    
Sbjct: 351 PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLR-----LSPGG 405

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
            +    C+D+ G K    P + +HF GGAE  LP ENY   V      C     TD    
Sbjct: 406 FSLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD---- 461

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 462 GGVSII-GNIQQQGFRVVFDGDGQRVGFAPKGC 493


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 184/393 (46%), Gaps = 61/393 (15%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ GTPP  +PF+   DTGS L W  C     CK C     P + P  SS+   + 
Sbjct: 66  YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPVP 120

Query: 147 CQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           C +  C  +W      + R+C++     S  C      Y+  Y  G  + GI  +ETL +
Sbjct: 121 CSSATCLPTW------RSRNCSNP----SSPC-----RYIYSYSDGAYSVGILGTETLTI 165

Query: 204 PNRI------IPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
            + +      + +   GC   +   S    G  G GRG  SL +QL + KFSYCL    F
Sbjct: 166 GSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--TDF 223

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
            ++T  S   L   +  +    T +  TP + +P    R      Y+V L+ I++G  R+
Sbjct: 224 FNSTMDSPFFLGTLAELAPGPGT-VQSTPLLQSPLNPSR------YFVNLQGISLGDVRL 276

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            + +    L  DGNGG +VDSGTTFT +A   F  + D  V+Q++        + A +L 
Sbjct: 277 PIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDR-VAQLLGQ----PPVNASSLD 331

Query: 375 GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
              PCF  P GE     P+L LHF GGA++ L  +NY +   + S+ CL +V      G 
Sbjct: 332 --SPCFPSPDGEPF--MPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIV------GS 381

Query: 434 PSII--LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           PS    LGNFQ QN  + +D+   +L F    C
Sbjct: 382 PSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 170/404 (42%), Gaps = 51/404 (12%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPSFIPKLSS 140
           +S   G Y +SL  GTPPQ +  + DTGS L+W  C+    C+ CS  S   +F  + S+
Sbjct: 79  ASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCS---PCRNCSHRSPGSAFFARHST 135

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           +   + C +P+C  + H       CN   L +       C        S  T G    E 
Sbjct: 136 TYSAIHCYSPQCQLVPHP--HPNPCNRTRLHSP------CRYQYTYADSSTTTGFFSKEA 187

Query: 201 LNLPN-----------------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL- 242
           L L                   RI    L G S   ++   G+ G GR   S  SQL   
Sbjct: 188 LTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQ---GVMGLGRAPISFSSQLGRR 244

Query: 243 --DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
              KFSYCL+ +       TS L +    + +  K   +++TP + NP          +Y
Sbjct: 245 FGSKFSYCLMDYTLSPPP-TSFLTIGGAQNVAVSKKGIMSFTPLLINPLSP------TFY 297

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           Y+ ++ + V G ++ +     ++D  GNGGTI+DSGTT TF+     EP   E +    K
Sbjct: 298 YIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFIT----EPAYTEILKAFKK 353

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
                    AE   G   C +V G    + P +  +  GG+  + P  NYF   G+    
Sbjct: 354 RVKLPSP--AEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGD-QIK 410

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           CL V    +  G    +LGN   Q + +E+D    RLGF ++ C
Sbjct: 411 CLAVQPVSQDGG--FSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 179/394 (45%), Gaps = 60/394 (15%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ GTPP  +PF+   DTGS L W  C     CK C     P + P  SS+   + 
Sbjct: 77  YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPVP 131

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-----SGLTEGIALSETL 201
           C +  C                P+  S+NC+   PS L  YG        + GI  +ETL
Sbjct: 132 CSSATC---------------LPVLRSRNCST--PSSLCRYGYSYSDGAYSAGILGTETL 174

Query: 202 NLPNRI------IPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
            L + +      + +   GC   +   S    G  G GRG  SL +QL + KFSYCL   
Sbjct: 175 TLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--T 232

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            F ++T  S  +L   +  +      +  TP + +P    R      Y V L+ IT+G  
Sbjct: 233 DFFNSTLDSPFLLGTLAELA-PGPGAVQSTPLLQSPLNPSR------YVVSLQGITLGDV 285

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+ + +K   L  +  GG +VDSGTTF+ +    F  + D  V+Q++        + A +
Sbjct: 286 RLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDH-VAQVLGQ----PPVNASS 340

Query: 373 LTGLRPCFDVP-GEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           L    PCF  P GE+   F P+L LHF GGA++ L  +NY +   E S+ CL +V     
Sbjct: 341 LD--SPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTST 398

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 +LGNFQ QN  + +D+   +L F    C
Sbjct: 399 WS----MLGNFQQQNIQMLFDMTVGQLSFLPTDC 428


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 129/389 (33%), Positives = 168/389 (43%), Gaps = 70/389 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++ S GTPPQ +  + DTGS L+W  C     C  C     PS+ P  SSS   L 
Sbjct: 80  GAYDMTFSIGTPPQELSALADTGSDLIWAKCG---ACTRCVPQGSPSYYPNKSSSFSKLP 136

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
           C    CS +   S QC        A    C      Y   YG        T+G   SET 
Sbjct: 137 CSGSLCSDL--PSSQCS-------AGGAEC-----DYKYSYGLASDPHHYTQGYLGSETF 182

Query: 202 NLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
            L +  +P    GC+ +        +G+ G GRG  SL SQLN+  FSYCL S    D  
Sbjct: 183 TLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS----DAA 238

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY-VGLRRITVGGQRVRVW 317
           +TS L+  +G+        G+  TP +           S YYY V L  I++G       
Sbjct: 239 KTSPLLFGSGA----LTGAGVQSTPLLRT---------STYYYTVNLESISIGAA----- 280

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               T    G+ G I DSGTT  F+A   +  LA E V  + +  N T A G +   G  
Sbjct: 281 ----TTAGTGSSGIIFDSGTTVAFLAEPAYT-LAKEAV--LSQTTNLTMASGRD---GYE 330

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
            CF   G     FP + LHF GG ++ LP ENYF  V + S  C  V         PS+ 
Sbjct: 331 VCFQTSGAV---FPSMVLHFDGG-DMDLPTENYFGAV-DDSVSCWIVQKS------PSLS 379

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN    NY++ YD+    L F+   C 
Sbjct: 380 IVGNIMQMNYHIRYDVEKSMLSFQPANCD 408


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 166/393 (42%), Gaps = 44/393 (11%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           +   G Y   +  GTP      +LDTGS +VW  C     C+ C       F P+ S S 
Sbjct: 141 AQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCA---PCRRCYDQSGQMFDPRASHSY 197

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C  P C  +       R          K C      Y V YG G +T G   +ETL
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLR---------RKACL-----YQVAYGDGSVTAGDFATETL 243

Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
              +   +P   +GC   +       AG+ G GRG  S PSQ++      FSYCL+    
Sbjct: 244 TFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTS 303

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
              + TS        S +   +   ++TP V NP +        +YYV L  I+VGG RV
Sbjct: 304 SSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRM------ETFYYVQLMGISVGGARV 357

Query: 315 -RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
             V    L LD   G GG IVDSGT+ T +A   +  L D F +     R     L    
Sbjct: 358 PGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLR-----LSPGG 412

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
            +    C+D+ G K    P + +HF GGAE  LP ENY   V      C     TD    
Sbjct: 413 FSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD---- 468

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GG SII GN Q Q + V +D   QRLGF  + C
Sbjct: 469 GGVSII-GNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 178/404 (44%), Gaps = 67/404 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPP+ +   LDTGS LVW  C     C+ C    +P   P  SS+   L C 
Sbjct: 92  YLVHLAVGTPPRPVALTLDTGSDLVWTQCA---PCRDCFHQGLPLLDPAASSTYAALPCG 148

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL------ 201
            P+C  +   S     C     ++  N  + C +Y+  YG   +T G   ++        
Sbjct: 149 APRCRALPFTS-----CGGGGRSSWGNGNRSC-AYIYHYGDKSVTVGEIATDRFTFGGDN 202

Query: 202 -----NLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
                 LP R       GC      V  S +  GIAGFGRG+ SLPSQLN+  FSYC  S
Sbjct: 203 GDGDSRLPTR---RLTFGCGHFNKGVFQSNE-TGIAGFGRGRWSLPSQLNVTTFSYCFTS 258

Query: 252 HKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
             F+     SSL+   G+       SH+   +  +  TP + NPS          Y++ L
Sbjct: 259 -MFES---KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPS------LYFLSL 308

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
           + I+VG  R+ V    L         TI+DSG + T +   ++E +  EF +Q+      
Sbjct: 309 KGISVGKTRLAVPEAKLR-------STIIDSGASITTLPEAVYEAVKAEFAAQV-----G 356

Query: 365 TRALGAEALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
               G    + L  CF +P     +    P L LH   GA+  LP  NY  V  + +A  
Sbjct: 357 LPPTGVVEGSALDLCFALPVTALWRRPPVPSLTLHLD-GADWELPRGNY--VFEDLAARV 413

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           + VV D  A+ G   ++GNFQ QN +V YDL N  L F    C 
Sbjct: 414 MCVVLD--AAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARCD 455


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 175/369 (47%), Gaps = 48/369 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DTGS L W  C     C  C + + P F P  S S R + C +  C  +   +     
Sbjct: 80  IVDTGSDLSWVQCQ---PCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGV 136

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR-- 221
           C   P      C     +Y+V YG G  T G    E LNL N  + NF+ GC   +    
Sbjct: 137 CGSNP----PTC-----NYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQGLF 187

Query: 222 -QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTT 277
              +G+ G GR   SL SQ++      FSYCL +    +   + SL++  G+S   K TT
Sbjct: 188 GGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTT---EAEASGSLVM-GGNSSVYKNTT 243

Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
            ++YT  ++NP +        +Y++ L  ITVGG  V V       DR      I+DSGT
Sbjct: 244 PISYTRMIHNPLLP-------FYFLNLTGITVGG--VEVQAPSFGKDR-----MIIDSGT 289

Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
             + + P +++ L  EFV Q      ++    A +   L  CF++ G +    P++K++F
Sbjct: 290 VISRLPPSIYQALKAEFVKQ------FSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYF 343

Query: 398 KGGAEVTLPVEN-YFAVVGEGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRNQ 455
           +G AE+ + V   +++V  + S VCL + +   E   G   I+GN+Q +N  + YD +  
Sbjct: 344 EGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVG---IIGNYQQKNQRIIYDTKGS 400

Query: 456 RLGFKQQLC 464
            LGF ++ C
Sbjct: 401 MLGFAEEAC 409


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 167/384 (43%), Gaps = 55/384 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q +   LDT +   W PC+    C  CSSS +  F P  SSSSR L C+
Sbjct: 88  YIVRANIGTPAQAMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P          QC+   +     SK+C      + + YG    E     +TL L   +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSAIEAYLTQDTLTLATDVI 187

Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           PN+  GC      +  PA G+ G GRG  SL SQ   L    FSYCL + K  + + +  
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L   N       +   +  TP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D     GTI DSGT +T +    +  + +EF  + VKN N      A +L G   C+  
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAMRNEF-RRRVKNAN------ATSLGGFDTCY-- 345

Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
               +GS  FP +   F  G  VTLP +N       G+  CL +            ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIAS 400

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 117/389 (30%), Positives = 174/389 (44%), Gaps = 52/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPPQ++  +LDT +  VW PC+    C  CS++         S+ S +  
Sbjct: 103 GNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 158

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
           C   +C+      + C     +P         IC S+   YG   +    L  +TL L  
Sbjct: 159 CSTTQCT--QARGLTCPSSTPQP--------SIC-SFNQSYGGDSSFSANLVQDTLTLSP 207

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
            +IPNF  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S +   F  
Sbjct: 208 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 267

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           + +   L           +   + YTP + NP    R +    YYV L  ++VG  +V V
Sbjct: 268 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 311

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
              YLT D +   GTI+DSGT  T  A  ++E + DEF  Q+  N +++  LGA      
Sbjct: 312 DPVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NGSFS-TLGA-----F 363

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF    E     P++ LH     ++ LP+EN       G+  CL++   R+ +     
Sbjct: 364 DTCFSADNENVT--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 420

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  + +D+ N R+G   + C 
Sbjct: 421 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 170/390 (43%), Gaps = 49/390 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y   L  GTP + +  +LDTGS +VW  C     C+ C S   P F P+ S + 
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +P C  +       R          K C      Y V YG G  T G   +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238

Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
                 +    +GC   +       AG+ G G+GK S P Q       KFSYCL+     
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
            +++ SS++  N +     +     +TP ++NP +        +YYVGL  I+VGG RV 
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVGLLGISVGGTRVP 345

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            V      LD+ GNGG I+DSGT+ T +    +  + D F    V  +   R   A   +
Sbjct: 346 GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF---RVGAKTLKR---APDFS 399

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               CFD+        P + LHF+ GA+V+LP  NY   V      C          GG 
Sbjct: 400 LFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFA---GTMGGL 455

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SII GN Q Q + V YDL + R+GF    C
Sbjct: 456 SII-GNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 121/386 (31%), Positives = 176/386 (45%), Gaps = 53/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +S+G PPQ    I+DTGS L W  C     CK C  +    F P  S+S + LG
Sbjct: 88  GEYLIDISYGNPPQKSTAIVDTGSDLNWVQC---LPCKSCYETLSAKFDPSKSASYKTLG 144

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLPN 205
           C +               C D P    ++C   C  Y  +YG G +   ALS + + +  
Sbjct: 145 CGS-------------NFCQDLPF---QSCAASC-QYDYMYGDGSSTSGALSTDDVTIGT 187

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
             IPN   GC   ++ +     G+ G G+G  SL SQL      KFSYCL+      +T+
Sbjct: 188 GKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLG---STK 244

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           TS L + + +        G+ YTP + N      N +  +YY  L+ I+V G+ V     
Sbjct: 245 TSPLYIGDST-----LAGGVAYTPMLTN------NNYPTFYYAELQGISVEGKAVNYPAN 293

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +   G GG I+DSGTT T++  + F P+    V+ +     Y  A G  +  GL  C
Sbjct: 294 TFDIAATGRGGLILDSGTTLTYLDVDAFNPM----VAALKAALPYPEADG--SFYGLEYC 347

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           F   G    ++P +  HF  GA+V L  +N F  +      CL + +    S     I G
Sbjct: 348 FSTAGVANPTYPTVVFHFN-GADVALAPDNTFIALDFEGTTCLAMASSTGFS-----IFG 401

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
           N Q  N+ + +DL N+R+GFK   C+
Sbjct: 402 NIQQLNHVIVHDLVNKRIGFKSANCE 427


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 115/383 (30%), Positives = 166/383 (43%), Gaps = 60/383 (15%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQ-CKYCSSSKIPSFIP-KLSSSSRLLGCQNPKC 152
            GTPP  +   L+ G+ L+W    NH      C     P F P   S       C +PK 
Sbjct: 1   MGTPPNPVKLKLENGNELIW----NHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKF 56

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
            W +   +      D+ + T      +        G+G +               +P   
Sbjct: 57  -WPNQTCVYTYSYGDKSVTTGF----LEVDKFTFVGAGAS---------------VPGVA 96

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT----SSLI 264
            GC + ++        GIAGFGRG  SLPSQL +  FS+C        TT T    S+++
Sbjct: 97  FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGAIPSTVL 149

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           LD  +         +  TP +     A+  A    YY+ L+ ITVG  R+ V      L 
Sbjct: 150 LDLPADLFSNGQGAVQTTPLI---QYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL- 205

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
            +G GGTI+DSGT+ T + P++++ + DEF +Q+         +     TG   CF  P 
Sbjct: 206 TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGHYTCFSAPS 259

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNF 441
           +     P+L LHF+ GA + LP ENY   V +    S +CL +    E +     I+GNF
Sbjct: 260 QAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT-----IIGNF 313

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q QN +V YDL+N  L F    C
Sbjct: 314 QQQNMHVLYDLQNNMLSFVAAQC 336


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 128/412 (31%), Positives = 177/412 (42%), Gaps = 63/412 (15%)

Query: 75  TTTTTNISSHSYGG--YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP 132
           TT  T +S    G   Y + L+ GTPPQ +  +LDTGS L+W  C     C  C +   P
Sbjct: 86  TTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLAQPDP 142

Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-L 191
            F P  S+S   + C    CS I H   +  D           CT     Y   YG G +
Sbjct: 143 LFAPGESASYEPMRCAGQLCSDILHHGCEMPD----------TCT-----YRYNYGDGTM 187

Query: 192 TEGIALSETLNLP----NRIIPNFL-VGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD 243
           T G+  +E         +R++   L  GC   +V S    +GI GFGR   SL SQL++ 
Sbjct: 188 TMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIR 247

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPF---VNNPSVAERNAFSVY 299
           +FSYCL S+    + R S+L+  + S       TG +  TP    + NP+         +
Sbjct: 248 RFSYCLTSYG---SGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPT---------F 295

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           YYV L  +TVG +R+R+      L  DG+GG IVDSGT  T +   +   +   F  Q+ 
Sbjct: 296 YYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQL- 354

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYFA 412
                  A G     G+  CF VP     S        P +  HF+  A++ LP  NY  
Sbjct: 355 ---RLPFANGGNPEDGV--CFLVPAAWRRSSSTSQVPVPRMVFHFQ-DADLDLPRRNYVL 408

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  +CL +      SG     +GN   Q+  V YDL  + L F    C
Sbjct: 409 DDHRKGRLCLLLAD----SGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 53/393 (13%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y + +S GTPPQ    I+DTGS L W  C     C  C     P FIP  SSS 
Sbjct: 2   SAGSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCA---PCARCFEQPDPLFIPLASSSY 58

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
               C +  C  +   +   R+           CT     Y   YG G  T G    ET+
Sbjct: 59  SNASCTDSLCDALPRPTCSMRN----------TCT-----YSYSYGDGSNTRGDFAFETV 103

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
            L    +     GC        AG   + G G+G  SLPSQLN      FSYCL+     
Sbjct: 104 TLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQS-- 161

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            TT T S I    ++ + + +    +TP + N    E N    YYYVG+  I+VG +RV 
Sbjct: 162 -TTGTFSPITFGNAAENSRAS----FTPLLQN----EDNP--SYYYVGVESISVGNRRVP 210

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                  +D +G GG I+DSGTT T+     F P+  E   Q+    +Y  A       G
Sbjct: 211 TPPSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQI----SYPEA--DPTPYG 264

Query: 376 LRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASG 432
           L  C+D+      S   P + +H     +  +PV N + +V   G  VC  + T  + S 
Sbjct: 265 LNLCYDISSVSASSLTLPSMTVHLT-NVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFS- 322

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
               I+GN Q QN  +  D+ N R+GF    C 
Sbjct: 323 ----IIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 169/396 (42%), Gaps = 57/396 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I L+ GTPPQ +  +LDTGS L+W  C     C  C +   P F P  SSS   + C 
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLAQPDPLFAPAASSSYVPMRCS 159

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPNRI 207
              C+ I H S Q  D           CT     Y   YG G T  G+  +E     +  
Sbjct: 160 GQLCNDILHHSCQRPD----------TCT-----YRYNYGDGTTTLGVYATERFTFASSS 204

Query: 208 IPNFLV----GCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
                V    GC  +   S    +GI GFGR   SL SQL++ +FSYCL  +    +TR 
Sbjct: 205 GEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYT---STRK 261

Query: 261 SSLI---LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           S+L+   L +G    D   TG      V    + +      +YYV    +TVG +R+R+ 
Sbjct: 262 STLMFGSLSDGVFEGDDAATGQ-----VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIP 316

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                L  DG+GG IVDSGT  T     +   +   F +Q+     +T +   +   G+ 
Sbjct: 317 LSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL--RLPFTSSSSPD--DGV- 371

Query: 378 PCFDVPGEKTG---------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
            CF  P    G         S P +  HF+ GA++ LP  NY        ++C+ +    
Sbjct: 372 -CFATPMAAGGRRASAATVVSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSLCILLAD-- 427

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             SG     +GNF  Q+  V YDL  + L F    C
Sbjct: 428 --SGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 144/487 (29%), Positives = 200/487 (41%), Gaps = 112/487 (22%)

Query: 13  IFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTT 72
           +     ++I+     +L   LS          ++ L  +   S  RA H+ + Q ++   
Sbjct: 8   VLMLLAVTIYSCDSANLRLQLSHVDAGRGLTHWELLRRMAQRSKARATHLLSAQDQSGRG 67

Query: 73  TTTTTTTNISSHSYG----GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
            + +   N  ++  G     Y + L+ GTPPQ +   LDTGS + W       QCK C +
Sbjct: 68  RSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITW------TQCKRCPA 121

Query: 129 SK-----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
           S      +P F P  SSS   L C +P C      +  C   ND   ATS+ C     +Y
Sbjct: 122 SACFNQTLPLFDPSASSSFASLPCSSPAC----ETTPPCGGGND---ATSRPC-----NY 169

Query: 184 LVLYGSG-LTEGIALSETLNLPN-------RIIPNFLVGCS-----VLSSRQPAGIAGFG 230
            + YG G ++ G    E     +         +P  + GC      V +S +  GIAGFG
Sbjct: 170 SISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRGVFTSNE-TGIAGFG 228

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG  SLPSQL +  FS+C        TT T S           K +  L   P V  PS 
Sbjct: 229 RGSLSLPSQLKVGNFSHCF-------TTITGS-----------KTSAVLLGLPGVAPPSA 270

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           +          +G RR   G  R R      +  R  N      SGT+ T + P  +  +
Sbjct: 271 SP---------LGRRR---GSYRCR------STPRSSN------SGTSITSLPPRTYRAV 306

Query: 351 ADEFVSQM---VKNRNYTRALGAEALTGLRPCFDVP--GEKTGSFPELKLHFKGGAEVTL 405
            +EF +Q+   V   N T             CF  P  G K    P + LHF+ GA + L
Sbjct: 307 REEFAAQVKLPVVPGNATDPF---------TCFSAPLRGPKP-DVPTMALHFE-GATMRL 355

Query: 406 PVENY-FAVVGEGSA------VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
           P ENY F VV +  A      +CL V+   E      IILGN Q QN +V YDL+N +L 
Sbjct: 356 PQENYVFEVVDDDDAGNSSRIICLAVIEGGE------IILGNIQQQNMHVLYDLQNSKLS 409

Query: 459 FKQQLCK 465
           F    C 
Sbjct: 410 FVPAQCD 416


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 161/385 (41%), Gaps = 52/385 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C  C +   P F P  S++   + 
Sbjct: 125 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK---PCLECYAQADPLFDPATSATFSAVP 181

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C      +++   C D     S  C      Y V YG G  T+G    ETL L  
Sbjct: 182 CGSAVC-----RTLRTSGCGD-----SGGC-----DYEVSYGDGSYTKGALALETLTLGG 226

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
             +    +GC   +       AG+ G G G  SL  QL       FSYCL S        
Sbjct: 227 TAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG------ 280

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+L      S+    G  + P V NP          +YYVGL  I VG +R+ +   
Sbjct: 281 AGSLVL----GRSEAVPEGAVWVPLVRNPQAPS------FYYVGLSGIGVGDERLPLQED 330

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG GG ++D+GT  T +  E +  L D FV+ +       RA G   L     C
Sbjct: 331 LFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAV---GALPRAPGVSLLD---TC 384

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F G A +TLP  N    V +G   CL       +S GPS ILG
Sbjct: 385 YDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFA---PSSSGPS-ILG 439

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF    C
Sbjct: 440 NIQQEGIQITVDSANGYIGFGPTTC 464


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 119/397 (29%), Positives = 168/397 (42%), Gaps = 54/397 (13%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ GTPP  +PF+   DTGS L W  C     CK C     P +    S+S   + 
Sbjct: 95  YLMELAIGTPP--VPFVALADTGSDLTWTQCK---PCKLCFPQDTPIYDTAASASFSPVP 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
           C +  C  I   S  C      P             Y   Y  G  + G+  +ETL    
Sbjct: 150 CASATCLPIWRSSRNCTATTTSPC-----------RYRYAYDDGAYSAGVLGTETLTFAG 198

Query: 204 -------PNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
                  P   +     GC V +   S    G  G GRG  SL +QL + KFSYCL    
Sbjct: 199 SSPGAPGPGVSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL--TD 256

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG---LTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           F +T+  S ++  + +  +   T G   +  TP V  P    R      YYV L  I++G
Sbjct: 257 FFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSR------YYVSLEGISLG 310

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
             R+ + +    L  DG+GG IVDSGT FT +    F  + +      V N+    A   
Sbjct: 311 DARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAG--VLNQPVVNASSL 368

Query: 371 EALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           ++     PCF      ++    P++ LHF GGA++ L  +NY +   E S+ CL +    
Sbjct: 369 DS-----PCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAP 423

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            A G    ILGNFQ QN  + +D+   +L F    C 
Sbjct: 424 SAYGS---ILGNFQQQNIQMLFDITVGQLSFVPTDCS 457


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 120/424 (28%), Positives = 185/424 (43%), Gaps = 68/424 (16%)

Query: 60  LHIKNPQTKTTTTTTTTTTTNI------SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           L +K+ + K +  ++TT   N       ++H  GGY++++  GTP +    + DTGS L 
Sbjct: 97  LRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLT 156

Query: 114 WFPCTNHYQCKYCSSSKIP----SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
           W       QC+ CS    P     F P  S+S + L C +  C  I  ES Q        
Sbjct: 157 W------TQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQ-------G 203

Query: 170 LATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNRIIPNFLVGCSVLSSRQ---PAG 225
            ++S +C      Y V YG+G T G   +ETL + P+ +  NF++GC   +  +    AG
Sbjct: 204 CSSSNSCL-----YGVKYGTGYTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAG 258

Query: 226 IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
           + G GR   +LPSQ +    + FSYCL +     ++ T  L    G S + K      +T
Sbjct: 259 LLGLGRSPVALPSQTSSTYKNLFSYCLPA----SSSSTGHLSFGGGVSQAAK------FT 308

Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
           P  +             Y + +  I+VGG+++ +             GTI+DSGTT T++
Sbjct: 309 PITSK--------IPELYGLDVSGISVGGRKLPIDPSVFR-----TAGTIIDSGTTLTYL 355

Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGG 400
                  L+  F   M    NYT   G    +GL+PC+D         + P++ + F+GG
Sbjct: 356 PSTAHSALSSAFQEMMT---NYTLTKGT---SGLQPCYDFSKHANDNITIPQISIFFEGG 409

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
            EV +     F        VCL    +   +     I GN Q + Y V YD+    +GF 
Sbjct: 410 VEVDIDDSGIFIAANGLEEVCLAFKDNGNDT--DVAIFGNVQQKTYEVVYDVAKGMVGFA 467

Query: 461 QQLC 464
              C
Sbjct: 468 PGGC 471


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 127/397 (31%), Positives = 180/397 (45%), Gaps = 61/397 (15%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI-------PSFIPKLSS 140
           G+S+++    P ++I   +DTGS L+W       QCK  SS+         P + P  SS
Sbjct: 15  GHSLTVGIVQPRKLI---VDTGSDLIW------TQCKLSSSTAAAARHGSPPVYDPGESS 65

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN-CTQICPSYLVLYGSGLTEGIALSE 199
           +   L C +  C          ++C      TSKN C      Y  +YGS    G+  SE
Sbjct: 66  TFAFLPCSDRLC---QEGQFSFKNC------TSKNRCV-----YEDVYGSAAAVGVLASE 111

Query: 200 TLNLPNRIIPNFLVG--CSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
           T     R   +  +G  C  LS+       GI G      SL +QL + +FSYCL    F
Sbjct: 112 TFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PF 169

Query: 255 DDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D  +TS L+    +  S  KTT  +  T  V+NP        +VYYYV L  I++G +R
Sbjct: 170 ADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VETVYYYVPLVGISLGHKR 222

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + V    L +  DG GGTIVDSG+T  ++    FE +  E V  +V+     R +    L
Sbjct: 223 LAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL 281

Query: 374 TGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
                CF +P     +       P L LHF GGA + LP +NYF     G  +CL V   
Sbjct: 282 -----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG-LMCLAVGKT 335

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + SG    I+GN Q QN +V +D+++ +  F    C
Sbjct: 336 TDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 127/461 (27%), Positives = 196/461 (42%), Gaps = 44/461 (9%)

Query: 24  SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISS 83
           S+ T+    L   H  P     Q+L+S +             Q    T++ +   +  SS
Sbjct: 19  STSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQNHRRTSSKSPLMSGASS 78

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSS 141
            S G Y +S+  G+PPQ +  + DTGS L W  C+    CK   S   P  +F+ + S++
Sbjct: 79  GS-GQYFVSIRLGSPPQTLLLVADTGSDLTWVRCS---ACKTNCSIHPPGSTFLARHSTT 134

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
                C +  C  +   +     CN   L ++         Y  +Y  G  T G    ET
Sbjct: 135 FSPTHCFSSLCQLVPQPNPN--PCNHTRLHSTCR-------YEYVYSDGSKTSGFFSKET 185

Query: 201 LNL-----PNRIIPNFLVGCSVLSS---------RQPAGIAGFGRGKTSLPSQLNLD--- 243
             L         + +   GC   +S            +G+ G GRG  S  SQL      
Sbjct: 186 TTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGR 245

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
            FSYCLL +       +  +I D  S+  D K+  +++TP + NP          +YY+ 
Sbjct: 246 SFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSM-MSFTPLLINPEAP------TFYYIS 298

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           ++ + V G ++ +     +LD  GNGGT++DSGTT TF+    +  +   F  + VK  +
Sbjct: 299 IKGVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKRE-VKLPS 357

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
            T   GA   +G   C +V G     FP L L   G +  + P  NYF  + EG   CL 
Sbjct: 358 PTPG-GASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEG-IKCL- 414

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +   EA  G   ++GN   Q + +E+D    RLGF ++ C
Sbjct: 415 AIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 136/440 (30%), Positives = 191/440 (43%), Gaps = 60/440 (13%)

Query: 43  DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
           D+     +L  S+  R      P+   +     T  + +   S G Y + +  GTPP+  
Sbjct: 106 DTMHRRAALSGSAAAR--RDSAPRRALSERVVATVESGVPVGS-GEYLVDVYLGTPPRRF 162

Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI----HHE 158
             I+DTGS L W  C     C  C     P F P  S S R + C + +C  +       
Sbjct: 163 RMIMDTGSDLNWLQCA---PCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESA 219

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNLPN---RIIPNFL 212
             +CR    +P          CP Y   YG  S  T  +AL   T+NL     R +    
Sbjct: 220 PRECRRPRSDP----------CP-YYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVA 268

Query: 213 VGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSHKFDDTTRTSSLIL 265
            GC   +       AG+ G GRG  S  SQL        FSYCL+ H    +   S +I 
Sbjct: 269 FGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHG---SAAGSKIIF 325

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
                H D     L   P +N  + A       +YY+ L+ I VGG+ V +    L+   
Sbjct: 326 ----GHDD----ALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS--- 374

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
              GGTI+DSGTT ++     ++ +   F+ +M  + +Y   LG   L+   PC++V G 
Sbjct: 375 --AGGTIIDSGTTLSYFPEPAYQAIRQAFIDRM--SPSYPLILGFPVLS---PCYNVSGA 427

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQ 444
           +    PEL L F  GA    P ENYF  +     +CL V+ T R    G SII GN+Q Q
Sbjct: 428 EKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRS---GMSII-GNYQQQ 483

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N++V YDL + RLGF  + C
Sbjct: 484 NFHVLYDLEHNRLGFAPRRC 503


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 119/391 (30%), Positives = 171/391 (43%), Gaps = 53/391 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTP + +   LDTGS LVW  C     C+ C    +P   P  SS+   L C 
Sbjct: 84  YLVRLAVGTPRRPVALTLDTGSDLVWTQCA---PCRDCFDQDLPVLDPAASSTYAALPCG 140

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR- 206
             +C  +   S     C    L   ++C      Y   YG   LT G   ++     +  
Sbjct: 141 AARCRALPFTS-----CGVRTLGNHRSCI-----YAYHYGDKSLTVGEIATDRFTFGDSG 190

Query: 207 ------IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                        GC  L+         GIAGFGRG+ SLPSQLN+  FSYC  S  F+ 
Sbjct: 191 GSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTS-MFES 249

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +   +L     + +S   +  +  TP + NPS          Y++ L+ I+VG  R+ V
Sbjct: 250 KSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPS------LYFLSLKGISVGKTRLPV 303

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                         TI+DSG + T +  E++E +  EF +Q+    +     G E  + L
Sbjct: 304 PETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPS-----GVEG-SAL 350

Query: 377 RPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
             CF +P     +  + P L LH + GA+  LP  NY  V  +  A  + +V D  A+ G
Sbjct: 351 DLCFALPVTALWRRPAVPSLTLHLE-GADWELPRSNY--VFEDLGARVMCIVLD--AAPG 405

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              ++GNFQ QN +V YDL N RL F    C
Sbjct: 406 EQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 118/405 (29%), Positives = 180/405 (44%), Gaps = 54/405 (13%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
             Y  +S+ L  G+  + +  I+DTGS  V   C          S   P F P  S S R
Sbjct: 95  EDYALFSMQLGIGSLQKNLSAIIDTGSEAVLVQC---------GSRSRPVFDPAASQSYR 145

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEP-LATSKNCTQICPSYLVLYGSG------LTEGIA 196
            + C +  C  +     Q  + + +P + +S  CT     Y + YG         ++ + 
Sbjct: 146 QVPCISQLCLAVQQ---QTSNGSSQPCVNSSATCT-----YSLSYGDSRNSTGDFSQDVI 197

Query: 197 LSETLNLPNRIIP--NFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKF 245
              + N   + +   +   GC+      L      GI GF RG  SLPSQL       KF
Sbjct: 198 FLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKF 257

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           SYC  S  +         + D+G S S      + YTP ++NP    R   S  YYVGL 
Sbjct: 258 SYCFPSQPWQPRATGVIFLGDSGLSKSK-----VGYTPLLDNPVTPAR---SQLYYVGLT 309

Query: 306 RITVGGQRVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
            I+V G+ + +      LD   G+GGT++DSGTTFT +  + +    + F +    NR+ 
Sbjct: 310 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAA---SNRSG 366

Query: 365 TRA-LGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGSA 419
            R  +GA A  G   C+++  G      PE++L  +    + L  E+ F  V   G    
Sbjct: 367 LRKKVGAAA--GFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVT 424

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           VCL +++ +++  G   +LGN+Q  NY VEYD    R+GF++  C
Sbjct: 425 VCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 131/419 (31%), Positives = 188/419 (44%), Gaps = 50/419 (11%)

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           +SL  A+   N +T+      +++ T+  +   G Y   L  GTP + +  +LDTGS +V
Sbjct: 113 TSLAAAVGSTN-RTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVV 171

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     CK C S   P F P  S S   + C +P C  +           D P  ++
Sbjct: 172 WIQCA---PCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRL-----------DSPGCST 217

Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGF 229
           K    IC  Y V YG G  T G   +ETL      +    +GC   +       AG+ G 
Sbjct: 218 KK--HIC-LYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGL 274

Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           GRG+ S PSQ+      KFSYCL+      +++ S ++  + +     +     +TP V+
Sbjct: 275 GRGRLSFPSQIGRRFSRKFSYCLVDRS--ASSKPSYMVFGDSAISRTAR-----FTPLVS 327

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
           NP +        +YYV L  ++VGG RV  +      LD  GNGG I+DSGT+ T +   
Sbjct: 328 NPKL------DTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRP 381

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
            +  L D F    V   N  R   A   +    CFD+ G+     P + LHF+ GA+V+L
Sbjct: 382 AYVALRDAF---RVGASNLKR---APEFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 434

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           P  NY   V    + C           G SI+ GN Q Q + V YDL   R+GF  + C
Sbjct: 435 PASNYLIPVDNSGSFCFAFAGTMS---GLSIV-GNIQQQGFRVVYDLAASRVGFAPRGC 489


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 55/384 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q +   LDT +   W PC+    C  CSSS +  F P  SSSSR L C+
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P          QC+   +     SK+C      + + YG    E     +TL L + +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVI 187

Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           PN+  GC      +  PA G+ G GRG  SL SQ   L    FSYCL + K  + + +  
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L   N       +   +  TP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D     GTI DSGT +T +    +  + +EF  + VKN N      A +L G   C+  
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEF-RRRVKNAN------ATSLGGFDTCY-- 345

Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
               +GS  FP +   F  G  VTLP +N       G+  CL +            ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIAS 400

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 125/399 (31%), Positives = 180/399 (45%), Gaps = 60/399 (15%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            G PPQ    I+DTGS+L+W  C+   +   C    +  + P  S +++ + C +  C  
Sbjct: 90  IGDPPQQAAAIIDTGSNLIWTQCST-CRANGCFGQDLTFYDPSRSRTAKPVACNDTAC-L 147

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL------PNRII 208
           +  E+   RD         K C     + L  YG+G   G   +E           N + 
Sbjct: 148 LGSETRCARD--------GKAC-----AVLTAYGAGAIGGFLGTEVFTFGHGQSSENNV- 193

Query: 209 PNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            +   GC   S   P      +GI G GRGK SLPSQL  +KFSYCL  + F D   TS+
Sbjct: 194 -SLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPY-FSDAANTST 251

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L +   +  S       T  PF+ NP   + + F  +YY+ L  ITVG  ++ V      
Sbjct: 252 LFVGASAGLSGGGAPA-TSVPFLKNP---DDDPFDSFYYLPLTGITVGTAKLDVPAAAFD 307

Query: 323 LDRDGN---GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
           L        GGT++DSG+ FT +    ++ L DE V Q+  +     A GAE   GL  C
Sbjct: 308 LREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPA-GAE---GLDLC 363

Query: 380 FD--VPGEKTGSFPELKLHF----KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
                PG+     P L LHF     GG +V +P ENY+  V + +A C+ V +    SGG
Sbjct: 364 VGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTA-CMVVFS----SGG 418

Query: 434 P--------SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           P        + I+GN+  Q+ ++ YDL    L F+   C
Sbjct: 419 PNSTLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 127/395 (32%), Positives = 177/395 (44%), Gaps = 64/395 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPP  +  I DTGS LVW  C+++      S   +  F P  S++  LL CQ
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAV-VFHPSRSTTYSLLSCQ 158

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
           +  C  +   S    D + E       C      Y   YG G  T G+  +ET +     
Sbjct: 159 SAACQALSQASC---DADSE-------C-----QYQYAYGDGSRTIGVLSTETFSFAAAG 203

Query: 208 --------IPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSH 252
                   +P    GCS  S  S +  G+ G G G  SL SQL        +FSYCL+  
Sbjct: 204 GGGEGQVRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVP- 262

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            +     +S+L     +  SD    G   TP V  PS  +      YY V L  + V GQ
Sbjct: 263 PYAAANSSSTLSFGARAVVSDP---GAASTPLV--PSEVDS-----YYTVALESVAVAGQ 312

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            V   +         +   IVDSGTT TF+ P L  PL    V+++ +     RA   E 
Sbjct: 313 DVASAN---------SSRIIVDSGTTLTFLDPALLRPL----VAELERRIRLPRAQPPEQ 359

Query: 373 LTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
           L  L+ C+DV G+        P++ L F GGA VTL  EN F+++ EG+ +CL +V   E
Sbjct: 360 L--LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGT-LCLVLVPVSE 416

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +   P  ILGN   QN++V YDL  + + F    C
Sbjct: 417 SQ--PVSILGNIAQQNFHVGYDLDARTVTFAAVDC 449


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 55/384 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q +   LDT +   W PC+    C  CSSS +  F P  SSSSR L C+
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P          QC+   +     SK+C      + + YG    E     +TL L + +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVI 187

Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           PN+  GC      +  PA G+ G GRG  SL SQ   L    FSYCL + K  + + +  
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L   N       +   +  TP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D     GTI DSGT +T +    +  + +EF  + VKN N      A +L G   C+  
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEF-RRRVKNAN------ATSLGGFDTCY-- 345

Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
               +GS  FP +   F  G  VTLP +N       G+  CL +            ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIAS 400

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/393 (30%), Positives = 173/393 (44%), Gaps = 43/393 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ILDTGS L W  C     C  C       + PK S+S + + 
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC---LPCYDCFQQNGAFYDPKASASYKNIT 224

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
           C + +C+ +          + +P    K+  Q CP Y     S  T G    ET  +NL 
Sbjct: 225 CNDQRCNLVS---------SPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 275

Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
                     + N + GC   +       AG+ G GRG  S  SQL       FSYCL+ 
Sbjct: 276 TNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 335

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
               DT  +S LI   G          L +T FV      + N    +YYV ++ I V G
Sbjct: 336 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----AGKENLVDTFYYVQIKSILVAG 388

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + + +  +   +  DG GGTI+DSGTT ++ A    EP A EF+   +  +   +     
Sbjct: 389 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 443

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
               L PCF+V G      PEL + F  GA    P EN F  + E   VCL ++   +++
Sbjct: 444 DFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSA 502

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN+Q QN+++ YD +  RLG+    C
Sbjct: 503 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 532


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 166/388 (42%), Gaps = 60/388 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+  GTP +    I DTGS L W  C     C  C   + P F P LSS+   + 
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK---PCADCYEQQDPLFDPSLSSTYAAVA 203

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
           C  P+C     + +    C+ +       C      Y V YG    T+G  + +TL L  
Sbjct: 204 CGAPEC-----QELDASGCSSD-----SRCR-----YEVQYGDQSQTDGNLVRDTLTLSA 248

Query: 205 NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +  +P F+ GC   ++    Q  G+ G GR K SLPSQ        F+YCL S       
Sbjct: 249 SDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS------- 301

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA-ERNAFSVYYYVGLRRITVGGQRVRVW 317
                      S S +    L   P  N    A    A   +YY+ L  I VGG+ +R+ 
Sbjct: 302 -----------SSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP 350

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                        T++DSGT  T + P  + PL   F   M + +       A AL+ L 
Sbjct: 351 ATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKK------APALSILD 400

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
            C+D  G +T   P ++L F GGA V+L        V + S  CL    + + S   SI 
Sbjct: 401 TCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVL-YVSKVSQACLAFAPNADDS---SIA 456

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ILGN Q + + V YD+ NQR+GF  + C
Sbjct: 457 ILGNTQQKTFAVAYDVANQRIGFGAKGC 484


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 125/390 (32%), Positives = 171/390 (43%), Gaps = 49/390 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y   L  GTPP+ +  +LDTGS +VW  C     C  C S     F P  S S 
Sbjct: 124 SQGSGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK---PCTKCYSQTDQIFDPSKSKSF 180

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +P C  +           D P  + KN   +C  Y V YG G  T G   +ETL
Sbjct: 181 AGIPCYSPLCRRL-----------DSPGCSLKN--NLC-QYQVSYGDGSFTFGDFSTETL 226

Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
                 +P   +GC   +       AG+ G GRG  S P+Q      +KFSYCL      
Sbjct: 227 TFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRT-- 284

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
            + + SS++  + +     +     +TP V NP +        +YYV L  I+VGG  VR
Sbjct: 285 ASAKPSSIVFGDSAVSRTAR-----FTPLVKNPKL------DTFYYVELLGISVGGAPVR 333

Query: 316 -VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +   +  LD  GNGG I+DSGT+ T +    +  L D F    V   +  R   A   +
Sbjct: 334 GISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAF---RVGASHLKR---APEFS 387

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               C+D+ G      P + LHF+ GA+V+LP  NY   V    + C           G 
Sbjct: 388 LFDTCYDLSGLSEVKVPTVVLHFR-GADVSLPAANYLVPVDNSGSFCFAFA---GTMSGL 443

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SII GN Q Q + V +DL   R+GF  + C
Sbjct: 444 SII-GNIQQQGFRVVFDLAGSRVGFAPRGC 472


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 166/388 (42%), Gaps = 60/388 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+  GTP +    I DTGS L W  C     C  C   + P F P LSS+   + 
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK---PCADCYEQQDPLFDPSLSSTYAAVA 203

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
           C  P+C     + +    C+ +       C      Y V YG    T+G  + +TL L  
Sbjct: 204 CGAPEC-----QELDASGCSSD-----SRCR-----YEVQYGDQSQTDGNLVRDTLTLSA 248

Query: 205 NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +  +P F+ GC   ++    Q  G+ G GR K SLPSQ        F+YCL S       
Sbjct: 249 SDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS------- 301

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA-ERNAFSVYYYVGLRRITVGGQRVRVW 317
                      S S +    L   P  N    A    A   +YY+ L  I VGG+ +R+ 
Sbjct: 302 -----------SSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP 350

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                        T++DSGT  T + P  + PL   F   M + +       A AL+ L 
Sbjct: 351 ATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKK------APALSILD 400

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
            C+D  G +T   P ++L F GGA V+L        V + S  CL    + + S   SI 
Sbjct: 401 TCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVL-YVSKVSQACLAFAPNADDS---SIA 456

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ILGN Q + + V YD+ NQR+GF  + C
Sbjct: 457 ILGNTQQKTFAVTYDVANQRIGFGAKGC 484


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 120/385 (31%), Positives = 170/385 (44%), Gaps = 42/385 (10%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTPPQ    ILDTGS L W  C      K   SS    F P LSSS  +L C +P
Sbjct: 84  VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSS---VFDPSLSSSFSVLPCNHP 140

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP-NRII 208
            C              D  L TS +  ++C  Y   Y  G L EG  + E +    ++  
Sbjct: 141 LCK---------PRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKITFSRSQST 190

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRTSSLIL-D 266
           P  ++GC+  SS    GI G   G+ S  SQ  L KFSYC+ + +     T T S  L +
Sbjct: 191 PPLILGCAEESS-DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGE 249

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +S   +    LT++     P     N   + Y V ++ I +G Q++ +       D  
Sbjct: 250 NPNSGGFRYINLLTFSQSQRMP-----NLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPS 304

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL------RPCF 380
           G G T++DSG+ FT++  E +  + +E V          R +GA    G         CF
Sbjct: 305 GAGQTMIDSGSEFTYLVDEAYNKVREEVV----------RLVGARLKKGYVYGGVSDMCF 354

Query: 381 DVPGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +    + G     +   F  G E+ +  E   A VG G  V    +   E  G  S I+G
Sbjct: 355 NGNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGG--VHCVGIGRSEMLGAASNIIG 412

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           NF  QN +VE+DL N+R+GF +  C
Sbjct: 413 NFHQQNIWVEFDLANRRVGFGKADC 437


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 179/412 (43%), Gaps = 66/412 (16%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           SS   G Y + L  GTP +  P I+DTGS L W  C         SS   P +    SSS
Sbjct: 20  SSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSS 79

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS---YLVLYGS-GLTEGIAL 197
            R + C + +C ++             P     +C+   PS   Y   Y     T GI  
Sbjct: 80  YREIPCTDDECLFL-------------PAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILA 126

Query: 198 SETLNLPNRI---------------IPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPS 238
            ET+++ +R                I N  +GCS  S        +G+ G G+G  SL +
Sbjct: 127 YETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLAT 186

Query: 239 QLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
           Q         FSYCL+ +    +  +S L++  G +   K    L +TP V NP      
Sbjct: 187 QTRHTALGGIFSYCLVDY-LRGSNASSFLVM--GRTRWRK----LAHTPIVRNP------ 233

Query: 295 AFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
           A   +YYV +  + V G+ V  +      +D DGN GTI DSGTT ++    L EP   +
Sbjct: 234 AAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSY----LREPAYSK 289

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
            +  +  +    RA   E   G   C++V   + G  P+L + F+GGA + LP  NY  +
Sbjct: 290 VLGALNASIYLPRA--QEIPEGFELCYNVTRMEKG-MPKLGVEFQGGAVMELPWNNYMVL 346

Query: 414 VGEG-SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V E    V L  VT    S     ILGN   Q++++EYDL   R+GFK   C
Sbjct: 347 VAENVQCVALQKVTTTNGSN----ILGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 131/404 (32%), Positives = 181/404 (44%), Gaps = 64/404 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++ +  G+PP+    I+DTGS LVW  C     C  C S   P + P  SS+     
Sbjct: 2   GAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK---PCSQCYSQSDPIYDPSASST----- 53

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
                CS    +S+    C+    +++K C      Y   YG S  T+G    ETL L  
Sbjct: 54  FAKTSCSTSSCQSLPASGCS----SSAKTCI-----YGYQYGDSSSTQGDFALETLTLRS 104

Query: 204 ---PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
               ++  PNF  GC  L+S      AGI G G+GK SL +QL     +KFSYCL+    
Sbjct: 105 SGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFD- 163

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR- 313
           DD+++TS LI   GSS S    +G   TP + N      +  S YY+VGL  I+VGG++ 
Sbjct: 164 DDSSKTSPLIF--GSSAS--TGSGAISTPIIPN------SGRSTYYFVGLEGISVGGKQL 213

Query: 314 -----------VRVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
                      VR   K      + N GGTI DSGTT T +   ++  +   F S +   
Sbjct: 214 SLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSV--- 270

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV- 420
              +      + +G   C+DV   K   FP L L FK G + + P +NYF +V     V 
Sbjct: 271 ---SLPTVDASSSGFDLCYDVSKSKNFKFPALTLAFK-GTKFSPPQKNYFVIVDTAETVA 326

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           CL +            I+GN   QNY+V YD     +      C
Sbjct: 327 CLAMGGSGSLG---LGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 79/416 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + LS GTPP+ +   LDTGS LVW  C     C       IP   P  SS+   + C 
Sbjct: 94  YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNC--FDQGAIPVLDPAASSTHAAVRCD 151

Query: 149 NPKC-------------SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
            P C             SW     +      D+ +   K       S    +G G   +G
Sbjct: 152 APVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGK-----LASDRFTFGPGDNADG 206

Query: 195 IALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
             +SE              GC   +         GIAGFGRG+ SLPSQL +  FSYC  
Sbjct: 207 GGVSER---------RLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFT 257

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           S  F+ T+   +L    G + ++   TG +  TP + +PS          Y++ L+ ITV
Sbjct: 258 S-MFESTSSLVTL----GVAPAELHLTGQVQSTPLLRDPSQPS------LYFLSLKAITV 306

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           G  R+ +  +   L        I+DSG + T +  +++E +  EFV+Q+         L 
Sbjct: 307 GATRIPIPERRQRLR---EASAIIDSGASITTLPEDVYEAVKAEFVAQV--------GLP 355

Query: 370 AEALTG--LRPCFDVPGEKTGS-----------------FPELKLHFKGGAEVTLPVENY 410
             A+ G  L  CF +P                        P L  H  GGA+  LP ENY
Sbjct: 356 VSAVEGSALDLCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENY 415

Query: 411 FAVVGEGSAVCLTVVTDREASGGP-SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             V  +  A  + +V D    GG  ++++GN+Q QN +V YDL N  L F    C+
Sbjct: 416 --VFEDYGARVMCLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/383 (30%), Positives = 164/383 (42%), Gaps = 48/383 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P + +  +LDTGS + W  C     C  C     P F P LS+S   + 
Sbjct: 161 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSTSYASVA 217

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C NP+C             +D   A  +N T  C  Y V YG G  T G   +ETL L +
Sbjct: 218 CDNPRC-------------HDLDAAACRNSTGAC-LYEVAYGDGSYTVGDFATETLTLGD 263

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + +  +GC   +       AG+   G G  S PSQ++   FSYCL+     D+  +S
Sbjct: 264 SAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSS 320

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L   + +   D + T     P + +P        S +YYVGL  I+VGGQ + +     
Sbjct: 321 TLQFGDAA---DAEVT----APLIRSPRT------STFYYVGLSGISVGGQILSIPPSAF 367

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            +D  G GG IVDSGT  T +    +  L D FV      ++  R  G   ++    C+D
Sbjct: 368 AMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVR---GTQSLPRTSG---VSLFDTCYD 421

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           +    +   P + L F GG E+ LP +NY   V      CL       A      I+GN 
Sbjct: 422 LSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNV 477

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q Q   V +D     +GF    C
Sbjct: 478 QQQGTRVSFDTAKSTVGFTSNKC 500


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 124/391 (31%), Positives = 170/391 (43%), Gaps = 51/391 (13%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y   L  GTP + +  +LDTGS +VW  C     C+ C S   P F P+ S + 
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +P C  +       R          K C      Y V YG G  T G   +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238

Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
                 +    +GC   +       AG+ G G+GK S P Q       KFSYCL+     
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
            +++ SS++  N +     +     +TP ++NP +        +YYV L  I+VGG RV 
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVELLGISVGGTRVP 345

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEAL 373
            V      LD+ GNGG I+DSGT+ T +    +  + D F       R   +AL  A   
Sbjct: 346 GVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF-------RVGAKALKRAPDF 398

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           +    CFD+        P + LHF+ GA+V+LP  NY   V      C          GG
Sbjct: 399 SLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAG---TMGG 454

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            SII GN Q Q + V YDL + R+GF    C
Sbjct: 455 LSII-GNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 171/390 (43%), Gaps = 57/390 (14%)

Query: 89  YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y + L+ GTPP  +PFI   DTGS L W  C     CK C     P +    SSS   L 
Sbjct: 83  YLMELAIGTPP--VPFIALADTGSDLTWTQCK---PCKLCFGQDTPIYDTTTSSSFSPLP 137

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C                P+ +S+  T   PS    Y     +G    E   +   
Sbjct: 138 CSSATC---------------LPIWSSRCST---PSATCRYRYAYDDGAYSPECAGIS-- 177

Query: 207 IIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            +     GC V +   S    G  G GRG  SL +QL + KFSYCL    F +T+ +S +
Sbjct: 178 -VGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLT--DFFNTSLSSPV 234

Query: 264 ILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
              + +  +    +     +  TP V +P    R      YYV L  I++G  R+ + + 
Sbjct: 235 FFGSLAELAASSASADAAVVQSTPLVQSPYNPSR------YYVSLEGISLGDARLPIPNG 288

Query: 320 YLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              L D DG+GG IVDSGT FT +    F  + D     + +       + A +L   RP
Sbjct: 289 TFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLGQ-----PVVNASSLD--RP 341

Query: 379 CFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           CF  P    ++    P++ LHF GGA++ L  +NY +   E S+ CL +V    ASG   
Sbjct: 342 CFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGS-- 399

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            +LGNFQ QN  + +D+   +L F    C 
Sbjct: 400 -VLGNFQQQNIQMLFDITVGQLSFMPTDCS 428


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 122/442 (27%), Positives = 183/442 (41%), Gaps = 59/442 (13%)

Query: 33  LSRFHTNPSQ-DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSI 91
           LSR H + S+  +      L+ + ++++  +K  QT+      +T  ++ +S   G Y  
Sbjct: 103 LSRLHRDSSRVQAITTRLQLILNGVSKS-DLKPLQTEIQPQDLSTPVSSGTSQGSGEYFT 161

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
            +  G P +    +LDTGS + W  C     C  C     P F P  SSS   L C + +
Sbjct: 162 RVGVGNPAKSYYMVLDTGSDINWIQCQ---PCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIP 209
           C+ +   S +   C                 Y V YG G  T G  ++ET++      + 
Sbjct: 219 CNSLQMSSCRNGQCR----------------YQVNYGDGSFTFGDFVTETMSFGGSGTVN 262

Query: 210 NFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           +  +GC         G+        G G G  SL SQL    FSYCL++    D+  +S+
Sbjct: 263 SIALGCG----HDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCLVNR---DSAASST 315

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L   N +   D     L            + +    +YYVGL  ++VGG+ +R+  +   
Sbjct: 316 LDF-NSAPVGDSVIAPLL-----------KSSKIDTFYYVGLSGMSVGGELLRIPQEVFK 363

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           LD  G+GG IVD GT  T +  E +  L D FVS       + R+    AL     C+D+
Sbjct: 364 LDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSM----SRHLRSTSGVAL--FDTCYDL 417

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
            G+ +   P +  HF GG    LP  NY   V      C        +      I+GN Q
Sbjct: 418 SGQSSVKVPTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLS----IIGNVQ 473

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            Q   V +DL N R+GF    C
Sbjct: 474 QQGTRVSFDLANNRVGFSTNKC 495


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 181/390 (46%), Gaps = 46/390 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++S   GTP   +    DTGS L+W  C     C  CS    PS+ P  SSS+  + 
Sbjct: 90  GDYAMSFGIGTPATGLSGEADTGSDLIWTKCG---ACARCSPRGSPSYYPTSSSSAAFVA 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
           C +  C  +      C +      + S NC     SY   YG+       TEGI ++ET 
Sbjct: 147 CGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNARDTHHYTEGILMTETF 198

Query: 202 NLPN--RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
              +     P    GC++ S       +G+ G GRGK SL +QLN++ F Y L S    D
Sbjct: 199 TFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----D 254

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +  S +   + +  +         TP + NP V +      +YYVGL  I+VGG+ V++
Sbjct: 255 LSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQI 310

Query: 317 WHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                + DR  G GG I DSGTT T +    +  + DE +SQM   +    A   + +  
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-- 368

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREAS 431
              CF   G  T +FP + LHF GGA++ L  ENY   +    GE +A C +VV   +A 
Sbjct: 369 ---CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA- 422

Query: 432 GGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
                I+GN    +++V +DL  N R+ F+
Sbjct: 423 ---LTIIGNIMQMDFHVVFDLSGNARMLFQ 449


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 181/390 (46%), Gaps = 46/390 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++S   GTP   +    DTGS L+W  C     C  CS    PS+ P  SSS+  + 
Sbjct: 90  GDYAMSFGIGTPATGLSGEADTGSDLIWTKCG---ACARCSPRGSPSYYPTSSSSAAFVA 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
           C +  C  +      C +      + S NC     SY   YG+       TEGI ++ET 
Sbjct: 147 CGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNARDTHHYTEGILMTETF 198

Query: 202 NLPN--RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
              +     P    GC++ S       +G+ G GRGK SL +QLN++ F Y L S    D
Sbjct: 199 TFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----D 254

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +  S +   + +  +         TP + NP V +      +YYVGL  I+VGG+ V++
Sbjct: 255 LSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQI 310

Query: 317 WHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                + DR  G GG I DSGTT T +    +  + DE +SQM   +    A   + +  
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-- 368

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREAS 431
              CF   G  T +FP + LHF GGA++ L  ENY   +    GE +A C +VV   +A 
Sbjct: 369 ---CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA- 422

Query: 432 GGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
                I+GN    +++V +DL  N R+ F+
Sbjct: 423 ---LTIIGNIMQMDFHVVFDLSGNARMLFQ 449


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 113/379 (29%), Positives = 169/379 (44%), Gaps = 29/379 (7%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTPPQ    +LDTGS L W  C  H +          SF P LSSS  +L C +P
Sbjct: 82  VSLPIGTPPQTQQMVLDTGSQLSWIQC--HKKSVPKKPPPTTSFDPSLSSSFSVLPCNHP 139

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
            C              D  L T+ +  ++C  Y   Y  G   EG  + E +   + +  
Sbjct: 140 LCK---------PRIPDFTLPTTCDQNRLC-HYSYFYADGTYAEGSLVREKITFSSSQST 189

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD-DTTRTSSLIL-D 266
           P  ++GC+  S+ +  GI G   G+ S  SQ  + KFSYC+ + +     + T S  L +
Sbjct: 190 PPLILGCAEASTDE-KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGN 248

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +S   +    LT+TP   +P     N   + Y + ++ I +G  R+ +       D  
Sbjct: 249 NPNSGRFQYINLLTFTPSQRSP-----NLDPLAYTIPMQGIRMGNARLNISATLFRPDPS 303

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGE 385
           G G TI+DSG+ FT++  E +  + +E V  +          G  +      CFD  P E
Sbjct: 304 GAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDM----CFDGNPME 359

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQN 445
                  +   F+ G E+ +      A VG G   C+ +    E  G  S I+GNF  QN
Sbjct: 360 IGRLIGNMVFEFEKGVEIVIDKWRVLADVG-GGVHCIGI-GRSEMLGAASNIIGNFHQQN 417

Query: 446 YYVEYDLRNQRLGFKQQLC 464
            +VEYDL N+R+G  +  C
Sbjct: 418 LWVEYDLANRRIGLGKADC 436


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 123/390 (31%), Positives = 169/390 (43%), Gaps = 49/390 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y   L  GTP + +  +LDTGS +VW  C     C+ C S   P F P+ S + 
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +P C  +       R          K C      Y V YG G  T G   +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238

Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
                 +    +GC   +       AG+ G G+GK S P Q       KFSYCL+     
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
            +++ SS++  N +     +     +TP ++NP +        +YYVGL  I+VGG RV 
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVGLLGISVGGTRVP 345

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            V      LD+ GNGG I+DSGT+ T +    +  + D F    V  +   R   A   +
Sbjct: 346 GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF---RVGAKTLKR---APNFS 399

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               CFD+        P + LHF+  A+V+LP  NY   V      C          GG 
Sbjct: 400 LFDTCFDLSNMNEVKVPTVVLHFR-RADVSLPATNYLIPVDTNGKFCFAFA---GTMGGL 455

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SII GN Q Q + V YDL + R+GF    C
Sbjct: 456 SII-GNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 168/386 (43%), Gaps = 54/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+  GTP + +  + DTGS L W  CT    C  C   K P F P  SS+   + 
Sbjct: 144 GNYVVSMGLGTPARDMTVVFDTGSDLSWVQCT---PCSDCYEQKDPLFDPARSSTYSAVP 200

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
           C +P+C     + +  R C+ +     K C      Y V+YG    T+G    +TL L  
Sbjct: 201 CASPEC-----QGLDSRSCSRD-----KKC-----RYEVVYGDQSQTDGALARDTLTLTQ 245

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           + ++P F+ GC    +    +  G+ G GR K SL SQ        FSYCL S     + 
Sbjct: 246 SDVLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS-----SP 300

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
             +  +   G + ++ + T +             R+    +YYV L  + V G+ VRV  
Sbjct: 301 SAAGYLSLGGPAPANARFTAME-----------TRHDSPSFYYVRLVGVKVAGRTVRVSP 349

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +       GT++DSGT  T + P ++  L   F   M +   Y R   A AL+ L  
Sbjct: 350 IVFS-----AAGTVIDSGTVITRLPPRVYAALRSAFARSMGRY-GYKR---APALSILDT 400

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D  G  T   P + L F GGA V L        V + S  CL    + +  G  + I+
Sbjct: 401 CYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVL-YVAKVSQACLAFAPNGD--GADAGII 457

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q +   V YD+  Q++GF    C
Sbjct: 458 GNTQQKTLAVVYDVARQKIGFGANGC 483


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 182/409 (44%), Gaps = 65/409 (15%)

Query: 68  KTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
           +T+ ++      N+   S  G Y I + FGTP Q +  ++DTGS + W PC    QC+ C
Sbjct: 93  RTSRSSKEDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCK---QCQGC 149

Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC--TQICPSYL 184
            S+  P F P  SSS +   C +  C  I                 S NC     C  + 
Sbjct: 150 HSTA-PIFDPAKSSSYKPFACDSQPCQEI-----------------SGNCGGNSKC-QFE 190

Query: 185 VLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSL 236
           VLYG G   +G   S+ + L ++ +PNF  GC+   S              G     T  
Sbjct: 191 VLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQA 250

Query: 237 P-SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
           P ++L    FSYCL S     +T + SL+L   ++ S   ++ L +T  + +PS      
Sbjct: 251 PTAELFGGTFSYCLPSS----STSSGSLVLGKEAAVS---SSSLKFTTLIKDPS------ 297

Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
           F  +Y+V L+ I+VG  R+ V    +       GGTI+DSGTT T++ P  ++ L D F 
Sbjct: 298 FPTFYFVTLKAISVGNTRISVPATNIA----SGGGTIIDSGTTITYLVPSAYKDLRDAFR 353

Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
            Q+        +L    +  +  C+D+        P + LH     ++ LP EN   +  
Sbjct: 354 QQL-------SSLQPTPVEDMDTCYDLSSSSV-DVPTITLHLDRNVDLVLPKENIL-ITQ 404

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E    CL   +    S     I+GN Q QN+ + +D+ N ++GF Q+ C
Sbjct: 405 ESGLSCLAFSSTDSRS-----IIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 164/383 (42%), Gaps = 48/383 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P + +  +LDTGS + W  C     C  C     P F P LS+S   + 
Sbjct: 165 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSTSYASVA 221

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C NP+C             +D   A  +N T  C  Y V YG G  T G   +ETL L +
Sbjct: 222 CDNPRC-------------HDLDAAACRNSTGAC-LYEVAYGDGSYTVGDFATETLTLGD 267

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + +  +GC   +       AG+   G G  S PSQ++   FSYCL+     D+  +S
Sbjct: 268 SAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSS 324

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L   + +   D + T     P + +P        S +YYVGL  ++VGGQ + +     
Sbjct: 325 TLQFGDAA---DAEVTA----PLIRSPRT------STFYYVGLSGLSVGGQILSIPPSAF 371

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            +D  G GG IVDSGT  T +    +  L D FV      ++  R  G   ++    C+D
Sbjct: 372 AMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVR---GTQSLPRTSG---VSLFDTCYD 425

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           +    +   P + L F GG E+ LP +NY   V      CL       A      I+GN 
Sbjct: 426 LSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNV 481

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q Q   V +D     +GF    C
Sbjct: 482 QQQGTRVSFDTAKSTVGFTTNKC 504


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 125/436 (28%), Positives = 189/436 (43%), Gaps = 52/436 (11%)

Query: 38  TNPSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFG 96
           T P  +S+ N +  + S    R  ++ +   + T      +   +   + G Y + +  G
Sbjct: 45  TAPKSESWMNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQV--LNVGNYVVRVQLG 102

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP Q +  +LDT +   W PC+    C  CSS+   +F  + SS+   L C  P+C+   
Sbjct: 103 TPGQTMYMVLDTSNDAAWAPCSG---CIGCSSTT--TFSAQNSSTFATLDCSKPECT--Q 155

Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPNRIIPNFLVGC 215
              + C      P   + +C      +   YG   T     + ++L+L   +IPNF  GC
Sbjct: 156 ARGLSC------PTTGNVDCL-----FNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGC 204

Query: 216 ---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
              +  SS  P G+ G GRG  SL SQ   L    FSYCL S  F     + SL L    
Sbjct: 205 ISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPS--FKSYYFSGSLKLGPVG 262

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
                +TT L + P  + PS+         YYV L  I+VG   V +  + L  D +   
Sbjct: 263 QPKAIRTTPLLHNP--HRPSL---------YYVNLTGISVGRVLVPISPELLAFDPNTGA 311

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
           GTI+DSGT  T   P ++  + DEF  Q+  + +    LGA        CF    E   S
Sbjct: 312 GTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFS---PLGA-----FDTCFATNNEV--S 361

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P + LH   G ++ LP+EN       GS  CL +            ++ N Q QN+ + 
Sbjct: 362 APAITLHLS-GLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRIL 420

Query: 450 YDLRNQRLGFKQQLCK 465
           +D+ N +LG  ++LC 
Sbjct: 421 FDINNSKLGIARELCN 436


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 117/382 (30%), Positives = 168/382 (43%), Gaps = 37/382 (9%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           ISL  GTPPQ    +LDTGS L W  C   ++ K     K  SF P LSSS   L C +P
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQC---HRKKLPPKPKT-SFDPSLSSSFSTLPCSHP 129

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
            C              D  L TS +  ++C  Y   Y  G   EG  + E +   N  I 
Sbjct: 130 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKITFSNTEIT 179

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
           P  ++GC+  SS    GI G  RG+ S  SQ  + KFSYC+   S++   T   S  + D
Sbjct: 180 PPLILGCATESSDD-RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGD 238

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +SH  K  + LT+      P     N   + Y V +  I  G +++ +       D  
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMP-----NLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAG 293

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
           G+G T+VDSG+ FT +    ++ +  E ++++ +        G  A      CFD     
Sbjct: 294 GSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM----CFD---GN 346

Query: 387 TGSFP----ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
               P    +L   F  G E+ +P E     VG G  +    +      G  S I+GN  
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGG--IHCVGIGRSSMLGAASNIIGNVH 404

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN +VE+D+ N+R+GF +  C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 120/412 (29%), Positives = 195/412 (47%), Gaps = 35/412 (8%)

Query: 64  NPQTKTT---TTTTTTTTTNISSHSYGGYS----ISLSFGTPPQIIPFILDTGSHLVWFP 116
           N +TKT    TT +++++++I+  S   YS    ++L  GTPPQ+   +LDTGS L W  
Sbjct: 50  NSKTKTNQQFTTLSSSSSSSINVKSSFKYSMALVVTLPIGTPPQLQQMVLDTGSQLSWIQ 109

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C N    +        SF P LSSS  +L C +P C              D  L T  + 
Sbjct: 110 CHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCK---------PRVPDFSLPTDCDA 160

Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT 234
             +C  Y   Y  G   EG  + E +   P++  P  ++GC+   S    GI G   G+ 
Sbjct: 161 NSLC-HYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCAT-QSDDARGILGMNLGRL 218

Query: 235 SLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
             PSQ  + KFSYC+ + +    +  S  + +N +S S +    LT+      P     N
Sbjct: 219 GFPSQAKITKFSYCVPTKQAQPAS-GSFYLGNNPASSSFRYVNLLTFGQSQRMP-----N 272

Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
              + Y + L+ I++GG+++ +       +  G+G T++DSG+ FT++  E +  + +E 
Sbjct: 273 LDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREEL 332

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGGAEVTLPVENYFAV 413
           V ++          G  A      CFD    + G    ++   F+ G ++ +P E   A 
Sbjct: 333 VKKVGPKIKKGYMYGGVADI----CFDGDAIEIGRLVGDMVFEFEKGVQIVIPKERVLAT 388

Query: 414 VGEGSAVCLTV-VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V +G   CL +  ++R  +GG   I+GNF  QN +VE+DL N+R+GF +  C
Sbjct: 389 V-DGGVHCLGMGRSERLGAGGN--IIGNFHQQNLWVEFDLANRRVGFGEADC 437


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 166/382 (43%), Gaps = 48/382 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P + +  +LDTGS + W  CT    C  C     P F P  SSS   L 
Sbjct: 149 GEYFTRVGIGNPAREVYMVLDTGSDVNWLQCT---PCADCYHQTEPIFEPSSSSSYEPLS 205

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P+C+ +  E  +CR           N T +   Y V YG G  T G   +ETL + +
Sbjct: 206 CDTPQCNAL--EVSECR-----------NATCL---YEVSYGDGSYTVGDFATETLTIGS 249

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
            ++ N  VGC   +       AG+ G G G  +LPSQLN   FSYCL+    D     S+
Sbjct: 250 TLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSD-----SA 304

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
             ++ G+S            P + N      +    +YY+GL  I+VGG+ +++      
Sbjct: 305 STVEFGTSLPPDAVVA----PLLRN------HQLDTFYYLGLTGISVGGELLQIPQSSFE 354

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           +D  G+GG I+DSGT  T +   ++  L D F+      +  +    A  +     C+++
Sbjct: 355 MDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFL------KGTSDLEKAAGVAMFDTCYNL 408

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             + T   P +  HF GG  + LP +NY   V      CL        +     I+GN Q
Sbjct: 409 SAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA----PTASSLAIIGNVQ 464

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            Q   V +DL N  +GF    C
Sbjct: 465 QQGTRVTFDLANSLIGFSSNKC 486


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 168/372 (45%), Gaps = 50/372 (13%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           +DTGS L+W  C     C  C+    P F  K S++ R L C++ +C+ +          
Sbjct: 1   MDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSSRCASL---------- 47

Query: 166 NDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLPNRIIPNFLVGCSVLS 219
                 +S +C +    Y   YG +  T G+  +ET      N       N   GC  L+
Sbjct: 48  ------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLN 101

Query: 220 SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
           +   A   G+ GFGRG  SL SQL   +FSYCL S+     +R    +  N SS +    
Sbjct: 102 AGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSG 161

Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
           + +  TPFV NP++         Y++ L+ I++G + + +      ++ DG GG I+DSG
Sbjct: 162 SPVQSTPFVINPALPNM------YFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSG 215

Query: 337 TTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPEL 393
           T+ T++  + +E +    VS + +   N T         GL  CF  P     T + P+L
Sbjct: 216 TSITWLQQDAYEAVRRGLVSAIPLPAMNDTD-------IGLDTCFQWPPPPNVTVTVPDL 268

Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
             HF   A +TL  ENY  +      +CL +     A  G   I+GN+Q QN ++ YD+ 
Sbjct: 269 VFHFD-SANMTLLPENYMLIASTTGYLCLVM-----APTGVGTIIGNYQQQNLHLLYDIG 322

Query: 454 NQRLGFKQQLCK 465
           N  L F    C 
Sbjct: 323 NSFLSFVPAPCD 334


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 118/437 (27%), Positives = 180/437 (41%), Gaps = 47/437 (10%)

Query: 40  PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
           PS    +++ +L      R L +    +    +T  ++    S  S   Y +    G+P 
Sbjct: 32  PSSSPLESIIALAREDDARLLFL----SSKAASTGVSSAPVASGQSPPSYVVRAGLGSPA 87

Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
           Q I   LDT +   W  C+    C  C SS    F P  S+S   L C +  C+ +  + 
Sbjct: 88  QPILLALDTSADATWAHCS---PCGTCPSSGS-LFAPANSTSYAPLPCSSTMCTVLQGQP 143

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
              +D  D     S     +C ++   +     +    S+ L+L    IPN+  GC    
Sbjct: 144 CPAQDPYD-----SSAPLPMC-AFTKPFADASFQASLASDWLHLGKDAIPNYAFGCVSAV 197

Query: 220 SRQPA-----GIAGFGRGKTSLPSQL-NL--DKFSYCLLSHK---FDDTTRTSSLILDNG 268
           S   A     G+ G GRG  +L SQ+ N+    FSYCL S+K   F  + R  +      
Sbjct: 198 SGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAA----- 252

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
                 +  G+ YTP + NP+       S  YYV +  ++VG   V+V       D    
Sbjct: 253 -----GQPRGVRYTPMLKNPN------RSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATG 301

Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
            GT+VDSGT  T   P ++  L +EF   +     YT +LGA        CF+      G
Sbjct: 302 AGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYT-SLGA-----FDTCFNTDEVAAG 355

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
             P + +H  GG ++ LP+EN           CL +    +       +L N Q QN  V
Sbjct: 356 VAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRV 415

Query: 449 EYDLRNQRLGFKQQLCK 465
            +D+ N R+GF ++ C 
Sbjct: 416 VFDVANSRVGFARESCN 432


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 159/351 (45%), Gaps = 63/351 (17%)

Query: 138 LSSSSRLLGCQNPKC--------SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
           +SS+ + + C +P C        S    E+ QC                    YL  YG 
Sbjct: 1   MSSTFKAVACPDPICRPSSGVSVSACAMENFQCF-------------------YLCSYGD 41

Query: 190 -GLTEGIALSETLNL--PNRI---IPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQ 239
             +T G    +T     PN +   +     GC    + L     +GIAGFGRG  SLPSQ
Sbjct: 42  RSITAGHIFKDTFTFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQ 101

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD---KKTTG-LTYTPFVNNPSVAERNA 295
           L + +FSYCL       T   SS+++       D     TTG    TP + NP +     
Sbjct: 102 LKVGRFSYCLTLV----TESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIP---- 153

Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
              +YY+ L  ITVG  R+        L +DG+GGT++DSGT+ T +   +FE L +E V
Sbjct: 154 --TFYYLSLEGITVGKTRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELV 211

Query: 356 SQMVKNR-NYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
           +Q    R + T  +G       R CF  P G K    P+L LH   GA++ LP +NYF  
Sbjct: 212 AQFPLPRYDNTPEVGD------RLCFRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVE 264

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +   +CL +    + +    +++GNFQ QN +V YD+ N +L F    C
Sbjct: 265 EPDSGVMCLQINGAEDTT---MVLIGNFQQQNMHVVYDVENNKLLFAPAQC 312


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 116/392 (29%), Positives = 162/392 (41%), Gaps = 58/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G PP  +  +LDTGS + W  C     C  C     P F P  S+S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCA---PCAECYEQTDPXFEPTSSAS 200

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C+  +C     +S+   +C        +N T +   Y V YG G  T G  ++ET
Sbjct: 201 FTSLSCETEQC-----KSLDVSEC--------RNGTCL---YEVSYGDGSYTVGDFVTET 244

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L +  + N  +GC         G+        G G G  S PSQLN   FSYCL+   
Sbjct: 245 VTLGSTSLGNIAIGCG----HNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRD 300

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D T+      LD  S  +    T     P   NP++        ++Y+GL  ++VGG  
Sbjct: 301 SDSTS-----TLDFNSPITPDAVTA----PLHRNPNL------DTFFYLGLTGMSVGGAV 345

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      +  DGNGG IVDSGT  T +   ++  L D FV      +       A  +
Sbjct: 346 LPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQT------ARGV 399

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASG 432
                C+D+  +     P +  HF  G E+ LP +NY   V      C     TD   S 
Sbjct: 400 ALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS- 458

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               ILGN Q Q   V +DL N  +GF    C
Sbjct: 459 ----ILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 119/392 (30%), Positives = 172/392 (43%), Gaps = 55/392 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  ++  GTP ++   I+DTGS L W  C+    C  C S     F+P  S+S   L 
Sbjct: 11  GEYLATVRLGTPERVFSVIVDTGSDLTWVQCS---PCGKCYSQNDALFLPNTSTSFTKLA 67

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL---- 201
           C +  C             N  P      C Q    Y   YG G LT G  + +T+    
Sbjct: 68  CGSALC-------------NGLPFPM---CNQTTCVYWYSYGDGSLTTGDFVYDTITMDG 111

Query: 202 -NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
            N   + +PNF  GC   +    AG   I G G+G  S  SQL      KFSYCL+    
Sbjct: 112 INGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDW-L 170

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
              T+TS L+  + +         + Y P + NP V        YYYV L  I+VG   +
Sbjct: 171 APPTQTSPLLFGDAAV---PILPDVKYLPILANPKVP------TYYYVKLNGISVGDNLL 221

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF-EPLADEFVSQMVKNRNYTRALGAEAL 373
            +      +D  G  GTI DSGTT T +A   + E LA    S M     Y+R +  + +
Sbjct: 222 NISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMA----YSRKI--DDI 275

Query: 374 TGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           + L  C    P ++  + P +  HF+GG ++ LP  NYF  +    + C  + +  + + 
Sbjct: 276 SRLDLCLSGFPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCFAMTSSPDVN- 333

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+G+ Q QN+ V YD   ++LGF  + C
Sbjct: 334 ----IIGSVQQQNFQVYYDTAGRKLGFVPKDC 361


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 166/394 (42%), Gaps = 51/394 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y + L  GTP   +  +LDTGS +VW  C+    CK C +     F PK S + 
Sbjct: 129 SQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCS---PCKACYNQTDAIFDPKKSKTF 185

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +  C  +   S    +C       SK C      Y V YG G  TEG   +ETL
Sbjct: 186 ATVPCGSRLCRRLDDSS----ECVTR---RSKTCL-----YQVSYGDGSFTEGDFSTETL 233

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLS 251
                 + +  +GC         G+        G GRG  S PSQ       KFSYCL+ 
Sbjct: 234 TFHGARVDHVPLGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 289

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
                ++      +  G++   K +    +TP + NP +        +YY+ L  I+VGG
Sbjct: 290 RTSSGSSSKPPSTIVFGNAAVPKTS---VFTPLLTNPKL------DTFYYLQLLGISVGG 340

Query: 312 QRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            RV  V      LD  GNGG I+DSGT+ T +    +  L D F     K +       A
Sbjct: 341 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKR------A 394

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            + +    CFD+ G  T   P +  HF GG EV+LP  NY   V      C         
Sbjct: 395 PSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFA----G 449

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + G   I+GN Q Q + V YDL   R+GF  + C
Sbjct: 450 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 114/392 (29%), Positives = 160/392 (40%), Gaps = 58/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G PP     ILDTGS + W  C     C  C     P F P  S+S
Sbjct: 142 TSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCA---PCADCYQQADPIFEPASSAS 198

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C   +C  +  +  +CR  ND  L            Y V YG G  T G  ++ET
Sbjct: 199 FSTLSCNTRQCRSL--DVSECR--NDTCL------------YEVSYGDGSYTVGDFVTET 242

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L +  + N  +GC         G+        G G G  S PSQ+N   FSYCL+   
Sbjct: 243 ITLGSAPVDNVAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR- 297

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
             D+   S+L  ++            T  P   +  +   +    +YYVGL  ++VGG+ 
Sbjct: 298 --DSESASTLEFNS------------TLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGEL 343

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEA 372
           V +      +D  GNGG IVDSGT  T +  +++  L D FV +       TR L     
Sbjct: 344 VSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKR-------TRDLPSTNG 396

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           +     C+D+  +     P +  HF  G E+ LP +NY   +      C        +  
Sbjct: 397 IALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLS 456

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+GN Q Q   V YDL N  +GF    C
Sbjct: 457 ----IIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 128/419 (30%), Positives = 186/419 (44%), Gaps = 49/419 (11%)

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
           +P+   +     T  + ++  S G Y + +  GTPP+    I+DTGS L W  C     C
Sbjct: 127 SPRRALSERMVATVESGVAVGS-GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCA---PC 182

Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
             C     P F P  SSS R + C + +C  +           + P A  +     CP Y
Sbjct: 183 LDCFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPP--------EPPRACRRPGEDSCP-Y 233

Query: 184 LVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGK 233
              YG  S  T  +AL S T+NL     +R + + + GC   +       AG+ G GRG 
Sbjct: 234 YYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGP 293

Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
            S  SQL       FSYCL+ H  D  ++   +  ++ +         L YT F    S 
Sbjct: 294 LSFASQLRAVYGHTFSYCLVDHGSDVASKV--VFGEDDALALAAAHPQLNYTAFAPASSP 351

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRV----WHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           A+      +YYV L+ + VGG+ + +    W         G   TI+DSGTT ++     
Sbjct: 352 AD-----TFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGG--TIIDSGTTLSYFVEPA 404

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           ++ +   F+ +M   R+Y        L+   PC++V G      PEL L F  GA    P
Sbjct: 405 YQVIRQAFIDRM--GRSYPLIPDFPVLS---PCYNVSGVDRPEVPELSLLFADGAVWDFP 459

Query: 407 VENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ENYF  +     +CL V+ T R        I+GNFQ QN++V YDL+N RLGF  + C
Sbjct: 460 AENYFIRLDPDGIMCLAVLGTPRTGMS----IIGNFQQQNFHVVYDLKNNRLGFAPRRC 514


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 117/382 (30%), Positives = 168/382 (43%), Gaps = 37/382 (9%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           ISL  GTPPQ    +LDTGS L W  C   ++ K     K  SF P LSSS   L C +P
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQC---HRKKLPPKPKT-SFDPSLSSSFSTLPCSHP 129

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
            C              D  L TS +  ++C  Y   Y  G   EG  + E +   N  I 
Sbjct: 130 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKITFSNTEIT 179

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
           P  ++GC+  SS    GI G  RG+ S  SQ  + KFSYC+   S++   T   S  + D
Sbjct: 180 PPLILGCATESSDD-RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGD 238

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +SH  K  + LT+      P     N   + Y V +  I  G +++ +       D  
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMP-----NLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAG 293

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
           G+G T+VDSG+ FT +    ++ +  E ++++ +        G  A      CFD     
Sbjct: 294 GSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM----CFD---GN 346

Query: 387 TGSFP----ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
               P    +L   F  G E+ +P E     VG G  +    +      G  S I+GN  
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGG--IHCVGIGRSSMLGAASNIIGNVH 404

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN +VE+D+ N+R+GF +  C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 117/384 (30%), Positives = 165/384 (42%), Gaps = 55/384 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q +   LDT +   W PC+    C  C+SS +  F P  SSSSR L C 
Sbjct: 91  YIVRANIGTPAQPMLVALDTSNDAAWVPCSG---CVGCASSVL--FDPSKSSSSRNLQCD 145

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P          QC+   +      K+C      + + YG    E     +TL L N +I
Sbjct: 146 AP----------QCKQAPNPTCTAGKSC-----GFNMTYGGSTIEASLTQDTLTLANDVI 190

Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
            ++  GC      +  PA G+ G GRG  SL SQ   L +  FSYCL + K  + + +  
Sbjct: 191 KSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLR 250

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L    G  +   +   +  TP + NP        S  YYV L  I VG + V +    L 
Sbjct: 251 L----GPKYQPVR---IKTTPLLKNPR------RSSLYYVNLVGIRVGNKIVDIPTSALA 297

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D     GTI DSGT FT +    +  + +EF  + +KN N      A +L G   C+  
Sbjct: 298 FDASTGAGTIFDSGTVFTRLVEPAYVAVRNEF-RRRIKNAN------ATSLGGFDTCY-- 348

Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
               +GS  +P +   F  G  VTLP +N       GS  CL +            ++ +
Sbjct: 349 ----SGSVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIAS 403

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  DL N RLG  ++ C
Sbjct: 404 MQQQNHRVLIDLPNSRLGISRETC 427


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 122/389 (31%), Positives = 170/389 (43%), Gaps = 64/389 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           GGY++++S GTP      + DTGS L+W  C     C  C     P F P  SS+   L 
Sbjct: 84  GGYNMNISVGTPLLTFSVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFSKLP 140

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C ++ +     R CN      +  C      Y   YGSG T G   +ETL + + 
Sbjct: 141 CTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKVGDA 186

Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
             P+   GCS  +        G G+        L + +FSYCL S         S ++  
Sbjct: 187 SFPSVAFGCSTEN--------GLGQ------LDLGVGRFSYCLRSGS---AAGASPILFG 229

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           + ++ +D     +  TPFVNNP+V        YYYV L  ITVG   + V        ++
Sbjct: 230 SLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTFGFTQN 281

Query: 327 G-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVP 383
           G  GGTIVDSGTT T++A + +E +   F+SQ   V   N TR        GL  CF   
Sbjct: 282 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTR--------GLDLCFKST 333

Query: 384 GEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
           G   G  + P L L F GGAE  +P   YFA V     G  +  CL ++  +     P  
Sbjct: 334 GGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ--PMS 389

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++GN    + ++ YDL      F    C 
Sbjct: 390 VIGNVMQMDMHLLYDLDGGIFSFAPADCA 418


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 179/398 (44%), Gaps = 54/398 (13%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + L  G+  + +  I+DTGS  V            C S   P F P  S S R + C + 
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLV---------QCGSRSRPVFDPAASQSYRQVPCISQ 51

Query: 151 KCSWIHHESIQCRDCNDEP-LATSKNCTQICPSYLVLYGSG------LTEGIALSETLNL 203
            C  +     Q  + + +P + +S  CT     Y + YG         ++ +    + N 
Sbjct: 52  LCLAVQQ---QTSNGSSQPCVNSSAACT-----YSLSYGDSRNSTGDFSQDVIFLNSTNS 103

Query: 204 PNRIIP--NFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
            ++ +   +   GC+      L      GI GF RG  SLPSQL       KFSYC  S 
Sbjct: 104 SSQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQ 163

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            +         + D+G S S      ++YTP ++NP    R   S  YYVGL  I+V G+
Sbjct: 164 PWQPRATGVIFLGDSGLSKSK-----VSYTPLLDNPVTPAR---SQLYYVGLTSISVDGK 215

Query: 313 RVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA-LGA 370
            + +      LD   G+GGT++DSGTTFT +  + +    + F +    NR+  R  +GA
Sbjct: 216 TLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAA---SNRSGLRKKVGA 272

Query: 371 EALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGSAVCLTVVT 426
            A  G   C+++  G      PE++L  +    + L  E+ F  V   G    VCL +++
Sbjct: 273 AA--GFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILS 330

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +++  G   +LGN+Q  NY VEYD    R+GF++  C
Sbjct: 331 SQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 131/437 (29%), Positives = 198/437 (45%), Gaps = 72/437 (16%)

Query: 46  QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQII 102
           + + +LV+ S  R   +   +  +++ ++   TT++ S  +   GGY + +S GTP +  
Sbjct: 10  EAIRALVAKSHARVRWMA-ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRF 68

Query: 103 PFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
             I DTGS LVW    PCT       CS   I  F P+ SS+ R + C +  C+ +    
Sbjct: 69  RAIADTGSDLVWVQSEPCTG------CSGGTI--FDPRQSSTFREMDCSSQLCAELPGSC 120

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-----PNRIIPNFLVG 214
                   EP   S  C     SY   YGSG TEG    +T++L      ++  P+F VG
Sbjct: 121 --------EP--GSSTC-----SYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVG 165

Query: 215 CSVLSSRQPA--GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS 269
           C +++S      G+ G G+G  SL SQL+     KFSYCL+    +  + +S L+    +
Sbjct: 166 CGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESSPLLFGPSA 223

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           +           TP    PS    + +  YY + +  I V GQ +              G
Sbjct: 224 ALHGTGIQSTKITP----PS----DTYPTYYLLTVNGIAVAGQTM-----------GSPG 264

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
            TI+DSGTT T++   ++       +S+M       R  G+    GL  C+D    +   
Sbjct: 265 TTIIDSGTTLTYVPSGVY----GRVLSRMESMVTLPRVDGSS--MGLDLCYDRSSNRNYK 318

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
           FP L +    GA +T P  NYF VV + G  VCL + +   ASG P  I+GN   Q Y++
Sbjct: 319 FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAMGS---ASGLPVSIIGNVMQQGYHI 374

Query: 449 EYDLRNQRLGFKQQLCK 465
            YD  +  L F Q  C+
Sbjct: 375 LYDRGSSELSFVQAKCE 391


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 118/380 (31%), Positives = 179/380 (47%), Gaps = 45/380 (11%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           ++ G   Q +  I+DTGS L W  C     C  C S + P F P  SSS   L C +  C
Sbjct: 135 VTIGLGNQNMTVIIDTGSDLTWVQCD---PCMSCYSQQGPVFNPSNSSSYNSLLCNSSTC 191

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
                +++Q    N E  A   N    C ++ V YG G  T+G    E L+     + NF
Sbjct: 192 -----QNLQFTTGNTE--ACESNNPSSC-NHTVSYGDGSFTDGELGVEHLSFGGISVSNF 243

Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLIL 265
           + GC   +       +GI G GR   S+ SQ N      FSYCL +    D+  + SL++
Sbjct: 244 VFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPT---TDSGASGSLVI 300

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            N SS   K  T + YT  V+NP ++       +Y + L  I VGG  ++        D 
Sbjct: 301 GNESSLF-KNLTPIAYTSMVSNPQLSN------FYVLNLTGIDVGGVAIQ--------DT 345

Query: 326 D-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
             GNGG ++DSGT  T +AP L+  L  EF+ Q      ++    A AL+ L  CF++ G
Sbjct: 346 SFGNGGILIDSGTVITRLAPSLYNALKAEFLKQ------FSGYPIAPALSILDTCFNLTG 399

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
            +  S P L +HF+   ++ +       +  +GS VCL + +  + +     I+GN+Q +
Sbjct: 400 IEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDEN--DMAIIGNYQQR 457

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N  V YD +  ++GF ++ C
Sbjct: 458 NQRVIYDAKQSKIGFAREDC 477


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 116/393 (29%), Positives = 160/393 (40%), Gaps = 59/393 (15%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G+PP+ +  ++DTGS + W  C     C  C     P F P  SSS
Sbjct: 148 ASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCA---PCADCYQQADPIFEPSFSSS 204

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C+  +C  +  +  +CR  ND  L            Y V YG G  T G   +ET
Sbjct: 205 YAPLTCETHQCKSL--DVSECR--NDSCL------------YEVSYGDGSYTVGDFATET 248

Query: 201 LNLPNRI-IPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
           + L     + N  +GC         G+        G G G  S PSQ+N   FSYCL++ 
Sbjct: 249 ITLDGSASLNNVAIGCG----HDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNR 304

Query: 253 KFDDTTRTSSLILDNG-SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
              DT   S+L  ++   SHS          P + N      N    +YY+G+  I VGG
Sbjct: 305 ---DTDSASTLEFNSPIPSHS-------VTAPLLRN------NQLDTFYYLGMTGIGVGG 348

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           Q + +      +D  GNGG IVDSGT  T +  +++  L D FV      R         
Sbjct: 349 QMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFV------RGTQHLPSTS 402

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
            +     C+D+    +   P +  HF  G  + LP +NY   V      C        A 
Sbjct: 403 GVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSAL 462

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN Q Q   V YDL N  +GF    C
Sbjct: 463 S----IIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 165/394 (41%), Gaps = 51/394 (12%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           S   G Y + L  GTP   +  +LDTGS +VW  C+    CK C +     F PK S + 
Sbjct: 132 SQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCS---PCKACYNQSDVIFDPKKSKTF 188

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C +  C  +        D ++     SK C      Y V YG G  TEG   +ETL
Sbjct: 189 ATVPCGSRLCRRLD-------DSSECVTRRSKTCL-----YQVSYGDGSFTEGDFSTETL 236

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLS 251
                 + +  +GC         G+        G GRG  S PSQ       KFSYCL+ 
Sbjct: 237 TFHGARVDHVPLGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVD 292

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
                ++      +  G+    K +    +TP + NP +        +YY+ L  I+VGG
Sbjct: 293 RTSSGSSSKPPSTIVFGNDAVPKTS---VFTPLLTNPKL------DTFYYLQLLGISVGG 343

Query: 312 QRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            RV  V      LD  GNGG I+DSGT+ T +    +  L D F     K +       A
Sbjct: 344 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKR------A 397

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            + +    CFD+ G  T   P +  HF GG EV+LP  NY   V      C         
Sbjct: 398 PSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFA----G 452

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + G   I+GN Q Q + V YDL   R+GF  + C
Sbjct: 453 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 120/394 (30%), Positives = 180/394 (45%), Gaps = 52/394 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTP + +  ++DTGS L W  C     CK C     P F P+ SSS + + 
Sbjct: 127 GEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ---PCKSCYKQADPIFDPRNSSSFQRIP 183

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSE--TLNL 203
           C +P C     ++++   C+    ATS+     C SY V YG G  + G   S+  TL  
Sbjct: 184 CLSPLC-----KALEIHSCSGSRGATSR-----C-SYQVAYGDGSFSVGDFSSDLFTLGT 232

Query: 204 PNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQL--------NLDKFSYCLLSH 252
            ++ + +   GC           AG+ G G GK S PSQ+          + FSYCL+  
Sbjct: 233 GSKAM-SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               T  +SSLI    +  S   T  L+  P + NP +        +YY  +  ++VGG 
Sbjct: 292 SNPMTRSSSSLIFGAAAIPS---TAALS--PLLKNPKL------DTFYYAAMIGVSVGGA 340

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAE 371
           ++ +  K L L + G+GG I+DSGT+ T     ++  + D F       RN T  L  A 
Sbjct: 341 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAF-------RNATTNLPSAP 393

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
             +    C++  G+ +   P L LHF+ GA++ LP  NY   +    + CL         
Sbjct: 394 RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMEL 453

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G    I+GN Q Q++ + +DL+   L F  Q CK
Sbjct: 454 G----IIGNIQQQSFRIGFDLQKSHLAFAPQQCK 483


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 116/392 (29%), Positives = 162/392 (41%), Gaps = 58/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G PP  +  +LDTGS + W  C     C  C     P F P  S+S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCA---PCAECYEQTDPIFEPTSSAS 200

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C+  +C     +S+   +C        +N T +   Y V YG G  T G  ++ET
Sbjct: 201 FTSLSCETEQC-----KSLDVSEC--------RNGTCL---YEVSYGDGSYTVGDFVTET 244

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L +  + N  +GC         G+        G G G  S PSQLN   FSYCL+   
Sbjct: 245 VTLGSTSLGNIAIGCG----HNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRD 300

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D T+      LD  S  +    T     P   NP++        ++Y+GL  ++VGG  
Sbjct: 301 SDSTS-----TLDFNSPITPDAVTA----PLHRNPNL------DTFFYLGLTGMSVGGAV 345

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      +  DGNGG IVDSGT  T +   ++  L D FV      +       A  +
Sbjct: 346 LPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQT------ARGV 399

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASG 432
                C+D+  +     P +  HF  G E+ LP +NY   V      C     TD   S 
Sbjct: 400 ALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS- 458

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               ILGN Q Q   V +DL N  +GF    C
Sbjct: 459 ----ILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 136/446 (30%), Positives = 191/446 (42%), Gaps = 52/446 (11%)

Query: 28  SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG 87
           SL+FSL+         S  + NSL SSSL      +NP TKTT+          SS  Y 
Sbjct: 28  SLSFSLTSIPL-----SSHSKNSLFSSSLASQFK-QNPNTKTTSYNYR------SSFKYS 75

Query: 88  -GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
               +SL  GTPPQ    +LDTGS L W       QCK    +   +F P LSSS  +L 
Sbjct: 76  MALIVSLPIGTPPQTQQMVLDTGSQLSWI------QCKVPPKTPPTAFDPLLSSSFSVLP 129

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C              D  L TS +  ++C  Y   Y  G   EG  + E     +
Sbjct: 130 CNHSLCK---------PRVPDYTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFTFSS 179

Query: 206 -RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD-TTRTSSL 263
            +  P  ++GC+  SS    GI G   G+ S  S   + KFSYC+   +    ++ T S 
Sbjct: 180 SQTTPPLILGCATDSSDT-QGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSF 238

Query: 264 ILD-NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
            L  N SS   K    +TY      P     N   + Y + +  I + G+++ +      
Sbjct: 239 YLGPNPSSAGFKYVNLMTYRQSQRMP-----NLDPLAYTLPMLGIRINGKKLNISTSAFR 293

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD- 381
            D  G G T++DSGT FTF+  E +  + +E V             G      L  CFD 
Sbjct: 294 ADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGS----LDMCFDG 349

Query: 382 ---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
              V G   G+   +   F+ G E+ +  E   A VG G   CL  +   +  G  S I+
Sbjct: 350 DAMVIGRMIGN---MAFEFENGVEIVVEREKMLADVG-GGVQCLG-IGRSDLLGVASNII 404

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GNF  Q+ +VE+DL  +R+GF +  C
Sbjct: 405 GNFHQQDLWVEFDLVGRRVGFGRTDC 430


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 174/373 (46%), Gaps = 45/373 (12%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DT S L W  C     C+ C   + P F P  S S   + C +P C  +  +      
Sbjct: 157 IVDTASELTWVQCA---PCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAG 213

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
               P    +     C SY + Y  G  + G+   + L+L   +I  F+ GC   +   P
Sbjct: 214 AGAPPCDAGRPAA--C-SYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSNQGPP 270

Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCL-LSHKFDDTTRTSSLIL-DNGSSHSD 273
               +G+ G GR + SL SQ  +D+F    SYCL LS + D    + SL+L D+ S++  
Sbjct: 271 FGGTSGLMGLGRSQLSLVSQ-TVDQFGGVFSYCLPLSRESD---ASGSLVLGDDPSAY-- 324

Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
           + +T + YT  V+N     +  F   Y V L  ITVGGQ V             +   IV
Sbjct: 325 RNSTPVVYTSMVSNSDPLLQGPF---YLVNLTGITVGGQEVE--------STGFSARAIV 373

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT  T + P ++  +  EF+SQ+ +   Y +A G    + L  CF++ G K    P L
Sbjct: 374 DSGTVITSLVPSVYNAVRAEFMSQLAE---YPQAPG---FSILDTCFNMTGLKEVQVPSL 427

Query: 394 KLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
            L F GGAEV +      YF V  + S VCL V + +  S   + I+GN+Q +N  V +D
Sbjct: 428 TLVFDGGAEVEVDSGGVLYF-VSSDSSQVCLAVASLK--SEDETSIIGNYQQKNLRVVFD 484

Query: 452 LRNQRLGFKQQLC 464
               ++GF Q+ C
Sbjct: 485 TSASQVGFAQETC 497


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 180/386 (46%), Gaps = 50/386 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++  G+  Q +  I+DTGS L W  C     C+ C +   P F P  S S + + C 
Sbjct: 122 YIVTMGLGS--QNMSVIVDTGSDLTWVQCE---PCRSCYNQNGPLFKPSTSPSYQPILCN 176

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
           +  C     +S++   C  +P +TS  C      Y+V YG G  T G    E L      
Sbjct: 177 STTC-----QSLELGACGSDP-STSATC-----DYVVNYGDGSYTSGELGIEKLGFGGIS 225

Query: 208 IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
           + NF+ GC   +       +G+ G GR + S+ SQ N      FSYCL S   D    + 
Sbjct: 226 VSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPST--DQAGASG 283

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           SL++ N  S   K  T + YT  + N  ++       +Y + L  I VGG  + V     
Sbjct: 284 SLVMGN-QSGVFKNVTPIAYTRMLPNLQLSN------FYILNLTGIDVGGVSLHVQASSF 336

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
                GNGG I+DSGT  + +AP +++ L  +F+ Q      ++    A   + L  CF+
Sbjct: 337 -----GNGGVILDSGTVISRLAPSVYKALKAKFLEQ------FSGFPSAPGFSILDTCFN 385

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTV--VTDREASGGPSIIL 438
           + G    + P + ++F+G AE+ +     F +V E  S VCL +  ++D    G    I+
Sbjct: 386 LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMG----II 441

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN+Q +N  V YD +  ++GF ++ C
Sbjct: 442 GNYQQRNQRVLYDAKLSQVGFAKEPC 467


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 171/382 (44%), Gaps = 47/382 (12%)

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS-- 153
           G PPQ    ++DTGS L+W  CT   + K C    +P F    S S   + CQ+  C+  
Sbjct: 93  GDPPQRAEALIDTGSSLIWTQCTACLR-KVCVRQDLPYFNASSSGSFAPVPCQDKACAGN 151

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
           ++H              A    CT     + V YG+G   G   ++     +        
Sbjct: 152 YLHF------------CALDGTCT-----FRVTYGAGGIIGFLGTDAFTFQSGGA-TLAF 193

Query: 214 GCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
           GC   +           +G+ G GRG+ SL SQ    +FSYCL  + F +   +S L + 
Sbjct: 194 GCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPY-FHNNGASSHLFVG 252

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR- 325
             +S S      ++   FV +P   +   +S +YY+ L  ITVG  ++ +      L   
Sbjct: 253 AAASLSGGGGAVMSMA-FVESP---KDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEV 308

Query: 326 -DG--NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +G   GG I+DSG+ FT +  + +EPL  E   Q+  N +     G E   G+  C   
Sbjct: 309 EEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQL--NGSLVPPPG-EDDGGMALCV-A 364

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
            G+     P L LHF GGA++ LP ENY+A + E S  C+ +V     S     I+GNFQ
Sbjct: 365 RGDLDRVVPTLVLHFSGGADMALPPENYWAPL-EKSTACMAIVRGYLQS-----IIGNFQ 418

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN ++ +D+   RL F+   C
Sbjct: 419 QQNMHILFDVGGGRLSFQNADC 440


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 128/431 (29%), Positives = 178/431 (41%), Gaps = 54/431 (12%)

Query: 46  QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFI 105
           ++L SL + S  R +  + P++    +    +     S   G Y + L  GTP   +  +
Sbjct: 96  ESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGL---SQGSGEYFMRLGVGTPATNMYMV 152

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           LDTGS +VW  C+    CK C +   P F P  S +   + C +  C  +   S    +C
Sbjct: 153 LDTGSDVVWLQCS---PCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSS----EC 205

Query: 166 NDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA 224
                  SK C      Y V YG G  T G   +ETL      + +  +GC         
Sbjct: 206 VSR---RSKACL-----YQVSYGDGSFTVGDFSTETLTFHGARVDHVALGC----GHDNE 253

Query: 225 GI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           G+        G GRG  S PSQ       KFSYCL+      ++      +  G+    K
Sbjct: 254 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPK 313

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLDRDGNGGTIV 333
                 +TP + NP +        +YY+ L  I+VGG RV  V      LD  GNGG I+
Sbjct: 314 TA---VFTPLLTNPKL------DTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 364

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT+ T +    +  L D F          TR   A + +    CFD+ G  T   P +
Sbjct: 365 DSGTSVTRLTQSAYVALRDAF------RLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTV 418

Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
             HF GG EV+LP  NY   V      C         + G   I+GN Q Q + V YDL 
Sbjct: 419 VFHFTGG-EVSLPASNYLIPVNNQGRFCFAFA----GTMGSLSIIGNIQQQGFRVAYDLV 473

Query: 454 NQRLGFKQQLC 464
             R+GF  + C
Sbjct: 474 GSRVGFLSRAC 484


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/412 (28%), Positives = 171/412 (41%), Gaps = 55/412 (13%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
           + N  T+  T   TT   + +S   G Y   +  GTP + +  +LDTGS + W  C    
Sbjct: 135 VYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE--- 191

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C  C     P F P  SS+ + L C  P+CS +  E+  CR         S  C     
Sbjct: 192 PCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236

Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
            Y V YG G  T G   ++T+   N   I N  +GC         G+        G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCG----HDNEGLFTGAAGLLGLGGG 291

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
             S+ +Q+    FSYCL+     D+ ++SSL       +S +   G    P + N  +  
Sbjct: 292 VLSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGGGDATAPLLRNKKI-- 341

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
                 +YYVGL   +VGG++V +      +D  G+GG I+D GT  T +  + +  L D
Sbjct: 342 ----DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
            F+   V  +      G+ +++    C+D     T   P +  HF GG  + LP +NY  
Sbjct: 398 AFLKLTVNLKK-----GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 452

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V +    C         S   SII GN Q Q   + YDL    +G     C
Sbjct: 453 PVDDSGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 122/389 (31%), Positives = 166/389 (42%), Gaps = 63/389 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++ S GTPPQ +  + DTGS L+W  C     CK C+     S+ P  SSS   L 
Sbjct: 79  GAYDMTFSMGTPPQTLSALADTGSDLIWAKCG---ACKRCAPRGSASYYPTKSSSFSKLP 135

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLAT---SKNCTQICPSYLVLYG-----SGLTEGIALS 198
           C           S  CR    + LAT   ++    +C SY   YG        T+G   S
Sbjct: 136 C----------SSALCRTLESQSLATCGGTRARGAVC-SYRYSYGLSSNPHHYTQGYMGS 184

Query: 199 ETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
           ET  L +  +     GC+ +        +G+ G GRGK SL  QL +  FSYCL S    
Sbjct: 185 ETFTLGSDAVQGIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTS---- 240

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           D + +S L+   G+        G+  TP VN  +       S +Y V L  I++G  +  
Sbjct: 241 DPSTSSPLLFGAGA----LTGPGVQSTPLVNLKT-------STFYTVNLDSISIGAAKT- 288

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
                      G  G I DSGTT TF+A   +       +SQ     N TR  G +   G
Sbjct: 289 --------PGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTT---NLTRVPGTD---G 334

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
              CF   G     FP + LHF GG ++ L  ENYF  V +  +  L   +  E S    
Sbjct: 335 YEVCFQTSGGAV--FPSMVLHFDGG-DMALKTENYFGAVNDSVSCWLVQKSPSEMS---- 387

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN    +Y++ YDL    L F+   C
Sbjct: 388 -IVGNIMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/412 (28%), Positives = 171/412 (41%), Gaps = 55/412 (13%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
           + N  T+  T   TT   + +S   G Y   +  GTP + +  +LDTGS + W  C    
Sbjct: 135 VYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE--- 191

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C  C     P F P  SS+ + L C  P+CS +  E+  CR         S  C     
Sbjct: 192 PCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236

Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
            Y V YG G  T G   ++T+   N   I N  +GC         G+        G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCG----HDNEGLFTGAAGLLGLGGG 291

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
             S+ +Q+    FSYCL+     D+ ++SSL       +S +   G    P + N  +  
Sbjct: 292 VLSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGGGDATAPLLRNKKI-- 341

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
                 +YYVGL   +VGG++V +      +D  G+GG I+D GT  T +  + +  L D
Sbjct: 342 ----DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
            F+   V  +      G+ +++    C+D     T   P +  HF GG  + LP +NY  
Sbjct: 398 AFLKLTVNLKK-----GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 452

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V +    C         S   SII GN Q Q   + YDL    +G     C
Sbjct: 453 PVDDSGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 115/390 (29%), Positives = 171/390 (43%), Gaps = 41/390 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      ++DTGS LVW  C+    C+ C + +   F P+ SS+ R + 
Sbjct: 84  GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS---PCRRCYAQRGQVFDPRRSSTYRRVP 140

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN 205
           C +P+C  +           D   A    C      Y+V YG G +  G   ++ L   N
Sbjct: 141 CSSPQCRALRFPGC------DSGGAAGGGCR-----YMVAYGDGSSSTGDLATDKLAFAN 189

Query: 206 RI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
              + N  +GC   +       AG+ G GRGK S+ +Q+       F YCL   +   +T
Sbjct: 190 DTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCL-GDRTSRST 248

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
           R+S L+            T L   P    PS+         YYV +   +VGG+RV  + 
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNP--RRPSL---------YYVDMAGFSVGGERVTGFS 297

Query: 318 HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
           +  L LD   G GG +VDSGT  +  A + +  L D F ++           G  ++   
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRR-LAGEHSV--F 354

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GEGSAVCLTVVTDREASGGP 434
             C+D+ G    S P + LHF GGA++ LP ENYF  V  G   A         EA+   
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             ++GN Q Q + V +D+  +R+GF  + C
Sbjct: 415 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 166/389 (42%), Gaps = 54/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+P ++   ++DTGS + W  C+    CK C       F P+ SSS R L 
Sbjct: 12  GEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCS---PCKSCYKQNDAVFDPRASSSFRRLS 68

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P+C  +  ++           +T   C      Y V YG G  T G   S++ ++  
Sbjct: 69  CSTPQCKLLDVKACA---------STDNRCL-----YQVSYGDGSFTVGDLASDSFSVSR 114

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                 + GC         G+        G G GK S PSQL+  KFSYCL+S   D+  
Sbjct: 115 GRTSPVVFGCG----HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSR--DNGV 168

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           R SS +L   S+     +    YT  + NP +        +YY GL  I++GG  + +  
Sbjct: 169 RASSALLFGDSAL--PTSASFAYTQLLKNPKL------DTFYYAGLSGISIGGTLLSIPS 220

Query: 319 KYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGL 376
               L    G GG I+DSGT+ T +    +  + D F       R+ T+ L  A   +  
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAF-------RSATQKLPRAADFSLF 273

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
             C+D     + + P +  HF+GGA V LP  NY   V      C     T  + S    
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q Q   V  DL + R+GF  + C
Sbjct: 330 -IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 125/395 (31%), Positives = 167/395 (42%), Gaps = 53/395 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      +LDTGS +VW  C     C+ C     P F P+ SSS   +G
Sbjct: 127 GEYFTKIGVGTPATQALMVLDTGSDVVWVQCA---PCRRCYEQSGPVFDPRRSSSYGAVG 183

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C    C  +       R            C      Y V YG G +T G  ++ETL    
Sbjct: 184 CGAALCRRLDSGGCDLR---------RGACM-----YQVAYGDGSVTAGDFVTETLTFAG 229

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------ 252
              +    +GC   +       AG+ G GRG  S P+Q++      FSYCL+        
Sbjct: 230 GARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAG 289

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               + R+S++    GS  +   +    +TP V NP +        +YYV L  I+VGG 
Sbjct: 290 AAPGSHRSSTVSFGAGSVGASSAS----FTPMVRNPRM------ETFYYVQLVGISVGGA 339

Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           RV  V    L LD   G GG IVDSGT+ T +A   +  L D F +           L  
Sbjct: 340 RVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLR----LSP 395

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
              +    C+D+ G +    P + +HF GGAE  LP ENY   V      C     TD  
Sbjct: 396 GGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD-- 453

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 454 --GGVSII-GNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 113/388 (29%), Positives = 160/388 (41%), Gaps = 49/388 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C  C +   P F P  S++   + 
Sbjct: 123 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK---PCLECYAQADPLFDPASSATFSAVS 179

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C      +++   C D     S  C      Y V YG G  T+G    ETL L  
Sbjct: 180 CGSAIC-----RTLRTSGCGD-----SGGC-----EYEVSYGDGSYTKGTLALETLTLGG 224

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT-- 257
             +    +GC   +       AG+ G G G  SL  QL       FSYCL S     +  
Sbjct: 225 TAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGA 284

Query: 258 -TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                SL+L      S+    G  + P V NP          +YYVG+  I VG +R+ +
Sbjct: 285 ADAAGSLVL----GRSEAVPEGAVWVPLVRNPQAPS------FYYVGVSGIGVGDERLPL 334

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                 L  DG GG ++D+GT  T +  E +  L D FV  +       RA G   L   
Sbjct: 335 QDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAV---GALPRAPGVSLLD-- 389

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D+ G  +   P +  +F G A +TLP  N    V +G   CL       +S G S 
Sbjct: 390 -TCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFA---PSSSGLS- 443

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ILGN Q +   +  D  N  +GF    C
Sbjct: 444 ILGNIQQEGIQITVDSANGYIGFGPATC 471


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 129/394 (32%), Positives = 180/394 (45%), Gaps = 64/394 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNH-YQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y + ++ GTPP  +  I DTGS LVW  C++         +     F P  SS+   L C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR 206
           Q+  C  +   S    D + E       C      Y   YG G  T G+  +ET +  + 
Sbjct: 163 QSNACQALSQASC---DADSE-------C-----QYQYSYGDGSRTIGVLSTETFSFVDG 207

Query: 207 ------IIPNFLVGCSVLSSR--QPAGIAGFGRGKTSLPSQL----NLD-KFSYCLLSHK 253
                  +P    GCS  S+   +  G+ G G G  SL SQL    ++D K SYCL+   
Sbjct: 208 GGKGQVRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSY 267

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
             D   +S+L   N  S +     G   TP V  PS  +      YY V L  + VGGQ 
Sbjct: 268 --DANSSSTL---NFGSRAVVSEPGAASTPLV--PSDVDS-----YYTVALESVAVGGQE 315

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V      +          IVDSGTT TF+ P L  PL    V+++ +     R    E L
Sbjct: 316 VATHDSRI----------IVDSGTTLTFLDPALLGPL----VTELERRIKLQRVQPPEQL 361

Query: 374 TGLRPCFDVPGE-KTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
             L+ C+DV G+ +T +F  P++ L F GGA VTL  EN F+++ EG+ +CL +V   E+
Sbjct: 362 --LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGT-LCLVLVPVSES 418

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              P  ILGN   QN++V YDL  + + F    C
Sbjct: 419 Q--PVSILGNIAQQNFHVGYDLDARTVTFAAADC 450


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 115/412 (27%), Positives = 170/412 (41%), Gaps = 55/412 (13%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
           + N  T+      TT   +  S   G Y   +  GTP + +  +LDTGS + W  C    
Sbjct: 135 VNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE--- 191

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C  C     P F P  SS+ + L C  P+CS +  E+  CR         S  C     
Sbjct: 192 PCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236

Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
            Y V YG G  T G   ++T+   N   I +  +GC         G+        G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINDVALGCG----HDNEGLFTGAAGLLGLGGG 291

Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
             S+ +Q+    FSYCL+     D+ ++SSL       +S +  +G    P + N  +  
Sbjct: 292 ALSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGSGDATAPLLRNQKI-- 341

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
                 +YYVGL   +VGGQ+V +      +D  G+GG I+D GT  T +  + +  L D
Sbjct: 342 ----DTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
            F+      +      G  +++    C+D     +   P +  HF GG  + LP +NY  
Sbjct: 398 AFLKLTTNLKK-----GTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLI 452

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V +    C         S   SII GN Q Q   + YDL N+ +G     C
Sbjct: 453 PVDDNGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 135/479 (28%), Positives = 198/479 (41%), Gaps = 68/479 (14%)

Query: 9   CLSFIFFFTLL-----SIFPSSITSLTFSLSRF--------HTNPSQDSYQN-LNSLVSS 54
           C +  F F LL     ++ P +  S T  LS             P Q+S+ N + ++ S 
Sbjct: 6   CAATFFLFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASK 65

Query: 55  SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
              R  ++     + TT         +       Y + +  GTP Q +  +LDT +   W
Sbjct: 66  DPERLKYLSTLADQKTTAVPIAPGQQV--LKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
            PC+    C  CSS+   +F+P  S++   L C   +CS +   S         P   S 
Sbjct: 124 VPCSG---CTGCSST---TFLPNASTTLGSLDCSGAQCSQVRGFSC--------PATGSS 169

Query: 175 NCTQICPSYLVLYG--SGLTEGIALSETLNLPNRIIPNFLVGC-SVLS--SRQPAGIAGF 229
            C      +   YG  S LT  + + + + L N +IP F  GC + +S  S  P G+ G 
Sbjct: 170 ACL-----FNQSYGGDSSLTATL-VQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGL 223

Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           GRG  SL SQ        FSYCL S  F     + SL L         +TT L   P  +
Sbjct: 224 GRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKLGPVGQPKSIRTTPLLRNP--H 279

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
            PS+         YYV L  ++VG  +V +  + L  D +   GTI+DSGT  T     +
Sbjct: 280 RPSL---------YYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPV 330

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           +  + DEF  Q+        +LGA        CF    E     P + LHF+G   + LP
Sbjct: 331 YFAIRDEFRKQV---NGPISSLGA-----FDTCFAATNEAEA--PAITLHFEG-LNLVLP 379

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +EN       GS  CL++            ++ N Q QN  + +D  N RLG  ++LC 
Sbjct: 380 MENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELCN 438


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 170/399 (42%), Gaps = 66/399 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ I  +LDTGS L+W  C     C  C     P F P++SSS   + C 
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDT---CTACLRQPDPLFSPRMSSSYEPMRCA 154

Query: 149 NPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN- 205
              C  I HH  ++   C                +Y   YG G T  G   +E     + 
Sbjct: 155 GQLCGDILHHSCVRPDTC----------------TYRYSYGDGTTTLGYYATERFTFASS 198

Query: 206 ----RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
               + +P    GC  +   S    +GI GFGR   SL SQL++ +FSYCL  +    ++
Sbjct: 199 SGETQSVP-LGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYA---SS 254

Query: 259 RTSSLILDNGSSHS--DKKTTGLTYTPFVN---NPSVAERNAFSVYYYVGLRRITVGGQR 313
           R S+L   + +     D  T  +  TP +    NP+         +YYV    +TVG +R
Sbjct: 255 RKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPT---------FYYVAFTGVTVGARR 305

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           +R+      L  DG+GG I+DSGT  T     +   +   F SQ+        A G+   
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQL----RLPFANGSSPD 361

Query: 374 TGLRPCFDVPG--------EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
            G+  CF  P          +  + P +  HF+ GA++ LP ENY         +C+ + 
Sbjct: 362 DGV--CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLL- 417

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                SG     +GNF  Q+  V YDL  + L F    C
Sbjct: 418 ---GDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|2245012|emb|CAB10432.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268406|emb|CAB78698.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1046

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 125/406 (30%), Positives = 179/406 (44%), Gaps = 80/406 (19%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           LDTGS LVWFPC   + C  C S  +P   P   SSS      +       H S+   D 
Sbjct: 130 LDTGSDLVWFPC-RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSD- 187

Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
               L    NC             +  CP +   YG G       S++L+LP+  + NF 
Sbjct: 188 ----LCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFT 243

Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
            GC+  +  +P G+AGFGRG+ SLP+QL +      + FSYCL+SH FD     R S LI
Sbjct: 244 FGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 303

Query: 265 LDNGSSHSDKKT----------------TGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           L       +K+                     +T  + NP          +Y V L+ I+
Sbjct: 304 LGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPK------HPYFYSVSLQGIS 357

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           +G + +        +D++G GG +VDSGTTFT +  + +  + +EF S++   R + RA 
Sbjct: 358 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 415

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYFAVVGEG--------SA 419
             E  +                  L LHF G  + VTLP  NYF    +G          
Sbjct: 416 RVEPSSA-----------------LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 458

Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
            CL ++    + E  GG   ILGN+Q Q + V YDL N+R+GF ++
Sbjct: 459 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKR 504


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 87/258 (33%), Positives = 130/258 (50%), Gaps = 29/258 (11%)

Query: 208 IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
           +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C  +    +  + S++
Sbjct: 91  VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKQSTV 147

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +LD  +         +  TP + N      +A   +YY+ L+ ITVG  R+ V      L
Sbjct: 148 LLDLPADLYKNGRGAVQSTPLIQN------SANPTFYYLSLKGITVGSTRLPVPESAFAL 201

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
             +G GGTI+DSGT+ T + P++++ + DEF +Q+         +     TG   CF  P
Sbjct: 202 -TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGPYTCFSAP 254

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGN 440
            +     P+L LHF+ GA + LP ENY   V +    S +CL +    E +     I+GN
Sbjct: 255 SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT-----IIGN 308

Query: 441 FQMQNYYVEYDLRNQRLG 458
           FQ QN +V YDL+N   G
Sbjct: 309 FQQQNMHVLYDLQNMHRG 326


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 176/396 (44%), Gaps = 52/396 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   ++ G PP     ++DTGS L+W  C     C++C     P + P+ SS+ R + 
Sbjct: 86  GEYFAVINVGDPPTRALVVIDTGSDLIWLQCV---PCRHCYRQVTPLYDPRSSSTHRRIP 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +P+C     + ++   C+    A +  C      Y+V+YG G  + G   ++ L  P+
Sbjct: 143 CASPRC----RDVLRYPGCD----ARTGGCV-----YMVVYGDGSASSGDLATDRLVFPD 189

Query: 206 RI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
              + N  +GC   +V      AG+ G GRG+ S P+QL       FSYCL     D  +
Sbjct: 190 DTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCL----GDRLS 245

Query: 259 RTSSLILDNGSSH----SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           R       NGSS+       +     +TP   NP           YYV +   +VGG+RV
Sbjct: 246 RAQ-----NGSSYLVFGRTPEPPSTAFTPLRTNPRRPS------LYYVDMVGFSVGGERV 294

Query: 315 RVW-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
             + +  L L+   G GG +VDSGT  +  A + +  + D F S         R L A  
Sbjct: 295 TGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAA-GTMRKL-ATK 352

Query: 373 LTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
            +    C+D+ G    +     P + LHF GGA++ LP  NY   V  G       +  +
Sbjct: 353 FSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQ 412

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            A  G + +LGN Q Q + + +D+   R+GF    C
Sbjct: 413 AADDGLN-VLGNVQQQGFGLVFDVERGRIGFTPNGC 447


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 119/393 (30%), Positives = 179/393 (45%), Gaps = 52/393 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTP + +  ++DTGS L W  C     CK C     P F P+ SSS + + 
Sbjct: 52  GEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ---PCKSCYKQADPIFDPRNSSSFQRIP 108

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSE--TLNL 203
           C +P C     ++++   C+    ATS+     C SY V YG G  + G   S+  TL  
Sbjct: 109 CLSPLC-----KALEVHSCSGSRGATSR-----C-SYQVAYGDGSFSVGDFSSDLFTLGT 157

Query: 204 PNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQL--------NLDKFSYCLLSH 252
            ++ + +   GC           AG+ G G GK S PSQ+          + FSYCL+  
Sbjct: 158 GSKAM-SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 216

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               T  +SSLI    +  S   T  L+  P + NP +        +YY  +  ++VGG 
Sbjct: 217 SNPMTRSSSSLIFGVAAIPS---TAALS--PLLKNPKL------DTFYYAAMIGVSVGGA 265

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAE 371
           ++ +  K L L + G+GG I+DSGT+ T     ++  + D F       RN T  L  A 
Sbjct: 266 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAF-------RNATINLPSAP 318

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
             +    C++  G+ +   P L LHF+ GA++ LP  NY   +    + CL         
Sbjct: 319 RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMEL 378

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G    I+GN Q Q++ + +DL+   L F  Q C
Sbjct: 379 G----IIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 131/437 (29%), Positives = 197/437 (45%), Gaps = 72/437 (16%)

Query: 46  QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQII 102
           + +  LV+ S  R   +   +  +++ ++   TT++ S  +   GGY + +S GTP +  
Sbjct: 10  EAIRGLVAKSHARVRWMA-ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRF 68

Query: 103 PFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
             I DTGS LVW    PCT       CS   I  F P+ SS+ R + C +  C+ +    
Sbjct: 69  RAIADTGSDLVWVQSEPCTG------CSGGTI--FDPRQSSTFREMDCSSQLCTELPGSC 120

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-----PNRIIPNFLVG 214
                   EP   S  C     SY   YGSG TEG    +T++L      ++  P+F VG
Sbjct: 121 --------EP--GSSAC-----SYSYEYGSGETEGEFARDTISLGTTSGGSQKFPSFAVG 165

Query: 215 CSVLSSRQPA--GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS 269
           C +++S      G+ G G+G  SL SQL+     KFSYCL+    +  + +S L+    +
Sbjct: 166 CGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESSPLLFGPSA 223

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           +           TP    PS    + +  YY + +  I V GQ         T+   G  
Sbjct: 224 ALHGTGIQSTKITP----PS----DTYPTYYLLTVNGIAVAGQ---------TMGSPGT- 265

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
            TI+DSGTT T++   ++       +S+M       R  G+    GL  C+D    +   
Sbjct: 266 -TIIDSGTTLTYVPSGVY----GRVLSRMESMVTLPRVDGSS--MGLDLCYDRSSNRNYK 318

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
           FP L +    GA +T P  NYF VV + G  VCL + +   A G P  I+GN   Q Y++
Sbjct: 319 FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAMGS---AGGLPVSIIGNVMQQGYHI 374

Query: 449 EYDLRNQRLGFKQQLCK 465
            YD  +  L F Q  C+
Sbjct: 375 LYDRGSSELSFVQAKCE 391


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 118/398 (29%), Positives = 175/398 (43%), Gaps = 58/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G PP     ++DTGS L+W  C     C+ C     P + P+ S + R + 
Sbjct: 90  GEYFAVIGVGDPPTHALVVIDTGSDLIWLQC---LPCRRCYRQVTPLYDPRNSKTHRRIP 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +P+C  +    ++   C+    A +  C      Y+V+YG G  + G   ++TL LP+
Sbjct: 147 CASPQCRGV----LRYPGCD----ARTGGCV-----YMVVYGDGSASSGDLATDTLVLPD 193

Query: 206 RI-IPNFLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
              + N  +GC      +L+S   AG+ G GRG+ S P+QL       FSYCL       
Sbjct: 194 DTRVHNVTLGCGHDNEGLLAS--AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRM--S 249

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
             R SS  L  G +     T    +TP   NP           YYV +   +VGG+RV  
Sbjct: 250 RARNSSSYLVFGRTPELPST---AFTPLRTNPRRPS------LYYVDMVGFSVGGERVAG 300

Query: 317 W-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ-----MVKNRNYTRALG 369
           + +  L L+   G GG +VDSGT  +    + +  + D FVS      M + RN      
Sbjct: 301 FSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRN------ 354

Query: 370 AEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
               +    C+DV G   G+    P + LHF   A++ LP  NY   V  G       + 
Sbjct: 355 --KFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLG 412

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + A  G + +LGN Q Q + V +D+   R+GF    C
Sbjct: 413 LQAADDGLN-VLGNVQQQGFGVVFDVERGRIGFTPNGC 449


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 170/399 (42%), Gaps = 66/399 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ I  +LDTGS L+W  C     C  C     P F P++SSS   + C 
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDT---CTACLRQPDPLFSPRMSSSYEPMRCA 154

Query: 149 NPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN- 205
              C  I HH  ++   C                +Y   YG G T  G   +E     + 
Sbjct: 155 GQLCGDILHHSCVRPDTC----------------TYRYSYGDGTTTLGYYATERFTFASS 198

Query: 206 ----RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
               + +P    GC  +   S    +GI GFGR   SL SQL++ +FSYCL  +    ++
Sbjct: 199 SGETQSVP-LGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYA---SS 254

Query: 259 RTSSLILDNGSSHS--DKKTTGLTYTPFVN---NPSVAERNAFSVYYYVGLRRITVGGQR 313
           R S+L   + +     D  T  +  TP +    NP+         +YYV    +TVG +R
Sbjct: 255 RKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPT---------FYYVAFTGVTVGARR 305

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           +R+      L  DG+GG I+DSGT  T     +   +   F SQ+        A G+   
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQL----RLPFANGSSPD 361

Query: 374 TGLRPCFDVPG--------EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
            G+  CF  P          +  + P +  HF+ GA++ LP ENY         +C+ + 
Sbjct: 362 DGV--CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLL- 417

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                SG     +GNF  Q+  V YDL  + L F    C
Sbjct: 418 ---GDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 165/389 (42%), Gaps = 54/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+P ++   ++DTGS + W  C+    CK C       F P+ SSS R L 
Sbjct: 12  GEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCS---PCKSCYKQNDAVFDPRASSSFRRLS 68

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P+C  +  ++           +T   C      Y V YG G  T G   S++  +  
Sbjct: 69  CSTPQCKLLDVKACA---------STDNRCL-----YQVSYGDGSFTVGDLASDSFLVSR 114

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                 + GC         G+        G G GK S PSQL+  KFSYCL+S   D+  
Sbjct: 115 GRTSPVVFGCG----HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSR--DNGV 168

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           R SS +L   S+     +    YT  + NP +        +YY GL  I++GG  + +  
Sbjct: 169 RASSALLFGDSAL--PTSASFAYTQLLKNPKL------DTFYYAGLSGISIGGTLLSIPS 220

Query: 319 KYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGL 376
               L    G GG I+DSGT+ T +    +  + D F       R+ T+ L  A   +  
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAF-------RSATQKLPRAADFSLF 273

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
             C+D     + + P +  HF+GGA V LP  NY   V      C     T  + S    
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q Q   V  DL + R+GF  + C
Sbjct: 330 -IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|118484651|gb|ABK94196.1| unknown [Populus trichocarpa]
          Length = 125

 Score =  127 bits (320), Expect = 8e-27,   Method: Composition-based stats.
 Identities = 62/127 (48%), Positives = 87/127 (68%), Gaps = 8/127 (6%)

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
           M   ++E +A EF  Q+    +YT A   +  TGLRPCF++ GEK+ S PE   HFKGGA
Sbjct: 1   MEKPVYELVAKEFEKQVA---HYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGA 57

Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRNQRL 457
           ++ LP+ NYF+ V  G  +CLT+V+D  +     GGP+IILGN+Q +N++VE+DL+N+R 
Sbjct: 58  KMALPLANYFSFVDSG-VICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERF 116

Query: 458 GFKQQLC 464
           GFKQQ C
Sbjct: 117 GFKQQNC 123


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 159/384 (41%), Gaps = 47/384 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P + +  +LDTGS + W  C     C  C     P F P LS+S   + 
Sbjct: 167 GEYFSRVGIGSPARELYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVS 223

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +P+C          RD +    A  +N T  C  Y V YG G  T G   +ETL L +
Sbjct: 224 CDSPRC----------RDLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGD 269

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + N  +GC   +       AG+   G G  S PSQ++   FSYCL+     D+   S
Sbjct: 270 STPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAAS 326

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L      + +D  T      P V +P          +YYV L  I+VGGQ + +     
Sbjct: 327 TLQFGADGAEADTVTA-----PLVRSPRTG------TFYYVALSGISVGGQALSIPSSAF 375

Query: 322 TLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            +D   G+GG IVDSGT  T +    +  L D FV      R          ++    C+
Sbjct: 376 AMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFV------RGTPSLPRTSGVSLFDTCY 429

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           D+    +   P + L F+GG  + LP +NY   V      CL       A      I+GN
Sbjct: 430 DLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGN 485

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q Q   V +D     +GF    C
Sbjct: 486 VQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 41/390 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      ++DTGS LVW  C+    C+ C + +   F P+ SS+ R + 
Sbjct: 84  GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS---PCRRCYAQRGQVFDPRRSSTYRRVP 140

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN 205
           C +P+C  +           D   A    C      Y+V YG G +  G   ++ L   N
Sbjct: 141 CSSPQCRALRFPGC------DSGGAAGGGCR-----YMVAYGDGSSSTGELATDKLAFAN 189

Query: 206 RI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
              + N  +GC   +       AG+ G  RGK S+ +Q+       F YCL   +   +T
Sbjct: 190 DTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCL-GDRTSRST 248

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
           R+S L+            T L   P    PS+         YYV +   +VGG+RV  + 
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNP--RRPSL---------YYVDMAGFSVGGERVTGFS 297

Query: 318 HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
           +  L LD   G GG +VDSGT  +  A + +  L D F ++           G  ++   
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRR-LAGEHSV--F 354

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GEGSAVCLTVVTDREASGGP 434
             C+D+ G    S P + LHF GGA++ LP ENYF  V  G   A         EA+   
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             ++GN Q Q + V +D+  +R+GF  + C
Sbjct: 415 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 173/384 (45%), Gaps = 37/384 (9%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           S+  S G+P   +  I+DTGS L W  C     C  C + + P F P  S++   + C  
Sbjct: 149 SLGGSSGSPAANLTVIVDTGSDLTWVQCK---PCSACYAQRDPLFDPAGSATYAAVRCNA 205

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRII 208
             C+    +S++          ++   ++ C  Y + YG G  + G+  ++T+ L    +
Sbjct: 206 SACA----DSLRAATGTPGSCGSTGAGSEKC-YYALAYGDGSFSRGVLATDTVALGGASL 260

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
             F+ GC  LS+R      AG+ G GR + SL SQ        FSYCL +    D + + 
Sbjct: 261 GGFVFGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSL 319

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           SL   + ++ S + TT + YT  + +P      A   +Y++ +    VGG         L
Sbjct: 320 SLGGGDDAASSYRNTTPVAYTRMIADP------AQPPFYFLNVTGAAVGG-------TAL 366

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
                G    ++DSGT  T +AP ++  +  EF+ Q      Y  A G    + L  C+D
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQF-GAAGYPAAPG---FSILDTCYD 422

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           + G      P L L  +GGA+VT+      F V  +GS VCL + +       P  I+GN
Sbjct: 423 LTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETP--IIGN 480

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
           +Q +N  V YD    RLGF  + C
Sbjct: 481 YQQKNKRVVYDTLGSRLGFADEDC 504


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 181/409 (44%), Gaps = 65/409 (15%)

Query: 68  KTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
           +T+ ++      N+   S  G Y I + FGTP Q +  ++DTGS + W PC    QC+ C
Sbjct: 93  RTSRSSKQDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCK---QCQGC 149

Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC--TQICPSYL 184
            S+  P F P  SSS +   C +  C  I                 S NC     C  + 
Sbjct: 150 HSTA-PIFDPAKSSSYKPFACDSQPCQEI-----------------SGNCGGNSKC-QFE 190

Query: 185 VLYGSGL-TEGIALSETLNLPNRIIPNFLVGC--SVLSSRQPA-----GIAGFGRGKTSL 236
           V YG G   +G   S+ + L ++ +PNF  GC  S+     P+        G     T  
Sbjct: 191 VSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQA 250

Query: 237 P-SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
           P ++L    FSYCL S     +T + SL+L   ++ S   ++ L +T  + +PS+     
Sbjct: 251 PTAELFGGTFSYCLPSS----STSSGSLVLGKEAAVS---SSSLKFTTLIKDPSIP---- 299

Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
              +Y+V L+ I+VG  R+ V    +       GGTI+DSGTT T + P  +  L D F 
Sbjct: 300 --TFYFVTLKAISVGNTRISVPGTNIA----SGGGTIIDSGTTITHLVPSAYTALRDAFR 353

Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
            Q+        +L    +  +  C+D+        P + LH     ++ LP EN   +  
Sbjct: 354 QQL-------SSLQPTPVEDMDTCYDLSSSSV-DVPTITLHLDRNVDLVLPKENIL-ITQ 404

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E    CL   +    S     I+GN Q QN+ + +D+ N ++GF Q+ C
Sbjct: 405 ESGLACLAFSSTDSRS-----IIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 185/424 (43%), Gaps = 57/424 (13%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           LTR+   ++ QTK  +        +  S   G Y I +S GTPP+ +  ++DTGS ++W 
Sbjct: 26  LTRS-RSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTGSDILWL 84

Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
            C     C Y  S  I  F P  SS+   LGC   +C  +   + Q   C          
Sbjct: 85  QCAPCVNC-YHQSDAI--FDPYKSSTYSTLGCSTRQCLNLDIGTCQANKC---------- 131

Query: 176 CTQICPSYLVLYGSGL-------TEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAG 225
                  Y V YG G        T+ ++L+ T  +   ++    +GC   +       AG
Sbjct: 132 ------LYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAG 185

Query: 226 IAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
           + G G+G  S P+Q+   N  +FSYCL   +  D+T  SSL+       +     G  +T
Sbjct: 186 LLGLGKGPLSFPNQVDPQNGGRFSYCLTDRE-TDSTEGSSLVF----GEAAVPPAGARFT 240

Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
           P  +N  V        +YY+ +  I+VGG  + +      LD  GNGG I+DSGT+ T +
Sbjct: 241 PQDSNMRVP------TFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRL 294

Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEA-LTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
               +  L D F       R  T  L   A  +    C+D+ G  +   P + LHF+GG 
Sbjct: 295 QNAAYASLRDAF-------RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGT 347

Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           ++ LP  NY   V   +  CL        + GPSII GN Q Q + V YD  + ++GF  
Sbjct: 348 DLKLPASNYLIPVDNSNTFCLAFA----GTTGPSII-GNIQQQGFRVIYDNLHNQVGFVP 402

Query: 462 QLCK 465
             C 
Sbjct: 403 SQCN 406


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 161/389 (41%), Gaps = 41/389 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    G+P Q +   LDT +   W  C+    C  C SS +  F P  SSS   L C 
Sbjct: 81  YVVRAGLGSPSQQLLLALDTSADATWAHCS---PCGTCPSSSL--FAPANSSSYASLPCS 135

Query: 149 NPKCSWIHHESIQCRDCNDE---PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
           +  C     ++        +   P AT   C    P     + + L      S+TL L  
Sbjct: 136 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALA-----SDTLRLGK 190

Query: 206 RIIPNFLVGCSVLSSRQPA------GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDD 256
             IPN+  GC V S   P       G+ G GRG  +L SQ   L    FSYCL S++   
Sbjct: 191 DAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR--S 247

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
              + SL L  G          + YTP + NP    R++    YYV +  ++VG   V+V
Sbjct: 248 YYFSGSLRLGAGGGQPRS----VRYTPMLRNP---HRSSL---YYVNVTGLSVGRAWVKV 297

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                  D     GT+VDSGT  T     ++  L +EF  Q+     YT +LGA      
Sbjct: 298 PAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----F 351

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF+      G  P + +H  GG ++ LP+EN           CL +    +       
Sbjct: 352 DTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVN 411

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  V +D+ N R+GF ++ C 
Sbjct: 412 VIANLQQQNIRVVFDVANSRIGFAKESCN 440


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 89/267 (33%), Positives = 132/267 (49%), Gaps = 32/267 (11%)

Query: 208 IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
           +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C  +    +  + S++
Sbjct: 243 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKQSTV 299

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +LD  +         +  TP + N      +A    YY+ L+ ITVG  R+ V      L
Sbjct: 300 LLDLLADLYKNGRGAVQSTPLIQN------SANPTLYYLSLKGITVGSTRLPVPESAFAL 353

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
             +G GGTI+DSGT+ T + P++++ + DEF +Q+         +     TG   CF  P
Sbjct: 354 -TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGPYTCFSAP 406

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTV--VTDREASGGPSIIL 438
            +     P+L LHF+ GA + LP ENY   V +    S +CL +  + D  A+      +
Sbjct: 407 SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSMICLAINELGDERAT------I 459

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           GNFQ QN +V YDL+N  L F    C 
Sbjct: 460 GNFQQQNMHVLYDLQNNMLSFVAAQCD 486



 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 75/148 (50%), Gaps = 16/148 (10%)

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
           G   ITVG  R+ V      L  +G GGTI+DSGT+ T + P++++ + DEF +Q+    
Sbjct: 38  GRPGITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---- 92

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSA 419
                +     TG   CF  P +     P+L LHF+ GA + LP ENY   V +    S 
Sbjct: 93  --KLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSI 149

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYY 447
           +CL +       G  + I+GNFQ QN +
Sbjct: 150 ICLAI-----NKGDETTIIGNFQQQNMH 172


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/384 (29%), Positives = 164/384 (42%), Gaps = 47/384 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P + +  +LDTGS + W  C     C  C     P F P LS+S   + 
Sbjct: 164 GEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVS 220

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C          +S +CRD +    A  +N T  C  Y V YG G  T G   +ETL L +
Sbjct: 221 C----------DSQRCRDLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGD 266

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              + N  +GC   +       AG+   G G  S PSQ++   FSYCL+     D+   S
Sbjct: 267 STPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAAS 323

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +L   +G++ +     G    P V +P        S +YYV L  I+VGGQ + +     
Sbjct: 324 TLQFGDGAAEA-----GTVTAPLVRSPRT------STFYYVALSGISVGGQPLSIPASAF 372

Query: 322 TLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            +D   G+GG IVDSGT  T +    +  L D FV Q   +   T       ++    C+
Sbjct: 373 AMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFV-QGAPSLPRT-----SGVSLFDTCY 426

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           D+    +   P + L F+GG  + LP +NY   V      CL       A      I+GN
Sbjct: 427 DLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGN 482

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q Q   V +D     +GF    C
Sbjct: 483 VQQQGTRVSFDTARGAVGFTPNKC 506


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 161/389 (41%), Gaps = 41/389 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    G+P Q +   LDT +   W  C+    C  C SS +  F P  SSS   L C 
Sbjct: 79  YVVRAGLGSPSQQLLLALDTSADATWAHCS---PCGTCPSSSL--FAPANSSSYASLPCS 133

Query: 149 NPKCSWIHHESIQCRDCNDE---PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
           +  C     ++        +   P AT   C    P     + + L      S+TL L  
Sbjct: 134 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALA-----SDTLRLGK 188

Query: 206 RIIPNFLVGCSVLSSRQPA------GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDD 256
             IPN+  GC V S   P       G+ G GRG  +L SQ   L    FSYCL S++   
Sbjct: 189 DAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR--S 245

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
              + SL L  G          + YTP + NP    R++    YYV +  ++VG   V+V
Sbjct: 246 YYFSGSLRLGAGGGQPRS----VRYTPMLRNP---HRSSL---YYVNVTGLSVGHAWVKV 295

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                  D     GT+VDSGT  T     ++  L +EF  Q+     YT +LGA      
Sbjct: 296 PAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----F 349

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF+      G  P + +H  GG ++ LP+EN           CL +    +       
Sbjct: 350 DTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVN 409

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  V +D+ N R+GF ++ C 
Sbjct: 410 VIANLQQQNIRVVFDVANSRVGFAKESCN 438


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 127/444 (28%), Positives = 186/444 (41%), Gaps = 75/444 (16%)

Query: 38  TNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISLS 94
           T  + +S++ L+ L S    R+  +  PQ+ + +  +   T  +     GG   Y +  S
Sbjct: 50  TQAALESHRRLSFLAS----RSSQVDKPQSSSASQLSNNDTDTVPLRMDGGGGAYDMEFS 105

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            GTPPQ +  + DTGS L+W  C         +     S+ P  SS+   L C +  C+ 
Sbjct: 106 IGTPPQKLTALADTGSDLIWTKCDAG---GGAAWGGSSSYHPNASSTFTRLPCSDRLCAA 162

Query: 155 IHHESI-QCRDCNDEPLATSKNCTQICPSYLVLYGSG----LTEGIALSETLNLPNRIIP 209
           +   S+ +C        A    C      Y   YG G     T+G   SET  L    +P
Sbjct: 163 LRSYSLARCA-------AGGAEC-----DYKYAYGLGDDPDFTQGFLGSETFTLGGDAVP 210

Query: 210 NFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
               GC+        + AG+ G GRG  SL SQL+   F YCL +    D ++ S L+  
Sbjct: 211 GVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTA----DASKASPLLFG 266

Query: 267 NGSSHSDK----KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
             ++ +      ++TGL               A + +Y V LR IT+G           T
Sbjct: 267 ALATMTGAGAGVQSTGLL--------------ASTTFYAVNLRSITIGSAT--------T 304

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
               G GG + DSGTT T++A   +      F+SQ       T     E   G   C++ 
Sbjct: 305 AGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQT------TSLTPVEGRYGFEACYEK 358

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
           P +     P + LHF GGA++ LPV NY   V +G  VC  V         PS+ I+GN 
Sbjct: 359 P-DSARLIPAMVLHFDGGADMALPVANYVVEVDDG-VVCWVVQRS------PSLSIIGNI 410

Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
              NY V +D+R   L F+   C 
Sbjct: 411 MQMNYLVLHDVRKSVLSFQPANCD 434


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 159/385 (41%), Gaps = 46/385 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C+ C +   P F P  SSS   + 
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +                 +  C      Y V YG G  T+G    ETL L  
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
             +    +GC   +S      AG+ G G G  SL  QL       FSYCL S        
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG---AGG 289

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+L      ++    G  + P V N      N  S +YYVGL  I VGG+R+ +   
Sbjct: 290 AGSLVL----GRTEAVPVGAVWVPLVRN------NQASSFYYVGLTGIGVGGERLPLQDS 339

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG GG ++D+GT  T +  E +  L   F   M           + A++ L  C
Sbjct: 340 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 393

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  GA +TLP  N    VG G+  CL       +S G S ILG
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 448

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF    C
Sbjct: 449 NIQQEGIQITVDSANGYVGFGPNTC 473


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 159/385 (41%), Gaps = 46/385 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C+ C +   P F P  SSS   + 
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +                 +  C      Y V YG G  T+G    ETL L  
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
             +    +GC   +S      AG+ G G G  SL  QL       FSYCL S        
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRG---AGG 289

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+L      ++    G  + P V N      N  S +YYVGL  I VGG+R+ +   
Sbjct: 290 AGSLVL----GRTEAVPVGAVWVPLVRN------NQASSFYYVGLTGIGVGGERLPLQDG 339

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG GG ++D+GT  T +  E +  L   F   M           + A++ L  C
Sbjct: 340 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 393

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  GA +TLP  N    VG G+  CL       +S G S ILG
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 448

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF    C
Sbjct: 449 NIQQEGIQITVDSANGYVGFGPNTC 473


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 183/411 (44%), Gaps = 61/411 (14%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
           ++ +S H     ++SL+ G+PPQ +  +LDTGS L W  C            K P+    
Sbjct: 45  SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 93

Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
           F P  SSS   + C +P C        + RD +   +  S +  ++C + +    +   E
Sbjct: 94  FDPLRSSSYSPIPCTSPTC------RTRTRDFS---IPVSCDKKKLCHAIISYADASSIE 144

Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
           G   S+T ++ N  IP  + GC  S  SS      +  G+ G  RG  S  +Q+ L KFS
Sbjct: 145 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 204

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YC+          +S ++L   SS S  K   L YTP V   S        V Y V L  
Sbjct: 205 YCISGQD------SSGILLFGESSFSWLK--ALKYTPLVQI-STPLPYFDRVAYTVQLEG 255

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVK 360
           I V    +++       D  G G T+VDSGT FTF+   ++  L +EFV Q      +++
Sbjct: 256 IKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLE 315

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AV 413
           + N+    GA  L     C+ VP  +      P + L F+ GAE+++  E         +
Sbjct: 316 DPNFVFQ-GAMDL-----CYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 368

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            G  S  C T   + E  G  S I+G+   QN ++E+DL   R+GF +  C
Sbjct: 369 RGSDSVYCFT-FGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 119/392 (30%), Positives = 166/392 (42%), Gaps = 51/392 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   ++ GTP       LDT S L W  C     C+ C     P F P+ S+S R + 
Sbjct: 136 GEYIAKIAVGTPGVEALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYREM- 191

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
                       S    DC     +   +  +    Y V YG G  T G  + ETL    
Sbjct: 192 ------------SFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG 239

Query: 206 RI-IPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNLDK-FSYCLLSHKFDDTTR 259
            + +P   +GC      L     AGI G GRG  S P+Q++ +  FSYCL+       + 
Sbjct: 240 GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSL 299

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
           +S+L    G+  +      +++TP V N ++        +YYV L  I+VGG RV  V  
Sbjct: 300 SSTLTFGAGAVDTSPP---VSFTPTVLNLNM------PTFYYVRLTGISVGGVRVPGVTE 350

Query: 319 KYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-- 375
           + L LD   G GG IVDSGT  T +A   +    D F +  V        LG  ++ G  
Sbjct: 351 RDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVD-------LGQVSIGGPS 403

Query: 376 --LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
                C+ V G      P + +HF G  EV L  +NY   V     VC        A+G 
Sbjct: 404 GFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFA----ATGD 459

Query: 434 PSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            S+ I+GN Q Q + + YD+   R+GF    C
Sbjct: 460 HSVSIIGNIQQQGFRIVYDI-GGRVGFAPNSC 490


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 183/411 (44%), Gaps = 61/411 (14%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
           ++ +S H     ++SL+ G+PPQ +  +LDTGS L W  C            K P+    
Sbjct: 52  SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 100

Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
           F P  SSS   + C +P C        + RD +   +  S +  ++C + +    +   E
Sbjct: 101 FDPLRSSSYSPIPCTSPTC------RTRTRDFS---IPVSCDKKKLCHAIISYADASSIE 151

Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
           G   S+T ++ N  IP  + GC  S  SS      +  G+ G  RG  S  +Q+ L KFS
Sbjct: 152 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 211

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YC+          +S ++L   SS S  K   L YTP V   S        V Y V L  
Sbjct: 212 YCISGQD------SSGILLFGESSFSWLK--ALKYTPLVQI-STPLPYFDRVAYTVQLEG 262

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVK 360
           I V    +++       D  G G T+VDSGT FTF+   ++  L +EFV Q      +++
Sbjct: 263 IKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLE 322

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AV 413
           + N+    GA  L     C+ VP  +      P + L F+ GAE+++  E         +
Sbjct: 323 DPNFVFQ-GAMDL-----CYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 375

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            G  S  C T   + E  G  S I+G+   QN ++E+DL   R+GF +  C
Sbjct: 376 RGSDSVYCFT-FGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 165/380 (43%), Gaps = 43/380 (11%)

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
           S G+P   +  I+DTGS L W  C     C  C + + P F P  S++   + C    C+
Sbjct: 195 SSGSPAANLTVIVDTGSDLTWVQCK---PCSACYAQRDPLFDPAGSATYAAVRCNASACA 251

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
                    +     P  +     + C  Y + YG G  + G+  ++T+ L    +  F+
Sbjct: 252 ------ASLKAATGTP-GSCGGGNERC-YYALAYGDGSFSRGVLATDTVALGGASLDGFV 303

Query: 213 VGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLIL 265
            GC  LS+R      AG+ G GR + SL SQ  L     FSYCL +    D + + SL  
Sbjct: 304 FGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSL-- 360

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             G + S + TT + YT  + +P      A   +Y++ +    VGG         L    
Sbjct: 361 -GGDASSYRNTTPVAYTRMIADP------AQPPFYFLNVTGAAVGG-------TALAAQG 406

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
            G    ++DSGT  T +AP ++  +  EF  Q       T    A   + L  C+D+ G 
Sbjct: 407 LGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPT----APGFSILDTCYDLTGH 462

Query: 386 KTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
                P L L  +GGAEVT+      F V  +GS VCL + +       P  I+GN+Q +
Sbjct: 463 DEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTP--IIGNYQQK 520

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N  V YD    RLGF  + C
Sbjct: 521 NKRVVYDTVGSRLGFADEDC 540


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 132/428 (30%), Positives = 180/428 (42%), Gaps = 72/428 (16%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ--------IIPFILD 107
           +T+A    +P+  T  T   T+         G Y   ++ GTP +        + P   D
Sbjct: 101 ITKAATPADPENGTVVTGAPTS---------GEYIAKITVGTPYENDSSFEALLSP---D 148

Query: 108 TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
            GS + W  C   ++C +      P +    SSS+  +GC  P C               
Sbjct: 149 MGSDVTWLQCMPCFRCYH---QPGPVYNRLKSSSASDVGCYAPAC--------------- 190

Query: 168 EPLATSKNCTQI---CPSYLVLYGSGLTE-GIALSETLNLPNRI-IPNFLVGCSV----L 218
             L +S  C Q    C  Y V YG G +  G    ETL  P  + +P   +GC      L
Sbjct: 191 RALGSSGGCVQFLNEC-QYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGL 249

Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
                AGI G GRG  S PSQ+       FSYCL         R+S+L   +G+S +   
Sbjct: 250 FPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQG--TGGRSSTLTFGSGASATTTT 307

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLD-RDGNGGTIV 333
           TT  ++TP + N      +    +YYVGL  I+VGG RVR V    L LD   G+GG IV
Sbjct: 308 TTPPSFTPMLTN------SRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIV 361

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD-VPGEKTGSFPE 392
           DSGT  T ++   +    D F    VK   +    G  A      C+  V G      P 
Sbjct: 362 DSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAF--FDTCYSSVRGRVMKKVPA 419

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVV--TDREASGGPSIILGNFQMQNYYVE 449
           + +HF GG EV LP +NY   V      +C       DR  S     I+GN Q+Q + V 
Sbjct: 420 VSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVS-----IIGNIQLQGFRVV 474

Query: 450 YDLRNQRL 457
           YD+  QR+
Sbjct: 475 YDVDGQRV 482


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 117/420 (27%), Positives = 176/420 (41%), Gaps = 53/420 (12%)

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           SSLT  L   NP  +    T   +     S   G Y +SL  GTPP+ +  + DTGS ++
Sbjct: 49  SSLTNPLKNTNPFLQQDFETPLRSGL---SDGSGEYFVSLGVGTPPRTVNMVADTGSDVL 105

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     C+ C     P F P  SS+ + + C +  C  +     +   C        
Sbjct: 106 WLQC---LPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQC-------- 154

Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGF 229
                    Y V YG G  T G   +ETL+  +  + +  +GC   +       AG+ G 
Sbjct: 155 --------LYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGL 206

Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           G+G  S PSQ+       FSYCL +    ++T +  LI  N +  S+ +     +T  + 
Sbjct: 207 GKGLLSFPSQVGQLYGSVFSYCLPTR---ESTGSVPLIFGNQAVASNAQ-----FTTLLT 258

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPE 345
           NP +        +YYV +  I VGG  V +    L+LD   GNGG I+DSGT  T +   
Sbjct: 259 NPKL------DTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTS 312

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
            + P+ D F + M  +   T        +    C+D+ G  +   P +   F GGA + L
Sbjct: 313 AYNPMRDAFRAGMPSDAKMT-----SGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMAL 367

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           P +N    V      CL    + E       I+GN Q Q++ + +D    R+G     C 
Sbjct: 368 PAQNIMVPVDNSGTYCLAFAPNSENFS----IIGNIQQQSFRMSFDSTGNRVGIGANQCN 423


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 162/389 (41%), Gaps = 56/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP Q++  +LDT     W PC +   C  CSS   P+F P  SS+   L 
Sbjct: 97  GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCAD---CAGCSS---PTFSPNTSSTYASLQ 150

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE-TLNLPN 205
           C  P+C+ +    + C      P   +  C      +   YG   +    LS+ +L L  
Sbjct: 151 CSVPQCTQV--RGLSC------PTTGTAACF-----FNQTYGGDSSFSAMLSQDSLGLAV 197

Query: 206 RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
             +P++  GC      S+  P G+ G GRG  SL SQ   L    FSYC  S K   F  
Sbjct: 198 DTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSG 257

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           + R   L           +   +  TP + NP    R      YYV L  ++VG   V V
Sbjct: 258 SLRLGPL----------GQPKNIRTTPLLRNP---HRPTL---YYVNLTGVSVGRVLVPV 301

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             + L  D +   GTI+DSGT  T     ++  + DEF  Q+   +     +GA      
Sbjct: 302 APELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQV---KGPFATIGA-----F 353

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF    E     P +  HF G  ++ LP+EN       GS  CL +            
Sbjct: 354 DTCFAATNEDIA--PPVTFHFTG-MDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLN 410

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           ++ N Q QN  + +D+ N RLG  ++LC 
Sbjct: 411 VIANLQQQNLRIMFDVTNSRLGIARELCN 439


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 109/404 (26%), Positives = 168/404 (41%), Gaps = 56/404 (13%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++ ++ GTPPQ +  +LDTGS L W  C          +S   S+ P        + C +
Sbjct: 64  TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASASSSYAP--------VPCSS 115

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
           P C+W+       RD    P   S  C ++  SY     +   +G+  ++T  L +  +P
Sbjct: 116 PACTWLG------RDLPVRPFCDSSAC-RVSLSY---ADASSADGLLAADTFLLGSSPMP 165

Query: 210 NFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
             L GC       +  S   P G+ G  RG  S  +Q    +F+YC+ + +         
Sbjct: 166 A-LFGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ------GPG 218

Query: 263 LILDNGSSHSDKKTT----GLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRVRV 316
           ++L  G+      T+     L YTP V    +++   +     Y V L  I VG   + +
Sbjct: 219 ILLLGGNDTETPLTSPPQQQLNYTPLVE---ISQPLPYFDRAAYTVQLEGIRVGSALLAI 275

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
               LT D  G G T+VDSGT FTF+ P+ +  L  EF +Q+ ++ +   A   E     
Sbjct: 276 PKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVF 335

Query: 377 RPCFDV----------PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
           +  FD                G  PE+ L  +G   V    E     V     GEG  V 
Sbjct: 336 QGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVW 395

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                  + +G  + ++G+   Q+ +VEYDLRN RLGF    C 
Sbjct: 396 CLTFGSSDMAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARCA 439


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 117/420 (27%), Positives = 176/420 (41%), Gaps = 53/420 (12%)

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           SSLT  L   NP  +    T   +     S   G Y +SL  GTPP+ +  + DTGS ++
Sbjct: 49  SSLTNPLKNTNPFLQQDFETPLRSGL---SDGSGEYFVSLGVGTPPRTVNMVADTGSDVL 105

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     C+ C     P F P  SS+ + + C +  C  +     +   C        
Sbjct: 106 WLQC---LPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQC-------- 154

Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGF 229
                    Y V YG G  T G   +ETL+  +  + +  +GC   +       AG+ G 
Sbjct: 155 --------LYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGL 206

Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           G+G  S PSQ+       FSYCL +    ++T +  LI  N +  S+ +     +T  + 
Sbjct: 207 GKGLLSFPSQVGQLYGSVFSYCLPTR---ESTGSVPLIFGNQAVASNAQ-----FTTLLT 258

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPE 345
           NP +        +YYV +  I VGG  V +    L+LD   GNGG I+DSGT  T +   
Sbjct: 259 NPKL------DTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTS 312

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
            + P+ D F + M  +   T        +    C+D+ G  +   P +   F GGA + L
Sbjct: 313 AYNPMRDAFRAGMPSDAKMT-----SGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMAL 367

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           P +N    V      CL    + E       I+GN Q Q++ + +D    R+G     C 
Sbjct: 368 PAQNIMVPVDNSGTYCLAFAPNSENFS----IIGNIQQQSFRMSFDSTGNRVGIGANQCN 423


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 112/395 (28%), Positives = 163/395 (41%), Gaps = 80/395 (20%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  +++ G+PP+    ++DTGS L W       +C  CS        P  SS+   L 
Sbjct: 1   GVYYSTITLGSPPKDFSLVMDTGSDLTWV------RCDPCS--------PDCSSTFDRLA 46

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
                     ++++ C D                  Y   YG G  T+G    +TL +  
Sbjct: 47  SNT-------YKALTCAD-----------------DYSYGYGDGSFTQGDLSVDTLKMAG 82

Query: 206 RI------IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
                    P F+ GC  L         GI     G  S PSQ+     +KFSYCLL   
Sbjct: 83  AASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQT 142

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
             ++ + S ++    +    +  +G    L YTP   +         S+YY V L  I+V
Sbjct: 143 AQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGES---------SIYYTVRLDGISV 193

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           G QR+ +        +D    TI DSGTT T + P + + +     S MV    +     
Sbjct: 194 GNQRLDLSPSAFLNGQDKP--TIFDSGTTLTMLPPGVCDSIKQSLAS-MVSGAEFV---- 246

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
             A+ GL  CF VP       P++  HF GGA+      NY  V+  GS  CL  V   E
Sbjct: 247 --AIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNY--VIDLGSLQCLIFVPTNE 302

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            S     I GN Q Q+++V +D+ N+R+GFK+  C
Sbjct: 303 VS-----IFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/392 (28%), Positives = 160/392 (40%), Gaps = 58/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G P   +  +LDTGS + W  C     C  C     P F P  S+S
Sbjct: 137 TSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCA---PCADCYHQADPIFEPASSTS 193

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C   +C     +S+   +C        +N T +   Y V YG G  T G  ++ET
Sbjct: 194 YSPLSCDTKQC-----QSLDVSEC--------RNNTCL---YEVSYGDGSYTVGDFVTET 237

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L +  + N  +GC         G+        G G GK S PSQ+N   FSYCL+   
Sbjct: 238 ITLGSASVDNVAIGCG----HNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRD 293

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D     S+  L+  S+      T     P + N           +YYVG+  ++VGG+ 
Sbjct: 294 SD-----SASTLEFNSALLPHAITA----PLLRN------RELDTFYYVGMTGLSVGGEL 338

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      +D  GNGG I+DSGT  T +    +  L D FV         T+ L   + 
Sbjct: 339 LSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKG-------TKDLPVTSE 391

Query: 374 TGL-RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
             L   C+D+  + +   P +  H  GG  + LP  NY   V      C        A  
Sbjct: 392 VALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALS 451

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+GN Q Q   V +DL N  +GF+ + C
Sbjct: 452 ----IIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 171/386 (44%), Gaps = 49/386 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP + +  +LDTGS +VW  C     C+ C +   P F P  S +   + 
Sbjct: 127 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQCA---PCRKCYTQADPVFDPTKSRTYAGIP 183

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +           D P   +KN  ++C  Y V YG G  T G   +ETL    
Sbjct: 184 CGAPLCRRL-----------DSPGCNNKN--KVC-QYQVSYGDGSFTFGDFSTETLTFRR 229

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
             +    +GC   +       AG+ G GRG+ S P Q       KFSYCL+      + +
Sbjct: 230 TRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRS--ASAK 287

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            SS++  + +     +     +TP + NP +        +YY+ L  I+VGG  VR    
Sbjct: 288 PSSVVFGDSAVSRTAR-----FTPLIKNPKL------DTFYYLELLGISVGGSPVRGLSA 336

Query: 320 YL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
            L  LD  GNGG I+DSGT+ T +    +  L D F    V   +  RA  AE  +    
Sbjct: 337 SLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RVGASHLKRA--AE-FSLFDT 390

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CFD+ G      P + LHF+ GA+V+LP  NY   V    + C           G SII 
Sbjct: 391 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMS---GLSII- 445

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + V +DL   R+GF  + C
Sbjct: 446 GNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 135/474 (28%), Positives = 208/474 (43%), Gaps = 76/474 (16%)

Query: 13  IFFFTLLSI--FPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALHI 62
           + FF+L  I  F  S+ + +FS    H +        P+Q+ +Q++ +    S+ RA   
Sbjct: 9   LLFFSLCFIISFSHSLRN-SFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRA--- 64

Query: 63  KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
            N   K + + T  +T  ++    G Y ++ S GTPP  +  ++DTGS +VW  C     
Sbjct: 65  -NRLFKDSLSNTPESTVYVNG---GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCK---P 117

Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
           C+ C     P F P  SSS + + C +  C  + + S     CN +           C  
Sbjct: 118 CEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTS-----CNKQ---------NSCEY 163

Query: 183 YLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSV----LSSRQPAGIAGFGRGK 233
            +       ++G    ETL L +        P  ++GC      +   + +GI G G G 
Sbjct: 164 TINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGP 223

Query: 234 TSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
            SL +QL      KFSYCLL     D+ +TS L   + +  S     G+  TPFV     
Sbjct: 224 VSLTTQLKSSIGGKFSYCLLP-LLVDSNKTSKLNFGDAAVVSGD---GVVSTPFVKKDPQ 279

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           A       +YY+ L   +VG +R+    ++  LD    G  I+DSGTT T +   ++  L
Sbjct: 280 A-------FYYLTLEAFSVGNKRI----EFEVLDDSEEGNIILDSGTTLTLLPSHVYTNL 328

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
            +  V+Q+VK     R      L  L  C+ +  ++   FP +  HFK GA++ L   + 
Sbjct: 329 -ESAVAQLVK---LDRVDDPNQLLNL--CYSITSDQY-DFPIITAHFK-GADIKLNPIST 380

Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           FA V +G  VCL   + +    GP  I GN    N  V YDL+   + FK   C
Sbjct: 381 FAHVADG-VVCLAFTSSQT---GP--IFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 122/397 (30%), Positives = 166/397 (41%), Gaps = 62/397 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +  +LDTGS L+W  C     C  C     P F P  SSS   + C 
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCA---PCASCLPQPDPIFSPGASSSYEPMRCA 160

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLP--- 204
              C+ I H S Q  D           CT     Y   YG G T  G+  +E        
Sbjct: 161 GELCNDILHHSCQRPD----------TCT-----YRYSYGDGTTTRGVYATERFTFSSSS 205

Query: 205 -----NRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
                 ++      GC  +   S    +GI GFGR   SL SQL + +FSYCL  +    
Sbjct: 206 SGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYA--- 262

Query: 257 TTRTSSLILDN--GSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGG 311
           + R S+L+  +  G  + D  T  +  T  +    NP+         +YYV    +TVG 
Sbjct: 263 SGRKSTLLFGSLRGGVY-DAATATVQTTRLLRSRQNPT---------FYYVPFTGVTVGA 312

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFT-FMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           +R+R+      L  DG+GG IVDSGT  T F AP L E +   F SQ+        + G 
Sbjct: 313 RRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAE-VVRAFRSQLRLPFAANGSSGP 371

Query: 371 EALTGLRPCFDVPGEKT---GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
           +   G+  CF     +       P +  H + GA++ LP  NY         +CL +   
Sbjct: 372 D--DGV--CFAAAASRVPRPAVVPRMVFHLQ-GADLDLPRRNYVLDDQRKGNLCLLLAD- 425

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              SG     +GNF  Q+  V YDL    L F    C
Sbjct: 426 ---SGDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      +LDTGS +VW  C     C++C +     F P+ S S   + 
Sbjct: 126 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 182

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +  +S  C    +  L            Y V YG G +T G   SETL    
Sbjct: 183 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 228

Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
              +    +GC      +   IA       GRG+ S PSQ+       FSYCL+   S  
Sbjct: 229 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSV 286

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
              +TR+S++      + +     G ++TP   NP +A       +YYV L   +VGG R
Sbjct: 287 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 337

Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           V+ V    L L+   G GG I+DSGT+ T +A  ++E + D F +  V  R     +   
Sbjct: 338 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 392

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
             +    C+++ G +    P + +H  GGA V LP ENY   V      C  +  TD   
Sbjct: 393 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 449

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 450 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 166/386 (43%), Gaps = 51/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P +    +LDTGS + W  C     C  C     P + P LSSS +L+G
Sbjct: 143 GEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE---PCSDCYQQSDPIYNPALSSSYKLVG 199

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           CQ   C     + +    C       S+N + +   Y V YG G  T+G   +ETL L  
Sbjct: 200 CQANLC-----QQLDVSGC-------SRNGSCL---YQVSYGDGSYTQGNFATETLTLGG 244

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDTTR 259
             + N  +GC   +       AG+ G G G  S PSQL   N   FSYCL+     D+  
Sbjct: 245 APLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDR---DSES 301

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           +S+L     +  +     G    P + N      +    +YYV L  I+VGG+ + +   
Sbjct: 302 SSTLQFGRAAVPN-----GAVLAPMLKN------SRLDTFYYVSLSGISVGGKMLSISDS 350

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLRP 378
              +D  GNGG IVDSGT  T +    ++ L D F       R  T+ L   + ++    
Sbjct: 351 VFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAF-------RAGTKNLPSTDGVSLFDT 403

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D+  +++   P +  HF GG  ++LP +NY   V      C        +      I+
Sbjct: 404 CYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLS----IV 459

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q   V +D  N ++GF    C
Sbjct: 460 GNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 121/405 (29%), Positives = 179/405 (44%), Gaps = 54/405 (13%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
           + +  G YS++   GTP Q    + DTGS L W  C  H + + CS+ K         F 
Sbjct: 5   ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 64

Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
             LSSS + + C    C       I+  D     L +  NC T + P  Y   Y  G T 
Sbjct: 65  ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 113

Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
            G   +ET+ +  +      + N L+GCS      S +   G+ G G  K S   +    
Sbjct: 114 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 173

Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
              KFSYCL+ H    + +  S  L  GSS S +     +TYT  V    +   N+F   
Sbjct: 174 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 223

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y V +  I++GG  +++  +    D  G GGTI+DSG++ TF+    ++P+       ++
Sbjct: 224 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 281

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
           K R     +G      L  CF+  G +    P L  HF  GAE   PV++Y     +G  
Sbjct: 282 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 335

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL  V+   A  G S++ GN   QN+  E+DL  ++LGF    C
Sbjct: 336 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 377


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 170/371 (45%), Gaps = 51/371 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DTGS L W  C     C+ C + + P F P  S S + + C +  C  + + +     
Sbjct: 81  IVDTGSDLTWVQCQ---PCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNLGV 137

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR-- 221
           C           T  C +Y+V YG G  T G    E LNL    + NF+ GC   +    
Sbjct: 138 CGSN--------TPTC-NYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLF 188

Query: 222 -QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTT 277
              +G+ G G+   SL SQ +      FSYCL +   D    + SLIL  G+S   K TT
Sbjct: 189 GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD---ASGSLIL-GGNSSVYKNTT 244

Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
            ++YT  + NP +        +Y++ L  I++GG  ++  +   +       G ++DSGT
Sbjct: 245 PISYTRMIANPQLP------TFYFLNLTGISIGGVALQAPNYRQS-------GILIDSGT 291

Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
             T + P ++  L  EF+ Q      ++    A   + L  CF++ G      P +++ F
Sbjct: 292 VITRLPPPVYRDLKAEFLKQ------FSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQF 345

Query: 398 KGGAEVTLPVENYFAVVG-EGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLR 453
           +G AE+T+ V   F  V  + S VCL + +   D E       I+GN+Q +N  V Y+ +
Sbjct: 346 EGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIP-----IIGNYQQRNQRVIYNTK 400

Query: 454 NQRLGFKQQLC 464
             +LGF  + C
Sbjct: 401 ESKLGFAAEAC 411


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 116/391 (29%), Positives = 178/391 (45%), Gaps = 52/391 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  S+  GTPP     ++DTGS +VW  C     C +C     P + P+ SS+     
Sbjct: 97  GEYFASVGVGTPPTPALLVIDTGSDVVWLQCK---PCVHCYRQLSPLYDPRGSSTYAQTP 153

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C  P+C          + C+     T+  C      Y ++YG +  T G   ++ L   N
Sbjct: 154 CSPPQCR-------NPQTCD----GTTGGC-----GYRIVYGDASSTSGNLATDRLVFSN 197

Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
              + N  +GC   +       AG+ G  RG  S  +Q+       F+YCL      D T
Sbjct: 198 DTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCL-----GDRT 252

Query: 259 RT--SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           R+  SS  L  G +  +  ++   +TP  +NP    R +    YYV +   +VGG+ V  
Sbjct: 253 RSGSSSSYLVFGRTAPEPPSS--VFTPLRSNP---RRPSL---YYVDMVGFSVGGEPVTG 304

Query: 317 W-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           + +  L+LD   G GG +VDSGT+ T  A + +  L D F ++  K     R +G   ++
Sbjct: 305 FSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVG--MRKVG-RGIS 361

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               C+D+ G      P + LHF GGA+V LP ENY      G   C  +    EA+G  
Sbjct: 362 VFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFAL----EAAGHD 417

Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + ++GN   Q + V +D+ N+R+GF+   C
Sbjct: 418 GLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 111/399 (27%), Positives = 163/399 (40%), Gaps = 55/399 (13%)

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
           TT   + +S   G Y   +  GTP + +  +LDTGS + W  C     C  C     P F
Sbjct: 150 TTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQC---LPCSECYQQSDPIF 206

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
            P  SS+ + L C +PKC+ +   +  CR         S  C      Y V YG G  T 
Sbjct: 207 DPTSSSTFKSLTCSDPKCASLDVSA--CR---------SNKCL-----YQVSYGDGSFTV 250

Query: 194 GIALSETLNL-PNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKF 245
           G   ++T+    +  + +  +GC         G+        G G G  S+ +Q+    F
Sbjct: 251 GNYATDTVTFGESGKVNDVALGCG----HDNEGLFTGAAGLLGLGGGALSMTNQIKAKSF 306

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           SYCL+     D+ ++SSL       +S +   G    P + N      +    +YYVGL 
Sbjct: 307 SYCLVDR---DSAKSSSLDF-----NSVQIGAGDATAPLLRN------SKMDTFYYVGLS 352

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
             +VGGQ+V +      +D  G GG I+D GT  T +  + +  L D FV      +   
Sbjct: 353 GFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKK-- 410

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
              G   ++    C+D     T   P +  HF GG  + LP +NY   + +    C    
Sbjct: 411 ---GTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFA 467

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                S   SII GN Q Q   + YDL N  +G     C
Sbjct: 468 ---PTSSSLSII-GNVQQQGTRITYDLANNLIGLSANKC 502


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      +LDTGS +VW  C     C++C +     F P+ S S   + 
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 176

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +  +S  C    +  L            Y V YG G +T G   SETL    
Sbjct: 177 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 222

Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
              +    +GC      +   IA       GRG+ S PSQ+       FSYCL+   S  
Sbjct: 223 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSV 280

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
              +TR+S++      + +     G ++TP   NP +A       +YYV L   +VGG R
Sbjct: 281 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 331

Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           V+ V    L L+   G GG I+DSGT+ T +A  ++E + D F +  V  R     +   
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 386

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
             +    C+++ G +    P + +H  GGA V LP ENY   V      C  +  TD   
Sbjct: 387 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 443

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 444 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++  G+    +  I+DTGS L W  C     C  C + + P F P  SSS + + C 
Sbjct: 65  YIVTMGLGSTNMTV--IIDTGSDLTWVQCE---PCMSCYNQQGPIFKPSTSSSYQSVSCN 119

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
           +  C  +   +     C   P      C     +Y+V YG G  T G    E L+     
Sbjct: 120 SSTCQSLQFATGNTGACGSNP----STC-----NYVVNYGDGSYTNGELGVEQLSFGGVS 170

Query: 208 IPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
           + +F+ GC   +     G++G    GR   SL SQ N      FSYCL +    ++  + 
Sbjct: 171 VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---ESGASG 227

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           SL++ N SS   K  T +TYT  + NP ++       +Y + L  I V G  ++V     
Sbjct: 228 SLVMGNESSVF-KNVTPITYTRMLPNPQLSN------FYILNLTGIDVDGVALQV----- 275

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
                GNGG ++DSGT  T +   +++ L   F+ Q      +T    A   + L  CF+
Sbjct: 276 --PSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQ------FTGFPSAPGFSILDTCFN 327

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGN 440
           + G    S P + +HF+G AE+ +     F VV E  S VCL + +  +A    + I+GN
Sbjct: 328 LTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGN 385

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
           +Q +N  V YD +  ++GF ++ C
Sbjct: 386 YQQRNQRVIYDTKQSKVGFAEESC 409


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 121/405 (29%), Positives = 179/405 (44%), Gaps = 54/405 (13%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
           + +  G YS++   GTP Q    + DTGS L W  C  H + + CS+ K         F 
Sbjct: 76  ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 135

Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
             LSSS + + C    C       I+  D     L +  NC T + P  Y   Y  G T 
Sbjct: 136 ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 184

Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
            G   +ET+ +  +      + N L+GCS      S +   G+ G G  K S   +    
Sbjct: 185 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244

Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
              KFSYCL+ H    + +  S  L  GSS S +     +TYT  V    +   N+F   
Sbjct: 245 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 294

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y V +  I++GG  +++  +    D  G GGTI+DSG++ TF+    ++P+       ++
Sbjct: 295 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 352

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
           K R     +G      L  CF+  G +    P L  HF  GAE   PV++Y     +G  
Sbjct: 353 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 406

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL  V+   A  G S++ GN   QN+  E+DL  ++LGF    C
Sbjct: 407 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 172/374 (45%), Gaps = 48/374 (12%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQCR 163
           ++DT S L W  C     C+ C   + P F P  S S   + C +  C  +    +    
Sbjct: 134 VVDTASELTWVQCQ---PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTS 190

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
            C D+      N  Q   SY + Y  G  + G+   + L L  + I  F+ GC   +   
Sbjct: 191 PCADD------NEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGA 244

Query: 223 P----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           P    +G+ G GR   SL SQ  +D+F    SYCL      ++  + SL+L + SS + +
Sbjct: 245 PFGGTSGLMGLGRSHVSLVSQ-TMDQFGGVFSYCL---PMRESGSSGSLVLGDDSS-AYR 299

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-WHKYLTLDRDGNGGTIV 333
            +T + YT  V++    +      +Y++ L  ITVGGQ V   W           G  I+
Sbjct: 300 NSTPIVYTAMVSDSGPLQ----GPFYFLNLTGITVGGQEVESPWFS--------AGRVII 347

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT  T + P ++  +  EF+SQ+ +   Y +   A A + L  CF++ G K    P L
Sbjct: 348 DSGTIITTLVPSVYNAVRAEFLSQLAE---YPQ---APAFSILDTCFNLTGLKEVQVPSL 401

Query: 394 KLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
           K  F+G  EV +  +   YF V  + S VCL + + +  S   + I+GN+Q +N  V +D
Sbjct: 402 KFVFEGSVEVEVDSKGVLYF-VSSDASQVCLALASLK--SEYDTSIIGNYQQKNLRVIFD 458

Query: 452 LRNQRLGFKQQLCK 465
               ++GF Q+ C 
Sbjct: 459 TLGSQIGFAQETCD 472


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 161/382 (42%), Gaps = 51/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +   FGTPPQ +   LDT S   W PC+    C  CS+SK   F P  S+S R + C 
Sbjct: 97  YIVKAKFGTPPQTLLLALDTSSDAAWIPCSG---CVGCSTSK--PFAPIKSTSFRNVSCG 151

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P C  + + +     C                ++   YGS       + +TL L    I
Sbjct: 152 SPHCKQVPNPTCGGSAC----------------AFNFTYGSSSIAASVVQDTLTLATDPI 195

Query: 209 PNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSS 262
           P +  GC    +  S+ Q   +       + L    NL K  FSYCL S  F     + S
Sbjct: 196 PGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSINFSGS 253

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G  +  K+   + YTP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 254 LRL--GPVYQPKR---IKYTPLLRNP---RRSSL---YYVNLVAIKVGRKIVDIPPAALA 302

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +A  ++  + +EF       R     L    L G   C++V
Sbjct: 303 FNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEF------RRRVGPKLPVTTLGGFDTCYNV 356

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F  G  VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 357 PIV----VPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 411

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ V +D+ N R+G  ++LC
Sbjct: 412 QQNHRVLFDVPNSRIGIARELC 433


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 122/435 (28%), Positives = 181/435 (41%), Gaps = 55/435 (12%)

Query: 40  PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           P Q+S+ N + ++ S    R  ++     + TT         +       Y + +  GTP
Sbjct: 50  PKQESWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQV--LKIANYVVRVKLGTP 107

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
            Q +  +LDT +   W PC+       C+     +F+P  S++   L C   +CS +   
Sbjct: 108 GQQMFMVLDTSNDAAWVPCSG------CTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGF 161

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRIIPNFLVGC- 215
           S         P   S  C      +   YG  S LT  + + + + L N +IP F  GC 
Sbjct: 162 SC--------PATGSSACL-----FNQSYGGDSSLTATL-VQDAITLANDVIPGFTFGCI 207

Query: 216 SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSS 270
           + +S  S  P G+ G GRG  SL SQ        FSYCL S  F     + SL L     
Sbjct: 208 NAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKLGPVGQ 265

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
               +TT L   P  + PS+         YYV L  ++VG  +V +  + L  D +   G
Sbjct: 266 PKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG 314

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
           TI+DSGT  T     ++  + DEF  Q+        +LGA        CF    E     
Sbjct: 315 TIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAATNEAEA-- 364

Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEY 450
           P + LHF+G   + LP+EN       GS  CL++            ++ N Q QN  + +
Sbjct: 365 PAITLHFEG-LNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMF 423

Query: 451 DLRNQRLGFKQQLCK 465
           D  N RLG  ++LC 
Sbjct: 424 DTTNSRLGIARELCN 438


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 109/384 (28%), Positives = 176/384 (45%), Gaps = 45/384 (11%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++  G+    +  I+DTGS L W  C     C  C + + P F P  SSS + + C 
Sbjct: 65  YIVTMGLGSKNMTV--IIDTGSDLTWVQCE---PCMSCYNQQGPIFKPSTSSSYQSVSCN 119

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
           +  C  +   +     C     +T         +Y+V YG G  T G    E L+     
Sbjct: 120 SSTCQSLQFATGNTGACGSSNPSTC--------NYVVNYGDGSYTNGELGVEALSFGGVS 171

Query: 208 IPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
           + +F+ GC   +     G++G    GR   SL SQ N      FSYCL +    +   + 
Sbjct: 172 VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---EAGSSG 228

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           SL++ N SS   K    +TYT  ++NP ++       +Y + L  I VGG  ++    + 
Sbjct: 229 SLVMGNESSVF-KNANPITYTRMLSNPQLSN------FYILNLTGIDVGGVALKAPLSF- 280

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
                GNGG ++DSGT  T +   +++ L  EF+      + +T    A   + L  CF+
Sbjct: 281 -----GNGGILIDSGTVITRLPSSVYKALKAEFL------KKFTGFPSAPGFSILDTCFN 329

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGN 440
           + G    S P + L F+G A++ +     F VV E  S VCL + +  +A    + I+GN
Sbjct: 330 LTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGN 387

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
           +Q +N  V YD +  ++GF ++ C
Sbjct: 388 YQQRNQRVIYDTKQSKVGFAEEPC 411


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 122/407 (29%), Positives = 171/407 (42%), Gaps = 48/407 (11%)

Query: 67  TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
           T  TT   +   + IS  S G Y + +  G+PP     ++D+GS ++W  C     C  C
Sbjct: 112 TTMTTEVGSEVVSGISEGS-GEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR---PCAEC 167

Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
                P F P  S+S   + C +  C  +   S  C D        S  C      Y V 
Sbjct: 168 YQQADPLFDPAASASFTAVPCDSGVCRTLPGGSSGCAD--------SGAC-----RYQVS 214

Query: 187 YGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL- 240
           YG G  T+G+   ETL   +   +    +GC   +       AG+ G G G  SL  QL 
Sbjct: 215 YGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLG 274

Query: 241 --NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
                 FSYCL S   D      SL+        D    G  + P + N   A++ +F  
Sbjct: 275 GAAGGAFSYCLASRGAD--AGAGSLVF----GRDDAMPVGAVWVPLLRN---AQQPSF-- 323

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
            YYVGL  + VGG+R+ +      L  DG GG ++D+GT  T + P+ +  L D F S +
Sbjct: 324 -YYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTI 382

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEG 417
               +  RA G   L     C+D+ G  +   P + L+F + GA +TLP  N    +G G
Sbjct: 383 --GGDLPRAPGVSLLD---TCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVEMG-G 436

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              CL       AS     ILGN Q Q   +  D  N  +GF    C
Sbjct: 437 GVYCLAFA----ASASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      +LDTGS +VW  C     C++C +     F P+ S S   + 
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 176

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +  +S  C    +  L            Y V YG G +T G   SETL    
Sbjct: 177 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 222

Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
              +    +GC      +   IA       GRG+ S P+Q+       FSYCL+   S  
Sbjct: 223 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSV 280

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
              +TR+S++      + +     G ++TP   NP +A       +YYV L   +VGG R
Sbjct: 281 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 331

Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           V+ V    L L+   G GG I+DSGT+ T +A  ++E + D F +  V  R     +   
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 386

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
             +    C+++ G +    P + +H  GGA V LP ENY   V      C  +  TD   
Sbjct: 387 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 443

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GG SII GN Q Q + V +D   QR+GF  + C
Sbjct: 444 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/395 (29%), Positives = 173/395 (43%), Gaps = 60/395 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y I +S GTPP+ +  ++DTGS ++W  C     C  C       F P  SS+   LG
Sbjct: 35  GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCA---PCVSCYHQCDEVFDPYKSSTYSTLG 91

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
           C + +C  +           D        C      Y V YG G        T+ ++L+ 
Sbjct: 92  CNSRQCLNL-----------DVGGCVGNKCL-----YQVDYGDGSFSTGEFATDAVSLNS 135

Query: 200 TLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHK 253
           T      ++    +GC   +       AG+ G G+G  S P+Q+N +   +FSYCL    
Sbjct: 136 TSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRD 195

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D T R SSLI  + +        G+ +TP  +N  V      S +YY+ +  I+VGG  
Sbjct: 196 TDSTER-SSLIFGDAA----VPPAGVRFTPQASNLRV------STFYYLKMTGISVGGSI 244

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF---VSQMVKNRNYTRALGA 370
           + +      LD  GNGG I+DSGT+ T +    +  L + F    S +V    ++     
Sbjct: 245 LTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL---- 300

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
                   C+++    +   P + LHF+GGA++ LP  NY   V   S  CL        
Sbjct: 301 -----FDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFA----G 351

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           + GPSII GN Q Q + V YD  + ++GF    C 
Sbjct: 352 TTGPSII-GNIQQQGFRVIYDNLHNQVGFVPSQCD 385


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/392 (27%), Positives = 172/392 (43%), Gaps = 54/392 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  S   G+PPQ    ++DTGS L+W  C      K C+   +P +   LS SS  +   
Sbjct: 86  YIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYY--NLSQSSTFVPV- 142

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-----SYLVLYGSGLTEGIALSETLNL 203
                           C D+    + N   +C      +++  YG+G   G   +E+   
Sbjct: 143 ---------------PCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVIGSLGTESFAF 187

Query: 204 PNRIIPNFLVGCSVLSS------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
            +    +   GC  L+          +G+ G GRG+ SL SQ+   +FSYCL  + F  +
Sbjct: 188 ESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPY-FHSS 245

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
             +S L +   +S      +     PFV +P   +   +S +YY+ L  ITVG  R+   
Sbjct: 246 GASSHLFVGASASLGGGGAS----MPFVKSP---KDYPYSTFYYLPLEGITVGKTRLPAV 298

Query: 318 HKYLTLDRD-----GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           +      R        GG I+D+G+  T +A   +E L +E  +Q+         + A  
Sbjct: 299 NSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNG----SLVPAPE 354

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
            +GL  C    G +    P L  HF GGA++ +P  +Y+A V + +A C+ ++      G
Sbjct: 355 DSGLELCVAREGFQK-VVPALVFHFGGGADMAVPAASYWAPV-DKAAACMMIL-----EG 407

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G   I+GNFQ Q+ ++ YDLR  R  F+   C
Sbjct: 408 GYDSIIGNFQQQDMHLLYDLRRGRFSFQTADC 439


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 118/401 (29%), Positives = 166/401 (41%), Gaps = 55/401 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-FIPKLSSSSRLL 145
           G Y + L  GTPPQ +  + DTGS LVW  C+    C+ C+     S F+ + S++    
Sbjct: 87  GQYFVDLRLGTPPQKLLLVADTGSDLVWVKCS---ACRNCTRHTPGSAFLARHSTTFSPN 143

Query: 146 GCQNPKCSWI----HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
            C +  C  +    HH       CN   L +       C  Y   YG G  T G    ET
Sbjct: 144 HCYDSACQLVPLPKHHR------CNHARLHSP------C-RYEYSYGDGSKTSGFFSKET 190

Query: 201 LNL-----PNRIIPNFLVGC---------SVLSSRQPAGIAGFGRGKTSLPSQLNL---D 243
             L         +     GC         S  S     G+ G GRG  SL SQL     +
Sbjct: 191 TTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGN 250

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           KFSYCL+ H    +  TS L++ +  +        + +TP   NP          +YY+G
Sbjct: 251 KFSYCLMDHDISPSP-TSYLLIGSTQNDVAPGKRRMRFTPLHINPLSP------TFYYIG 303

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           +  ++V G ++ +      LD  GNGGTIVDSGTT TF+     EP   + ++  V  R 
Sbjct: 304 IESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLP----EPAYLQILT--VIKRR 357

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                 AE   G   C +V   +    P+L     G +  + P  NYF    E    CL 
Sbjct: 358 VRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE-DVKCLA 416

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +      SG    ++GN   Q + +E+D    RLGF +  C
Sbjct: 417 LQAVMTPSG--FSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 120/391 (30%), Positives = 167/391 (42%), Gaps = 62/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANIS 235

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS      +  R C      +  NC      Y V YG G  + G    +TL L +
Sbjct: 236 CAAPACS-----DLDTRGC------SGGNCL-----YGVQYGDGSYSIGFFAMDTLTLSS 279

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             + +  LD G          LT TP +  N P+         +YYVG+  I VGGQ + 
Sbjct: 335 --SGTGYLDFGPGSPAAAGARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 382

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +     T       GTIVDSGT  T + P  +  L   F S M   R Y +   A A++ 
Sbjct: 383 IPQSVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAA-RGYKK---APAVSL 433

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
           L  C+D  G    + P + L F+GGA + +      Y A V   S VCL    + +  GG
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASV---SQVCLGFAANED--GG 488

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              I+GN Q++ + V YD+  + +GF    C
Sbjct: 489 DVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 180/396 (45%), Gaps = 70/396 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK----YCSSSKIPSFIPKLSSSS 142
           G Y + L  G+PP+    ILDTGS L W       QCK    YC S   P F P  S++ 
Sbjct: 118 GNYYLKLGLGSPPKYYTMILDTGSSLSWL------QCKPCVVYCHSQVDPLFEPSASNTY 171

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL 201
           R L C + +CS +   ++      ++PL T+     +C  Y   YG +  + G    + L
Sbjct: 172 RPLYCSSSECSLLKAATL------NDPLCTASG---VC-VYTASYGDASYSMGYLSRDLL 221

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
            L P++ +P+F  GC   +     + AGI G  R K S+ +QL+      FSYCL     
Sbjct: 222 TLTPSQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCL----- 276

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTY--TPFV---NNPSVAERNAFSVYYYVGLRRITV 309
              T TSS     G   S  K +  +Y  TP +    NPS+         Y++ L  ITV
Sbjct: 277 --PTSTSS----GGGFLSIGKISPSSYKFTPMIRNSQNPSL---------YFLRLAAITV 321

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
            G+ V V      +       TI+DSGT  T +   ++  L + FV  M  +R Y +   
Sbjct: 322 AGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAALREAFVKIM--SRRYEQ--- 370

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
           A A + L  CF    +     PE+++ F+GGA+++L   N      +G A CL   +  +
Sbjct: 371 APAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADKGIA-CLAFASSNQ 429

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            +     I+GN Q Q Y + YD+   ++GF    C+
Sbjct: 430 IA-----IIGNHQQQTYNIAYDVSASKIGFAPGGCR 460


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 161/378 (42%), Gaps = 53/378 (14%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            +LDTGS +VW  C     C+ C     P F P+ SSS   +GC    C  +       R
Sbjct: 1   MVLDTGSDVVWVQCA---PCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLR 57

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR 221
                       C      Y V YG G +T G  ++ETL       +    +GC   +  
Sbjct: 58  ---------RGACM-----YQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEG 103

Query: 222 ---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------KFDDTTRTSSLILDNGS 269
                AG+ G GRG  S P+Q++      FSYCL+            + R+S++    GS
Sbjct: 104 LFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGS 163

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLD-RDG 327
             +   +    +TP V NP +        +YYV L  I+VGG RV  V    L LD   G
Sbjct: 164 VGASSAS----FTPMVRNPRM------ETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 213

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
            GG IVDSGT+ T +A   +  L D F +           L     +    C+D+ G + 
Sbjct: 214 RGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLR----LSPGGFSLFDTCYDLGGRRV 269

Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNY 446
              P + +HF GGAE  LP ENY   V      C     TD    GG SII GN Q Q +
Sbjct: 270 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD----GGVSII-GNIQQQGF 324

Query: 447 YVEYDLRNQRLGFKQQLC 464
            V +D   QR+GF  + C
Sbjct: 325 RVVFDGDGQRVGFAPKGC 342


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/381 (27%), Positives = 161/381 (42%), Gaps = 43/381 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
           G Y +S S GTPPQ++  +LD  S  VW  C+    C     +++  P F   LSS+ R 
Sbjct: 95  GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIRE 154

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETL 201
           + C N  C  +  ++      +D P             Y  +YG G    T G+   +  
Sbjct: 155 VRCANRGCQRLVPQTCS---ADDSPCG-----------YSYVYGGGAANTTAGLLAVDAF 200

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                     + GC+V +     G+ G GRG+ S  SQL + +FSY L     DD     
Sbjct: 201 AFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVDVG 257

Query: 262 SLI--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           S I  LD+    + +          V+ P VA R + S+ YYV L  I V G+ + +   
Sbjct: 258 SFILFLDDAKPRTSRA---------VSTPLVASRASRSL-YYVELAGIRVDGEDLAIPRG 307

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG+GG ++      TF+     +  A + V Q + ++   RA     L GL  C
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIELRAADGSEL-GLDLC 361

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +      T   P + L F GGA + L + NYF +       CLT++      G    +LG
Sbjct: 362 YTSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGS---LLG 418

Query: 440 NFQMQNYYVEYDLRNQRLGFK 460
           +      ++ YD+   RL F+
Sbjct: 419 SLIQVGTHMIYDISGSRLVFE 439


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 162/387 (41%), Gaps = 51/387 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + ++ GTP      ILDTGS L W  C     C  C     P + P  SS+   + 
Sbjct: 113 GEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK---PCTDCYPQPTPIYDPSQSSTYSKVP 169

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +   S    +C                 YL  YG    T+GI   E+  L +
Sbjct: 170 CSSSMCQALPMYSCSGANCE----------------YLYSYGDQSSTQGILSYESFTLTS 213

Query: 206 RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLP----SQLNL---DKFSYCLLSHKFDDTT 258
           + +P+   GC   +        G   G    P    SQL     +KFSYCL+S   D  +
Sbjct: 214 QSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSIT-DSPS 272

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           +TS L +         KT  L      + P V  R+    +YY+ L  I+VGGQ + +  
Sbjct: 273 KTSPLFI--------GKTASLNAKTVSSTPLVQSRSR-PTFYYLSLEGISVGGQLLDIAD 323

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               L  DG GG I+DSGTT T++    ++ +    +S +    N  +  G+    GL  
Sbjct: 324 GTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI----NLPQVDGSN--IGLDL 377

Query: 379 CFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           CF+   G  T  FP +  HF+ GA+  LP ENY      G A CL ++     S     I
Sbjct: 378 CFEPQSGSSTSHFPTITFHFE-GADFNLPKENYIYTDSSGIA-CLAMLPSNGMS-----I 430

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GN Q QNY + YD     L F   +C
Sbjct: 431 FGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 115/402 (28%), Positives = 168/402 (41%), Gaps = 74/402 (18%)

Query: 79  TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
           T +S  + G Y  S++ G+PP+    ++DTGS L W       +C  CS        P  
Sbjct: 114 TPVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTW------VRCDPCS--------PDC 159

Query: 139 SSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
           SS+   L           ++++ C D  D  L          P  L L+      G +L 
Sbjct: 160 SSTFDRLASNT-------YKALTCAD--DLRL----------PVLLRLWRRLFHSGRSLR 200

Query: 199 ETLNLPNRI------IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFS 246
           +TL +           P F+ GC  L         GI     G  S PSQ+     +KFS
Sbjct: 201 DTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFS 260

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYV 302
           YCLL     ++ + S ++    +    +  +G    L YTP   +         S+YY V
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGES---------SIYYTV 311

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L  I+VG QR+ +        +D    TI DSGTT T +   + + +     S MV   
Sbjct: 312 RLDGISVGNQRLDLSPSTFLNGQDKP--TIFDSGTTLTMLPSGVCDSIKQSLAS-MVSGA 368

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
            +       A+ GL  CF VP       P++  HF GGA+      NY  V+  GS  CL
Sbjct: 369 EFV------AIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNY--VIDLGSLQCL 420

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             V   E S     I GN Q Q+++V +D+ N+R+GFK+  C
Sbjct: 421 IFVPTNEVS-----IFGNLQQQDFFVLHDMDNRRIGFKETDC 457


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/405 (27%), Positives = 175/405 (43%), Gaps = 59/405 (14%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSSKIPSFIPKLSSS 141
           H     ++SL+ GTPPQ +  +LDTGS L W  C  T  +Q          +F P  SSS
Sbjct: 80  HHNVSLTVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT---------TFDPNRSSS 130

Query: 142 SRLLGCQNPKCSWIHHESIQCRD-CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
              + C           S+ C D   D P+  S +  Q+C + L    +  +EG   S+T
Sbjct: 131 YSPVPC----------SSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDT 180

Query: 201 LNLPNRIIPNFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
             + N  +P  + GC       +     +  G+ G  RG  S  SQ++  KFSYC+    
Sbjct: 181 FYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSD 240

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
           F      S ++L   ++ S      L YTP +   S        V Y V L  I V  + 
Sbjct: 241 F------SGVLLLGDANFS--WLMPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVSSKL 291

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRA 367
           + +       D  G G T+VDSGT FTF+   ++  L +EF++Q      ++++ NY   
Sbjct: 292 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQ 351

Query: 368 LGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAV 420
            G +       C+ VP  +T     P + L F+ GAE+ +  +         V G  S  
Sbjct: 352 GGMDL------CYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVY 404

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           C T   + +     + ++G+   QN ++E+DL   R+GF Q  C 
Sbjct: 405 CFT-FGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQCD 448


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/381 (28%), Positives = 162/381 (42%), Gaps = 43/381 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
           G Y +S S GTPPQ++  +LD  S  VW  C+    C     +++  P F   LSS+ R 
Sbjct: 95  GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIRE 154

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETL 201
           + C N  C  +  ++      +D P             Y  +YG G    T G+   +  
Sbjct: 155 VRCANRGCQRLVPQTCSA---DDSPCG-----------YSYVYGGGAANTTAGLLAVDAF 200

Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                     + GC+V +     G+ G GRG+ SL SQL + +FSY L     DD     
Sbjct: 201 AFATVRADGVIFGCAVATEGDIGGVIGLGRGELSLVSQLQIGRFSYYLAP---DDAVDVG 257

Query: 262 SLI--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           S I  LD+    + +          V+ P VA R + S+ YYV L  I V G+ + +   
Sbjct: 258 SFILFLDDAKPRTSRA---------VSTPLVANRASRSL-YYVELAGIRVDGEDLAIPRG 307

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG+GG ++      TF+     +  A + V Q + ++   RA     L GL  C
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIGLRAADGSEL-GLDLC 361

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +      T   P + L F GGA + L + NYF +       CLT++      G    +LG
Sbjct: 362 YTSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGS---LLG 418

Query: 440 NFQMQNYYVEYDLRNQRLGFK 460
           +      ++ YD+   RL F+
Sbjct: 419 SLIQVGTHMIYDISGSRLVFE 439


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 117/413 (28%), Positives = 179/413 (43%), Gaps = 58/413 (14%)

Query: 72  TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI 131
           T T T +  +S H     ++SL+ G+PPQ +  +LDTGS L W  C            K+
Sbjct: 43  TQTQTPSRKLSFHHNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHC-----------KKL 91

Query: 132 P----SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
           P    +F P LSSS     C +  C      + + RD    P +   N  ++C   +   
Sbjct: 92  PNLNSTFNPLLSSSYTPTPCNSSIC------TTRTRDLT-IPASCDPN-NKLCHVIVSYA 143

Query: 188 GSGLTEGIALSETLNLPNRIIPNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQ 239
            +   EG   +ET +L     P  L GC         +    +  G+ G  RG  SL +Q
Sbjct: 144 DASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQ 203

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           ++L KFSYC+             L+L +G+       + L YTP V   + +      V 
Sbjct: 204 MSLPKFSYCISGED-----ALGVLLLGDGT----DAPSPLQYTPLV-TATTSSPYFNRVA 253

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM- 358
           Y V L  I V  + +++       D  G G T+VDSGT FTF+   ++  L DEF+ Q  
Sbjct: 254 YTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTK 313

Query: 359 -----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
                +++ N+    GA  L     C+  P     + P + L F  GAE+ +  E     
Sbjct: 314 GVLTRIEDPNFVFE-GAMDL-----CYHAPA-SFAAVPAVTLVFS-GAEMRVSGERLLYR 365

Query: 414 VGEGS--AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V +GS    C T   + +  G  + ++G+   QN ++E+DL   R+GF Q  C
Sbjct: 366 VSKGSDWVYCFT-FGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 156/367 (42%), Gaps = 47/367 (12%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            +LDTGS + W  C     C  C     P F P LS+S   + C          +S +CR
Sbjct: 1   MVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVSC----------DSQRCR 47

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR 221
           D +    A  +N T  C  Y V YG G  T G   +ETL L +   + N  +GC   +  
Sbjct: 48  DLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEG 103

Query: 222 ---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
                AG+   G G  S PSQ++   FSYCL+     D+   S+L   +G++ +     G
Sbjct: 104 LFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEA-----G 155

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR-DGNGGTIVDSGT 337
               P V +P        S +YYV L  I+VGGQ + +      +D   G+GG IVDSGT
Sbjct: 156 TVTAPLVRSPRT------STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGT 209

Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
             T +    +  L D FV Q   +   T       ++    C+D+    +   P + L F
Sbjct: 210 AVTRLQSAAYAALRDAFV-QGAPSLPRT-----SGVSLFDTCYDLSDRTSVEVPAVSLRF 263

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           +GG  + LP +NY   V      CL       A      I+GN Q Q   V +D     +
Sbjct: 264 EGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNVQQQGTRVSFDTARGAV 319

Query: 458 GFKQQLC 464
           GF    C
Sbjct: 320 GFTPNKC 326


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 115/390 (29%), Positives = 168/390 (43%), Gaps = 61/390 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      GTP Q +   +D  +   W PC           ++ PSF P  SS+ R + C 
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCAACA-----GCARAPSFDPTRSSTYRPVRCG 161

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
            P+CS     S         P     +C     ++ + Y +   + +   + L L + + 
Sbjct: 162 APQCSQAPAPSC--------PGGLGSSC-----AFNLSYAASTFQALLGQDALALHDDVD 208

Query: 208 -IPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
            +  +  GC  +    S  P G+ GFGRG  S PSQ        FSYCL S+K  + + T
Sbjct: 209 AVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGT 268

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
             L    G +   K+   +  TP ++NP    R +    YYV +  I VGG+ V V    
Sbjct: 269 LRL----GPAGQPKR---IKTTPLLSNP---HRPSL---YYVNMVGIRVGGRPVPVPASA 315

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L  D     GTIVD+GT FT ++  ++  + D F       R+  RA  A  L G   C+
Sbjct: 316 LAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRVRAPVAGPLGGFDTCY 368

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---- 436
           +V    T S P +   F G   VTLP EN       G   CL +     A+G P      
Sbjct: 369 NV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAM-----AAGPPDGVDAA 419

Query: 437 --ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 420 LNVLASMQQQNHRVLFDVANGRVGFSRELC 449


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 162/400 (40%), Gaps = 59/400 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   ++ GTP       LDT S L W  C     C+ C     P F P+ S+S   + 
Sbjct: 132 GEYMAKIAVGTPAVQALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYGEMN 188

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
              P C  +                 +K  T I   Y V YG G      + G  + ETL
Sbjct: 189 YDAPDCQALGRSG----------GGDAKRGTCI---YTVQYGDGHGSTSTSVGDLVEETL 235

Query: 202 NLPNRIIPNFL-VGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
                +   +L +GC      L     AGI G GRG+ S+P Q+        FSYCL+  
Sbjct: 236 TFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDF 295

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
                + +S+L    G+          T  P    P+V  +N    +YYV L  ++VGG 
Sbjct: 296 ISGPGSPSSTLTFGAGAVD--------TSPPASFTPTVLNQN-MPTFYYVRLIGVSVGGV 346

Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           RV  V  + L LD   G GG I+DSGTT T +A          +V+     R    +LG 
Sbjct: 347 RVPGVTERDLQLDPYTGRGGVILDSGTTVTRLA-------RPAYVAFRDAFRAAATSLGQ 399

Query: 371 EALTG----LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV- 425
            +  G       C+ V G      P + +HF GG EV+L  +NY   V     VC     
Sbjct: 400 VSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAG 459

Query: 426 -TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             DR  S     ++GN   Q + V YDL  QR+GF    C
Sbjct: 460 TGDRSVS-----VIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 169/413 (40%), Gaps = 63/413 (15%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
            I +PQ  +T  T+ T      S   G Y + +  G P +    ++DTGS + W  C   
Sbjct: 138 EILHPQDFSTPVTSGT------SQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCK-- 189

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C  C     P F P  SSS   LGCQ P+C  +  +   CR  ND  L          
Sbjct: 190 -PCDDCYQQVDPIFDPASSSSFSRLGCQTPQCRNL--DVFACR--NDSCL---------- 234

Query: 181 PSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGR 231
             Y V YG G  T G   +ET++  N   +    +GC         G+        G G 
Sbjct: 235 --YQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCG----HDNEGLFVGAAGLIGLGG 288

Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
           G  SL SQ+    FSYCL++    D+  +S+L   N +  SD  T      P   N  V 
Sbjct: 289 GPLSLTSQIKASSFSYCLVNR---DSVDSSTLEF-NSAKPSDSVTA-----PIFKNSKV- 338

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
                  +YYVG+  ++VGG+++ +      +D  G GG IVD GT  T +  + +  L 
Sbjct: 339 -----DTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALR 393

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
           D FV ++ K+   T             C+++    +   P +   F GG  + LP  NY 
Sbjct: 394 DTFV-KLTKDLPSTSGFAL-----FDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYL 447

Query: 412 AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             V      CL       +      I+GN Q Q   V YDL N ++ F  + C
Sbjct: 448 IPVDSAGTFCLAFAPTTASLS----IIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 120/431 (27%), Positives = 188/431 (43%), Gaps = 57/431 (13%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTK- 68
           LS   F  LL I P    S+  +   F             SL+ ++ +R L +   +++ 
Sbjct: 15  LSLPVFAVLLLISPVVAVSIGDADVGFRA-----------SLIRTAESRNLSLAAERSRR 63

Query: 69  --TTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
             +  T+ T T   ++    GG Y +  S G PP +I   +DTGS L+W  C+    C  
Sbjct: 64  RLSVYTSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCS---PCNG 120

Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
           C+    P + P  S SS  L C +  C  +    I    C+D+P         +C  Y  
Sbjct: 121 CNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDP--------PLC-GYHY 171

Query: 186 LYG-SG--LTEGIALSETLNLPNRIIPN---FLVGCSVLSSR--QPAGIAGFGRGKTSLP 237
            YG SG   T+G+  +ET    +  + N   F    ++  S+    AG+ G GRG  SL 
Sbjct: 172 AYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLV 231

Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
           SQL   +F+YCL +          S IL    +  D     ++ TP V NP   +R+   
Sbjct: 232 SQLGAGRFAYCLAADP-----NVYSTILFGSLAALDTSAGDVSSTPLVTNPK-PDRD--- 282

Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
            +YYV L+ I+VGG R+ +      ++ DG+GG   DSG   T +    ++ +     S+
Sbjct: 283 THYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSE 342

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKT-GSFPELKLHFKGGAEVTLPVENYFAVVGE 416
           + +       LG +A  G   CF    ++     P L LHF  GA+++L   NY     +
Sbjct: 343 IQR-------LGYDA--GDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKTSTK 393

Query: 417 GSA---VCLTV 424
           G +   VC+ +
Sbjct: 394 GPSEVLVCMAI 404


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 160/382 (41%), Gaps = 51/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +   FGTPPQ +   LDT S   W PC+    C  CS+SK   F P  S+S R + C 
Sbjct: 97  YIVKAKFGTPPQTLLLALDTSSDAAWIPCSG---CVGCSTSK--PFAPIKSTSFRNVSCG 151

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P C  + + +     C                ++   YGS       + +TL L    I
Sbjct: 152 SPHCKQVPNPTCGGSAC----------------AFNFTYGSSSIAASVVQDTLTLAADPI 195

Query: 209 PNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSS 262
           P +  GC    +  S+ Q   +       + L    NL K  FSYCL S  F     + S
Sbjct: 196 PGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSINFSGS 253

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G  +  K+   + YTP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 254 LRL--GPVYQPKR---IKYTPLLRNP---RRSSL---YYVNLVAIKVGRKIVDIPPAALA 302

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +A  ++  + +EF       R     L    L G   C++V
Sbjct: 303 FNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEF------RRRVGPKLPVTTLGGFDTCYNV 356

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F  G  V LP +N       GS  CL +    +       ++ N Q
Sbjct: 357 PIV----VPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 411

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ V +D+ N R+G  ++LC
Sbjct: 412 QQNHRVLFDVPNSRIGIARELC 433


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 178/405 (43%), Gaps = 54/405 (13%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
           + +  G Y ++   GTP Q    + DTGS L W  C  H + + CS+ K         F 
Sbjct: 76  ADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 135

Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
             LSSS + + C    C       I+  D     L +  NC T + P  Y   Y  G T 
Sbjct: 136 ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 184

Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
            G   +ET+ +  +      + N L+GCS      S +   G+ G G  K S   +    
Sbjct: 185 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244

Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
              KFSYCL+ H    + +  S  L  GSS S +     +TYT  V    +   N+F   
Sbjct: 245 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 294

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y V +  I++GG  +++  +    D  G GGTI+DSG++ TF+    ++P+       ++
Sbjct: 295 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 352

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
           K R     +G      L  CF+  G +    P L  HF  GAE   PV++Y     +G  
Sbjct: 353 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 406

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL  V+   A  G S++ GN   QN+  E+DL  ++LGF    C
Sbjct: 407 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 169/388 (43%), Gaps = 52/388 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++  FGTP +    I+DTGS L W  C     C  C S     F PK SSS + L 
Sbjct: 135 GNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK---PCADCYSQVDAIFEPKQSSSYKTLP 191

Query: 147 CQNPKCS-WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
           C +  C+  I  ES      N  P      C      Y + YG G  ++G    ETL L 
Sbjct: 192 CLSATCTELITSES------NPTP------CLLGGCVYEINYGDGSSSQGDFSQETLTLG 239

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +    NF  GC   ++   +  +G+ G G+   S PSQ       +F+YCL    F  +T
Sbjct: 240 SDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCL--PDFGSST 297

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            T S  +  GS  +        +TP V+N        +  +Y+VGL  I+VGG R+ +  
Sbjct: 298 STGSFSVGKGSIPASA-----VFTPLVSN------FMYPTFYFVGLNGISVGGDRLSIPP 346

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLR 377
             L     G G TIVDSGT  T + P+ +  L   F       R+ TR L  A+  + L 
Sbjct: 347 AVL-----GRGSTIVDSGTVITRLLPQAYNALKTSF-------RSKTRDLPSAKPFSILD 394

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
            C+D+        P +  HF+  A+V +  V     V   GS VCL   +  +  G    
Sbjct: 395 TCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDG--FN 452

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GNFQ Q   V +D    R+GF    C
Sbjct: 453 IIGNFQQQRMRVAFDTGAGRIGFASGSC 480


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 171/383 (44%), Gaps = 38/383 (9%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTPPQ    ILDTGS L W  C      K   S+    F P LSSS  +L C +P
Sbjct: 79  VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPST---VFDPSLSSSFSVLPCNHP 135

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP-NRII 208
            C              D  L TS +  ++C  Y   Y  G L EG  + E +    ++  
Sbjct: 136 LCK---------PRIPDFTLPTSCDLNRLC-HYSYFYADGTLAEGNLVREKITFSTSQST 185

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRTSSLIL-D 266
           P  ++GC+  +S    GI G   G+ S  SQ  + KFSYC+ + +     T T S  L +
Sbjct: 186 PPLILGCAEDASDD-KGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGE 244

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +S   +  + LT++     P     N   + + V L+ I +G +++ +       D  
Sbjct: 245 NPNSAGFQYISLLTFSQSQRMP-----NLDPLAHTVALQGIRIGNKKLNIPVSAFRADPS 299

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDV 382
           G G +++DSG+ FT++    +  + +E V     ++ K   Y+   G   +     CFD 
Sbjct: 300 GAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYS---GVSDM-----CFDG 351

Query: 383 PGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
              + G     +   F  G E+ +      A VG G  V    +   E  G  S I+GNF
Sbjct: 352 NAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGGG--VHCVGIGRSEMLGAASNIIGNF 409

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
             QN +VE+D+ N+R+GF +  C
Sbjct: 410 HQQNLWVEFDIANRRVGFGKADC 432


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 172/372 (46%), Gaps = 52/372 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DT S L W  C     C  C   + P F P  S S  +L C +  C     +++Q   
Sbjct: 141 IVDTASELTWVQCA---PCASCHDQQGPLFDPASSPSYAVLPCNSSSC-----DALQVAT 192

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
            +           Q   SY + Y  G  ++G+   + L+L   +I  F+ GC   S++ P
Sbjct: 193 GSAAGACGGGE--QPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SNQGP 249

Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
               +G+ G GR + SL SQ  +D+F    SYCL      ++  + SL+L + +S   + 
Sbjct: 250 FGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVY-RN 304

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
           +T + YT  V++P          +Y+V L  IT+GGQ V              G  IVDS
Sbjct: 305 STPIVYTTMVSDPVQGP------FYFVNLTGITIGGQEVE----------SSAGKVIVDS 348

Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
           GT  T + P ++  +  EF+SQ  +   Y +A G    + L  CF++ G +    P LK 
Sbjct: 349 GTIITSLVPSVYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNLTGFREVQIPSLKF 402

Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
            F+G  EV +      YF V  + S VCL + + +  S   + I+GN+Q +N  V +D  
Sbjct: 403 VFEGNVEVEVDSSGVLYF-VSSDSSQVCLALASLK--SEYETSIIGNYQQKNLRVIFDTL 459

Query: 454 NQRLGFKQQLCK 465
             ++GF Q+ C 
Sbjct: 460 GSQIGFAQETCD 471


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 165/382 (43%), Gaps = 52/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   +DT +   W PCT    C  C+S+    F P+ S++ + + C 
Sbjct: 78  YIVRAKIGTPPQTLLLAMDTSNDAAWIPCT---ACDGCASTL---FAPEKSTTFKNVSCA 131

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +       CN                + + YGS       + +T+ L    +
Sbjct: 132 APECKQVPNPGCGVSSCN----------------FNLTYGSSSIAANLVQDTITLATDPV 175

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           P++  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S  F     + S
Sbjct: 176 PSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 233

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G     K+   + YTP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 234 LRL--GPVAQPKR---IKYTPLLKNP---RRSSL---YYVNLEAIRVGRKVVDIPPAALA 282

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +   ++  + DEF       R     L   +L G   C++V
Sbjct: 283 FNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEF------RRRVGPKLTVTSLGGFDTCYNV 336

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 337 PI----VVPTITFIFTG-MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 391

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ V YD+ N R+G  ++LC
Sbjct: 392 QQNHRVLYDVPNSRVGVARELC 413


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 122/414 (29%), Positives = 179/414 (43%), Gaps = 66/414 (15%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           HI +     +      T +  +  S  G+S+++    P ++I   +DTGS L+W      
Sbjct: 15  HIISHSRNVSAALVVRTPSRRTDGSDQGHSLTVGIVQPRKLI---VDTGSDLIW------ 65

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
            QCK  SS+         ++++R            H      R       A ++ CT   
Sbjct: 66  TQCKLSSST---------AAAAR------------HGSPPLSRTAPARTGAFTRTCTASA 104

Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLP 237
            +  VL     T G   + +L L       F  GC  LS+       GI G      SL 
Sbjct: 105 AAVGVLASETFTFGARRAVSLRL------GF--GCGALSAGSLIGATGILGLSPESLSLI 156

Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF 296
           +QL + +FSYCL    F D  +TS L+    +  S  KTT  +  T  V+NP        
Sbjct: 157 TQLKIQRFSYCLT--PFADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VE 207

Query: 297 SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS 356
           +VYYYV L  I++G +R+ V    L +  DG GGTIVDSG+T  ++    FE +  E V 
Sbjct: 208 TVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVM 266

Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENY 410
            +V+     R +    L     CF +P     +       P L LHF GGA + LP +NY
Sbjct: 267 DVVRLPVANRTVEDYEL-----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNY 321

Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           F     G  +CL V    + SG    I+GN Q QN +V +D+++ +  F    C
Sbjct: 322 FQEPRAG-LMCLAVGKTTDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 172/372 (46%), Gaps = 52/372 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DT S L W  C     C  C   + P F P  S S  +L C +  C     +++Q   
Sbjct: 140 IVDTASELTWVQCA---PCASCHDQQGPLFDPASSPSYAVLPCNSSSC-----DALQVAT 191

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
            +           Q   SY + Y  G  ++G+   + L+L   +I  F+ GC   S++ P
Sbjct: 192 GSAAGACGGGE--QPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SNQGP 248

Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
               +G+ G GR + SL SQ  +D+F    SYCL      ++  + SL+L + +S   + 
Sbjct: 249 FGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVY-RN 303

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
           +T + YT  V++P          +Y+V L  IT+GGQ V              G  IVDS
Sbjct: 304 STPIVYTTMVSDPVQGP------FYFVNLTGITIGGQEVE----------SSAGKVIVDS 347

Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
           GT  T + P ++  +  EF+SQ  +   Y +A G    + L  CF++ G +    P LK 
Sbjct: 348 GTIITSLVPSVYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNLTGFREVQIPSLKF 401

Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
            F+G  EV +      YF V  + S VCL + + +  S   + I+GN+Q +N  V +D  
Sbjct: 402 VFEGNVEVEVDSSGVLYF-VSSDSSQVCLALASLK--SEYETSIIGNYQQKNLRVIFDTL 458

Query: 454 NQRLGFKQQLCK 465
             ++GF Q+ C 
Sbjct: 459 GSQIGFAQETCD 470


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 169/387 (43%), Gaps = 56/387 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   +  GTP +    ++DTGS L W  C+    C+  C     P F PK SSS   +
Sbjct: 115 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS---PCRVSCHRQSGPVFDPKTSSSYAAV 171

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C +P+C  +   ++    C+          + +C  Y   YG S  + G    +T++  
Sbjct: 172 SCSSPQCDGLSTATLNPAVCSP---------SNVC-IYQASYGDSSFSVGYLSKDTVSFG 221

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
              +PNF  GC   +     + AG+ G  R K SL  QL       FSYCL S       
Sbjct: 222 ANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPS------- 274

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-W 317
            +SS  L  GS +      G +YTP V+N            Y++ L  +TV G+ + V  
Sbjct: 275 TSSSGYLSIGSYNPG----GYSYTPMVSN------TLDDSLYFISLSGMTVAGKPLAVSS 324

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
            +Y +L       TI+DSGT  T +   ++  L+    + M   +  T+   A ++  L 
Sbjct: 325 SEYTSLP------TIIDSGTVITRLPTSVYTALSKAVAAAM---KGSTKRAAAYSI--LD 373

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF+    K  + P + + F GGA + L   N    V +G+  CL     R A+     I
Sbjct: 374 TCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDV-DGATTCLAFAPARSAA-----I 427

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GN Q Q + V YD+++ R+GF    C
Sbjct: 428 IGNTQQQTFSVVYDVKSNRIGFAAAGC 454


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 107/392 (27%), Positives = 173/392 (44%), Gaps = 48/392 (12%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            GTPP+ +  ++DT S L W   T+   C  CS +K+P F P LSSS     C +  C  
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTS---CTNCSPTKVPPFNPGLSSSFISEPCTSSVCLG 61

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN-----RII 208
                 Q   CN     ++ +C     S+ V Y  G    G+   E  +L +       +
Sbjct: 62  RSKLGFQSA-CNR----STGSC-----SFQVAYLDGSEAYGVIAREIFSLQSWDGAASTL 111

Query: 209 PNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNL-------DKFSYCLLSHKFDDT 257
            + + GC+    ++P    +G  G  RG  S P+Q+         D+FSYC   ++ +  
Sbjct: 112 GDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF-PNRAEHL 170

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
             +  +I  +    S        Y      P +A   +   +YYVGL+ I+VGG+ + + 
Sbjct: 171 NSSGVIIFGD----SGIPAHHFQYLSLEQEPPIA---SIVDFYYVGLQGISVGGELLHIP 223

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                +DR GNGGT  DSGTT +F+       L + F  +++   +  R  G++    L 
Sbjct: 224 RSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVL---HLNRTSGSDFTKEL- 279

Query: 378 PCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV---CLTVVTDREASG 432
            C+DV     +  + P + LHFK   ++ L   + +  +     V   CL  V     + 
Sbjct: 280 -CYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQ 338

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G   ++GN+Q Q+Y +E+DL   R+GF    C
Sbjct: 339 GGVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 119/386 (30%), Positives = 168/386 (43%), Gaps = 49/386 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP + +  +LDTGS +VW  C     C+ C +     F P  S +   + 
Sbjct: 116 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQCA---PCRKCYTQTDHVFDPTKSRTYAGIP 172

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C  +           D P  ++KN  ++C  Y V YG G  T G   +ETL    
Sbjct: 173 CGAPLCRRL-----------DSPGCSNKN--KVC-QYQVSYGDGSFTFGDFSTETLTFRR 218

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
             +    +GC   +       AG+ G GRG+ S P Q       KFSYCL+      + +
Sbjct: 219 NRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRS--ASAK 276

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            SS+I  + +           +TP + NP +        +YY+ L  I+VGG  VR    
Sbjct: 277 PSSVIFGDSAVSRTAH-----FTPLIKNPKL------DTFYYLELLGISVGGAPVRGLSA 325

Query: 320 YL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
            L  LD  GNGG I+DSGT+ T +    +  L D F    +   +  R   A   +    
Sbjct: 326 SLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RIGASHLKR---APEFSLFDT 379

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CFD+ G      P + LHF+ GA+V+LP  NY   V    + C           G SII 
Sbjct: 380 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAG---TMSGLSII- 434

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + + YDL   R+GF  + C
Sbjct: 435 GNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 172/390 (44%), Gaps = 38/390 (9%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++SL+ GTPPQ +  ++DTGS L W  C         SSS   +F P  SSS   + C +
Sbjct: 74  TVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQN----SSSSSSTFNPVWSSSYSPIPCSS 129

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
             C      + Q RD    P+  S +  Q C + L    +  +EG   ++T  + +  IP
Sbjct: 130 STC------TDQTRDF---PIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180

Query: 210 NFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           N + GC  S+ SS      +  G+ G  RG  S  SQ+   KFSYC+  + F      S 
Sbjct: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDF------SG 234

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L+L   ++ S      L YTP +   S        V Y V L  I V  + + +      
Sbjct: 235 LLLLGDANFS--WLAPLNYTPLIEM-STPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFE 291

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D  G G T+VDSGT FTF+    +  L D F+++   +              +  C+ V
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRV 351

Query: 383 PGEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPS 435
           P  +T     P + L F+ GAE+T+  +     V     G  S  C T   + +  G  +
Sbjct: 352 PTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFT-FGNSDLLGVEA 409

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            ++G+   QN ++E+DL+  R+G  +  C 
Sbjct: 410 FVIGHLHQQNVWMEFDLKKSRIGLAEIRCD 439


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 118/403 (29%), Positives = 174/403 (43%), Gaps = 46/403 (11%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPK 137
           +  +S H     ++SL+ G+PPQ +  +LDTGS L W  C         S +    F P 
Sbjct: 29  SNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK-------SPNLTSVFNPL 81

Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
            SSS   + C +P C              D P   + +  ++C + +    +   EG   
Sbjct: 82  SSSSYSPIPCSSPVCR---------TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLA 132

Query: 198 SETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
           S+   + +  +P  L GC  S  SS      +  G+ G  RG  S  +QL L KFSYC+ 
Sbjct: 133 SDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 191

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                 + R SS +L  G SH       LTYTP V   S        V Y V L  I VG
Sbjct: 192 ------SGRDSSGVLLFGDSHL-SWLGNLTYTPLVQI-STPLPYFDRVAYTVQLDGIRVG 243

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + + +       D  G G T+VDSGT FTF+   ++  L +EF+ Q    +     LG 
Sbjct: 244 NKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQ---TKGVLAPLGD 300

Query: 371 EALT---GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
                   +  C+ VP G K    P + L F+ GAE+ +  E     V     G+    C
Sbjct: 301 PNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR-GAEMVVGGEVLLYKVPGMMKGKEWVYC 359

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LT   + +  G  + ++G+   QN ++E+DL   R+GF +  C
Sbjct: 360 LT-FGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 401


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 120/399 (30%), Positives = 168/399 (42%), Gaps = 61/399 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +  S GTP Q    I+DTGS L +  C     C  C     P + P  SS+   + 
Sbjct: 32  GQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCA---PCDLCYEQDGPLYQPSNSSTFTPVP 88

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP--------SYLVLYG-SGLTEGIAL 197
           C + +C  I             P      C+   P        SY   YG +  T G+  
Sbjct: 89  CDSAECLLI-------------PAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFA 135

Query: 198 SETLNLPNRIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYC 248
            ET  +    + +   GC      S +S+    G+ G G+G  S  SQ      +KF+YC
Sbjct: 136 YETATVGGIRVNHVAFGCGNRNQGSFVSA---GGVLGLGQGALSFTSQAGYAFENKFAYC 192

Query: 249 LLSHKFDDTTRTSSLIL--DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           L S+    T+  SSLI   D  S+  D + T L   P   NPSV         YYV + R
Sbjct: 193 LTSY-LSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPL--NPSV---------YYVQIVR 240

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I  GG+ + +      +D  GNGGTI DSGTT T+ +P+ +       ++   K+  Y R
Sbjct: 241 ICFGGETLLIPDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYA----RIIAAFEKSVPYPR 296

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           A    +  GL  C +V G     +P   + F  GA       NYF  V   +  CL ++ 
Sbjct: 297 A--PPSPQGLPLCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSP-NIDCLAML- 352

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             E+S     ++GN   QNY V+YD    R+GF    C 
Sbjct: 353 --ESSSDGFNVIGNIIQQNYLVQYDREEHRIGFAHANCD 389


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 120/388 (30%), Positives = 173/388 (44%), Gaps = 66/388 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRL 144
           Y I++ FGTP +    I DTGS++ W       QCK C  S    + P F P LSS+ R 
Sbjct: 16  YVITVGFGTPKKNQTVIFDTGSNVNWI------QCKPCVVSCYPQQEPLFDPTLSSTYRN 69

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
           + C +  C+ +                +S+ C+     Y V YG G  T G   +ET  L
Sbjct: 70  ISCTSAACTGL----------------SSRGCSGSTCVYGVTYGDGSSTVGFLATETFTL 113

Query: 204 -PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
               +  NF+ GC   +       AG+ G GR   SL SQL     + FSYCL S     
Sbjct: 114 AAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPS----- 168

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            T +++  L+ G   +  +T G  YT  + N            Y++ L  I+VGG R+ +
Sbjct: 169 -TSSATGYLNIG---NPLRTPG--YTAMLTNSRAPT------LYFIDLIGISVGGTRLAL 216

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                      + GTI+DSGT  T + P  +  L   F + M +   YTRA  A     L
Sbjct: 217 SSTVFQ-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQ---YTRAAAASI---L 265

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D     T +FP +KLH+  G +VT+P    F V+   S VCL    + +++     
Sbjct: 266 DTCYDFSRTTTVTFPTIKLHYT-GLDVTIPGAGVFYVI-SSSQVCLAFAGNSDST--QIG 321

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q +   V YD   +R+GF    C
Sbjct: 322 IIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 175/412 (42%), Gaps = 61/412 (14%)

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
           +P    +++T +   T+  + S G Y +++  GTP      + DTGS   W       QC
Sbjct: 138 HPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWV------QC 191

Query: 124 K----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
           +     C   K P F P  SS+   + C +  C+ +        D N         CT  
Sbjct: 192 RPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADL--------DTN--------GCTGG 235

Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTS 235
              Y V YG G  T G    +TL + +  I  F  GC   ++    + AG+ G GRGKTS
Sbjct: 236 HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTS 295

Query: 236 LPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
           L  Q        F+YCL +     TT T  L    GS+ ++ + T          P + +
Sbjct: 296 LTVQAYNKYGGAFAYCLPAL----TTGTGYLDFGPGSAGNNARLT----------PMLTD 341

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
           +     +YYVG+  I VGGQ+V V     +       GT+VDSGT  T +    +  L+ 
Sbjct: 342 KG--QTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPATAYTALSS 394

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
            F   M+  R Y +A G      L  C+D  G      P + L F+GGA + + V     
Sbjct: 395 AFDKVMLA-RGYKKAPGYSI---LDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVY 450

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + E + VCL   ++ +       I+GN Q + Y V YDL  + +GF    C
Sbjct: 451 AISE-AQVCLAFASNGDDES--VAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 175/412 (42%), Gaps = 61/412 (14%)

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
           +P    +++T +   T+  + S G Y +++  GTP      + DTGS   W       QC
Sbjct: 138 HPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWV------QC 191

Query: 124 K----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
           +     C   K P F P  SS+   + C +  C+ +        D N         CT  
Sbjct: 192 RPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADL--------DTN--------GCTGG 235

Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTS 235
              Y V YG G  T G    +TL + +  I  F  GC   ++    + AG+ G GRGKTS
Sbjct: 236 HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTS 295

Query: 236 LPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
           L  Q        F+YCL +     TT T  L    GS+ ++ + T          P + +
Sbjct: 296 LTVQAYNKYGGAFAYCLPAL----TTGTGYLDFGPGSAGNNARLT----------PMLTD 341

Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
           +     +YYVG+  I VGGQ+V V     +       GT+VDSGT  T +    +  L+ 
Sbjct: 342 KG--QTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPATAYTALSS 394

Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
            F   M+  R Y +A G      L  C+D  G      P + L F+GGA + + V     
Sbjct: 395 AFDKVMLA-RGYKKAPGYSI---LDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVY 450

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + E + VCL   ++ +       I+GN Q + Y V YDL  + +GF    C
Sbjct: 451 AISE-AQVCLAFASNGDDE--SVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 167/381 (43%), Gaps = 29/381 (7%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTP Q    +LDTGS L W  C +  + K        SF P LSSS   L C +P
Sbjct: 82  LSLPIGTPSQSQELVLDTGSQLSWIQC-HPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
            C              D  L TS +  ++C  Y   Y  G   EG  + E     N +  
Sbjct: 141 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKFTFSNSQTT 190

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
           P  ++GC+  S+ +  GI G   G+ S  SQ  + KFSYC+   S++    +  S  + D
Sbjct: 191 PPLILGCAKESTDE-KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGD 249

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +S   K  + LT+      P     N   + Y V L+ I +G +R+ +       D  
Sbjct: 250 NPNSRGFKYVSLLTFPQSQRMP-----NLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAG 304

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
           G+G T+VDSG+ FT +    ++ + +E V  +          G+ A      CFD  G  
Sbjct: 305 GSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM----CFD--GNH 358

Query: 387 TGSFPEL--KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
           +     L   L F+ G  V + VE    +V  G  +    +      G  S I+GN   Q
Sbjct: 359 SMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQ 418

Query: 445 NYYVEYDLRNQRLGFKQQLCK 465
           N +VE+D+ N+R+GF +  C+
Sbjct: 419 NLWVEFDVTNRRVGFSKAECR 439


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/332 (30%), Positives = 145/332 (43%), Gaps = 43/332 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPPQ +   LDTGS L+W  C     C  C    +P F P  SS+  L  C 
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
           +  C     + +    C       ++ C      Y   YG   +T G    +        
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188

Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             +P    GC + ++        GIAGFGRG  SLPSQL +  FS+C  +    +  + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +++LD  +         +  TP + NP      A   +YY+ L+ ITVG  R+ V     
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
            L ++G GGTI+DSGT  T +   ++  + D F +Q+   V + N T             
Sbjct: 300 AL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
           C   P       P+L LHF+ GA + LP ENY
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENY 380


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 120/418 (28%), Positives = 178/418 (42%), Gaps = 51/418 (12%)

Query: 58  RALHIKNPQT--KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           R+   KNP    K   +  T  + + S+   G Y +++  GTP + + FI DTGS L W 
Sbjct: 105 RSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164

Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
            C      +YC   + P F P  S+S   + C +P C  +   +       + P  ++  
Sbjct: 165 QC--EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGT------GNSPSCSAST 216

Query: 176 CTQICPSYLVLYGS-GLTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFG 230
           C      Y + YG    + G    + L L +  +  NFL GC   +       AG+ G G
Sbjct: 217 CV-----YGIQYGDQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLG 271

Query: 231 RGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           R   SL SQ        FSYCL S      T +S+  L  GS     K    T       
Sbjct: 272 RNALSLVSQTAQKYGKLFSYCLPS------TSSSTGYLTFGSGGGTSKAVKFT------- 318

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
           PS+      S +Y++ L  I+VGG+++       +       GTI+DSGT  + + P  +
Sbjct: 319 PSLVNSQGPS-FYFLNLIAISVGGRKLSTSASVFS-----TAGTIIDSGTVISRLPPTAY 372

Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPV 407
             L   F  QM K   Y +A  A     L  C+D     T   P++ L+F  GAE+ L  
Sbjct: 373 SDLRASFQQQMSK---YPKAAPASI---LDTCYDFSQYDTVDVPKINLYFSDGAEMDLDP 426

Query: 408 ENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              F ++   S VCL    + +A+     ILGN Q + + V YD+   R+GF    C+
Sbjct: 427 SGIFYILNI-SQVCLAFAGNSDAT--DIAILGNVQQKTFDVVYDVAGGRIGFAPGGCE 481


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 137/484 (28%), Positives = 212/484 (43%), Gaps = 85/484 (17%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
           L+  FFF   SI  S   S  FS+   H +        P+Q+ YQ++   V  S+ R  H
Sbjct: 7   LTLSFFFLCFSISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRSINRVNH 66

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
                +   +  +T  +T IS    G Y +S S GTPP     I+DTGS +VW  C    
Sbjct: 67  -----SNKNSLASTPESTVISYE--GDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCE--- 116

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C+ C +   P F P  SSS + + C +  C     +S++   CND+     KNC     
Sbjct: 117 PCEQCYNQTTPKFNPSKSSSYKNISCSSKLC-----QSVRDTSCNDK-----KNC----- 161

Query: 182 SYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFLVGCSVLS----SRQPAGIAGFGR 231
            Y + YG+   ++G    ETL L +        P  ++GC   +     R  +G+ G G 
Sbjct: 162 EYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGG 221

Query: 232 GKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG----LTYTPF 284
           G  SL +QL      KFSYCL+           S+ L N S  S K   G    ++    
Sbjct: 222 GPASLITQLGPSIGGKFSYCLVRM---------SITLKNMSMGSSKLNFGDVAIVSGHNV 272

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
           ++ P V + ++F  +YY+ +   +VG +RV        ++    G  I+DS T  TF+  
Sbjct: 273 LSTPIVKKDHSF--FYYLTIEAFSVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPS 327

Query: 345 ELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
           +++  L    V     + V + N   +L          C++V  ++   FP +  HFK G
Sbjct: 328 DVYTKLNSAIVDLVTLERVDDPNQQFSL----------CYNVSSDEEYDFPYMTAHFK-G 376

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           A++ L   N F  V     +C        ++GG   I G+F  Q++ V YDL+ + + FK
Sbjct: 377 ADILLYATNTFVEVAR-DVLCFAFA---PSNGGA--IFGSFSQQDFMVGYDLQQKTVSFK 430

Query: 461 QQLC 464
              C
Sbjct: 431 SVDC 434


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 166/389 (42%), Gaps = 59/389 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLL 145
           G Y + +  GTP +    + DTGS   W  C     C  YC   K P F P  S++   +
Sbjct: 94  GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ---PCVAYCYRQKEPLFDPTKSATYANI 150

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
            C +  CS ++        C                 Y + YG G  T G    +TL L 
Sbjct: 151 SCSSSYCSDLYVSGCSGGHC----------------LYGIQYGDGSYTIGFYAQDTLTLA 194

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              I NF  GC   +     + AG+ G GRGKTSLP Q   DK    F+YCL +     +
Sbjct: 195 YDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQA-YDKYGGVFAYCLPA----TS 249

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
             T  L L  G+  ++ + T          P + +R     +YYVG+  I VGG  + + 
Sbjct: 250 AGTGFLDLGPGAPAANARLT----------PMLVDRG--PTFYYVGMTGIKVGGHVLPIP 297

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               +       GT+VDSGT  T + P  + PL   F S+ ++   Y+    A A + L 
Sbjct: 298 GSVFS-----TAGTLVDSGTVITRLPPSAYAPLRSAF-SKAMQGLGYS---AAPAFSILD 348

Query: 378 PCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
            C+D+ G K GS   P + L F+GGA + +        V + S  CL    + + +    
Sbjct: 349 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAPNADDT--DV 405

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q + + V YD+  + +GF    C
Sbjct: 406 AIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 167/385 (43%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    G+PPQ +   +DT +   W PCT    C  C+S+    F P+ S++ + + C 
Sbjct: 98  YIVRAKIGSPPQTLLLAMDTSNDAAWIPCT---ACDGCTSTL---FAPEKSTTFKNVSCG 151

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ + + S     C                ++ + YGS       + +T+ L    I
Sbjct: 152 SPQCNQVPNPSCGTSAC----------------TFNLTYGSSSIAANVVQDTVTLATDPI 195

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
           P++  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S K   F  + R
Sbjct: 196 PDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLR 255

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
              +           +   + YTP + NP    R++    YYV L  I VG + V +  +
Sbjct: 256 LGPV----------AQPIRIKYTPLLKNP---RRSSL---YYVNLVAIRVGRKVVDIPPE 299

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  +     GT+ DSGT FT +    +  + DEF  ++         L   +L G   C
Sbjct: 300 ALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKAN--LTVTSLGGFDTC 357

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           + VP       P +   F  G  VTLP +N       GS  CL + +  +       ++ 
Sbjct: 358 YTVPIVA----PTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIA 412

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q QN+ V YD+ N RLG  ++LC
Sbjct: 413 NMQQQNHRVLYDVPNSRLGVARELC 437


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 160/346 (46%), Gaps = 43/346 (12%)

Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
           +P   P  SSS+  + C +  C  +      C +      + S NC     SY   YG+ 
Sbjct: 12  LPLLYPTSSSSAAFVACGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNA 63

Query: 191 -----LTEGIALSETLNLPNRI--IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL 240
                 TEGI ++ET    +     P    GC++ S       +G+ G GRGK SL +QL
Sbjct: 64  RDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQL 123

Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
           N++ F Y L S    D +  S +   + +  +         TP + NP V +      +Y
Sbjct: 124 NVEAFGYRLSS----DLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FY 175

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           YVGL  I+VGG+ V++     + DR  G GG I DSGTT T +    +  + DE +SQM 
Sbjct: 176 YVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG 235

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----G 415
             +    A   + +     CF   G  T +FP + LHF GGA++ L  ENY   +    G
Sbjct: 236 FQKPPPAANDDDLI-----CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNG 289

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
           E +A C +VV   +A      I+GN    +++V +DL  N R+ F+
Sbjct: 290 E-TARCWSVVKSSQA----LTIIGNIMQMDFHVVFDLSGNARMLFQ 330


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 164/389 (42%), Gaps = 60/389 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 234

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS      +  R C      +  +C      Y V YG G  + G    +TL L +
Sbjct: 235 CAAPACS-----DLDTRGC------SGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 279 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 333

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             T +  LD G+     +   LT TP +  N P+         +YYVGL  I VGG+ + 
Sbjct: 334 --TGTGYLDFGAGSPAAR---LTTTPMLVDNGPT---------FYYVGLTGIRVGGRLLY 379

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +             GTIVDSGT  T + P  +  L   F + M   R Y +   A A++ 
Sbjct: 380 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRSAFAAAM-SARGYKK---APAVSL 430

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D  G    + P + L F+GGA + +            S VCL    + +  GG  
Sbjct: 431 LDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 487

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q++ + V YD+  + + F    C
Sbjct: 488 GIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 57/388 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP +    + DTGS   W  C       YC   K P F P  S++   + 
Sbjct: 159 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV--AYCYRQKEPLFDPTKSATYANIS 216

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS ++        C                 Y + YG G  T G    +TL L  
Sbjct: 217 CSSSYCSDLYVSGCSGGHC----------------LYGIQYGDGSYTIGFYAQDTLTLAY 260

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTT 258
             I NF  GC   +     + AG+ G GRGKTSLP Q   DK    F+YCL +     + 
Sbjct: 261 DTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQA-YDKYGGVFAYCLPA----TSA 315

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            T  L L  G+  ++ + T          P + +R     +YYVG+  I VGG  + +  
Sbjct: 316 GTGFLDLGPGAPAANARLT----------PMLVDRG--PTFYYVGMTGIKVGGHVLPIPG 363

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +       GT+VDSGT  T + P  + PL   F S+ ++   Y+    A A + L  
Sbjct: 364 SVFS-----TAGTLVDSGTVITRLPPSAYAPLRSAF-SKAMQGLGYS---AAPAFSILDT 414

Query: 379 CFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           C+D+ G K GS   P + L F+GGA + +        V + S  CL    + + +     
Sbjct: 415 CYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAPNADDTD--VA 471

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q + + V YD+  + +GF    C
Sbjct: 472 IVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 134/479 (27%), Positives = 198/479 (41%), Gaps = 76/479 (15%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
           L+ +FF     +  S      FS+   H +        P+Q+ YQ        S+ RA H
Sbjct: 7   LTLLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANH 66

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
               +        +T   +I     G Y ++ S GTPP  +  I+DTGS +VW  C    
Sbjct: 67  FY--KYSLANIPQSTVIPDI-----GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE--- 116

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C+ C +   P F P  SSS + + C +  C     +S++   CND      KN  +   
Sbjct: 117 PCQECYNQTTPMFNPSKSSSYKNIPCPSKLC-----QSMEDTSCND------KNYCE--- 162

Query: 182 SYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGC---SVLSSR-QPAGIAGFGR 231
            Y   YG     G  LS      E+ N      PN ++GC   ++LS     +GI GFG 
Sbjct: 163 -YSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGS 221

Query: 232 GKTSLPSQLNLD---KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
           G  S  +QL      KFSYC   L S     +  TS L   + ++ S     G+  TP +
Sbjct: 222 GPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGD---GVVTTPIL 278

Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
                  +     +YY+ L   +VG +RV +       + D  G  I+DSGTT T +  +
Sbjct: 279 -------KKDPETFYYLTLEAFSVGNRRVEIGG---VPNGDNEGNIIIDSGTTLTSLTKD 328

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
            +  L    V  +   R        +    L  C+ V  E    FP + +HFK GA+V L
Sbjct: 329 DYSFLESAVVDLVKLERV------DDPTQTLNLCYSVKAEGY-DFPIITMHFK-GADVDL 380

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              + F  V +G   CL   + ++ +     I GN   QN  V YDL+ + + FK   C
Sbjct: 381 HPISTFVSVADG-VFCLAFESSQDHA-----IFGNLAQQNLMVGYDLQQKIVSFKPSDC 433


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 89/258 (34%), Positives = 129/258 (50%), Gaps = 22/258 (8%)

Query: 214 GCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
           GC  LS+      +G+ G   G  SL SQL++ +FSYCL         +TS ++    + 
Sbjct: 97  GCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFA---ERKTSPMLFGAMAD 153

Query: 271 HSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
                TTG +  T  + NP++      + YYYV L  +++G +R+RV    L ++ DG G
Sbjct: 154 LRKYNTTGPIQTTAILRNPAMD-----TFYYYVPLVGLSLGTKRLRVPAASLAINPDGTG 208

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTG 388
           GTIVDSG+T   +A + F+ +  + V + VK   +   +    L     CF VP G    
Sbjct: 209 GTIVDSGSTMAHLAGKAFDAV-KKAVLEAVKLPVFNGTVEDYEL-----CFAVPSGVAMA 262

Query: 389 SF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
           +   P L LHF GGA + LP +NYF     G  +CL V    E  G P  I+GN Q QN 
Sbjct: 263 AVKTPPLVLHFDGGAAMALPRDNYFQEPRAG-LMCLAVARSPEDLGAPISIIGNVQQQNM 321

Query: 447 YVEYDLRNQRLGFKQQLC 464
           +V +D+ NQ+  F    C
Sbjct: 322 HVLFDVHNQKFSFAPTKC 339


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 110/385 (28%), Positives = 159/385 (41%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS ++W  C     C  C     P F P  SSS   + 
Sbjct: 132 GEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE---PCTQCYHQSDPVFNPADSSSYAGVS 188

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS + +       C                 Y V YG G  T+G    ETL    
Sbjct: 189 CASTVCSHVDNAGCHEGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 232

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
            +I N  +GC   +       AG+ G G G  S   QL       FSYCL+S        
Sbjct: 233 TLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQ---- 288

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            SS +L  G    +    G  + P ++NP          +YYVGL  + VGG RV +   
Sbjct: 289 -SSGLLQFGR---EAVPVGAAWVPLIHNPRAQS------FYYVGLSGLGVGGLRVPISED 338

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    +E   D F++Q     N  RA G         C
Sbjct: 339 VFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTT---NLPRASGVSIFD---TC 392

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F GG  +TLP  N+   V +  + C        +S G SII G
Sbjct: 393 YDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFA---PSSSGLSII-G 448

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF   +C
Sbjct: 449 NIQQEGIEISVDGANGFVGFGPNVC 473


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 114/392 (29%), Positives = 165/392 (42%), Gaps = 56/392 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP + +  I DTGS L W  C      K C + + P F P  S +   + 
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQC--QPCVKSCYAQQQPIFDPSTSKTYSNIS 209

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  CS +   +       + P  +S NC      Y + YG S  T G    + L L  
Sbjct: 210 CTSAACSSLKSAT------GNSPGCSSSNCV-----YGIQYGDSSFTIGFFAKDKLTLTQ 258

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
           N +   F+ GC   +     + AG+ G GR   S+  Q        FSYCL       T+
Sbjct: 259 NDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL------PTS 312

Query: 259 RTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           R S+  L     NG   S     G+T+TPF ++   A       YY++ +  I+VGG+ +
Sbjct: 313 RGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTA-------YYFIDVLGISVGGKAL 365

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +           N GTI+DSGT  T +    +  L   F   M K         A AL+
Sbjct: 366 SISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPT------APALS 414

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREASG 432
            L  C+D+    + S P++  +F G A V L   N   +    S VCL      D ++ G
Sbjct: 415 LLDTCYDLSNYTSISIPKISFNFNGNANVELD-PNGILITNGASQVCLAFAGNGDDDSIG 473

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I GN Q Q   V YD+   +LGF  + C
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGC 501


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 145/356 (40%), Gaps = 47/356 (13%)

Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
           C++   P F P  SS+   L C +  C ++    + C          +  C    P    
Sbjct: 88  CAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCN---------ATGCVYYYP---- 134

Query: 186 LYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNLD 243
            YG G T G   +ETL++     P    GCS  +      +GI G GR   SL SQ+ + 
Sbjct: 135 -YGMGFTAGYLATETLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVG 193

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTP-FVNNPSVAERNAFSVYYYV 302
           +FSYCL S    D     S IL      S  K TG   +P  + NP +      S YYYV
Sbjct: 194 RFSYCLRS----DADAGDSPILFG----SLAKVTGGKSSPAILENPEMPS----SSYYYV 241

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
            L  ITVG   + V        R       GGTIVDSGTT T++  E +  +   F+SQM
Sbjct: 242 NLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQM 301

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV- 414
                 T   G     G   CFD      GS    P L L F GGAE  +   +Y  VV 
Sbjct: 302 ATANLTTTVNGTR--FGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVE 359

Query: 415 ----GEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
               G  +  CL V+   E     SI I+GN    + +V YDL      F    C 
Sbjct: 360 VDSQGRAAVECLLVLPASEKL---SISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 122/481 (25%), Positives = 198/481 (41%), Gaps = 67/481 (13%)

Query: 8   LCLSFIFFFTLLSIFPSSITS------LTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALH 61
           +  +F+F   LL    S +TS      L   L+         + + +   V+ S  R L 
Sbjct: 1   MARTFVFLLVLLCFRASLVTSSSTGAGLRMKLTHVDDKAGYTTEERVRRAVAVSRER-LA 59

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
               Q +   +   +   ++++  Y    +    G PPQ    ++DTGS+L+W  C    
Sbjct: 60  YTQQQQQLRASGDVSAPVHLATRQYIAEYL---IGDPPQRAAALIDTGSNLIWTQCGTTC 116

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
             K C+   +P +   LS SS                +     C D     + N   +C 
Sbjct: 117 GLKACAKQDLPYY--NLSRSS----------------TFAAVPCADSAKLCAANGVHLCG 158

Query: 182 -----SYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS------RQPAGIAGFG 230
                ++   YG+G   G   +E     +        GC  L+          +G+ G G
Sbjct: 159 LDGSCTFAASYGAGSVFGSLGTEAFTFQSG-AAKLGFGCVSLTRITKGALNGASGLIGLG 217

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG+ SL SQ    KFSYCL  +  +     SS +    S+        +T  PFV +P  
Sbjct: 218 RGRLSLVSQTGATKFSYCLTPYLRNHG--ASSHLFVGASASLSGGGGAVTSIPFVKSP-- 273

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPEL 346
            E   +S +YY+ L  I+VG  ++ +      L R      +GG I+D+G+  T +A   
Sbjct: 274 -EDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAA 332

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF---DVPGEKTGSFPELKLHFKGGAEV 403
           +  L+DE   Q+  NR+  +     A TGL  C    DV  +K    P L  HF GGA++
Sbjct: 333 YSALSDEVARQL--NRSLVQ---PPADTGLDLCVARQDV--DKV--VPVLVFHFGGGADM 383

Query: 404 TLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
            +   +Y+  V + +A C+ +       GG   ++GNFQ Q+ ++ YD+    L F+   
Sbjct: 384 AVSAGSYWGPVDKSTA-CMLI-----EEGGYETVIGNFQQQDVHLLYDIGKGELSFQTAD 437

Query: 464 C 464
           C
Sbjct: 438 C 438


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 123/442 (27%), Positives = 196/442 (44%), Gaps = 57/442 (12%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSIS 92
           LS F+ N  +   Q +N+ +  S++R  H         + +     ++++S+  G Y +S
Sbjct: 43  LSPFY-NSEETDLQRINNALRRSISRVHHFD--PIAAASVSPKAAESDVTSNR-GEYLMS 98

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           LS GTPP  I  I DTGS L+W  C     C+ C     P F PK S + R   C   +C
Sbjct: 99  LSLGTPPFKIMGIADTGSDLIWTQCK---PCERCYKQVDPLFDPKSSKTYRDFSCDARQC 155

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
           S +   +     C  +     ++ T          G+  ++ I L  T   P    P  +
Sbjct: 156 SLLDQSTCSGNICQYQYSYGDRSYTM---------GNVASDTITLDSTTGSPVS-FPKTV 205

Query: 213 VGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCL--LSHKFDDTTRTSSL 263
           +GC   +    S + +GI G G G  SL SQ+      KFSYCL  LS +  ++++    
Sbjct: 206 IGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKL--- 262

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
              N  S++     G+  TP +++ ++      S +Y++ L  ++VG +R++     L  
Sbjct: 263 ---NFGSNAVVSGPGVQSTPLLSSETM------SSFYFLTLEAMSVGNERIKFGDSSLGT 313

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPCFDV 382
              G G  I+DSGTT T +  + F  L+    +Q+   R       AE  +G L  C+  
Sbjct: 314 ---GEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRR-------AEDPSGFLSVCYSA 363

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             +     P +  HF  GA+V L   N F  V +   VCL   +    + G S I GN  
Sbjct: 364 TSDL--KVPAITAHFT-GADVKLKPINTFVQVSD-DVVCLAFAS---TTSGIS-IYGNVA 415

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
             N+ VEY+++ + L FK   C
Sbjct: 416 QMNFLVEYNIQGKSLSFKPTDC 437


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 113/392 (28%), Positives = 166/392 (42%), Gaps = 56/392 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP + +  I DTGS L W  C      K C + + P F P  S +   + 
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQC--QPCVKSCYAQQQPIFDPSASKTYSNIS 209

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  CS +   +       + P  +S NC      Y + YG S  T G    +TL L  
Sbjct: 210 CTSTACSGLKSAT------GNSPGCSSSNCV-----YGIQYGDSSFTVGFFAKDTLTLTQ 258

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
           N +   F+ GC   +     + AG+ G GR   S+  Q        FSYCL       T+
Sbjct: 259 NDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL------PTS 312

Query: 259 RTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           R S+  L     NG   S     G+T+TPF ++         + +Y++ +  I+VGG+ +
Sbjct: 313 RGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQG-------ATFYFIDVLGISVGGKAL 365

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +           N GTI+DSGT  T +   ++  L   F   M K         A AL+
Sbjct: 366 SISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPT------APALS 414

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREASG 432
            L  C+D+    + S P++  +F G A V L   N   +    S VCL      D +  G
Sbjct: 415 LLDTCYDLSNYTSISIPKISFNFNGNANVDLE-PNGILITNGASQVCLAFAGNGDDDTIG 473

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I GN Q Q   V YD+   +LGF  + C
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGC 501


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 163/392 (41%), Gaps = 57/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G P +    +LDTGS + W  C     C  C     P F P+ SSS
Sbjct: 148 TSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPRSSSS 204

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C++ +C  +  E+  CR         +  C      Y V YG G  T G  + ET
Sbjct: 205 FASLPCESQQCQAL--ETSGCR---------ASKCL-----YQVSYGDGSFTVGEFVIET 248

Query: 201 LNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
           L   N  +I N  VGC         G+        G G G  SL SQ+    FSYCL+  
Sbjct: 249 LTFGNSGMINNVAVGCG----HDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLV-- 302

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             D  + +SS +  N ++ SD           VN P + +      +YYVGL  ++VGGQ
Sbjct: 303 --DRDSSSSSDLEFNSAAPSDS----------VNAP-LLKSGKVDTFYYVGLTGMSVGGQ 349

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +      +D  G GG IVDSGT  T +  + +  L D FVS+      Y +     A
Sbjct: 350 LLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT----PYLKKTNGFA 405

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           L     C+D+  +   + P +   F GG  + LP +NY   V      C        +  
Sbjct: 406 L--FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS 463

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+GN Q Q   V YDL N  +GF    C
Sbjct: 464 ----IIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 120/393 (30%), Positives = 171/393 (43%), Gaps = 65/393 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK----YCSSSKIPSFIPKLSSSS 142
           G Y + +  G+P +    I+DTGS L W       QCK    YC     P F P  S + 
Sbjct: 11  GNYYVKVGLGSPARYYSMIVDTGSSLSWL------QCKPCVVYCHVQADPLFDPSASKTY 64

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL 201
           + L C + +CS +   ++     N+    TS N   +C  Y   YG S  + G    + L
Sbjct: 65  KSLSCTSSQCSSLVDATL-----NNPLCETSSN---VC-VYTASYGDSSYSMGYLSQDLL 115

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
            L P++ +P F+ GC   S     + AGI G GR K S+  Q++      FSYCL     
Sbjct: 116 TLAPSQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL----- 170

Query: 255 DDTTRTSSLILDNGSSH---SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
              TR     L  G +    S  K T +T  P   NPS+         Y++ L  ITVGG
Sbjct: 171 --PTRGGGGFLSIGKASLAGSAYKFTPMTTDP--GNPSL---------YFLRLTAITVGG 217

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + + V      +       TI+DSGT  T +   ++ P    FV  M  +  Y RA G  
Sbjct: 218 RALGVAAAQYRVP------TIIDSGTVITRLPMSVYTPFQQAFVKIM--SSKYARAPG-- 267

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
             + L  CF    +   S PE++L F+GGA++ L   N    V EG   CL    +   +
Sbjct: 268 -FSILDTCFKGNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQVDEG-LTCLAFAGNNGVA 325

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN Q Q + V +D+   R+GF    C
Sbjct: 326 -----IIGNHQQQTFKVAHDISTARIGFATGGC 353


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 164/398 (41%), Gaps = 56/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   ++ GTP       +DTGS + W  C     C+ C     P F P+ S+S R +G
Sbjct: 132 GEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ---PCRRCYPQSGPVFDPRHSTSYREMG 188

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS--GLTEGIALSETLNLP 204
              P C  +                   +  ++   Y V YG     T G  + ETL   
Sbjct: 189 YDAPDCQALGRSG-------------GGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA 235

Query: 205 NRI-IPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQL-----NLDKFSYCLLSHKF 254
             + +P+  +GC      L +   AGI G GRG+ S PSQ+     N+  FSYCL     
Sbjct: 236 GGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFL 295

Query: 255 DDTTRT--SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               R+  S+L + +G++      +   +TP V N ++A     + YY   +     G +
Sbjct: 296 SSPGRSVSSTLTIGDGAAAGSPPPS---FTPTVQNLNMA-----TFYYVRLVGVSVGGVR 347

Query: 313 RVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              V    L LD   G GG I+DSGT  T +A          +++     R     LG  
Sbjct: 348 VPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARR-------AYIAFRDAFRAAAVDLGQV 400

Query: 372 ALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--T 426
           ++ G    FD     G +    P + +HF GG E+TLP +NY   V     VC       
Sbjct: 401 SIGGPSGFFDTCYTMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTG 460

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           DR  S     I+GN Q Q + V Y++   R+GF    C
Sbjct: 461 DRSVS-----IIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 164/392 (41%), Gaps = 57/392 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y   +  G P +    +LDTGS + W  C     C  C     P F P+ SSS
Sbjct: 148 TSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPRSSSS 204

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              L C++ +C  +  E+  CR         +  C      Y V YG G  T G  ++ET
Sbjct: 205 FASLPCESQQCQAL--ETSGCR---------ASKCL-----YQVSYGDGSFTVGEFVTET 248

Query: 201 LNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
           L   N  +I +  VGC         G+        G G G  SL SQ+    FSYCL+  
Sbjct: 249 LTFGNSGMINDVAVGCG----HDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCLV-- 302

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             D  + +SS +  N ++ SD           VN P + +      +YYVGL  ++VGGQ
Sbjct: 303 --DRDSSSSSDLEFNSAAPSDS----------VNAP-LLKSGKVDTFYYVGLTGMSVGGQ 349

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +      +D  G GG IVDSGT  T +  + +  L D FVS+      Y +     A
Sbjct: 350 LLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT----PYLKKTNGFA 405

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           L     C+D+  +   + P +   F GG  + LP +NY   V      C        +  
Sbjct: 406 L--FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS 463

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               I+GN Q Q   V YDL N  +GF    C
Sbjct: 464 ----IIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 39/371 (10%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DT S L W  C     C+ C   + P F P  S S   + C +  C  +   +     
Sbjct: 167 IVDTASELTWVQCA---PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLAT--GGT 221

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
                    ++ +    SY + Y  G  + G+   + L+L   +I  F+ GC   +   P
Sbjct: 222 SGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSNQGPP 281

Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
               +G+ G GR + SL SQ  +D+F    SYCL      ++  + SL++ + SS   + 
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQ-TMDQFGGVFSYCL---PLKESDSSGSLVIGDDSSVY-RN 336

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
           +T + Y   V++P          +Y+V L  ITVGGQ V                 I+DS
Sbjct: 337 STPIVYASMVSDPLQGP------FYFVNLTGITVGGQEVESSGFSSGGGGGK---AIIDS 387

Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
           GT  T + P ++  +  EF+SQ  +   Y +A G    + L  CF++ G +    P LKL
Sbjct: 388 GTVITSLVPSIYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNMTGLREVQVPSLKL 441

Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
            F GG EV +      YF V  + S VCL +   +      + I+GN+Q +N  V +D  
Sbjct: 442 VFDGGVEVEVDSGGVLYF-VSSDSSQVCLAMAPLKSEY--ETNIIGNYQQKNLRVIFDTS 498

Query: 454 NQRLGFKQQLC 464
             ++GF Q+ C
Sbjct: 499 GSQVGFAQETC 509


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 184/442 (41%), Gaps = 69/442 (15%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH---SYGGYSISLSF 95
           +PS+   + L      S++R    +          T  T+  I S    S G Y ++L  
Sbjct: 48  DPSKTQAERLTDAFRRSVSRVGRFR---------PTAMTSDGIQSRIVPSAGEYLMNLYI 98

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
           GTPP  +  I+DTGS L W  C     C +C    +P F PK SS+ R   C    C  +
Sbjct: 99  GTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLAL 155

Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-----IP 209
             +    R C+ E     K CT     +   Y  G  T G   SETL + +        P
Sbjct: 156 GKD----RSCSKE-----KKCT-----FRYSYADGSFTGGNLASETLTVDSTAGKPVSFP 201

Query: 210 NFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
            F  GC   S     +  +GI G G G+ SL SQL       FSYCLL    D +   SS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSS--ISS 259

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
            I  N  +       G   TP V       + +   +YY+ L  I+VG +R+  +  Y  
Sbjct: 260 RI--NFGASGRVSGYGTVSTPLV-------QKSPDTFYYLTLEGISVGKKRLP-YKGYSK 309

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
                 G  IVDSGTT+TF+  E +  L ++ V+  +K +      G  +L     C++ 
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFYSKL-EKSVANSIKGKRVRDPNGIFSL-----CYNT 363

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             E     P +  HFK  A V L   N F  + E   VC TV    +       +LGN  
Sbjct: 364 TAEINA--PIITAHFK-DANVELQPLNTFMRMQE-DLVCFTVAPTSDIG-----VLGNLA 414

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
             N+ V +DLR +R+ FK   C
Sbjct: 415 QVNFLVGFDLRKKRVSFKAADC 436


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 171/396 (43%), Gaps = 63/396 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y  + + GTPPQ +  ++D    LVW  CT    C+ C    +P F P  SS+ R 
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           L C +  C  I                +S+NCT     Y     +G T G A ++T  + 
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAI- 154

Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                    GC V++ ++      P+GI G GR   SL +Q+N+  FSYCL         
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV-AERNAFSVYYYVGLRRITVGGQRVRVW 317
             SS  L  G++         + TPFV   S  +  N  + YY V L  I  GG  ++  
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAA 266

Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                     +G T+ +D+ +  +++A   ++ L           +  T A+G + +   
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308

Query: 377 RPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR------ 428
              +D+  P    G  PEL   F GGA +T+P  NY    G G+ VCLT+ +        
Sbjct: 309 PKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTIGSSASLNLTG 367

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E  G  + ILG+ Q +N +V +DL+ + L FK   C
Sbjct: 368 ELEG--ASILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 116/385 (30%), Positives = 164/385 (42%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 184 GNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKL--FDPARSSTDANIS 241

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS ++                +K C+     Y V YG G  + G    +TL L +
Sbjct: 242 CAAPACSDLY----------------TKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSS 285

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
              I  F  GC   +     + AG+ G GRGKTSLP Q   DK+   + +H F   + + 
Sbjct: 286 YDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQA-YDKYG-GVFAHCFPARS-SG 342

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +  LD G   S   +T LT    V+N           +YYVGL  I VGG+ + +     
Sbjct: 343 TGYLDFGPGSSPAVSTKLTTPMLVDN--------GLTFYYVGLTGIRVGGKLLSIPPSVF 394

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           T       GTIVDSGT  T + P  +  L   F S  +  R Y +   A AL+ L  C+D
Sbjct: 395 T-----TAGTIVDSGTVITRLPPAAYSSLRSAFAS-AIAARGYKK---APALSLLDTCYD 445

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILG 439
             G    + P + L F+GGA + +      Y A V   S  CL    + E       I+G
Sbjct: 446 FTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASV---SQACLGFAANEEDD--DVGIVG 500

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q++ + V YD+  + +GF    C
Sbjct: 501 NTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 164/399 (41%), Gaps = 75/399 (18%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y +  S GTPPQ +  + DTGS L+W  C        C     PS++P  SS+   
Sbjct: 87  SGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTS-CEPQGSPSYLPNASSTFAK 145

Query: 145 LGCQNPKCSWIHHESIQ-CRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALS 198
           L C +  CS +  +S+  C        A    C      Y   YG G      T+G    
Sbjct: 146 LPCSDRLCSLLRSDSVAWCA-------AAGAEC-----DYRYSYGLGDDDHHYTQGFLAR 193

Query: 199 ETLNLPNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
           ET  L    +P+   GC+          +G+ G GRG  SL SQLN   F YCL S    
Sbjct: 194 ETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTS---- 249

Query: 256 DTTRTSSLILDNGSS--HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
           D ++ S L+  + +S   +  ++TGL               A + +Y V LR I++G   
Sbjct: 250 DASKASPLLFGSLASLTGAQVQSTGLL--------------ASTTFYAVNLRSISIGSAT 295

Query: 314 VRVWHKYLTLDRDGNG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
                        G G   G + DSGTT T++A   +      F+SQ   ++        
Sbjct: 296 T-----------PGVGEPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQ-------V 337

Query: 371 EALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
           E   G   CF  P        + P + LHF  GA++ LPV NY   V +G  VC  V   
Sbjct: 338 EDTDGFEACFQKPANGRLSNAAVPTMVLHFD-GADMALPVANYVVEVEDG-VVCWIV--- 392

Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                 PS+ I+GN    NY V +D+    L F+   C 
Sbjct: 393 ---QRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANCD 428


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 108/391 (27%), Positives = 163/391 (41%), Gaps = 56/391 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y + +  G PP     +LDTGS + W  C     C  C     P F P  S+S
Sbjct: 142 TSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCA---PCSECYQQSDPIFDPVSSNS 198

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              + C  P+C     +S+   +C        +N T +   Y V YG G  T G   +ET
Sbjct: 199 YSPIRCDAPQC-----KSLDLSEC--------RNGTCL---YEVSYGDGSYTVGEFATET 242

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L    + N  +GC         G+        G G GK S P+Q+N   FSYCL++  
Sbjct: 243 VTLGTAAVENVAIGCG----HNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNR- 297

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
             D+   S+L  ++           +   P   NP +        +YY+GL+ I+VGG+ 
Sbjct: 298 --DSDAVSTLEFNS------PLPRNVVTAPLRRNPEL------DTFYYLGLKGISVGGEA 343

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      +D  G GG I+DSGT  T +  E+++ L D FV      +       A  +
Sbjct: 344 LPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFV------KGAKGIPKANGV 397

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
           +    C+D+   ++   P +  HF  G E+ LP  NY   V      C        +   
Sbjct: 398 SLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS- 456

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              I+GN Q Q   V +D+ N  +GF    C
Sbjct: 457 ---IMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 165/387 (42%), Gaps = 58/387 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   +DT +   W PCT    C  C+S+    F P+ S++ + + C 
Sbjct: 97  YIVRAKIGTPPQTLLLAIDTSNDAAWIPCT---ACDGCTSTL---FAPEKSTTFKNVSCG 150

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ +   S     C                ++ + YGS       + +T+ L    I
Sbjct: 151 SPECNKVPSPSCGTSAC----------------TFNLTYGSSSIAANVVQDTVTLATDPI 194

Query: 209 PNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
           P +  GC   +   S  P G+ G GRG  SL SQ   L    FSYCL S K   F  + R
Sbjct: 195 PGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLR 254

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
              +           +   + YTP + NP    R++    YYV L  I VG + V +   
Sbjct: 255 LGPV----------AQPIRIKYTPLLKNP---RRSSL---YYVNLFAIRVGRKIVDIPPA 298

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ--MVKNRNYTRALGAEALTGLR 377
            L  +     GT+ DSGT FT +   ++  + DEF  +  M    N T      +L G  
Sbjct: 299 ALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLT----VTSLGGFD 354

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            C+ VP       P +   F G   VTLP +N       GS  CL + +  +       +
Sbjct: 355 TCYTVPIVA----PTITFMFSG-MNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNV 409

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + N Q QN+ V YD+ N RLG  ++LC
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 128/441 (29%), Positives = 202/441 (45%), Gaps = 65/441 (14%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP++ S Q L + +  S++R  H  +   K  +        +++S+S G Y +++S GTP
Sbjct: 47  NPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQI--DLTSNS-GEYLMNISLGTP 103

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS L+W  C     C  C +   P F PK SS+ + + C + +C+ + ++
Sbjct: 104 PFPIMAIADTGSDLLWTQCK---PCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQ 160

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFL 212
           +     C+ E       C     SY   YG    T+G    +TL L +       + N +
Sbjct: 161 A----SCSTE----DNTC-----SYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCLLSHKFDDTTRTSSLIL 265
           +GC   ++    ++ +GI G G G  SL +QL  ++D KFSYCL+    ++  RTS +  
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSEN-DRTSKI-- 264

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            N  +++    TG+  TP +         +   +YY+ L+ I+VG + V+    Y   D 
Sbjct: 265 -NFGTNAVVSGTGVVSTPLI-------AKSQETFYYLTLKSISVGSKEVQ----YPGSDS 312

Query: 326 -DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
             G G  I+DSGTT T +  E +  L D   S +   +        +  TGL  C+   G
Sbjct: 313 GSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQTGLSLCYSATG 366

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQM 443
           +     P + +HF  GA+V L   N F  + E   VC          G PS  I GN   
Sbjct: 367 DL--KVPAITMHFD-GADVNLKPSNCFVQISE-DLVCFAF------RGSPSFSIYGNVAQ 416

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
            N+ V YD  ++ + FK   C
Sbjct: 417 MNFLVGYDTVSKTVSFKPTDC 437


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 121/417 (29%), Positives = 184/417 (44%), Gaps = 61/417 (14%)

Query: 74  TTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
           T ++T  +S +     ++SL+ GTPPQ +  +LDTGS L W  C                
Sbjct: 55  TPSSTRKVSFYHNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINSV------- 107

Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
           F P LSSS   + C +P C     + +    C+      S N   +  SY         E
Sbjct: 108 FNPHLSSSYTPIPCMSPICKTRTRDFLIPVSCD------SNNLCHVTVSYADFTS---LE 158

Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
           G   S+T  +     P  + G   S  SS      +  G+ G  RG  S  +Q+   KFS
Sbjct: 159 GNLASDTFAISGSGQPGIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFS 218

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNP-SVAERNAFSVYYYVG 303
           YC+       + + +S +L  G + + K    L YTP V  N P    +R    V Y V 
Sbjct: 219 YCI-------SGKDASGVLLFGDA-TFKWLGPLKYTPLVKMNTPLPYFDR----VAYTVR 266

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------ 357
           L  I VG + ++V  +    D  G G T+VDSGT FTF+   ++  L +EFV+Q      
Sbjct: 267 LMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLT 326

Query: 358 MVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-- 414
           ++++ N+    GA  L     CF V  G    + P + + F+ GAE+++  E     V  
Sbjct: 327 LLEDPNFVFE-GAMDL-----CFRVRRGGVVPAVPAVTMVFE-GAEMSVSGERLLYRVGG 379

Query: 415 ------GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                 G G   CLT   + +  G  + ++G+   QN ++E+DL N R+GF    C+
Sbjct: 380 DGDVAKGNGDVYCLT-FGNSDLLGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTKCE 435


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 171/396 (43%), Gaps = 63/396 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y  + + GTPPQ +  ++D    LVW  CT    C+ C    +P F P  SS+ R 
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           L C +  C  I                +S+NCT     Y     +G T G+A ++T  + 
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAI- 154

Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                    GC V++ ++      P+GI G GR   SL +Q+N+  FSYCL         
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER-NAFSVYYYVGLRRITVGGQRVRVW 317
             SS  L  G++         + TPFV   S     N  + YY V L  I  GG  ++  
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAA 266

Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                     +G T+ +D+ +  +++A   ++ L           +  T A+G + +   
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308

Query: 377 RPCFDVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR------ 428
              +D+   K   G  PEL   F GGA +T+P  NY    G G+ VCLT+ +        
Sbjct: 309 PKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTIGSSASLNLTG 367

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E  G  + ILG+ Q +N +V +DL+ + L FK   C
Sbjct: 368 ELEG--ASILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 119/433 (27%), Positives = 190/433 (43%), Gaps = 61/433 (14%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
           + RAL + N + ++        T++ +  S     I L+ G   + + +I+         
Sbjct: 90  MRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGKNM 149

Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
               DTGS L W  C     C+ C + + P + P +SSS + + C +  C     + +  
Sbjct: 150 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 201

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
              N  P        +    Y+V YG G  T G   SE++ L +  + N + GC   +  
Sbjct: 202 ATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKG 261

Query: 222 ---QPAGIAGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD- 273
                +G+ G GR   SL SQ     N   FSYCL S + D  + T S     G+  S  
Sbjct: 262 LFGGASGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPSLE-DGASGTLSF----GNDFSVY 315

Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
           K +T + YTP V NP +        +Y + L   ++GG    V  K L+  R    G ++
Sbjct: 316 KNSTSVFYTPLVQNPQLRS------FYILNLTGASIGG----VELKTLSFGR----GILI 361

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT  T + P +++ +  EF+ Q      ++    A   + L  CF++   +  S P +
Sbjct: 362 DSGTVITRLPPSIYKAVKTEFLKQ------FSGFPSAPGYSILDTCFNLTSYEDISIPTI 415

Query: 394 KLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYD 451
           K+ F+G AE+ + V   F  V  + S VCL + +   E   G   I+GN+Q +N  V YD
Sbjct: 416 KMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYD 472

Query: 452 LRNQRLGFKQQLC 464
              +RLG   + C
Sbjct: 473 TTQERLGIAGENC 485


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/391 (30%), Positives = 166/391 (42%), Gaps = 62/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKL--FDPARSSTYANVS 234

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P C       +  R C      +  +C      Y V YG G  + G    +TL L +
Sbjct: 235 CAAPACF-----DLDTRGC------SGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 279 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 333

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             + +  LD G          LT TP +  N P+         +YYVG+  I VGGQ + 
Sbjct: 334 --SGTGYLDFGPGSPAAAGARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 381

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +             GTIVDSGT  T + P  +  L   FVS M   R Y +   A A++ 
Sbjct: 382 IPQSVFA-----TAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAA-RGYKK---APAVSL 432

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
           L  C+D  G    + P + L F+GGA + +      Y A V   S VCL    + +  GG
Sbjct: 433 LDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIMYAASV---SQVCLGFAANED--GG 487

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              I+GN Q++ + V YD+  + +GF    C
Sbjct: 488 DVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/416 (27%), Positives = 167/416 (40%), Gaps = 55/416 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-------HYQCKYCSSSKIPSFIPKLSSS 141
           Y  S   G PPQ    ++DTGS LVW  C+              C    +P +   LS +
Sbjct: 78  YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137

Query: 142 SRLLGCQNPKCSW--IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
           +R + C +   +   +  E+  C         +  +   +  S    YG+G+  G+  ++
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGG----GSGDDACVVAAS----YGAGVALGVLGTD 189

Query: 200 TLNLPNRIIPNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
               P+        GC   +   P      +GI G GRG  SL SQLN  +FSYCL  + 
Sbjct: 190 AFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPY- 248

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG--------LTYTPFVNNPSVAERNAFSVYYYVGLR 305
           F DT   S L + +G         G        +T  PF  NP   + + FS +YY+ L 
Sbjct: 249 FRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNP---KDSPFSTFYYLPLV 305

Query: 306 RITVGGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
            +  G   V +      L         GG ++DSG+ FT +       L  E   Q+  +
Sbjct: 306 GLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGS 365

Query: 362 RNYT---RALGAEALTGLRPCFDVPGEKTGSFPELKLHFK----GGAEVTLPVENYFAVV 414
            +       LG      +    D       + P L L F     GG E+ +P E Y+A V
Sbjct: 366 GSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV 425

Query: 415 GEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            E S  C+ VV+   ASG  ++      I+GNF  Q+  V YDL N  L F+   C
Sbjct: 426 -EASTWCMAVVS--SASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 159/385 (41%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C     C  C     P F P  S+S   + 
Sbjct: 41  GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK---PCTQCYHQTDPLFDPADSASFMGVS 97

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
           C +  C  + +       CN      S  C      Y V YG G  T+G    ETL L  
Sbjct: 98  CSSAVCDQVDNAG-----CN------SGRC-----RYEVSYGDGSSTKGTLALETLTLGR 141

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
            ++ N  +GC  ++       AG+ G G G  S   QL+ ++   FSYCL+S        
Sbjct: 142 TVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSR-----VT 196

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            S+  L+ GS   +    G  + P + NP          YYY+GL  + VG  +V +   
Sbjct: 197 NSNGFLEFGS---EAMPVGAAWIPLIRNPHSPS------YYYIGLSGLGVGDMKVPISED 247

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   GNGG ++D+GT  T      +E   D F+ Q     N  RA G         C
Sbjct: 248 IFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQ---TGNLPRASGVSIFD---TC 301

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +++ G  +   P +  +F GG  +TLP  N+   V +    C               ILG
Sbjct: 302 YNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLS----ILG 357

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N+ +GF   +C
Sbjct: 358 NIQQEGIQISVDGANEFVGFGPNVC 382


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 119/400 (29%), Positives = 170/400 (42%), Gaps = 46/400 (11%)

Query: 70  TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
           +T + T   T+ +S   G Y   +  G P Q   F+ DTGS + W  C        C   
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
             P F PK SSS   L C + +C  +           DE    + +C      Y V YG 
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLL-----------DEAACDANSCI-----YEVEYGD 268

Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK 244
           G  T G   +ET +  +   IPN  +GC   +        G+ G G G  SL SQL    
Sbjct: 269 GSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATS 328

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           FSYCL+     D    SS  LD    ++D+ +  LT +P V N      + F  + YV +
Sbjct: 329 FSYCLV-----DLDSESSSTLD---FNADQPSDSLT-SPLVKN------DRFPTFRYVKV 373

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             ++VGG+ + +      +D  G+GG IVDSGTT T +  ++++ L D FV  + KN   
Sbjct: 374 IGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVG-LTKNLP- 431

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
                A  ++    C+D+  +     P +     G   + LP +N    V      CL  
Sbjct: 432 ----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAF 487

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     S  P  I+GN Q Q   V YDL N  +GF    C
Sbjct: 488 L----PSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 165/386 (42%), Gaps = 56/386 (14%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           GY++++  GTPPQ+   I DT S L W  C         +    P F P  SSS   + C
Sbjct: 90  GYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFND---TAKQVEPLFDPAKSSSFAFVTC 146

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN-- 205
            +  C+             D P   +K C+     Y+  Y S    G+   E+  L +  
Sbjct: 147 SSKLCT------------EDNP--GTKRCSNKTCRYVYPYVSVEAAGVLAYESFTLSDNN 192

Query: 206 -RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             I  +F  GC  L+       +GI G      S+ SQL + KFSYCL  +      ++S
Sbjct: 193 QHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYT---DRKSS 249

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
            L     +     KTTG              + + + YYYV L  +++G +R+ V     
Sbjct: 250 PLFFGAWADLGRYKTTGPI------------QKSLTFYYYVPLVGLSLGTRRLDVPAATF 297

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L +   GGT+VD G T   +A   F  L +  +  +         L    +   + CF 
Sbjct: 298 ALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTL------NLPLTNRTVKDYKVCFA 348

Query: 382 VP-GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           +P G   G+   P L L+F GGA++ LP +NYF     G  +CL +V      GG   I+
Sbjct: 349 LPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAG-LMCLALV-----PGGGMSII 402

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q QN+++ +D+ + +  F   +C
Sbjct: 403 GNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 169/396 (42%), Gaps = 66/396 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y ++++ G+PP+ +  I DTGS LVW  C         +++    F P  SS+   + CQ
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL---- 203
              C     E++    C+D       NC     +YL  YG G  T G+  +ET       
Sbjct: 161 TDAC-----EALGRATCDD-----GSNC-----AYLYAYGDGSNTTGVLSTETFTFDDGG 205

Query: 204 ----PNRI-IPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
               P ++ +     GCS  ++      G+ G G G  SL +QL        +FSYCL+ 
Sbjct: 206 SGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
           H  +    +S+L   N  + +D    G   TP V             YY V L  + VG 
Sbjct: 266 HSVN---ASSAL---NFGALADVTEPGAASTPLVAGD-------VDTYYTVVLDSVKVGN 312

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           +         T+    +   IVDSGTT TF+ P L  P+ DE       +R  T      
Sbjct: 313 K---------TVASAASSRIIVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQS 357

Query: 372 ALTGLRPCFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
               L+ C++V G   E   S P+L L F GGA V L  EN F  V EG+ +CL +V   
Sbjct: 358 PDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATT 416

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E    P  ILGN   QN +V YDL    + F    C
Sbjct: 417 EQQ--PVSILGNLAQQNIHVGYDLDAGTVTFAGADC 450


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 66/392 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           + +++ FGTP Q    I DTGS + W    PC+ H     C     P F P  S++  ++
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGH-----CYKQHDPIFDPTKSATYSVV 189

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLP 204
            C +P+C+                 A    C+     Y V YG G +    LS ETL+L 
Sbjct: 190 PCGHPQCA----------------AADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT 233

Query: 205 N-RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
           + R +P F  GC   ++       G+ G GRG+ SL SQ        FSYCL S   D+T
Sbjct: 234 STRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DNT 290

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T     I     + +D     + YT  V      ++  +  +Y+V L  I +GG  + V 
Sbjct: 291 THGYLTIGPTTPASNDD----VQYTAMV------QKQDYPSFYFVELVSIDIGGYILPVP 340

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               T D     GT +DSGT  T++ PE +  L D F   M + +       A A     
Sbjct: 341 PTLFTDD-----GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKP------APAYDPFD 389

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAV-CLTVVTDREASG 432
            C+D  G+     P +   F  G+   L   ++F ++        A+ CL  V     S 
Sbjct: 390 TCYDFTGQSAIFIPAVSFKFSDGSVFDL---SFFGILIFPDDTAPAIGCLGFVA--RPSA 444

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            P  I+GN Q +N  V YD+  +++GF    C
Sbjct: 445 MPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 167/389 (42%), Gaps = 45/389 (11%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTPPQ    +LDTGS L W  C +    K     ++P  +PK  ++S      + 
Sbjct: 68  VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKK-----RLPP-LPKPKTTSFDPSLSSS 121

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-I 208
                 +  I      D  L TS +  ++C  Y   Y  G L EG  + E       +  
Sbjct: 122 FSLLPCNHPICKPRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKFTFSKSLST 180

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
           P  ++GC+  S+    GI G  RG+ S  SQ  + KFSYC+ S    + T    L  DN 
Sbjct: 181 PPVILGCAQASTEN-RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYL-GDNP 238

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
           +S   K  T LT+    ++P     N   + Y + ++ I + G+R+ V       D  G+
Sbjct: 239 NSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGS 293

Query: 329 GGTIVDSGTTFTFMAPELFEPLADE---FVSQMVKNRNYTRALGAEALTGLRPCFD--VP 383
           G T++DSG+  T++  E +E + +E    V  M+K + Y  A  A+       CFD  V 
Sbjct: 294 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK-KGYVYADVADM------CFDAGVT 346

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--------VTDREASGGPS 435
            E       +   F  G E+          VG G  V   V        +   E  G  S
Sbjct: 347 AEVGRRIGGISFEFDNGVEI---------FVGRGEGVLTEVEKGVKCVGIGRSERLGIGS 397

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+G    QN +VEYDL N+R+GF    C
Sbjct: 398 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 426


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP + +  + DTGS + W  C+    C+ C   + P F P LSSS + L 
Sbjct: 79  GDYFARIGVGTPARSVYMVADTGSDVSWLQCS---PCRKCYRQQDPIFNPSLSSSFKPLA 135

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C       ++ + C+ +       C      Y V YG G  T G   +ETL+   
Sbjct: 136 CASSICG-----KLKIKGCSRK-----NECM-----YQVSYGDGSFTVGDFSTETLSFGE 180

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
             + +  +GC     R   G+        G GRG  S PSQ        FSYCL      
Sbjct: 181 HAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRR--- 233

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           ++   +SL+   G S   +K     +T  + N           YYYVGL RI V G  V 
Sbjct: 234 ESAIAASLVF--GPSAVPEKAR---FTKLLPN------RRLDTYYYVGLARIRVAGSPVN 282

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      +   G GG IVDSGT  + +    +  L D F       R+      A  ++ 
Sbjct: 283 IPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-------RSLVTFPSAPGISL 335

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
              C+D+   KT + P + L F GGA + LP +     V +    CL    + EA     
Sbjct: 336 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 392

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q Q + +  D + +++G     C
Sbjct: 393 -IIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 132/467 (28%), Positives = 194/467 (41%), Gaps = 61/467 (13%)

Query: 14  FFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTT 73
           F F LL +  +S     FS+   H +     + + +   +  LT A H ++         
Sbjct: 17  FLFHLLEVGLAS--GGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFH-RSASRVGRFRQ 73

Query: 74  TTTTTTNISSH---SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK 130
           +  T+  I S    S G Y ++LS GTPP  +  I+DTGS L W  C     C +C    
Sbjct: 74  SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQV 130

Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
           +P F PK SS+ R   C    C  +          ND      K CT     ++  Y  G
Sbjct: 131 VPFFDPKNSSTYRDSSCGTSFCLALG---------NDRSCRNGKKCT-----FMYSYADG 176

Query: 191 -LTEGIALSETLNLPNRI-----IPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQL 240
             T G    ETL + +        P F  GC   S        +GI G G  + S+ SQL
Sbjct: 177 SFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQL 236

Query: 241 NL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
                 +FSYCLL   F D++ +S +   N          G   TP V       +   +
Sbjct: 237 KSTINGRFSYCLLP-VFTDSSMSSRI---NFGRSGIVSGAGTVSTPLV------MKGPDT 286

Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
            YY + L   +VG +R+  +  +        G  IVDSGTT+T++  E +  L +E V+ 
Sbjct: 287 YYYLITLEGFSVGKKRLS-YKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKL-EESVAH 344

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
            +K +      G  +L     C++   ++  + P +  HFK  A V L   N F  + E 
Sbjct: 345 SIKGKRVRDPNGISSL-----CYNTTVDQIDA-PIITAHFK-DANVELQPWNTFLRMQE- 396

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             VC TV+   +       ILGN    N+ V +DLR +R+ FK   C
Sbjct: 397 DLVCFTVLPTSDIG-----ILGNLAQVNFLVGFDLRKKRVSFKAADC 438


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP + +  + DTGS + W  C+    C+ C   + P F P LSSS + L 
Sbjct: 12  GDYFARIGVGTPARSVYMVADTGSDVSWLQCS---PCRKCYRQQDPIFNPSLSSSFKPLA 68

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C       ++ + C+ +       C      Y V YG G  T G   +ETL+   
Sbjct: 69  CASSICG-----KLKIKGCSRK-----NKCM-----YQVSYGDGSFTVGDFSTETLSFGE 113

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
             + +  +GC     R   G+        G GRG  S PSQ        FSYCL      
Sbjct: 114 HAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRR--- 166

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           ++   +SL+   G S   +K     +T  + N           YYYVGL RI V G  V 
Sbjct: 167 ESAIAASLVF--GPSAVPEKAR---FTKLLPN------RRLDTYYYVGLARIRVAGSPVN 215

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      +   G GG IVDSGT  + +    +  L D F       R+      A  ++ 
Sbjct: 216 IPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-------RSLVTFPSAPGISL 268

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
              C+D+   KT + P + L F GGA + LP +     V +    CL    + EA     
Sbjct: 269 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 325

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q Q + +  D + +++G     C
Sbjct: 326 -IIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 171/400 (42%), Gaps = 46/400 (11%)

Query: 70  TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
           +T + T   T+ +S   G Y   +  G P Q   F+ DTGS + W  C        C   
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
             P F PK SSS   L C + +C  +           DE    + +C      Y V YG 
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLL-----------DEAACDANSCI-----YEVEYGD 268

Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK 244
           G  T G   +ET +  +   IPN  +GC   +       AG+ G G G  SL SQL    
Sbjct: 269 GSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATS 328

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           FSYCL+     D    SS  LD    ++D+ +  LT +P V N      + F  + YV +
Sbjct: 329 FSYCLV-----DLDSESSSTLD---FNADQPSDSLT-SPLVKN------DRFPTFRYVKV 373

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             ++VGG+ + +      +D  G+GG IVDSGTT T +  ++++ L D FV  + KN   
Sbjct: 374 IGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVG-LTKNLP- 431

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
                A  ++    C+D+  +     P +     G   + LP +N    V      CL  
Sbjct: 432 ----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF 487

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +     S  P  I+GN Q Q   V YDL N  +GF    C
Sbjct: 488 L----PSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 175/387 (45%), Gaps = 60/387 (15%)

Query: 98  PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
           P   I  ++DTGS++ W   T     K CS SK  S +P          C +PKC     
Sbjct: 42  PKDNISAVVDTGSNIFW---TTE---KECSRSKTRSMLP----------CCSPKCE--QR 83

Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL---PNRIIP-- 209
            S  CR    E  A ++  T+   +Y + YG      T G+   + L +    ++ +P  
Sbjct: 84  ASCGCR--RSELKAEAEKETKC--TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGS 139

Query: 210 ----NFLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
                  +GCS    L  + P+  G+ G GR  TSLP QLN  KFSYCL S++  D    
Sbjct: 140 QSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDL--P 197

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           S L+L   ++  D  T  +     V   ++   + +   Y+V L+ I++GG R+      
Sbjct: 198 SYLLL---TAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPA---- 250

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             +     G   VD+GT+FT +   +F  L  E + +++K R Y +          + C+
Sbjct: 251 --VSTKSGGNMFVDTGTSFTRLEGTVFAKLVTE-LDRIMKERKYVKE--QPGRNNGQICY 305

Query: 381 DVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
             P    +++   P++ LHF   A + LP ++Y       S +CL +  D+    G   +
Sbjct: 306 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAI--DKSNIKGGISV 361

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LGNFQMQN ++  D  N++L F +  C
Sbjct: 362 LGNFQMQNTHMLLDTGNEKLSFVRADC 388


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 173/389 (44%), Gaps = 46/389 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLL 145
           G Y + +  GTP +    I+DTGS L W  C     C  YC     P F P +S + + L
Sbjct: 105 GNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQ---PCVIYCHVQVDPIFTPSVSKTYKAL 161

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C + +CS +   ++    C+        N T  C  Y   YG +  + G    + L L 
Sbjct: 162 SCSSSQCSSLKSSTLNAPGCS--------NATGAC-VYKASYGDTSFSIGYLSQDVLTLT 212

Query: 205 NRIIPN--FLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDD 256
               P+  F+ GC   +     + AGI G    K S+  QL+    + FSYCL S     
Sbjct: 213 PSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQ 272

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
              + S  L  G+S          +TP V NP +         Y++GL  ITV G+ + V
Sbjct: 273 PNSSVSGFLSIGASSLSSSP--YKFTPLVKNPKIPS------LYFLGLTTITVAGKPLGV 324

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                 +       TI+DSGT  T +   ++  L   FV  M+ ++ Y +A G    + L
Sbjct: 325 SASSYNVP------TIIDSGTVITRLPVAIYNALKKSFV--MIMSKKYAQAPG---FSIL 373

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF    ++  + PE+++ F+GGA + L V N    + +G+  CL +     AS  P  
Sbjct: 374 DTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEKGT-TCLAIA----ASSNPIS 428

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN+Q Q + V YD+ N ++GF    C+
Sbjct: 429 IIGNYQQQTFTVAYDVANSKIGFAPGGCQ 457


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 179/404 (44%), Gaps = 74/404 (18%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + LS GTPPQ + F L   S   W  C++      C+++ +  F P LS+S   L C +P
Sbjct: 1   MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAIN-CTTASL--FQPGLSTSHTKLPCGSP 57

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN---- 205
            CS     S  C          S +C     SY   YG+  +  G  +S+   + +    
Sbjct: 58  SCSAFSAVSTSC--------GPSSSC-----SYNTSYGTNFSSAGDLVSDIATMDSVRNR 104

Query: 206 RIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSHKFDD 256
           ++  N  +GC      +L     +G  GF +G  S   QL+      KF YCL S  F  
Sbjct: 105 KVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTF-- 162

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                 L++ N    +   ++ + YTP + NP  AE       Y++ L  I++   + +V
Sbjct: 163 ---RGKLVIGNYKLRNASISSSMAYTPMITNPQAAE------LYFINLSTISIDKNKFQV 213

Query: 317 -WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-RNYTRALG----- 369
               +L+   +G GGT++D+ T  ++        L  +F +Q+V+  +NYT  L      
Sbjct: 214 PIQGFLS---NGTGGTVIDTTTFLSY--------LTSDFYTQLVQAIKNYTTNLVEVSSS 262

Query: 370 -AEALTGLRPCFDVPGEKTGSFP---ELKLHFKGGAEVTLPVENYFAVVGEGSA---VCL 422
            A+AL G+  C+++       FP    L  HF GGA V   V  +F +    S    +C+
Sbjct: 263 VADAL-GVELCYNISANS--DFPPPATLTYHFLGGAGV--EVSTWFLLDDSDSVNNTICM 317

Query: 423 TVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            +   R  S GP++ ++G +Q  +  VEYDL   R GF  Q C 
Sbjct: 318 AI--GRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGCN 359


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 162/382 (42%), Gaps = 48/382 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   +  G P ++   + DTGS + W  C        C     P F PK SSS   L C 
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-R 206
           + +C  +        +CN      S  C      Y V YG G  T G   +ETL+  N  
Sbjct: 208 SQQCKLLDKA-----NCN------SDTCI-----YQVHYGDGSFTTGELATETLSFGNSN 251

Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            IPN  +GC   +       AG+ G G G  SL SQL    FSYCL++   D    +SS 
Sbjct: 252 SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSD----SSST 307

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +  N +  SD  T+     P V N      + F  Y YV +  I+VGG+ + +      +
Sbjct: 308 LEFNSNMPSDSLTS-----PLVKN------DRFHSYRYVKVVGISVGGKTLPISPTRFEI 356

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEALTGLRPCFDV 382
           D  G GG IVDSGT  + +  +++E L + FV         T +L  A  ++    C++ 
Sbjct: 357 DESGLGGIIVDSGTIISRLPSDVYESLREAFV-------KLTSSLSPAPGISVFDTCYNF 409

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
            G+     P +      G  + LP  NY  ++      CL  +  + +      I+G+FQ
Sbjct: 410 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS----IIGSFQ 465

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            Q   V YDL N  +GF    C
Sbjct: 466 QQGIRVSYDLTNSLVGFSTNKC 487


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 108/418 (25%), Positives = 165/418 (39%), Gaps = 50/418 (11%)

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           ++L R L  ++  +  +         +  +   G Y I +  G+PP+    ++D+GS +V
Sbjct: 107 ATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIV 166

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     C  C     P F P  S+S   + C +  C  I +       C        
Sbjct: 167 WVQCQ---PCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHAGGCR------- 216

Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGF 229
                    Y V+YG G  T+G    ETL     ++ N  +GC   +       AG+ G 
Sbjct: 217 ---------YEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGL 267

Query: 230 GRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           G G  SL  QL       FSYCL+S   D      SL    G+        G  + P + 
Sbjct: 268 GGGSMSLVGQLGGQTGGAFSYCLVSRGTDSA---GSLEFGRGA-----MPVGAAWIPLIR 319

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           NP          +YY+ L  + VGG +V +      L+  GNGG ++D+GT  T +    
Sbjct: 320 NPRAPS------FYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVA 373

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           +    D F+ Q     N  RA G         C+++ G  +   P +  +F GG  +TLP
Sbjct: 374 YVAFRDAFIGQ---TGNLPRASGVSIFD---TCYNLNGFVSVRVPTVSFYFAGGPILTLP 427

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             N+   V +    C        AS     I+GN Q +   + +D  N  +GF   +C
Sbjct: 428 ARNFLIPVDDVGTFCFAFA----ASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 162/387 (41%), Gaps = 55/387 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP      + DTGS   W  C       YC   K P F P  S++   + 
Sbjct: 163 GNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV--AYCYQQKEPLFTPTKSATYANIS 220

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS      +  R C      +  +C      Y V YG G  T G    +TL L  
Sbjct: 221 CTSSYCS-----DLDTRGC------SGGHCL-----YAVQYGDGSYTVGFYAQDTLTLGY 264

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTT 258
             + +F  GC   +     + AG+ G GRGKTS+P Q   DK    F+YC+ +      T
Sbjct: 265 DTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQ-AYDKYSGVFAYCIPA------T 317

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            + +  LD G          LT     N P+         +YYVG+  I VGG  + +  
Sbjct: 318 SSGTGFLDFGPGAPAAANARLTPMLVDNGPT---------FYYVGMTGIKVGGHLLSIPA 368

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +     + G +VDSGT  T + P  +EPL   F   M +   Y     A A + L  
Sbjct: 369 TVFS-----DAGALVDSGTVITRLPPSAYEPLRSAFAKGM-EGLGYKT---APAFSILDT 419

Query: 379 CFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           C+D+ G + + + P + L F+GGA + +        V + S  CL    + + +     I
Sbjct: 420 CYDLTGYQGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAANDDDTD--MTI 476

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GN Q + Y V YDL  + +GF    C
Sbjct: 477 VGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 124/443 (27%), Positives = 190/443 (42%), Gaps = 74/443 (16%)

Query: 40  PSQDSYQNLNSLVSSSLTRALHI-KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           P+++ YQ+       S+ RA H  K+  T T  +T             GGY ++ S GTP
Sbjct: 45  PTENKYQHFVDAARRSINRANHFFKDSDTSTPESTVIP--------DRGGYLMTYSVGTP 96

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS +VW  C     C+ C +   P F P  SSS + + C +  C      
Sbjct: 97  PTKIYGIADTGSDIVWLQCE---PCEQCYNQTTPIFNPSKSSSYKNIPCSSKLC-----H 148

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFL 212
           S++   C+D+      N  Q    Y + YG S  ++G    +TL+L +        P  +
Sbjct: 149 SVRDTSCSDQ------NSCQ----YKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIV 198

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
           +GC   ++       +GI G G G  SL +QL      KFSYCL+     ++  +S L  
Sbjct: 199 IGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSF 258

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            + +  S     G+  TP +            V+Y++ L+  +VG +RV         D 
Sbjct: 259 GDAAVVSGD---GVVSTPLIKKD--------PVFYFLTLQAFSVGNKRVEFGGSSEGGDD 307

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFD 381
           +GN   I+DSGTT T +  +++  L    V       V + N   +L          C+ 
Sbjct: 308 EGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL----------CYS 355

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           +   +   FP + +HFK GA+V L   + F  + +G  VC       + S     I GN 
Sbjct: 356 LKSNEY-DFPIITVHFK-GADVELHSISTFVPITDG-IVCFAF----QPSPQLGSIFGNL 408

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
             QN  V YDL+ + + FK   C
Sbjct: 409 AQQNLLVGYDLQQKTVSFKPTDC 431


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 164/393 (41%), Gaps = 60/393 (15%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           +S   G Y + +  G PP     +LDTGS + W  C     C  C     P F P  S+S
Sbjct: 142 TSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCA---PCSECYQQSDPIFDPISSNS 198

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              + C  P+C     +S+   +C        +N T +   Y V YG G  T G   +ET
Sbjct: 199 YSPIRCDEPQC-----KSLDLSEC--------RNGTCL---YEVSYGDGSYTVGEFATET 242

Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
           + L +  + N  +GC         G+        G G GK S P+Q+N   FSYCL++  
Sbjct: 243 VTLGSAAVENVAIGCG----HNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRD 298

Query: 254 FD--DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
            D   T   +S +  N ++            P + NP +        +YY+GL+ I+VGG
Sbjct: 299 SDAVSTLEFNSPLPRNAAT-----------APLMRNPEL------DTFYYLGLKGISVGG 341

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + + +      +D  G GG I+DSGT  T +  E+++ L D FV      +       A 
Sbjct: 342 EALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFV------KGAKGIPKAN 395

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
            ++    C+D+   ++   P +   F  G E+ LP  NY   V      C        + 
Sbjct: 396 GVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSL 455

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN Q Q   V +D+ N  +GF    C
Sbjct: 456 S----IIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 165/384 (42%), Gaps = 58/384 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P +    ++DTGS + W  C     C  C S   P F P  SS+     C 
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCS 189

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
           +  C+ +  E   C         +S  C      Y V YG G  T G   S+TL L +  
Sbjct: 190 SAACAQLGQEGNGC---------SSSQC-----QYTVTYGDGSSTTGTYSSDTLALGSNA 235

Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTS 261
           +  F  GCS + S    Q  G+ G G G  SL SQ        FSYCL        T +S
Sbjct: 236 VRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCL------PATSSS 289

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L  G+      T+G   TP + +  V        +Y V ++ I VGG+++ +     
Sbjct: 290 SGFLTLGAG-----TSGFVKTPMLRSSQVP------TFYGVRIQAIRVGGRQLSIPTSVF 338

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +       GTI+DSGT  T + P  +  L+  F + M   + Y  A  +     L  CFD
Sbjct: 339 S------AGTIMDSGTVLTRLPPTAYSALSSAFKAGM---KQYPSAPPSGI---LDTCFD 386

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGN 440
             G+ + S P + L F GGA V +  +    +    S +CL    + + S   S+ I+GN
Sbjct: 387 FSGQSSVSIPTVALVFSGGAVVDIASDGIM-LQTSNSILCLAFAANSDDS---SLGIIGN 442

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q + + V YD+    +GFK   C
Sbjct: 443 VQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 109/222 (49%), Gaps = 26/222 (11%)

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           KFSYCL S    D ++ S L+L + +    K T     TP + NPS         +YY+ 
Sbjct: 5   KFSYCLTSM---DDSKASVLLLGSLA----KATKDAISTPLLTNPSQPS------FYYLS 51

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I VGG ++ +      +  DG+GG I+DSGTT T++   +F+ L  EF+SQ      
Sbjct: 52  LEGIPVGGTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ------ 105

Query: 364 YTRALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
               L   + TGL  CF +P E T    P+L  HFKGG ++ LP E+Y     +    CL
Sbjct: 106 SNLQLDKSSSTGLDVCFSLPSETTQVEVPKLVFHFKGG-DLELPAESYMIADSKLGVACL 164

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +     AS G S I GN Q QN  V +DL  + + F    C
Sbjct: 165 AM----GASNGMS-IFGNVQQQNILVNHDLEKETISFVPTQC 201


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 175/387 (45%), Gaps = 60/387 (15%)

Query: 98  PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
           P   I  ++DTGS++ W   T     K CS SK  S +P          C +PKC     
Sbjct: 65  PKDNISAVVDTGSNIFW---TTE---KECSRSKTRSMLP----------CCSPKCE--QR 106

Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL---PNRIIP-- 209
            S  CR    E  A ++  T+   +Y + YG      T G+   + L +    ++ +P  
Sbjct: 107 ASCGCR--RSELKAEAEKETKC--TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGS 162

Query: 210 ----NFLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
                  +GCS    L  + P+  G+ G GR  TSLP QLN  KFSYCL S++  D    
Sbjct: 163 QSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDL--P 220

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           S L+L   ++  D  T  +     V   ++   + +   Y+V L+ I++GG R+      
Sbjct: 221 SYLLL---TAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPA---- 273

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             +     G   VD+GT+FT +   +F  L  E + +++K R Y +          + C+
Sbjct: 274 --VSTKSGGNMFVDTGTSFTRLEGTVFAKLVTE-LDRIMKERKYVKE--QPGRNNGQICY 328

Query: 381 DVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
             P    +++   P++ LHF   A + LP ++Y       S +CL +  D+    G   +
Sbjct: 329 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAI--DKSNIKGGISV 384

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LGNFQMQN ++  D  N++L F +  C
Sbjct: 385 LGNFQMQNTHMLLDTGNEKLSFVRADC 411


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 124/474 (26%), Positives = 199/474 (41%), Gaps = 72/474 (15%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
           L+ IFF+    I+ S  +    S+   H +        P+   +Q   ++V  S+ R  +
Sbjct: 7   LTLIFFYLCCFIYFSHASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNY 66

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
                +       +T T  +     G Y IS S GTPP  +   +DTGS++VW  C    
Sbjct: 67  FTKEFSLNKNQPVSTLTPEL-----GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ--- 118

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            C  C +   P F P  SSS + + C           S  C+D ND  ++ S N   +C 
Sbjct: 119 PCNTCFNQTSPIFNPSKSSSYKNIPCT----------SSTCKDTNDTHISCS-NGGDVCE 167

Query: 182 SYLVLYGSGLTEGIALSETLNLPNR-----IIPNFLVGCSVLS----SRQPAGIAGFGRG 232
             +   G   ++G   +++L L +      + PN ++GC  ++    + Q +G+ G GRG
Sbjct: 168 YSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRG 227

Query: 233 KTSLPSQLNL----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG--LTYTPFVN 286
             SL  Q+       KFSYCL+ +   D+  +S LI        D   +G  +  TP V 
Sbjct: 228 PMSLIKQVGSSSVGSKFSYCLIPYN-SDSNSSSKLIFG-----EDVVVSGEIVVSTPMV- 280

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
                + N    YY++ L   +VG  R+    +Y           ++DSGT  T M P L
Sbjct: 281 -----KVNGQENYYFLTLEAFSVGNNRI----EYGERSNASTQNILIDSGTPLT-MLPNL 330

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           F      +V+Q VK     R    +    L  C++  G++  + P++  HF  GA+V L 
Sbjct: 331 FLSKLVSYVAQEVK---LPRIEPPDHHLSL--CYNTTGKQL-NVPDITAHFN-GADVKLN 383

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
               F    +G  +C   ++          I GN    N  ++YDL  + + FK
Sbjct: 384 SNGTFFPFEDG-IMCFGFISSNGLE-----IFGNIAQNNLLIDYDLEKEIISFK 431


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 162/385 (42%), Gaps = 57/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           + +    GTP Q +   LDT +   W PC+    C  C S+ +  F    SSS R L CQ
Sbjct: 103 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSG---CIGCPSTTV--FSSDKSSSFRPLPCQ 157

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ + + S     C                 + + YGS       + + L L    +
Sbjct: 158 SPQCNQVPNPSCSGSACG----------------FNLTYGSSTVAADLVQDNLTLATDSV 201

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
           P++  GC   +  SS  P G+ G GRG  SL  Q   L    FSYCL S K   F  + R
Sbjct: 202 PSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLR 261

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
              +           +   + YTP + NP    R++    YYV L  I VG + V +   
Sbjct: 262 LGPV----------AQPIRIKYTPLLRNP---RRSSL---YYVNLISIRVGRKIVDIPPS 305

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  +     GT++DSGTTFT +    +  + DEF       R   R +   +L G   C
Sbjct: 306 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEF------RRRVGRNVTVSSLGGFDTC 359

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           + VP       P +   F  G  VTLP +N+      GS  CL +    +       ++ 
Sbjct: 360 YTVPIIS----PTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIA 414

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q QN+ + +D+ N R+G  ++ C
Sbjct: 415 SMQQQNHRILFDIPNSRVGVARESC 439


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 163/383 (42%), Gaps = 35/383 (9%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTP Q    +LDTGS L W  C +  + K        SF P LSSS   L C +P
Sbjct: 83  LSLPIGTPSQSQELVLDTGSQLSWIQC-HPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
            C              D  L TS +  ++C  Y   Y  G   EG  + E     N +  
Sbjct: 142 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
           P  ++GC+   S    GI G   G+ S  SQ  + KFSYC+   S++    +  S  + +
Sbjct: 192 PPLILGCAK-ESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGE 250

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           N +S   K  + LT+      P     N   + Y V L  I +G +R+ +       D  
Sbjct: 251 NPNSRGFKYVSLLTFPQSQRMP-----NLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAG 305

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD----- 381
           G+G T+VDSG+ FT +    ++ + +E V  +          G+ A      CFD     
Sbjct: 306 GSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM----CFDGNHQM 361

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
           V G   G      L F+ G  V + VE    +V  G  +    +      G  S I+GN 
Sbjct: 362 VIGRLIGD-----LVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNV 416

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
             QN +VE+D+ N+R+GF +  C
Sbjct: 417 HQQNLWVEFDVANRRVGFSKAEC 439


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 120/412 (29%), Positives = 174/412 (42%), Gaps = 98/412 (23%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC+    CK C   + P F P+LSSS + 
Sbjct: 76  SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSSSYKA 132

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
           L C NP C           +C+DE     K C      Y   Y    +    LSE L   
Sbjct: 133 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 171

Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
              +++ P   V GC       L S++  GI G GRGK S+  QL +DK      FS C 
Sbjct: 172 GNESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 230

Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
              +        +++L   S       SHSD         PF            S YY +
Sbjct: 231 GGMEVG----GGAMVLGKISPPAGMVFSHSD---------PFR-----------SPYYNI 266

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+++ V G+ +++  K      +G  GT++DSGTT+ +   E F  + D  + ++   +
Sbjct: 267 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLK 322

Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
                     + G  P     CF   G         FPE+ + F  G ++ L  ENY F 
Sbjct: 323 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFR 373

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 A CL +  DR++    + +LG   ++N  V YD  N +LGF +  C
Sbjct: 374 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 421


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 122/420 (29%), Positives = 177/420 (42%), Gaps = 61/420 (14%)

Query: 63  KNPQTKTTTTTTTTT-----------TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
           K+ Q + +TTTT +             ++ S+   G Y +++  GTP      + DTGS 
Sbjct: 124 KSIQRRVSTTTTVSRGKPKRNRPSLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSD 183

Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
             W  C       Y    K+  F P  SS+   + C  P CS ++               
Sbjct: 184 TTWVQCEPCVVVCYKQQEKL--FDPARSSTYANISCAAPACSDLY--------------- 226

Query: 172 TSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGI 226
             K C+     Y V YG G  + G    +TL L +   I  F  GC   +     + AG+
Sbjct: 227 -IKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGL 285

Query: 227 AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
            G GRGKTSLP Q   DK+   + +H F   + + +  LD G       +  LT    V+
Sbjct: 286 LGLGRGKTSLPVQA-YDKYG-GVFAHCFPARS-SGTGYLDFGPGSLPAVSAKLTTPMLVD 342

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           N           +YYVGL  I VGG+ + +     T       GTIVDSGT  T + P  
Sbjct: 343 NGPT--------FYYVGLTGIRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPPAA 389

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           +  L   F S M + R Y +   A AL+ L  C+D  G    + P + L F+GGA + + 
Sbjct: 390 YSSLRSAFASAMAE-RGYKK---APALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVH 445

Query: 407 VEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                Y A V   S  CL    ++E       I+GN Q++ + V YD+  + +GF    C
Sbjct: 446 ASGIIYAASV---SQACLGFAGNKEDD--DVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 162/385 (42%), Gaps = 57/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           + +    GTP Q +   LDT +   W PC+    C  C S+ +  F    SSS R L CQ
Sbjct: 26  FVVRAKIGTPAQTLLLALDTSNDAAWIPCSG---CIGCPSTTV--FSSDKSSSFRPLPCQ 80

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ + + S     C                 + + YGS       + + L L    +
Sbjct: 81  SPQCNQVPNPSCSGSACG----------------FNLTYGSSTVAADLVQDNLTLATDSV 124

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
           P++  GC   +  SS  P G+ G GRG  SL  Q   L    FSYCL S K   F  + R
Sbjct: 125 PSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLR 184

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
              +           +   + YTP + NP    R++    YYV L  I VG + V +   
Sbjct: 185 LGPV----------AQPIRIKYTPLLRNP---RRSSL---YYVNLISIRVGRKIVDIPPS 228

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  +     GT++DSGTTFT +    +  + DEF       R   R +   +L G   C
Sbjct: 229 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEF------RRRVGRNVTVSSLGGFDTC 282

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           + VP       P +   F  G  VTLP +N+      GS  CL +    +       ++ 
Sbjct: 283 YTVPIIS----PTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIA 337

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q QN+ + +D+ N R+G  ++ C
Sbjct: 338 SMQQQNHRILFDIPNSRVGVARESC 362


>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + +  +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++V G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+      ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 152/370 (41%), Gaps = 49/370 (13%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + +  G PPQ    I D  +   W  C     C  C       F P  SSS  LL C+  
Sbjct: 189 VQIGVGGPPQKFYMIFDLQTDFTWLQCQ---PCIKCYDQPDSIFDPSQSSSYTLLSCETK 245

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNR-II 208
            C+ + + S     C+D+              Y + Y  G  TEG+ ++ET++  +   +
Sbjct: 246 HCNLLPNSS-----CSDDGYC----------RYNITYKDGTNTEGVLINETVSFESSGWV 290

Query: 209 PNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
               +GCS   ++ P     G  G GRG  S PS++N    SYCL+  K  D   +S+L 
Sbjct: 291 DRVSLGCSN-KNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESK--DGYSSSTLE 347

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
            ++         +G      + NP           YYVGL+ I VGG+++ V +   T+D
Sbjct: 348 FNS------PPCSGSVKAKLLQNPKAEN------LYYVGLKGIKVGGEKIDVPNSTFTID 395

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
             GNGG IV S +  T +  + +  + D FV+   K ++  R    +A      C+++  
Sbjct: 396 PYGNGGMIVSSSSLITMLENDTYNVVRDAFVA---KTQHLER---LKAFLQFDTCYNLSS 449

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
             T   P L+     G    LP E+Y   V +    C         S G   ILG  Q  
Sbjct: 450 NNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFA----PSKGSFSILGTLQQY 505

Query: 445 NYYVEYDLRN 454
              V +DL N
Sbjct: 506 GTRVTFDLVN 515


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 168/380 (44%), Gaps = 40/380 (10%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           ++ G   Q    I+DTGS L W  C     C+ C + + P F P  SSS   L C +P C
Sbjct: 68  VTVGIGGQNSTLIVDTGSDLTWVQC---LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTC 124

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
             +   +          L ++KN T     Y + YG G  + G    E L L    I NF
Sbjct: 125 VALQPTA------GSSGLCSNKNSTSC--DYQIDYGDGSYSRGELGFEKLTLGKTEIDNF 176

Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLIL 265
           + GC   +       +G+ G  R + SL SQ   L    FSYCL +        + SL L
Sbjct: 177 IFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT---GVGSSGSLTL 233

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
                 + K  + ++YT  + NP ++       +Y++ L  I++GG  + V      L  
Sbjct: 234 GGADFSNFKNISPISYTRMIQNPQMSN------FYFLNLTGISIGGVNLNVPR----LSS 283

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
           +    +++DSGT  T ++P +++    EF  Q    R           + L  CF++ G 
Sbjct: 284 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT------TPGFSILNTCFNLTGY 337

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
           +  + P +K  F+G AE+ + VE  F  V  + S +CL   +        ++I+GN+Q +
Sbjct: 338 EEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYED--QTMIIGNYQQK 395

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N  V Y+ +  ++GF  + C
Sbjct: 396 NQRVIYNSKESKVGFAGEPC 415


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 153/385 (39%), Gaps = 55/385 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C+ C +   P F P  SSS   + 
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +                 +  C      Y V YG G  T+G    ETL L  
Sbjct: 185 CGSAICRTLSGTGCGG-------GGDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
             +    +GC   +S      AG+ G G G  SL  QL       FSYCL S        
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG---AGG 289

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+L                       +V      S +YYVGL  I VGG+R+ +   
Sbjct: 290 AGSLVLGR-------------------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDS 330

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG GG ++D+GT  T +  E +  L   F   M           + A++ L  C
Sbjct: 331 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 384

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  GA +TLP  N    VG G+  CL       +S G S ILG
Sbjct: 385 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 439

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF    C
Sbjct: 440 NIQQEGIQITVDSANGYVGFGPNTC 464


>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + +  +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++V G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+      ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 124/388 (31%), Positives = 164/388 (42%), Gaps = 66/388 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++S GTP      ++DTGS + W  C  H +    SS     F P  SS+     C 
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHC--HARAGAGSSLF---FDPGKSSTYTPFSCS 179

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +  C+ +      C          S N T  C  Y V YG G  T G   S+TL L    
Sbjct: 180 SAACTRLEGRDNGC----------SLNST--C-QYTVRYGDGSNTTGTYGSDTLALNSTE 226

Query: 207 IIPNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
            + NF  GCS        L   Q  G+ G G G  SL SQ        FSYCL +     
Sbjct: 227 KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPA----- 281

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           TTR+S  +    S+     T+G     FV  P    R A   +Y+V L+ I VGG  V +
Sbjct: 282 TTRSSGFLTLGAST----GTSG-----FVTTPMFRSRRA-PTFYFVILQGINVGGDPVAI 331

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                        G+I+DSGT  T + P  +  L+  F + M   R Y RA    A + L
Sbjct: 332 SPTVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGM---RRYPRA---RAFSIL 379

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CFD  G+   S P ++L F GGA V L  +        GS  CL       A+GG   
Sbjct: 380 DTCFDFTGQDNVSIPAVELVFSGGAVVDLDADGIM----YGS--CLAFA---PATGGIGS 430

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q + + V +D+    LGF+   C
Sbjct: 431 IIGNVQQRTFEVLHDVGQSVLGFRPGAC 458


>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + +  +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++V G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+      ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 112/395 (28%), Positives = 169/395 (42%), Gaps = 58/395 (14%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP----SFIPKLSSSSRLL 145
           +ISL+ G+PPQ +  +LDTGS L W  C            K+P    +F P LSSS    
Sbjct: 60  TISLTIGSPPQNVTMVLDTGSELSWLHC-----------KKLPNLNSTFNPLLSSSYTPT 108

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
            C +  C        + RD    P +   N  ++C   +    +   EG   +ET +L  
Sbjct: 109 PCNSSVCM------TRTRDLTI-PASCDPN-NKLCHVIVSYADASSAEGTLAAETFSLAG 160

Query: 206 RIIPNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
              P  L GC         +    +  G+ G  RG  SL +Q+ L KFSYC+        
Sbjct: 161 AAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISGED---- 216

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
                L+L +G S      + L YTP V   + +      V Y V L  I V  + +++ 
Sbjct: 217 -AFGVLLLGDGPS----APSPLQYTPLVTA-TTSSPYFDRVAYTVQLEGIKVSEKLLQLP 270

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM------VKNRNYTRALGAE 371
                 D  G G T+VDSGT FTF+   ++  L DEF+ Q       +++ N+    GA 
Sbjct: 271 KSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFE-GAM 329

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDRE 429
            L     C+  P     + P + L F  GAE+ +  E     V +G     C T   + +
Sbjct: 330 DL-----CYHAPA-SLAAVPAVTLVFS-GAEMRVSGERLLYRVSKGRDWVYCFT-FGNSD 381

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G  + ++G+   QN ++E+DL   R+GF +  C
Sbjct: 382 LLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 161/382 (42%), Gaps = 48/382 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   +  G P ++   + DTGS + W  C        C     P F PK SSS   L C 
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-R 206
           + +C  +        +CN      S  C      Y V YG G  T G   +ETL+  N  
Sbjct: 208 SQQCKLLDKA-----NCN------SDTCI-----YQVHYGDGSFTTGELATETLSFGNSN 251

Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            IPN  +GC   +       AG+ G G G  SL SQL    FSYCL++   D    +SS 
Sbjct: 252 SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSD----SSST 307

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +  N    SD  T+     P V N      + F  Y YV +  I+VGG+ + +      +
Sbjct: 308 LEFNSYMPSDSLTS-----PLVKN------DRFHSYRYVKVVGISVGGKTLPISPTRFEI 356

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEALTGLRPCFDV 382
           D  G GG IVDSGT  + +  +++E L + FV         T +L  A  ++    C++ 
Sbjct: 357 DESGLGGIIVDSGTIISRLPSDVYESLREAFV-------KLTSSLSPAPGISVFDTCYNF 409

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
            G+     P +      G  + LP  NY  ++      CL  +  + +      I+G+FQ
Sbjct: 410 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS----IIGSFQ 465

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            Q   V YDL N  +GF    C
Sbjct: 466 QQGIRVSYDLTNSIVGFSTNKC 487


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 168/380 (44%), Gaps = 40/380 (10%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           ++ G   Q    I+DTGS L W  C     C+ C + + P F P  SSS   L C +P C
Sbjct: 147 VTVGIGGQNSTLIVDTGSDLTWVQC---LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTC 203

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
             +   +          L ++KN T     Y + YG G  + G    E L L    I NF
Sbjct: 204 VALQPTA------GSSGLCSNKNSTSC--DYQIDYGDGSYSRGELGFEKLTLGKTEIDNF 255

Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLIL 265
           + GC   +       +G+ G  R + SL SQ   L    FSYCL +        + SL L
Sbjct: 256 IFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT---TGVGSSGSLTL 312

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
                 + K  + ++YT  + NP ++       +Y++ L  I++GG  + V      L  
Sbjct: 313 GGADFSNFKNISPISYTRMIQNPQMSN------FYFLNLTGISIGGVNLNVPR----LSS 362

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
           +    +++DSGT  T ++P +++    EF  Q    R           + L  CF++ G 
Sbjct: 363 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT------TPGFSILNTCFNLTGY 416

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
           +  + P +K  F+G AE+ + VE  F  V  + S +CL   +        ++I+GN+Q +
Sbjct: 417 EEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYED--QTMIIGNYQQK 474

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N  V Y+ +  ++GF  + C
Sbjct: 475 NQRVIYNSKESKVGFAGEPC 494


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/385 (27%), Positives = 160/385 (41%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C    QC + S    P F P  S+S   + 
Sbjct: 138 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSD---PVFDPADSASFTGVS 194

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  + +       C                 Y V YG G  T+G    ETL    
Sbjct: 195 CSSSVCDRLENAGCHAGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 238

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            ++ +  +GC   +       AG+ G G G  S   QL       FSYCL+S   D +  
Sbjct: 239 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSS-- 296

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+        +    G  + P V NP          +YY+GL  + VGG RV +  +
Sbjct: 297 -GSLVFGR-----EALPAGAAWVPLVRNPRAPS------FYYIGLAGLGVGGIRVPISEE 344

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    ++   D F++Q     N  RA G         C
Sbjct: 345 VFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA---NLPRATGVAIFD---TC 398

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F GG  +TLP  N+   + +    C        ++ G S ILG
Sbjct: 399 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFA---PSTSGLS-ILG 454

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   + +D  N  +GF   +C
Sbjct: 455 NIQQEGIQISFDGANGYVGFGPNIC 479


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 121/392 (30%), Positives = 163/392 (41%), Gaps = 62/392 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP +    +LDTGS +VW  C     C  C S   P F P LS+S   LG
Sbjct: 195 GEYFTRIGVGTPMREQYMVLDTGSDVVWIQCE---PCSKCYSQVDPIFNPSLSASFSTLG 251

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS++                 + NC      Y V YG G  T G   +E L    
Sbjct: 252 CNSAVCSYLD----------------AYNCHGGGCLYKVSYGDGSYTIGSFATEMLTFGT 295

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
             + N  +GC        AG+        G G G  S PSQL       FSYCL+     
Sbjct: 296 TSVRNVAIGCG----HDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLV----- 346

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
           D    SS  L+ G    +    G   TP + NPS+        +YYV L  I+VGG  + 
Sbjct: 347 DRFSESSGTLEFGP---ESVPLGSILTPLLTNPSLP------TFYYVPLISISVGGALLD 397

Query: 315 RVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEA 372
            V      +D   G GG IVDSGT  T +   +++ + D FV+        TR L  AE 
Sbjct: 398 SVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAG-------TRQLPKAEG 450

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           ++    C+D+ G    + P +  HF  GA + LP +NY   +      C        A+ 
Sbjct: 451 VSIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFA---PATS 507

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             S I+GN Q Q   V +D  N  +GF  + C
Sbjct: 508 DLS-IMGNIQQQGIRVSFDTANSLVGFALRQC 538


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 163/406 (40%), Gaps = 65/406 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   ++ GTP       LDT S L W  C     C+ C     P F P+ S+S   + 
Sbjct: 139 GDYIAKIAVGTPAVEALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYGEMN 195

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
              P C  +                 +K  T I   Y VLYG G        + G  + E
Sbjct: 196 YDAPDCQALGRSG----------GGDAKRGTCI---YTVLYGDGDGHGSTSTSVGDLVEE 242

Query: 200 TLNLPNRIIPNFL-VGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLL 250
           TL     +   +L +GC      L     AGI G  RG+ S+P Q+        FSYCL+
Sbjct: 243 TLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLV 302

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                  + +S+L    G+          T  P    P+V  +N    +YYV L  ++VG
Sbjct: 303 DFISGPGSPSSTLTFGAGAVD--------TSPPASFTPTVLNQN-MPTFYYVRLIGVSVG 353

Query: 311 GQRV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           G RV  V  + L LD   G+GG I+DSGTT T +A     P    + +     R     L
Sbjct: 354 GVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLA----RP---AYTAFRDAFRAAATGL 406

Query: 369 GAEALTGLRPCFDV---PGEKTG-----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
           G  +  G    FD     G + G       P + +HF GG E++L  +NY   V     V
Sbjct: 407 GQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTV 466

Query: 421 CLTVV--TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C       DR  S     ++GN   Q + V YD+  QR+GF    C
Sbjct: 467 CFAFAGTGDRSVS-----VIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + ++ +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYNTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++  G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNASGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+      ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 133/482 (27%), Positives = 205/482 (42%), Gaps = 87/482 (18%)

Query: 10  LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
           L+ + F+ L +IF     +  FS+   H +        P++  +Q + + V  S+ RA H
Sbjct: 9   LALVLFY-LCNIFYLEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANH 67

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
           +     ++  +  +  TT IS+   G Y IS S GTP   +  ILDTGS ++W  C    
Sbjct: 68  LN----QSFVSPNSPETTVISA--LGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQ--- 118

Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
            CK C     P F    S + + L C +  C     +S+Q   C     ++ K+C     
Sbjct: 119 PCKKCYEQTTPIFDSSKSQTYKTLPCPSNTC-----QSVQGTFC-----SSRKHCL---- 164

Query: 182 SYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGC----SVLSSRQPAGIAGFG 230
            Y + Y  G         E + L  T   P +  P  ++GC    ++    + +GI G G
Sbjct: 165 -YSIHYVDGSQSLGDLSVETLTLGSTNGSPVQ-FPGTVIGCGRYNAIGIEEKNSGIVGLG 222

Query: 231 RGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           RG  SL +QL+     KFSYCL+      +T +S L   N +  S +   G   TP  + 
Sbjct: 223 RGPMSLITQLSPSTGGKFSYCLVPGL---STASSKLNFGNAAVVSGR---GTVSTPLFS- 275

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
                +N   V+Y++ L   +VG  R+    ++ +    G G  I+DSGTT T +   ++
Sbjct: 276 -----KNGL-VFYFLTLEAFSVGRNRI----EFGSPGSGGKGNIIIDSGTTLTALPNGVY 325

Query: 348 EPL----ADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAE 402
             L    A   + Q V++ N    L          C+ V P +   S P +  HF  GA+
Sbjct: 326 SKLEAAVAKTVILQRVRDPNQVLGL----------CYKVTPDKLDASVPVITAHFS-GAD 374

Query: 403 VTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           VTL   N F  V +   VC         +     + GN   QN  V YDL+   + FK  
Sbjct: 375 VTLNAINTFVQVAD-DVVCFAFQPTETGA-----VFGNLAQQNLLVGYDLQMNTVSFKHT 428

Query: 463 LC 464
            C
Sbjct: 429 DC 430


>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + +  +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++V G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFSSQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+      ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 166/389 (42%), Gaps = 45/389 (11%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL  GTPPQ    +LDTGS L W  C +    K     ++P  +PK  ++S      + 
Sbjct: 68  VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKK-----RLPP-LPKPKTASFDPSLSSS 121

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-I 208
                 +  I      D  L TS +  ++C  Y   Y  G L EG  + E       +  
Sbjct: 122 FSLLPCNHPICKPRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKFTFSKSLST 180

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
           P  ++GC+  S+    GI G   G+ S  SQ  + KFSYC+ S    + T    L  DN 
Sbjct: 181 PPVILGCAQASTEN-RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYL-GDNP 238

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
           +S   K  T LT+    ++P     N   + Y + ++ I + G+R+ +       D  G+
Sbjct: 239 NSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 293

Query: 329 GGTIVDSGTTFTFMAPELFEPLADE---FVSQMVKNRNYTRALGAEALTGLRPCFD--VP 383
           G T++DSG+  T++  E +E + +E    V  M+K + Y  A  A+       CFD  V 
Sbjct: 294 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK-KGYVYADVADM------CFDAGVT 346

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--------VTDREASGGPS 435
            E       +   F  G E+          VG G  V   V        +   E  G  S
Sbjct: 347 AEVGRRIGGISFEFDNGVEI---------FVGRGEGVLTEVEKGVKCVGIGRSERLGIGS 397

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+G    QN +VEYDL N+R+GF    C
Sbjct: 398 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 426


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 113/393 (28%), Positives = 158/393 (40%), Gaps = 79/393 (20%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ILDTGS L W  C                             
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC----------------------------- 198

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
                        + C DC        +N  Q CP Y     S  T G    ET  +NL 
Sbjct: 199 -------------LPCYDC------FQQNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 239

Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
                     + N + GC   +       AG+ G GRG  S  SQL       FSYCL+ 
Sbjct: 240 TNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 299

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
               DT  +S LI   G          L +T FV      + N    +YYV ++ I V G
Sbjct: 300 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----AGKENLVDTFYYVQIKSILVAG 352

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + + +  +   +  DG GGTI+DSGTT ++ A    EP A EF+   +  +   +     
Sbjct: 353 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 407

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
               L PCF+V G      PEL + F  GA    P EN F  + E   VCL ++   +++
Sbjct: 408 DFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSA 466

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                I+GN+Q QN+++ YD +  RLG+    C
Sbjct: 467 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 496


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 109/402 (27%), Positives = 170/402 (42%), Gaps = 63/402 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           Y +++  GTPP+    + DTGS L W    PC +      C   + P F P  SS+   +
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPD----SSCYPQQEPLFDPSKSSTYVDV 177

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE---TLN 202
            C  P+C   H   +Q   C       + +C      Y V YG       +L+E   TL+
Sbjct: 178 PCSAPEC---HIGGVQQTRCG------ATSC-----EYSVKYGDESETHGSLAEETFTLS 223

Query: 203 LPNRIIP---NFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD------KFS 246
            P+ + P     + GCS         +    AG+ G GRG +S+ SQ           FS
Sbjct: 224 PPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFS 283

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL        + T  L +  G++   ++ + L++TP +   S   R+A    Y V L  
Sbjct: 284 YCLPPRG----SSTGYLTIGGGAAAPQQQYSNLSFTPLITTIS-QLRSA----YVVNLAG 334

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           ++V G  V +     +L      G ++DSGT  T M    + PL DEF   M       +
Sbjct: 335 VSVNGAAVDIPASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHM----GSYK 384

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCL 422
            L   ++  L  C+DV G+   + P + L F GGA + +       V+    G G ++ L
Sbjct: 385 MLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTL 444

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +     +    +I+GN Q + Y V +D+   R+GF    C
Sbjct: 445 ACLAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 114/398 (28%), Positives = 173/398 (43%), Gaps = 72/398 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    G+PP     ++DTGS L+W  C+    C  C   + P F P  SS+ +   
Sbjct: 87  GEYLMRFYIGSPPVERLAMVDTGSSLIWLQCS---PCHNCFPQETPLFEPLKSSTYKYAT 143

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C+ +       RDC          C      Y ++YG    + GI  +ETL+  +
Sbjct: 144 CDSQPCTLLQPSQ---RDC-----GKLGQCI-----YGIMYGDKSFSVGILGTETLSFGS 190

Query: 206 R------IIPNFLVGCSV------LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLL 250
                    PN + GC V       +S +  GIAG G G  SL SQL      KFSYCLL
Sbjct: 191 TGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCLL 250

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
            +   D+T TS L      S +   T G+  TP +  PS+        YY++ L  +T+G
Sbjct: 251 PY---DSTSTSKLKF---GSEAIITTNGVVSTPLIIKPSLP------TYYFLNLEAVTIG 298

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + V       T   DGN   ++DSGT  T++    +    + FV+ +         LG 
Sbjct: 299 QKVVS------TGQTDGN--IVIDSGTPLTYLENTFY----NNFVASL------QETLGV 340

Query: 371 EAL----TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           + L    + L+ CF  P     + P++   F  GA V L  +N    + + + +CL VV 
Sbjct: 341 KLLQDLPSPLKTCF--PNRANLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCLAVV- 396

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              +SG    + G+    ++ VEYDL  +++ F    C
Sbjct: 397 --PSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/337 (27%), Positives = 149/337 (44%), Gaps = 34/337 (10%)

Query: 31  FSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
             L+      S    Q L+  ++ S  R   +++           T    + + S G Y 
Sbjct: 31  LKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYL 90

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           + L+ GTPP     I+DTGS L+W  C     C  C+    P F  K S++ R L C++ 
Sbjct: 91  VDLAIGTPPLYYTAIMDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSS 147

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLP 204
           +C+ +                +S +C +    Y   YG +  T G+  +ET      N  
Sbjct: 148 RCASL----------------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191

Query: 205 NRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                N   GC  L++   A   G+ GFGRG  SL SQL   +FSYCL S+     +R  
Sbjct: 192 KVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLY 251

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             +  N SS +    + +  TPFV NP++         Y++ L+ I++G + + +     
Sbjct: 252 FGVYANLSSTNTSSGSPVQSTPFVINPALPN------MYFLSLKAISLGTKLLPIDPLVF 305

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
            ++ DG GG I+DSGT+ T++  + +E +    VS +
Sbjct: 306 AINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 135/476 (28%), Positives = 199/476 (41%), Gaps = 89/476 (18%)

Query: 10  LSFIFFFTLLSIFP-SSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRAL 60
           L  +F+F+L  I   S   +  FS+   H +        P+Q+ YQ++ +    S+ RA 
Sbjct: 6   LLILFYFSLCFIISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRAN 65

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           H        T  T T  +T I  H  G Y ++ S GTPP  +  I DTGS +VW  C   
Sbjct: 66  HFYK-----TALTNTPQSTVIPDH--GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCE-- 116

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             CK C +   P F P  SS+ + + C +  C                     K+  Q  
Sbjct: 117 -PCKECYNQTTPKFKPSKSSTYKNIPCSSDLC---------------------KSGQQ-- 152

Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC----SVLSSRQPAGIAGFGRGKTSL 236
                  G+   + + L  +   P    P  ++GC    +V      +GI G G G  SL
Sbjct: 153 -------GNLSVDTLTLESSTGHPISF-PKTVIGCGTDNTVSFEGASSGIVGLGGGPASL 204

Query: 237 PSQL--NLD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
            +QL  ++D KFSYCLL +  +  T +     D      D    G+  TP V    +   
Sbjct: 205 ITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGD----GVVSTPIVKKDPI--- 257

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG----TIVDSGTTFTFMAPELFEP 349
               V+YY+ L   +VG +R+         +   NGG     I+DSGTT T +  +++  
Sbjct: 258 ----VFYYLTLEAFSVGNKRIE-------FEGSSNGGHEGNIIIDSGTTLTVIPTDVYNN 306

Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
           L +  V ++VK +   R      L  L  C+ V  +    FP +  HFKG A+V L   +
Sbjct: 307 L-ESAVLELVKLK---RVNDPTRLFNL--CYSVTSDGY-DFPIITTHFKG-ADVKLHPIS 358

Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            F  V +G  VCL   T         + I GN   QN  V YDL+ + + FK   C
Sbjct: 359 TFVDVADG-IVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 175/398 (43%), Gaps = 62/398 (15%)

Query: 89  YSISLSFGT--PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y I+   G   P   I  ++DTGS + W   T     K CS SK  S +P          
Sbjct: 110 YIITFYLGNQRPEDNISAVVDTGSDIFW---TTE---KECSRSKTRSMLP---------- 153

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL 203
           C +PKC     +   C  C    L         C +Y ++YG      T G+   + L +
Sbjct: 154 CCSPKC----EQRASC-GCGRSELKAEAEKETKC-TYAIIYGGNANDSTAGVMYEDKLTI 207

Query: 204 ---PNRIIPN------FLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCL 249
               ++ +P+        +GCS    L  + P+  G+ G GR  TSLP QLN  KFSYCL
Sbjct: 208 VAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCL 267

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
            S++  D    S L+L   ++  D  T  +     V   ++   + +   Y+V L+ I++
Sbjct: 268 SSYQEPDL--PSYLLL---TAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISI 322

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GG R      +  +     G   VD+G +FT +   +F  L  E + +++K R Y +   
Sbjct: 323 GGTR------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTE-LDRIMKERKYVKEQP 375

Query: 370 AEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
                  + C+  P    +++   P++ LHF   A + LP ++Y       S +CL +  
Sbjct: 376 GR--NNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAIYK 431

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                GG S +LGNFQMQN ++  D  N++L F +  C
Sbjct: 432 S-NIKGGIS-VLGNFQMQNTHMLLDTGNEKLSFVRADC 467


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 164/387 (42%), Gaps = 51/387 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+  GTP + +  I DTGS L W  C      +YC + K P F+P  S++   + 
Sbjct: 129 GNYIVSVGLGTPKKYLSLIFDTGSDLTWTQC--QPCARYCYNQKDPVFVPSQSTTYSNIS 186

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +P CS +   +      N    + ++ C      Y + YG    + G    ETL L +
Sbjct: 187 CSSPDCSQLESGT-----GNQPGCSAARACI-----YGIQYGDQSFSVGYFAKETLTLTS 236

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
             +I NFL GC   +       AG+ G G+ K S+  Q        FSYCL        T
Sbjct: 237 TDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCL------PKT 290

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +S+  L   +         L YTP      VA       +Y V +  + VGG ++ +  
Sbjct: 291 SSSTGYL---TFGGGGGGGALKYTPITKAHGVAN------FYGVDIVGMKVGGTQIPISS 341

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +       G I+DSGT  T + P+ +  L   F   M K   Y +   A  L+ L  
Sbjct: 342 SVFS-----TSGAIIDSGTVITRLPPDAYSALKSAFEKGMAK---YPK---APELSILDT 390

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG-SAVCLTVVTDREASGGPSII 437
           C+D+    T   P++   FKGG E+ L  +    + G   S VCL    +++ S     I
Sbjct: 391 CYDLSKYSTIQIPKVGFVFKGGEELDL--DGIGIMYGASTSQVCLAFAGNQDPS--TVAI 446

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GN Q +   V YD+   ++GF    C
Sbjct: 447 IGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 154/381 (40%), Gaps = 54/381 (14%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           +  G P Q   F+LDTGS + W  C        C     P F P+LSSS   + C + +C
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN-LPNRIIPN 210
             +         C                 Y V YG G  T G   +ETL  + +  IPN
Sbjct: 61  QLLDEAGCNVNSC----------------IYKVEYGDGSFTIGELATETLTFVHSNSIPN 104

Query: 211 FLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
             +GC         G+        G G G  S+ SQL    FSYCL+     D    S  
Sbjct: 105 ISIGCG----HDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCLV-----DIDSPSFS 155

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            LD    ++D  +  L  +P V N      + F  + YV +  ++VGG+ + +      +
Sbjct: 156 TLD---FNTDPPSDSLI-SPLVKN------DRFPSFRYVKVIGMSVGGKPLPISSSRFEI 205

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
           D  G GG IVDSGTT T +  +++E L + F+         T    A  ++    C+D+ 
Sbjct: 206 DESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLT------TNLPPAPEISPFDTCYDLS 259

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
            +     P +     G   + LP +N    V      CL  V    ++  P  I+GNFQ 
Sbjct: 260 SQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFV----SATFPLSIIGNFQQ 315

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           Q   V YDL N  +GF    C
Sbjct: 316 QGIRVSYDLTNSLVGFSTNKC 336


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 133/487 (27%), Positives = 205/487 (42%), Gaps = 83/487 (17%)

Query: 5   ISALCLSFIFFFTLLSIFP-SSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSS 55
           ++ LC   +  F+L  I   S   S  FS+   H +        P+++ YQ+       S
Sbjct: 1   MNTLCFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRS 60

Query: 56  LTRALHI-KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
           + RA H  K+  T T  +T             GGY ++ S GTPP  I  I DTGS +VW
Sbjct: 61  INRANHFFKDSDTSTPESTVIP--------DRGGYLMTYSVGTPPTKIYGIADTGSDIVW 112

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
             C     C+ C +   P F P  SSS + + C +  C      S++   C+D+      
Sbjct: 113 LQCE---PCEQCYNQTTPIFNPSKSSSYKNIPCLSKLC-----HSVRDTSCSDQ------ 158

Query: 175 NCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFLVGCSVLSS----RQPA 224
           N  Q    Y + YG S  ++G    +TL+L +        P  ++GC   ++       +
Sbjct: 159 NSCQ----YKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASS 214

Query: 225 GIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
           GI G G G  SL +QL      KFSYCL+     ++  +S L   + +  S     G+  
Sbjct: 215 GIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD---GVVS 271

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP +            V+Y++ L+  +VG +RV         D +GN   I+DSGTT T 
Sbjct: 272 TPLIKKD--------PVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGN--IIIDSGTTLTL 321

Query: 342 MAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
           +  +++  L    V       V + N   +L          C+ +   +   FP +  HF
Sbjct: 322 IPSDVYTNLESAVVDLVKLDRVDDPNQQFSL----------CYSLKSNEY-DFPIITAHF 370

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           K GA++ L   + F  + +G  VC       + S     I GN   QN  V YDL+ + +
Sbjct: 371 K-GADIELHSISTFVPITDG-IVCFAF----QPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424

Query: 458 GFKQQLC 464
            FK   C
Sbjct: 425 SFKPTDC 431


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/382 (29%), Positives = 154/382 (40%), Gaps = 53/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   LD      W PC     C  CSS+    F    S++ + LGC 
Sbjct: 35  YIVKAKVGTPPQTLLMALDNSYDAAWIPCKG---CVGCSST---VFNTVKSTTFKTLGCG 88

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +           P+     CT     +   YGS         +T+ L    +
Sbjct: 89  APQCKQVPN-----------PICGGSTCT-----WNTTYGSSTILSNLTRDTIALSMDPV 132

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQL-NLDK--FSYCLLSHKFDDTTRTSS 262
           P +  GC   +  SS  P G+ GFGRG  S  SQ  NL K  FSYCL S  F     + S
Sbjct: 133 PYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPS--FRTLNFSGS 190

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT     P + NP    R++    YYV L  I VG + V +    L 
Sbjct: 191 LRLGPVGQPPRIKTT-----PLLKNP---RRSSL---YYVKLNGIRVGRKIVDIPRSALA 239

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +    +  + +EF       R         +L G   C+ V
Sbjct: 240 FNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEF-------RKRVGNATVSSLGGFDTCYSV 292

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VT+P EN       G   CL +    +       ++ + Q
Sbjct: 293 PIVP----PTITFMFSG-MNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQ 347

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + +D+ N RLG  ++ C
Sbjct: 348 QQNHRILFDVPNSRLGVAREQC 369


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)

Query: 33  LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
           LS +H    PS    +++ +L  +   R L +    +K  ++   T+    S  +   Y 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGVTSAPVASGQTPPSYV 80

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +    GTP Q +   LDT +   W  C     C  C +     FIP  SSS   L C + 
Sbjct: 81  VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
            C     +          PL     C    P     + + L      S+TL L    I  
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187

Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
           +  GC V +   P       G+ G GRG  SL SQ        FSYCL S++   F  + 
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSL 246

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           R  +            +   + YTP + NP    R +    YYV +  ++VG   V+V  
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
                D     GT++DSGT  T     ++  L +EF  Q+     YT +LGA        
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF+      G  P + LH  GG ++TLP+EN           CL +    +       ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            N Q QN  V  D+   R+GF ++ C 
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431


>gi|15450651|gb|AAK96597.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 110

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 68/106 (64%), Gaps = 4/106 (3%)

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
           NYTR    E  TGL PCF++ G+   + PEL   FKGGA++ LP+ NYF  VG    VCL
Sbjct: 3   NYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCL 62

Query: 423 TVVTDR----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           TVV+D+        GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 63  TVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 108


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 156/391 (39%), Gaps = 49/391 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +S G+PP     ++D+GS ++W  C     C  C     P F P  S++   + 
Sbjct: 169 GEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK---PCLECYVQADPLFDPATSATFSGVS 225

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +   +     C D  L   +        Y V Y  G  T+G    ETL L  
Sbjct: 226 CGSAICRILPTSA-----CGDGELGGCE--------YEVSYADGSYTKGALALETLTLGG 272

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------K 253
             +   ++GC   +       AG+ G G G  SL  QL  +    FSYCL S        
Sbjct: 273 TAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGA 332

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            DD      L+L      S+    G  + P V NP          +YYVGL  I VG +R
Sbjct: 333 ADDDA--GWLVL----GRSEAVPEGAVWVPLVRNPRAPS------FYYVGLSGIEVGDER 380

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +      L  DG G  ++D+GTT T +  E +  L D FV  +       RA G  + 
Sbjct: 381 LPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAG--AVPRAQGVSSS 438

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
             L  C+D+ G  +   P +   F G A + L   N    V  G   CL       +S G
Sbjct: 439 V-LDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVDMG-IYCLAFA---PSSSG 493

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            S I+GN Q     +  D  N  +GF    C
Sbjct: 494 LS-IMGNTQQAGIQITVDSANGYIGFGPANC 523


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 157/383 (40%), Gaps = 55/383 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++S GTP       +DTGS L W  CT       C S K P F P  SSS   + C 
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
            P C  +   +  C         ++  C      Y+V YG G  T G+  S+TL L PN 
Sbjct: 199 GPVCGGLGIYASSC---------SAAQC-----GYVVSYGDGSKTTGVYSSDTLTLSPND 244

Query: 207 IIPNFLVGCSVLSSRQPA--GIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTS 261
            +  F  GC    S      G+ G GR + SL  Q        FSYCL        TR S
Sbjct: 245 AVRGFFFGCGHAQSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCL-------PTRPS 297

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           +         S     G + T  +++P+ A       YY V L  I+VGGQ++ V     
Sbjct: 298 TTGYLTLGGPSGAAPPGFSTTQLLSSPNAA------TYYVVMLTGISVGGQQLSVPSSVF 351

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
                  GGT+VD+GT  T + P  +  L   F S M  +  Y     A A   L  C++
Sbjct: 352 A------GGTVVDTGTVITRLPPTAYAALRSAFRSGMA-SYGYPS---APATGILDTCYN 401

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
             G  T + P + L F GGA VTL  +         S  CL        S G   ILGN 
Sbjct: 402 FSGYGTVTLPNVALTFSGGATVTLGADGIL------SFGCLAFAP--SGSDGGMAILGNV 453

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q +++ V  D     +GFK   C
Sbjct: 454 QQRSFEVRID--GTSVGFKPSSC 474


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)

Query: 33  LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
           LS +H    PS    +++ +L  +   R L +    +K  ++   T+    S  +   Y 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGVTSAPVASGQTPPSYV 80

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +    GTP Q +   LDT +   W  C     C  C +     FIP  SSS   L C + 
Sbjct: 81  VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
            C     +          PL     C    P     + + L      S+TL L    I  
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187

Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
           +  GC V +   P       G+ G GRG  SL SQ        FSYCL S++   F  + 
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSL 246

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           R  +            +   + YTP + NP    R +    YYV +  ++VG   V+V  
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
                D     GT++DSGT  T     ++  L +EF  Q+     YT +LGA        
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF+      G  P + LH  GG ++TLP+EN           CL +    +       ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            N Q QN  V  D+   R+GF ++ C 
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/386 (28%), Positives = 165/386 (42%), Gaps = 47/386 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS-SRLL 145
           G Y + +  G+P Q+   +LDT +   W PCT    C  CSSS    + P+ S++    +
Sbjct: 106 GSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTG---CTGCSSSST-YYSPQASTTYGGAV 161

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
            C  P+C+       Q R     P   SK CT     +   Y         + ++L L  
Sbjct: 162 ACYAPRCA-------QARGALPCPYTGSKACT-----FNQSYAGSTFSATLVQDSLRLGI 209

Query: 206 RIIPNFLVGCS------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
             +P++  GC        L ++   G+        S  S+L    FSYCL S  F  +  
Sbjct: 210 DTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPS--FQSSYF 267

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           + SL L  G +   ++   +  TP + NP    R +    YYV L  +TVG  +V +  +
Sbjct: 268 SGSLKL--GPTGQPRR---IRTTPLLQNP---RRPSL---YYVNLTGVTVGRVKVPLPIE 316

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
           YL  D +   GTI+DSGT  T     ++  + DEF +Q VK   ++R        G   C
Sbjct: 317 YLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQ-VKGPFFSRG-------GFDTC 368

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           F    E     P +KL F  G +VTLP EN       G   CL +            ++ 
Sbjct: 369 FVKTYENLT--PLIKLRFT-GLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIA 425

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
           N+Q QN  V +D  N R+G  ++LC 
Sbjct: 426 NYQQQNLRVLFDTVNNRVGIARELCN 451


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)

Query: 33  LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
           LS +H    PS    +++ +L  +   R L +    +K  ++   T+    S  +   Y 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGITSAPVASGQTPPSYV 80

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +    GTP Q +   LDT +   W  C     C  C +     FIP  SSS   L C + 
Sbjct: 81  VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
            C     +          PL     C    P     + + L      S+TL L    I  
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187

Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
           +  GC V +   P       G+ G GRG  SL SQ        FSYCL S++   F  + 
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSL 246

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
           R  +            +   + YTP + NP    R +    YYV +  ++VG   V+V  
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
                D     GT++DSGT  T     ++  L +EF  Q+     YT +LGA        
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF+      G  P + LH  GG ++TLP+EN           CL +    +       ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            N Q QN  V  D+   R+GF ++ C 
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 180/427 (42%), Gaps = 51/427 (11%)

Query: 59  ALHIKNPQTKTTT----TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
           AL  +  +T  +T    +TT+ TT  +  H     ++SL+ GTP Q I  +LDTGS L W
Sbjct: 33  ALRTQKHRTPISTPRLFSTTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSW 92

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
             C                F P  S +   + C +P C              D PL  S 
Sbjct: 93  LHCKKEPNFNSI-------FNPLASKTYTKIPCSSPTCE---------TRTRDLPLPVSC 136

Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIA 227
           +  ++C   +    +   EG    ET  + +   P  + GC  S  SS      +  G+ 
Sbjct: 137 DPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLM 196

Query: 228 GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           G  RG  S  +Q+   KFSYC+      D   +  L+L   S    K    L YTP V  
Sbjct: 197 GMNRGSLSFVNQMGFRKFSYCI-----SDRDSSGVLLLGEASFSWLKP---LNYTPLVEM 248

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
            S        V Y V L  I V  + + +       D  G G T+VDSGT FTF+   ++
Sbjct: 249 -STPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVY 307

Query: 348 EPLADEFVSQ---MVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAE 402
             L  EF+ Q   +++  N  R +   A+     C+ +   +    + P + L F+ GAE
Sbjct: 308 SALKQEFLLQTKGVLRVLNEPRYVFQGAMD---LCYLIEPTRAALPNLPVVNLMFR-GAE 363

Query: 403 VTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           +++  +         V G+ S  C T   + ++ G  S ++G+ Q QN ++EYDL   R+
Sbjct: 364 MSVSGQRLLYRVPGEVRGKDSVWCFT-FGNSDSLGIESFVIGHHQQQNVWMEYDLEKSRI 422

Query: 458 GFKQQLC 464
           GF +  C
Sbjct: 423 GFAEVRC 429


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 169/397 (42%), Gaps = 52/397 (13%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           +IS++ GTPPQ +  ++DTGS L W  C  +      ++   P F P +SSS   + C +
Sbjct: 67  TISITVGTPPQNMSMVIDTGSELSWLHCNTNTT----ATIPYPFFNPNISSSYTPISCSS 122

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
           P C+             D P+  S +   +C + L    +  +EG   S+T    +   P
Sbjct: 123 PTCT---------TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNP 173

Query: 210 NFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
             + GC       +  S     G+ G   G  SL SQL + KFSYC+    F      S 
Sbjct: 174 GIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISGSDF------SG 227

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPS---VAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           ++L   S+ S   +  L YTP V   +     +R+A    Y V L  I +  + + +   
Sbjct: 228 ILLLGESNFSWGGS--LNYTPLVQISTPLPYFDRSA----YTVRLEGIKISDKLLNISGN 281

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GL 376
               D  G G T+ D GT F+++   ++  L DEF++Q        RAL          +
Sbjct: 282 LFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQ---TNGTLRALDDPNFVFQIAM 338

Query: 377 RPCFDVPGEKTG--SFPELKLHFKG------GAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             C+ VP  ++     P + L F+G      G ++   V  +  V G  S  C T   + 
Sbjct: 339 DLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGF--VWGNDSVYCFT-FGNS 395

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +  G  + I+G+   Q+ ++E+DL   R+G     C 
Sbjct: 396 DLLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARCD 432


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 174/412 (42%), Gaps = 98/412 (23%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC+    CK C   + P F P+LS+S + 
Sbjct: 72  SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSTSYQA 128

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
           L C NP C           +C+DE     K C      Y   Y    +    LSE L   
Sbjct: 129 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 167

Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
              +++ P   V GC       L S++  GI G GRGK S+  QL +DK      FS C 
Sbjct: 168 GNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 226

Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
              +        +++L   S       SHSD         PF            S YY +
Sbjct: 227 GGMEVGG----GAMVLGKISPPPGMVFSHSD---------PFR-----------SPYYNI 262

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+++ V G+ +++  K      +G  GT++DSGTT+ +   E F  + D  + ++   +
Sbjct: 263 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLK 318

Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
                     + G  P     CF   G         FPE+ + F  G ++ L  ENY F 
Sbjct: 319 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFR 369

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 A CL +  DR++    + +LG   ++N  V YD  N +LGF +  C
Sbjct: 370 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 174/412 (42%), Gaps = 98/412 (23%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC+    CK C   + P F P+LS+S + 
Sbjct: 72  SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSTSYQA 128

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
           L C NP C           +C+DE     K C      Y   Y    +    LSE L   
Sbjct: 129 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 167

Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
              +++ P   V GC       L S++  GI G GRGK S+  QL +DK      FS C 
Sbjct: 168 GNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 226

Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
              +        +++L   S       SHSD         PF            S YY +
Sbjct: 227 GGMEVG----GGAMVLGKISPPPGMVFSHSD---------PFR-----------SPYYNI 262

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+++ V G+ +++  K      +G  GT++DSGTT+ +   E F  + D  + ++   +
Sbjct: 263 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLK 318

Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
                     + G  P     CF   G         FPE+ + F  G ++ L  ENY F 
Sbjct: 319 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFR 369

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 A CL +  DR++    + +LG   ++N  V YD  N +LGF +  C
Sbjct: 370 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 161/389 (41%), Gaps = 40/389 (10%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  S GTPPQ +   +DT +   W PC   + C     +  PSF P  S++ R + C 
Sbjct: 94  YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCP----TTAPSFNPASSATFRPVPCG 149

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
            P CS   + S     C    LA SKN       + + YG    +     + L +     
Sbjct: 150 APPCSQAPNPS-----CTS--LAKSKNSC----GFSLSYGDSSLDATLSQDNLAVTANGG 198

Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCLLSHKFDDTTRT 260
           +I  +  GC   S+   A   G           +   K      FSYCL S+       +
Sbjct: 199 VIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFS 258

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L      + +K   +  TP + +P    R +    YYV +  + +G + V +    
Sbjct: 259 GSLTLGRKGQPAPEK---MKTTPLLASP---HRPSL---YYVAMTGVRIGKKSVPIPPSA 309

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE----FVSQMVKNRNYTRALGAEALTGL 376
           L  D     GT++DSGT F  +A   +  + DE        + +      ++   +L G 
Sbjct: 310 LAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGF 369

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C++V    T ++P + L F GG EV LP EN       GS  CL +          ++
Sbjct: 370 DTCYNV---STVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAAL 426

Query: 437 -ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++G+ Q QN+ V +D+ N R+GF ++ C
Sbjct: 427 NVIGSLQQQNHRVLFDVPNARVGFARERC 455


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 154/383 (40%), Gaps = 70/383 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      GTP Q +   +D  +   W PC+    C  C++S  PSF P  SS+ R + C 
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 157

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C+ +   S         P     +C      + + Y +   + +   ++L L N ++
Sbjct: 158 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 204

Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
            ++  GC  + +      AG  R          L   +  LL             + D G
Sbjct: 205 VSYTFGCLRVVNGNSRAAAGAHR----------LRPRAALLL-------------VADQG 241

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
                 +   +  TP + NP    R +    YYV +  I VG + V+V    L  +    
Sbjct: 242 HLGPIGQPKRIKTTPLLYNP---HRPSL---YYVNMIGIRVGSKVVQVPQSALAFNPVTG 295

Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
            GTI+D+GT FT +A  ++  + D F       R   R   A  L G   C++V    T 
Sbjct: 296 SGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV----TV 344

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-------ILGNF 441
           S P +   F G   VTLP EN       G   CL +      + GPS        +L + 
Sbjct: 345 SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM------AAGPSDGVNAALNVLASM 398

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q QN  V +D+ N R+GF ++LC
Sbjct: 399 QQQNQRVLFDVANGRVGFSRELC 421


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 159/386 (41%), Gaps = 52/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++++  GTP +    I DTGS L W  C      K C   K P   P  S+S + + 
Sbjct: 131 GDYAVTVGLGTPKKEFTLIFDTGSDLTWTQC--EPCAKTCYKQKEPRLDPTKSTSYKNIS 188

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-P 204
           C +  C  +  E               ++C+     Y V YG G  + G   +ETL L  
Sbjct: 189 CSSAFCKLLDTEG-------------GESCSSPTCLYQVQYGDGSYSIGFFATETLTLSS 235

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
           + +  NFL GC   +S   R  AG+ G GR K SLPSQ        FSYCL        +
Sbjct: 236 SNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL------PAS 289

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +S   L  G   S      + +TP      ++E    + +Y + +  ++VGG ++ +  
Sbjct: 290 SSSKGYLSFGGQVSKT----VKFTP------LSEDFKSTPFYGLDITELSVGGNKLSIDA 339

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +       GT++DSGT  T +    +  L+  F   M      T     +  +    
Sbjct: 340 SIFS-----TSGTVIDSGTVITRLPSTAYSALSSAFQKLM------TDYPSTDGYSIFDT 388

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D    +T   P++ + FKGG E+ + V      V     VCL    + +     + I 
Sbjct: 389 CYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDV--KAAIF 446

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q + Y V YD    R+GF    C
Sbjct: 447 GNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 162/385 (42%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  + + GTPPQ    ++D    LVW  C    QC  C     P F P  S++ R   C 
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCSRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P C  I  +S              +NC+    +Y     +G T G   ++T  +     
Sbjct: 108 TPLCESIPSDS--------------RNCSGNVCAYQASTNAGDTGGKVGTDTFAV-GTAK 152

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
            +   GC V S       P+GI G GR   SL +Q  +  FSYCL  H   D  R S+L 
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGRNSALF 209

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L  GSS           TPFVN       N  S YY V L  +  G   + +     T+ 
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
                  ++D+ +  +F+    +         Q VK +  T A+GA  + T + P   CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTAAVGAPPMATPVEPFDLCF 307

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
              G  +G+ P+L   F+GGA +T+P  NY      G+ VCL +++    +    + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q +N +  +DL  + L F+   C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 87/262 (33%), Positives = 127/262 (48%), Gaps = 25/262 (9%)

Query: 210 NFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
           N   GC  L++   AG   I G   G  S+  QL++ KFSYCL    F D  +TS ++  
Sbjct: 23  NLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSYCLTP--FTDH-KTSPVMFG 79

Query: 267 NGSSHSDKKTTGLTYT-PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             +     KTTG   T P + NP         +YYYV +  I++G +R+ V    L L  
Sbjct: 80  AMADLGKYKTTGKVQTIPLLKNP------VEDIYYYVPMVGISIGSKRLDVPEAILALRP 133

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-- 383
           DG GGT++DS TT  ++    F+ L    +  M K     R++    +     CF++P  
Sbjct: 134 DGTGGTVLDSATTLAYLVEPAFKELKKAVMEGM-KLPAANRSIDDYPV-----CFELPRG 187

Query: 384 -GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
              +    P L LHF G AE++LP ++YF     G  +CL V+      G P++I GN Q
Sbjct: 188 MSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPG-MMCLAVM-QAPFEGAPNVI-GNVQ 244

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN +V YDL N++  +    C
Sbjct: 245 QQNMHVLYDLGNRKFSYAPTKC 266


>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 154

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 58/146 (39%), Positives = 86/146 (58%), Gaps = 6/146 (4%)

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L YTPF+ N   A  + +  +YY+ LR +++G +R+ +  K  + D  GNGGTI+DSGTT
Sbjct: 15  LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
           FT    E ++ +   F SQ+     + RA   EA TG+R C++V G      P+   HFK
Sbjct: 74  FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
           GG+++ LPV NYF+     S +CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSDS-ICLTM 154


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 165/391 (42%), Gaps = 62/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 180 GNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKL--FDPARSSTYANVS 237

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS ++                ++ C+     Y V YG G  + G    +TL L +
Sbjct: 238 CAAPACSDLY----------------TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSS 281

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 282 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 336

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             + +  LD G   S         TP +  N P+         +YYVG+  I VGGQ + 
Sbjct: 337 --SGTGYLDFGPG-SPAAVGARQTTPMLTDNGPT---------FYYVGMTGIRVGGQLLS 384

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +     +       GTIVDSGT  T + P  +  L   F S M   R Y +   A AL+ 
Sbjct: 385 IPQSVFS-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAA-RGYKK---APALSL 435

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
           L  C+D  G    + P++ L F+GGA + +      Y A +   S VCL    + +    
Sbjct: 436 LDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASL---SQVCLGFAANEDDD-- 490

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              I+GN Q++ + V YD+  + +GF    C
Sbjct: 491 DVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 155/382 (40%), Gaps = 53/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT +   W PC+    C  CSS+    F    S++ + +GC+
Sbjct: 96  YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSG---CVGCSST---VFNNVKSTTFKTVGCE 149

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +       C                ++ + YGS         + + L    I
Sbjct: 150 APQCKQVPNSKCGGSAC----------------AFNMTYGSSSIAANLSQDVVTLATDSI 193

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           P++  GC   +  SS  P G+ G GRG  SL SQ   L    FSYCL S  F     + S
Sbjct: 194 PSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPS--FRSLNFSGS 251

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT     P + NP    R++    YYV L  I VG + V +    L 
Sbjct: 252 LRLGPVGQPKRIKTT-----PLLKNP---RRSSL---YYVNLMAIRVGRRVVDIPPSALA 300

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +    +  + D F  + V N   T      +L G   C+  
Sbjct: 301 FNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAF-RKRVGNATVT------SLGGFDTCYTS 353

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N        S  CL +    +       ++ N Q
Sbjct: 354 PIVA----PTITFMFSG-MNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQ 408

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + +D+ N RLG  ++ C
Sbjct: 409 QQNHRILFDVPNSRLGVAREPC 430


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 166/391 (42%), Gaps = 56/391 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLL 145
           G Y + L  GTPP+    ILDTGS L W  C     C  YC +   P + P +S + + L
Sbjct: 123 GNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQ---PCAVYCHAQADPLYDPSVSKTYKKL 179

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL- 203
            C + +CS +   ++     ND    T  N       Y   YG +  + G    + L L 
Sbjct: 180 SCASVECSRLKAATL-----NDPLCETDSNACL----YTASYGDTSFSIGYLSQDLLTLT 230

Query: 204 PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
            ++ +P F  GC   +     + AGI G  R K S+ +QL+      FSYCL        
Sbjct: 231 SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL-------P 283

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           T  S        S      T   +TP +    NPS+         Y++ L  ITV G+ +
Sbjct: 284 TANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL---------YFLRLTAITVSGRPL 334

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +      +       T++DSGT  T +   ++  L   FV  M      T+   A A +
Sbjct: 335 DLAAAMYRVP------TLIDSGTVITRLPMSMYAALRQAFVKIMS-----TKYAKAPAYS 383

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
            L  CF    +   + PE+K+ F+GGA++TL   +      +G    +T +    +SG  
Sbjct: 384 ILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG----ITCLAFAGSSGTN 439

Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I I+GN Q Q Y + YD+   R+GF    C
Sbjct: 440 QIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 163/392 (41%), Gaps = 83/392 (21%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y ++++ G+PP+ +  I DTGS LVW  C         +++    F P  SS+   + CQ
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL---- 203
              C  +   +     C+D       NC     +YL  YG G  T G+  +ET       
Sbjct: 161 TDACEALGRAT-----CDD-----GSNC-----AYLYAYGDGSNTTGVLSTETFTFDDGG 205

Query: 204 ----PNRI-IPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
               P ++ I     GCS  ++      G+ G G G  SL +QL        +FSYCL+ 
Sbjct: 206 AGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
           H  +    +S+L   N  + +D    G   TP V N +VA   +  +             
Sbjct: 266 HSVN---ASSAL---NFGALADVTEPGAASTPLVGNKTVASAASSRI------------- 306

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
                               IVDSGTT TF+ P L  P+ DE       +R  T      
Sbjct: 307 --------------------IVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQS 340

Query: 372 ALTGLRPCFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
               L+ C++V G   E   S P+L L F GGA V L  EN F  V EG+ +CL +V   
Sbjct: 341 PDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATT 399

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           E    P  ILGN   QN +V YDL    +G K
Sbjct: 400 EQQ--PVSILGNLAQQNIHVGYDLDAGTVGNK 429



 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/137 (39%), Positives = 66/137 (48%), Gaps = 12/137 (8%)

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG---EKTG 388
           IVDSGTT TF+ P L  P+ DE       +R  T          L+ C++V G   E   
Sbjct: 440 IVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQSPDGLLQLCYNVAGREVEAGE 493

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
           S P+L L F GGA V L  EN F  V EG+ +CL +V   E    P  ILGN   QN +V
Sbjct: 494 SIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATTEQQ--PVSILGNLAQQNIHV 550

Query: 449 EYDLRNQRLGFKQQLCK 465
            YDL    + F    C 
Sbjct: 551 GYDLDAGTVTFAVADCA 567


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 56/387 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP +      DTGS L W  C        C     P F P  S+S + + 
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCL--GGCFPQNQPKFDPTTSTSYKNVS 195

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C  I   +   +DC       S  C      Y + YGSG T G   +ETL + + 
Sbjct: 196 CSSEFCKLIAEGNYPAQDC------ISNTCL-----YGIQYGSGYTIGFLATETLAIASS 244

Query: 207 -IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
            +  NFL GCS  S        G+ G GR   +LPSQ      + FSYCL +      + 
Sbjct: 245 DVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASP----SS 300

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           T  L      S + K T      P   +P + +       Y +    I+V G+ + +   
Sbjct: 301 TGHLSFGVEVSQAAKST------PI--SPKLKQ------LYGLNTVGISVRGRELPING- 345

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
             ++ R     TI+DSGTTFTF+    +  L   F   M    NYT   G  +    +PC
Sbjct: 346 --SISR-----TIIDSGTTFTFLPSPTYSALGSAFREMMA---NYTLTNGTSS---FQPC 392

Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           +D    G  T + P + + F+GG EV + V      V     VCL        S     I
Sbjct: 393 YDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFA--DTGSDSDFAI 450

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GN+Q + Y V YD+    +GF  + C
Sbjct: 451 FGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 129/484 (26%), Positives = 199/484 (41%), Gaps = 80/484 (16%)

Query: 5   ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSL 56
           IS   L+ + F+ L +IF     +  FS+   H +        P++  +Q + + V  S+
Sbjct: 4   ISPSTLALVLFY-LCNIFYLEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSV 62

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
            RA H          T T          + G Y IS S G PP  +  I+DTGS ++W  
Sbjct: 63  NRANHFHKAHKAAKATIT---------QNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQ 113

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C     C+ C +     F P  S++ ++L   +  C  +            E  + S + 
Sbjct: 114 CK---PCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSV------------EDTSCSSDN 158

Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNL--PNRIIPNF---LVGC----SVLSSRQPAGI 226
            ++C  Y + YG G  ++G    ETL L   N     F   ++GC    +V    + +GI
Sbjct: 159 RKMC-EYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGI 217

Query: 227 AGFGRGKTSLPSQLNL------DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
            G G G  SL +QL         KFSYCL S     +  +S L   + +  S     G  
Sbjct: 218 VGLGNGPVSLINQLRRRSSSIGRKFSYCLASM----SNISSKLNFGDAAVVSGD---GTV 270

Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
            TP V +          V+YY+ L   +VG  R+            GN   I+DSGTT T
Sbjct: 271 STPIVTHDP-------KVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDSGTTLT 321

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
            +  +++  L       +  +R        + L  L  C+    ++  + P +  HF  G
Sbjct: 322 LLPNDIYSKLESAVADLVELDR------VKDPLKQLSLCYRSTFDELNA-PVIMAHFS-G 373

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           A+V L   N F  V +G   CL  ++ +    GP  I GN   QN+ V YDL+ + + FK
Sbjct: 374 ADVKLNAVNTFIEVEQG-VTCLAFISSKI---GP--IFGNMAQQNFLVGYDLQKKIVSFK 427

Query: 461 QQLC 464
              C
Sbjct: 428 PTDC 431


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 119/403 (29%), Positives = 168/403 (41%), Gaps = 68/403 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y +S+  GTP + +  + DTGS L W       QC  CSS      + P F P  SS+
Sbjct: 83  GNYVVSVGLGTPARDLTVVFDTGSDLSWV------QCGPCSSGGCYHQQDPLFAPSSSST 136

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSET 200
              + C  P+C            C+  P          CP Y V+YG    T G   ++T
Sbjct: 137 FSAVRCGEPECPRARQS------CSSSP------GDDRCP-YEVVYGDKSRTVGHLGNDT 183

Query: 201 LNLP-----------NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LD 243
           L L            +  +P F+ GC   ++    +  G+ G GRGK SL SQ      +
Sbjct: 184 LTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGE 243

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
            FSYCL S   +     S        +H+        +TP +N      R+    +YYV 
Sbjct: 244 GFSYCLPSSSSNAHGYLSLGTPAPAPAHA-------RFTPMLN------RSNTPSFYYVK 290

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I V G+ ++V  +          G IVDSGT  T +AP  +  L   F+S M K   
Sbjct: 291 LVGIRVAGRAIKVSSRPALWP----AGLIVDSGTVITRLAPRAYSALRTAFLSAMGK-YG 345

Query: 364 YTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
           Y R   A  L+ L  C+D       T S P + L F GGA +++       V     A C
Sbjct: 346 YKR---APRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-C 401

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L    +   +G  + ILGN Q +   V YD+  Q++GF  + C
Sbjct: 402 LAFAPN--GNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 129/471 (27%), Positives = 194/471 (41%), Gaps = 68/471 (14%)

Query: 8   LCLSFIFFFTLL-SIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ 66
           LCL  I F   L S F   I     S S F+   ++  +Q + + V  S+ RA H     
Sbjct: 12  LCLYNICFSEALKSGFSVEIIHRDSSRSPFY-RATETQFQRVTNAVRRSMNRANHFNQIS 70

Query: 67  TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
             +    +  T  +      G Y +S S GTPP  +  I+DT S ++W  C     C+ C
Sbjct: 71  VYSNAVESPVTLLDD-----GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQ---LCETC 122

Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
            +   P F P  S + + L C +  C     +S+Q   C+ +         +IC   +  
Sbjct: 123 YNDTSPMFDPSYSKTYKNLPCSSTTC-----KSVQGTSCSSDE-------RKICEHTVNY 170

Query: 187 YGSGLTEGIALSETLNL-----PNRIIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQ 239
                ++G  + ET+ L     P    P  ++GC  +   S    GI G G G  SL  Q
Sbjct: 171 KDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSFDSIGIVGLGGGPVSLVPQ 230

Query: 240 LN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
           L+     KFSYCL       + R+S L   + +  S   T             V+ R  F
Sbjct: 231 LSSSISKKFSYCLAPI----SDRSSKLKFGDAAMVSGDGT-------------VSTRIVF 273

Query: 297 ---SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
                +YY+ L   +VG  R+            G G  I+DSGTTFT +  +++  L + 
Sbjct: 274 KDWKKFYYLTLEAFSVGNNRIEFRSSSSR--SSGKGNIIIDSGTTFTVLPDDVYSKL-ES 330

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
            V+ +VK     RA   + L     C+    +K    P +  HF  GA+V L   N F +
Sbjct: 331 AVADVVK---LERA--EDPLKQFSLCYKSTYDKV-DVPVITAHF-SGADVKLNALNTF-I 382

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V     VCL  ++ +  +     I GN   QN+ V YDL+ + + FK   C
Sbjct: 383 VASHRVVCLAFLSSQSGA-----IFGNLAQQNFLVGYDLQRKIVSFKPTDC 428


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 119/440 (27%), Positives = 178/440 (40%), Gaps = 63/440 (14%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           N  Q   Q  N  +  S++R  H    Q    T +     + I ++  G Y +SLS GTP
Sbjct: 47  NSQQTHLQRWNKAMRRSVSRVHHF---QRTAATVSPKEVESEIIANG-GEYLMSLSLGTP 102

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS L+W  CT    C  C     P F PK S + R L C   +C  +   
Sbjct: 103 PFEILAIADTGSDLIWTQCT---PCDKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGES 159

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFL 212
           S     C+ E         Q+C  Y   YG    T G    +T+ LP+        P  +
Sbjct: 160 S----SCSSE---------QLC-QYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTV 205

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
           +GC   ++    ++ +GI G G G  SL SQ+      KFSYCL+    +    +S L  
Sbjct: 206 IGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHF 265

Query: 266 DNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
              +  S    +G+  TP ++ NP          +YY+ L  ++VG +++          
Sbjct: 266 GRNAVVSG---SGVQSTPLISKNP--------DTFYYLTLEAMSVGDKKIEFGGSSFGGS 314

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
                  I+DSGT+ T      F   A   V   V N   T+          RP  D+  
Sbjct: 315 EG---NIIIDSGTSLTLFPVNFFTEFATA-VENAVINGERTQDASGLLSHCYRPTPDL-- 368

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
                 P +  HF  GA+V L   N F ++ +   +CL   + +  +     I GN    
Sbjct: 369 ----KVPVITAHFN-GADVVLQTLNTFILISD-DVLCLAFNSTQSGA-----IFGNVAQM 417

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           N+ + YD++ + + FK   C
Sbjct: 418 NFLIGYDIQGKSVSFKPTDC 437


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 69/220 (31%), Positives = 106/220 (48%), Gaps = 16/220 (7%)

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           FSYCL+     D   +S LI   G          L +T  V      + N    +YYV +
Sbjct: 154 FSYCLVDRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV----AGKENPVDTFYYVQI 206

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
           + I VGG+ V +  +   +  DG+GGTI+DSGTT ++ A   ++ + + F   M K + Y
Sbjct: 207 KSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF---MAKVKGY 263

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
                 +    L PC++V G +    P+  + F  GA    PVENYF  +     VCL +
Sbjct: 264 PV---VKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAI 320

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +    ++     I+GN+Q QN+++ YD +  RLGF    C
Sbjct: 321 LGTPPSALS---IIGNYQQQNFHILYDTKKSRLGFAPTKC 357


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 111/410 (27%), Positives = 167/410 (40%), Gaps = 62/410 (15%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++ ++ GTPPQ +  +LDTGS L W  C   Y     +    P+F    SSS   + C +
Sbjct: 56  TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSY-----APPLTPAFNASGSSSYGAVPCPS 110

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
             C W   +      C+  P     N  ++  SY     +   +G+  ++T  L     P
Sbjct: 111 TACEWRGRDLPVPPFCDTPP----SNACRVSLSYA---DASSADGVLATDTFLLTGGAPP 163

Query: 210 ---NFLVGC---------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
                  GC                   S    G+ G  RG  S  +Q    +F+YC+  
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAP 223

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITV 309
            +         L+ D+G          L YTP +    +++   +   V Y V L  I V
Sbjct: 224 GEGPGVL----LLGDDGGVAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRV 271

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           G   + +    LT D  G G T+VDSGT FTF+  + +  L  EF SQ    R     LG
Sbjct: 272 GCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLG 328

Query: 370 AEALT---GLRPCFDVPGEK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEG 417
                       CF  P  +    +G  PE+ L  + GAEV +  E    +V     GEG
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387

Query: 418 SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            A  +  +T  + + +G  + ++G+   QN +VEYDL+N R+GF    C 
Sbjct: 388 GAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 123/406 (30%), Positives = 166/406 (40%), Gaps = 61/406 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y ++LS GTPP  I  I DTGS L W        C  C   K P F P  S++   
Sbjct: 76  SGGEYMMNLSIGTPPFPILAIADTGSDLTWL---QSKPCDQCYPQKGPIFDPSNSTTFHK 132

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL 203
           L C    C+ +   +  C D           C      Y   YG    T G   S+T+ +
Sbjct: 133 LPCTTAPCNALDESARSCTD--------PTTC-----GYTYSYGDHSYTTGYLASDTVTV 179

Query: 204 PNR--IIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL---- 250
            N    I N   GC   +      Q +GI G G G  S  SQL      KFSYCLL    
Sbjct: 180 GNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLEN 239

Query: 251 --SHKFDDTTRTSSLILDNGSSHSDKKTTGLTY--TPFVNNPSVAERNAFSVYYYVGLRR 306
             S +  D+  TS ++  +    S   T G+ +  TP VN          S YYY+ +  
Sbjct: 240 EISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEP-------STYYYLTIEA 292

Query: 307 ITVGGQRV---RVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           ITVG +++       K  + D         G  I+DSGTT TF+  E +  L    V ++
Sbjct: 293 ITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI 352

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
              R         +L     CF   G++    P +K+HF+GGA+V L   N F    EG 
Sbjct: 353 KMERVNDVKNSMFSL-----CFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEG- 405

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            VC T++   +       I GN    N+ V YDL  + + F    C
Sbjct: 406 LVCFTMLPTNDVG-----IYGNLAQMNFVVGYDLGKRTVSFLPADC 446


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 116/438 (26%), Positives = 169/438 (38%), Gaps = 53/438 (12%)

Query: 37  HTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISL 93
           H     D  +     V++ + R  H      K +        T++ S    G   Y + +
Sbjct: 88  HRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVISGMEAGSGEYFVRI 147

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
             G+PP+    ++D+GS +VW  C     C  C     P F P  SSS   + C +  C 
Sbjct: 148 GVGSPPRNQYMVIDSGSDIVWVQCK---PCSRCYQQSDPVFDPADSSSFAGVSCGSDVCD 204

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
            + +       C                 Y V YG G  T+G    ETL +   +I +  
Sbjct: 205 RLENTGCNAGRCR----------------YEVSYGDGSYTKGTLALETLTVGQVMIRDVA 248

Query: 213 VGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILD 266
           +GC   +       AG+ G G G  S   QL       FSYCL+S     T  T +L   
Sbjct: 249 IGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG---TGSTGALEFG 305

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
            G+        G T+   + NP          +YY+GL  I VGG RV V  +   L   
Sbjct: 306 RGA-----LPVGATWISLIRNPRAPS------FYYIGLAGIGVGGVRVSVPEETFQLTEY 354

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
           G  G ++D+GT  T      +    D F +Q     N  RA G         C+D+ G +
Sbjct: 355 GTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQ---TSNLPRAPGVSIFD---TCYDLNGFE 408

Query: 387 TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
           +   P +  +F  G  +TLP  N+   V  G   CL       +  G SII GN Q +  
Sbjct: 409 SVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFA---PSPSGLSII-GNIQQEGI 464

Query: 447 YVEYDLRNQRLGFKQQLC 464
            + +D  N  +GF   +C
Sbjct: 465 QISFDGANGFVGFGPNIC 482


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 173/388 (44%), Gaps = 44/388 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP +    I+DTGS L W  C       YC     P F P  S + + L 
Sbjct: 111 GNYYVKIGLGTPAKYFSMIVDTGSSLSWLQC--QPCVIYCHVQVDPIFTPSTSKTYKALP 168

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C + +CS +   ++    C+        N T  C  Y   YG +  + G    + L L  
Sbjct: 169 CSSSQCSSLKSSTLNAPGCS--------NATGAC-VYKASYGDTSFSIGYLSQDVLTLTP 219

Query: 206 RIIPN--FLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
              P+  F+ GC   +     + +GI G    K S+  QL+    + FSYCL S      
Sbjct: 220 SEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPN 279

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           + + S  L  G+S        LT +P+   P V  +   S+ Y++ L  ITV G+ + V 
Sbjct: 280 SSSLSGFLSIGASS-------LTSSPYKFTPLVKNQKIPSL-YFLDLTTITVAGKPLGVS 331

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                     N  TI+DSGT  T +   ++  L   FV  ++ ++ Y +A G    + L 
Sbjct: 332 ASSY------NVPTIIDSGTVITRLPVAVYNALKKSFV--LIMSKKYAQAPG---FSILD 380

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF    ++  + PE+++ F+GGA + L   N    + +G+  CL +     AS  P  I
Sbjct: 381 TCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEKGT-TCLAIA----ASSNPISI 435

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +GN+Q Q + V YD+ N ++GF    C+
Sbjct: 436 IGNYQQQTFKVAYDVANFKIGFAPGGCQ 463


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 113/373 (30%), Positives = 162/373 (43%), Gaps = 50/373 (13%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           I+DTGS L W  C        C + + P F P  S +   + C +P C+         +D
Sbjct: 197 IVDTGSDLTWVQC-EPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACA------ASLKD 249

Query: 165 CNDEPLATSK---NCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLS 219
               P + ++   N  Q C  Y + YG G  + G+   +TL L     +  F+ GC  LS
Sbjct: 250 ATGAPGSCARSAGNSEQRC-YYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFVFGCG-LS 307

Query: 220 SRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           +R      AG+ G GR   SL SQ        FSYCL +     TT T SL L  G S S
Sbjct: 308 NRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPA----TTTSTGSLSLGPGPSSS 363

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
                 + YT  + +P+         +Y++ +    V           LT    G G  +
Sbjct: 364 FPN---MAYTRMIADPTQPP------FYFINITGAAV------GGGAALTAPGFGAGNVL 408

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           VDSGT  T +AP +++ +  EF  +      Y  A G    + L  C+D+ G    + P 
Sbjct: 409 VDSGTVITRLAPSVYKAVRAEFARRF----EYPAAPG---FSILDACYDLTGRDEVNVPL 461

Query: 393 LKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
           L L  +GGA+VT+      F V  +GS VCL + +       P  I+GN+Q +N  V YD
Sbjct: 462 LTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTP--IIGNYQQRNKRVVYD 519

Query: 452 LRNQRLGFKQQLC 464
               RLGF  + C
Sbjct: 520 TVGSRLGFADEDC 532


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 119/431 (27%), Positives = 187/431 (43%), Gaps = 57/431 (13%)

Query: 49  NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
           +S++SS  SL R  +++  +T+     T     N+ +   G  + ++ S G PP      
Sbjct: 49  DSILSSYQSLDRN-NVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 107

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           +DTGS L+W  C     C  C     P F P  SS+   L   +P C             
Sbjct: 108 IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 152

Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
            + P     +  Q    Y   Y  G T    L+      ET +     + + + GC   S
Sbjct: 153 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 208

Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           +R     Q +GI G   G  S+ S+L   +FSYC+    FD     + L+L +G      
Sbjct: 209 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 261

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
           K  G + TPF         + F+ +YYV L  I+VG  R+ +  +       G GG ++D
Sbjct: 262 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311

Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
           SGTT TF+A + F+PL++E + ++V  R + + +    + G         E    FPEL 
Sbjct: 312 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 368

Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
            HF  GA++ L   + F V       CL V+     + G   ++G    Q+Y V YDL  
Sbjct: 369 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 425

Query: 455 QRLGFKQQLCK 465
           +R+ F++  C+
Sbjct: 426 KRVYFQRTDCE 436


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 57/387 (14%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           + + GTPPQ    I+D    LVW  C+    C  C    +P F+P  SS+ R   C    
Sbjct: 70  NFTIGTPPQPASAIIDVAGELVWTQCS---MCSRCFKQDLPLFVPNASSTFRPEPCGTDA 126

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
           C  I   +     C  E    SK               G T GI  ++T  +      + 
Sbjct: 127 CKSIPTSNCSSNMCTYEGTINSKL-------------GGHTLGIVATDTFAI-GTATASL 172

Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
             GC V S       P+G+ G GR  +SL SQ+N+ KFSYCL  H   D+ + S L+L  
Sbjct: 173 GFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRLLL-- 227

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
           GSS         T TPFV     +  +  S YY + L  I  G          + L   G
Sbjct: 228 GSSAKLAGGGNSTTTPFVK---TSPGDDMSQYYPIQLDGIKAG-------DAAIALPPSG 277

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-ALTGLRP---CFDVP 383
           N   +V +    +F+    ++ L  E           T+A+GA    T L+P   CF   
Sbjct: 278 N-TVLVQTLAPMSFLVDSAYQALKKEV----------TKAVGAAPTATPLQPFDLCFPKA 326

Query: 384 GEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSA-VCLTVVT----DREASGGPSII 437
           G    S P+L   F +G A +T+P   Y   VGE    VC+ +++    +  A      I
Sbjct: 327 GLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNI 386

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LG+ Q +N +   DL  + L F+   C
Sbjct: 387 LGSLQQENTHFLLDLEKKTLSFEPADC 413


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 235

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS ++        C                 Y V YG G  + G    +TL L +
Sbjct: 236 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 279

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             T +  LD G+         LT TP +  N P+         +YYVG+  I VGGQ + 
Sbjct: 335 --TGTGYLDFGAGSLAAARARLT-TPMLTENGPT---------FYYVGMTGIRVGGQLLS 382

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +             GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ 
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSL-RYAFAAAMAARGYKK---APAVSL 433

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D  G    + P + L F+GGA + +            S VCL    + +  GG  
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 490

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q++ + V YD+  + +GF    C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 115/389 (29%), Positives = 163/389 (41%), Gaps = 54/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP +    +LDTGS +VW  C     C+ C S   P F P  S S   +G
Sbjct: 6   GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE---PCRECYSQADPIFNPSSSVSFSTVG 62

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS      +   DC+         C      Y V YG G  T G   +ETL    
Sbjct: 63  CDSAVCS-----QLDANDCH------GGGCL-----YEVSYGDGSYTVGSYATETLTFGT 106

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
             I N  +GC   +V      AG+ G G G  S P+QL       FSYCL+     D   
Sbjct: 107 TSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV-----DRDS 161

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
            SS  L+ G    +    G  +TP V NP +        +YY+ +  I+VGG  +  V  
Sbjct: 162 ESSGTLEFG---PESVPIGSIFTPLVANPFLP------TFYYLSMVAISVGGVILDSVPS 212

Query: 319 KYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
           +   +D   G GG I+DSGT  T +    ++ L D F++     ++  R   A+ ++   
Sbjct: 213 EAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIA---GTQHLPR---ADGISIFD 266

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
            C+D+   ++ S P +  HF  GA   LP +N    +      C      D   S     
Sbjct: 267 TCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS----- 321

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN Q Q   V +D  N  +GF    C+
Sbjct: 322 IMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 67/397 (16%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC+    C+ C   + P F P+ SS+ + 
Sbjct: 84  SNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCST---CEQCGKHQDPRFQPESSSTYKP 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS---ETL 201
           + C NP C           +C+DE     K CT       +   SGL     LS   E+ 
Sbjct: 141 MQC-NPSC-----------NCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSFGNESE 184

Query: 202 NLPNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKF---SYCLLSHK 253
             P R I     GC       L S++  GI G GRG  S+  QL + +    S+ L    
Sbjct: 185 LTPQRAI----FGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGG 240

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQ 312
            D      +++L N     D                 A  + + S YY + L+ + V G+
Sbjct: 241 MD--VVGGAMVLGNIPPPPDM--------------VFAHSDPYRSAYYNIELKELHVAGK 284

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+++  +      DG  GT++DSGTT+ ++  E F    D     ++K   + + +    
Sbjct: 285 RLKLNPRVF----DGKHGTVLDSGTTYAYLPEEAFVAFKDA----IIKEIKFLKQIHGPD 336

Query: 373 LTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTD 427
            +    CF   G         FPE+ + F  G +++L  ENY F       A CL +  +
Sbjct: 337 PSYNDICFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQN 396

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +    P+ +LG   ++N  V YD  N ++GF +  C
Sbjct: 397 GK---DPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNC 430


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/427 (27%), Positives = 185/427 (43%), Gaps = 31/427 (7%)

Query: 42  QDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQ 100
            + + N+N   S S    L I       +TT T     +IS + Y     ++L  GTPPQ
Sbjct: 27  HNKHHNVNDSFSLSFPLTLSI------NSTTKTNPIVPSISPYKYSMALVVTLPIGTPPQ 80

Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
           +   +LDTGS + W  C N    +        SF P LSSS   L C +P C        
Sbjct: 81  LQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNHPLCK------- 133

Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
                 D  L T  +  ++C  Y   Y  G + EG  + E + L P+   P  ++GC+  
Sbjct: 134 --PQVPDISLPTDCDANRLC-HYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCAN- 189

Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
            S    GI G   G+ S P+Q  + KFSY +   +      + SL L N  + S  +   
Sbjct: 190 QSDDARGILGMNLGRLSFPNQAKITKFSYFVPVKQ--TQPGSGSLYLGNNPNSSCFRYVK 247

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           L    F  + S    N   + + + ++ I++GG+++ +       D  G G TI+DSG+ 
Sbjct: 248 LLT--FSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDSGSE 305

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHF 397
           F++M  + +  + +E V ++          G  A      CFD    + G    ++   F
Sbjct: 306 FSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADI----CFDGDATEIGRLVGDMVFEF 361

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           + G E+ +P E     V +G   C  +    E  GG   I+GNF  QN +VE+DL   R+
Sbjct: 362 EKGVEIVIPKERVLIEV-DGGVHCFGI-GRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRV 419

Query: 458 GFKQQLC 464
           GF+   C
Sbjct: 420 GFRGANC 426


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 114/445 (25%), Positives = 177/445 (39%), Gaps = 72/445 (16%)

Query: 34  SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           +RF+    +D+ +      ++SL R L    P T       +   + +   S G Y + +
Sbjct: 89  TRFNARMQRDTKR------AASLLRRLAAGKP-TYAAEAFGSDVVSGMEQGS-GEYFVRI 140

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
             G+PP+    ++D+GS ++W  C     C  C     P F P  SSS   + C +  CS
Sbjct: 141 GVGSPPRNQYVVMDSGSDIIWVQCE---PCTQCYHQSDPVFNPADSSSFSGVSCASTVCS 197

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
            + + +     C                 Y V YG G  T+G    ET+     +I N  
Sbjct: 198 HVDNAACHEGRCR----------------YEVSYGDGSYTKGTLALETITFGRTLIRNVA 241

Query: 213 VGCS-------------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
           +GC              +     P    G   G+T          FSYCL+S   +    
Sbjct: 242 IGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTG-------GAFSYCLVSRGIE---- 290

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            SS +L+ G    +    G  + P ++NP          +YY+GL  + VGG RV +   
Sbjct: 291 -SSGLLEFGR---EAMPVGAAWVPLIHNPRAQS------FYYIGLSGLGVGGLRVSISED 340

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    +E   D F++Q     N  RA G         C
Sbjct: 341 VFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTT---NLPRASGVSIFD---TC 394

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F GG  +TLP  N+   V +    C        +S G SII G
Sbjct: 395 YDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFA---PSSSGLSII-G 450

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF   +C
Sbjct: 451 NIQQEGIQISVDGANGFVGFGPNVC 475


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/261 (32%), Positives = 128/261 (49%), Gaps = 28/261 (10%)

Query: 209 PNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
           P    GC++ S       +G+ G GRGK SL +QLN++ F Y L S    D +  S +  
Sbjct: 15  PGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----DLSAPSPISF 70

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            + +  +         TP + NP V +      +YYVGL  I+VGG+ V++     + DR
Sbjct: 71  GSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQIPSGTFSFDR 126

Query: 326 D-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
             G GG I DSGTT T +    +  + DE +SQM   +    A   + +     CF   G
Sbjct: 127 STGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-----CF-TGG 180

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREASGGPSIILGN 440
             T +FP + LHF GGA++ L  ENY   +    GE +A C +VV   +A      I+GN
Sbjct: 181 SSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA----LTIIGN 235

Query: 441 FQMQNYYVEYDLR-NQRLGFK 460
               +++V +DL  N R+ F+
Sbjct: 236 IMQMDFHVVFDLSGNARMLFQ 256


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 124/442 (28%), Positives = 197/442 (44%), Gaps = 70/442 (15%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP + S Q L + +  S+ R  H       T    T     +++S+S G Y +++S GTP
Sbjct: 47  NPMETSSQRLRNAIHRSVNRVFHF------TEKDNTPQPQIDLTSNS-GEYLMNVSIGTP 99

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS L+W  C     C  C +   P F PK SS+ + + C + +C+ + ++
Sbjct: 100 PFPIMAIADTGSDLLWTQCA---PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQ 156

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRI-----IPNFL 212
                       A+       C SY + YG +  T+G    +TL L +       + N +
Sbjct: 157 ------------ASCSTNDNTC-SYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCL--LSHKFDDTTRTSSL 263
           +GC   ++    ++ +GI G G G  SL  QL  ++D KFSYCL  L+ K D T++    
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKI--- 260

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
              N  +++    +G+  TP +       + +   +YY+ L+ I+VG ++++        
Sbjct: 261 ---NFGTNAIVSGSGVVSTPLI------AKASQETFYYLTLKSISVGSKQIQYSGSDSES 311

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
                G  I+DSGTT T +  E +  L D   S +   +        +  +GL  C+   
Sbjct: 312 ---SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQSGLSLCYSAT 362

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQ 442
           G+     P + +HF  GA+V L   N F  V E   VC          G PS  I GN  
Sbjct: 363 GDL--KVPVITMHFD-GADVKLDSSNAFVQVSE-DLVCFAF------RGSPSFSIYGNVA 412

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
             N+ V YD  ++ + FK   C
Sbjct: 413 QMNFLVGYDTVSKTVSFKPTDC 434


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/392 (27%), Positives = 169/392 (43%), Gaps = 47/392 (11%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
           +SL+ GTPPQ +  ++DTGS L W  C         + S   +F P  S+S + + C +P
Sbjct: 33  VSLTVGTPPQNVSMVIDTGSELSWLHCNK-------TLSYPTTFDPTRSTSYQTIPCSSP 85

Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
            C+             D P+  S +   +C + L    +  ++G   S+  ++ +  I  
Sbjct: 86  TCT---------NRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISG 136

Query: 211 FLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            + GC  SV SS      +  G+ G  RG  S  SQL   KFSYC+    F      S L
Sbjct: 137 LVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTDF------SGL 190

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +L   S+ +   +  L YTP +   S        V Y V L  I V  + + +       
Sbjct: 191 LLLGESNLT--WSVPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEP 247

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCF 380
           D  G G T+VDSGT FTF+   ++  L   F++Q     +  R L          +  C+
Sbjct: 248 DHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQ---TSSVLRVLEDPDFVFQGAMDLCY 304

Query: 381 DVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGG 433
            VP  +      P + L F+ GAE+T+  +     V     G  S  CL+   + +  G 
Sbjct: 305 LVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYRVPGELRGNDSVHCLS-FGNSDLLGV 362

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            + ++G+   QN ++E+DL   R+G  Q  C 
Sbjct: 363 EAYVIGHHHQQNVWMEFDLEKSRIGLAQVRCD 394


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 119/431 (27%), Positives = 187/431 (43%), Gaps = 57/431 (13%)

Query: 49  NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
           +S++SS  SL R  +++  +T+     T     N+ +   G  + ++ S G PP      
Sbjct: 17  DSILSSYQSLDRN-NVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 75

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           +DTGS L+W  C     C  C     P F P  SS+   L   +P C             
Sbjct: 76  IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 120

Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
            + P     +  Q    Y   Y  G T    L+      ET +     + + + GC   S
Sbjct: 121 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 176

Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           +R     Q +GI G   G  S+ S+L   +FSYC+    FD     + L+L +G      
Sbjct: 177 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 229

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
           K  G + TPF         + F+ +YYV L  I+VG  R+ +  +       G GG ++D
Sbjct: 230 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279

Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
           SGTT TF+A + F+PL++E + ++V  R + + +    + G         E    FPEL 
Sbjct: 280 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336

Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
            HF  GA++ L   + F V       CL V+     + G   ++G    Q+Y V YDL  
Sbjct: 337 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 393

Query: 455 QRLGFKQQLCK 465
           +R+ F++  C+
Sbjct: 394 KRVYFQRTDCE 404


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 124/442 (28%), Positives = 197/442 (44%), Gaps = 70/442 (15%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP + S Q L + +  S+ R  H       T    T     +++S+S G Y +++S GTP
Sbjct: 47  NPMETSSQRLRNAIHRSVNRVFHF------TEKDNTPQPQIDLTSNS-GEYLMNVSIGTP 99

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS L+W  C     C  C +   P F PK SS+ + + C + +C+ + ++
Sbjct: 100 PFPIMAIADTGSDLLWTQCA---PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQ 156

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRI-----IPNFL 212
                       A+       C SY + YG +  T+G    +TL L +       + N +
Sbjct: 157 ------------ASCSTNDNTC-SYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203

Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCL--LSHKFDDTTRTSSL 263
           +GC   ++    ++ +GI G G G  SL  QL  ++D KFSYCL  L+ K D T++    
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKI--- 260

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
              N  +++    +G+  TP +       + +   +YY+ L+ I+VG ++++        
Sbjct: 261 ---NFGTNAIVSGSGVVSTPLI------AKASQETFYYLTLKSISVGSKQIQYSGSDSES 311

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
                G  I+DSGTT T +  E +  L D   S +   +        +  +GL  C+   
Sbjct: 312 ---SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQSGLSLCYSAT 362

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQ 442
           G+     P + +HF  GA+V L   N F  V E   VC          G PS  I GN  
Sbjct: 363 GDL--KVPVITMHFD-GADVKLDSSNAFVQVSE-DLVCFAF------RGSPSFSIYGNVA 412

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
             N+ V YD  ++ + FK   C
Sbjct: 413 QMNFLVGYDTVSKTVSFKPTDC 434


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 166/406 (40%), Gaps = 65/406 (16%)

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
           Q+  T  TT  T+ N        Y I++  G+P +    ++D+GS + W  C     C  
Sbjct: 113 QSHVTVPTTLGTSLNTLE-----YLITVRLGSPAKTQTVLIDSGSDVSWVQCK---PCLQ 164

Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
           C S   P F P LSS+     C +  C+ +  +   C        ++S  C      Y+V
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGC--------SSSSQC-----QYIV 211

Query: 186 LYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN 241
            Y  G  T G   S+TL L +  I NF  GCS + S       G+ G G G  SL SQ  
Sbjct: 212 RYADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTA 271

Query: 242 ---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
                 FSYCL        T +SS  L  G+      T+G   TP + +  V        
Sbjct: 272 GTFGTAFSYCL------PPTPSSSGFLTLGAG-----TSGFVKTPMLRSSPVP------T 314

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           +Y V L  I VGG ++ +     +       G ++DSGT  T +    +  L+  F + M
Sbjct: 315 FYGVRLEAIRVGGTQLSIPTSVFS------AGMVMDSGTIITRLPRTAYSALSSAFKAGM 368

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
            + R       A   + +  CFD  G+ +   P + L F GGA V L           G+
Sbjct: 369 KQYRP------APPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGIIL----GN 418

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +     +D  + G    I+GN Q + + V YD+    +GFK   C
Sbjct: 419 CLAFAANSDDSSPG----IVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 169/393 (43%), Gaps = 44/393 (11%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++SL+ GTPPQ +  ++DTGS L W  C         ++S   +F    S S R + C +
Sbjct: 32  TVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTT----TTSYPTTFNQTRSISYRPIPCSS 87

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
             C      + Q RD +   +  S +   +C + L    +  +EG   S+T ++    IP
Sbjct: 88  STC------TNQTRDFS---IPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIP 138

Query: 210 NFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
             + GC  SV SS      +  G+ G  RG  S  SQ+   KFSYC+    F      S 
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTDF------SG 192

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           ++L   S+ +      L YTP V   S        + Y V L  I V  + + +      
Sbjct: 193 MLLLGESNFT--WAVPLNYTPLVQI-STPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFE 249

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPC 379
            D  G G T+VDSGT FTF+    +  L  EF++Q      + R L          +  C
Sbjct: 250 PDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTT---GFLRVLEDPDFVFQGAMDLC 306

Query: 380 FDVPGEKT--GSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASG 432
           + VP  +      P + L F  GAE+T+  E         + G  S  CL+   + +  G
Sbjct: 307 YRVPISQRVLPRLPTVSLVFN-GAEMTVADERVLYRVPGEIRGNDSVHCLS-FGNSDLLG 364

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             + ++G+   QN ++E+DL   R+G  Q  C 
Sbjct: 365 VEAYVIGHHHQQNVWMEFDLERSRIGLAQVRCD 397


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 117/389 (30%), Positives = 163/389 (41%), Gaps = 54/389 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP +    +LDTGS +VW  C     C+ C S   P F P  S S   +G
Sbjct: 152 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE---PCRECYSQADPIFNPSSSVSFSTVG 208

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS +        D ND        C      Y V YG G  T G   +ETL    
Sbjct: 209 CDSAVCSQL--------DAND--------CHGGGCLYEVSYGDGSYTVGSYATETLTFGT 252

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
             I N  +GC   +V      AG+ G G G  S P+QL       FSYCL+     D   
Sbjct: 253 TSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV-----DRDS 307

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
            SS  L+ G    +    G  +TP V NP +        +YY+ +  I+VGG  +  V  
Sbjct: 308 ESSGTLEFG---PESVPIGSIFTPLVANPFLP------TFYYLSMVAISVGGVILDSVPS 358

Query: 319 KYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
           +   +D   G GG I+DSGT  T +    ++ L D F++     ++  RA G   ++   
Sbjct: 359 EAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIA---GTQHLPRADG---ISIFD 412

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
            C+D+   ++ S P +  HF  GA   LP +N    +      C      D   S     
Sbjct: 413 TCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS----- 467

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN Q Q   V +D  N  +GF    C+
Sbjct: 468 IMGNIQQQGIRVSFDSANSLVGFAIDQCQ 496


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 148/385 (38%), Gaps = 68/385 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP     ++D+GS ++W  C     C+ C +   P F P  SSS   + 
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  +                 +  C      Y V YG G  T+G    ETL L  
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
             +    +GC   +S      AG+ G G G  SL  QL       FSYCL S        
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            +S                                    +YYVGL  I VGG+R+ +   
Sbjct: 293 LAS-----------------------------------SFYYVGLTGIGVGGERLPLQDS 317

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L  DG GG ++D+GT  T +  E +  L   F   M           + A++ L  C
Sbjct: 318 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 371

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  GA +TLP  N    VG G+  CL       +S G S ILG
Sbjct: 372 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 426

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   +  D  N  +GF    C
Sbjct: 427 NIQQEGIQITVDSANGYVGFGPNTC 451


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 117/437 (26%), Positives = 192/437 (43%), Gaps = 67/437 (15%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
           + RAL + N + ++        T++ +  S     I L+ G   + + +I+         
Sbjct: 87  MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 146

Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
               DTGS L W  C     C+ C + + P + P +SSS + + C +  C     + +  
Sbjct: 147 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 198

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
              N  P   +    +    Y+V YG G  T G   SE++ L +  + NF+ GC     R
Sbjct: 199 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 254

Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
              G+        G GR   SL SQ     N   FSYCL S   +D    S    ++ S 
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGASGSLSFGNDSSV 311

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
           +++  +T ++YTP V NP +        +Y + L   ++GG    V  K  +  R    G
Sbjct: 312 YTN--STSVSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 355

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
            ++DSGT  T + P +++ +  EF+ Q      ++    A   + L  CF++   +  S 
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 409

Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
           P +K+ F+G AE+ + V   F  V  + S VCL + +   E   G   I+GN+Q +N  V
Sbjct: 410 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 466

Query: 449 EYDLRNQRLGFKQQLCK 465
            YD   +RLG   + C+
Sbjct: 467 IYDTTQERLGIVGENCR 483


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/415 (26%), Positives = 173/415 (41%), Gaps = 58/415 (13%)

Query: 59  ALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
           +L+  N       +  +   T  +S+  G Y   +  GTP +    ++DTGS L W  C+
Sbjct: 107 SLYRANDDAAVDGSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS 166

Query: 119 NHYQCKY-CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
               C+  C     P F PK SSS   + C  P+C+ +   ++    C+          +
Sbjct: 167 ---PCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSS---------S 214

Query: 178 QICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGK 233
            +C  Y   YG S  + G    +T++  +  +PNF  GC   +     + AG+ G  R K
Sbjct: 215 DVC-IYQASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNK 273

Query: 234 TSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
            SL  QL       FSYCL S          S    N   +S        YTP V++   
Sbjct: 274 LSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSY---NPGQYS--------YTPMVSS--- 319

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
                    Y++ L  +TV G+ + V   +Y +L       TI+DSGT  T +   +++ 
Sbjct: 320 ---TLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLP------TIIDSGTVITRLPTTVYDA 370

Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
           L+      M   +       A+A + L  CF V    +   P + + F GGA + L  +N
Sbjct: 371 LSKAVAGAMKGTKR------ADAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQN 423

Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               V + S  CL     R A+     I+GN Q Q + V YD+++ R+GF    C
Sbjct: 424 LLVDV-DSSTTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGFAAGGC 472


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 120/437 (27%), Positives = 191/437 (43%), Gaps = 67/437 (15%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
           + RAL + N + ++        T++ +  S     I L+ G   + + +I+         
Sbjct: 39  MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 98

Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
               DTGS L W  C     C+ C + + P + P +SSS + + C +  C     + +  
Sbjct: 99  SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 150

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
              N  P   +    +    Y+V YG G  T G   SE++ L +  + NF+ GC     R
Sbjct: 151 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 206

Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
              G+        G GR   SL SQ     N   FSYCL S   +D   + SL   N SS
Sbjct: 207 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGA-SGSLSFGNDSS 262

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
                T+ ++YTP V NP +        +Y + L   ++GG    V  K  +  R    G
Sbjct: 263 VYTNSTS-VSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 307

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
            ++DSGT  T + P +++ +  EF+ Q      ++    A   + L  CF++   +  S 
Sbjct: 308 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 361

Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
           P +K+ F+G AE+ + V   F  V  + S VCL + +   E   G   I+GN+Q +N  V
Sbjct: 362 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 418

Query: 449 EYDLRNQRLGFKQQLCK 465
            YD   +RLG   + C+
Sbjct: 419 IYDTTQERLGIVGENCR 435


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 116/230 (50%), Gaps = 26/230 (11%)

Query: 238 SQLNLDKFSYCLLS-HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
           SQL   KFSYCL S H+     +TSSL+  +  ++S+     +  TP + NP +      
Sbjct: 173 SQLGTQKFSYCLTSIHE----NKTSSLLFGS-LAYSNFNPGKIPRTPLIQNPFLPS---- 223

Query: 297 SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS 356
             YYY+ L+ ITVG   + +      L +DG+GG I+DSGTT T++  + F+ L + F+S
Sbjct: 224 --YYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFIS 281

Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVV 414
           Q          +   + TGL  CF +P +       P+L  HFK G ++ LPVENY    
Sbjct: 282 QT------ELQVANSSTTGLDLCFHLPVKNAAEVKVPKLIFHFK-GLDLALPVENYMVSD 334

Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            E   +CL +    +A+G  S I GN Q QN  V +DL+   L      C
Sbjct: 335 PEMGLICLAI----DATGSLS-IFGNIQQQNMLVLHDLKKSTLSLVPTQC 379



 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 28/52 (53%), Gaps = 6/52 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
           G + ++L  GTPP   P I+DTGS L+W     H  CK    SK    IP++
Sbjct: 97  GEFVVNLMIGTPPVPFPAIMDTGSDLIW----THKLCKGVKPSKFS--IPRI 142


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 157/381 (41%), Gaps = 52/381 (13%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           + + GTPPQ     +D    LVW  C+   QC +C    +P F+P  SS+ +   C    
Sbjct: 57  NFTIGTPPQAASAFIDLTGELVWTQCS---QCIHCFKQDLPVFVPNASSTFKPEPCGTDV 113

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
           C  I                T K  + +C    V    G T GI  ++T  +      + 
Sbjct: 114 CKSI---------------PTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASL 158

Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
             GC V S       P+G  G GR   SL +Q+ L +FSYCL  H   DT + S L L  
Sbjct: 159 GFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL-- 213

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
               S K   G  +TPFV     +  +  S YY + L  I  G          +T+ R  
Sbjct: 214 --GASAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGR 261

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
           N   +  +    + +   +++      ++ +      T  +GA        CF  P    
Sbjct: 262 NTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTAT-PVGAP----FEVCF--PKAGV 314

Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNFQM 443
              P+L   F+ GA +T+P  NY   VG  + VCL+V++    +  A  G + ILG+FQ 
Sbjct: 315 SGAPDLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSFQQ 372

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           +N ++ +DL    L F+   C
Sbjct: 373 ENVHLLFDLDKDMLSFEPADC 393


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 117/437 (26%), Positives = 192/437 (43%), Gaps = 67/437 (15%)

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
           + RAL + N + ++        T++ +  S     I L+ G   + + +I+         
Sbjct: 87  MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 146

Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
               DTGS L W  C     C+ C + + P + P +SSS + + C +  C     + +  
Sbjct: 147 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 198

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
              N  P   +    +    Y+V YG G  T G   SE++ L +  + NF+ GC     R
Sbjct: 199 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 254

Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
              G+        G GR   SL SQ     N   FSYCL S   +D    S    ++ S 
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGASGSLSFGNDSSV 311

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
           +++  +T ++YTP V NP +        +Y + L   ++GG    V  K  +  R    G
Sbjct: 312 YTN--STSVSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 355

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
            ++DSGT  T + P +++ +  EF+ Q      ++    A   + L  CF++   +  S 
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 409

Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
           P +K+ F+G AE+ + V   F  V  + S VCL + +   E   G   I+GN+Q +N  V
Sbjct: 410 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 466

Query: 449 EYDLRNQRLGFKQQLCK 465
            YD   +RLG   + C+
Sbjct: 467 IYDSTQERLGIVGENCR 483


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 52/387 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP   +  I DTGS L W  C      + C   K P F P  S+S   + 
Sbjct: 130 GNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVS 187

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +   +     C      ++ NC      Y + YG    + G    E   L N
Sbjct: 188 CSSAACGSLSSATGNAGSC------SASNCI-----YGIQYGDQSFSVGFLAKEKFTLTN 236

Query: 206 R-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
             +      GC   +       AG+ G GR K S PSQ        FSYCL S      +
Sbjct: 237 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SAS 292

Query: 259 RTSSLILDN-GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
            T  L   + G S S K      +TP     ++ +  +F   Y + +  ITVGGQ++ + 
Sbjct: 293 YTGHLTFGSAGISRSVK------FTPI---STITDGTSF---YGLNIVAITVGGQKLPIP 340

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               +       G ++DSGT  T + P+ +  L   F ++M K   Y    G      L 
Sbjct: 341 STVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSK---YPTTSGVSI---LD 389

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CFD+ G KT + P++   F GGA V L  +  F V  + S VCL    + + S   + I
Sbjct: 390 TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF-KISQVCLAFAGNSDDSN--AAI 446

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GN Q Q   V YD    R+GF    C
Sbjct: 447 FGNVQQQTLEVVYDGAGGRVGFAPNGC 473


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 159/398 (39%), Gaps = 56/398 (14%)

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
           +T  ++ ++   G Y   +  G P +    +LDTGS + W  C     C  C     P F
Sbjct: 143 STPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCK---PCSDCYQQSDPIF 199

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
            P  SSS   L C   +C  +  E   CR+           C      Y V YG G  T 
Sbjct: 200 DPTASSSYNPLTCDAQQCQDL--EMSACRN---------GKCL-----YQVSYGDGSFTV 243

Query: 194 GIALSETLNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFS 246
           G  ++ET++     +    +GC         G+        G G G  SL SQ+    FS
Sbjct: 244 GEYVTETVSFGAGSVNRVAIGCG----HDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFS 299

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL+     D+ ++S+L  ++       +       P + N  V      + +YYV L  
Sbjct: 300 YCLVDR---DSGKSSTLEFNS------PRPGDSVVAPLLKNQKV------NTFYYVELTG 344

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           ++VGG+ V V  +   +D+ G GG IVDSGT  T +  + +  + D F       R  + 
Sbjct: 345 VSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAF------KRKTSN 398

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
              AE +     C+D+   ++   P +  HF G     LP +NY   V      C     
Sbjct: 399 LRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAP 458

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              +      I+GN Q Q   V +DL N  +GF    C
Sbjct: 459 TTSSMS----IIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 165/403 (40%), Gaps = 71/403 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y +S+  GTP + +  + DTGS L W       QC  CSS      + P F P  SS+
Sbjct: 152 GNYVVSVGLGTPARDLTVVFDTGSDLSWV------QCGPCSSGGCYKQQDPLFAPSDSST 205

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSET 200
              + C   +C            C   P          CP Y V+YG    T+G   ++T
Sbjct: 206 FSAVRCGARECRARQS-------CGGSP------GDDRCP-YEVVYGDKSRTQGHLGNDT 251

Query: 201 LNLP-----------NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LD 243
           L L            +  +P F+ GC   ++    Q  G+ G GRGK SL SQ      +
Sbjct: 252 LTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGE 311

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
            FSYCL S         S        +H+        +TP +N      R     +YYV 
Sbjct: 312 GFSYCLPSSSSSAPGYLSLGTPVPAPAHAQ-------FTPMLN------RTTTPSFYYVK 358

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I V G+ +RV    + L        IVDSGT  T +AP  +  L   F+S M K   
Sbjct: 359 LVGIRVAGRAIRVSSPRVALP------LIVDSGTVITRLAPRAYRALRAAFLSAMGKY-G 411

Query: 364 YTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
           Y R   A  L+ L  C+D       T S P + L F GGA +++       V     A C
Sbjct: 412 YKR---APRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-C 467

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L    + +  G  + ILGN Q +   V YD+  Q++GF  + C
Sbjct: 468 LAFAPNGD--GRSAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 116/403 (28%), Positives = 172/403 (42%), Gaps = 50/403 (12%)

Query: 78   TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPK 137
            +  +S H     ++SL+ G+PPQ +  +LDTGS L W  C         S +    F P 
Sbjct: 989  SNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK-------SPNLTSVFNPL 1041

Query: 138  LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
             SSS   + C +P C        + RD    P   + +  ++C + +    +   EG   
Sbjct: 1042 SSSSYSPIPCSSPICR------TRTRDL---PNPVTCDPKKLCHAIVSYADASSLEGNLA 1092

Query: 198  SETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
            S+   + +  +P  L GC  S  SS      +  G+ G  RG  S  +QL L KFSYC+ 
Sbjct: 1093 SDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 1151

Query: 251  SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                  + R SS +L  G  H       LTYTP V   S        V Y V L  I VG
Sbjct: 1152 ------SGRDSSGVLLFGDLHLSW-LGNLTYTPLVQ-ISTPLPYFDRVAYTVQLDGIRVG 1203

Query: 311  GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
             + + +       D  G G T+VDSGT FTF+   ++  L +EF+ Q    +     LG 
Sbjct: 1204 NKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQ---TKGVLAPLGD 1260

Query: 371  EALT---GLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
                    +  C+ V  G K  + P + L F+ GAE+ +  E     V     G     C
Sbjct: 1261 PNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR-GAEMVVGGEVLLYRVPEMMKGNEWVYC 1319

Query: 422  LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            LT   + +  G  + ++G+   QN ++E+DL    + F   LC
Sbjct: 1320 LT-FGNSDLLGIEAFVIGHHHQQNVWMEFDL----VAFAADLC 1357


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 110/410 (26%), Positives = 166/410 (40%), Gaps = 62/410 (15%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++ ++ GTPPQ +  +LDTGS L W  C   Y     +    P+F    SSS   + C +
Sbjct: 56  TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSY-----APPLTPAFNASGSSSYGAVPCPS 110

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
             C W   +      C+  P     N  ++  SY     +   +G+  ++T  L     P
Sbjct: 111 TACEWRGRDLPVPPFCDTPP----SNACRVSLSYA---DASSADGVLATDTFLLTGGAPP 163

Query: 210 ---NFLVGC---------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
                  GC                   S    G+ G  RG  S  +Q    +F+YC+  
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAP 223

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITV 309
            +         L+ D+G          L YTP +    +++   +   V Y V L  I V
Sbjct: 224 GEGPGVL----LLGDDGGVAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRV 271

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           G   + +    LT D  G G T+VDSGT FTF+  + +  L  EF SQ    R     LG
Sbjct: 272 GCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLG 328

Query: 370 AEALT---GLRPCFDVPGEK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEG 417
                       CF  P  +    +G  P + L  + GAEV +  E    +V     GEG
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387

Query: 418 SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            A  +  +T  + + +G  + ++G+   QN +VEYDL+N R+GF    C 
Sbjct: 388 GAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 158/386 (40%), Gaps = 52/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C     C  C     P F P  S+S   + 
Sbjct: 41  GEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK---PCTQCYHQTDPLFDPADSASFMGVS 97

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  + +       CN      S  C      Y V YG G  T+G    ETL    
Sbjct: 98  CSSAVCDRVENAG-----CN------SGRC-----RYEVSYGDGSYTKGTLALETLTFGR 141

Query: 206 RIIPNFLVGCSVLSSR----QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
            ++ N  +GC   S+R      AG+ G G G  S   QL+    + FSYCL+S      T
Sbjct: 142 TVVRNVAIGCG-HSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRG----T 196

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            T+  +       S+    G  + P V NP          +YY+ L  + VG  RV V  
Sbjct: 197 NTNGFL----EFGSEAMPVGAAWIPLVRNPRAPS------FYYIRLLGLGVGDTRVPVSE 246

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               L+  G+GG ++D+GT  T      +E   + F+ Q    +N  RA G         
Sbjct: 247 DVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQ---TQNLPRASGVSIFD---T 300

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+++ G  +   P +  +F GG  +T+P  N+   V +    C               IL
Sbjct: 301 CYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLS----IL 356

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q +   +  D  N+ +GF   +C
Sbjct: 357 GNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 111/374 (29%), Positives = 162/374 (43%), Gaps = 52/374 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   +DT +   W PCT    C  C+S+    F P+ S++ + + C 
Sbjct: 93  YIVRAKIGTPPQTLLLAMDTSNDAAWIPCT---ACDGCASTL---FAPEKSTTFKNVSCA 146

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +       C      +S+N       + + YGS       + +T+ L    +
Sbjct: 147 APECKQVPNPG-----CG----VSSRN-------FNLTYGSSSIAANLVQDTITLATDPV 190

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           P++  GC   +  +S  P G+ G GRG  SL SQ   L    FSYCL S  F     + S
Sbjct: 191 PSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 248

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G     K+   + YTP + NP    R++    YYV L  I VG + V +    L 
Sbjct: 249 LRL--GPVAQPKR---IKYTPLLKNP---RRSSL---YYVNLEAIRVGRKVVDIPPAALA 297

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +   ++  + DEF       R     L   +L G   C++V
Sbjct: 298 FNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEF------RRRVGPKLTVTSLGGFDTCYNV 351

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F  G  VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 352 P----IVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 406

Query: 443 MQNYYVEYDLRNQR 456
            QN+ V YD+ N R
Sbjct: 407 QQNHRVLYDVPNSR 420


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 108/400 (27%), Positives = 162/400 (40%), Gaps = 55/400 (13%)

Query: 105 ILDTGSHLVWFPCTN-------HYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW--I 155
           ++DTGS LVW  C+              C    +P +   LS ++R + C +   +   +
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
             E+  C         +  +   +  S    YG+G+  G+  ++    P+        GC
Sbjct: 137 APETAGCARGG----GSGDDACVVAAS----YGAGVALGVLGTDAFTFPSSSSVTLAFGC 188

Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
              +   P      +GI G GRG  SL SQLN  +FSYCL  + F DT   S L + +G 
Sbjct: 189 VSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPY-FRDTVSPSHLFVGDGE 247

Query: 270 SHSDKKTTG--------LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
               +   G        +T  PF  NP   + + FS +YY+ L  +  G   V +     
Sbjct: 248 LAGLRAAAGGGGGGGAPVTTVPFAKNP---KDSPFSTFYYLPLVGLAAGNATVALPAGAF 304

Query: 322 TLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT---RALGAEALT 374
            L         GG ++DSG+ FT +       L  E   Q+  + +       LG     
Sbjct: 305 DLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALEL 364

Query: 375 GLRPCFDVPGEKTGSFPELKLHFK----GGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            +    D       + P L L F     GG E+ +P E Y+A V E S  C+ VV+   A
Sbjct: 365 CVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-EASTWCMAVVS--SA 421

Query: 431 SGGPSI------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SG  ++      I+GNF  Q+  V YDL N  L F+   C
Sbjct: 422 SGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 116/393 (29%), Positives = 169/393 (43%), Gaps = 71/393 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS----SSKIPSFIPKLSSSS 142
           G Y I++ FGTP +    + DTGS + W       QCK C+    + + P F P LSS+ 
Sbjct: 14  GNYVITVGFGTPTRTQTVVFDTGSDVNWL------QCKPCAVRCYAQQEPLFDPSLSSTY 67

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
           R + C  P C  +                +++ C+     Y V YG G  T G    +T 
Sbjct: 68  RNVSCTEPACVGL----------------STRGCSSSTCLYGVFYGDGSSTIGFLAMDTF 111

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKT-SLPSQLNL---DKFSYCLLSHK 253
            L P +   NF+ GC   ++   +  AG+ G GR  T SL SQ+     + FSYCL S  
Sbjct: 112 MLTPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPS-- 169

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
               T +++  L+ G+  +        YT  + +  V         Y++ L  I+VGG R
Sbjct: 170 ----TSSATGYLNIGNPQNTPG-----YTAMLTDTRVPT------LYFIDLIGISVGGTR 214

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + +           + GTI+DSGT  T + P  +  L     + M +   YT    A A+
Sbjct: 215 LSLSSTVFQ-----SVGTIIDSGTVITRLPPTAYSALKTAVRAAMTQ---YTL---APAV 263

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREAS 431
           T L  C+D     +  +P + LHF  G +V +P    F V    S VCL     TD    
Sbjct: 264 TILDTCYDFSRTTSVVYPVIVLHFA-GLDVRIPATGVFFVF-NSSQVCLAFAGNTDSTMI 321

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G    I+GN Q     V YD   +R+GF    C
Sbjct: 322 G----IIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 113/418 (27%), Positives = 172/418 (41%), Gaps = 50/418 (11%)

Query: 56  LTRALHIKNP-QTKTTTTTTTTTTTNISSH-SYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           LTRA  ++   +     TT       +  H S   Y ++L+ GTPPQ +  I+D G  LV
Sbjct: 16  LTRAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELV 75

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C  H  C+ C    +P F    SS+ R   C    C     ESI  R        + 
Sbjct: 76  WTQCAQH--CRRCFKQDLPLFDTNASSTFRPEPCGAAVC-----ESIPTR--------SC 120

Query: 174 KNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGF 229
                    Y      G T G   ++ + +          GC+V S        +G  G 
Sbjct: 121 AGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGL 180

Query: 230 GRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
           GR   SL +Q+N   FSYCL      DT ++S+L L   S+       G   TPFV   S
Sbjct: 181 GRTNLSLAAQMNATAFSYCLAP---PDTGKSSALFL-GASAKLAGAGKGAGTTPFVKT-S 235

Query: 290 VAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
               +  S  Y + L       + +R  +  + + + GN   +V + T  T +   ++  
Sbjct: 236 TPPHSGLSRSYLLRL-------EAIRAGNATIAMPQSGN-TIMVSTATPVTALVDSVYRD 287

Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLP 406
           L           +    A+GA  +      +D+       +G  P+L L F+GGAE+T+P
Sbjct: 288 L----------RKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVP 337

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V +Y    G  +A C+ ++    A GG S ILG+ Q  N ++ +DL  + L F+   C
Sbjct: 338 VSSYLFDAGNDTA-CVAIL-GSPALGGVS-ILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 186/431 (43%), Gaps = 57/431 (13%)

Query: 49  NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
           +S++SS  SL R  +++  +T+           N+ +   G  + ++ S G PP      
Sbjct: 17  DSILSSYQSLDRN-NVERRRTRRAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 75

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           +DTGS L+W  C     C  C     P F P  SS+   L   +P C             
Sbjct: 76  IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 120

Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
            + P     +  Q    Y   Y  G T    L+      ET +     + + + GC   S
Sbjct: 121 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 176

Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           +R     Q +GI G   G  S+ S+L   +FSYC+    FD     + L+L +G      
Sbjct: 177 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 229

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
           K  G + TPF         + F+ +YYV L  I+VG  R+ +  +       G GG ++D
Sbjct: 230 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279

Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
           SGTT TF+A + F+PL++E + ++V  R + + +    + G         E    FPEL 
Sbjct: 280 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336

Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
            HF  GA++ L   + F V       CL V+     + G   ++G    Q+Y V YDL  
Sbjct: 337 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 393

Query: 455 QRLGFKQQLCK 465
           +R+ F++  C+
Sbjct: 394 KRVYFQRTDCE 404


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 52/387 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP   +  I DTGS L W  C      + C   K P F P  S+S   + 
Sbjct: 102 GNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVS 159

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +   +     C      ++ NC      Y + YG    + G    E   L N
Sbjct: 160 CSSAACGSLSSATGNAGSC------SASNCI-----YGIQYGDQSFSVGFLAKEKFTLTN 208

Query: 206 R-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
             +      GC   +       AG+ G GR K S PSQ        FSYCL S      +
Sbjct: 209 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SAS 264

Query: 259 RTSSLILDN-GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
            T  L   + G S S K      +TP     ++ +  +F   Y + +  ITVGGQ++ + 
Sbjct: 265 YTGHLTFGSAGISRSVK------FTPI---STITDGTSF---YGLNIVAITVGGQKLPIP 312

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               +       G ++DSGT  T + P+ +  L   F ++M K   Y    G      L 
Sbjct: 313 STVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSK---YPTTSGVSI---LD 361

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CFD+ G KT + P++   F GGA V L  +  F V  + S VCL    + + S   + I
Sbjct: 362 TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF-KISQVCLAFAGNSDDSN--AAI 418

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            GN Q Q   V YD    R+GF    C
Sbjct: 419 FGNVQQQTLEVVYDGAGGRVGFAPNGC 445


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 154/383 (40%), Gaps = 56/383 (14%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           + + GTPPQ     +D    LVW  C+   QC +C    +P F+P  SS+ +   C    
Sbjct: 27  NFTIGTPPQAASAFIDLTGELVWTQCS---QCIHCFKQDLPVFVPNASSTFKPEPCGTDV 83

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
           C  I                T K  + +C    V    G T GI  ++T  +      + 
Sbjct: 84  CKSI---------------PTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASL 128

Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
             GC V S       P+G  G GR   SL +Q+ L +FSYCL  H   DT + S L L  
Sbjct: 129 GFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL-- 183

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
               S K   G  +TPFV     +  +  S YY + L  I  G          +T+ R  
Sbjct: 184 --GASAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGR 231

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV--PGE 385
           N        T     A      L D  V Q  K         A   T +   F+V  P  
Sbjct: 232 N--------TVLVQTAVVRVSLLVDS-VYQEFKKAVMASVGAAPTATPVGEPFEVCFPKA 282

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNF 441
                P+L   F+ GA +T+P  NY   VG  + VCL+V++    +  A  G + ILG+F
Sbjct: 283 GVSGAPDLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSF 340

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
           Q +N ++ +DL    L F+   C
Sbjct: 341 QQENVHLLFDLDKDMLSFEPADC 363


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 161/388 (41%), Gaps = 59/388 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 234

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS +           D    +  +C      Y V YG G  + G    +TL L +
Sbjct: 235 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
              +  F  GC   +     + AG+ G GRGKTSLP Q        F++CL +       
Sbjct: 279 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARS----- 333

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            T +  LD G+      TT    TP +  N P+         +YYVG+  I VGG+ + +
Sbjct: 334 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 379

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                        GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ L
Sbjct: 380 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 430

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D  G    + P + L F+GGA + +        V   S VCL    + +  GG   
Sbjct: 431 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 487

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q++ + V YD+  + +GF    C
Sbjct: 488 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 124/486 (25%), Positives = 214/486 (44%), Gaps = 76/486 (15%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHT--------NPSQDSYQNLNSLV 52
           M S  +++ LS   F + +    ++   L F+    H         NP++   Q + + +
Sbjct: 1   MVSLFTSVLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAI 60

Query: 53  SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
             S  R  H  +        +  +  T+I+    G Y ++LS GTPP  I  + DTGS+L
Sbjct: 61  HRSFNRVSHFTD--LSEMDASLNSPQTDITPCG-GEYLMNLSLGTPPSPIMAVADTGSNL 117

Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
           +W  C     C  C +   P F PK SS+ + + C + +C+ + +++     C+ E    
Sbjct: 118 IWTQCK---PCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQA----SCSTE---- 166

Query: 173 SKNCTQICPSYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGC----SVLSSR 221
            K C     SYLV Y  G         + + L  T N P + + N ++GC    +V    
Sbjct: 167 DKTC-----SYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQ-LKNIIIGCGQNNAVTFRN 220

Query: 222 QPAGIAGFGRGKTSLPSQL--NLD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
           + +G+ G G G  SL  QL  ++D KFSYCL+    D T++       N  +++     G
Sbjct: 221 KSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPEN-DQTSKI------NFGTNAVVSGPG 273

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
              TP V    V  R+ F   YY+ L+ I+VG + ++      T D +  G  ++DSGTT
Sbjct: 274 TVSTPLV----VKSRDTF---YYLTLKSISVGSKNMQ------TPDSNIKGNMVIDSGTT 320

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
            T +  + +  + +   S +  +++    +G+        C++   +   + P + +HF+
Sbjct: 321 LTLLPVKYYIEIENAVASLINADKSKDERIGSSL------CYNATADL--NIPVITMHFE 372

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
           G      P  ++F V  +   VCL        +G    I GN   +N+ V YD  ++ + 
Sbjct: 373 GADVKLYPYNSFFKVTED--LVCLAFGMSFYRNG----IYGNVAQKNFLVGYDTASKTMS 426

Query: 459 FKQQLC 464
           FK   C
Sbjct: 427 FKPTDC 432


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 124/412 (30%), Positives = 173/412 (41%), Gaps = 72/412 (17%)

Query: 89  YSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y I LS GTP PQ +   LDTGS LVW  C     C  C +   P+F    S ++  + C
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQCA----CHVCFAQPFPTFDALASQTTLAVPC 155

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--- 203
            +P C+   +           PL+        C  YL  Y    +T G  + +T      
Sbjct: 156 SDPICTSGKY-----------PLSGCTFNDNTC-FYLYDYADKSITSGRIVEDTFTFRSP 203

Query: 204 ---------PNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
                        +PN   GC      +  S + +GIAGF RG  SLPSQL + +FS+C 
Sbjct: 204 QGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNE-SGIAGFSRGPMSLPSQLKVARFSHCF 262

Query: 250 LSHKFDDTTRTSSLIL------DNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSVYYYV 302
            +       RTS + L      DN  +H+   T  +  TPF N N S+         YY+
Sbjct: 263 TAIA---DARTSPVFLGGAPGPDNLGAHA---TGPVQSTPFANSNGSL---------YYL 307

Query: 303 GLRRITVGGQR--VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-- 358
            L+ ITVG  R  +            G+GGTI+DSGT    +   ++  L   FV+++  
Sbjct: 308 TLKGITVGKTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL 367

Query: 359 -VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--- 414
            V N +   A         R     P     + P++ LH   GA+  LP E+Y   +   
Sbjct: 368 PVANESAADAESTLCFEAARSASLPPEAPAPALPKVVLHV-AGADWDLPRESYVLDLLED 426

Query: 415 --GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G GS +CL +     A      I+GNFQ QN +V YDL   +L F    C
Sbjct: 427 EDGSGSGLCLVM---NSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 49/382 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT +   W PCT    C  CS++    F P  S++ + +GC 
Sbjct: 98  YIVKAKIGTPAQTLLLAMDTSNDASWVPCT---ACVGCSTTT--PFAPAKSTTFKKVGCG 152

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +C  + + +     C                ++   YG+       + +T+ L    +
Sbjct: 153 ASQCKQVRNPTCDGSAC----------------AFNFTYGTSSVAASLVQDTVTLATDPV 196

Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC      S +  +   G+        +   +L    FSYCL S K    T   S
Sbjct: 197 PAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFK----TLNFS 252

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
             L  G     K+   + +TP + NP    R++    YYV L  I VG + V +  + L 
Sbjct: 253 GSLRLGPVAQPKR---IKFTPLLKNP---RRSSL---YYVNLVAIRVGRRIVDIPPEALA 303

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            + +   GT+ DSGT FT +    +  + +EF  ++  ++  T      +L G   C+  
Sbjct: 304 FNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLT----VTSLGGFDTCYTA 359

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F  G  VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 360 PIVA----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQ 414

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ V +D+ N RLG  ++LC
Sbjct: 415 QQNHRVLFDVPNSRLGVARELC 436


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 114/418 (27%), Positives = 172/418 (41%), Gaps = 50/418 (11%)

Query: 56  LTRALHIKNP-QTKTTTTTTTTTTTNISSH-SYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           LTRA  ++   +     TT       +  H S   Y ++L+ GTPPQ +  I+D G  LV
Sbjct: 16  LTRAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELV 75

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C  H  C+ C    +P F    SS+ R   C    C     ESI  R        + 
Sbjct: 76  WTQCAQH--CRRCFKQDLPLFDTNASSTFRPEPCGAAVC-----ESIPTR--------SC 120

Query: 174 KNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGF 229
                    Y      G T G   ++ + +          GC+V S        +G  G 
Sbjct: 121 AGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGL 180

Query: 230 GRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
           GR   SL +Q+N   FSYCL      DT ++S+L L   S+       G   TPFV   S
Sbjct: 181 GRTNLSLAAQMNATAFSYCLAPP---DTGKSSALFL-GASAKLAGAGKGAGTTPFVKT-S 235

Query: 290 VAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
               +  S  Y + L       + +R  +  + + + GN  T V + T  T +   ++  
Sbjct: 236 TPPNSGLSRSYLLRL-------EAIRAGNATIAMPQSGNTIT-VSTATPVTALVDSVYRD 287

Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLP 406
           L           +    A+GA  +      +D+       +G  P+L L F+GGAE+T+P
Sbjct: 288 L----------RKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVP 337

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V +Y    G  +A C+ ++    A GG S ILG+ Q  N ++ +DL  + L F+   C
Sbjct: 338 VSSYLFDAGNDTA-CVAIL-GSPALGGVS-ILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 129/464 (27%), Positives = 188/464 (40%), Gaps = 85/464 (18%)

Query: 34  SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           S FH +PS  +   +      S  RA  +     +    +     + ++S  +  Y +++
Sbjct: 47  SPFH-DPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSELTSTPFE-YLMAV 104

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-------FIPKLSSSSRLLG 146
           + GTPP  +  I DTGS L+W  C+        ++++          F P  S++ RL+ 
Sbjct: 105 NIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTTFRLVD 164

Query: 147 CQNPKCSWIHHESI----QCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
           C +  CS +   S     +CR                   Y   YG G  T G+  +ET 
Sbjct: 165 CDSVACSELPEASCGADSKCR-------------------YSYSYGDGSHTSGVLSTETF 205

Query: 202 NLPNRI----------IPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQLNLD-----K 244
              +            + N   GCS   + S    G+ G G G  SL SQL  D     +
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRR 265

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
           FSYCL+ +    + + SS +  N    +     G   TP +  PS  +      YY V L
Sbjct: 266 FSYCLVPY----SVKASSAL--NFGPRAAVTDPGAVTTPLI--PSQVK-----AYYIVEL 312

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
           R + VG +         T +       IVDSGTT TF+   L +PL  E   ++      
Sbjct: 313 RSVKVGNK---------TFEAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRI----KL 359

Query: 365 TRALGAEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
             A   E L  L  CFDV G + G      P++ +   GGA VTL  EN F  V EG+ +
Sbjct: 360 PPAQSPERLLPL--CFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGT-L 416

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           CL V    E    P+ I+GN   QN +V YDL    + F    C
Sbjct: 417 CLAVSAMSEQF--PASIIGNIAQQNMHVGYDLDKGTVTFAPAAC 458


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 161/388 (41%), Gaps = 59/388 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 181 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 238

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS +           D    +  +C      Y V YG G  + G    +TL L +
Sbjct: 239 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 282

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
              +  F  GC   +     + AG+ G GRGKTSLP Q        F++CL +       
Sbjct: 283 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARS----- 337

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            T +  LD G+      TT    TP +  N P+         +YYVG+  I VGG+ + +
Sbjct: 338 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 383

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                        GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ L
Sbjct: 384 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 434

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D  G    + P + L F+GGA + +        V   S VCL    + +  GG   
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 491

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q++ + V YD+  + +GF    C
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 161/385 (41%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  + + GTPPQ    ++D    LVW  C    QC  C     P F P  S++ R   C 
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCSRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P C  I  +S              +NC+    +Y     +G T G   ++T  +     
Sbjct: 108 TPLCESIPSDS--------------RNCSGNVCAYQASTNAGDTGGKVGTDTFAV-GTAK 152

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
            +   GC V S       P+GI G GR   SL +Q  +  FSYCL  H   D  + S+L 
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGKNSALF 209

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L  GSS           TPFVN       N  S YY V L  +  G   + +     T+ 
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
                  ++D+ +  +F+    +         Q VK +  T A+GA  + T + P   CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTVAVGAPPMATPVEPFDLCF 307

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
              G  +G+ P+L   F+GGA +T+   NY      G+ VCL +++    +    + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVAASNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q +N +  +DL  + L F+   C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 153/399 (38%), Gaps = 98/399 (24%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           +S G Y+++LS GTPP     + DTGS L+W  C     C  C++   P F P  SS+  
Sbjct: 85  NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCA---PCTECAARPAPPFQPASSSTFS 141

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
            L C +  C ++       R CN      +  C    P     YG G T G   +ETL++
Sbjct: 142 KLPCASSLCQFLTSPY---RTCN------ATGCVYYYP-----YGMGFTAGYLATETLHV 187

Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                P    GCS  +      +GI G GR   SL SQ+ + +FSYCL S+         
Sbjct: 188 GGASFPGVTFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNA-------- 239

Query: 262 SLILDNGSS----HSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
               D G S     S  K TG  +  TP + NP +      S YYYV L  ITVG   + 
Sbjct: 240 ----DAGDSPILFGSLAKVTGGNVQSTPLLENPEMPS----SSYYYVNLTGITVGATDLP 291

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +    LT            +GT F                                   G
Sbjct: 292 MAMANLT----------TVNGTRF-----------------------------------G 306

Query: 376 LRPCFD---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTD 427
              CFD     G      P L L F GGAE  +   +YF VV     G  +  CL V+  
Sbjct: 307 FDLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVL-- 364

Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             AS   SI I+GN    + +V YDL      F    C 
Sbjct: 365 -PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 402


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 151/381 (39%), Gaps = 68/381 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + ++ GTPP  +  +LDTGS L+W  C     C+ C     P + P  S++   + C+
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +P C  +     +C   +         C     +Y   YG G  T+G+  +ET  L  + 
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197

Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
            +     GC   ++ S+   +G+ G GRG  SL SQL + +                   
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRPR----------------- 240

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
                 S   +        P   +P               L  ITVG   + +      L
Sbjct: 241 -----RSCRARAAARGGGAPTTTSP---------------LEGITVGDTLLPIDPAVFRL 280

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
              G+GG I+DSGTTFT +    F  LA    S++         L + A  GL  CF   
Sbjct: 281 TPMGDGGVIIDSGTTFTALEERAFVALARALASRV------RLPLASGAHLGLSLCFAAA 334

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
             +    P L LHF  GA++ L  E+Y          CL +V+ R  S     +LG+ Q 
Sbjct: 335 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 388

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
           QN ++ YDL    L F+   C
Sbjct: 389 QNTHILYDLERGILSFEPAKC 409


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 161/385 (41%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  + + GTPPQ    ++D    LVW  C    QC  C     P F P  S++ R   C 
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCGRCFEQGTPLFDPTASNTYRAEPCG 107

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P C  I  +               +NC+    +Y     +G T G   ++T  +     
Sbjct: 108 TPLCESIPSDV--------------RNCSGNVCAYEASTNAGDTGGKVGTDTFAV-GTAK 152

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
            +   GC V S       P+GI G GR   SL +Q  +  FSYCL  H   D  + S+L 
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGKNSALF 209

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L  GSS           TPFVN       N  S YY V L  +  G   + +     T+ 
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
                  ++D+ +  +F+    +         Q VK +  T A+GA  + T + P   CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTVAVGAPPMATPVEPFDLCF 307

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
              G  +G+ P+L   F+GGA +T+P  NY      G+ VCL +++    +    + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q +N +  +DL  + L F+   C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 165/384 (42%), Gaps = 58/384 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C     C  C S   P F P  SS+     C 
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 184

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
           +  C+ +  E   C        ++S  C      Y+V YG G  T G   S+TL L +  
Sbjct: 185 SAACAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
           + +F  GCS + S    Q  G+ G G G  SL SQ    L + FSYCL        T +S
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 285

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L      +     G   + FV  P +   +    +Y V L+ I VGG+++ +     
Sbjct: 286 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 338

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +       GT++DSGT  T + P  +  L+  F + M   + Y  A  +  L     CFD
Sbjct: 339 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 386

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGN 440
             G+ + S P + L F GGA V+L             + CL    + + S   S+ I+GN
Sbjct: 387 FSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAANSDDS---SLGIIGN 437

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q + + V YD+    +GF+   C
Sbjct: 438 VQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 158/386 (40%), Gaps = 63/386 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+  G+P + +  I DTGS L W  C           S   +F P  S+S   + 
Sbjct: 132 GNYIVSIGLGSPKKDLMLIFDTGSDLTWARC-----------SAAETFDPTKSTSYANVS 180

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS +   +     C       +  C      Y + YG G  + G    E L + +
Sbjct: 181 CSTPLCSSVISATGNPSRC------AASTCV-----YGIQYGDGSYSIGFLGKERLTIGS 229

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
             I  NF  GC         + AG+ G GR K S+ SQ        FSYCL S       
Sbjct: 230 TDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPS------- 282

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +S+  L  GSS S        +TP  + P        S +Y + L  ITVGGQ++ +  
Sbjct: 283 SSSTGFLSFGSSQSKSA----KFTPLSSGP--------SSFYNLDLTGITVGGQKLAIPL 330

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +       GTI+DSGT  T + P  +  L   F   M      +  +G + L+ L  
Sbjct: 331 SVFS-----TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMA-----SYPMG-KPLSILDT 379

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D    KT   P++ + F GG +V +     F   G    VCL    +  A    + I 
Sbjct: 380 CYDFSKYKTIKVPKIVISFSGGVDVDVDQAGIFVANGL-KQVCLAFAGNTGAR--DTAIF 436

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q +N+ V YD+   ++GF    C
Sbjct: 437 GNTQQRNFEVVYDVSGGKVGFAPASC 462


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 119/396 (30%), Positives = 167/396 (42%), Gaps = 70/396 (17%)

Query: 86  YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           +G Y +  S GTP      I DTGS L W  CT    CK C   + P F P  SS+   +
Sbjct: 85  HGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCT---PCKTCYPQEAPLFDPTQSSTYVDV 141

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEG------IALS 198
            C++  C+       +C         +SK C      YL  YG+   T G      I+ S
Sbjct: 142 PCESQPCTLFPQNQREC--------GSSKQCI-----YLHQYGTDSFTIGRLGYDTISFS 188

Query: 199 ET-LNLPNRIIPNFLVGCSVLS------SRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
            T +       P  + GC+  S      S +  G  G G G  SL SQL      KFSYC
Sbjct: 189 STGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYC 248

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           ++       + TS+  L  GS      T  +  TPF+ NPS      +  YY + L  IT
Sbjct: 249 MVPF-----SSTSTGKLKFGSM---APTNEVVSTPFMINPS------YPSYYVLNLEGIT 294

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG ++V        L     G  I+DS    T +   ++     +F+S + +  N   A 
Sbjct: 295 VGQKKV--------LTGQIGGNIIIDSVPILTHLEQGIYT----DFISSVKEAINVEVA- 341

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             +A T    C   P     +FPE   HF  GA+V L  +N F  + + + VC+TVV  +
Sbjct: 342 -EDAPTPFEYCVRNPTNL--NFPEFVFHFT-GADVVLGPKNMFIAL-DNNLVCMTVVPSK 396

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             S     I GN+   N+ VEYDL  +++ F    C
Sbjct: 397 GIS-----IFGNWAQVNFQVEYDLGEKKVSFAPTNC 427


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 133/494 (26%), Positives = 205/494 (41%), Gaps = 95/494 (19%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLT-----FSLSRFHTNP--------SQDSYQN 47
           MA+ IS      +FF  +L +   S T++      F+ S FH +         S   Y  
Sbjct: 1   MAATIS------LFFHLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDR 54

Query: 48  LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILD 107
           L +    SL+R+  + N   +  T+      ++I   S G Y +S+S GTPP     I D
Sbjct: 55  LANAFRRSLSRSAALLN---RAATSGAVGLQSSIGPGS-GEYLMSVSIGTPPVDYLGIAD 110

Query: 108 TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
           TGS L W  C     C  C     P F P  S+S   + C    C  +          +D
Sbjct: 111 TGSDLTWAQC---LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAV----------DD 157

Query: 168 EPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---P 223
                   C      Y   YG    ++G    E + + +  + + ++GC   SS      
Sbjct: 158 GHCGVQGVC-----DYSYTYGDRTYSKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFA 211

Query: 224 AGIAGFGRGKTSLPSQLNLD-----KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKK 275
           +G+ G G G+ SL SQ++       +FSYC   LLSH         + ++          
Sbjct: 212 SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSG-------- 263

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
             G+  TP ++  +V        YYY+ L  I++G +R   + K         G  I+DS
Sbjct: 264 -PGVVSTPLISKNTV-------TYYYITLEAISIGNERHMAFAK--------QGNVIIDS 307

Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPEL 393
           GTT T +  EL++ +    + ++VK +      G+     L  CFD  +    +   P +
Sbjct: 308 GTTLTILPKELYDGVVSSLL-KVVKAKRVKDPHGS-----LDLCFDDGINAAASLGIPVI 361

Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEY 450
             HF GGA V L   N F  V + +  CLT+   + AS  P+    I+GN    N+ + Y
Sbjct: 362 TAHFSGGANVNLLPINTFRKVAD-NVNCLTL---KAAS--PTTEFGIIGNLAQANFLIGY 415

Query: 451 DLRNQRLGFKQQLC 464
           DL  +RL FK  +C
Sbjct: 416 DLEAKRLSFKPTVC 429


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 134/482 (27%), Positives = 197/482 (40%), Gaps = 93/482 (19%)

Query: 5   ISALCLSFIFFFTLLSIFPSSITSLT--FSLSRFHTN--------PSQDSYQNLNSLVSS 54
           +SA     + FFT+     S   +L   F+L   H +        P+Q+ Y+ + + V  
Sbjct: 1   MSAHSFLTLLFFTIFCFIISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRR 60

Query: 55  SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
           S+ R  H      K + T+T  +T N      G Y +S S GTPP  +   +DTGS LVW
Sbjct: 61  SINRVNHFY----KYSLTSTPQSTVNSDK---GEYLMSYSIGTPPFKVFGFVDTGSDLVW 113

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
             C     CK C     P F P LSSS + + C +  C  +   S   R           
Sbjct: 114 LQCE---PCKQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCDVR----------- 159

Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGFG 230
                   YL +      E + L  T    +   P  ++GC   ++       +GI G G
Sbjct: 160 -------GYLSV------ETLTLDSTTGY-SVSFPKTMIGCGYRNTGTFHGPSSGIVGLG 205

Query: 231 RGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
            G  SLPSQL      KFSYCL               L N +S  +     + Y      
Sbjct: 206 SGPMSLPSQLGTSIGGKFSYCL------------GPWLPNSTSKLNFGDAAIVYGDGAMT 253

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI-VDSGTTFTFMAPEL 346
             + +++A S  YY+ L   +VG + +            GN G I +DSGTTFTF+  ++
Sbjct: 254 TPIVKKDAQSG-YYLTLEAFSVGNKLIEFGGP----TYGGNEGNILIDSGTTFTFLPYDV 308

Query: 347 ---FEPLADEFVS-QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAE 402
              FE    E+++ + V++ N T  L          C++V      + P +  HFK GA+
Sbjct: 309 YYRFESAVAEYINLEHVEDPNGTFKL----------CYNVAYHGFEA-PLITAHFK-GAD 356

Query: 403 VTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           + L   + F  V +G A CL  +  + A      I GN   QN  V Y+L    + FK  
Sbjct: 357 IKLYYISTFIKVSDGIA-CLAFIPSQTA------IFGNVAQQNLLVGYNLVQNTVTFKPV 409

Query: 463 LC 464
            C
Sbjct: 410 DC 411


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 114/395 (28%), Positives = 161/395 (40%), Gaps = 65/395 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +    GTPP       DTGS L+W  C+    C  C     P F P  SS+     
Sbjct: 88  GEYLMRFYIGTPPVERLATADTGSDLIWVQCS---PCASCFPQSTPLFQPLKSSTFMPTT 144

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS--GLTEGIALSETLNLP 204
           C++  C+ +  E   C          S  C      Y   YG     +EG+  +ETL   
Sbjct: 145 CRSQPCTLLLPEQKGC--------GKSGECI-----YTYKYGDQYSFSEGLLSTETLRFD 191

Query: 205 NR------IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
           ++        PN   GC      +V  S +  GI G G G  SL SQ+      KFSYCL
Sbjct: 192 SQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCL 251

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           L      +T TS L   N S  + +   G+  TP +  P +        YY++ L  +TV
Sbjct: 252 LPL---GSTSTSKLKFGNESIITGE---GVVSTPMIIKPWLP------TYYFLNLEAVTV 299

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
             + V       T   DGN   I+DSGT  T++    +   A           +    L 
Sbjct: 300 AQKTVP------TGSTDGN--VIIDSGTLLTYLGESFYYNFAASL------QESLAVELV 345

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            + L+ L  CF  P      FPE+   F  GA V+L   N F +  + + VCL ++    
Sbjct: 346 QDVLSPLPFCF--PYRDNFVFPEIAFQFT-GARVSLKPANLFVMTEDRNTVCL-MIAPSS 401

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            SG    I G+F   ++ VEYDL  +++ F+   C
Sbjct: 402 VSG--ISIFGSFSQIDFQVEYDLEGKKVSFQPTDC 434


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 166/405 (40%), Gaps = 59/405 (14%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++S+  GTPPQ +  +LDTGS L      +   C   S S    F    S +   + C +
Sbjct: 66  TVSVVVGTPPQNVTMVLDTGSEL------SGLLCNGSSLSPPAPFNASASLTYSAVDCSS 119

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
           P C W     +  R   D P +TS      C   +    +   +G  +++T  L  + +P
Sbjct: 120 PACVW-RGRDLPVRPFCDAPPSTS------CRVSISYADASSADGHLVADTFILGTQAVP 172

Query: 210 NFLVGC-------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             L GC             +   S    G+ G  RG  S  +Q    +F+YC+   +   
Sbjct: 173 A-LFGCITSYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATLRFAYCIAPGQGPG 231

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRV 314
                                 L YTP +    +++   +   V Y V L  I VG   +
Sbjct: 232 ILLLGG---------DGGAAPPLNYTPLIE---ISQPLPYFDRVAYSVQLEGIRVGSALL 279

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           ++    LT D  G G T+VDSGT FTF+  + +  L  EF++Q    R+    LG     
Sbjct: 280 QIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFLNQA---RSLLAPLGEPGFV 336

Query: 375 ---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVEN-YFAVVGE------GSAV 420
                  CF  P E+  +     PE+ L  + GAEV +  E   ++V GE        AV
Sbjct: 337 FQGAFDACFRGPEERVSAASRLLPEVGLVLR-GAEVAVAGEKLLYSVPGERRGEEGAEAV 395

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                 + + +G  + ++G+   Q+ +VEYDL+N R+GF    C+
Sbjct: 396 WCLTFGNSDMAGMSAYVIGHHHQQDVWVEYDLQNGRVGFAPARCE 440


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 113/409 (27%), Positives = 166/409 (40%), Gaps = 79/409 (19%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + +S++ GTPP  +  I DTGS L W  C     C+ C     P F  K SS+ +   
Sbjct: 83  GEFFMSITIGTPPIKVFAIADTGSDLTWVQCK---PCQQCYKENGPIFDKKKSSTYKSEP 139

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C  +      C + N+           IC  Y   YG    ++G   +ET+++ +
Sbjct: 140 CDSRNCQALSSTERGCDESNN-----------IC-KYRYSYGDQSFSKGDVATETVSIDS 187

Query: 206 R-----IIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN--- 241
                   P  + GC            G+  G T                SL SQL    
Sbjct: 188 ASGSPVSFPGTVFGC------------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235

Query: 242 LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYY 300
             KFSYC LSHK   T  TS + L   S  S   K +G+  TP V+   +        YY
Sbjct: 236 SKKFSYC-LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-------TYY 287

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFV 355
           Y+ L  I+VG +++         + DG     +G  I+DSGTT T +    F+  +    
Sbjct: 288 YLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVE 347

Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
             +   +  +   G      L  CF     + G  PE+ +HF  GA+V L   N F  + 
Sbjct: 348 ESVTGAKRVSDPQGL-----LSHCFKSGSAEIG-LPEITVHFT-GADVRLSPINAFVKLS 400

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E   VCL++V   E +     I GNF   ++ V YDL  + + F+   C
Sbjct: 401 E-DMVCLSMVPTTEVA-----IYGNFAQMDFLVGYDLETRTVSFQHMDC 443


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 114/422 (27%), Positives = 173/422 (40%), Gaps = 61/422 (14%)

Query: 58  RALHIKNPQTKTTTTTTTTTTTNISSHSYG--GYSISLSFGTPPQIIPFILDTGSHLVW- 114
           R + I  P T         T  + +  S G   + +++ FGTP Q    + DTGS + W 
Sbjct: 87  RGIPISYPPTIPPAEAPAVTIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI 146

Query: 115 --FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
              PC+ H     C     P F P  S++   + C +P+C+    +              
Sbjct: 147 QCLPCSGH-----CYKQHDPIFDPTKSATYSAVPCGHPQCAAAGGKC------------- 188

Query: 173 SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN-RIIPNFLVGC---SVLSSRQPAGIA 227
           S N T +   Y V YG G  T G+   ETL+L + R +P F  GC   ++       G+ 
Sbjct: 189 SSNGTCL---YKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGCGETNLGDFGDVDGLI 245

Query: 228 GFGRGKTSLPSQLNLDKFS---YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
           G GRG+ SL SQ      +   YCL S+       TS   L  G++     + G+ YT  
Sbjct: 246 GLGRGQLSLSSQAAASFGAAFSYCLPSYN------TSHGYLTIGTTTPASGSDGVRYTAM 299

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
           +      ++  +  +Y+V L  I VGG  + V     T D     GT++DSGT  T++ P
Sbjct: 300 I------QKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRD-----GTLLDSGTVLTYLPP 348

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
           E +  L D F   M + +       A A      C+D  G+     P +   F  G+   
Sbjct: 349 EAYTALRDRFKFTMTQYKP------APAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402

Query: 405 LPVENYFAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           L          + +    CL  V     S  P  I+GN Q +N  + YD+  +++GF   
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAFVP--RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSG 460

Query: 463 LC 464
            C
Sbjct: 461 SC 462


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 160/388 (41%), Gaps = 59/388 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 235

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS +           D    +  +C      Y V YG G  + G    +TL L +
Sbjct: 236 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 279

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
              +  F  GC   +     + AG+ G GRGKTSLP Q        F++CL         
Sbjct: 280 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRS----- 334

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            T +  LD G+      TT    TP +  N P+         +YYVG+  I VGG+ + +
Sbjct: 335 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 380

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                        GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ L
Sbjct: 381 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 431

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D  G    + P + L F+GGA + +        V   S VCL    + +  GG   
Sbjct: 432 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 488

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q++ + V YD+  + +GF    C
Sbjct: 489 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 158/386 (40%), Gaps = 61/386 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +S+  GTP + +  + DTGS L W  C     C  C     P F P  S++   + C 
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK---PCNNCYKQHDPLFDPSQSTTYSAVPCG 244

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
                         ++C D    +S  C      Y V+YG    T+G    +TL L   +
Sbjct: 245 -------------AQECLDSGTCSSGKC-----RYEVVYGDMSQTDGNLARDTLTLGPSS 286

Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
             +  F+ GC    +    +  G+ G GR + SL SQ        FSYCL S      + 
Sbjct: 287 DQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPS------SW 340

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            +   L  GS+ +          P     ++  R+    +YY+ L  I V G+ VRV   
Sbjct: 341 RAEGYLSLGSAAA---------PPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPA 391

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
                     GT++DSGT  T +    +  L   F   M   R Y RA    AL+ L  C
Sbjct: 392 VFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFM---RRYKRA---PALSILDTC 440

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-IL 438
           +D  G      P + L F GGA + L       V    S  CL   ++ + +   S+ IL
Sbjct: 441 YDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANR-SQACLAFASNGDDT---SVGIL 496

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q + + V YDL NQ++GF  + C
Sbjct: 497 GNMQQKTFAVVYDLANQKIGFGAKGC 522


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 161/389 (41%), Gaps = 58/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 178 GNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 235

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS ++        C                 Y V YG G  + G    +TL L +
Sbjct: 236 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 279

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             T +  LD G+      +  LT TP +  N P+         +YYVG+  I VGGQ + 
Sbjct: 335 --TGTGYLDFGAGSLAAASARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 382

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +             GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ 
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSL-RYAFAAAMAARGYKK---APAVSL 433

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D  G    + P + L F+GGA + +            S VCL    + +  GG  
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 490

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q++ + V YD+  + +GF    C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 119/424 (28%), Positives = 181/424 (42%), Gaps = 47/424 (11%)

Query: 58  RALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
           R  HI   +   TT T ++TT  +  H     ++SL+ G+PPQ +  +LDTGS L W  C
Sbjct: 38  RHSHISTARKYFTTATASSTTNKLLFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHC 97

Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
               + ++ +S     F P  S +   + C +P C        + RD     +  S + T
Sbjct: 98  K---KTQFLNS----VFNPLSSKTYSKVPCLSPTC------KTRTRDLT---IPVSCDAT 141

Query: 178 QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFG 230
           ++C   +    +   EG    ET  L +   P  + GC  S  SS      +  G+ G  
Sbjct: 142 KLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMN 201

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG  S  +Q+   KFSYC+    FD       L+L N S    K    L+YTP V   S 
Sbjct: 202 RGSLSFVNQMGYPKFSYCI--SGFDS---AGVLLLGNASFPWLKP---LSYTPLV-QIST 252

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
                  V Y V L  I V  + + +       D  G G T+VDSGT FTF+   ++  L
Sbjct: 253 PLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTAL 312

Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCF--DVPGEKTGSFPELKLHFKGGAEVTL 405
            +EF+SQ    R   + L  +       +  C+  D       + P + L F+ GAE+++
Sbjct: 313 KNEFLSQ---TRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQ-GAEMSV 368

Query: 406 PVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
             E         V G  S  C T   + +  G  + ++G+   QN ++E+DL   R+G  
Sbjct: 369 SGERLLYRVPGEVRGRDSVWCFT-FGNSDLLGVEAFVIGHHHQQNVWMEFDLEKSRIGLA 427

Query: 461 QQLC 464
              C
Sbjct: 428 DVRC 431


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 158/388 (40%), Gaps = 50/388 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P +     LDTGS + W  C     C  C S   P + P  SSS R + 
Sbjct: 10  GEYFARMGIGNPQRSYYLELDTGSDVTWIQCA---PCSSCYSQVDPIYDPSNSSSYRRVY 66

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  C  + + + Q   C                SY V+YG S  + G    E+  L P
Sbjct: 67  CGSALCQALDYSACQGMGC----------------SYRVVYGDSSASSGDLGIESFYLGP 110

Query: 205 NR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
           N    + N   GC   +S   R  AG+ G G G  S  SQ+       FSYCL+      
Sbjct: 111 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 170

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +R+S LI    +     +     +TP + NP +      + +YY  L  I+VGG  + +
Sbjct: 171 QSRSSPLIFGRTAIPFAAR-----FTPLLKNPRI------NTFYYAVLTGISVGGTPLPI 219

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                 L  +G GG I+DSGT+ T + P  +  L D +      +RN   A G   L   
Sbjct: 220 PPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAY---RAASRNLPPAPGVYLLD-- 274

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF+  G  T   P L LHF  G ++ LP  N    V      CL        S  P  
Sbjct: 275 -TCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAP----SSMPIS 329

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++GN Q Q + + +DL+   +    + C
Sbjct: 330 VIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C     C  C S   P F P  SS+     C 
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 184

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
           +  C+ +  E   C        ++S  C      Y+V YG G  T G   S+TL L +  
Sbjct: 185 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
           + +F  GCS + S    Q  G+ G G G  SL SQ    L + FSYCL        T +S
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 285

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L      +     G   + FV  P +   +    +Y V L+ I VGG+++ +     
Sbjct: 286 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 338

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +       GT++DSGT  T + P  +  L+  F + M   + Y  A  +  L     CFD
Sbjct: 339 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 386

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             G+ + S P + L F GGA V+L      + N  A  G          +D  + G    
Sbjct: 387 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 433

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q + + V YD+    +GF+   C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 118/454 (25%), Positives = 192/454 (42%), Gaps = 64/454 (14%)

Query: 31  FSLSRFHTNPSQDSY--------QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
           FS+   H + S+  +        Q ++++V+ S+ RA ++ +  + +       T   I 
Sbjct: 27  FSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPT---II 83

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
            ++   Y +S S GTPP  +  ++DTGS  +WF C     CK C +   P F P  SS+ 
Sbjct: 84  PYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK---PCKPCLNQTSPIFNPSKSSTY 140

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
           + + C +P C     E  +C          S N  + C   +       ++G    +TL 
Sbjct: 141 KNIRCSSPICK--RGEKTRC----------SSNRKRKCEYEITYLDRSGSQGDISKDTLT 188

Query: 203 LPNR-----IIPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLL 250
           L +        P  ++GC    S+ +    +GI GFGRG  S+ SQL      KFSYCL 
Sbjct: 189 LNSNDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLA 248

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           S  F     +S L   + +  S     G+  TP + +  V         Y+  L   +VG
Sbjct: 249 S-LFSKANISSKLYFGDMAVVSGH---GVVSTPLIQSFYVGN-------YFTNLEAFSVG 297

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
              +++    L  D +GN   ++DSG+T T +  +++  L    +S MVK +        
Sbjct: 298 DHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVIS-MVKLKRV-----K 349

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           +    L  C+    +K    P +  HF+ GA+V L   N F  +     +C        +
Sbjct: 350 DPTQQLSLCYKTTLKKY-EVPIITAHFR-GADVKLNAFNTFIQMNH-EVMCFAF----NS 402

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           S  P ++ GN   QN+ V YD     + FK   C
Sbjct: 403 SAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNC 436


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 115/415 (27%), Positives = 168/415 (40%), Gaps = 55/415 (13%)

Query: 71  TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK 130
           + +++ TT  +  H     + SL+ GTPPQ I  +LDTGS L W  C            K
Sbjct: 49  SNSSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRC-----------KK 97

Query: 131 IPSFI----PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
            P+F     P  S +   + C +  C             +D  L  + +  ++C   +  
Sbjct: 98  EPNFTSIFNPLASKTYTKIPCSSQTCK---------TRTSDLTLPVTCDPAKLCHFIISY 148

Query: 187 YGSGLTEGIALSETLNLPNRIIPNFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQ 239
             +   EG    ET    +   P  + GC       +     +  G+ G  RG  S  +Q
Sbjct: 149 ADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQ 208

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           +   KFSYC+     D T      +L   + +S  K   L YTP V   S        V 
Sbjct: 209 MGFRKFSYCI--SGLDST----GFLLLGEARYSWLKP--LNYTPLV-QISTPLPYFDRVA 259

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM- 358
           Y V L  I V  + + +       D  G G T+VDSGT FTF+   ++  L  EF+ Q  
Sbjct: 260 YSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTA 319

Query: 359 ----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF--- 411
               V N       GA  L  L    D       + P +KL F+ GAE+++  +      
Sbjct: 320 GVLRVLNEPQYVFQGAMDLCYL---IDSTSSTLPNLPVVKLMFR-GAEMSVSGQRLLYRV 375

Query: 412 --AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              V G+ S  C T     E  G  S ++G+ Q QN ++EYDL N R+GF +  C
Sbjct: 376 PGEVRGKDSVWCFTFGNSDEL-GISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 162/398 (40%), Gaps = 47/398 (11%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           I SH  G Y + +  G+PP     + DTGS ++W  C+    C  C +   P F P  S+
Sbjct: 115 IVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCS---PCSDCYAQGDPLFDPANSA 171

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSE 199
           S   + C +  C      S           ++S         Y V YG    T G+   E
Sbjct: 172 SFSPVPCNSGVCRAAARYS-----------SSSCGGGGGECEYKVSYGDKSYTNGVLALE 220

Query: 200 TLNLPNRI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSH 252
           TL L     +    +GC   +     + AG+ G G G  SL  QL       FSYCL  +
Sbjct: 221 TLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGY 280

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              + + + SL+L       D   TG  + P V NP          +YYVG+  + V G+
Sbjct: 281 YSGEGSGSGSLVL----GREDAAPTGAVWVPLVRNPDAPS------FYYVGVNGLGVAGE 330

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+++      L  DG GG ++D+GT  T +  E +  L   F     +      A  A  
Sbjct: 331 RLQLQDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEG-----APRAPG 385

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEGSAVCLTVVT 426
           ++    C+D+ G  +   P + L+F G       A +TLP  N    V +G   CL    
Sbjct: 386 VSLFDTCYDLSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAA 445

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               + GPS ILGN Q Q   +  D  +  +GF    C
Sbjct: 446 ---VASGPS-ILGNIQQQGIEITVDSASGYVGFGPATC 479


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 113/439 (25%), Positives = 187/439 (42%), Gaps = 81/439 (18%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           H+ +P TK + TT               Y   ++  TP   +  ++D G   +W  C NH
Sbjct: 34  HLFSPVTKDSATTLQ-------------YIAQINQRTPLVPLNLVVDLGGKFLWVDCENH 80

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
           Y                 SS+ R + C + +CS    +S  C DC   P     N   + 
Sbjct: 81  YT----------------SSTYRPVRCPSAQCSLAKSDS--CGDCFSSPKPGCNNTCGLI 122

Query: 181 PSYLVLYGSGLTEGIALSETLNLP---------NRIIPNFLVGCSVLS-----SRQPAGI 226
           P   + + +  T G    + L++          N ++  FL  C+  S     +   +G+
Sbjct: 123 PDNTITHSA--TRGDLAEDVLSIQSTSGFNTGQNVVVSRFLFSCAPTSLLRGLAGGASGM 180

Query: 227 AGFGRGKTSLPSQLN-----LDKFSYCLLSHK----FDDTTRTSSLILDNGSSHS---DK 274
           AG GR K +LPSQL        KF++C  S      F D     S + DN S  +   D 
Sbjct: 181 AGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVIIFGDGPY--SFLADNPSLPNVVFDS 238

Query: 275 KTTGLTYTPFVNNPSVAERNAF-----SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           K+  LTYTP + N  V+  +AF     SV Y++G++ I + G+ V +    L++D  G G
Sbjct: 239 KS--LTYTPLLIN-HVSTASAFLQGESSVEYFIGVKTIKIDGKVVSLNSSLLSIDNKGVG 295

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG- 388
           GT + +   +T +   +++ + D FV   V  RN T    +          ++PG   G 
Sbjct: 296 GTKISTVDPYTVLEASIYKAVTDAFVKASVA-RNITTEDSSPPFEFCYSFDNLPGTPLGA 354

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG----PSIILGNFQMQ 444
           S P ++L  +     ++   N    + +   +CL  V     +GG     SI++G +Q++
Sbjct: 355 SVPTIELLLQNNVIWSMFGANSMVNIND-EVLCLGFV-----NGGVNLRTSIVIGGYQLE 408

Query: 445 NYYVEYDLRNQRLGFKQQL 463
           N  +++DL   RLGF   +
Sbjct: 409 NNLLQFDLAASRLGFSNTI 427


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 111/403 (27%), Positives = 164/403 (40%), Gaps = 76/403 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y      G PPQ    ++DTGS LVW  C+   + K C+   +P +    SS+   + C 
Sbjct: 90  YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLR-KVCARQALPYYNSSASSTFAPVPCA 148

Query: 149 NPKCSWIHHESIQCRDCNDEPL---ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
              C+            ND+ +     +  C+ I       YG+G+  G   +E     +
Sbjct: 149 ARICA-----------ANDDIIHFCDLAAGCSVIAG-----YGAGVVAGTLGTEAFAFQS 192

Query: 206 -------------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
                        RI+   L G S        G+ G GRG+ SL SQ    KFSYCL  +
Sbjct: 193 GTAELAFGCVTFTRIVQGALHGAS--------GLIGLGRGRLSLVSQTGATKFSYCLTPY 244

Query: 253 KFDDTTRTSSLILDNGSS---HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
            F +   T  L +   +S   H D  T     T FV  P        S +YY+ L  +TV
Sbjct: 245 -FHNNGATGHLFVGASASLGGHGDVMT-----TQFVKGPK------GSPFYYLPLIGLTV 292

Query: 310 GGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
           G  R+ +      L        +GG I+DSG+ FT +  + ++ LA E  +++       
Sbjct: 293 GETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARL------N 346

Query: 366 RALGAEALTGLRPCFDVPGEKTGS-FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
            +L A           V     G   P +  HF+GGA++ +P E+Y+A V +        
Sbjct: 347 GSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDK------AA 400

Query: 425 VTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                AS GP     ++GN+Q QN  V YDL N    F+   C
Sbjct: 401 ACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 163/411 (39%), Gaps = 66/411 (16%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIP---------KLSS 140
           ++ ++ G PPQ +  +LDTGS L W           C+ S++PS  P           SS
Sbjct: 63  TVPVAVGAPPQNVTMVLDTGSELSWL---------RCNGSRVPSTPPPQAPAAFNGSASS 113

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           +     C +P+C W      + RD    P       +  C   L    +   +GI  ++T
Sbjct: 114 TYAAAHCSSPECQW------RGRDLPVPPFCAGPP-SNSCRVSLSYADASSADGILAADT 166

Query: 201 LNLPNRIIPNFLVGC----------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
             L        L GC          +   S    G+ G  RG  S  +Q    +F+YC+ 
Sbjct: 167 FLLGGAPPVRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCIA 226

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRIT 308
                       L++  G   +      L YTP +    ++    +   V Y V L  I 
Sbjct: 227 PGD------GPGLLVLGGDGAALAPQ--LNYTPLIQ---ISRPLPYFDRVAYSVQLEGIR 275

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG   + +    L  D  G G T+VDSGT FTF+  + + PL  EF++Q          L
Sbjct: 276 VGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ---TSALLAPL 332

Query: 369 GAEALT---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVV-----GE 416
           G            CF     +  +     PE+ L  + GAEV +  E     V     GE
Sbjct: 333 GESDFVFQGAFDACFRASEARVAAASQMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGE 391

Query: 417 GSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G A  +  +T  + + +G  + ++G+   QN +VEYDL+N R+GF    C 
Sbjct: 392 GGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 442


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 162/389 (41%), Gaps = 58/389 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP      + DTGS   W  C       Y    K+  F P  SS+   + 
Sbjct: 176 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKL--FDPVRSSTYANVS 233

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C  P CS ++        C                 Y V YG G  + G    +TL L +
Sbjct: 234 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 277

Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
              +  F  GC   +     + AG+ G GRGKTSLP Q   DK    F++CL +      
Sbjct: 278 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 332

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             T +  LD G+      +  LT TP +  N P+         +YY+G+  I VGGQ + 
Sbjct: 333 --TGTGYLDFGAGSPAAASARLT-TPMLTDNGPT---------FYYIGMTGIRVGGQLLS 380

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +             GTIVDSGT  T + P  +  L     +  +  R Y +   A A++ 
Sbjct: 381 IPQSVFA-----TAGTIVDSGTVITRLPPPAYSSL-RYAFAAAMAARGYKK---APAVSL 431

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D  G    + P + L F+GGA + +            S VCL    + +  GG  
Sbjct: 432 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 488

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q++ + V YD+  + +GF   +C
Sbjct: 489 GIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C     C  C S   P F P  SS+     C 
Sbjct: 52  YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 108

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
           +  C+ +  E   C        ++S  C      Y+V YG G  T G   S+TL L +  
Sbjct: 109 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 155

Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
           + +F  GCS + S    Q  G+ G G G  SL SQ    L + FSYCL        T +S
Sbjct: 156 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 209

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L      +     G   + FV  P +   +    +Y V L+ I VGG+++ +     
Sbjct: 210 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 262

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +       GT++DSGT  T + P  +  L+  F + M   + Y  A  +  L     CFD
Sbjct: 263 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 310

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             G+ + S P + L F GGA V+L      + N  A  G          +D  + G    
Sbjct: 311 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 357

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q + + V YD+    +GF+   C
Sbjct: 358 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 128/457 (28%), Positives = 187/457 (40%), Gaps = 86/457 (18%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP       LN+    S++R+  + N  ++T   +             G + +S++ GTP
Sbjct: 42  NPKNTVTDRLNAAFLRSISRSRRLNNILSQTDLQSGLIGAD-------GEFFMSITIGTP 94

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  +  I DTGS L W  C     C+ C     P F  K SS+ +   C +  C   H  
Sbjct: 95  PMKVFAIADTGSDLTWVQCK---PCQQCYKENGPIFDKKKSSTYKSEPCDSRNC---HAL 148

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFL 212
           S   R C++     SKN   +C  Y   YG    ++G   +ET+++ +        P  +
Sbjct: 149 SSSERGCDE-----SKN---VC-KYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTV 199

Query: 213 VGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN---LDKFSYCLLSHK 253
            GC            G+  G T                SL SQL      KFSYC LSHK
Sbjct: 200 FGC------------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC-LSHK 246

Query: 254 FDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              T  TS + L   S  S   K +G+  TP V+            YYY+ L  I+VG +
Sbjct: 247 SATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEP-------RTYYYLTLEAISVGKK 299

Query: 313 RVRVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           ++         +  G     +G  I+DSGTT T +    F+      V ++V      R 
Sbjct: 300 KIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFG-AAVEELVTGAK--RV 356

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
              + L  L  CF     + G  PE+ +HF  GA+V L   N F  V E   VCL++V  
Sbjct: 357 SDPQGL--LSHCFKSGSAEIG-LPEITVHFT-GADVRLSPINAFVKVSE-DMVCLSMVPT 411

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            E +     I GNF   ++ V YDL  + + F++  C
Sbjct: 412 TEVA-----IYGNFAQMDFLVGYDLETRTVSFQRMDC 443


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 166/390 (42%), Gaps = 61/390 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP----SFIPKLSSSS 142
           G Y +++  GTP +    + DTGS + W       QC+ C  S  P     F P  S+S 
Sbjct: 133 GNYVVTVGLGTPKEDFTLVFDTGSGITW------TQCQPCLGSCYPQKEQKFDPTKSTSY 186

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL 201
             + C +  C+ +      C   N   L            Y ++YG    ++G   +ETL
Sbjct: 187 NNVSCSSASCNLLPTSERGCSASNSTCL------------YQIIYGDQSYSQGFFATETL 234

Query: 202 NLPNR-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
            + +  +  NFL GC   ++    Q AG+ G      SLPSQ       +FSYCL S   
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--- 291

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
              T +S+  L+ G   S  +T G  +TP           AFS +Y + +  I+V G ++
Sbjct: 292 ---TPSSTGYLNFGGKVS--QTAG--FTPI--------SPAFSSFYGIDIVGISVAGSQL 336

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +     T       G I+DSGT  T + P  ++ L + F  +M    NY +  G E L 
Sbjct: 337 PIDPSIFT-----TSGAIIDSGTVITRLPPTAYKALKEAFDEKM---SNYPKTNGDELL- 387

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
               C+D     T SFP++ + FKGG EV +       +V     VCL    +++ S   
Sbjct: 388 --DTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDS--E 443

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I GN Q + Y V YD     +GF    C
Sbjct: 444 FGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 109/395 (27%), Positives = 163/395 (41%), Gaps = 68/395 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP +    +LDTGS + W  C     C+ C S   P F P  S+S   +G
Sbjct: 155 GEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE---PCRECYSQADPIFNPSYSASFSTVG 211

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  CS      +   DC+      S  C      Y   YG G  + G   +ETL    
Sbjct: 212 CDSAVCS-----QLDAYDCH------SGGCL-----YEASYGDGSYSTGSFATETLTFGT 255

Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDK---FSYCLLSHKFD 255
             + N  +GC      +  G+        G G G  S P+Q+       FSYCL+  + D
Sbjct: 256 TSVANVAIGCG----HKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESD 311

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
                SS  L  G         G  +TP   NP +        +YY+ +  I+VGG  + 
Sbjct: 312 -----SSGPLQFGPK---SVPVGSIFTPLEKNPHLP------TFYYLSVTAISVGGALLD 357

Query: 315 RVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVS---QMVKNRNYTRALGA 370
            +  +   +D   G+GG I+DSGT  T +    ++ + D FV+   Q+ +          
Sbjct: 358 SIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRT--------- 408

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
           +A++    C+D+ G +  S P +  HF  GA + LP +NY   +      C        A
Sbjct: 409 DAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAF-----A 463

Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               S+ I+GN Q Q+  V +D  N  +GF    C
Sbjct: 464 PAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 165/395 (41%), Gaps = 45/395 (11%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI---PSFIPKLSSSSRLLG 146
           ++SL+ GTPPQ +  +LDTGS L W  C    Q    + +      SF P+ S++   + 
Sbjct: 64  TVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVP 123

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C + +CS         RD    P  +    ++ C   L       ++G   ++   +   
Sbjct: 124 CGSTQCS--------SRDLPAPP--SCDGASRQCHVSLSYADGSASDGALATDVFAVGEA 173

Query: 207 IIPNFLVGC-SVLSSRQPAGIA-----GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
                  GC S      P G+A     G  RG  S  +Q +  +FSYC+      D    
Sbjct: 174 PPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI-----SDRDDA 228

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
             L+L     HSD     L YTP    P++       V Y V L  I VGG+ + +    
Sbjct: 229 GVLLL----GHSDLPFLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRVGGKALPIPASV 283

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLR 377
           L  D  G G T+VDSGT FTF+  + +  L  EF+ Q    +   RAL   +      L 
Sbjct: 284 LAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQ---TKPLLRALDDPSFAFQEALD 340

Query: 378 PCFDVPGEK---TGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGSAV----CLTVVTDRE 429
            CF VP  +   +   P + L F  GAE+++  +   + V GE        CLT   + +
Sbjct: 341 TCFRVPAGRPPPSARLPPVTLLFN-GAEMSVAGDRLLYKVPGEHRGADGVWCLT-FGNAD 398

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                + ++G+    N +VEYDL   R+G     C
Sbjct: 399 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 111/419 (26%), Positives = 169/419 (40%), Gaps = 67/419 (15%)

Query: 64  NPQTKTTTTTTTTTTTNISSHS-----YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
           + + K    TTT     +  HS      G Y   +  G+P Q    ++DTGS   W  C+
Sbjct: 83  DSRRKGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNCS 142

Query: 119 NHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
             ++   C+S K    + +L S S    C  P    ++       D +    +++K    
Sbjct: 143 KSFEAVTCASRKCKVDLSELFSLSV---CPKPSDPCLY-------DISYADGSSAKG--- 189

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS------VLSSRQPAGIAGFGRG 232
                   +G   T+ I +  T N     + N  +GC+      V  + +  GI G G  
Sbjct: 190 -------FFG---TDSITVGLT-NGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFA 238

Query: 233 KTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
           K S   +       KFSYCL+ H    + R+ S  L  G  H+ K    +  T  +    
Sbjct: 239 KDSFIDKAANKYGAKFSYCLVDHL---SHRSVSSNLTIGGHHNAKLLGEIRRTELI---- 291

Query: 290 VAERNAFSVYYYVGLRRITVGGQRVR----VWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
                 F  +Y V +  I++GGQ ++    VW      D +  GGT++DSGTT T +   
Sbjct: 292 -----LFPPFYGVNVVGISIGGQMLKIPPQVW------DFNAEGGTLIDSGTTLTSLLLP 340

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
            +E + +     + K +  T     E    L  CFD  G      P L  HF GGA    
Sbjct: 341 AYEAVFEALTKSLTKVKRVT----GEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEP 396

Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           PV++Y   V      C+ +V   +  GG S+I GN   QN+  E+DL    +GF    C
Sbjct: 397 PVKSYIIDVAP-LVKCIGIVP-IDGIGGASVI-GNIMQQNHLWEFDLSTNTVGFAPSTC 452


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 161/391 (41%), Gaps = 63/391 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +S+  GTP + +  + DTGS L W  C     C  C     P F P  S++   + C 
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCK---PCDGCYQQHDPLFDPSQSTTYSAVPCG 194

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL---- 203
             +C  +   S     C                 Y V+YG    T+G    +TL L    
Sbjct: 195 AQECRRLDSGSCSSGKCR----------------YEVVYGDMSQTDGNLARDTLTLGPSS 238

Query: 204 ---PNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
               +  +  F+ GC    +    +  G+ G GR + SL SQ        FSYCL S   
Sbjct: 239 SSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSS-- 296

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             +T    L L + +  + + T  +T +   + PS         +YY+ L  I V G+ V
Sbjct: 297 --STAEGYLSLGSAAPPNARFTAMVTRS---DTPS---------FYYLNLVGIKVAGRTV 342

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           RV             GT++DSGT  T +    +  L   F   M +  +Y RA    AL+
Sbjct: 343 RVSPAVFR-----TPGTVIDSGTVITRLPSRAYAALRSSFAGLM-RRYSYKRA---PALS 393

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
            L  C+D  G      P + L F GGA + L       V  + S  CL   ++ + +   
Sbjct: 394 ILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANK-SQACLAFASNGDDT--- 449

Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           SI ILGN Q + + V YD+ NQ++GF  + C
Sbjct: 450 SIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 162/411 (39%), Gaps = 66/411 (16%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIP---------KLSS 140
           ++ ++ G PPQ +  +LDTGS L W           C+ S++PS  P           SS
Sbjct: 61  TVPVAVGAPPQNVTMVLDTGSELSWL---------RCNGSRVPSTPPPQAPAAFNGSASS 111

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           +     C +P+C W      + RD    P          C   L    +   +GI  ++T
Sbjct: 112 TYAAAHCSSPECQW------RGRDLPVPPFCAGPPSXS-CRVSLSYADASSADGILAADT 164

Query: 201 LNLPNRIIPNFLVGC----------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
             L        L GC          +   S    G+ G  RG  S  +Q    +F+YC+ 
Sbjct: 165 FLLGGAPPVXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCIA 224

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRIT 308
                       L++  G   +      L YTP +    ++    +   V Y V L  I 
Sbjct: 225 PGD------GPGLLVLGGDGAALAPQ--LNYTPLIQ---ISRPLPYFDRVAYSVQLEGIR 273

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VG   + +    L  D  G G T+VDSGT FTF+  + + PL  EF++Q          L
Sbjct: 274 VGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ---TSALLAPL 330

Query: 369 GAEALT---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVV-----GE 416
           G            CF     +  +     PE+ L  + GAEV +  E     V     GE
Sbjct: 331 GESDFVFQGAFDACFRASEARVAAASXMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGE 389

Query: 417 GSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G A  +  +T  + + +G  + ++G+   QN +VEYDL+N R+GF    C 
Sbjct: 390 GGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 440


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 115/417 (27%), Positives = 169/417 (40%), Gaps = 67/417 (16%)

Query: 71  TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS- 129
           T  T   +  ++ HS   Y +++  GTP +    + DTGS L W       QCK C+ S 
Sbjct: 109 TAATIPASLGLAFHSLE-YVVTIGIGTPARNFTVLFDTGSDLTWV------QCKPCTDSC 161

Query: 130 ---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
              + P F P  SS+   + C  P+C     + + C     E              Y V 
Sbjct: 162 YQQQEPLFDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTTCE--------------YSVK 207

Query: 187 YGS-GLTEGIALSETLNLPNRIIP--NFLVGCS------VLSSRQP---AGIAGFGRGKT 234
           YG   +T G    E   L     P    + GCS      V  + +    AG+ G GRG +
Sbjct: 208 YGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDS 267

Query: 235 SLPSQLNL----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           S+ SQ       D FSYCL          +S+  L  G++   +    L++TP V + S 
Sbjct: 268 SILSQTRRGNSGDVFSYCLPPRG------SSAGYLTIGAAAPPQSN--LSFTPLVTDNS- 318

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
                 S  Y V L  I+V G  + +      +      GT++DSGT  T M    +  L
Sbjct: 319 ----QLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDSGTVITHMPAAAYYVL 368

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
            DEF   M     YT  L    +  L  C+DV G    + P + L F GGA + +     
Sbjct: 369 RDEFRRHM---GGYTM-LPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASGI 424

Query: 410 --YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              FAV   G ++ L  +     +    +I+GN Q + Y V +D+  +R+GF    C
Sbjct: 425 LLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGC 481


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 109/409 (26%), Positives = 156/409 (38%), Gaps = 66/409 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTPP      +DT S L+W  C     C  C     P F P++SS+   L 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ---PCTGCYHQVDPMFNPRVSSTYAALP 143

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C  +     +C   +DE           C       G+  TEG    + L +   
Sbjct: 144 CSSDTCDELDVH--RCGHDDDES----------CQYTYTYSGNATTEGTLAVDKLVIGED 191

Query: 207 IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                  GCS  S+      Q +G+ G GRG  SL SQL++ +F+YCL        +R  
Sbjct: 192 AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP----ASRIP 247

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             ++    + + +  T     P   +P       +  YYY+ L  + +G + + +     
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPR------YPSYYYLNLDGLLIGDRAMSLPPTTT 301

Query: 322 TLDR----------------------DGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           T                         D N  G I+D  +T TF+   L+    DE V+ +
Sbjct: 302 TTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDL 357

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVG 415
                  R  G+    GL  CF +P          P + L F  G  + L     FA   
Sbjct: 358 EVEIRLPRGTGSS--LGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDR 414

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E   +CL V     A  G   ILGNFQ QN  V Y+LR  R+ F Q  C
Sbjct: 415 ESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C     C  C S   P F P  SS+     C 
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 254

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
           +  C+ +  E   C        ++S  C      Y+V YG G  T G   S+TL L +  
Sbjct: 255 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 301

Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
           + +F  GCS + S    Q  G+ G G G  SL SQ    L + FSYCL        T +S
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 355

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S  L      +     G   + FV  P +   +    +Y V L+ I VGG+++ +     
Sbjct: 356 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 408

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
           +       GT++DSGT  T + P  +  L+  F + M   + Y  A  +  L     CFD
Sbjct: 409 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGILD---TCFD 456

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             G+ + S P + L F GGA V+L      + N  A  G          +D  + G    
Sbjct: 457 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 503

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q + + V YD+    +GF+   C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 125/482 (25%), Positives = 202/482 (41%), Gaps = 75/482 (15%)

Query: 5   ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
           I +L +  IF  +   +  ++     F++   H +    P  +  +N    V+ +L R++
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSI 63

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPC 117
                 +  T   T T    I ++  G Y + LS GTPP  I  + DTGS ++W    PC
Sbjct: 64  ------SHNTGLVTNTVEAPIYNNR-GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPC 116

Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
           TN YQ        +P F P  S++ R + C +P CS+   ++     C+ +P     +CT
Sbjct: 117 TNCYQ------QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDN----SCSFKP-----DCT 161

Query: 178 QICPSYLVLYG-SGLTEGIALSETLNL---PNRII--PNFLVGCSVLSS----RQPAGIA 227
                Y + YG +  ++G    +TL +     R++  P   +GC   ++       +GI 
Sbjct: 162 -----YSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIV 216

Query: 228 GFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
           G G G  SL  Q+      KFSYCL     DD          N  S+++   +G   TP 
Sbjct: 217 GLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKL----NFGSNANVSGSGAVSTP- 271

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
                +   + F  +Y + L+ ++VG  R   ++        G    I+DSGTT T +  
Sbjct: 272 -----IYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGTTLTLLPV 324

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
           +L+      F   +  + N  R         L  CF+   +     P + +HF+ GA + 
Sbjct: 325 DLYH----NFAKAISNSINLQRTDDPNQF--LEYCFETTTDDY-KVPFIAMHFE-GANLR 376

Query: 405 LPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           L  EN    V + + +CL      D + S     I GN    N+ V YD+ N  L FK  
Sbjct: 377 LQRENVLIRVSD-NVICLAFAGAQDNDIS-----IYGNIAQINFLVGYDVTNMSLSFKPM 430

Query: 463 LC 464
            C
Sbjct: 431 NC 432


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 157/386 (40%), Gaps = 53/386 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  SL  GTP   +   LDTGS   W  C     C  C       F P  SS+   + C 
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCK---PCPDCYEQHEALFDPSKSSTYSDITCS 190

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGLTEGIALSETLNL-PN 205
           +             R+C +   +   NC+  + CP  +       T G    +TL L P 
Sbjct: 191 S-------------RECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPT 237

Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
             +P F+ GC   +  S  +  G+ G GRGK SL SQ+       FSYCL S     +  
Sbjct: 238 DAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS-----SPS 292

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            +  +  +G++ +       T      +PS         +YY+ L  ITV G+ ++V   
Sbjct: 293 ATGYLSFSGAAAAAPTNAQFTEMVAGQHPS---------FYYLNLTGITVAGRAIKVPPS 343

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
                     GTI+DSGT F+ + P  +  L     S M +   Y RA    + T    C
Sbjct: 344 VFAT----AAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGR---YKRA---PSSTIFDTC 393

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD-REASGGPSIIL 438
           +D+ G +T   P + L F  GA V L            S  CL  + +  + S G   +L
Sbjct: 394 YDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLG---VL 450

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q +   V YD+ NQ++GF    C
Sbjct: 451 GNTQQRTLAVIYDVDNQKVGFGANGC 476


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 109/409 (26%), Positives = 156/409 (38%), Gaps = 66/409 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTPP      +DT S L+W  C     C  C     P F P++SS+   L 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ---PCTGCYHQVDPMFNPRVSSTYAALP 143

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C  +     +C   +DE           C       G+  TEG    + L +   
Sbjct: 144 CSSDTCDELDVH--RCGHDDDES----------CQYTYTYSGNATTEGTLAVDKLVIGED 191

Query: 207 IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
                  GCS  S+      Q +G+ G GRG  SL SQL++ +F+YCL        +R  
Sbjct: 192 AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP----ASRIP 247

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             ++    + + +  T     P   +P       +  YYY+ L  + +G + + +     
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPR------YPSYYYLNLDGLLIGDRTMSLPPTTT 301

Query: 322 TLDR----------------------DGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           T                         D N  G I+D  +T TF+   L+    DE V+ +
Sbjct: 302 TTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDL 357

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVG 415
                  R  G+    GL  CF +P          P + L F  G  + L     FA   
Sbjct: 358 EVEIRLPRGTGSS--LGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDR 414

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E   +CL V     A  G   ILGNFQ QN  V Y+LR  R+ F Q  C
Sbjct: 415 ESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 157/388 (40%), Gaps = 50/388 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G+P +     LDTGS + W  C     C  C S   P + P  SSS R + 
Sbjct: 43  GEYFARMGIGSPQRSYYLELDTGSDVTWIQCA---PCSSCYSQVDPIYDPSNSSSYRRVY 99

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  C  + + + Q   C                SY V+YG S  + G    E+  L P
Sbjct: 100 CGSALCQALDYSACQGMGC----------------SYRVVYGDSSASSGDLGIESFYLGP 143

Query: 205 NR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
           N    + N   GC   +S   R  AG+ G G G  S  SQ+       FSYCL+      
Sbjct: 144 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 203

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +R+S LI    +     +     +TP + NP +        +YY  L  I+VGG  + +
Sbjct: 204 QSRSSPLIFGRTAIPFAAR-----FTPLLKNPRI------DTFYYAILTGISVGGTALPI 252

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                 L  +G GG I+DSGT+ T + P  +  L D +      +RN   A G   L   
Sbjct: 253 PPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAY---RAASRNLPPAPGVYLLD-- 307

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF+  G  T   P L LHF    ++ LP  N    V      CL        S  P  
Sbjct: 308 -TCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAP----SSMPIS 362

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++GN Q Q + + +DL+   +    + C
Sbjct: 363 VIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 83/141 (58%), Gaps = 7/141 (4%)

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           +FD+  + S ++L +    +      L YTPF+ N      + + VYYY+GLR +++GG+
Sbjct: 1   RFDEENQKSLMVLGD---KAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+++  K L  D  GNGGTI+DSGTTFT    E+F+ +A  F SQ+     Y RA+  EA
Sbjct: 58  RMKLPSKLLRFDAKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113

Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
           LTG+  C++V G +    PE 
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 166/385 (43%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT +   W PC+    C  C +S    F P  S+S R + C 
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSG---CAGCPTSS--PFNPAASASYRPVPCG 108

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C    + S     C+      +K+C      + + Y     +     +TL +   ++
Sbjct: 109 SPQCVLAPNPS-----CSPN----AKSC-----GFSLSYADSSLQAALSQDTLAVAGDVV 154

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
             +  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S K   F  T R
Sbjct: 155 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLR 214

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               +  NG     K T      P + NP    R++    YYV +  I VG + V +   
Sbjct: 215 ----LGRNGQPRRIKTT------PLLANP---HRSSL---YYVNMTGIRVGKKVVSIPAS 258

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  D     GT++DSGT FT +   ++  L DE     V+ R    A    +L G   C
Sbjct: 259 ALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDE-----VRRRVGAGAAAVSSLGGFDTC 313

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           ++     T ++P + L F G  +VTLP EN       G+  CL +    +       ++ 
Sbjct: 314 YNT----TVAWPPVTLLFDG-MQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIA 368

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q QN+ V +D+ N R+GF ++ C
Sbjct: 369 SMQQQNHRVLFDVPNGRVGFARESC 393


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 153/385 (39%), Gaps = 69/385 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C    QC + S    P F P  S+S   + 
Sbjct: 199 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSD---PVFDPADSASFTGVS 255

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  + +       C                 Y V YG G  T+G    ETL    
Sbjct: 256 CSSSVCDRLENAGCHAGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 299

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            ++ +  +GC   +       AG+ G G G  S   QL       FSYCL+S        
Sbjct: 300 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA------- 352

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
                                + P V NP          +YY+GL  + VGG RV +  +
Sbjct: 353 --------------------AWVPLVRNPRAPS------FYYIGLAGLGVGGIRVPISEE 386

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    ++   D F++Q     N  RA G         C
Sbjct: 387 VFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA---NLPRATGVAIFD---TC 440

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F GG  +TLP  N+   + +    C        ++ G S ILG
Sbjct: 441 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFA---PSTSGLS-ILG 496

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   + +D  N  +GF   +C
Sbjct: 497 NIQQEGIQISFDGANGYVGFGPNIC 521


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 115/420 (27%), Positives = 173/420 (41%), Gaps = 57/420 (13%)

Query: 59  ALHIKNPQTKTTTTTTTTTTTNI-----SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
           ++H K  +  TT   + + +T++     S+   G Y +++  GTP   +  I DTGS L 
Sbjct: 98  SIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 157

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C      + C   K P F P  S+S   + C +  C  +   +     C      ++
Sbjct: 158 WTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC------SA 209

Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNL-PNRIIPNFLVGCSVLSS---RQPAGIAG 228
            NC      Y + YG    + G    +   L  + +      GC   +       AG+ G
Sbjct: 210 SNCI-----YGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLG 264

Query: 229 FGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDN-GSSHSDKKTTGLTYTPF 284
            GR K S PSQ        FSYCL S      + T  L   + G S S K      +TP 
Sbjct: 265 LGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGSAGISRSVK------FTPI 314

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
               ++ +  +F   Y + +  ITVGGQ++ +     +       G ++DSGT  T + P
Sbjct: 315 ---STITDGTSF---YGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPP 363

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
           + +  L   F ++M K   Y    G      L  CFD+ G KT + P++   F GGA V 
Sbjct: 364 KAYAALRSSFKAKMSK---YPTTSGVSI---LDTCFDLSGFKTVTIPKVAFSFSGGAVVE 417

Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L  +  F    + S VCL    + + S   + I GN Q Q   V YD    R+GF    C
Sbjct: 418 LGSKGIFYAF-KISQVCLAFAGNSDDSN--AAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 474


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 178/418 (42%), Gaps = 57/418 (13%)

Query: 63  KNPQTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
           KN   + T     +TT    S S  G   Y + +  GTP + +  + DTGS L W     
Sbjct: 17  KNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTW----- 71

Query: 120 HYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
             QC+ C+ S    +   F P  SSS   + C +  C+ +  + I+  +C+    +T  +
Sbjct: 72  -TQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIK-SECSS---STDAS 126

Query: 176 CTQICPSYLVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSS---RQPAGIAGFG 230
           C      Y   YG   T  G    E L +    I+ +FL GC   +       AG+ G G
Sbjct: 127 CI-----YDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLG 181

Query: 231 RGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           R   S+  Q   N +K FSYCL +      T +S   L  G+S +   +  L YTP    
Sbjct: 182 RHPISIVQQTSSNYNKIFSYCLPA------TSSSLGHLTFGASAATNAS--LIYTPL--- 230

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
            +++  N+F   Y + +  I+VGG ++      ++      GG+I+DSGT  T +AP ++
Sbjct: 231 STISGDNSF---YGLDIVSISVGGTKLPA----VSSSTFSAGGSIIDSGTVITRLAPTVY 283

Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPV 407
             L   F   M K   Y  A  A     L  C+D+ G K  S P +   F GG  V L  
Sbjct: 284 AALRSAFRRXMEK---YPVANEAGL---LDTCYDLSGYKEISVPRIDFEFSGGVTVELXH 337

Query: 408 ENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                V  E   VCL    +   S     + GN Q +   V YD++  R+GF    CK
Sbjct: 338 RGILXVESE-QQVCLAFAAN--GSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392


>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 84/141 (59%), Gaps = 7/141 (4%)

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           +FD+  + S ++L + +  +      L YTPF+ N      + + VYYY+GLR +++GG+
Sbjct: 1   RFDEENQKSLMVLGDKAFPTG---IPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+++  K L  D  GNGGTI+DSGTTFT    E+F+ +A  F SQ+     Y RA+  EA
Sbjct: 58  RMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113

Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
           LTG+  C++V G +    PE 
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 166/385 (43%), Gaps = 54/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT +   W PC+    C  C +S    F P  S+S R + C 
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSG---CAGCPTSS--PFNPAASASYRPVPCG 161

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P+C    + S     C+      +K+C      + + Y     +     +TL +   ++
Sbjct: 162 SPQCVLAPNPS-----CSPN----AKSC-----GFSLSYADSSLQAALSQDTLAVAGDVV 207

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
             +  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S K   F  T R
Sbjct: 208 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLR 267

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               +  NG     K T      P + NP    R++    YYV +  I VG + V +   
Sbjct: 268 ----LGRNGQPRRIKTT------PLLANP---HRSSL---YYVNMTGIRVGKKVVSIPAS 311

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  D     GT++DSGT FT +   ++  L DE     V+ R    A    +L G   C
Sbjct: 312 ALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDE-----VRRRVGAGAAAVSSLGGFDTC 366

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           ++     T ++P + L F G  +VTLP EN       G+  CL +    +       ++ 
Sbjct: 367 YNT----TVAWPPVTLLFDG-MQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIA 421

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q QN+ V +D+ N R+GF ++ C
Sbjct: 422 SMQQQNHRVLFDVPNGRVGFARESC 446


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 125/482 (25%), Positives = 202/482 (41%), Gaps = 75/482 (15%)

Query: 5   ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
           I +L +  IF  +   +  ++     F++   H +    P  +  +N    V+ +L R++
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSI 63

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PC 117
                 +  T   T T    I ++  G Y + LS GTPP  I  + DTGS ++W    PC
Sbjct: 64  ------SHNTGLVTNTVEAPIYNNR-GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPC 116

Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
           TN YQ        +P F P  S++ R + C +P CS+   ++     C+ +P     +CT
Sbjct: 117 TNCYQ------QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDN----SCSFKP-----DCT 161

Query: 178 QICPSYLVLYG-SGLTEGIALSETLNL---PNRII--PNFLVGCSVLSS----RQPAGIA 227
                Y + YG +  ++G    +TL +     R++  P   +GC   ++       +GI 
Sbjct: 162 -----YSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIV 216

Query: 228 GFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
           G G G  SL  Q+      KFSYCL     DD          N  S+++   +G   TP 
Sbjct: 217 GLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKL----NFGSNANVSGSGAVSTP- 271

Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
                +   + F  +Y + L+ ++VG  R   ++        G    I+DSGTT T +  
Sbjct: 272 -----IYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGTTLTLLPV 324

Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
           +L+      F   +  + N  R         L  CF+   +     P + +HF+ GA + 
Sbjct: 325 DLYH----NFAKAISNSINLQRTDDPNQF--LEYCFETTTDDY-KVPFIAMHFE-GANLR 376

Query: 405 LPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
           L  EN    V + + +CL      D + S     I GN    N+ V YD+ N  L FK  
Sbjct: 377 LQRENVLIRVSD-NVICLAFAGAQDNDIS-----IYGNIAQINFLVGYDVTNMSLSFKPM 430

Query: 463 LC 464
            C
Sbjct: 431 NC 432


>gi|359806276|ref|NP_001241217.1| uncharacterized protein LOC100818868 precursor [Glycine max]
 gi|255644718|gb|ACU22861.1| unknown [Glycine max]
          Length = 450

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 109/422 (25%), Positives = 180/422 (42%), Gaps = 81/422 (19%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           YS S+  GTPP  +  ++D     +WF C N Y                 SS+   + C 
Sbjct: 50  YSTSIDMGTPPLTLDLVIDIRERFLWFECGNDYN----------------SSTYYPVRCG 93

Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICP--SYLVLYGSG-----------LTE 193
             KC     +   C  C + PL T  + N   + P   +   + SG            T 
Sbjct: 94  TKKCK--KAKGTACITCTNHPLKTGCTNNTCGVDPFNPFGEFFVSGDVGEDILSSLHSTS 151

Query: 194 GIALSETLNLPNRI----------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL--- 240
           G     TL++P  +          +  FL G +    +   G+ G  R   SLP+QL   
Sbjct: 152 GARAPSTLHVPRFVSTCVYPDKFGVEGFLQGLA----KGKKGVLGLARTAISLPTQLAAK 207

Query: 241 -NLD-KFSYCLLS-HKFDDTTRTSSLILDNGSSH--SDKKTTGLTYTPFVNNPS----VA 291
            NL+ KF+ CL S  K++   +   L +  G  +      +  L+YTP + NP     + 
Sbjct: 208 YNLEPKFALCLPSTSKYN---KLGDLFVGGGPYYLPPHDASKFLSYTPILTNPQSTGPIF 264

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
           + +  S  Y++ ++ I + G+ V V    L++DR GNGG  + +   +T     +++PL 
Sbjct: 265 DADP-SSEYFIDVKSIKLDGKIVNVNTSLLSIDRQGNGGCKLSTVVPYTKFHTSIYQPLV 323

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRP---CFD---VPGEKTG-SFPELKLHFKGGAEVT 404
           ++FV Q    +        + +T + P   CFD   +    TG + P + L  KGG +  
Sbjct: 324 NDFVKQAALRK-------IKRVTSVAPFGACFDSRTIGKTVTGPNVPTIDLVLKGGVQWR 376

Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           +   N    V + + +CL  V      G P   SI++G +QM++  +E+DL + +LGF  
Sbjct: 377 IYGANSMVKVSK-NVLCLGFVDGGLEPGSPIATSIVIGGYQMEDNLLEFDLVSSKLGFSS 435

Query: 462 QL 463
            L
Sbjct: 436 SL 437


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 110/404 (27%), Positives = 174/404 (43%), Gaps = 53/404 (13%)

Query: 71  TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSS 129
            +  T  + + S+   G Y +++  G+P + + FI DTGS L W  C     C  YC   
Sbjct: 129 ASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE---PCVGYCYQQ 185

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
           +   F P  S S   + C +P C  +   +       + P  +S  C      Y + YG 
Sbjct: 186 REHIFDPSTSLSYSNVSCDSPSCEKLESAT------GNSPGCSSSTCL-----YGIRYGD 234

Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL-- 242
           G  + G    E L+L +  +  NF  GC   +       AG+ G  R   SL SQ     
Sbjct: 235 GSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKY 294

Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA-FSVYY 300
              FSYCL       ++ +S+  L  GS   D K   + +TP       +E N+ +  +Y
Sbjct: 295 GKVFSYCL------PSSSSSTGYLSFGSGDGDSKA--VKFTP-------SEVNSDYPSFY 339

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           ++ +  I+VG +++ +     +       GTI+DSGT  + + P ++  +   F   M  
Sbjct: 340 FLDMVGISVGERKLPIPKSVFS-----TAGTIIDSGTVISRLPPTVYSSVQKVFRELM-- 392

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
             +Y R  G   L     C+D+   KT   P++ L+F GGAE+ L  E    V+ + S V
Sbjct: 393 -SDYPRVKGVSILD---TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVL-KVSQV 447

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           CL    + +       I+GN Q +  +V YD    R+GF    C
Sbjct: 448 CLAFAGNSDDD--EVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489


>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 83/141 (58%), Gaps = 7/141 (4%)

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           +FD+  + S ++L +    +      L YTPF+ N      + + VYYY+GLR +++GG+
Sbjct: 1   RFDEENQKSLMVLGD---KAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+++  K L  D  GNGGTI+DSGTTFT    E+F+ +A  F SQ+     Y RA+  EA
Sbjct: 58  RMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113

Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
           LTG+  C++V G +    PE 
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 155/385 (40%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C     CK C     P F P  S S   + 
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ---PCKLCYKQSDPVFDPAKSGSYTGVS 185

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  I +       C                 Y V+YG G  T+G    ETL    
Sbjct: 186 CGSSVCDRIENSGCHSGGCR----------------YEVMYGDGSYTKGTLALETLTFAK 229

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            ++ N  +GC   +       AG+ G G G  S   QL+      F YCL+S   D T  
Sbjct: 230 TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST-- 287

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+        +    G ++ P V NP          +YYVGL+ + VGG R+ +   
Sbjct: 288 -GSLVFGR-----EALPVGASWVPLVRNPRAPS------FYYVGLKGLGVGGVRIPLPDG 335

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    +    D F SQ     N  RA G         C
Sbjct: 336 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTA---NLPRASGVSIFD---TC 389

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  G  +TLP  N+   V +    C        AS     I+G
Sbjct: 390 YDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA----ASPTGLSIIG 445

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   V +D  N  +GF   +C
Sbjct: 446 NIQQEGIQVSFDGANGFVGFGPNVC 470


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 121/425 (28%), Positives = 181/425 (42%), Gaps = 60/425 (14%)

Query: 52  VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
           + S L++ L  +N   +  +TT    +  +   +   Y + +  GTP + +  I DTGS+
Sbjct: 105 IQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSA--DYYVVVGLGTPKRDLSLIFDTGSY 162

Query: 112 LVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
           L W       QC+ C+ S    + P F P  SSS   + C +  C+     S  C     
Sbjct: 163 LTW------TQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQF--RSAGCSS--- 211

Query: 168 EPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-PNRIIPNFLVGCSVLSS---RQ 222
              +T  +C      Y V YG + ++ G    E L +    I+ +FL GC   +    R 
Sbjct: 212 ---STDASCI-----YDVKYGDNSISRGFLSQERLTITATDIVHDFLFGCGQDNEGLFRG 263

Query: 223 PAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
            AG+ G  R   S   Q   +    FSYCL S      T +S   L  G+S +      L
Sbjct: 264 TAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS------TPSSLGHLTFGASAA--TNANL 315

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
            YTPF    +++  N+F   Y + +  I+VGG ++      ++      GG+I+DSGT  
Sbjct: 316 KYTPF---STISGENSF---YGLDIVGISVGGTKLPA----VSSSTFSAGGSIIDSGTVI 365

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
           T + P  +  L   F   M+K   Y  A G   L     C+D  G K  S P +   F G
Sbjct: 366 TRLPPTAYAALRSAFRQFMMK---YPVAYGTRLLD---TCYDFSGYKEISVPRIDFEFAG 419

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           G +V LP+     + GE SA  L +      +G    I GN Q +   V YD+   R+GF
Sbjct: 420 GVKVELPLVG--ILYGE-SAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGF 476

Query: 460 KQQLC 464
               C
Sbjct: 477 GAAGC 481


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 155/385 (40%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C     CK C     P F P  S S   + 
Sbjct: 130 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ---PCKLCYKQSDPVFDPAKSGSYTGVS 186

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  I +       C                 Y V+YG G  T+G    ETL    
Sbjct: 187 CGSSVCDRIENSGCHSGGCR----------------YEVMYGDGSYTKGTLALETLTFAK 230

Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            ++ N  +GC   +       AG+ G G G  S   QL+      F YCL+S   D T  
Sbjct: 231 TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST-- 288

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL+        +    G ++ P V NP          +YYVGL+ + VGG R+ +   
Sbjct: 289 -GSLVFGR-----EALPVGASWVPLVRNPRAPS------FYYVGLKGLGVGGVRIPLPDG 336

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G+GG ++D+GT  T +    +    D F SQ     N  RA G         C
Sbjct: 337 VFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTA---NLPRASGVSIFD---TC 390

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  +   P +  +F  G  +TLP  N+   V +    C        AS     I+G
Sbjct: 391 YDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA----ASPTGLSIIG 446

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   V +D  N  +GF   +C
Sbjct: 447 NIQQEGIQVSFDGANGFVGFGPNVC 471


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/406 (27%), Positives = 164/406 (40%), Gaps = 66/406 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTP       +DT S LVW  C     C  C     P F PKLSSS  ++ 
Sbjct: 90  GEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQ---PCVSCYRQLDPVFNPKLSSSYAVVP 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C+ +  +  +C + +D            C       G G+T+G    + L +   
Sbjct: 147 CTSDTCAQL--DGHRCHEDDD----------GACQYTYKYSGHGVTKGTLAIDKLAIGGD 194

Query: 207 IIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS- 261
           +    + GCS  S   PA    G+ G GRG  SL SQL++ +F YCL        +RTS 
Sbjct: 195 VFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPP----MSRTSG 250

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ--------- 312
            L+L  G+      +  +T T       ++    +  YYY+ L  + VG Q         
Sbjct: 251 KLVLGAGADAVRNMSDRVTVT-------MSSSTRYPSYYYLNLDGLAVGDQTPGTTRNAT 303

Query: 313 ----------RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
                             +        G IVD  +T +F+   L++ LAD+   ++    
Sbjct: 304 SPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEI---- 359

Query: 363 NYTRALGAEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
              RA  +  L GL  CF +P   G      P + L F G     L ++     V +G  
Sbjct: 360 RLPRATPSLRL-GLDLCFILPEGVGMDRVYVPTVSLSFDG---RWLELDRDRLFVTDGRM 415

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +CL +      S     ILGNFQ+QN  V ++LR  ++ F +  C 
Sbjct: 416 MCLMIGRTSGVS-----ILGNFQLQNMRVLFNLRRGKITFAKASCD 456


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 111/393 (28%), Positives = 171/393 (43%), Gaps = 63/393 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+DTGS + + PC+    C++C   + P F P LS + + + 
Sbjct: 87  GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST---CEHCGRHQDPKFQPDLSETYQPVK 143

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C  P C+    ++ QC    D   A   + + +    +V +G+       LSE    P R
Sbjct: 144 C-TPDCN-CDGDTNQC--MYDRQYAEMSSSSGVLGEDVVSFGN-------LSELA--PQR 190

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDD 256
            +     GC       L S++  GI G GRG  S+  QL       D FS C        
Sbjct: 191 AV----FGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGG 246

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                ++IL   S   D           V   S  +R   S YY + L+ + V G+++++
Sbjct: 247 ----GAMILGGISPPED----------MVFTHSDPDR---SPYYNINLKEMHVAGKKLQL 289

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             K      DG  GT++DSGTT+ ++    F  LA  F   ++K RN  + +        
Sbjct: 290 NPKVF----DGKHGTVLDSGTTYAYLPETAF--LA--FKRAIMKERNSLKQINGPDPNYK 341

Query: 377 RPCFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREAS 431
             CF   G    +   SFP + + F+ G +++L  ENY F       A CL V ++    
Sbjct: 342 DICFTGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGR-- 399

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             P+ +LG   ++N  V YD  N ++GF +  C
Sbjct: 400 -DPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 114/396 (28%), Positives = 162/396 (40%), Gaps = 55/396 (13%)

Query: 89  YSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---FIPKLSSSSRL 144
           Y +S+  GTP PQ    + DTGS L W  C   Y CK C          F    SSS R 
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNC--EYWCKSCPKPNPHPGRVFRANDSSSFRT 176

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTEGIALSET--- 200
           + C +  C     +     +C         N    C   Y  L G     G+  +ET   
Sbjct: 177 IPCSSDDCKIELQDYFSLTEC--------PNPNAPCLFDYRYLNGPRAI-GVFANETVTV 227

Query: 201 -LNLPNRI-IPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSH 252
            LN   +I + + L+GC+   +     P G+ G G  K SL  +L     +KFSYCL+ H
Sbjct: 228 GLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDH 287

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRIT 308
                           SS + K        P +  P +          + +Y V +  I+
Sbjct: 288 L---------------SSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGIS 332

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG  + +      +   G GG IVDSGT+ T +A E ++ + D       K++   + +
Sbjct: 333 VGGSMLSISSDIWNVT--GVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHK---KVV 387

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             E       CF+  G    + P L +HF  GA    PV++Y   V EG   CL ++   
Sbjct: 388 PIELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIK-CLGII--- 443

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +A    S ILGN   QN+  EYDL   +LGF    C
Sbjct: 444 KADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 124/484 (25%), Positives = 199/484 (41%), Gaps = 79/484 (16%)

Query: 11  SFIFFFTLL------SIFPSSITSLTFSLSRFHT--------NPSQDSYQNLNSLVSSSL 56
           +F+F F LL      S   +S T   FS++  H         NPS    + + + V  S 
Sbjct: 3   AFVFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSF 62

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
            R+   K     +     +  T  I       Y +    GTPP     I DTGS L+W  
Sbjct: 63  ARS---KRRLRLSQNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQ 119

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C     C+ C     P F P+ SS+ + + C +  C+ +      C       +  S  C
Sbjct: 120 CA---PCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRAC-------VGKSGQC 169

Query: 177 TQICPSYLVLYGS-GLTEGIALSETLNLPNR----IIPNFLVGC------SVLSSRQPAG 225
                 Y  +YG   L  GI   E++N  ++      P    GC      +V  S++  G
Sbjct: 170 Y-----YQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMG 224

Query: 226 IAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
           + G G G  SL SQL      KFSYC     F   +  S+  +  G+    K+  G+  T
Sbjct: 225 LVGLGVGPLSLISQLGYQIGRKFSYC-----FPPLSSNSTSKMRFGNDAIVKQIKGVVST 279

Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
           P +       ++    YYY+ L  +++G ++V+      T +   +G  ++DSGT+FT +
Sbjct: 280 PLI------IKSIGPSYYYLNLEGVSIGNKKVK------TSESQTDGNILIDSGTSFTIL 327

Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAE 402
               +    ++FV+ +VK      A+    L     CF+  G++   FP++   F  GA+
Sbjct: 328 KQSFY----NKFVA-LVKEVYGVEAVKIPPLV-YNFCFENKGKRK-RFPDVVFLFT-GAK 379

Query: 403 VTLPVENYFAVVGEGSAVCLTVV--TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           V +   N F    + + +C+  +  +D + S     I GN     Y VEYDL+   + F 
Sbjct: 380 VRVDASNLFE-AEDNNLLCMVALPTSDEDDS-----IFGNHAQIGYQVEYDLQGGMVSFA 433

Query: 461 QQLC 464
              C
Sbjct: 434 PADC 437


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 109/423 (25%), Positives = 159/423 (37%), Gaps = 38/423 (8%)

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
           TR L  +    +          + +  H     ++SL+ GTPPQ +  +LDTGS L W  
Sbjct: 34  TRPLLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLL 93

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C           S + SF P+ S +   + C + +C        + RD    P       
Sbjct: 94  CAPGGGGGGGGRSAL-SFRPRASLTFASVPCDSAQC--------RSRDLPSPP--ACDGA 142

Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-SRQPAGIA-----GFG 230
           ++ C   L       ++G   +E   +          GC   +    P G+A     G  
Sbjct: 143 SKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMN 202

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG  S  SQ +  +FSYC+      D      L+L     HSD     L YTP    P++
Sbjct: 203 RGALSFVSQASTRRFSYCI-----SDRDDAGVLLL----GHSDLPFLPLNYTPLYQ-PAM 252

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
                  V Y V L  I VGG+ + +    L  D  G G T+VDSGT FTF+  + +  L
Sbjct: 253 PLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAL 312

Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPGEKT--GSFPELKLHFKGGAEVTL 405
             EF  Q    + +  AL             CF VP  +      P + L F G      
Sbjct: 313 KAEFSRQ---TKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVA 369

Query: 406 PVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
                + V GE   G  V      + +     + ++G+    N +VEYDL   R+G    
Sbjct: 370 GDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPI 429

Query: 463 LCK 465
            C 
Sbjct: 430 RCD 432


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 119/442 (26%), Positives = 188/442 (42%), Gaps = 57/442 (12%)

Query: 33  LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSIS 92
           LS  H NPS   Y +L      S +R+  +    T  T+ +T    + I   S G + +S
Sbjct: 39  LSPLH-NPSLSRYDSLIDAFRRSFSRSATL---LTHLTSVSTACIRSPIIPDS-GEFLMS 93

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
           +  GTPP  +  I DTGS L W  C     C+ C +   P F P+ SSS R + C +  C
Sbjct: 94  IFIGTPPVNVIAIADTGSDLTWTQC---LPCRECFNQSQPIFNPRRSSSYRKVSCASDTC 150

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNF 211
                 S++   C  +         Q C SY   YG    T G   S+ + + +  +P  
Sbjct: 151 -----RSLESYHCGPD--------LQSC-SYGYSYGDRSFTYGDLASDQITIGSFKLPKT 196

Query: 212 LVGCSVLSSRQPAGIAGFGRGKTSLP----SQLNL-----DKFSYCLLSHKFDDTTRTSS 262
           ++GC   +     G+     G         SQ+        +FSYCL +  F +   T +
Sbjct: 197 VIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTF-FSNANITGT 255

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           +     +  S ++   +  TP V  P   +      +Y++ L  I+VG +R +  +    
Sbjct: 256 ISFGRKAVVSGRQ---VVSTPLV--PRSPD-----TFYFLTLEAISVGKKRFKAANGISA 305

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           +   GN   I+DSGTT T +   L+  +    +++++K +      G   L     C+  
Sbjct: 306 MTNHGN--IIIDSGTTLTLLPRSLYYGVFST-LARVIKAKRVDDPSGILEL-----CYSA 357

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
                 + P +  HF GGA+V L   N FA V + +  CLT     + +     I GN  
Sbjct: 358 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVAD-NVTCLTFAPATQVA-----IFGNLA 411

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
             N+ V YDL N+RL F+ +LC
Sbjct: 412 QINFEVGYDLGNKRLSFEPKLC 433


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 157/383 (40%), Gaps = 65/383 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y ++L   TPP  +  + DTGS LVW  C            K+P+     SSS   L C 
Sbjct: 76  YLMALDVSTPPVRMLALADTGSSLVWLKC------------KLPAAHTPASSSYARLPCD 123

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
              C  +  ++  CR         + +   IC           T G    +      R+ 
Sbjct: 124 AFACKALG-DAASCR--------ATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRL- 173

Query: 209 PNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRT 260
            +F  GC+  +   S    G+ G   G  SL SQL+       KFSYCL+ +    ++ T
Sbjct: 174 -DF--GCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYS---SSET 227

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            S  L+ GS      + G   TP V     A RN    +Y + L  I V G+ V +    
Sbjct: 228 VSSSLNFGSHAIVSSSPGAATTPLV-----AGRN--KSFYTIALDSIKVAGKPVPLQTTT 280

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
             L        IVDSGT  T++   + +PL    V+ +       R    E L  +  C+
Sbjct: 281 TKL--------IVDSGTMLTYLPKAVLDPL----VAALTAAIKLPRVKSPETLYAV--CY 326

Query: 381 DV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           DV    P +   S P++ L   GG EV LP  N F V  +G+ VCL +V     S  P  
Sbjct: 327 DVRRRAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVE----SHLPEF 382

Query: 437 ILGNFQMQNYYVEYDLRNQRLGF 459
           ILGN   QN +V +DL  + + F
Sbjct: 383 ILGNVAQQNLHVGFDLERRTVSF 405


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/385 (25%), Positives = 149/385 (38%), Gaps = 56/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +   FGTP Q +   +DT +   W PCT    C  CS++    F P  S++ + +GC 
Sbjct: 106 YIVRAKFGTPAQTLLLAMDTSNDAAWVPCT---ACVGCSTTT--PFAPPKSTTFKKVGCG 160

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +C  + + +     C                ++   YG+       + +T+ L    +
Sbjct: 161 ASQCKQVRNPTCDGSAC----------------AFNFTYGTSSVAASLVQDTVTLATDPV 204

Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC      S L  +   G+        +   +L    FSYCL S K         
Sbjct: 205 PAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFK--------- 255

Query: 263 LILDNGSSHSDKKTTGLTYT---PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               N S H D            P   NP    R++    YYV L  I VG + V +  +
Sbjct: 256 --TLNFSGHXDLXPVAQPRDQVYPSFKNP---RRSSL---YYVNLVAIRVGRRIVDIPPE 307

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
            L  +     GT+ DSGT FT     L EP      ++  +  +  + L   +L G   C
Sbjct: 308 ALAFNPXTGAGTVFDSGTVFT----RLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTC 363

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           + VP       P +   F  G  VTLP +N       GS  CL +    +       ++ 
Sbjct: 364 YTVPIVA----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIA 418

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q QN+ V +D+ N RLG  ++LC
Sbjct: 419 NMQQQNHRVLFDVPNSRLGVARELC 443


>gi|356518052|ref|XP_003527698.1| PREDICTED: basic 7S globulin 2-like [Glycine max]
          Length = 447

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 167/396 (42%), Gaps = 63/396 (15%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           ++  GTP      ++D G   +W  C+N    +Y SSSK            R + C++ K
Sbjct: 59  TIGIGTPQHSTNLVIDLGGENLWHDCSNR---RYNSSSK------------RKIVCKSKK 103

Query: 152 CSWIHHESIQCRDCN----DEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
           C     E   C         +P     +CT    + L  + S  T    + +T+ L +  
Sbjct: 104 CP----EGAACVSTGCIGPYKPGCAISDCTITVSNPLAQFSSSYT---MVEDTIFLSHTY 156

Query: 208 IPNFLVGCSVLSS-----------RQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
           IP FL GC  L             R   GI GF   + +LPSQL L      KFS C  S
Sbjct: 157 IPGFLAGCVDLDDGLSGNALQGLPRTSKGIIGFSHSELALPSQLVLSNKLIPKFSLCFPS 216

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRI 307
              ++     ++ +  G  H   ++  L  TP V NP    +V+   A S+ Y++ ++ I
Sbjct: 217 S--NNLKGFGNIFIGAGGGHPQVESKFLQTTPLVVNPVATGAVSIYGAPSIEYFIDVKAI 274

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
            + G  + +    L++D+ GNGGT + + T +T +   L++P   EF+++  + R   R 
Sbjct: 275 KIDGHVLNLNSSLLSIDKKGNGGTKISTMTPWTELHSSLYKPFVQEFINK-AEGRRMKR- 332

Query: 368 LGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                +     CFD   +    TG + P + L   GGA+ T+   N   V+   +  CL 
Sbjct: 333 --VAPVPPFDACFDTSTIRNSITGLAVPSIDLVLPGGAQWTIYGANSMTVMTSKNVACLA 390

Query: 424 VVTD----REASG---GPSIILGNFQMQNYYVEYDL 452
            V      +E        S+++G  Q+++  +  D+
Sbjct: 391 FVDGGMKPKEMHSIQLEASVVIGGHQLEDNLLVIDM 426


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 164/390 (42%), Gaps = 56/390 (14%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           GY IS   GTPP  +  ++DT +  +WF C     CK C ++  P F P  SS+ + + C
Sbjct: 88  GYIISFLIGTPPFQLYGVMDTANDNIWFQCN---PCKPCFNTTSPMFDPSKSSTYKTIPC 144

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNR 206
            +PKC  +  E+  C          S +  ++C       G   ++G    +TL L  N 
Sbjct: 145 SSPKCKNV--ENTHC----------SSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNN 192

Query: 207 IIP----NFLVGCSVLSSRQP-----AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
             P    N ++GC    ++ P     +G  G GRG  S  SQLN     KFSYCL+   F
Sbjct: 193 DTPISFKNIVIGCG-HRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVP-LF 250

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
            +   +  L   + S  S     G   TP           A  + Y   L  ++VG   +
Sbjct: 251 SNEGISGKLHFGDKSVVSG---VGTVSTPIT---------AGEIGYSTTLNALSVGDHII 298

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           +  +   T   D  G TI+DSGTT T +   ++  L +  V+ MVK     RA       
Sbjct: 299 KFENS--TSKNDNLGNTIIDSGTTLTILPENVYSRL-ESIVTSMVK---LERAKSPNQ-- 350

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
             + C+     K    P +  HF  GA+V L   N F  + +   VC   V+       P
Sbjct: 351 QFKLCYKAT-LKNLDVPIITAHFN-GADVHLNSLNTFYPI-DHEVVCFAFVS---VGNFP 404

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             I+GN   QN+ V +DL+   + FK   C
Sbjct: 405 GTIIGNIAQQNFLVGFDLQKNIISFKPTDC 434


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 170/398 (42%), Gaps = 56/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP      +DTGS ++W  C +   C   S  +I    F P  SS+S +
Sbjct: 73  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C + +C    +  IQ  D      AT  +    C SY   YG G  T G  +S+ ++L
Sbjct: 133 IACSDQRC----NNGIQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 181

Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                     N   P  + GCS         S R   GI GFG+ + S+ SQL+    + 
Sbjct: 182 NTIFEGSVTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 240

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D++    L+L       +     + YT  V  P+         +Y + L+ 
Sbjct: 241 RVFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 285

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ +++           + GTIVDSGTT  ++A E ++P      + + ++ +   
Sbjct: 286 IAVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVV 343

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           + G +       C+ +    T  FP++ L+F GGA + L  ++Y           +  + 
Sbjct: 344 SRGNQ-------CYLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIG 396

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++  G    ILG+  +++  V YDL  QR+G+    C
Sbjct: 397 FQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 434


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 109/422 (25%), Positives = 159/422 (37%), Gaps = 38/422 (9%)

Query: 57  TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
           TR L  +    +          + +  H     ++SL+ GTPPQ +  +LDTGS L W  
Sbjct: 33  TRPLLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLL 92

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
           C           S + SF P+ S +   + C + +C        + RD    P       
Sbjct: 93  CAPGGGGGGGGRSAL-SFRPRASLTFASVPCGSAQC--------RSRDLPSPP--ACDGA 141

Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-SRQPAGIA-----GFG 230
           ++ C   L       ++G   +E   +          GC   +    P G+A     G  
Sbjct: 142 SKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMN 201

Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           RG  S  SQ +  +FSYC+      D      L+L     HSD     L YTP    P++
Sbjct: 202 RGALSFVSQASTRRFSYCI-----SDRDDAGVLLL----GHSDLPFLPLNYTPLYQ-PAM 251

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
                  V Y V L  I VGG+ + +    L  D  G G T+VDSGT FTF+  + +  L
Sbjct: 252 PLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAL 311

Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPGEKT--GSFPELKLHFKGGAEVTL 405
             EF  Q    + +  AL             CF VP  +      P + L F G      
Sbjct: 312 KAEFSRQ---TKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVA 368

Query: 406 PVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
                + V GE   G  V      + +     + ++G+    N +VEYDL   R+G    
Sbjct: 369 GDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPI 428

Query: 463 LC 464
            C
Sbjct: 429 RC 430


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 123/441 (27%), Positives = 191/441 (43%), Gaps = 69/441 (15%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           N ++ S Q + + +  S    L   N      +  +  T+        G Y +++S GTP
Sbjct: 42  NSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNR------GEYLMNISIGTP 95

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  I  I DTGS L+W  C     C+ C     P F PK SS+ R + C           
Sbjct: 96  PVPILAIADTGSDLIWTQCN---PCEDCYQQTSPLFDPKESSTYRKVSCS---------- 142

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFL 212
           S QCR   D   +T +N      SY + YG +  T+G    +T+ + +       + N +
Sbjct: 143 SSQCRALEDASCSTDENTC----SYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMI 198

Query: 213 VGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
           +GC      +  PA  GI G G G TSL SQL      KFSYCL+   F   T  +S I 
Sbjct: 199 IGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLV--PFTSETGLTSKI- 255

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
                  +  T G+     V + S+ +++  + YY++ L  I+VG ++++      T+  
Sbjct: 256 -------NFGTNGIVSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSKKIQFTS---TIFG 304

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
            G G  ++DSGTT T +    +  L +  V+  +K        G  +L     C+     
Sbjct: 305 TGEGNIVIDSGTTLTLLPSNFYYEL-ESVVASTIKAERVQDPDGILSL-----CY----R 354

Query: 386 KTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
            + SF  P++ +HFKGG +V L   N F  V E  + C     + + +     I GN   
Sbjct: 355 DSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVS-CFAFAANEQLT-----IFGNLAQ 407

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
            N+ V YD  +  + FK+  C
Sbjct: 408 MNFLVGYDTVSGTVSFKKTDC 428


>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
          Length = 191

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 97/197 (49%), Gaps = 13/197 (6%)

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
           G      K   L +T  V      + N    +YYV ++ + VGG+ + +  +   L  +G
Sbjct: 5   GEDKELLKHLNLNFTSLVG----GKENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEG 60

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
            GGTI+DSGTT ++ A   +E +   FV+++       R    +    L+PC++V G + 
Sbjct: 61  VGGTIIDSGTTLSYFAEPAYEIIKQAFVNKV------KRYPILDDFPILKPCYNVSGVEK 114

Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
              P   + F  GA  T PVENYF  +     VCL ++    ++     I+GN+Q QN++
Sbjct: 115 LELPSFGIVFGDGAIWTFPVENYFIKLEPEDIVCLAILGTPHSAMS---IIGNYQQQNFH 171

Query: 448 VEYDLRNQRLGFKQQLC 464
           + YD +  RLGF  + C
Sbjct: 172 ILYDTKRSRLGFAPRRC 188


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 164/392 (41%), Gaps = 65/392 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           + +++  GTP Q    I DTGS L W  C       +C   + P F P  SS+   + C 
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
            P+C+                L +  N T +   YLV YG G  T G+   +TL L  +R
Sbjct: 209 EPQCAAAGG------------LCSEDNTTCL---YLVHYGDGSSTTGVLSRDTLALTSSR 253

Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLSHKF 254
            +  F  GC   +      +  FGR         G+ SLPSQ        FSYCL S   
Sbjct: 254 ALAGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPS--- 304

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             +  T+  +    +  +D  T    YT  +  P       F  +Y+V L  I +GG  +
Sbjct: 305 --SNSTTGYLTIGATPATD--TGAAQYTAMLRKPQ------FPSFYFVELVSIDIGGYIL 354

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            V     T      GGT++DSGT  T++  + +E L D F   M +   YT A   + L 
Sbjct: 355 PVPPAVFT-----RGGTLLDSGTVLTYLPAQAYELLRDRFRLTMER---YTPAPPNDVLD 406

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVCLTVVTDREASG 432
               C+D  GE     P +   F  GA   L   ++F V+   + +  CL      +A G
Sbjct: 407 A---CYDFAGESEVIVPAVSFRFGDGAVFEL---DFFGVMIFLDENVGCLAFAA-MDAGG 459

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            P  I+GN Q ++  V YD+  +++GF    C
Sbjct: 460 LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 121/281 (43%), Gaps = 40/281 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L+ GTPP+ +   LDTGS LVW  C     C+ C    IP   P  SS+   L C 
Sbjct: 86  YLVHLAVGTPPRPVALTLDTGSDLVWTQCA---PCRDCFDQGIPLLDPAASSTYAALPCG 142

Query: 149 NPKCSWIHHESIQCRDC------NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
            P+C  +   S   R C       D+ +   K    I         +G   G     +L 
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGK----IATDRFTFGDNGRRNG---DGSLP 195

Query: 203 LPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
              R+      GC      V  S +  GIAGFGRG+ SLPSQLN   FSYC  S  FD  
Sbjct: 196 ATRRLT----FGCGHFNKGVFQSNE-TGIAGFGRGRWSLPSQLNATSFSYCFTS-MFDSK 249

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +   +L     + +S   +  +  TP   NPS          Y++ L+ I+VG  R+ V 
Sbjct: 250 SSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPS------LYFLSLKGISVGKTRLPVP 303

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
                        TI+DSG + T +  E++E +  EF +Q+
Sbjct: 304 ETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQV 337


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 124/471 (26%), Positives = 185/471 (39%), Gaps = 84/471 (17%)

Query: 24  SSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIKNP-------------- 65
           SS++  T +L+  H      PS         L+     RA HI+                
Sbjct: 47  SSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFAMNAAVDGAGDLQ 106

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQ 122
           Q+K +++  T   +++ +  Y    IS+  GTP       +DTGS + W    PC N   
Sbjct: 107 QSKVSSSVPTKLGSSLDTLEY---VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPN--- 160

Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
              C +     F P  SS+ R + C   +C+ +  +   C        AT+  C      
Sbjct: 161 -PPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG-------ATNYEC-----Q 207

Query: 183 YLVLYGSG-LTEGIALSETLNL--PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSL 236
           Y V YG G  T G    +TL L   +  +  F  GCS L S    Q  G+ G G G  SL
Sbjct: 208 YGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSL 267

Query: 237 PSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
            SQ      + FSYCL                 +GSS       G   + FV    +  +
Sbjct: 268 VSQTAAAYGNSFSYCLPP--------------TSGSSGFLTLGGGGGASGFVTTRMLRSK 313

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
                +Y   L+ I VGG+++ +             G++VDSGT  T + P  +  L+  
Sbjct: 314 Q-IPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTIITRLPPTAYSALSSA 366

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
           F + M + R+      A A + L  CFD  G+   S P + L F GGA + L        
Sbjct: 367 FKAGMKQYRS------APARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIM-- 418

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              G+ +      D   +G    I+GN Q + + V YD+ +  LGF+   C
Sbjct: 419 --YGNCLAFAATGDDGTTG----IIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 168/391 (42%), Gaps = 56/391 (14%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC++   C++C   + P F P  SS+   
Sbjct: 84  SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSD---CEHCGKHQDPRFQPDESSTYHP 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           + C N  C+   H+ + C    +   A   + + +    ++ +G         +++  +P
Sbjct: 141 VKC-NMDCN-CDHDGVNC--VYERRYAEMSSSSGVLGEDIISFG---------NQSEVVP 187

Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
            R +     GC       L S++  GI G GRG+ S+  QL              D    
Sbjct: 188 QRAV----FGCENVETGDLYSQRADGIMGLGRGQLSIVDQL-------------VDKNVI 230

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVRVWH 318
             S  L  G  H       L   P   +   +  + + S YY + L+ I V G+ +++  
Sbjct: 231 NDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKL-- 288

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              T DR    GT++DSGTT+ ++  E F    D  +    K+ N  +  G +       
Sbjct: 289 SPSTFDR--KHGTVLDSGTTYAYLPEEAFVAFRDAIIK---KSHNLKQIHGPDPNYN-DI 342

Query: 379 CFDVPGEK----TGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGG 433
           CF   G      + +FPE+ + F  G +++L  ENY F       A CL +  +    G 
Sbjct: 343 CFSGAGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN----GD 398

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + +LG   ++N  V YD  N+++GF +  C
Sbjct: 399 STTLLGGIIVRNTLVTYDRENEKIGFWKTNC 429


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 108/392 (27%), Positives = 153/392 (39%), Gaps = 42/392 (10%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++SL+ GTPPQ +  +LDTGS L W  C         S+    SF P+ SS+   + C +
Sbjct: 86  TVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAM---SFRPRASSTFAAVPCAS 142

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
            +C        + RD    P       +  C   L       ++G   ++   + +    
Sbjct: 143 AQC--------RSRDLPSPP--ACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPL 192

Query: 210 NFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
               GC      S       AG+ G  RG  S  SQ +  +FSYC+      D      L
Sbjct: 193 RAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCI-----SDRDDAGVL 247

Query: 264 ILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           +L     HSD  T   L YTP    P++       V Y V L  I VGG+ + +    L 
Sbjct: 248 LL----GHSDLPTFLPLNYTPMYQ-PALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLA 302

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPC 379
            D  G G T+VDSGT FTF+  + +  L  EF  Q    R    AL   +         C
Sbjct: 303 PDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQA---RPLLPALDDPSFAFQEAFDTC 359

Query: 380 FDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGG 433
           F VP  +   T   P + L F G           + V GE   G  V      + +    
Sbjct: 360 FRVPQGRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPI 419

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            + ++G+    N +VEYDL   R+G     C 
Sbjct: 420 MAYVIGHHHQMNVWVEYDLERGRVGLAPVRCD 451


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 173/397 (43%), Gaps = 66/397 (16%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           ++G Y +    GTPP     I DT S L+W  C+    C+ C     P F P  SS+   
Sbjct: 86  NHGEYLMRFYIGTPPVERLAIADTASDLIWVQCS---PCETCFPQDTPLFEPHKSSTFAN 142

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
           L C +  C+     S     C   PL  +     +C  Y   YG G  T+G+  +E+++ 
Sbjct: 143 LSCDSQPCT-----SSNIYYC---PLVGN-----LC-LYTNTYGDGSSTKGVLCTESIHF 188

Query: 204 PNRII--PNFLVGCSVLS------SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
            ++ +  P  + GC   +      S +  GI G G G  SL SQL      KFSYCLL  
Sbjct: 189 GSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPF 248

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
                T TS++ L  G+  +     G+  TP + +P       +  YY++ L  IT+G +
Sbjct: 249 -----TSTSTIKLKFGND-TTITGNGVVSTPLIIDPH------YPSYYFLHLVGITIGQK 296

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            ++V     T D   NG  I+D GT  T++    +      FV+ +        ALG   
Sbjct: 297 MLQVR----TTDHT-NGNIIIDLGTVLTYLEVNFYH----NFVTLL------REALGISE 341

Query: 373 LTGLRP-----CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
                P     CF  P +   +FP++   F  GA+V L  +N F    + + +CL V+ D
Sbjct: 342 TKDDIPYPFDFCF--PNQANITFPKIVFQFT-GAKVFLSPKNLFFRFDDLNMICLAVLPD 398

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             A G    + GN    ++ VEYD + +++ F    C
Sbjct: 399 FYAKGFS--VFGNLAQVDFQVEYDRKGKKVSFAPADC 433


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 105/426 (24%), Positives = 167/426 (39%), Gaps = 78/426 (18%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++ ++ G PPQ +  +LDTGS L W           C+ S++PS  P+  + +   G  +
Sbjct: 60  TVPVAVGAPPQNVTMVLDTGSELSWL---------LCNGSRVPSTPPQPQAPAAFNGSAS 110

Query: 150 -----------PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
                      P+C W      + RD    P       +  C   L    +   +G+  +
Sbjct: 111 STYAAAHCSSSPECQW------RGRDLPVPPFCAGPP-SNSCRVSLSYADASSADGVLAA 163

Query: 199 ETLNLPNRIIPNFLVGC--------------------SVLSSRQPAGIAGFGRGKTSLPS 238
           +T  L        L GC                    +  SS    G+ G  RG  S  +
Sbjct: 164 DTFLLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVT 223

Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILD-NGSSHSDKKTTGLTYTPFVNNPSVAERNAF- 296
           Q    +F+YC+             L+L  +G   +      L YTP +    +++   + 
Sbjct: 224 QTGTLRFAYCIAPGDGPGL-----LVLGGDGDGAALSAAPQLNYTPLIE---MSQPLPYF 275

Query: 297 -SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
             V Y V L  I VG   + +    L  D  G G T+VDSGT FTF+  + + PL  EF+
Sbjct: 276 DRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFL 335

Query: 356 SQMVKNRNYTRALGAEALT---GLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLP 406
           +Q          LG            CF     +  +       PE+ L  + GAEV + 
Sbjct: 336 NQ---TSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLR-GAEVAVG 391

Query: 407 VENYFAVV-----GEGSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
            E    +V     GEG +  +  +T  + + +G  + ++G+   QN +VEYDL+N R+GF
Sbjct: 392 GEKLLYMVPGERRGEGGSEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGF 451

Query: 460 KQQLCK 465
               C 
Sbjct: 452 APARCD 457


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 167/397 (42%), Gaps = 55/397 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  C++   C   S   I    F    SS++RL
Sbjct: 79  GLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARL 138

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P C S I   + QC          S  C     SY   YG G  T G  +S+T  
Sbjct: 139 VPCSHPICTSQIQTTATQCP-------PQSNQC-----SYAFQYGDGSGTSGYYVSDTFY 186

Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               L   +I N     + GCS   S       +   GI GFG+G+ S+ SQL+    + 
Sbjct: 187 FDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITP 246

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
            + SH          +++       +    G+ Y+P V  PS         +Y + L+ I
Sbjct: 247 RVFSHCLKGEDSGGGILV-----LGEILEPGIVYSPLV--PS-------QPHYNLDLQSI 292

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
            V GQ + +           N GTI+D+GTT  ++  E ++P    FVS +         
Sbjct: 293 AVSGQLLPI--DPAAFATSSNRGTIIDTGTTLAYLVEEAYDP----FVSAITAA---VSQ 343

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
           L    +     C+ V    +  FP +  +F GGA + L  E Y   +   +   L  +  
Sbjct: 344 LATPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGF 403

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++  GG + ILG+  +++    YDL +QR+G+    C
Sbjct: 404 QKIQGGIT-ILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 116/456 (25%), Positives = 177/456 (38%), Gaps = 70/456 (15%)

Query: 45  YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIP 103
           ++ L   +  S  R   I  P+   T++            S GG Y + L  GTP     
Sbjct: 44  HELLRRAIQRSRDRLASIA-PRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFT 102

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQC 162
             +DT S L+W  C     C  C     P F P  S+S  ++ C +  C  +  H   + 
Sbjct: 103 AAIDTASDLIWTQCQ---PCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
            D +DE      +  Q   SY    G+  T GI   + L + + +    + GCS  S   
Sbjct: 160 GDSDDE------DACQYTYSY---GGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGG 210

Query: 223 P----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
           P    +G+ G GRG  SL SQL++ +F YCL         R   L+L   ++ + +  + 
Sbjct: 211 PPPQVSGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRSAGR---LVLGADAAATVRNASE 267

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG--------- 329
               P     S   R  +  YYY+ L  I++ G R   +     ++    G         
Sbjct: 268 RVVVPM----STGSR--YPSYYYLNLDGISI-GDRAMSFRSRNRMNATTPGTAAGAPASP 320

Query: 330 -----------------GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
                            G I+D  +T TF+   L+E + D+   ++   R      G+ +
Sbjct: 321 VSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPR------GSGS 374

Query: 373 LTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
             GL  CF +P     S    P + L F+ G  + L  E  F        +CL V     
Sbjct: 375 DLGLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFVEDRASGMMCLMVGKTDG 433

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            S     ILGN+Q QN  V Y+LR  R+ F +  C+
Sbjct: 434 VS-----ILGNYQQQNMQVMYNLRRGRITFIKTACE 464


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 167/391 (42%), Gaps = 54/391 (13%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           G+ ++LS G+PP     ++DTGS L+W  C     C  C       F P  S S + LGC
Sbjct: 103 GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCL---PCINCFQQSTSWFDPLKSVSFKTLGC 159

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIA-----LSETLN 202
             P  ++I+        CN    A  K         L   G   ++GI      L ETL+
Sbjct: 160 GFPGYNYIN-----GYKCNRFNQAEYK---------LRYLGGDSSQGILAKESLLFETLD 205

Query: 203 LPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFDD 256
                  N   GC   ++ ++   A    FG G     ++ +QL  +KFSYC+     ++
Sbjct: 206 EGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCI--GDINN 262

Query: 257 TTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
              T + L+L  GS              ++   S   +  F  +YYV L+ I+VG + ++
Sbjct: 263 PLYTHNHLVLGQGS--------------YIEGDSTPLQIHFG-HYYVTLQSISVGSKTLK 307

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      +  DG+GG ++DSG T+T +A   FE L DE V  M       R        G
Sbjct: 308 IDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM--KGLLERIPTQRKFEG 365

Query: 376 LRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           L  CF  V       FP +  HF GGA++ L   + F   G G   CL ++         
Sbjct: 366 L--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHG-GDRFCLAILPSNSELLNL 422

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           S+I G    QNY V +DL   ++ F++  C+
Sbjct: 423 SVI-GILAQQNYNVGFDLEQMKVFFRRIDCQ 452


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 165/400 (41%), Gaps = 59/400 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G+PP+     +DTGS ++W  C+    C  C SS     ++  F P  SS+
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSST 145

Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSE 199
           S  + C + +C + +      C+  ++ P             Y   YG G  T G  +S+
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSD 194

Query: 200 TLN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDK 244
           T+     + N    N     + GCS         + R   GI GFG+ + S+ SQLN   
Sbjct: 195 TMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
            S  + SH    +     +++       +    GL YTP V  PS         +Y + L
Sbjct: 255 VSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNL 300

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             I V GQ++ +     T       GTIVDSGTT  ++A   ++P  +   + +  +   
Sbjct: 301 ESIVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS--- 355

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
            R+L ++       CF        SFP + L+F GG  +T+  ENY           L  
Sbjct: 356 VRSLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +  +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 412 IGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 151/384 (39%), Gaps = 52/384 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT S + W PC+    C  C S+   +F P  S+S + + C 
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 169

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + + +   R C                S+ + YGS         +T+ L    I
Sbjct: 170 APQCKQVPNPTCGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 213

Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
             F  GC        ++   +   G+        S    +    FSYCL S  F   T +
Sbjct: 214 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--FRSLTFS 271

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L   S     K     YT  + NP    R++    YYV L  I VG + V +    
Sbjct: 272 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 320

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           +  +     GTI DSGT +T +A  ++E + +EF  ++        +LG          F
Sbjct: 321 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG---------F 371

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           D         P +   FK G  +T+P +N       GS  CL +    E       ++ +
Sbjct: 372 DTCYSGQVKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIAS 430

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 431 MQQQNHRVLIDVPNGRLGLARERC 454


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 155/380 (40%), Gaps = 47/380 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  S   GTPPQ +   LD  S LVW  C                F P  S++   + 
Sbjct: 98  GMYVFSYGIGTPPQQVSGALDISSDLVWTAC-----------GATAPFNPVRSTTVADVP 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
           C +  C     ++     C     A S  C     +Y  +YG G   T G+  +E     
Sbjct: 147 CTDDACQQFAPQT-----CGAGAGAGSSEC-----AYTYMYGGGAANTTGLLGTEAFTFG 196

Query: 205 NRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           +  I   + GC   +V      +G+ G GRG  SL SQL +D+FSY       DD+  T 
Sbjct: 197 DTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVDTQ 253

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S IL       D  T   ++T    +  +   +A    YYV L  I V G+ + +     
Sbjct: 254 SFIL-----FGDDATPQTSHT---LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTF 305

Query: 322 TL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            L ++DG+GG  +      T +    ++PL      Q V ++    A+   AL GL  C+
Sbjct: 306 DLRNKDGSGGVFLSITDLVTVLEEAAYKPL-----RQAVASKIGLPAVNGSAL-GLDLCY 359

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
                     P + L F GGA + L + NYF +       CLT++    +S G   +LG+
Sbjct: 360 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTIL---PSSAGDGSVLGS 416

Query: 441 FQMQNYYVEYDLRNQRLGFK 460
                 ++ YD+   +L F+
Sbjct: 417 LIQVGTHMMYDINGSKLVFE 436


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 165/400 (41%), Gaps = 59/400 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G+PP+     +DTGS ++W  C+    C  C SS     ++  F P  SS+
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSST 145

Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSE 199
           S  + C + +C + +      C+  ++ P             Y   YG G  T G  +S+
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSD 194

Query: 200 TLN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDK 244
           T+     + N    N     + GCS         + R   GI GFG+ + S+ SQLN   
Sbjct: 195 TMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
            S  + SH    +     +++       +    GL YTP V  PS         +Y + L
Sbjct: 255 VSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNL 300

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             I V GQ++ +     T       GTIVDSGTT  ++A   ++P  +   + +  +   
Sbjct: 301 ESIVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS--- 355

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
            R+L ++       CF        SFP + L+F GG  +T+  ENY           L  
Sbjct: 356 VRSLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +  +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 412 IGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 127/464 (27%), Positives = 185/464 (39%), Gaps = 74/464 (15%)

Query: 23  PSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIK---------NPQTKT 69
           PS+   +T  L   H      PS     +L   +     RA +IK         + +   
Sbjct: 55  PSTSGGITVPLHHRHGPCSPVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSD 114

Query: 70  TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
             T  TT  T++S+  Y    I++  G+P       +DTGS + W  C     C  C S 
Sbjct: 115 AATVPTTLGTSLSTLEY---VITVGIGSPAVTQTMSMDTGSDVSWVQCK---PCSQCHSE 168

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
               F P  SS+     C +  C  +  +S Q   C      +S  C      Y+V Y  
Sbjct: 169 VDSLFDPSASSTYSPFSCSSAACVQLS-QSQQGNGC------SSSQC-----QYIVSYVD 216

Query: 190 GL-TEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL-- 242
           G  T G   S+TL L +  I  F  GCS   S     Q  G+ G G    SL SQ     
Sbjct: 217 GSSTTGTYSSDTLTLGSNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTF 276

Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
              FSYCL        T  SS  L  G++      +G   TP + +  +        YY 
Sbjct: 277 GKAFSYCL------PPTPGSSGFLTLGAA----SRSGFVKTPMLRSTQIP------TYYG 320

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           V L  I VGGQ++ +     +       G+++DSGT  T + P  +  L+  F + M K 
Sbjct: 321 VLLEAIRVGGQQLNIPTSVFS------AGSVMDSGTVITRLPPTAYSALSSAFKAGMKKY 374

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
                   A+    L  CFD  G+ + S P + L F GGA V L   ++  ++ E    C
Sbjct: 375 PP------AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNL---DFNGIMLELDNWC 425

Query: 422 LTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L    + + S   S+  +GN Q + + V YD+    +GF+   C
Sbjct: 426 LAFAANSDDS---SLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 153/384 (39%), Gaps = 52/384 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + +  GTP Q +   +DT S + W PC+    C  C S+   +F P  S+S + + C 
Sbjct: 99  YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 153

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + + +   R C                S+ + YGS         +T+ L    I
Sbjct: 154 APQCKQVPNPACGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 197

Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
             F  GC        ++   +   G+        S    +    FSYCL S  F   T +
Sbjct: 198 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPS--FRSLTFS 255

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L   S     K     YT  + NP    R++    YYV L  I VG + V +    
Sbjct: 256 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 304

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           +  +     GTI DSGT +T +A  ++E + +EF  ++        +LG          F
Sbjct: 305 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGG---------F 355

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           D         P +   FKG   +T+P +N       GS  CL + +  E       ++ +
Sbjct: 356 DTCYSGQVKVPTITFMFKG-VNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIAS 414

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 415 MQQQNHRVLIDVPNGRLGLARERC 438


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 151/384 (39%), Gaps = 52/384 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT S + W PC+    C  C S+   +F P  S+S + + C 
Sbjct: 99  YIVKALIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 153

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + + +   R C                S+ + YGS         +T+ L    I
Sbjct: 154 APQCKQVPNPTCGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 197

Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
             F  GC        ++   +   G+        S    +    FSYCL S  F   T +
Sbjct: 198 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--FRSLTFS 255

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L   S     K     YT  + NP    R++    YYV L  I VG + V +    
Sbjct: 256 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 304

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           +  +     GTI DSGT +T +A  ++E + +EF  ++        +LG          F
Sbjct: 305 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG---------F 355

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           D         P +   FK G  +T+P +N       GS  CL +    E       ++ +
Sbjct: 356 DTCYSGQVKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIAS 414

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q QN+ V  D+ N RLG  ++ C
Sbjct: 415 MQQQNHRVLIDVPNGRLGLARERC 438


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 159/387 (41%), Gaps = 58/387 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   +DT +   W PC     C  C +S  P F P  S+S R + C 
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAG---CAGCPTSSAPPFDPAASTSYRSVPCG 166

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P C+               P A      + C   L    S L   ++  ++L +    +
Sbjct: 167 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGDAV 212

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
             +  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S K   F  T R
Sbjct: 213 KTYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLR 272

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               +  NG     K T      P + NP        S  YYV +  I VG + V +   
Sbjct: 273 ----LGRNGQPPRIKTT------PLLANPH------RSSLYYVNMTGIRVGRKVVPIPPP 316

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
            L  D     GT++DSGT FT +    +  + DE            R +GA   +L G  
Sbjct: 317 ALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 366

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF+       ++P + L F G  +VTLP EN       G+  CL +    +       +
Sbjct: 367 TCFNT---TAVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 422

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + + Q QN+ V +D+ N R+GF ++ C
Sbjct: 423 IASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 172/394 (43%), Gaps = 58/394 (14%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           +++L+ G PPQ I  +LDTGS L W  C         S +    F P  SS+   + C +
Sbjct: 66  TVTLAVGDPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 118

Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           P C        + RD    P+ A+    T +C   +    +   EG    ET  + +   
Sbjct: 119 PICR------TRTRDL---PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR 169

Query: 209 PNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           P  L GC  S LSS      +  G+ G  RG  S  +QL   KFSYC+        + +S
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 223

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             +L   +S+S      + YTP V   S        V Y V L  I VG + + +     
Sbjct: 224 GFLLLGDASYS--WLGPIQYTPLVLQ-STPLPYFDRVAYTVQLEGIRVGSKILSLPKSVF 280

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRALGAE 371
             D  G G T+VDSGT FTF+   ++  L +EF++Q      +V + ++    T  L  +
Sbjct: 281 VPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYK 340

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCLTVV 425
             +  RP F          P + L F+ GAE+++  +   + V G GS       C T  
Sbjct: 341 VGSTTRPNFS-------GLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFT-F 391

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
            + +  G  + ++G+   QN ++E+DL   R+GF
Sbjct: 392 GNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 425


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 166/400 (41%), Gaps = 48/400 (12%)

Query: 72  TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI 131
           T +T    +  + + G Y + +  GTP Q++  +LDT +   + PC+    C  CS +  
Sbjct: 83  TVSTAPIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSG---CTGCSDT-- 137

Query: 132 PSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL 191
            +F PK S+S   L C  P+C  +    + C      P   +  C     S+   Y    
Sbjct: 138 -TFSPKASTSYGPLDCSVPQCGQVR--GLSC------PATGTGAC-----SFNQSYAGSS 183

Query: 192 TEGIALSETLNLPNRIIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
                + ++L L   +IPN+  GC  ++  +  PA                +       F
Sbjct: 184 FSATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIF 243

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           SYCL S  F     + SL L         +TT L  +P  + PS+         YYV   
Sbjct: 244 SYCLPS--FKSYYFSGSLKLGPVGQPKSIRTTPLLRSP--HRPSL---------YYVNFT 290

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            I+VG   V    +YL  + +   GTI+DSGT  T     ++  + +EF  Q V    +T
Sbjct: 291 GISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQ-VGGTTFT 349

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
            ++GA        CF    E     P + LHF+G  ++ LP+EN       GS  CL + 
Sbjct: 350 -SIGA-----FDTCFVKTYETLA--PPITLHFEG-LDLKLPLENSLIHSSAGSLACLAMA 400

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              +       ++ NFQ QN  + +D  N ++G  +++C 
Sbjct: 401 AAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVCN 440


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 159/373 (42%), Gaps = 37/373 (9%)

Query: 102 IPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC-SWIHHESI 160
           +  I+DTGS L W  C     C  C + + P F P  S+S   + C    C + +   + 
Sbjct: 177 LTVIVDTGSDLTWVQCK---PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATG 233

Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLS 219
               C           ++ C  Y + YG G  + G+  ++T+ L    +  F+ GC  LS
Sbjct: 234 VPGSCATVGGGGGGGKSERC-YYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-LS 291

Query: 220 SRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           +R      AG+ G GR + SL SQ        FSYCL +    D   + SL    G + S
Sbjct: 292 NRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSL---GGDTSS 348

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
            +  T ++YT  + +P      A   +Y++ +   +VGG  V                 +
Sbjct: 349 YRNATPVSYTRMIADP------AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 395

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           +DSGT  T +AP ++  +  EF  Q        R   A   + L  C+++ G      P 
Sbjct: 396 LDSGTVITRLAPSVYRAVRAEFARQF----GAERYPAAPPFSLLDACYNLTGHDEVKVPL 451

Query: 393 LKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
           L L  +GGA++T+      F    +GS VCL + +       P  I+GN+Q +N  V YD
Sbjct: 452 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP--IIGNYQQKNKRVVYD 509

Query: 452 LRNQRLGFKQQLC 464
               RLGF  + C
Sbjct: 510 TVGSRLGFADEDC 522


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 150/385 (38%), Gaps = 62/385 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+P      ++D+GS +VW  C     C  C +   P F P  S+S   + 
Sbjct: 127 GEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCE---PCDQCYNQTDPIFNPATSASFIGVA 183

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C+ +  + + CR            C      Y V YG G  T+G    ET+ +  
Sbjct: 184 CSSNVCNQLD-DDVACR---------KGRC-----GYQVAYGDGSYTKGTLALETITIGR 228

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            +I +  +GC   +       AG+ G G G  S   QL       F YCL+S        
Sbjct: 229 TVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAM----- 283

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
                             G  + P ++NP       +  +YYV L  + VGG RV +  +
Sbjct: 284 ----------------PVGAMWVPLIHNP------FYPSFYYVSLSGLAVGGIRVPISEQ 321

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G GG ++D+GT  T +    +    D F++Q     N  RA G         C
Sbjct: 322 IFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTT---NLPRAPGVSIFD---TC 375

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           +D+ G  T   P +  +F GG  +T P  N+     +    C        +  G SII G
Sbjct: 376 YDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFA---PSPSGLSII-G 431

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q +   V  D  N  +GF   +C
Sbjct: 432 NIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 109/451 (24%), Positives = 176/451 (39%), Gaps = 63/451 (13%)

Query: 34  SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
           SR   NPS  S    +    S+     H KNP    ++TTT           +G Y  S+
Sbjct: 52  SRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPAALRSSTTTL-------GRKFGEYYTSI 104

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
             G+P Q    I+DTGS L W  C     CK C+ S    +    S+S R + C N +  
Sbjct: 105 KLGSPGQEAILIVDTGSELTWLQC---LPCKVCAPSVDTIYDAARSASYRPVTCNNSQL- 160

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLNLPNRI---- 207
                      C++    T   C +     +   YG G  + G   ++TL +   +    
Sbjct: 161 -----------CSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKP 209

Query: 208 --IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
             + +F  GC+     L     +GI G   GK +LP QL      KFS+C    +     
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLN 268

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            T  +   N     ++    + YT      S  +R     +Y+V L+ +++        H
Sbjct: 269 STGVVFFGNAELPHEQ----VQYTSVALTNSELQRK----FYHVALKGVSINS------H 314

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-RALGAEALTGLR 377
           + + L R      I+DSG++F+        P   +     +K+R  + + L  ++   L 
Sbjct: 315 ELVFLPR--GSVVILDSGSSFS----SFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLG 368

Query: 378 PCFDVPGEKTG----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
            CF V  +       + P L L F+ G  + +P       V              +    
Sbjct: 369 TCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPN 428

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           P  ++GN+Q QN +VEYD++  R+GF +  C
Sbjct: 429 PVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 174/397 (43%), Gaps = 64/397 (16%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           +++L+ G+PPQ I  +LDTGS L W  C         S +    F P  SS+   + C +
Sbjct: 62  TVTLAVGSPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 114

Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           P C              D P+ A+    T  C   +    +   EG    +T  + +   
Sbjct: 115 PICR---------TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTR 165

Query: 209 PNFLVGC--SVLSS-----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           P  L GC  S LSS      +  G+ G  RG  S  +QL   KFSYC+        + +S
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 219

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPS---VAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            ++L   +S+S      + YTP V   +     +R    V Y V L  I VG + + +  
Sbjct: 220 GILLLGDASYS--WLGPIQYTPLVLQTTPLPYFDR----VAYTVQLEGIRVGSKILSLPK 273

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRAL 368
                D  G G T+VDSGT FTF+   ++  L +EF++Q      +V + N+    T  L
Sbjct: 274 SVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDL 333

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCL 422
                +  RP F      TG  P + L F+ GAE+++  +   + V G GS       C 
Sbjct: 334 CYRVGSSTRPNF------TG-LPVISLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCF 385

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           T   + +  G  + ++G+   QN ++E+DL   R+GF
Sbjct: 386 T-FGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 421


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 37/370 (10%)

Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC-SWIHHESIQCR 163
           I+DTGS L W  C     C  C + + P F P  S+S   + C    C + +   +    
Sbjct: 179 IVDTGSDLTWVQCK---PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
            C           ++ C  Y + YG G  + G+  ++T+ L    +  F+ GC  LS+R 
Sbjct: 236 SCATVGGGGGGGKSERC-YYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-LSNRG 293

Query: 223 ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
                AG+ G GR + SL SQ        FSYCL +    D   + SL    G + S + 
Sbjct: 294 LFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSL---GGDTSSYRN 350

Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
            T ++YT  + +P      A   +Y++ +   +VGG  V                 ++DS
Sbjct: 351 ATPVSYTRMIADP------AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVLLDS 397

Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
           GT  T +AP ++  +  EF  Q        R   A   + L  C+++ G      P L L
Sbjct: 398 GTVITRLAPSVYRAVRAEFARQF----GAERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 453

Query: 396 HFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
             +GGA++T+      F    +GS VCL + +       P  I+GN+Q +N  V YD   
Sbjct: 454 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP--IIGNYQQKNKRVVYDTVG 511

Query: 455 QRLGFKQQLC 464
            RLGF  + C
Sbjct: 512 SRLGFADEDC 521


>gi|383143511|gb|AFG53183.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 82/153 (53%), Gaps = 18/153 (11%)

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL     D    +S +++ N +   D     LTYTP + NP       +  +YY+GL  
Sbjct: 1   YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +++G +R+ +     T D  GNGGTI+DSGT+FT     ++  +A EF SQ+     Y R
Sbjct: 47  VSIGRKRMNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
             GAE+ TGL  C++V G +   FP+   HFKG
Sbjct: 103 VPGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 170/394 (43%), Gaps = 58/394 (14%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           +++L+ G PPQ I  +LDTGS L W  C         S +    F P  SS+   + C +
Sbjct: 66  TVTLAVGDPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 118

Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           P C              D P+ A+    T +C   +    +   EG    ET  + +   
Sbjct: 119 PICR---------TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR 169

Query: 209 PNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           P  L GC  S LSS      +  G+ G  RG  S  +QL   KFSYC+        + +S
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 223

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
             +L   +S+S      + YTP V   S        V Y V L  I VG + + +     
Sbjct: 224 VFLLLGDASYS--WLGPIQYTPLVLQ-STPLPYFDRVAYTVQLEGIRVGSKILSLPKSVF 280

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRALGAE 371
             D  G G T+VDSGT FTF+   ++  L +EF++Q      +V + ++    T  L  +
Sbjct: 281 VPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYK 340

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCLTVV 425
             +  RP F          P + L F+ GAE+++  +   + V G GS       C T  
Sbjct: 341 VGSTTRPNFS-------GLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFT-F 391

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
            + +  G  + ++G+   QN ++E+DL   R+GF
Sbjct: 392 GNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 425


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 110/402 (27%), Positives = 174/402 (43%), Gaps = 63/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
           G Y   +  GTPP+ +   +DTGS ++W  C +   C   S  +I    F P  SS+S L
Sbjct: 75  GLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSL 134

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C + +C S +      C   N++       CT     Y   YG G  T G  +S+ ++
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQ-------CT-----YTFQYGDGSGTSGYYVSDLMH 182

Query: 203 --------LPNRIIPNFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                   L      + + GCS+L       S R   GI GFG+   S+ SQL+    + 
Sbjct: 183 FASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAP 242

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +    L+L       +     + Y+P V  PS         +Y + L+ 
Sbjct: 243 RVFSHCLKGDNSGGGVLVL------GEIVEPNIVYSPLV--PS-------QPHYNLNLQS 287

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I+V GQ VR+           N GTIVDSGTT  ++A E + P      + + ++     
Sbjct: 288 ISVNGQIVRIAPSVFATSN--NRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVL 345

Query: 367 ALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFA---VVGEGSAVCL 422
           + G +       C+ +        FP++ L+F GGA + L  ++Y      +GEGS  C+
Sbjct: 346 SRGNQ-------CYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCI 398

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                ++ SG    ILG+  +++    YDL  QR+G+    C
Sbjct: 399 GF---QKISGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/393 (29%), Positives = 154/393 (39%), Gaps = 77/393 (19%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
           +   G Y  S+  GTPP     +LDTGS +VW  C    QC Y  S ++  F P+ S S 
Sbjct: 136 AQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQC-YAQSGRV--FDPRRSRSY 192

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
             + C  P C  +                    C      Y V YG G +T G   +ETL
Sbjct: 193 AAVRCGAPPCRGLDAGGGG------GCDRRRGTCL-----YQVAYGDGSVTAGDLATETL 241

Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
                  +P   VGC   +       AG+ G GRG+ SLP+Q       +FSYC      
Sbjct: 242 WFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDL 301

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
           D  T     I+     H                                     VGG RV
Sbjct: 302 DHRT-----IIRTVHQH-------------------------------------VGGARV 319

Query: 315 R-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R V  + L LD   G GG I+DSGT+ T +A  ++  + + F +     R     L    
Sbjct: 320 RGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLR-----LAPGG 374

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
            +    C+D+ G +    P + +H  GGAEV LP ENY   V      CL +  TD    
Sbjct: 375 FSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTD---- 430

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GG SI+ GN Q Q + V +D   QR+    + C
Sbjct: 431 GGVSIV-GNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 117/478 (24%), Positives = 199/478 (41%), Gaps = 68/478 (14%)

Query: 7   ALCLSFIFFFTLLSIFPSSITS--LTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
           A   S +F  +  S+F S++T+    F++   H +    P  +S +     + ++L R+ 
Sbjct: 2   APVFSLLFLISTASVF-SAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           H      + T    + T      ++ G Y + +S GTPP  I  + DTGS ++W  C   
Sbjct: 61  H------RNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-- 112

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C  C     P F P  S++ + + C +P CS+    S     C+D+       C    
Sbjct: 113 -PCSNCYQQNAPMFDPSKSTTYKNVACSSPVCSY----SGDGSSCSDD-----SECL--- 159

Query: 181 PSYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGF 229
             Y + YG           + + +  T   P    P  ++GC   ++       +GI G 
Sbjct: 160 --YSIAYGDDSHSQGNLAVDTVTMQSTSGRP-VAFPRTVIGCGHDNAGTFNANVSGIVGL 216

Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           GRG  SL +QL      KFSYCL+      T  ++ L   N  S+++   +G   TP   
Sbjct: 217 GRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKL---NFGSNANVSGSGTVSTP--- 270

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
              +     +  +Y + L  ++VG  +         L  + N   I+DSGTT T++   L
Sbjct: 271 ---IYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESN--IIIDSGTTLTYLPSAL 325

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
                + F S + ++ +   A        L  CF    +     P + +HF+ GA+V L 
Sbjct: 326 L----NSFGSAISQSMSLPHAQDPSEF--LDYCFATTTDDY-EMPPVTMHFE-GADVPLQ 377

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            EN F  + + + +CL   +  + +     I GN    N+ V YD++N  + F+   C
Sbjct: 378 RENLFVRLSDDT-ICLAFGSFPDDN---IFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 169/398 (42%), Gaps = 56/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP      +DTGS ++W  C +   C   S  +I    F P  SS+S +
Sbjct: 76  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C + +C    +   Q  D      AT  +    C SY   YG G  T G  +S+ ++L
Sbjct: 136 IACSDQRC----NNGKQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 184

Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                     N   P  + GCS         S R   GI GFG+ + S+ SQL+    + 
Sbjct: 185 NTIFEGSMTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 243

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D++    L+L       +     + YT  V  P+         +Y + L+ 
Sbjct: 244 RIFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 288

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I+V GQ +++           + GTIVDSGTT  ++A E ++P      + + ++     
Sbjct: 289 ISVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVV 346

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           + G +       C+ +    T  FP++ L+F GGA + L  ++Y           +  + 
Sbjct: 347 SRGNQ-------CYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIG 399

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++  G    ILG+  +++  V YDL  QR+G+    C
Sbjct: 400 FQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 437


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 108/432 (25%), Positives = 179/432 (41%), Gaps = 51/432 (11%)

Query: 40  PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           P  D++ N + ++ S    R  ++    ++ T +T    +    + + G Y + +  GTP
Sbjct: 51  PKADTWDNRIINMASKDPVRVKYLSTLVSQKTVSTAPIASGQ--AFNIGNYVVRVKLGTP 108

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
            Q++  +LDT +   + PC+    C  CS +   +F PK S+S   L C  P+C  +   
Sbjct: 109 GQLLFMVLDTSTDEAFVPCSG---CTGCSDT---TFSPKASTSYGPLDCSVPQCGQVR-- 160

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--S 216
            + C      P   +  C     S+   Y         + + L L   +IP +  GC  +
Sbjct: 161 GLSC------PATGTGAC-----SFNQSYAGSSFSATLVQDALRLATDVIPYYSFGCVNA 209

Query: 217 VLSSRQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           +  +  PA                +       FSYCL S  F     + SL L       
Sbjct: 210 ITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPS--FKSYYFSGSLKLGPVGQPK 267

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
             +TT L  +P  + PS+         YYV    I+VG   V    +YL  + +   GTI
Sbjct: 268 SIRTTPLLRSP--HRPSL---------YYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTI 316

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           +DSGT  T     ++  + +EF  Q V    +T ++GA        CF    E     P 
Sbjct: 317 IDSGTVITRFVEPVYNAVREEFRKQ-VGGTTFT-SIGA-----FDTCFVKTYETLA--PP 367

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
           + LHF+G  ++ LP+EN       GS  CL +    +       ++ NFQ QN  + +D+
Sbjct: 368 ITLHFEG-LDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDI 426

Query: 453 RNQRLGFKQQLC 464
            N ++G  +++C
Sbjct: 427 VNNKVGIAREVC 438


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 164/398 (41%), Gaps = 59/398 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y   +  G+PP+     +DTGS ++W  C+    C  C SS     ++  F P  SS+S 
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSSTSS 173

Query: 144 LLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
            + C + +C + +      C+  ++ P             Y   YG G  T G  +S+T+
Sbjct: 174 KIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSDTM 222

Query: 202 N----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
                + N    N     + GCS         + R   GI GFG+ + S+ SQLN    S
Sbjct: 223 YFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 282

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
             + SH    +     +++       +    GL YTP V  PS         +Y + L  
Sbjct: 283 PKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNLES 328

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ++ +     T       GTIVDSGTT  ++A   ++P  +   + +  +    R
Sbjct: 329 IVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS---VR 383

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           +L ++       CF        SFP + L+F GG  +T+  ENY           L  + 
Sbjct: 384 SLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG 439

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 440 WQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 477


>gi|383143501|gb|AFG53178.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143503|gb|AFG53179.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143507|gb|AFG53181.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143509|gb|AFG53182.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143517|gb|AFG53186.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143519|gb|AFG53187.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 82/153 (53%), Gaps = 18/153 (11%)

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL     D    +S +++ N +   D     LTYTP + NP       +  +YY+GL  
Sbjct: 1   YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +++G +R+ +     T D  GNGGTI+DSGT+FT     ++  +A EF SQ+     Y R
Sbjct: 47  VSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
             GAE+ TGL  C++V G +   FP+   HFKG
Sbjct: 103 VPGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 97/210 (46%), Gaps = 22/210 (10%)

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           D T+ S L+L    S  +   T    TP + NP          +YY+ L  I+VG  ++ 
Sbjct: 2   DDTKQSVLLL---GSLPNVNATKQVTTPLITNPLQPS------FYYISLEVISVGDTKLS 52

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +      +  DG+GG I+DSGTT T++    F+ L  EF SQ          +     TG
Sbjct: 53  IEQSTFEVSDDGSGGVIIDSGTTITYIEENAFDSLKKEFTSQT------KLPVDKSGSTG 106

Query: 376 LRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           L  CF +P  KT    P+L  HFKGG ++ LP ENY          CL +     AS G 
Sbjct: 107 LDVCFSLPSGKTEVEIPKLVFHFKGG-DLELPGENYMIADSSLGVACLAM----GASNGM 161

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           S I GN Q QN  V +DL+ + + F    C
Sbjct: 162 S-IFGNIQQQNILVNHDLQKETITFIPTQC 190


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 157/394 (39%), Gaps = 66/394 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP Q +  +LDT +   W PC+    C  CSS+   +       S   L 
Sbjct: 95  GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSG---CTGCSSTTFSTNTSSTYGS---LD 148

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE-TLNLPN 205
           C   +C+ +   S         P   S +C      +   YG   +    L E +L L N
Sbjct: 149 CSMAQCTQVRGFSC--------PATGSSSCV-----FNQSYGGDSSFSATLVEDSLRLVN 195

Query: 206 RIIPNFLVGC--SVLSSRQP------------AGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
            +IPNF  GC  S+     P            + IA  G   + L        FSYCL S
Sbjct: 196 DVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGL--------FSYCLPS 247

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
             F     + SL L  G +   K    + YTP + NP    R +    YYV L  ++VG 
Sbjct: 248 --FKSYYFSGSLKL--GPAGQPKS---IRYTPLLRNP---HRPSL---YYVNLTGVSVGR 294

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
             V +  + L  + +   GTI+DSGT  T     ++  + DEF  Q+        A    
Sbjct: 295 TLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQV--------AGPFS 346

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
           +L     CF    E     P + LHF G   + LP+EN       GS  CL +       
Sbjct: 347 SLGAFDTCFAATNEAVA--PAVTLHFTG-LNLVLPMENSLIHSSAGSLACLAMAAAPNNV 403

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                ++ N Q QN  + +D+ N RLG  ++LC 
Sbjct: 404 NSVLNVIANLQQQNLRLLFDVPNSRLGIARELCN 437


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 158/388 (40%), Gaps = 74/388 (19%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTPP  I  +LDTGS  +W  C     C +C +   P F P  SS+ + + C 
Sbjct: 65  YLMKLQIGTPPFEIEAVLDTGSEHIWTQC---LPCVHCYNQTAPIFDPSKSSTFKEIRC- 120

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
                          D +D            CP  LV  G   T+G  ++ET+ + +   
Sbjct: 121 ---------------DTHDHS----------CPYELVYGGKSYTKGTLVTETVTIHSTSG 155

Query: 207 ---IIPNFLVGCSVLSSR-QP--AGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT 257
              ++P  ++GC   +S  +P  AG+ G  RG  SL +Q+  +     SYC         
Sbjct: 156 QPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG------ 209

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-V 316
                     G+S  +     +     V + +V  + A   +YY+ L  ++VG  R+  V
Sbjct: 210 ---------KGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
              +  L     G  ++DSG+T T+  PE +  L  + V Q+V    + R   ++ L   
Sbjct: 261 GTPFHAL----KGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPR---SDILCYY 312

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
               D+       FP + +HF GGA++ L   N +     G   CL ++ +         
Sbjct: 313 SKTIDI-------FPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIE---EA 362

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN    N+ V YD  +  + FK   C
Sbjct: 363 IFGNRAQNNFLVGYDSSSLLVSFKPTNC 390


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 158/389 (40%), Gaps = 52/389 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT S + W PC     C  CSS+    F    S++ + LGCQ
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 154

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQ-------ICPSYLVLYGSGLTEGIALSETL 201
             +C  + H           PL TS +          +C   L   GS L   ++  +T+
Sbjct: 155 AAQCKQVLHLL--------SPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLS-QDTI 205

Query: 202 NLPNRIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
            L    +P +  GC        L ++   G+        S    L    FSYCL S  F 
Sbjct: 206 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FK 263

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
               + SL L  G     K+   + YTP + NP    R +    Y+V L  + VG + V 
Sbjct: 264 SLNFSGSLRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVD 312

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V     T +     GTI DSGT FT +    +  + D F     +NR   R L   +L G
Sbjct: 313 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAF-----RNR-VGRNLTVTSLGG 366

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
              C+ VP       P +   F G   VTLP +N       GS  CL +    +      
Sbjct: 367 FDTCYTVPIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVL 421

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ++ N Q QN+ + YD+ N RLG  ++LC
Sbjct: 422 NVIANLQQQNHRLLYDVPNSRLGVARELC 450


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 164/397 (41%), Gaps = 64/397 (16%)

Query: 89  YSISLSFGTPP-QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y I++  G+PP +    ++DTGS + W  C   +Q   C     P F P LSS+     C
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ--QCRPQVDPLFDPSLSSTYSPFSC 197

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLPN 205
            +  C+ +  E       N    ++S  C      Y+ +YG G   T G   S+TL L +
Sbjct: 198 SSAACAQLFQEG------NANGCSSSGQC-----QYIAMYGDGSVGTTGTYSSDTLALGS 246

Query: 206 R----IIPNFLVGCSVLSSRQPAGIAGFGRGKT-------SLPSQ----LNLDKFSYCLL 250
                ++  F  GCS        GI G   G         SL SQ         FSYCL 
Sbjct: 247 NSNTVVVSKFRFGCS----HAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCL- 301

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
                  T +SS  L  G++ +   + G   TP + +  V        +Y V L  I VG
Sbjct: 302 -----PPTPSSSGFLTLGAAGT--SSAGFVKTPMLRSSQVP------AFYGVRLEAIRVG 348

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+++ +           + G I+DSGT  T + P  +  L+  F + M   + Y  A  +
Sbjct: 349 GRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGM---KQYPPAPSS 399

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHF--KGGAEVTLPVENYFAVVGEGSAVCLT-VVTD 427
                L  CFD+ G+ + S P + L F   GGA V L        +   S  CL  V T 
Sbjct: 400 AGGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATS 459

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + S G   I+GN Q + + V YD+    +GFK   C
Sbjct: 460 DDGSTG---IIGNVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 164/387 (42%), Gaps = 39/387 (10%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTPP  I   +DTGS+++W PC N   CK C +     F P  SS+ +   
Sbjct: 96  GNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCIN---CKDCFNQSSSIFNPLASSTYQDAP 152

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C + +C      S  C+  N    +  +     CP+     G    + + L+ +   P  
Sbjct: 153 CDSYQC---ETTSSSCQSDNVCLYSCDEKHQLNCPN-----GRIAVDTMTLTSSDGRPFP 204

Query: 207 I-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           +   +F+ G S+  +    G+ G GRG  SL S+   L+  KFSYCL  +     ++  +
Sbjct: 205 LPYSDFVCGNSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPSKI-N 263

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
             L +  S  D +    T             +  S  YYV L  I+VG +R  +++    
Sbjct: 264 FGLQSFISDDDLEVVSTTLG----------HHRHSGNYYVTLEGISVGEKRQDLYYVDDP 313

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-----RNYTRALGAEALTGLR 377
                 G  ++DSGT FT +  + ++ L       + +N      N       +    L 
Sbjct: 314 F-APPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLS 372

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           PCF    E    FP++ +HF   A+V L  +N F  V E   VC      +    G S +
Sbjct: 373 PCFWYYPEL--KFPKITIHFT-DADVELSDDNSFIRVAE-DVVCFAFAATQP---GQSTV 425

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            G++Q  N+ + YDL+   + FK+  C
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDC 452


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 168/396 (42%), Gaps = 70/396 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSS 142
           G Y + ++ GTP   +   LDTGS + W       QC+ C  S        F P+ SSS 
Sbjct: 43  GNYLVKMALGTPKLSLSLALDTGSDITW------TQCEPCVGSCYRQAQTKFDPRKSSSY 96

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
           + + C +  C  I  +S   R C       S  C      Y V YG G  + G   +E L
Sbjct: 97  KNVSCSSSSCRIIT-DSGGARGC------VSSTCI-----YKVQYGDGSYSVGFFATEKL 144

Query: 202 NL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLP------------SQLNLDKFSYC 248
            + P+ +I NFL GC     +Q AG   FGR    L             S+   + F+YC
Sbjct: 145 TISPSDVISNFLFGCG----QQNAGR--FGRIAGLLGLGRGKLSLALQTSEKYNNLFTYC 198

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           L S     ++ T  L L  G      K T L+   F N P          +Y + ++ ++
Sbjct: 199 LPSFS---SSSTGHLTL-GGQVPKSVKFTPLS-PAFKNTP----------FYGIDIKGLS 243

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           VGG  + +     +     N G I+DSGT  T + P ++  L+ +F  Q++K+   T   
Sbjct: 244 VGGHVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKF-QQLMKDYPKT--- 294

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
             +  + L  C+D  G ++ S P +   FKGG EV +       V+     VCL    + 
Sbjct: 295 --DGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPND 352

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +   G  ++ GN Q Q Y V +DL   R+GF    C
Sbjct: 353 DD--GDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 162/394 (41%), Gaps = 61/394 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y+  L  GTPPQ    I+DTGS + + PC++   C+ C   + P F P LSS+ R 
Sbjct: 73  SNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS---CEQCGKHQDPRFQPDLSSTYRP 129

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS---ETL 201
           + C NP C           +C+DE     K CT       +   SG+     +S   E+ 
Sbjct: 130 VKC-NPSC-----------NCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSFGNESE 173

Query: 202 NLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
             P R +     GC       L S++  GI G GRG+ S+  QL              D 
Sbjct: 174 LKPQRAV----FGCENVETGDLYSQRADGIMGLGRGRLSVVDQL-------------VDK 216

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
                S  L  G          L       N   +  N + S YY + L+ + V G+ ++
Sbjct: 217 GVIGDSFSLCYGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLK 276

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +  K      D   GT++DSGTT+ +     F  L D  + ++   R+  +  G +    
Sbjct: 277 LKPKVF----DEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEI---RHLKQIPGPDP-NY 328

Query: 376 LRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREA 430
              CF   G +       FPE+ + F  G +++L  ENY F       A CL +  +   
Sbjct: 329 HDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQN--- 385

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               + +LG   ++N  V YD  N ++GF +  C
Sbjct: 386 GNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 157/388 (40%), Gaps = 74/388 (19%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTPP  I  +LDTGS  +W  C     C +C +   P F P  SS+ + + C 
Sbjct: 59  YLMKLQIGTPPFEIEAVLDTGSEHIWTQC---LPCVHCYNQTAPIFDPSKSSTFKEIRCD 115

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
                  H  S                    CP  LV  G   T+G  ++ET+ + +   
Sbjct: 116 T------HDHS--------------------CPYELVYGGKSYTKGTLVTETVTIHSTSG 149

Query: 207 ---IIPNFLVGCSVLSSR-QP--AGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT 257
              ++P  ++GC   +S  +P  AG+ G  RG  SL +Q+  +     SYC         
Sbjct: 150 QPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG------ 203

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-V 316
                     G+S  +     +     V + +V  + A   +YY+ L  ++VG  R+  V
Sbjct: 204 ---------KGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 254

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
              +  L     G  ++DSG+T T+  PE +  L  + V Q+V    + R   ++ L   
Sbjct: 255 GTPFHAL----KGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPR---SDILCYY 306

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
               D+       FP + +HF GGA++ L   N +     G   CL ++ +         
Sbjct: 307 SKTIDI-------FPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIE---EA 356

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN    N+ V YD  +  + FK   C
Sbjct: 357 IFGNRAQNNFLVGYDSSSLLVSFKPTNC 384


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 111/418 (26%), Positives = 171/418 (40%), Gaps = 81/418 (19%)

Query: 80  NISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK--------- 130
           N SS S   Y   +  G P Q +  I+DTGS ++WF C     C+ CSS K         
Sbjct: 79  NGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCK---LCQGCSSKKNVIVCSSII 135

Query: 131 ----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
               I  + P+LS ++    C +P CS    E   CR  N+            C   +  
Sbjct: 136 MQGPITLYDPELSITASPATCSDPLCS----EGGSCRGNNNS-----------CAYDISY 180

Query: 187 YGSGLTEGIALSETLNLPNRIIPN--FLVGCSV-LSSRQPA-GIAGFGRGKTSLPSQLNL 242
             +  + GI   + ++L ++   N    +GC+  +S   P  GI GFGR K S+P+QL  
Sbjct: 181 EDTSSSTGIYFRDVVHLGHKASLNTTMFLGCATSISGLWPVDGIMGFGRSKVSVPNQLAA 240

Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
              SY +  H          +++       + +   + YTP + N          + Y V
Sbjct: 241 QAGSYNIFYHCLSGEKEGGGILVLG----KNDEFPEMVYTPMLAN---------DIVYNV 287

Query: 303 GLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGT---TFTFMAPELFEPLADEFVSQM 358
            L  ++V  + + +       +   GNGGTI+DSGT   TF   A  LF     +F + +
Sbjct: 288 KLVSLSVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAI 347

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYF-AVV 414
                      A   +   PCF    ++      FP + L F GGA + L   NY  AVV
Sbjct: 348 PT---------APLESSGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVV 398

Query: 415 GEGSA----------VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
               +          VC++       S G S ILG+  +++  V YD+   R+G+ +Q
Sbjct: 399 SRKLSESTHFQGVRLVCIS------WSVGNSTILGDAILKDKVVVYDMEKSRIGWVKQ 450


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 126/480 (26%), Positives = 193/480 (40%), Gaps = 85/480 (17%)

Query: 13  IFFFTLLSIFPSSITSLT-----FSLSRFHTNP--------SQDSYQNLNSLVSSSLTRA 59
           IFF  +L +   S T++      F+ S FH +         S   Y  L +    SL+R+
Sbjct: 7   IFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRS 66

Query: 60  LHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
             + N   +  T         ++  S G Y +S+S GTPP     + DTGS L+W  C  
Sbjct: 67  ATLLN---RAATNGALDLQAPLTPGS-GEYLMSVSIGTPPVDYIGMADTGSDLMWAQC-- 120

Query: 120 HYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
              C  C     P F P  S+S   + C +  C  I       +   D            
Sbjct: 121 -LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCD------------ 167

Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCS---VLSSRQPAGIAGFGRGKTS 235
              Y   YG    T+G    E + + +  + + ++GC           +G+ G G G+ S
Sbjct: 168 ---YSYTYGDQTYTKGDLGFEKITIGSSSVKS-VIGCGHESGGGFGFASGVIGLGGGQLS 223

Query: 236 LPSQLNLD-----KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
           L SQ++       +FSYC   LLSH         + ++            G+  TP ++ 
Sbjct: 224 LVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG---------PGVVSTPLISK 274

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
             V        YYYV L  I++G +R      ++   + GN   I+DSGTT +F+  EL+
Sbjct: 275 NPV-------TYYYVTLEAISIGNER------HMASAKQGN--VIIDSGTTLSFLPKELY 319

Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVT- 404
               D  VS ++K     R         L  CFD  +    +   P +   F GGA V  
Sbjct: 320 ----DGVVSSLLKVVKAKRVKDPGNFWDL--CFDDGINVATSSGIPIITAQFSGGANVNL 373

Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LPV  +  V    + + LT  +  +  G    I+GN  + N+ + YDL  +RL FK  +C
Sbjct: 374 LPVNTFQKVANNVNCLTLTPASPTDEFG----IIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 50/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTPPQ +   +DT +   W PC+    C  C ++    F P  S S R + C 
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSG---CAGCPTTT--PFNPAASKSYRAVPCG 162

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P CS         R  N      +K+C      + + Y     E     ++L + N ++
Sbjct: 163 SPACS---------RAPNPSCSLNTKSC-----GFSLTYADSSLEAALSQDSLAVANDVV 208

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
            ++  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S  F     + +
Sbjct: 209 KSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPS--FKSLNFSGT 266

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT     P + NP    R++    YYV +  I VG + V +    L 
Sbjct: 267 LRLGRKGQPLRIKTT-----PLLVNP---HRSSL---YYVSMTGIRVGKKVVPIPPAALA 315

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D     GT++DSGT FT +    +  + DE        R   R     +L G   C++ 
Sbjct: 316 FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV-------RRRIRGAPLSSLGGFDTCYNT 368

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
               T  +P +   F G  +VTLP +N       G+  CL +    +       ++ + Q
Sbjct: 369 ----TVKWPPVTFMFTG-MQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQ 423

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + +D+ N R+GF ++ C
Sbjct: 424 QQNHRILFDVPNGRVGFAREQC 445


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 161/389 (41%), Gaps = 57/389 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           + +++ FG+P Q     +DTGS + W    PC+ H     C     P F P  S++   +
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGH-----CYKQHDPVFDPTKSATYSAV 215

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP 204
            C +P+C+    +            + S  C      Y V YG G  T G+   ETL+L 
Sbjct: 216 PCGHPQCAAAGGK-----------CSNSGTCL-----YKVTYGDGSSTAGVLSHETLSLS 259

Query: 205 N-RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
           + R +P F  GC   +  +     G+ G GRG  SLPSQ        FSYCL S+   DT
Sbjct: 260 STRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSY---DT 316

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T    L + + +  +      + YT  +      ++  +   Y+V +  I +GG  + V 
Sbjct: 317 TH-GYLTMGSTTPAASNDDDDVQYTAMI------QKEDYPSLYFVEVVSIDIGGYILPVP 369

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               T D     GT+ DSGT  T++ PE +  L D F   M + +       A A     
Sbjct: 370 PTVFTRD-----GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKP------APAYDPFD 418

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV-CLTVVTDREASGGPS 435
            C+D  G      P +   F  GA   L PV           A  CL  V     S  P 
Sbjct: 419 TCYDFTGHNAIFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVP--RPSTMPF 476

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q +   V YD+  +++GF Q  C
Sbjct: 477 NIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 130/464 (28%), Positives = 189/464 (40%), Gaps = 77/464 (16%)

Query: 23  PSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRA-LHIKN-PQTKTTTTTTTTTTTN 80
           P   TS  F+L R HT       +++ +  S  +    LH K+ P         TT   +
Sbjct: 23  PKQCTSYRFTL-RLHT-------KSIKTKESPKIKPGYLHSKSTPAPSRLDNLWTTEIAD 74

Query: 81  ISSH-----SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI 135
           I SH     +   +  ++S G PP     ++DTGS L W  C     CK C    IP F 
Sbjct: 75  IVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQC---LPCK-CYPQTIPFFH 130

Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGI 195
           P  SS+ R     N  C    H   Q     DE        T  C  +L       T GI
Sbjct: 131 PSRSSTYR-----NASCESAPHAMPQI--FRDEK-------TGNCRYHLRYRDFSNTRGI 176

Query: 196 ALSETLNLPNR---II--PNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
              E L        +I  PN + GC   +S   Q +G+ G G G  S+ ++    KFSYC
Sbjct: 177 LAKEKLTFQTSDEGLISKPNIVFGCGQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYC 236

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
             S   D T   + LIL NG+      T                   F   YY+ L+ I+
Sbjct: 237 FGS-LIDPTYPHNFLILGNGARIEGDPT---------------PLQIFQDRYYLDLQAIS 280

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF------VSQMVKN- 361
           +G + + +        R   GGT++D+G + T +A E +E L++E       V + VK+ 
Sbjct: 281 LGEKLLDIEPGIFQRYR-SKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDW 339

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
             YT       L       D+ G     FP +  HF GGAE+ L VE+ F     G + C
Sbjct: 340 EQYTNHCYEGNLK-----LDLYG-----FPVVTFHFAGGAELALDVESLFVSSESGDSFC 389

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           L +  +   +     ++G    QNY V Y+LR  ++ F++  C+
Sbjct: 390 LAMTMN---TFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCE 430


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 113/401 (28%), Positives = 168/401 (41%), Gaps = 72/401 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
           G Y   +  GTPPQ     +DTGS + W  C     CK  S+  +P   F P+ S+S   
Sbjct: 46  GLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTS 105

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNC---TQICPSYLVLYGSG-LTEGIALSET 200
           + C + +C                 LA++  C   +  CP Y  LYG G  T G  +++ 
Sbjct: 106 ISCTDEECY----------------LASNSKCSFNSMSCP-YSTLYGDGSSTAGYLINDV 148

Query: 201 LNLPNRIIPN-----------FLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
           L+       N           F  G +   +    G+ GFG+ + SLPSQL+    S  +
Sbjct: 149 LSFNQVPSGNSTATSGTARLTFGCGSNQTGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNI 208

Query: 250 LSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
            +H    D   + +L++ +       +  GL YTP V   S         +Y V L  I 
Sbjct: 209 FAHCLQGDNKGSGTLVIGH------IREPGLVYTPIVPKQS---------HYNVELLNIG 253

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           V G  V     +   D   +GG I+DSGTT T+    L +P  D+F     K R+  R+ 
Sbjct: 254 VSGTNVTTPTAF---DLSNSGGVIMDSGTTLTY----LVQPAYDQF---QAKVRDCMRS- 302

Query: 369 GAEALTGLRP-CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEG-SAVCLTV 424
                 G+ P  F       G FP + L+F GGA + L   +Y    ++  G SA C + 
Sbjct: 303 ------GVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSW 356

Query: 425 VTDREASGGPS-IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +      G  S  I G+  +++  V YD  N R+G+K   C
Sbjct: 357 LESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 160/385 (41%), Gaps = 57/385 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++S GTP       +DTGS + W  C      + CSS K   F P  S++     C 
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCA-PCAAQSCSSQKDKLFDPAKSATYSAFSCS 188

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNL-PNR 206
           + +C+ +  E   C + + +              Y+V Y     T G   S+TL L  + 
Sbjct: 189 SAQCAQLGGEGNGCLNSHCQ--------------YIVKYVDHSNTTGTYGSDTLGLTTSD 234

Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
            + NF  GCS  ++    Q  G+ G G    SL SQ        FSYCL       ++ +
Sbjct: 235 AVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCL-----PPSSSS 289

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           +   L  G++     ++  + TP V       R     +Y V L+ ITV G ++ V    
Sbjct: 290 AGGFLTLGAAAGGTSSSRYSRTPLV-------RFNVPTFYGVFLQAITVAGTKLNVPASV 342

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPC 379
            +      G ++VDSGT  T + P  ++ L   F  +M       +A  + A  G L  C
Sbjct: 343 FS------GASVVDSGTVITQLPPTAYQALRTAFKKEM-------KAYPSAAPVGILDTC 389

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           FD  G KT   P + L F  GA + L V   F       A CL       A  G + ILG
Sbjct: 390 FDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY------AGCLAFTA--TAQDGDTGILG 441

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q + + + +D+    LGF+   C
Sbjct: 442 NVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 124/471 (26%), Positives = 185/471 (39%), Gaps = 84/471 (17%)

Query: 24  SSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIKNP-------------- 65
           SS++  T +L+  H      PS         L+     RA HI+                
Sbjct: 47  SSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFAMNAAVDGAGDLQ 106

Query: 66  QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQ 122
           Q+K +++  T   +++ +  Y    IS+  GTP       +DTGS + W    PC N   
Sbjct: 107 QSKVSSSVPTKLGSSLDTLEY---VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPN--- 160

Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
              C +     F P  SS+ R + C   +C+ +  +   C        AT+  C      
Sbjct: 161 -PPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG-------ATNYEC-----Q 207

Query: 183 YLVLYGSG-LTEGIALSETLNL--PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSL 236
           Y V YG G  T G    +TL L   +  +  F  GCS + S    Q  G+ G G G  SL
Sbjct: 208 YGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSL 267

Query: 237 PSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
            SQ      + FSYCL                 +GSS       G   + FV    +  R
Sbjct: 268 VSQTAAAYGNSFSYCLPPT--------------SGSSGFLTLGGGGGVSGFVTTRMLRSR 313

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
                +Y   L+ I VGG+++ +             G++VDSGT  T + P  +  L+  
Sbjct: 314 Q-IPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTIITRLPPTAYSALSSA 366

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
           F + M + R+      A A + L  CFD  G+   S P + L F GGA + L        
Sbjct: 367 FKAGMKQYRS------APARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIM-- 418

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              G+ +      D   +G    I+GN Q + + V YD+ +  LGF+   C
Sbjct: 419 --YGNCLAFAATGDDGTTG----IIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 140/298 (46%), Gaps = 42/298 (14%)

Query: 179 ICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKT 234
           IC +Y + YG G  T G    E L     ++ +F+ GC   +     G++G    GR   
Sbjct: 132 IC-NYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDL 190

Query: 235 SLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
           SL SQ   +    FSYCL S    +   + SLIL  G+S   + ++ ++Y   + NP + 
Sbjct: 191 SLISQTSGIFGGVFSYCLPS---TERKGSGSLIL-GGNSSVYRNSSPISYAKMIENPQLY 246

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
                  +Y++ L  I++GG  ++           G    +VDSGT  T + P +++ L 
Sbjct: 247 N------FYFINLTGISIGGVALQAPSV-------GPSRILVDSGTVITRLPPTIYKALK 293

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
            EF+ Q      +T    A A + L  CF++   +    P +K+HF+G AE+T+ V   F
Sbjct: 294 AEFLKQ------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVF 347

Query: 412 AVV-GEGSAVCLTVVT----DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             V  + S VCL + +    D  A      ILGN+Q +N  V YD +  ++GF  + C
Sbjct: 348 YFVKSDASQVCLALASLEYQDEVA------ILGNYQQKNLRVIYDTKETKVGFALETC 399


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 126/485 (25%), Positives = 200/485 (41%), Gaps = 88/485 (18%)

Query: 9   CLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLV-SSSLTRALHIKNPQT 67
           C   IF F +   F   +   + S  RF   P + S++  N+LV    L R   + + ++
Sbjct: 24  CNGRIFTFEMHHRFSDEVKQWSDSTGRFVKFPPKGSFEYFNALVLRDWLIRGRRLSDSES 83

Query: 68  KTTTT-TTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-- 124
           +++ T +   +T+ ISS  +  Y+ ++  GTP       LDTGS L W PC +  +C   
Sbjct: 84  ESSLTFSDGNSTSRISSLGFLHYT-TVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPT 141

Query: 125 ----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
               Y S  ++  + PK+S++++ + C N  C+    +  QC       L T   C    
Sbjct: 142 EGATYASEFELSIYNPKISTTNKKVTCNNSLCA----QRNQC-------LGTFSTC---- 186

Query: 181 PSYLVLYGSGL--TEGIALSETLNL------PNRIIPNFLVGC------SVLSSRQPAGI 226
             Y+V Y S    T GI + + ++L      P R+      GC      S L    P G+
Sbjct: 187 -PYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGL 245

Query: 227 AGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
            G G  K S+PS L       D FS C      D   R S    D GSS  ++       
Sbjct: 246 FGLGMEKISVPSVLAREGLVADSFSMCF---GHDGVGRIS--FGDKGSSDQEE------- 293

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TPF  NPS          Y + + R+ VG   +           D     + D+GT+FT+
Sbjct: 294 TPFNLNPSHPN-------YNITVTRVRVGTTLI-----------DDEFTALFDTGTSFTY 335

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGG 400
           +   ++  +++ F SQ    R+       ++      C+D+  +   S  P L L  KG 
Sbjct: 336 LVDPMYTTVSESFHSQAQDKRH-----SPDSRIPFEYCYDMSNDANASLIPSLSLTMKGN 390

Query: 401 AEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           +  T+  +    +  EG  V CL +V   E +     I+G   M  Y V +D     L +
Sbjct: 391 SHFTIN-DPIIVISTEGELVYCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVLAW 444

Query: 460 KQQLC 464
           K+  C
Sbjct: 445 KKFDC 449


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 126/487 (25%), Positives = 200/487 (41%), Gaps = 90/487 (18%)

Query: 9   CLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLV-SSSLTRALHIKNPQT 67
           C   IF F +   F   +   + S  RF   P + S++  N+LV    L R   +   ++
Sbjct: 24  CNGRIFTFEMHHRFSDEVKQWSDSTGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSESES 83

Query: 68  KTTTTTTTT---TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           ++ ++ T +   +T+ ISS  +  Y+ ++  GTP       LDTGS L W PC +  +C 
Sbjct: 84  ESESSLTFSDGNSTSRISSLGFLHYT-TVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCA 141

Query: 125 ------YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
                 Y S  ++  + PK+S++++ + C N  C+    +  QC       L T   C  
Sbjct: 142 PTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA----QRNQC-------LGTFSTC-- 188

Query: 179 ICPSYLVLYGSGL--TEGIALSETLNL------PNRIIPNFLVGC------SVLSSRQPA 224
               Y+V Y S    T GI + + ++L      P R+      GC      S L    P 
Sbjct: 189 ---PYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPN 245

Query: 225 GIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
           G+ G G  K S+PS L       D FS C      D   R S    D GSS  ++     
Sbjct: 246 GLFGLGMEKISVPSVLAREGLVADSFSMCF---GHDGVGRIS--FGDKGSSDQEE----- 295

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
             TPF  NPS          Y + + R+ VG   +           D     + D+GT+F
Sbjct: 296 --TPFNLNPSHPN-------YNITVTRVRVGTTLI-----------DDEFTALFDTGTSF 335

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFK 398
           T++   ++  +++ F SQ    R+       ++      C+D+  +   S  P L L  K
Sbjct: 336 TYLVDPMYTTVSESFHSQAQDKRH-----SPDSRIPFEYCYDMSNDANASLIPSLSLTMK 390

Query: 399 GGAEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           G +  T+  +    +  EG  V CL +V   E +     I+G   M  Y V +D     L
Sbjct: 391 GNSHFTIN-DPIIVISTEGELVYCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVL 444

Query: 458 GFKQQLC 464
            +K+  C
Sbjct: 445 AWKKFDC 451


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 159/389 (40%), Gaps = 61/389 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRL 144
           Y + +  GTPP     + DTGS   W       QC+ C  S    K   F P  SS+   
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWV------QCRPCVVSCYKQKDRLFDPAKSSTYAN 216

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
           + C +P C+      +    CN      + +C      Y + YG G  T G    +TL +
Sbjct: 217 VSCADPACA-----DLDASGCN------AGHCL-----YGIQYGDGSYTVGFFAKDTLAV 260

Query: 204 PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
               I  F  GC   +     Q AG+ G GRG TS+  Q        FSYCL +      
Sbjct: 261 AQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPA------ 314

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           +  ++  L+ G        +    TP + +           +YYVGL  I VGG+++   
Sbjct: 315 SSAATGYLEFGPLSPSSSGSNAKTTPMLTDKG-------PTFYYVGLTGIRVGGKQLGAI 367

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
            + +      N GT+VDSGT  T + P+          +  +    Y +   A A + L 
Sbjct: 368 PESVF----SNSGTLVDSGTVITRL-PDTAYAALSSAFAAAMAASGYKK---AAAYSILD 419

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT--DREASGGPS 435
            C+D  G    S P + L F+GGA + L        + + S VCL   +  D E+ G   
Sbjct: 420 TCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQ-SQVCLGFASNGDDESVG--- 475

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN Q + Y V YD+  + +GF    C
Sbjct: 476 -IVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/327 (30%), Positives = 140/327 (42%), Gaps = 48/327 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + +  GTP Q +  +LDT +   W PC+    C  CSS+   +F+P  S++   L C 
Sbjct: 45  YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG---CTGCSST---TFLPNASTTLGSLDCS 98

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +CS +   S           AT  +      SY    G        + + + L N +I
Sbjct: 99  EAQCSQVRGFSCP---------ATGSSACLFNQSY---GGDSSLAATLVQDAITLANDVI 146

Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
           P F  GC + +S  S  P G+ G GRG  SL SQ        FSYCL S  F     + S
Sbjct: 147 PGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGS 204

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         +TT L   P  + PS+         YYV L  ++VG  +V +  + L 
Sbjct: 205 LKLGPVGQPKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLV 253

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D +   GTI+DSGT  T     ++  + DEF  Q+        +LGA        CF  
Sbjct: 254 FDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAA 305

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVEN 409
             E     P + LHF+ G  + LP+EN
Sbjct: 306 TNEAEA--PAVTLHFE-GLNLVLPMEN 329


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 162/392 (41%), Gaps = 65/392 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           + +++  GTP Q    I DTGS L W  C       +C   + P F P  SS+   + C 
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
            P+C+                L +  N T +   YLV YG G  T G+   +TL L  +R
Sbjct: 204 EPQCAAAGD------------LCSEDNTTCL---YLVRYGDGSSTTGVLSRDTLALTSSR 248

Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLSHKF 254
            +  F  GC   +      +  FGR         G+ SLPSQ        FSYCL S   
Sbjct: 249 ALTGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPS--- 299

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             +  T+  +    +  +D  T    YT  +  P       F  +Y+V L  I +GG  +
Sbjct: 300 --SNSTTGYLTIGATPATD--TGAAQYTAMLRKPQ------FPSFYFVELVSIDIGGYVL 349

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            V     T      GGT++DSGT  T++  + +  L D F   M +   YT A   + L 
Sbjct: 350 PVPPAVFT-----RGGTLLDSGTVLTYLPAQAYALLRDRFRLTMER---YTPAPPNDVLD 401

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVCLTVVTDREASG 432
               C+D  GE     P +   F  GA   L   ++F V+   + +  CL      +  G
Sbjct: 402 A---CYDFAGESEVVVPAVSFRFGDGAVFEL---DFFGVMIFLDENVGCLAFAA-MDTGG 454

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            P  I+GN Q ++  V YD+  +++GF    C
Sbjct: 455 LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 152/382 (39%), Gaps = 53/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q     LDT +   W PC     C  CSS+   S     S++ + LGC 
Sbjct: 90  YIVKANVGTPAQTFLMALDTSNDAAWIPCNG---CVGCSSTVFNSVT---STTFKTLGCD 143

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +           P      CT     +   YG          +T+ L   I+
Sbjct: 144 APQCKQVPN-----------PTCGGSTCT-----WNTTYGGSTILSNLTRDTIALSTDIV 187

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC   +  SS  P G+ G GRG  S  SQ   L    FSYCL S  F     + +
Sbjct: 188 PGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLNFSGT 245

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT     P + NP    R++    YYV L  I VG + V +    L 
Sbjct: 246 LRLGPAGQPLRIKTT-----PLLKNP---RRSSL---YYVNLIGIRVGRKIVDIPASALA 294

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +   ++  + DEF       R         +L G   C+  
Sbjct: 295 FNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEF-------RKRVGNAIVSSLGGFDTCYTG 347

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 348 PIVA----PTMTFMFSG-MNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQ 402

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + +D+ N R+G  ++ C
Sbjct: 403 QQNHRILFDVPNSRIGVAREPC 424


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/350 (28%), Positives = 148/350 (42%), Gaps = 55/350 (15%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           S G Y  + + GTPPQ +  ++D    LVW  CT    C+ C    +P F P  SS+ R 
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           L C +  C  I                +S+NCT     Y     +G T G A ++T  + 
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAI- 154

Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
                    GC V++ ++      P+GI G GR   SL +Q+N+  FSYCL         
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER-NAFSVYYYVGLRRITVGGQRVRVW 317
             SS  L  G++         + TPFV   S     N  + YY V L  I  GG  ++  
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAA 266

Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                     +G T+ +D+ +  +++A   ++ L           +  T A+G + +   
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308

Query: 377 RPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
              +D+  P    G  PEL   F GGA +T+P  NY    G G+ VCLT+
Sbjct: 309 PKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTI 357


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 173/407 (42%), Gaps = 69/407 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP+     +DTGS ++W  C++   C   S  +IP   F P  S+++ L
Sbjct: 82  GLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAAL 141

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C + +C+      IQ  D     L +S+  T  C  Y   YG G  T G  +++ ++L
Sbjct: 142 VSCSDQRCT----AGIQSSD----SLCSSR--TNQC-GYTFQYGDGSGTSGYYVADLMHL 190

Query: 204 PNRIIPNFLVG-------------CSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD 243
              ++ +  +              CS L       S R   GI GFG+ + S+ SQL   
Sbjct: 191 DTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQ 250

Query: 244 K-----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
                 FS+CL   K DD+     L+L       +     + YTP V  PS    N +  
Sbjct: 251 GITPRVFSHCL---KGDDSGG-GVLVL------GEIVEPNIVYTPLV--PSQPHYNLY-- 296

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
                L+ I+V GQ + +           N GTIVDSGTT  ++A   ++P      S +
Sbjct: 297 -----LQSISVAGQTLAIDPS--VFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVV 349

Query: 359 VKN-RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
             N R Y        L+    C+ V       FP++ L+F GGA + L  ++Y       
Sbjct: 350 SLNARTY--------LSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSV 401

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               +  V  ++  G    ILG+  +++    YD+ NQR+G+    C
Sbjct: 402 GGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 120/413 (29%), Positives = 176/413 (42%), Gaps = 79/413 (19%)

Query: 86  YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCS-SSKIPSFIPKLSSSSR 143
           YG +  +L  GTP +    I+DTGS + + PC +   C + C    K  +F P  SSSS 
Sbjct: 59  YGYFYATLHLGTPARQFAVIVDTGSTITYVPCAS---CGRNCGPHHKDAAFDPASSSSSA 115

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
           ++GC + KC            C   P   S+   + C           + G+ +S+ L L
Sbjct: 116 VIGCDSDKCI-----------CGRPPCGCSEK--RECTYQRTYAEQSSSAGLLVSDQLQL 162

Query: 204 PNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHK 253
            +  +   + GC       + +++  GI G G  + SL +QL       D F+ C  S +
Sbjct: 163 RDGAV-EVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVE 221

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D      +L+L  G   + +    L YT  +++       A   YY V L  + VGGQ+
Sbjct: 222 GD-----GALML--GDVDAAEYDVALQYTALLSSL------AHPHYYSVQLEALWVGGQQ 268

Query: 314 VRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           + V       +R   G GT++DSGTTFT++  E F+ L  E VS       Y    G  +
Sbjct: 269 LPV-----KPERYEEGYGTVLDSGTTFTYLPSEAFQ-LFKEAVSA------YALEHGLNS 316

Query: 373 LTGLRP-----------CFDV---PGEKTGS-----FPELKLHFKGGAEV-TLPVENYFA 412
           + G  P           CF      G    S     FP  +L F  G  + T P+   F 
Sbjct: 317 VKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFM 376

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             GE  A CL V  D  ASG    +LG    +N  V+YD RN+R+GF    C+
Sbjct: 377 HTGEMGAYCLGVF-DNGASG---TLLGGISFRNILVQYDRRNRRVGFGAASCQ 425


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 158/377 (41%), Gaps = 30/377 (7%)

Query: 91  ISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           I+++ GTP  Q +  ++D  S+ VW  C        C      +F P  S++   L C +
Sbjct: 90  INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRI 207
             C  +  E+     C     A +      C SY + YG  +  T G   ++T       
Sbjct: 150 DMCLPVLRET-----CGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA 204

Query: 208 IPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
           +P  + GCS  S    AG   + G GRG  SL SQL   KFSY LL+ +  D     S+I
Sbjct: 205 VPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVI 264

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTL 323
              G     K   G + TP +++        +  +YYV L  + V G R+  +      L
Sbjct: 265 -RFGDDAVPKTKRGRS-TPLLSS------TLYPDFYYVNLTGVRVDGNRLDAIPAGTFDL 316

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
             +G GG I+ S T  T++     E  A + V   V +R    A+   A   L  C++  
Sbjct: 317 RANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNAS 371

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
                  P+L L F GGA++ L   NYF +  +    CLT++  +  S     +LG    
Sbjct: 372 SMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGS-----VLGTLLQ 426

Query: 444 QNYYVEYDLRNQRLGFK 460
               + YD+   RL F+
Sbjct: 427 TGTNMIYDVDAGRLTFE 443


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 108/411 (26%), Positives = 174/411 (42%), Gaps = 74/411 (18%)

Query: 85  SYGGYSISLSF-----GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPK 137
           +Y  Y + L F     G+PP+     +DTGS ++W  C +   C   S   IP   F P 
Sbjct: 74  TYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPG 133

Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRD--CNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
            SS++ L+ C + +CS      +Q  D  C+ +       C      Y   YG G  T G
Sbjct: 134 SSSTASLISCSDQRCSL----GVQSSDAGCSSQ----GNQCI-----YTFQYGDGSGTSG 180

Query: 195 IALSETLNLPNRII--------PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQ 239
             +S+ LN  + I+         + + GCS+        S R   GI GFG+   S+ SQ
Sbjct: 181 YYVSDLLNF-DAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQ 239

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           ++    +  + SH          +++       D     + Y+P V  PS         +
Sbjct: 240 MSSQGITPKVFSHCLKGDGGGGGILVLGEIVEED-----IVYSPLV--PS-------QPH 285

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVS 356
           Y + L+ I+V G+ + +  +        N GTIVDSGTT  ++A E ++P      E VS
Sbjct: 286 YNLNLQSISVNGKSLAIDPEVFATST--NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVS 343

Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA---V 413
           Q V+            L+    C+ +     G FP + L+F GG  + L  E+Y      
Sbjct: 344 QSVR----------PLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNS 393

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +G+ +  C+     ++  G    ILG+  +++    YDL  QR+G+    C
Sbjct: 394 IGDAAVWCIGF---QKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 441


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 158/377 (41%), Gaps = 30/377 (7%)

Query: 91  ISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           I+++ GTP  Q +  ++D  S+ VW  C        C      +F P  S++   L C +
Sbjct: 90  INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRI 207
             C  +  E+     C     A +      C SY + YG  +  T G   ++T       
Sbjct: 150 DMCLPVLRET-----CGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA 204

Query: 208 IPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
           +P  + GCS  S    AG   + G GRG  SL SQL   KFSY LL+ +  D     S+I
Sbjct: 205 VPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVI 264

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTL 323
              G     K   G + TP +++        +  +YYV L  + V G R+  +      L
Sbjct: 265 -RFGDDAVPKTKRGQS-TPLLSS------TLYPDFYYVNLTGVRVDGNRLDAIPAGTFDL 316

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
             +G GG I+ S T  T++     E  A + V   V +R    A+   A   L  C++  
Sbjct: 317 RANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNAS 371

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
                  P+L L F GGA++ L   NYF +  +    CLT++  +  S     +LG    
Sbjct: 372 SMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGS-----VLGTLLQ 426

Query: 444 QNYYVEYDLRNQRLGFK 460
               + YD+   RL F+
Sbjct: 427 TGTNMIYDVDAGRLTFE 443


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 162/402 (40%), Gaps = 60/402 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTP       +DT S LVW  C     C  C     P F P+LSSS  ++ 
Sbjct: 86  GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ---PCVSCYRQLDPIFNPRLSSSYAVVP 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  CS +  +  +C + +D          Q C       G+ +T G    + L +   
Sbjct: 143 CSSDTCSQL--DGHRCDEDDD----------QACRYNYKYSGNAVTNGTLAIDKLAVGGN 190

Query: 207 IIPNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT-S 261
           +    ++GCS  S   P    +G+ G  RG  SL SQL++ +F YCL        +RT  
Sbjct: 191 VFHAVVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPP----MSRTPG 246

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK-- 319
            L+L  G+     +      T      +++    +  YYY+    + VG Q      +  
Sbjct: 247 KLVLGAGAGADAVRNVSDRVT-----VTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPT 301

Query: 320 ------------YLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
                               N  G IVD  +T +F+   L++ LAD+   ++       R
Sbjct: 302 SPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEI----RLPR 357

Query: 367 ALGAEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
           A  +  L GL  CF +P   G      P + + F G     L +E     + +G  +CL 
Sbjct: 358 ATPSTRL-GLDLCFILPEGVGIDRVYVPTVSMSFDG---RWLELERDRLFLEDGRMMCLM 413

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +      S     ILGN+Q QN +V Y+LR  ++ F +  C 
Sbjct: 414 IGRTSGVS-----ILGNYQQQNMHVLYNLRRGKITFAKASCD 450


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/255 (32%), Positives = 118/255 (46%), Gaps = 29/255 (11%)

Query: 215 CSVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
           C V     P+ G+ GF RG  S PSQ   +    FSYCL S+K  + + T  L    G +
Sbjct: 333 CVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTLRL----GPA 388

Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
              K+   +  TP ++NP    R +    YYV +  I VGG+ V V    L  D     G
Sbjct: 389 GQPKR---IKTTPLLSNP---HRPSL---YYVNMVGIRVGGRPVAVPASALAFDPASGHG 439

Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
           TIVD+GT FT ++  ++  + D F       R+  RA  A  L G   C++V    T S 
Sbjct: 440 TIVDAGTMFTRLSAPVYAAVCDVF-------RSRVRAPVAGPLGGFDTCYNV----TISV 488

Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVE 449
           P +   F G   VTLP EN           CL +      S    + ++ + Q QN+ V 
Sbjct: 489 PTVTFLFDGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVL 548

Query: 450 YDLRNQRLGFKQQLC 464
           +D+ N R+GF ++LC
Sbjct: 549 FDVANGRVGFSRELC 563


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 131/286 (45%), Gaps = 49/286 (17%)

Query: 193 EGIALSETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
           + +AL + ++    ++  +  GC  +    S  P G+ GFG G  S PSQ N D     F
Sbjct: 346 DALALHDDVD----VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQ-NKDVYGFVF 400

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           SYCL S+K  + + T  L    G +   K+   +  TP ++NP    R +    YYV + 
Sbjct: 401 SYCLPSYKSSNFSSTLRL----GPAGQPKR---IKMTPLLSNP---HRPSL---YYVNMV 447

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            I VGG+ + V    L  D     GTIVD+GT FT ++  ++  + D F       R+  
Sbjct: 448 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRV 500

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
           RA     L G   C++V    T S P +   F G   VTLP EN           CL + 
Sbjct: 501 RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAM- 555

Query: 426 TDREASGGPSI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                + GPS        +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 556 -----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELC 596


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 157/374 (41%), Gaps = 56/374 (14%)

Query: 104 FILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
            ILDTGS L W  C     C  YC +   P + P +S + + L C + +CS +   ++  
Sbjct: 1   MILDTGSSLSWLQCQ---PCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATL-- 55

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-PNRIIPNFLVGCSVLSS 220
              ND    T  N       Y   YG +  + G    + L L  ++ +P F  GC   + 
Sbjct: 56  ---NDPLCETDSNACL----YTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQ 108

Query: 221 R---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
               + AGI G  R K S+ +QL+      FSYCL        T  S        S    
Sbjct: 109 GLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL-------PTANSGSSGGGFLSIGSI 161

Query: 275 KTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
             T   +TP +    NPS+         Y++ L  ITV G+ + +      +       T
Sbjct: 162 SPTSYKFTPMLTDSKNPSL---------YFLRLTAITVSGRPLDLAAAMYRVP------T 206

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
           ++DSGT  T +   ++  L   FV  M      T+   A A + L  CF    +   + P
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMS-----TKYAKAPAYSILDTCFKGSLKSISAVP 261

Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEY 450
           E+K+ F+GGA++TL   +      +G    +T +    +SG   I I+GN Q Q Y + Y
Sbjct: 262 EIKMIFQGGADLTLRAPSILIEADKG----ITCLAFAGSSGTNQIAIIGNRQQQTYNIAY 317

Query: 451 DLRNQRLGFKQQLC 464
           D+   R+GF    C
Sbjct: 318 DVSTSRIGFAPGSC 331


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 108/411 (26%), Positives = 174/411 (42%), Gaps = 74/411 (18%)

Query: 85  SYGGYSISLSF-----GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPK 137
           +Y  Y + L F     G+PP+     +DTGS ++W  C +   C   S   IP   F P 
Sbjct: 59  TYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPG 118

Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRD--CNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
            SS++ L+ C + +CS      +Q  D  C+ +       C      Y   YG G  T G
Sbjct: 119 SSSTASLISCSDQRCSL----GVQSSDAGCSSQ----GNQCI-----YTFQYGDGSGTSG 165

Query: 195 IALSETLNLPNRII--------PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQ 239
             +S+ LN  + I+         + + GCS+        S R   GI GFG+   S+ SQ
Sbjct: 166 YYVSDLLNF-DAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQ 224

Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
           ++    +  + SH          +++       D     + Y+P V  PS         +
Sbjct: 225 MSSQGITPKVFSHCLKGDGGGGGILVLGEIVEED-----IVYSPLV--PS-------QPH 270

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVS 356
           Y + L+ I+V G+ + +  +        N GTIVDSGTT  ++A E ++P      E VS
Sbjct: 271 YNLNLQSISVNGKSLAIDPEVFATST--NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVS 328

Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA---V 413
           Q V+            L+    C+ +     G FP + L+F GG  + L  E+Y      
Sbjct: 329 QSVR----------PLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNS 378

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +G+ +  C+     ++  G    ILG+  +++    YDL  QR+G+    C
Sbjct: 379 IGDAAVWCIGF---QKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 426


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 99/327 (30%), Positives = 140/327 (42%), Gaps = 48/327 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + +  GTP Q +  +LDT +   W PC+    C  CSS+   +F+P  S++   L C 
Sbjct: 45  YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG---CTGCSST---TFLPNASTTLGSLDCS 98

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +CS +   S           AT  +      SY    G        + + + L N +I
Sbjct: 99  EAQCSQVRGFSCP---------ATGSSACLFNQSY---GGDSSLAATLVQDAITLANDVI 146

Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
           P F  GC + +S  S  P G+ G GRG  SL SQ        FSYCL S  F     + S
Sbjct: 147 PGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGS 204

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         +TT L   P  + PS+         YYV L  ++VG  +V +  + L 
Sbjct: 205 LKLGPVGQPKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLV 253

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            D +   GTI+DSGT  T     ++  + DEF  Q+        +LGA        CF  
Sbjct: 254 FDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAE 305

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVEN 409
             E     P + LHF+ G  + LP+EN
Sbjct: 306 TNEAEA--PAVTLHFE-GLNLVLPMEN 329


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 114/405 (28%), Positives = 174/405 (42%), Gaps = 65/405 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRLLG 146
           Y   +  G P +     +DTGS ++W  C     C   S+  IP   + P+ SS++ L+ 
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPN 205
           C +P C  +         C+      + NC      Y+  YG G T EG  + + +   N
Sbjct: 62  CSDPLC--VRGRRFAEAQCSQ----ATNNC-----EYIFSYGDGSTSEGYYVRDAMQY-N 109

Query: 206 RIIPN--------FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLL 250
            I  N         L GCS+     LS+ Q A  GI GFG+ + S+P+QL   +    + 
Sbjct: 110 VISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVF 169

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           SH  +   R   +++  G +       G+TYTP V +         SV+Y V LR I+V 
Sbjct: 170 SHCLEGEKRGGGILVIGGIAEP-----GMTYTPLVPD---------SVHYNVVLRGISVN 215

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
             R+ +  +  +   D   G I+DSGTT  +     +    + FV  +   R  T A   
Sbjct: 216 SNRLPIDAEDFSSTNDT--GVIMDSGTTLAYFPSGAY----NVFVQAI---REATSATPV 266

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVV 425
                   CF V G  +  FP + L+F+GGA + L  +NY      A  G     C+   
Sbjct: 267 RVQGMDTQCFLVSGRLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQ 325

Query: 426 TDREASGGPS-----IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +   +S GP       ILG+  +++  V YDL N R+G+    CK
Sbjct: 326 SS-SSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNCK 369


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 133/484 (27%), Positives = 202/484 (41%), Gaps = 68/484 (14%)

Query: 1   MASYISALCL-SFIFFFTLLSI------FPSSITSLTFSLSRFHTNPSQDSYQNLNSLVS 53
           MA+ +S L + + IF  TL+ I      F   + +     S F+ NP +   Q + S V 
Sbjct: 1   MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFY-NPRETPTQRIVSAVR 59

Query: 54  SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
            S++R  H  +P   +   T T  +  IS+   G Y +  S GTP   I  I DTGS L+
Sbjct: 60  RSMSRVHHF-SPTKNSDIFTDTAQSEMISNQ--GEYLMKFSLGTPAFDILAIADTGSDLI 116

Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
           W  C     C  C     P F PK SS+ R + C   +C  +   +     C+ E    +
Sbjct: 117 WTQCK---PCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGA----SCSGE---GN 166

Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFLVGCSVLS----SRQP 223
           K C      Y   YG    T G   ++T+ L +      ++P  ++GC   +    + + 
Sbjct: 167 KTC-----HYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKG 221

Query: 224 AGIAGFGRGKTSLPSQLN--LD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
           +GI G G G  SL SQL   +D KFSYCL+     + T +S L   N  S+      G+ 
Sbjct: 222 SGIVGLGGGPISLISQLGSTIDGKFSYCLVPLS-SNATNSSKL---NFGSNGIVSGGGVQ 277

Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
            TP ++            +Y++ L  ++VG +R++             G  I+DSGTT T
Sbjct: 278 STPLISKDP-------DTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIIDSGTTLT 327

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
               + F  L+   V   V         G  +L     C+ +  +    FP +  HF  G
Sbjct: 328 LFPEDFFSELSSA-VQDAVAGTPVEDPSGILSL-----CYSIDADL--KFPSITAHFD-G 378

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           A+V L   N F  V + + +C         +     I GN    N+ V YDL  + + FK
Sbjct: 379 ADVKLNPLNTFVQVSD-TVLCFAFNPINSGA-----IFGNLAQMNFLVGYDLEGKTVSFK 432

Query: 461 QQLC 464
              C
Sbjct: 433 PTDC 436


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 152/382 (39%), Gaps = 53/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + GTP Q     LDT +   W PC     C  CSS+   S     S++ + LGC 
Sbjct: 90  YIVKANVGTPAQTFLMALDTSNDAAWIPCNG---CVGCSSTVFNSVT---STTFKTLGCD 143

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
            P+C  + +           P      CT     +   YG          +T+ L   I+
Sbjct: 144 APQCKQVPN-----------PTCGGSTCT-----WNTTYGGSTILSNLTRDTIALSTDIV 187

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC   +  SS  P G+ G GRG  S  SQ   L    FSYCL S  F     + +
Sbjct: 188 PGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLNFSGT 245

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L         KTT     P + NP    R++    YYV L  I VG + V +    L 
Sbjct: 246 LRLGPAGQPLRIKTT-----PLLKNP---RRSSL---YYVNLIGIRVGRKIVDIPASALA 294

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +   ++  + DEF       R         +L G   C+  
Sbjct: 295 FNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEF-------RKRVGNAIVSSLGGFDTCYTG 347

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 348 PIVA----PTMTFMFSG-MNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQ 402

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + +D+ N R+G  ++ C
Sbjct: 403 QQNHRILFDVPNSRIGVAREPC 424


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 107/403 (26%), Positives = 166/403 (41%), Gaps = 62/403 (15%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
           + S G Y   +  G+PP+     +DTGS ++W  C    +C   +   IP   +  K SS
Sbjct: 72  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 131

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
           +S+ +GC++  CS+I    +Q   C        K C     SY V+YG G T +G  + +
Sbjct: 132 TSKNVGCEDDFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFIKD 177

Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
            + L       R  P     + GC    S Q         GI GFG+  TS+ SQL    
Sbjct: 178 NITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGG 237

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
            +  + SH  D+       I   G   S    T    TP V N          V+Y V L
Sbjct: 238 STKRIFSHCLDNMNGGG--IFAVGEVESPVVKT----TPIVPN---------QVHYNVIL 282

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
           + + V G  + +     +   +G+GGTI+DSGTT  ++   L+  L ++  + Q VK   
Sbjct: 283 KGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 340

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                          CF        +FP + LHF+   ++++   +Y   + E    C  
Sbjct: 341 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 390

Query: 424 VVTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +       G   I+LG+  + N  V YDL N+ +G+    C
Sbjct: 391 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433


>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
           sativus]
          Length = 432

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 114/423 (26%), Positives = 176/423 (41%), Gaps = 80/423 (18%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           ++ H  G Y   +   TP   +   +D G   +W  C   Y                +SS
Sbjct: 34  VTKHPSGQYITQIRQRTPLVPVKLTVDLGGQFMWVDCDRGY----------------VSS 77

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC---PSYLVLYGSGLTEGIAL 197
           S + + C++ +CS    +S  C DC   P     N T  C   P   ++  S  T G   
Sbjct: 78  SYKPVRCRSAQCSL--SKSTSCGDCFSPPXPGCNNNT--CGHFPGNTIIQLS--TSGEVT 131

Query: 198 SETLNL-------PNRI--IPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNLD 243
           S+ L++       P R   IPNFL  C    +L   +   +G+AGFGR   SLPSQ +  
Sbjct: 132 SDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAA 191

Query: 244 -----KFSYCLLSHKFDDTTRTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SV 290
                KF+ CL       +TR+  +I   NG  H   +   T  LTYTP   NP     V
Sbjct: 192 FSFNRKFAVCL-----SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGV 246

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           +     S  Y++G++ I    + V +    L +D +GNGGT + +   +T +   ++  L
Sbjct: 247 STSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNAL 306

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
                 ++   RN  R     A+     C+     K+ SF   +L   G   + L ++N 
Sbjct: 307 VKTITREL---RNIPR---VAAVAPFGVCY-----KSKSFGSTRLG-PGMPSIDLILQNK 354

Query: 410 --YFAVVGEGSAV-------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
              + + G  S V       CL  V D       +I++G +QM++  +E+DL   RLGF 
Sbjct: 355 KVIWRIFGANSMVQVNEEVLCLGFV-DGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFS 413

Query: 461 QQL 463
             L
Sbjct: 414 STL 416


>gi|383143497|gb|AFG53176.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143499|gb|AFG53177.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143505|gb|AFG53180.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143513|gb|AFG53184.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143515|gb|AFG53185.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 18/153 (11%)

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           YCL     D    +S +++ N +   D     LTYTP + NP       +  +YY+GL  
Sbjct: 1   YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +++G +R+ +     T D  GNGGTI+DSGT+FT     ++  +A EF SQ+     Y R
Sbjct: 47  VSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
             GAE+ T L  C++V G +   FP+   HFKG
Sbjct: 103 VPGAESTTALGLCYNVSGVENIQFPQFAFHFKG 135


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP +    I DTGS + W  C      K C   K P   P  S+S + + 
Sbjct: 117 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 174

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
           C +  C  +               A+ K  +Q C S    Y V YG G  + G   +ETL
Sbjct: 175 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 219

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
            L  + +  NFL GC   ++      AG+ G GR K +LPSQ        FSYCL +   
Sbjct: 220 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 277

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             ++    L L    S S K T           P  A+ ++ + +Y + +  ++VGG++ 
Sbjct: 278 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRK- 322

Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
                 L++D    + GT++DSGT  T ++P  +  L+  F + M    +Y    G    
Sbjct: 323 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 372

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
                C+D     T   P++ + FKGG E+ + V      V     VCL    + + S  
Sbjct: 373 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 428

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + I GN Q + Y V YD    R+GF    C
Sbjct: 429 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP +    I DTGS + W  C      K C   K P   P  S+S + + 
Sbjct: 129 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 186

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
           C +  C  +               A+ K  +Q C S    Y V YG G  + G   +ETL
Sbjct: 187 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 231

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
            L  + +  NFL GC   ++      AG+ G GR K +LPSQ        FSYCL +   
Sbjct: 232 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 289

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             ++    L L    S S K T           P  A+ ++ + +Y + +  ++VGG++ 
Sbjct: 290 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRK- 334

Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
                 L++D    + GT++DSGT  T ++P  +  L+  F + M    +Y    G    
Sbjct: 335 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 384

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
                C+D     T   P++ + FKGG E+ + V      V     VCL    + + S  
Sbjct: 385 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 440

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + I GN Q + Y V YD    R+GF    C
Sbjct: 441 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 131/286 (45%), Gaps = 49/286 (17%)

Query: 193 EGIALSETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
           + +AL + ++    ++  +  GC  +    S  P G+ GFG G  S PSQ N D     F
Sbjct: 285 DALALHDDVD----VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQ-NKDVYGFVF 339

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
           SYCL S+K  + + T  L    G +   K+   +  TP ++NP    R +    YYV + 
Sbjct: 340 SYCLPSYKSSNFSSTLRL----GPAGQPKR---IKMTPLLSNP---HRPSL---YYVNMV 386

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            I VGG+ + V    L  D     GTIVD+GT FT ++  ++  + D F       R+  
Sbjct: 387 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRV 439

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
           RA     L G   C++V    T S P +   F G   VTLP EN           CL + 
Sbjct: 440 RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAM- 494

Query: 426 TDREASGGPSI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                + GPS        +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 495 -----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELC 535


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 51/380 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y  S   GTPPQ +   LD  S LVW  C                F P  S++   + 
Sbjct: 98  GMYVFSYGIGTPPQQVSGALDISSDLVWTAC-----------GATAPFNPVRSTTVADVP 146

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
           C +  C     ++     C     A +  C     +Y  +YG G   T G+  +E     
Sbjct: 147 CTDDACQQFAPQT-----CG----AGASEC-----AYTYMYGGGAANTTGLLGTEAFTFG 192

Query: 205 NRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
           +  I   + GC   +V      +G+ G GRG  SL SQL +D+FSY       DD+  T 
Sbjct: 193 DTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVDTQ 249

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           S IL       D  T   ++T    +  +   +A    YYV L  I V G+ + +     
Sbjct: 250 SFIL-----FGDDATPQTSHT---LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTF 301

Query: 322 TL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
            L ++DG+GG  +      T +    ++PL      Q V ++    A+   AL GL  C+
Sbjct: 302 DLRNKDGSGGVFLSITDLVTVLEEAAYKPL-----RQAVASKIGLPAVNGSAL-GLDLCY 355

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
                     P + L F GGA + L + NYF +       CLT++    +S G   +LG+
Sbjct: 356 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTIL---PSSAGDGSVLGS 412

Query: 441 FQMQNYYVEYDLRNQRLGFK 460
                 ++ YD+   +L F+
Sbjct: 413 LIQVGTHMMYDINGSKLVFE 432


>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 432

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 114/423 (26%), Positives = 176/423 (41%), Gaps = 80/423 (18%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           ++ H  G Y   +   TP   +   +D G   +W  C   Y                +SS
Sbjct: 34  VTKHPSGQYITQIRQRTPLVPVKLTVDLGGQFMWVDCDRGY----------------VSS 77

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC---PSYLVLYGSGLTEGIAL 197
           S + + C++ +CS    +S  C DC   P     N T  C   P   ++  S  T G   
Sbjct: 78  SYKPVRCRSAQCSL--SKSTSCGDCFSPPRPGCNNNT--CGHFPGNTIIQLS--TSGEVT 131

Query: 198 SETLNL-------PNRI--IPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNL- 242
           S+ L++       P R   IPNFL  C    +L   +   +G+AGFGR   SLPSQ +  
Sbjct: 132 SDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAA 191

Query: 243 ----DKFSYCLLSHKFDDTTRTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SV 290
                KF+ CL       +TR+  +I   NG  H   +   T  LTYTP   NP     V
Sbjct: 192 FSFNRKFAVCL-----SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGV 246

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
           +     S  Y++G++ I    + V +    L +D +GNGGT + +   +T +   ++  L
Sbjct: 247 STSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNAL 306

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
                 ++   RN  R     A+     C+     K+ SF   +L   G   + L ++N 
Sbjct: 307 VKTITREL---RNIPR---VAAVAPFGVCY-----KSKSFGSTRLG-PGMPSIDLILQNK 354

Query: 410 --YFAVVGEGSAV-------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
              + + G  S V       CL  V D       +I++G +QM++  +E+DL   RLGF 
Sbjct: 355 KVIWRIFGANSMVQVNEEVLCLGFV-DGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFS 413

Query: 461 QQL 463
             L
Sbjct: 414 STL 416


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 80/268 (29%), Positives = 116/268 (43%), Gaps = 38/268 (14%)

Query: 210 NFLVGCS------VLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRT 260
           N  +GC+      V  +    GI G G  K S   +   +   KFSYCL+ H    + R 
Sbjct: 264 NLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHL---SHRN 320

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR----V 316
            S  L  G  H+ K    +  T  +          F  +Y V +  I++GGQ ++    V
Sbjct: 321 VSSYLTIGGHHNAKLLGEIKRTELI---------LFPPFYGVNVVGISIGGQMLKIPPQV 371

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
           W      D +  GGT++DSGTT T +    +EP+ +  +  + K +  T     E    L
Sbjct: 372 W------DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVT----GEDFGAL 421

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CFD  G      P L  HF GGA    PV++Y   V      C+ +V   +  GG S+
Sbjct: 422 DFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP-LVKCIGIVP-IDGIGGASV 479

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN   QN+  E+DL    +GF   +C
Sbjct: 480 I-GNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 174/424 (41%), Gaps = 69/424 (16%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSY------GGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           I NP  +     T+   +N     Y      G Y+  L  GTPPQ    I+DTGS + + 
Sbjct: 50  ISNPHRRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYV 109

Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
           PC+    C+ C   + P F P+ SS+ + + C N  C     + +QC           + 
Sbjct: 110 PCST---CEQCGRHQDPKFDPESSSTYKPIKC-NIDC-ICDSDGVQC--------VYERQ 156

Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFG 230
             ++  S  VL    ++ G   +++  +P R +     GC  +      S++  GI G G
Sbjct: 157 YAEMSTSSGVLGEDVISFG---NQSELIPQRAV----FGCENMETGDLFSQRADGIMGLG 209

Query: 231 RGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
            G  SL  QL       D FS C             +++L   S  SD      TY+  V
Sbjct: 210 TGDLSLVDQLVEKGAINDSFSLCYGGMDIG----GGAMVLGGISPPSDMI---FTYSDPV 262

Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
            +P          YY V L+ I V G+++ +         DG  G ++DSGTT+ ++  E
Sbjct: 263 RSP----------YYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAE 308

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGA 401
            F    D  + ++    +  + +          CF   G    E +  FP + + F+ G 
Sbjct: 309 AFSAFKDAIMDEI----HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364

Query: 402 EVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           +++L  ENYF    +   A CL +    E     + +LG   ++N  V YD  N ++GF 
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKIGFW 421

Query: 461 QQLC 464
           +  C
Sbjct: 422 KTNC 425


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 174/413 (42%), Gaps = 75/413 (18%)

Query: 85  SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSS 142
           S G Y   +  G P +     +DTGS ++W  C     C   S+  IP   + P+ SS++
Sbjct: 25  SGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTT 84

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETL 201
            L+ C +P C  +         C+     T+ NC      Y+  YG G T EG  + + +
Sbjct: 85  SLVSCSDPLC--VRGRRFAEAQCSQ----TTNNC-----EYIFSYGDGSTSEGYYVRDAM 133

Query: 202 NLPNRIIPN--------FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDK-- 244
              N I  N         L GCS+     LS+ Q A  GI GFG+ + S+P+QL   +  
Sbjct: 134 QY-NVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNI 192

Query: 245 ---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
              FS+CL   K          I +           G+TYTP V +         SV+Y 
Sbjct: 193 PRVFSHCLEGEKRGGGILVIGGIAE----------PGMTYTPLVPD---------SVHYN 233

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           V LR I+V   R+ +  +  +   D   G I+DSGTT  +     +    + FV  +   
Sbjct: 234 VVLRGISVNSNRLPIDAEDFSSTNDT--GVIMDSGTTLAYFPSGAY----NVFVQAI--- 284

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-----AVVGE 416
           R  T A           CF V G  +  FP + L+F+GGA + L  +NY      A  G 
Sbjct: 285 REATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGT 343

Query: 417 GSAVCLTVVTDREASGGPS-----IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               C+   +   +S GP       ILG+  +++  V YDL N R+G+    C
Sbjct: 344 TDVWCIGWQS-SSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 163/387 (42%), Gaps = 49/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP + +  I DTGS L W  C    +  Y     I  F P  S+S   + 
Sbjct: 143 GNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAI--FDPSKSTSYSNIT 200

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  C+ +   +     C+    A++K C      Y + YG S  + G    E L++  
Sbjct: 201 CTSTLCTQLSTATGNEPGCS----ASTKACI-----YGIQYGDSSFSVGYFSRERLSVTA 251

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTT 258
             I+ NFL GC   +       AG+ G GR   S   Q   +    FSYCL        T
Sbjct: 252 TDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL------PAT 305

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +S+  L  G++     T+ + YTPF    S   R   S +Y + +  I+VGG ++ V  
Sbjct: 306 SSSTGRLSFGTT----TTSYVKYTPF----STISRG--SSFYGLDITGISVGGAKLPVSS 355

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +      GG I+DSGT  T + P  +  L   F   M K  +      A  L+ L  
Sbjct: 356 STFS-----TGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPS------AGELSILDT 404

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D+ G +  S P++   F GG  V LP +     V     VCL    + + S     I 
Sbjct: 405 CYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGIL-YVASAKQVCLAFAANGDDS--DVTIY 461

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           GN Q +   V YD+   R+GF    CK
Sbjct: 462 GNVQQKTIEVVYDVGGGRIGFGAGGCK 488


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 174/424 (41%), Gaps = 69/424 (16%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSY------GGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           I NP  +     T+   +N     Y      G Y+  L  GTPPQ    I+DTGS + + 
Sbjct: 50  ISNPHRRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYV 109

Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
           PC+    C+ C   + P F P+ SS+ + + C N  C     + +QC           + 
Sbjct: 110 PCST---CEQCGRHQDPKFDPESSSTYKPIKC-NIDC-ICDSDGVQC--------VYERQ 156

Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFG 230
             ++  S  VL    ++ G   +++  +P R +     GC  +      S++  GI G G
Sbjct: 157 YAEMSTSSGVLGEDVISFG---NQSELIPQRAV----FGCENMETGDLFSQRADGIMGLG 209

Query: 231 RGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
            G  SL  QL       D FS C             +++L   S  SD      TY+  V
Sbjct: 210 TGDLSLVDQLVEKGAINDSFSLCYGGMDIG----GGAMVLGGISPPSDMI---FTYSDPV 262

Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
            +P          YY V L+ I V G+++ +         DG  G ++DSGTT+ ++  E
Sbjct: 263 RSP----------YYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAE 308

Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGA 401
            F    D  + ++    +  + +          CF   G    E +  FP + + F+ G 
Sbjct: 309 AFSAFKDAIMDEI----HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364

Query: 402 EVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           +++L  ENYF    +   A CL +    E     + +LG   ++N  V YD  N ++GF 
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKIGFW 421

Query: 461 QQLC 464
           +  C
Sbjct: 422 KTNC 425


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 107/403 (26%), Positives = 166/403 (41%), Gaps = 62/403 (15%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
           + S G Y   +  G+PP+     +DTGS ++W  C    +C   +   IP   +  K SS
Sbjct: 68  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 127

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
           +S+ +GC++  CS+I    +Q   C        K C     SY V+YG G T +G  + +
Sbjct: 128 TSKNVGCEDDFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFIKD 173

Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
            + L       R  P     + GC    S Q         GI GFG+  TS+ SQL    
Sbjct: 174 NITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGG 233

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
            +  + SH  D+       I   G   S    T    TP V N          V+Y V L
Sbjct: 234 STKRIFSHCLDNMNGGG--IFAVGEVESPVVKT----TPIVPN---------QVHYNVIL 278

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
           + + V G  + +     +   +G+GGTI+DSGTT  ++   L+  L ++  + Q VK   
Sbjct: 279 KGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 336

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                          CF        +FP + LHF+   ++++   +Y   + E    C  
Sbjct: 337 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 386

Query: 424 VVTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +       G   I+LG+  + N  V YDL N+ +G+    C
Sbjct: 387 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 110/401 (27%), Positives = 165/401 (41%), Gaps = 79/401 (19%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+DTGS + + PC+    CK+C S + P F P+ S +     
Sbjct: 91  GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST---CKHCGSHQDPKFRPEASET----- 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN---- 202
            Q  KC+W      QC +C+D+     K CT     Y   Y    T    L E +     
Sbjct: 143 YQPVKCTW------QC-NCDDD----RKQCT-----YERRYAEMSTSSGVLGEDVVSFGN 186

Query: 203 ----LPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYC 248
                P R I     GC       + +++  GI G GRG  S+  QL       D FS C
Sbjct: 187 QSELSPQRAI----FGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC 242

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
                          I       S       T++  V +P          YY + L+ I 
Sbjct: 243 YGGMGVGGGAMVLGGI-------SPPADMVFTHSDPVRSP----------YYNIDLKEIH 285

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           V G+R+ +  K      DG  GT++DSGTT+ ++    F  LA  F   ++K  +  + +
Sbjct: 286 VAGKRLHLNPKVF----DGKHGTVLDSGTTYAYLPESAF--LA--FKHAIMKETHSLKRI 337

Query: 369 GAEALTGLRPCFDVP----GEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT 423
                     CF        + + SFP +++ F  G +++L  ENY F       A CL 
Sbjct: 338 SGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLG 397

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V ++      P+ +LG   ++N  V YD  + ++GF +  C
Sbjct: 398 VFSN---GNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/249 (31%), Positives = 111/249 (44%), Gaps = 28/249 (11%)

Query: 224 AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYT 282
           +GI GFGR   SL SQL++ +FSYCL S+    + R S+L+  + S       TG +  T
Sbjct: 76  SGIVGFGRNPLSLVSQLSIRRFSYCLTSYA---SRRQSTLLFGSLSDGVYGDATGRVQTT 132

Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
           P + +P          +YYV    +TVG +R+R+      L  DG+GG IVDSGT  T +
Sbjct: 133 PLLQSPQNP------TFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLL 186

Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKL 395
              +   +   F  Q+        A G     G+  CF VP     S        P + L
Sbjct: 187 PAAVLAEVVRAFRQQL----RLPFANGGNPEDGV--CFLVPAAWRRSSSTSQMPVPRMVL 240

Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
           HF+ GA++ LP  NY         +CL +      SG     +GN   Q+  V YDL  +
Sbjct: 241 HFQ-GADLDLPRRNYVLDDHRRGRLCLLLAD----SGDDGSTIGNLVQQDMRVLYDLEAE 295

Query: 456 RLGFKQQLC 464
            L      C
Sbjct: 296 TLSIAPARC 304


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 143/351 (40%), Gaps = 52/351 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S S GTPPQ++  +LD  S  VW  C+    C  C +       P  +S+     
Sbjct: 95  GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCS---ACATCGADA-----PAATSAP---- 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL 203
              P  +++          +D    T+  C      Y  +YG G    T G+   +    
Sbjct: 143 ---PFYAFLSF--------HDTRAPTTPPC-----GYSYVYGGGAANTTAGLLAVDAFAF 186

Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
                   + GC+V +     G+ G GRG+ S  SQL + +FSY L     DD     S 
Sbjct: 187 ATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVDVGSF 243

Query: 264 I--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
           I  LD+    + +          V+ P VA R + S+ YYV L  I V G+ + +     
Sbjct: 244 ILFLDDAKPRTSRA---------VSTPLVASRASRSL-YYVELAGIRVDGEDLAIPRGTF 293

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L  DG+GG ++      TF+     +  A + V Q + ++   RA     L GL  C+ 
Sbjct: 294 DLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIELRAADGSEL-GLDLCYT 347

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
                T   P + L F GGA + L + NYF +       CLT++      G
Sbjct: 348 SESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDG 398


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/392 (26%), Positives = 156/392 (39%), Gaps = 42/392 (10%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++ ++ GTPPQ +  +LDTGS L W  C   Y       S        L        C  
Sbjct: 56  TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTRRSTRRWRGRDLPVPPF---CDT 112

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
           P        S  CR       A+S +      ++L+  G+      A    +   +    
Sbjct: 113 PP-------SNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITSYSSTTA 165

Query: 210 NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
               G     S    G+ G  RG  S  +Q    +F+YC+   +         L+ D+G 
Sbjct: 166 TNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVL----LLGDDGG 221

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
                    L YTP +    +++   +   V Y V L  I VG   + +    LT D  G
Sbjct: 222 VAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTG 273

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPG 384
            G T+VDSGT FTF+  + +  L  EF SQ    R     LG            CF  P 
Sbjct: 274 AGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLGEPGFVFQGAFDACFRGPE 330

Query: 385 EK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVT--DREASGG 433
            +    +G  PE+ L  + GAEV +  E    +V     GEG A  +  +T  + + +G 
Sbjct: 331 ARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGM 389

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            + ++G+   QN +VEYDL+N R+GF    C 
Sbjct: 390 SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 421


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 160/387 (41%), Gaps = 48/387 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP + +  I DTGS L W  C    +  Y     I  F P  S+S   + 
Sbjct: 144 GNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVI--FDPSKSTSYSNIT 201

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
           C +  C+ +   +     C+    A++K C      Y + YG S  + G    E L +  
Sbjct: 202 CTSALCTQLSTATGNDPGCS----ASTKACI-----YGIQYGDSSFSVGYFSRERLTVTA 252

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
             ++ NFL GC   +       AG+ G GR   S   Q        FSYCL S      T
Sbjct: 253 TDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPS------T 306

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +S+  L  G + + +    L YTPF    S   R   S +Y + +  I VGG ++ V  
Sbjct: 307 SSSTGHLSFGPAATGRY---LKYTPF----STISRG--SSFYGLDITAIAVGGVKLPVSS 357

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +      GG I+DSGT  T + P  +  L   F   M K  +      A  L+ L  
Sbjct: 358 STFS-----TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPS------AGELSILDT 406

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+D+ G K  S P ++  F GG  V LP +     V     VCL    + + S     I 
Sbjct: 407 CYDLSGYKVFSIPTIEFSFAGGVTVKLPPQGIL-FVASTKQVCLAFAANGDDS--DVTIY 463

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           GN Q +   V YD+   R+GF    CK
Sbjct: 464 GNVQQRTIEVVYDVGGGRIGFGAGGCK 490


>gi|388508700|gb|AFK42416.1| unknown [Lotus japonicus]
          Length = 440

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 178/402 (44%), Gaps = 76/402 (18%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR-----LLGCQNPK 151
           TP   +   LD G   +W  C N    +Y SS+    F P    SS+     L GC   K
Sbjct: 59  TPLVPVKLTLDLGGGYLWVNCENR---QYVSST----FKPARCGSSQCSLFGLTGCSGDK 111

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
                        C   P  T    +     +  +     T+G   ++ +++PN +   F
Sbjct: 112 I------------CGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFL---F 156

Query: 212 LVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSSL 263
           + G  V+    ++   G+AG GR + SLPSQ +       KF+ CL ++   D      +
Sbjct: 157 ICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGV----M 212

Query: 264 ILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQRVRVWH 318
              +G  + ++  +  LTYTP + NP     +AF    SV Y++G++ + V  + V +  
Sbjct: 213 FFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSVKVSEKNVPLNT 272

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
             L+++++G GGT + +   +T M   +++ +AD FV          ++LGA  ++ + P
Sbjct: 273 TLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV----------KSLGAPTVSPVAP 322

Query: 379 ---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV---TDR 428
              CF   D+   + G   P + L  + G E   P+    ++V     +CL  V   ++ 
Sbjct: 323 FGTCFATKDISFSRIGPGVPAIDLVLQNGVE--WPIIGANSMVQFDDVICLGFVDAGSNP 380

Query: 429 EAS------GGP----SIILGNFQMQNYYVEYDLRNQRLGFK 460
           +AS      GG     SI +G  Q++N  +++DL   RLGF+
Sbjct: 381 KASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFR 422


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 107/450 (23%), Positives = 174/450 (38%), Gaps = 63/450 (14%)

Query: 35  RFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLS 94
           R   NPS  S    +    S+     H KNP    ++TTT           +G Y  S+ 
Sbjct: 53  RVKANPSPSSAAQKSLFPYSAHIFQQHTKNPAALRSSTTTL-------GRKFGEYYTSIK 105

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            G+P Q    I+DTGS L W  C     CK C+ S    +    S S + + C N +   
Sbjct: 106 LGSPGQEAILIVDTGSELTWLKC---LPCKVCAPSVDTIYDAARSVSYKPVTCNNSQL-- 160

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLNLPNRI----- 207
                     C++    T   C +     +   YG G  + G   ++TL +   +     
Sbjct: 161 ----------CSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 208 -IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            + +F  GC+     L     +GI G   GK +LP QL      KFS+C    +      
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNS 269

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           T  +   N     ++    + YT      S  +R     +Y+V L+ +++        H+
Sbjct: 270 TGVVFFGNAELPHEQ----VQYTSVALTNSELQRK----FYHVALKGVSINS------HE 315

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-RALGAEALTGLRP 378
            + L R      I+DSG++F+        P   +     +K+R  + + L  ++   L  
Sbjct: 316 LVLLPR--GSVVILDSGSSFS----SFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGT 369

Query: 379 CFDVPGEKTG----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
           CF V  +       + P L L F+ G  + +P       V              +    P
Sbjct: 370 CFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNP 429

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             ++GN+Q QN +VEYD++  R+GF +  C
Sbjct: 430 VNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  GTP +    I DTGS + W  C      K C   K P   P  S+S + + 
Sbjct: 69  GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 126

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
           C +  C  +               A+ K  +Q C S    Y V YG G  + G   +ETL
Sbjct: 127 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 171

Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
            L  + +  NFL GC   ++      AG+ G GR K +LPSQ        FSYCL +   
Sbjct: 172 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 229

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             ++    L L    S S K T           P  A+ ++ + +Y + +  ++VGG++ 
Sbjct: 230 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRQ- 274

Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
                 L++D    + GT++DSGT  T ++P  +  L+  F + M    +Y    G    
Sbjct: 275 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 324

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
                C+D     T   P++ + FKGG E+ + V      V     VCL    + + S  
Sbjct: 325 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 380

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + I GN Q + Y V YD    R+GF    C
Sbjct: 381 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 171/417 (41%), Gaps = 91/417 (21%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           + S  Y  + ++ S G PP     ++DTGS L W  C   + C  CS   +P F P  SS
Sbjct: 85  VPSPRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMC---HPCSSCSQQSVPIFDPSKSS 141

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           +               + ++ C +CN   +   +     CP  +   GSG ++GI   E 
Sbjct: 142 T---------------YSNLSCSECNKCDVVNGE-----CPYSVEYVGSGSSQGIYAREQ 181

Query: 201 LNLPN-----RIIPNFLVGC----SVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSY 247
           L L         +P+ + GC    S+ S+  P     G+ G G G+ SL       KFSY
Sbjct: 182 LTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG-KKFSY 240

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           C+ + + +   + + L+L + ++     TT                N  +  YYV L  I
Sbjct: 241 CIGNLR-NTNYKFNRLVLGDKANMQGDSTT---------------LNVINGLYYVNLEAI 284

Query: 308 TVGGQRVRV----WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE---------F 354
           ++GG+++ +    + + +T   D N G I+DSG   T++    FE L+ E          
Sbjct: 285 SIGGRKLDIDPTLFERSIT---DNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLV 341

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
           ++Q  K+  YT                V  +    FP +  HF  GA + L V + F   
Sbjct: 342 LAQQDKHNPYTLCYSG-----------VVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQT 390

Query: 415 GEGSAVCLTVV------TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            E +  C+ ++       D E+       +G    QNY V YDL   R+ F++  C+
Sbjct: 391 TE-NEFCMAMLPGNYFGDDYESFSS----IGMLAQQNYNVGYDLNRMRVYFQRIDCE 442


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/403 (25%), Positives = 168/403 (41%), Gaps = 62/403 (15%)

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
           + S G Y   +  G+PP+     +DTGS ++W  C    +C   +   IP   +  K SS
Sbjct: 71  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASS 130

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
           +S+ +GC++  CS+I    +Q   C        K C     SY V+YG G T +G  + +
Sbjct: 131 TSKNVGCEDAFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFVKD 176

Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
            + L       R  P     + GC    S Q         GI GFG+  TS+ SQL    
Sbjct: 177 NITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGG 236

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
               + SH  D+        +       + ++  +  TP V N          V+Y V L
Sbjct: 237 SVKRIFSHCLDNMNGGGIFAI------GEVESPVVKTTPLVPN---------QVHYNVIL 281

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
           + + V G+ + +     +   +G+GGTI+DSGTT  ++   L+  L ++  + Q VK   
Sbjct: 282 KGMDVDGEPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 339

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                          CF        +FP + LHF+   ++++   +Y   + E    C  
Sbjct: 340 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 389

Query: 424 VVTDREAS--GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +    +  G   I+LG+  + N  V YDL N+ +G+    C
Sbjct: 390 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 432


>gi|358347314|ref|XP_003637703.1| Basic 7S globulin [Medicago truncatula]
 gi|355503638|gb|AES84841.1| Basic 7S globulin [Medicago truncatula]
          Length = 454

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 163/417 (39%), Gaps = 67/417 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           YS S+  GTP   +  ++D     +WF C + Y                 S++   + C 
Sbjct: 50  YSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYN----------------STTYNPIQCG 93

Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-- 204
             KC         C DC + P  T  + N   + P     +G     G    + L+ P  
Sbjct: 94  TKKCK--QARGTGCIDCTNHPFKTGCTNNTCGVEP--FNPFGGFFVSGDVGEDILSFPRV 149

Query: 205 --------NRIIPNFLVGCSVLS-----------SRQPAGIAGFGRGKTSLPSQL----N 241
                   N  +P F+  C               S+   G+ G  R   SLP+Q+     
Sbjct: 150 TSDGRRVTNVRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFK 209

Query: 242 LD-KFSYCLLSHKFDDTTRTSSLILDNG----SSHSDKKTTGLTYTPFVNNPSVAE---R 293
           LD KF+ CL S    +     SL +  G     S+ D  +  L YTP + N         
Sbjct: 210 LDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFD 269

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
           N  S  Y++ ++ I V    V      L++++ G GGT + +    T +   ++ PL + 
Sbjct: 270 NFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNA 329

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVEN 409
           FV +  + R   R    +A+     CFD   +     G + P + L  KGG E  +   N
Sbjct: 330 FVKK-AEIRKIKR---VKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385

Query: 410 YFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
               V E + +CL  V       GP   SII+G  Q+++  VE+DL + +LGF   L
Sbjct: 386 SMVKVNE-NVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSL 441


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C +S     ++ SF P  SS+
Sbjct: 3   GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 59

Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
           +  + C + +C+  +   E+I C+  N +    S  C      Y   YG G  T G  +S
Sbjct: 60  ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 109

Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
           +T+     + N    N     + GCS         + R   GI GFG+ + S+ SQLN  
Sbjct: 110 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 169

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
             S  + SH    +     +++       +    GL YTP V  PS         +Y + 
Sbjct: 170 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 215

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I V GQ++ +     T       GTIVDSGTT  ++A   ++P      + +  +  
Sbjct: 216 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 273

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
              + G++       CF        SFP + L+F GG  +++  ENY           L 
Sbjct: 274 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 326

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 327 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 367


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/391 (26%), Positives = 157/391 (40%), Gaps = 61/391 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + LS GTPP  I    DTGS LVWF C     C  C   + P F P+ SSS   + C 
Sbjct: 60  YLMELSIGTPPIKIYAEADTGSDLVWFQCI---PCTKCYKQQNPMFDPRSSSSYTNITCG 116

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
              C+ +           D  L ++   T  C        + +T+G+   ETL L +   
Sbjct: 117 TESCNKL-----------DSSLCSTDQKT--CNYTYSYADNSITQGVLAQETLTLTSTTG 163

Query: 207 ---IIPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHK 253
                   + GC    S  + R+  G+ G GRG  SL SQ+        + FS CL+   
Sbjct: 164 EPVAFQGIIFGCGHNNSGFNDRE-MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFN 222

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
            D +  TS +    G   S+    G   TP ++             Y+  L  I+V    
Sbjct: 223 TDPSI-TSQMNFGKG---SEVLGNGTVSTPLISKDGTG--------YFATLLGISVEDIN 270

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           +  +    +L     G  ++DSGTT T++  E +  L ++     V+N+    AL    +
Sbjct: 271 LP-FSNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQ-----VRNK---VALEPFRI 321

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
            G   C+  P    G  P L +HF+GG  +  P + +  V  +    C  V    E    
Sbjct: 322 DGYELCYQTPTNLNG--PTLTIHFEGGDVLLTPAQMFIPV--QDDNFCFAVFDTNEE--- 374

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  GN+   NY + +DL  Q + FK   C
Sbjct: 375 -YVTYGNYAQSNYLIGFDLERQVVSFKATDC 404


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  + + GTPPQ    I+D    LVW  C+    C+ C    +P F+P  SS+ +   C 
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCS---ACRRCFKQDLPVFVPNASSTFKPEPCG 101

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
              C     ESI  R C+ +  +     TQ+          G T G A ++T  +    +
Sbjct: 102 TAVC-----ESIPTRSCSGDVCSYKGPPTQL---------RGNTSGFAATDTFAIGTATV 147

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
                GC V S       P+G  G GR   SL +Q+ L +FSYCL      +T ++S L 
Sbjct: 148 -RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPR---NTGKSSRLF 203

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L  GSS     +   +  PF+     +  +  S YY + L  I  G   +          
Sbjct: 204 L--GSSAKLAGSESTSTAPFIKT---SPDDDGSNYYLLSLDAIRAGNTTIATAQ------ 252

Query: 325 RDGNGGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DV 382
              +GG +V  + + F+ +    ++    + V++ V                L  CF   
Sbjct: 253 ---SGGILVMHTVSPFSLLVDSAYKAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKA 306

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSII 437
            G    + P+L   F+G A +T+P   Y   VG E    C  +++    +R    G S +
Sbjct: 307 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVS-V 365

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LG+ Q ++ +  YDL+ + L F+   C
Sbjct: 366 LGSLQQEDVHFLYDLKKETLSFEPADC 392


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 161/386 (41%), Gaps = 52/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  G+PP+    ++D+GS +VW  C     C  C     P F P  S++   + 
Sbjct: 135 GEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ---PCSECYQQSDPVFDPAGSATYAGIS 191

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
           C +  C  + +       CND        C      Y V YG G  T G    ETL    
Sbjct: 192 CDSSVCDRLDNAG-----CND------GRC-----RYEVSYGDGSYTRGTLALETLTFGR 235

Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
            +I N  +GC  ++       AG+ G G G  S   QL       FSYCL+S     T  
Sbjct: 236 VLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRG---TES 292

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           T +L    G+        G  + P + NP          +YYVGL  + VGG RV +  +
Sbjct: 293 TGTLEFGRGA-----MPVGAAWVPLIRNPRAPS------FYYVGLSGLGVGGIRVPIPEQ 341

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              L   G GG ++D+GT  T +    +E   D F+ Q     N  R   ++ ++    C
Sbjct: 342 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTA---NLPR---SDRVSIFDTC 395

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-AVVGEGSAVCLTVVTDREASGGPSIIL 438
           +++ G  +   P +  +F GG  +TLP  N+   V GEG+  C        AS     I+
Sbjct: 396 YNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGT-FCFAFA----ASASGLSII 450

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q +   +  D  N  +GF   +C
Sbjct: 451 GNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 167/400 (41%), Gaps = 65/400 (16%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
           + HSY  +  +L  GTP +    I+DTGS + + PC +   C +C       F P  S++
Sbjct: 8   TRHSY--FYTTLKLGTPERTFSVIIDTGSTITYIPCKD---CSHCGKHTAEWFDPDKSTT 62

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
           ++ L C +P C+     +  C  CN++    S+   +   S          EG  + +T 
Sbjct: 63  AKKLACGDPLCNC---GTPSCT-CNNDRCYYSRTYAERSSS----------EGWMIEDTF 108

Query: 202 NLPNRIIPNFLV-GCSVLSS----RQPA-GIAGFGRGKTSLPSQLNL-----DKFSYCLL 250
             P+   P  LV GC    +    RQ A GI G G    +  SQL       D FS C  
Sbjct: 109 GFPDSDSPVRLVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLC-F 167

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
            +  D       + L  G++          YTP + +  +        YY V +  ITV 
Sbjct: 168 GYPKDGILLLGDVTLPEGAN--------TVYTPLLTHLHLH-------YYNVKMDGITVN 212

Query: 311 GQRVRVWHKYLTLDR---DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           GQ        L  D    D   GT++DSGTTFT++  + F+ +A + V   V+ +     
Sbjct: 213 GQT-------LAFDASVFDRGYGTVLDSGTTFTYLPTDAFKAMA-KAVGDYVEKKGLQST 264

Query: 368 LGAEALTG---LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
            GA+        +   D   +    FP  +  F GGA++TLP   Y   + + +  CL +
Sbjct: 265 PGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLPPLRYL-FLSKPAEYCLGI 323

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +    G    ++G   +++  V YD RN ++GF    C
Sbjct: 324 FDN----GNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 155/382 (40%), Gaps = 52/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT S + W PC     C  CSS+    F    S++ + LGCQ
Sbjct: 36  YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 89

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +C  +   +     C             +C   L   GS L   ++  +T+ L    +
Sbjct: 90  AAQCKQVPKPT-----CGGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAV 133

Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC        L ++   G+        S    L    FSYCL S  F     + S
Sbjct: 134 PGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 191

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G     K+   + YTP + NP    R +    Y+V L  + VG + V V     T
Sbjct: 192 LRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFT 240

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +    +  + D F +++ +N      L   +L G   C+ V
Sbjct: 241 FNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTV 294

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 295 PIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQ 349

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + YD+ N RLG  ++LC
Sbjct: 350 QQNHRLLYDVPNSRLGVARELC 371


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/394 (27%), Positives = 169/394 (42%), Gaps = 47/394 (11%)

Query: 88  GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           G+ ++LS G+PP     ++DTGS L+W  C     C  C       F P  S S + LGC
Sbjct: 103 GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCL---PCINCFQQSTSWFDPLKSVSFKTLGC 159

Query: 148 QNPKCSWIHHESIQCRDCNDEPLAT---SKNCTQICPSYLVLYGSGLTEGI-----ALSE 199
             P  ++I+    +C   N           + +Q   +   L    L EG      A+S 
Sbjct: 160 GFPGYNYIN--GYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAIST 217

Query: 200 TLNLPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHK 253
            ++   +   N   GC   ++ ++   A    FG G     ++ +QL  +KFSYC+    
Sbjct: 218 QISKIKK--SNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCI--GD 272

Query: 254 FDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            ++   T + L+L  GS              ++   S   +  F  +YYV L+ I+VG +
Sbjct: 273 INNPLYTHNHLVLGQGS--------------YIEGDSTPLQIHFG-HYYVTLQSISVGSK 317

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            +++      +  DG+GG ++DSG T+T +A   FE L DE V  M       R      
Sbjct: 318 TLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM--KGLLERIPTQRK 375

Query: 373 LTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
             GL  CF  V       FP +  HF GGA++ L   + F   G G   CL ++      
Sbjct: 376 FEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHG-GDRFCLAILPSNSEL 432

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
              S+I G    QNY V +DL   ++ F++  C+
Sbjct: 433 LNLSVI-GILAQQNYNVGFDLEQMKVFFRRIDCQ 465


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 168/405 (41%), Gaps = 64/405 (15%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIPKL-SSS 141
           +S G Y   +  GTPP+     +DTGS ++W  C     C   S   I  +F   + SS+
Sbjct: 73  NSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSST 132

Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTE 193
           + L+ C +P C S +   + +C          S    Q   SY   YG G       +++
Sbjct: 133 AALIPCSDPICTSRVQGAAAEC----------SPRVNQC--SYTFQYGDGSGTSGYYVSD 180

Query: 194 GIALSETLNLPNRI--IPNFLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL-- 242
            +  S  +  P  +      + GCS+  S       +   GI GFG G  S+ SQL+   
Sbjct: 181 AMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRG 240

Query: 243 ---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
                FS+CL              IL+            + Y+P V  PS         +
Sbjct: 241 ITPKVFSHCLKGDGDGGGVLVLGEILE----------PSIVYSPLV--PS-------QPH 281

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y + L+ I V GQ + +     ++  +  GGTIVD GTT  ++  E ++PL     + + 
Sbjct: 282 YNLNLQSIAVNGQLLPINPAVFSISNN-RGGTIVDCGTTLAYLIQEAYDPLVTAINTAVS 340

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
           ++   T + G +       C+ V       FP + L+F+GGA + L  E Y    G    
Sbjct: 341 QSARQTNSKGNQ-------CYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDG 393

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  +  ++   G S ILG+  +++  V YD+  QR+G+    C
Sbjct: 394 AEMWCIGFQKFQEGAS-ILGDLVLKDKIVVYDIAQQRIGWANYDC 437


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 118/483 (24%), Positives = 202/483 (41%), Gaps = 71/483 (14%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR------FHTNPSQDSYQNLNSLVSS 54
           M SY + L  S + F  L  I  SS+ S  F   +         +P+  S++     V  
Sbjct: 1   MNSYSATLLCSLLGFNLLAVILSSSVDSRDFDYQQRSVILPLFISPTNSSHRR----VLD 56

Query: 55  SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
              R  H++N     ++        ++ ++ Y  Y+  L  G+PPQ    I+DTGS + +
Sbjct: 57  RDHRLRHLQNLVKPHSSNARMRLHDDLLTNGY--YTTRLWIGSPPQEFALIVDTGSTVTY 114

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
            PC+N   C  C + + P F P+LSS+ + + C N  C+      +QC           +
Sbjct: 115 VPCSN---CVQCGNHQDPRFQPELSSTYQPVKC-NADCN-CDENGVQC--------TYER 161

Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-----RQPAGIAGF 229
              ++  S  VL    ++ G    E+  +P R +     GC  + S     ++  GI G 
Sbjct: 162 RYAEMSTSSGVLAEDVMSFG---KESELVPQRAV----FGCETMESGDLYTQRADGIMGL 214

Query: 230 GRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           GRG  S+  QL    +   S+ L     D      +++L   SS       G+ ++   +
Sbjct: 215 GRGTLSVMDQLVGKGVVSNSFSLCYGGMD--VGGGAMVLGGISS-----PPGMVFSH--S 265

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           +PS       S YY + L+ I V G+ +++  +      DG  G I+DSGTT+ +   + 
Sbjct: 266 DPSR------SPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKA 315

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAE 402
           +    D     ++K  ++ + +          CF   G    E    FPE+ + F  G +
Sbjct: 316 YYAFKD----AIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371

Query: 403 VTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           ++L  ENY F       A CL +  +       + +LG   ++N  V Y+  N  +GF +
Sbjct: 372 ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGIIVRNTLVTYNRENSTIGFWK 428

Query: 462 QLC 464
             C
Sbjct: 429 TNC 431


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 155/391 (39%), Gaps = 44/391 (11%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           ++SL+ GTPPQ +  +LDTGS L W  C         + S    F P+ S++   + C +
Sbjct: 62  TVSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADS----FRPRASATFAAVPCGS 117

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
            +CS         RD    P  +    ++ C   L       ++G   ++   + +    
Sbjct: 118 ARCS--------SRDLPAPP--SCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPL 167

Query: 210 NFLVGC-SVLSSRQP-----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
               GC S      P     AG+ G  RG  S  +Q +  +FSYC+      D      L
Sbjct: 168 RSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYCI-----SDRDDAGVL 222

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +L     HSD     L YTP    P+        V Y V L  I VGG+ + +    L  
Sbjct: 223 LL----GHSDLPFLPLNYTPLYQ-PTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAP 277

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCF 380
           D  G G T+VDSGT FTF+  + +  +  EF+ Q    +    AL   +         CF
Sbjct: 278 DHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQ---TKPLLPALEDPSFAFQEAFDTCF 334

Query: 381 DVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----CLTVVTDREASGG 433
            VP  +   +   P + L F G           + V GE        CLT   + +    
Sbjct: 335 RVPKGRPPPSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLT-FGNADMVPL 393

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + ++G+    N +VEYDL   R+G     C
Sbjct: 394 TAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 118/483 (24%), Positives = 202/483 (41%), Gaps = 71/483 (14%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR------FHTNPSQDSYQNLNSLVSS 54
           M SY + L  S + F  L  I  SS+ S  F   +         +P+  S++     V  
Sbjct: 1   MNSYSATLLCSLLGFNLLAVILSSSVDSRDFDYQQRSVILPLFISPTNSSHRR----VLD 56

Query: 55  SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
              R  H++N     ++        ++ ++ Y  Y+  L  G+PPQ    I+DTGS + +
Sbjct: 57  RDHRLRHLQNLVKPHSSNARMRLHDDLLTNGY--YTTRLWIGSPPQEFALIVDTGSTVTY 114

Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
            PC+N   C  C + + P F P+LSS+ + + C N  C+      +QC           +
Sbjct: 115 VPCSN---CVQCGNHQDPRFQPELSSTYQPVKC-NADCN-CDENGVQC--------TYER 161

Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-----RQPAGIAGF 229
              ++  S  VL    ++ G    E+  +P R +     GC  + S     ++  GI G 
Sbjct: 162 RYAEMSTSSGVLAEDVMSFG---KESELVPQRAV----FGCETMESGDLYTQRADGIMGL 214

Query: 230 GRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
           GRG  S+  QL    +   S+ L     D      +++L   SS       G+ ++   +
Sbjct: 215 GRGTLSVMDQLVGKGVVSNSFSLCYGGMD--VGGGAMVLGGISS-----PPGMVFSH--S 265

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           +PS       S YY + L+ I V G+ +++  +      DG  G I+DSGTT+ +   + 
Sbjct: 266 DPSR------SPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKA 315

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAE 402
           +    D     ++K  ++ + +          CF   G    E    FPE+ + F  G +
Sbjct: 316 YYAFKD----AIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371

Query: 403 VTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           ++L  ENY F       A CL +  +       + +LG   ++N  V Y+  N  +GF +
Sbjct: 372 ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGIIVRNTLVTYNRENSTIGFWK 428

Query: 462 QLC 464
             C
Sbjct: 429 TNC 431


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 160/386 (41%), Gaps = 65/386 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++  GTP + +P I DTGS L+W  C     CK C   K+P F P  S+S + L C 
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCK---PCKAC-YPKVPVFDPTKSASFKGLPCS 187

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPNRI 207
           +  C  I                +S  CT     YL  Y  +  + G   +ET++  +  
Sbjct: 188 SKLCQSIRQG------------CSSPKCT-----YLTAYVDNSSSTGTLATETISFSHLK 230

Query: 208 --IPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLN--LDK-FSYCLLSHKFDDTTR 259
               N L+GCS   S +    +GI G  R   SL SQ     DK FSYC+ S     T  
Sbjct: 231 YDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPS-----TPG 285

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
           ++  +   G   +D     + ++P         + A S  Y + +  I+VGG+++ +   
Sbjct: 286 STGHLTFGGKVPND-----VRFSP-------VSKTAPSSDYDIKMTGISVGGRKLLIDAS 333

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
              +       + +DSG   T + P+ +  L   F   M   + Y   L  +    L  C
Sbjct: 334 AFKI------ASTIDSGAVLTRLPPKAYSALRSVFREMM---KGYP-LLDQDDF--LDTC 381

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT-DREASGGPSIIL 438
           +D     T + P + + F+GG E+ + V      V      CL     D E S     I 
Sbjct: 382 YDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVS-----IF 436

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GNFQ + Y V +D   +R+GF    C
Sbjct: 437 GNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 109/390 (27%), Positives = 164/390 (42%), Gaps = 73/390 (18%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            GTPPQ    I+DTGS + + PC +   C  C + + P F P LS +   + C NP C+ 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNS---CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT- 56

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
              E+ QC    +   A   + + I    LV +G+       +SE    P R +     G
Sbjct: 57  CDTENDQC--TYERQYAEMSSSSGILGEDLVSFGN-------MSEL--KPQRAV----FG 101

Query: 215 CS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLI 264
           C       L S+   GI G GRG  S+  QL       D FS C    +        +++
Sbjct: 102 CENAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----VGGGAMV 157

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L   S  SD           V + S  +R   S YY + LR + V G+++ +  +     
Sbjct: 158 LGQISPPSD----------MVFSHSDPDR---SPYYNIELRGLHVAGKKLDINPQVF--- 201

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP-----C 379
            DG  GTI+DSGTT+ ++    F P      S++          G + + G  P     C
Sbjct: 202 -DGKHGTILDSGTTYAYLPEAAFLPFIQAITSEL---------HGLKQIRGPDPNYNDVC 251

Query: 380 FDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGP 434
           F   G    E   +FP + + F  G + +L  ENY F       A CL V  + +    P
Sbjct: 252 FSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGK---DP 308

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + +LG   ++N  V YD  + ++GF +  C
Sbjct: 309 TTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C +S     ++ SF P  SS+
Sbjct: 89  GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 145

Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
           +  + C + +C+  +   E+I C+  N +    S  C      Y   YG G  T G  +S
Sbjct: 146 ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 195

Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
           +T+     + N    N     + GCS         + R   GI GFG+ + S+ SQLN  
Sbjct: 196 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 255

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
             S  + SH    +     +++       +    GL YTP V  PS         +Y + 
Sbjct: 256 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 301

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I V GQ++ +     T       GTIVDSGTT  ++A   ++P      + +  +  
Sbjct: 302 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 359

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
              + G++       CF        SFP + L+F GG  +++  ENY           L 
Sbjct: 360 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 412

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 413 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 171/402 (42%), Gaps = 64/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   L  GTPP+     +DTGS ++W  C +   C   S   IP   F P  S ++ L
Sbjct: 50  GLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASL 109

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           + C + +CS     S        + + +++N   +C  Y   YG G  T G  +S+ L+ 
Sbjct: 110 ISCSDQRCSLGLQSS--------DSVCSAQN--NLC-GYNFQYGDGSGTSGYYVSDLLHF 158

Query: 203 ---LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD----- 243
              L   ++ N     + GCS L       S R   GI GFG+   S+ SQL        
Sbjct: 159 DTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPR 218

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
            FS+CL   K DD+     L+L       +     + YTP V  PS         +Y + 
Sbjct: 219 AFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLV--PS-------QPHYNLN 259

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-R 362
           ++ I+V GQ + +           + GTI+DSGTT  ++A   ++P      S +  + R
Sbjct: 260 MQSISVNGQTLAIDPS--VFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVR 317

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
            Y        L+    C+ +       FP++ L+F GGA + L  ++Y           L
Sbjct: 318 PY--------LSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAAL 369

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  ++  G    ILG+  +++    YD+ NQR+G+    C
Sbjct: 370 WCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDC 411


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C +S     ++ SF P  SS+
Sbjct: 87  GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 143

Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
           +  + C + +C+  +   E+I C+  N +    S  C      Y   YG G  T G  +S
Sbjct: 144 ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 193

Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
           +T+     + N    N     + GCS         + R   GI GFG+ + S+ SQLN  
Sbjct: 194 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 253

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
             S  + SH    +     +++       +    GL YTP V  PS         +Y + 
Sbjct: 254 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 299

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L  I V GQ++ +     T       GTIVDSGTT  ++A   ++P      + +  +  
Sbjct: 300 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 357

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
              + G++       CF        SFP + L+F GG  +++  ENY           L 
Sbjct: 358 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 410

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  +   G    ILG+  +++    YDL N R+G+    C
Sbjct: 411 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 155/391 (39%), Gaps = 67/391 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  GTP      +LDTGS +VW P              +P  +  +   S    
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWAPV-----------RALPPLLRAVRQGSSTGA 168

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
              P   W     I CR  +       +N       Y V YG G +T G   SETL    
Sbjct: 169 APAPTPRWNCVAPI-CRRLDSAGCDRRRNSCL----YQVAYGDGSVTAGDFASETLTFAR 223

Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
              +    +GC      +   IA       GRG+ S PSQ+       FSYCL+      
Sbjct: 224 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLV------ 275

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR- 315
                    D  SS   + +     TP             + +YYV L   +VGG RV+ 
Sbjct: 276 ---------DRTSSRRARPSRRWGGTP-----------RMATFYYVHLLGFSVGGARVKG 315

Query: 316 VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
           V    L L+   G GG I+DSGT+ T +A  ++E + D F +  V  R     +     +
Sbjct: 316 VSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPGGFS 370

Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGG 433
               C+++ G +    P + +H  GGA V LP ENY   V      C  +  TD    GG
Sbjct: 371 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD----GG 426

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            SII GN Q Q + V +D   QR+GF  + C
Sbjct: 427 VSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 107/402 (26%), Positives = 172/402 (42%), Gaps = 63/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  C +   C   S  +I    F P+ SS+S L
Sbjct: 75  GLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSL 134

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C + +C S +      C   N++       CT     Y   YG G  T G  +S+ ++
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQ-------CT-----YTFQYGDGSGTSGYYVSDLMH 182

Query: 203 --------LPNRIIPNFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                   L      + + GCS+L       S R   GI GFG+   S+ SQL+L   + 
Sbjct: 183 FAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAP 242

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +    L+L       +     + Y+P V +           +Y + L+ 
Sbjct: 243 RVFSHCLKGDNSGGGVLVL------GEIVEPNIVYSPLVQSQP---------HYNLNLQS 287

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I+V GQ V +           N GTIVDSGTT  ++A E + P  +   + + ++     
Sbjct: 288 ISVNGQIVPIAPAVFATSN--NRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVL 345

Query: 367 ALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFA---VVGEGSAVCL 422
           + G +       C+ +        FP++ L+F GGA + L  ++Y      +GEGS  C+
Sbjct: 346 SRGNQ-------CYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCI 398

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                +   G    ILG+  +++    YDL  QR+G+    C
Sbjct: 399 GF---QRIPGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 113/402 (28%), Positives = 164/402 (40%), Gaps = 81/402 (20%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+DTGS + + PC+    C++C S + P F P+ S +     
Sbjct: 91  GYYTARLWIGTPPQRFALIVDTGSTVTYVPCST---CRHCGSHQDPKFRPEDSET----- 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE------- 199
            Q  KC+W      QC   ND      K CT     Y   Y    T   AL E       
Sbjct: 143 YQPVKCTW------QCNCDNDR-----KQCT-----YERRYAEMSTSSGALGEDVVSFGN 186

Query: 200 -TLNLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYC 248
            T   P R I     GC       + +++  GI G GRG  S+  QL       D FS C
Sbjct: 187 QTELSPQRAI----FGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLC 242

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
                          I       S       T +  V +P          YY + L+ I 
Sbjct: 243 YGGMGVGGGAMVLGGI-------SPPADMVFTRSDPVRSP----------YYNIDLKEIH 285

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           V G+R+ +  K      DG  GT++DSGTT+ ++    F  LA  F   ++K  +  + +
Sbjct: 286 VAGKRLHLNPKVF----DGKHGTVLDSGTTYAYLPESAF--LA--FKHAIMKETHSLKRI 337

Query: 369 GAEALTGLRPCF-----DVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCL 422
                     CF     DV  + + SFP +++ F  G +++L  ENY F       A CL
Sbjct: 338 SGPDPRYNDICFSGAEIDV-SQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCL 396

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            V ++      P+ +LG   ++N  V YD  + ++GF +  C
Sbjct: 397 GVFSN---GNDPTTLLGGIVVRNTLVMYDREHTKIGFWKTNC 435


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 67/388 (17%)

Query: 89  YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           Y + +SFGTP  PQ++  ++DTGS + W       QCK CSS      K P + P  SS+
Sbjct: 79  YVVRVSFGTPAVPQVV--VIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSST 130

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
              + C +  C  +        D       + K C      + + Y  G  T G    + 
Sbjct: 131 YSAVPCASDVCKKL------AADAYGSGCTSGKQC-----GFAISYADGTSTVGAYSQDK 179

Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
           L L P  I+ NF  GC            G+ G GR + SL ++     FSYCL S     
Sbjct: 180 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSV---- 234

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           +++   L L  G     K  +G  +TP    P    +  FS    V L  I VGG+++ +
Sbjct: 235 SSKPGFLALGAG-----KNPSGFVFTPMGTVPG---QPTFST---VTLAGINVGGKKLDL 283

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                +      GG IVDSGT  T +    +  L   F   M   R             L
Sbjct: 284 RPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG-------DL 330

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+++ G K    P++ L F GGA + L V N   V G     CL          G + 
Sbjct: 331 DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG-----CLAFA--ESGPDGSAG 383

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +LGN   + + V +D    + GF+ + C
Sbjct: 384 VLGNVNQRAFEVLFDTSTSKFGFRAKAC 411


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 122/486 (25%), Positives = 194/486 (39%), Gaps = 72/486 (14%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLV 52
           MA       L+ +  + L +I         FS+   H +        P++  +Q + + V
Sbjct: 1   MAMITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAV 60

Query: 53  SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
             S+ R  H K    K   +T +  +T ++S   G Y +  S G+PP  +  I+DTGS +
Sbjct: 61  RRSINRGNHFK----KAFVSTDSAESTVVASQ--GEYLMRYSVGSPPFQVLGIVDTGSDI 114

Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
           +W  C     C+ C     P F P  S + + L C +  C     ES++   C+ +    
Sbjct: 115 LWLQCE---PCEDCYKQTTPIFDPSKSKTYKTLPCSSNTC-----ESLRNTACSSD---- 162

Query: 173 SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQPAGI 226
                 +C  Y + YG G  ++G    ETL L +        P  ++GC   +       
Sbjct: 163 -----NVC-EYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEE 216

Query: 227 AGFGRGKTSLPSQLNLD-------KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
                G    P  L          KFSYCL +  F ++  +S L   + +  S + T   
Sbjct: 217 GSGIVGLGGGPVSLISQLSSSIGGKFSYCL-APIFSESNSSSKLNFGDAAVVSGRGTVST 275

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
              P              V+Y++ L   +VG  R+       +    G+G  I+DSGTT 
Sbjct: 276 PLDPLNGQ----------VFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTL 325

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
           T +  E +  L +  VS ++K     RA     L  L  C+    ++    P +  HFK 
Sbjct: 326 TLLPQEDYLNL-ESAVSDVIK---LERARDPSKLLSL--CYKTTSDEL-DLPVITAHFK- 377

Query: 400 GAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
           GA+V L P+  +  V  E   VC   ++ +  +     I GN   QN  V YDL  + + 
Sbjct: 378 GADVELNPISTFVPV--EKGVVCFAFISSKIGA-----IFGNLAQQNLLVGYDLVKKTVS 430

Query: 459 FKQQLC 464
           FK   C
Sbjct: 431 FKPTDC 436


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 160/387 (41%), Gaps = 57/387 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   +  GTP +    ++DTGS L W  C+    C   C     P F P+ SSS   +
Sbjct: 119 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPRSSSSYASV 175

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C  P+C  +   ++    C+     TS  C      Y   YG S  + G    +T++  
Sbjct: 176 SCSAPQCDALTTATLNPSTCS-----TSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 225

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +  +PNF  GC   +     Q AG+ G  R K SL  QL       FSYCL +       
Sbjct: 226 STSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGY 285

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-W 317
            +      N   +S        YTP      +A+ +     Y++ +  ITV G+ + V  
Sbjct: 286 LSIGSY--NPGQYS--------YTP------MAKSSLDDSLYFIKMTGITVAGKPLSVSA 329

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
             Y +L       TI+DSGT  T +  +++  L+      M   +   R   A A + L 
Sbjct: 330 SAYSSLP------TIIDSGTVITRLPTDVYSALSKAVAGAM---KGTPR---ASAFSILD 377

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF     +    P++ + F GGA + L   N    V + +  CL     R A+     I
Sbjct: 378 TCFQGQASRL-RVPQVSMAFAGGAALKLKATNLLVDV-DSATTCLAFAPARSAA-----I 430

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +GN Q Q + V YD++N ++GF    C
Sbjct: 431 IGNTQQQTFSVVYDVKNSKIGFAAGGC 457


>gi|388493426|gb|AFK34779.1| unknown [Medicago truncatula]
          Length = 454

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 163/417 (39%), Gaps = 67/417 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           YS S+  GTP   +  ++D     +WF C + Y                 S++   + C 
Sbjct: 50  YSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYN----------------STTYNPIQCG 93

Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-- 204
             KC         C DC + P  T  + N   + P     +G     G    + L+ P  
Sbjct: 94  TKKCK--QARGTGCIDCTNHPSKTGCTNNTCGVEP--FNPFGGFFVSGDVGEDILSFPRV 149

Query: 205 --------NRIIPNFLVGCSVLS-----------SRQPAGIAGFGRGKTSLPSQL----N 241
                   N  +P F+  C               S+   G+ G  R   SLP+Q+     
Sbjct: 150 TSDGRRVTNVRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFK 209

Query: 242 LD-KFSYCLLSHKFDDTTRTSSLILDNG----SSHSDKKTTGLTYTPFVNNPSVAE---R 293
           LD KF+ CL S    +     SL +  G     S+ D  +  L YTP + N         
Sbjct: 210 LDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFD 269

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
           N  S  Y++ ++ I V    V      L++++ G GGT + +    T +   ++ PL + 
Sbjct: 270 NFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNA 329

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVEN 409
           FV +  + R   R    +A+     CFD   +     G + P + L  KGG E  +   N
Sbjct: 330 FVKK-AEIRKIKR---VKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385

Query: 410 YFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
               V E + +CL  V       GP   SII+G  Q+++  VE+DL + +LGF   L
Sbjct: 386 SMVKVNE-NVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSL 441


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 156/410 (38%), Gaps = 78/410 (19%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +++  GTPP  +  I DTGS LVW  C         ++     F+P  SS+   +GC 
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCD 169

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTEGIALS-ETLNLPNR 206
              C                 L+++ +C+      YL  YG G      LS ET      
Sbjct: 170 TKAC---------------RALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTI 214

Query: 207 I----------------------IPNFLVGCSVLSSR--QPAGIAGFGRGKTSLPSQLNL 242
                                  I     GCS  ++   +  G+ G G G  SL SQL  
Sbjct: 215 ADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGA 274

Query: 243 D-----KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
                 KFSYCL    + +T  +S+L   N  S +     G   TP +            
Sbjct: 275 TTSLGRKFSYCLA--PYANTNASSAL---NFGSRAVVSEPGAASTPLITG-------EVE 322

Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
            YY + L  I V G +         +        IVDSGTT T++   L  PL    V  
Sbjct: 323 TYYTIALDSINVAGTKRPTTAAQAHI--------IVDSGTTLTYLDSALLTPL----VKD 370

Query: 358 MVKNRNYTRALGAEALTGLRPCFD---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
           + +     RA   E +  L  C+D   V GE     P++ L   GG EVTL  +N F VV
Sbjct: 371 LTRRIKLPRAESPEKILDL--CYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVV 428

Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            EG  +CL +V   E       ILGN   QN +V YDL    + F    C
Sbjct: 429 QEG-VLCLALVATSERQS--VSILGNIAQQNLHVGYDLEKGTVTFAAADC 475


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 155/382 (40%), Gaps = 52/382 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +    GTP Q +   +DT S + W PC     C  CSS+    F    S++ + LGCQ
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 154

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
             +C  +   +     C             +C   L   GS L   ++  +T+ L    +
Sbjct: 155 AAQCKQVPKPT-----CGGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAV 198

Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
           P +  GC        L ++   G+        S    L    FSYCL S  F     + S
Sbjct: 199 PGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 256

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
           L L  G     K+   + YTP + NP    R +    Y+V L  + VG + V V     T
Sbjct: 257 LRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFT 305

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
            +     GTI DSGT FT +    +  + D F +++ +N      L   +L G   C+ V
Sbjct: 306 FNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTV 359

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
           P       P +   F G   VTLP +N       GS  CL +    +       ++ N Q
Sbjct: 360 PIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQ 414

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN+ + YD+ N RLG  ++LC
Sbjct: 415 QQNHRLLYDVPNSRLGVARELC 436


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 109/390 (27%), Positives = 164/390 (42%), Gaps = 73/390 (18%)

Query: 95  FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            GTPPQ    I+DTGS + + PC +   C  C + + P F P LS +   + C NP C+ 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNS---CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT- 56

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
              E+ QC    +   A   + + I    LV +G+       +SE    P R +     G
Sbjct: 57  CDTENDQC--TYERQYAEMSSSSGILGEDLVSFGN-------MSEL--KPQRAV----FG 101

Query: 215 CS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLI 264
           C       L S+   GI G GRG  S+  QL       D FS C    +        +++
Sbjct: 102 CENAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----VGGGAMV 157

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L   S  SD           V + S  +R   S YY + LR + V G+++ +  +     
Sbjct: 158 LGQISPPSD----------MVFSHSDPDR---SPYYNIELRGLHVAGKKLDINPQVF--- 201

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP-----C 379
            DG  GTI+DSGTT+ ++    F P      S++          G + + G  P     C
Sbjct: 202 -DGKHGTILDSGTTYAYLPEAAFLPFIQAITSEL---------HGLKQIRGPDPNYNDVC 251

Query: 380 FDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGP 434
           F   G    E   +FP + + F  G + +L  ENY F       A CL V  + +    P
Sbjct: 252 FSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKD---P 308

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + +LG   ++N  V YD  + ++GF +  C
Sbjct: 309 TTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 115/389 (29%), Positives = 152/389 (39%), Gaps = 68/389 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-----FIPKLSSSSR 143
           Y +++S GTP       +DTGS + W       QCK CS+    S     F P  SS+  
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWV------QCKPCSAPACNSQRDQLFDPAKSSTYS 196

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
            + C    CS +      C         +   C      Y+V YG G  T G+  S+TL 
Sbjct: 197 AVPCGADACSELRIYEAGC---------SGSQC-----GYVVSYGDGSNTTGVYGSDTLA 242

Query: 203 L-PNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFD 255
           L P   +  FL GC    +   AGI G    GR   SL SQ        FSYCL S +  
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQ-- 300

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             +    L L   SS S   TTGL               A   +Y V L  I+VGGQ+V 
Sbjct: 301 --SAAGYLTLGGPSSASGFATTGLL-----------TAWAAPTFYMVMLTGISVGGQQVA 347

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V            GGT+VD+GT  T + P  +  L   F   +     Y     A A   
Sbjct: 348 VPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPC-GYPS---APANGI 397

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D       + P + L F GGA + L            S+ CL    +     G +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL------SSGCLAFAPN--GGDGDA 449

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ILGN Q +++ V +D     +GF    C
Sbjct: 450 AILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 104/385 (27%), Positives = 159/385 (41%), Gaps = 58/385 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I+++ GTP       +DTGS + W  C      + CSS K   F P +S++     C 
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQCAP-CAAQSCSSQKDKLFDPAMSATYSAFSCG 187

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           + +C+ +  E   C     +              Y+V YG G  T G   S+TL+L  + 
Sbjct: 188 SAQCAQLGDEGNGCLKSQCQ--------------YIVKYGDGSNTAGTYGSDTLSLTSSD 233

Query: 207 IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
            + +F  GCS  ++    +  G+ G G    SL SQ        FSYCL       ++  
Sbjct: 234 AVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPS---SSGG 290

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
             L L      S  +    ++TP V       R +   +Y V L+ ITV G  + V    
Sbjct: 291 GFLTLGAAGGASSSR---YSHTPMV-------RFSVPTFYGVFLQGITVAGTMLNVPASV 340

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPC 379
                  +G ++VDSGT  T + P  ++ L   F  +M       +A  + A  G L  C
Sbjct: 341 F------SGASVVDSGTVITQLPPTAYQALRTAFKKEM-------KAYPSAAPVGSLDTC 387

Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
           FD  G  T + P + L F  GA + L +           A CL       A  G + ILG
Sbjct: 388 FDFSGFNTITVPTVTLTFSRGAAMDLDISGILY------AGCLAFTA--TAHDGDTGILG 439

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N Q + + + +D+  + +GF+   C
Sbjct: 440 NVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
          Length = 274

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 80/246 (32%), Positives = 107/246 (43%), Gaps = 72/246 (29%)

Query: 225 GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS------HSDKKTTG 278
           GIAGFGRG+ SLPSQLN+  FSYC  S  FD  T++SS++    ++      H    T  
Sbjct: 88  GIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD--TKSSSVVTLGAAAAELLHTHHAAHTGD 144

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
           +  T  + NPS          Y+V LR I+VGG RV V    L         TI+DSG +
Sbjct: 145 VRTTRLIKNPSQPS------LYFVPLRGISVGGARVAVPESRL------RSSTIIDSGAS 192

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
            T +  +++E +  EFVSQ                                         
Sbjct: 193 ITTLPEDVYEAVKAEFVSQ----------------------------------------- 211

Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
                 LP  NY  V  + +A  L VV D  A+ G  +++GN+Q QN +V YDL N  L 
Sbjct: 212 ------LPRGNY--VFEDYAARVLCVVLD--AAAGEQVVIGNYQQQNTHVVYDLENDVLS 261

Query: 459 FKQQLC 464
           F    C
Sbjct: 262 FAPARC 267


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 67/388 (17%)

Query: 89  YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           Y + +SFGTP  PQ++  ++DTGS + W       QCK CSS      K P + P  SS+
Sbjct: 113 YVVRVSFGTPAVPQVV--VIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSST 164

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
              + C +  C  +        D       + K C      + + Y  G  T G    + 
Sbjct: 165 YSAVPCASDVCKKL------AADAYGSGCTSGKQC-----GFAISYADGTSTVGAYSQDK 213

Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
           L L P  I+ NF  GC            G+ G GR + SL ++     FSYCL S     
Sbjct: 214 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSV---- 268

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           +++   L L  G     K  +G  +TP    P    +  FS    V L  I VGG+++ +
Sbjct: 269 SSKPGFLALGAG-----KNPSGFVFTPMGTVPG---QPTFST---VTLAGINVGGKKLDL 317

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                +      GG IVDSGT  T +    +  L   F   M   R             L
Sbjct: 318 RPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG-------DL 364

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+++ G K    P++ L F GGA + L V N   V G     CL          G + 
Sbjct: 365 DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG-----CLAFA--ESGPDGSAG 417

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +LGN   + + V +D    + GF+ + C
Sbjct: 418 VLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 128/491 (26%), Positives = 189/491 (38%), Gaps = 87/491 (17%)

Query: 4   YISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIK 63
           Y S L +SF F         SS      ++   H +       N +  VS  L  A    
Sbjct: 8   YCSLLAISFFFASN------SSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRS 61

Query: 64  NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
             +++  TT T   +  IS+   G Y +S+S GTPP  +  I DTGS L W  C     C
Sbjct: 62  ISRSRRFTTKTDLQSGLISNG--GEYFMSISIGTPPSKVFAIADTGSDLTWVQCK---PC 116

Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
           + C     P F  K SS+ +   C +  C  +      C +  D           IC  Y
Sbjct: 117 QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKD-----------IC-KY 164

Query: 184 LVLYG-SGLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT--- 234
              YG +  T+G   +ET+     +  +   P  + GC            G+  G T   
Sbjct: 165 RYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGC------------GYNNGGTFEE 212

Query: 235 -------------SLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS--SHSDKKT 276
                        SL SQL      KFSYC LSH    T  TS + L   S  S+  K +
Sbjct: 213 TGSGIIGLGGGPLSLVSQLGSSIGKKFSYC-LSHTAATTNGTSVINLGTNSIPSNPSKDS 271

Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN---GGTIV 333
             LT TP +             YY++ L  +TVG  ++        L+   +   G  I+
Sbjct: 272 ATLT-TPLIQKDP-------ETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIII 323

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGTT T +    ++         +   +  +   G      L  CF   G+K    P +
Sbjct: 324 DSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGL-----LTHCFK-SGDKEIGLPAI 377

Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
            +HF   A+V L   N F  + E + VCL+++   E +     I GN    ++ V YDL 
Sbjct: 378 TMHFT-NADVKLSPINAFVKLNEDT-VCLSMIPTTEVA-----IYGNMVQMDFLVGYDLE 430

Query: 454 NQRLGFKQQLC 464
            + + F++  C
Sbjct: 431 TKTVSFQRMDC 441


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 163/386 (42%), Gaps = 56/386 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   +  GTP      ++DTGS L W  C+    C   C     P F PK SS+   +
Sbjct: 120 GNYVTRMGLGTPATQYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPKSSSTYASV 176

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
           GC   +CS +   ++    C+     +S  C      Y   YG S  + G    +T++  
Sbjct: 177 GCSAQQCSDLPSATLNPSACS-----SSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 226

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +  +PNF  GC   +     + AG+ G  R K SL  QL       F+YCL       ++
Sbjct: 227 STSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSS 283

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
              SL   N   +S        YTP V++ S+ +       Y++ L  +TV G  + V  
Sbjct: 284 GYLSLGSYNPGQYS--------YTPMVSS-SLDDS-----LYFIKLSGMTVAGNPLSVSS 329

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +        TI+DSGT  T +   ++  L+    + M   +  +R   A A + L  
Sbjct: 330 SAYSSLP-----TIIDSGTVITRLPTSVYSALSKAVAAAM---KGTSR---ASAYSILDT 378

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF     +  S P + + F GGA + L  +N    V + S  CL     R A+     I+
Sbjct: 379 CFKGQASRV-SAPAVTMSFAGGAALKLSAQNLLVDV-DDSTTCLAFAPARSAA-----II 431

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + V YD+++ R+GF    C
Sbjct: 432 GNTQQQTFSVVYDVKSSRIGFAAGGC 457


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 134/300 (44%), Gaps = 31/300 (10%)

Query: 172 TSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSR---QPAGI 226
           T++ C+     Y V YG G  T G    +TL L +   I  F  GC   +     + AG+
Sbjct: 12  TTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGL 71

Query: 227 AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
            G GRGKTSLP Q   DK+   + +H F   +  +   L+ G   S   +  L+ TP + 
Sbjct: 72  LGLGRGKTSLPVQ-TYDKYG-GVFAHCFPARSSGTGY-LEFGPGSSPAVSAKLSTTPMLI 128

Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
           +           +YYVG+  I VGG+ + +             GTIVDSGT  T + P  
Sbjct: 129 DTG-------PTFYYVGMTGIRVGGKLLPIPQSVFA-----AAGTIVDSGTVITRLPPAA 176

Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
           +  L   F + M   R Y R   A AL+ L  C+D+ G    + P + L F+GG  + + 
Sbjct: 177 YSSLRSAFAASMAA-RGYKR---APALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVD 232

Query: 407 VEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                Y A V   S  CL    +  A      I+GN Q++ + V YD+ ++ +GF    C
Sbjct: 233 ASGIIYAASV---SQACLGFAGNEAAD--DVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 162/392 (41%), Gaps = 60/392 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y ++L  GTP      ++DTGS L W       QCK C++S     K P F P  SS+  
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCNASDCYPQKDPLFDPSKSSTFA 178

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
            + C +  C  +  +      C +        C      Y + YG+G +TEG+  +ETL 
Sbjct: 179 TIPCASDACKQLPVDGYD-NGCTNNTSGMPPQC-----GYAIEYGNGAITEGVYSTETLA 232

Query: 203 L-PNRIIPNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKF 254
           L  + ++ +F  GC       P     G+ G G    SL SQ   +    FSYCL     
Sbjct: 233 LGSSAVVKSFRFGCGS-DQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCL----- 286

Query: 255 DDTTRTSSLILDNGSSHS-DKKTTGLTYTPF-VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
                + +  L  G+ +S +   +G  +TP    +P +A       +Y V L  I+VGG+
Sbjct: 287 -PPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIA------TFYVVTLTGISVGGK 339

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +             G IVDSGT  T +    ++ L   F S M +       L   A
Sbjct: 340 ALDIPPAVFAK------GNIVDSGTVITGIPTTAYKALRTAFRSAMAE-----YPLLPPA 388

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
            + L  C++  G  T + P++ L F GGA V L V +   V       CL      + S 
Sbjct: 389 DSALDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLV-----EDCLAFADAGDGSF 443

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G   I+GN   +   V YD     LGF+   C
Sbjct: 444 G---IIGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 129/486 (26%), Positives = 199/486 (40%), Gaps = 91/486 (18%)

Query: 1   MASYISALCLSFIFFFTLLSIFPSSITSLT-----FSLSRFHTNPSQDSYQNLNSLVSSS 55
           MA+ IS      +FF  +L +   S T++      F+ S FH    +DS   L+ L  SS
Sbjct: 1   MAATIS------LFFHLILFLISFSQTTIINGDNGFTTSLFH----RDSL--LSPLEFSS 48

Query: 56  LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
           L+    + N   ++ + +        +S + G  S  +  GTPP     I DTGS L W 
Sbjct: 49  LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSII--GTPPVDYLGIADTGSDLTWA 106

Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH--HESIQCRDCNDEPLATS 173
            C     C  C     P F P  S+S   + C    C  +   H  +Q            
Sbjct: 107 QC---LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQ------------ 151

Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGF 229
                +C  Y   YG    ++G    E + + +  + + ++GC   SS      +G+ G 
Sbjct: 152 ----GVC-DYSYTYGDRTYSKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGL 205

Query: 230 GRGKTSLPSQLNLD-----KFSYCL---LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
           G G+ SL SQ++       +FSYCL   LSH         + ++            G+  
Sbjct: 206 GGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG---------PGVVS 256

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP ++  +V        YYY+ L  I++G +R   + K         G  I+DSGTT +F
Sbjct: 257 TPLISKNTV-------TYYYITLEAISIGNERHMAFAK--------QGNVIIDSGTTLSF 301

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKG 399
           +  EL+    D  VS ++K     R         L  CFD  +    +   P +   F G
Sbjct: 302 LPKELY----DGVVSSLLKVVKAKRVKDPGNFWDL--CFDDGINVATSSGIPIITAQFSG 355

Query: 400 GAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
           GA V L PV  +  V    + + LT  +  +  G    I+GN  + N+ + YDL  +RL 
Sbjct: 356 GANVNLLPVNTFQKVANNVNCLTLTPASPTDEFG----IIGNLALANFLIGYDLEAKRLS 411

Query: 459 FKQQLC 464
           FK  +C
Sbjct: 412 FKPTVC 417


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 108/394 (27%), Positives = 176/394 (44%), Gaps = 57/394 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           G Y + +  G+P +    I+DTGS   W    PCT      YC   + P F P  S + +
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-----IYCHIQEDPVFNPSASKTYK 155

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLN 202
            + C + +CS +   ++    C+ +    S  C      Y   YG S  + G    + L 
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQ----SNACV-----YKASYGDSSFSLGYLSQDVLT 206

Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
           L P++ + +F+ GC   +     +  GI G    + S+ SQL+    + FSYCL +  F 
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT-SFS 265

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
                    L  G+S S   ++   +TP + NP+          Y++ L  ITV G+ + 
Sbjct: 266 TPNSPKEGFLSIGTS-SLTPSSSYKFTPLLKNPNNPS------LYFIDLESITVAGRPLG 318

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V      +       TI+DSGT  T +   ++  L + +V+  + ++ Y +A G   ++ 
Sbjct: 319 VAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVT--ILSKKYQQAPG---ISL 367

Query: 376 LRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           L  CF   G   G     P++++ FKGGA++ L   N    + E    CL +      +G
Sbjct: 368 LDTCFK--GSLAGISEVAPDIRIIFKGGADLQLKGHNSLVEL-ETGITCLAM------AG 418

Query: 433 GPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             SI I+GN+Q Q   V YD+ N R+GF    C+
Sbjct: 419 SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 108/394 (27%), Positives = 176/394 (44%), Gaps = 57/394 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           G Y + +  G+P +    I+DTGS   W    PCT      YC   + P F P  S + +
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-----IYCHIQEDPVFNPSASKTYK 155

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLN 202
            + C + +CS +   ++    C+ +    S  C      Y   YG S  + G    + L 
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQ----SNACV-----YKASYGDSSFSLGYLSQDVLT 206

Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
           L P++ + +F+ GC   +     +  GI G    + S+ SQL+    + FSYCL +  F 
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT-SFS 265

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
                    L  G+S S   ++   +TP + NP+          Y++ L  ITV G+ + 
Sbjct: 266 TPNSPKEGFLSIGTS-SLTPSSSYKFTPLLKNPNNPS------LYFIDLESITVAGRPLG 318

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V      +       TI+DSGT  T +   ++  L + +V+  + ++ Y +A G   ++ 
Sbjct: 319 VAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVT--ILSKKYQQAPG---ISL 367

Query: 376 LRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
           L  CF   G   G     P++++ FKGGA++ L   N    + E    CL +      +G
Sbjct: 368 LDTCFK--GSLAGISEVAPDIRIIFKGGADLQLKGHNSLVEL-ETGITCLAM------AG 418

Query: 433 GPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             SI I+GN+Q Q   V YD+ N R+GF    C+
Sbjct: 419 SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 152/389 (39%), Gaps = 68/389 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-----FIPKLSSSSR 143
           Y +++S GTP       +DTGS + W       QCK CS+    S     F P  SS+  
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWV------QCKPCSAPACNSQRDQLFDPAKSSTYS 196

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
            + C    CS +      C         +   C      Y+V YG G  T G+  S+TL 
Sbjct: 197 AVPCGADACSELRIYEAGC---------SGSQC-----GYVVSYGDGSNTTGVYGSDTLA 242

Query: 203 L-PNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFD 255
           L P   +  FL GC    +   AGI G    GR   SL SQ        FSYCL S +  
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQ-- 300

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             +    L L   +S S   TTGL               A   +Y V L  I+VGGQ+V 
Sbjct: 301 --SAAGYLTLGGPTSASGFATTGLL-----------TAWAAPTFYMVMLTGISVGGQQVA 347

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V            GGT+VD+GT  T + P  +  L   F    +    Y     A A   
Sbjct: 348 VPASAFA------GGTVVDTGTVITRLPPTAYAALRSAF-RGAIAPYGYPS---APANGI 397

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C+D       + P + L F GGA + L            S+ CL    +     G +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL------SSGCLAFAPN--GGDGDA 449

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ILGN Q +++ V +D     +GF    C
Sbjct: 450 AILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 133/504 (26%), Positives = 196/504 (38%), Gaps = 98/504 (19%)

Query: 1   MASYISALCLSFIFFFTLLSIFP-SSITSLTFSLSRFHT--------NPSQDSYQNLNSL 51
           MA+  S     FI F +++S F      +  FS +  H         NP    +  L + 
Sbjct: 1   MAAVSSIYVSLFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNS 60

Query: 52  VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
              S++RA   K      + +      ++I     G Y + +S G P   I  I DTGS 
Sbjct: 61  FHRSISRANRFK----PNSISARALVQSDIVPGG-GEYLMRISIGNPQVEILAIADTGSD 115

Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
           L+W  C     C+ C     P F P+ SSS R + C N  C+ +  E+  C         
Sbjct: 116 LIWVQCQ---PCEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSC--------- 163

Query: 172 TSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIA---- 227
            ++   + C  Y   YG          ++ +  +  I  F +G +  +S   A IA    
Sbjct: 164 DARGFVKTC-GYTYSYG---------DQSFSDGHLAIERFGIGST--NSNTSAAIAYFQE 211

Query: 228 -GFGRG--------------------KTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSL 263
             FG G                      SL SQL      KFSYCL+    + +  TS +
Sbjct: 212 VAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTS-EQSNYTSKI 270

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV---RVWHKY 320
              N     D   +G  Y   V+ P + ++     YYY+ L  I+V  +R+    +W+  
Sbjct: 271 NFGN-----DINISGSNYN-VVSTPLLPKKP--ETYYYLTLEAISVENKRLPYTNLWNGE 322

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           +       G  I+DSGTT TF+  E F  L D  V + VK    +   G         CF
Sbjct: 323 VE-----KGNIIIDSGTTLTFLDSEFFNNL-DSAVEEAVKGERVSDPHGL-----FNICF 371

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
               EK    P +  HF  GA+V L   N FA V E   +C T++   + +     I GN
Sbjct: 372 --KDEKAIELPIITAHFT-GADVELQPVNTFAKV-EEDLLCFTMIPSNDIA-----IFGN 422

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
               N+ V YDL  + + F    C
Sbjct: 423 LAQMNFLVGYDLEKKAVSFLPTDC 446


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 106/397 (26%), Positives = 168/397 (42%), Gaps = 71/397 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+D+GS + + PC +   C+ C + + P F P LSS+   + 
Sbjct: 86  GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+    +  QC    +   A   + + +    +V +G+         E+   P R
Sbjct: 143 C-NVDCT-CDSDKNQC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 189

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
            +     GC       L S+   GI G GRG+ S+  QL       D FS C        
Sbjct: 190 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 244

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
                +++L      +     G+ YT           NA  S YY + L+ + V G+ +R
Sbjct: 245 ---GGAMVLG-----AMPAPPGMIYT---------HSNAVRSPYYNIELKEMHVAGKALR 287

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-------VKNRNYTRAL 368
           V  +      DG  GT++DSGTT+ ++  + F    D   SQ+         + NY    
Sbjct: 288 VDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDIC 343

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTD 427
            A A   +    +V       FP++ + F  G +++L  ENY F       A CL V  +
Sbjct: 344 FAGAGRNVSQLSEV-------FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQN 396

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +    P+ +LG   ++N  V YD  N+++GF +  C
Sbjct: 397 GKD---PTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 106/399 (26%), Positives = 167/399 (41%), Gaps = 75/399 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+D+GS + + PC +   C+ C + + P F P LSS+   + 
Sbjct: 86  GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+    +  QC    +   A   + + +    +V +G+         E+   P R
Sbjct: 143 C-NVDCT-CDSDKNQC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 189

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
            +     GC       L S+   GI G GRG+ S+  QL       D FS C        
Sbjct: 190 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 244

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
                +++L      +     G+ YT           NA  S YY + L+ + V G+ +R
Sbjct: 245 ---GGAMVLG-----AMPAPPGMIYT---------HSNAVRSPYYNIELKEMHVAGKALR 287

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           V  +      DG  GT++DSGTT+ ++  + F    D   SQ+   +          + G
Sbjct: 288 VDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKK---------IRG 334

Query: 376 LRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
             P     CF   G         FP++ + F  G +++L  ENY F       A CL V 
Sbjct: 335 PDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + +    P+ +LG   ++N  V YD  N+++GF +  C
Sbjct: 395 QNGKD---PTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 110/398 (27%), Positives = 160/398 (40%), Gaps = 58/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
           G Y   +  GTP +     +DTGS ++W  C    +C   S     +  +  K S++S  
Sbjct: 153 GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 212

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           +GC +  CS             D PL   K   Q    Y VLYG G +      +     
Sbjct: 213 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 259

Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
           NRI  NF         + GC          SS    GI GFG+  +S+ SQL        
Sbjct: 260 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 319

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           + SH  D+        +D G   +     G    P VN   + +  A   +Y V ++ I 
Sbjct: 320 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 364

Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           VGG  + V    + + DR G   TI+DSGTT  +   E++ PL ++ +SQ    R +T  
Sbjct: 365 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 420

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVT 426
              +A T    CFD  G     FP + LHF     +T+ P E  F V      +      
Sbjct: 421 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG 474

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +   G    +LG+  + N  V YDL  Q +G+ +  C
Sbjct: 475 AQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 512


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 107/402 (26%), Positives = 164/402 (40%), Gaps = 65/402 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
           G Y   +  G PP+     +DTGS ++W  C N  +C  K     K+  + P+ S+S+  
Sbjct: 80  GLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATR 139

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLN 202
           + C +  C+  ++  +Q              CT+  P  Y V+YG G  T G  + + L 
Sbjct: 140 IYCDDDFCAATYNGVLQ-------------GCTKDLPCQYSVVYGDGSSTAGFFVKDNLQ 186

Query: 203 LPNRIIPNF---------LVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
             +R+  N          + GC          SS    GI GFG+  +S+ SQL      
Sbjct: 187 F-DRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKV 245

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV-NNPSVAERNAFSVYYYVGLR 305
             + +H  D+       I   G   S K  T    TP V N P          +Y V ++
Sbjct: 246 KRVFAHCLDNVKGGG--IFAIGEVVSPKVNT----TPMVPNQP----------HYNVVMK 289

Query: 306 RITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
            I VGG  + +    + T DR    GTI+DSGTT  ++   ++E +  + VS+    + +
Sbjct: 290 EIEVGGNVLELPTDIFDTGDRR---GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLH 346

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
           T     E  T    CF   G     FP +K HF G   +T+   +Y   + E    C   
Sbjct: 347 TV---EEQFT----CFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE-EVWCFGW 398

Query: 425 VTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                +   G    +LG+  + N  V YDL NQ +G+    C
Sbjct: 399 QNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNC 440


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 110/398 (27%), Positives = 160/398 (40%), Gaps = 58/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
           G Y   +  GTP +     +DTGS ++W  C    +C  K      +  +  K S++S  
Sbjct: 72  GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 131

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           +GC +  CS             D PL   K   Q    Y VLYG G +      +     
Sbjct: 132 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 178

Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
           NRI  NF         + GC          SS    GI GFG+  +S+ SQL        
Sbjct: 179 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 238

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           + SH  D+        +D G   +     G    P VN   + +  A   +Y V ++ I 
Sbjct: 239 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 283

Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           VGG  + V    + + DR G   TI+DSGTT  +   E++ PL ++ +SQ    R +T  
Sbjct: 284 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 339

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVT 426
              +A T    CFD  G     FP + LHF     +T+ P E  F V      +      
Sbjct: 340 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG 393

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +   G    +LG+  + N  V YDL  Q +G+ +  C
Sbjct: 394 AQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 431


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 111/433 (25%), Positives = 177/433 (40%), Gaps = 52/433 (12%)

Query: 40  PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           P  DS+ N + ++ S    R  ++     + T T+    +    + + G Y + +  GTP
Sbjct: 50  PKADSWDNRVINMASKDPARMSYLSTLVAQKTATSAPIASGQ--TFNIGNYVVRVKIGTP 107

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
            Q++  +LDT +   + P +    C  CS++   +F P +S+S   L C  P+C  +   
Sbjct: 108 GQLLFMVLDTSTDEAFVPSSG---CIGCSAT---TFYPNVSTSFVPLDCSVPQCGQVR-- 159

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--S 216
            + C      P   S  C     S+   Y         + ++L L   +IP++  G   +
Sbjct: 160 GLSC------PATGSGAC-----SFNQSYAGSTFSATLVQDSLRLATDVIPSYSFGSINA 208

Query: 217 VLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           +  S  PA    G+        S    +    FSYCL S  F     + SL L       
Sbjct: 209 ISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPS--FKSYYFSGSLKLGPVGQPK 266

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
             +TT L + P  + PS+         YYV L  I+VG   V +  + L  +     GTI
Sbjct: 267 SIRTTPLLHNP--HRPSL---------YYVNLTAISVGRVYVPLPSELLAFNPSTGAGTI 315

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
           +DSGT  T     ++  + DEF  Q+    +   +LGA        CF    E     P 
Sbjct: 316 IDSGTVITRFVEPIYNAVRDEFRKQVTGPFS---SLGA-----FDTCFVKNYETLA--PA 365

Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
           + LHF    ++ LP+EN       GS  CL +            ++ NFQ QN  V +D 
Sbjct: 366 ITLHFT-DLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDT 424

Query: 453 RNQRLGFKQQLCK 465
            N ++G  ++LC 
Sbjct: 425 VNNKVGIARELCN 437


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  + + GTPPQ    I+D    LVW  C+    C+ C    +P F+P  SS+ +   C 
Sbjct: 62  YVANFTIGTPPQPASAIVDVAGELVWTQCS---ACRRCFKQDLPVFVPNASSTFKPEPCG 118

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
              C     ESI  R C+ +  +     TQ+          G T G A ++T  +    +
Sbjct: 119 TAVC-----ESIPTRSCSGDVCSYKGPPTQL---------RGNTSGFAATDTFAIGTATV 164

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
                GC V S       P+G  G GR   SL +Q+ L +FSYCL      +T ++S L 
Sbjct: 165 -RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPR---NTGKSSRLF 220

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L + +  +  ++T  +  PF+      + +    YY + L  I  G   +          
Sbjct: 221 LGSSAKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ------ 269

Query: 325 RDGNGGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DV 382
              +GG +V  + + F+ +    +     + V++ V                L  CF   
Sbjct: 270 ---SGGILVMHTVSPFSLLVDSAYRAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKA 323

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSII 437
            G    + P+L   F+G A +T+P   Y   VG E    C  +++    +R    G S +
Sbjct: 324 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVS-V 382

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LG+ Q ++ +  YDL+ + L F+   C
Sbjct: 383 LGSLQQEDVHFLYDLKKETLSFEPADC 409


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 171/393 (43%), Gaps = 59/393 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL 145
           G Y +S + G P   +   LDT + L+W  C+N + QC+         F+   S +  + 
Sbjct: 73  GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEME 132

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
            C +  C+ +       + CN    ++ K C      Y ++YG    T GI  S++    
Sbjct: 133 PCGSNFCNSL----TGFQTCN----SSDKWC-----KYRLVYGDNKATSGILSSDSFGFD 179

Query: 205 NR----IIPNFL-VGCSVL----SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
                 +   FL  GCS        +   G  G  +   SL SQL + KFSYCL+   F+
Sbjct: 180 TSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLV--PFN 237

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
           +   TS +    GS       T    TP +   S A        YYV +  I++G     
Sbjct: 238 NLGSTSKMYF--GS----LPVTSGGQTPLLYPNSDA--------YYVKVLGISIGNDEPH 283

Query: 316 ---VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
              V+  Y   D     G I+D+G T++ +  + F+ L  +F++  +K+    +    E 
Sbjct: 284 FDGVFDVYEVRD-----GWIIDTGITYSSLETDAFDSLLAKFLT--LKDFPQRKDDPKER 336

Query: 373 LTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
                 CF++       SFP++ +HF G A++ L VE+ F  + +    CL ++     S
Sbjct: 337 F---ELCFELQNANDLESFPDVTVHFDG-ADLILNVESTFVKIEDDGIFCLALL----RS 388

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G P  ILGNFQ+QNY+V YDL  Q + F    C
Sbjct: 389 GSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 108/396 (27%), Positives = 158/396 (39%), Gaps = 62/396 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y  + + GTPPQ +  I+D    LVW       QC  C SS     ++P F P  S++ R
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTYR 115

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
              C +P C     +SI  R+C+ +       C    PS       G T GIA ++ + +
Sbjct: 116 AEQCGSPLC-----KSIPTRNCSGD-----GECGYEAPSMF-----GDTFGIASTDAIAI 160

Query: 204 PNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
            N        GC V S          P+G  G GR   SL  Q N+  FSYCL  H    
Sbjct: 161 GNAE-GRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLAPHG--- 216

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
             + S+L L   +  +    +         + S    +    YY V L  I  G   V  
Sbjct: 217 PGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAA 276

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA--DEFVSQMVKNRNYTRALGAEALT 374
                       GG I       T +  E F PL+   +   Q ++ +  T ALG+ ++ 
Sbjct: 277 ASS--------GGGAI-------TILQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMA 320

Query: 375 GLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVVTD---R 428
                FD+          P+L   F+GGA +T P   Y    G G+  VCL++++     
Sbjct: 321 NPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLD 380

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            A  G S ILG+   +N +  +DL  + L F+   C
Sbjct: 381 SADDGVS-ILGSLLQENVHFLFDLEKETLSFEPADC 415


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 165/402 (41%), Gaps = 64/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+P +     +DTGS ++W  C     C + S   I    F    SS++ L
Sbjct: 81  GLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C +P CS+    +     C+ +    +  C     SY   YG G  T G  +S+T+  
Sbjct: 141 VSCADPICSYAVQTATS--GCSSQ----ANQC-----SYTFQYGDGSGTTGYYVSDTMYF 189

Query: 204 PNRIIPNFLV---------GCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
              ++   +V         GCS   S       +   GI GFG G  S+ SQL+      
Sbjct: 190 DTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249

Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
             FS+CL   +          IL+            + Y+P V  PS+        +Y +
Sbjct: 250 KVFSHCLKGGENGGGVLVLGEILE----------PSIVYSPLV--PSLP-------HYNL 290

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+ I V GQ + +           N GTIVDSGTT  ++  E + P  D   + + +  
Sbjct: 291 NLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFS 348

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
               + G +       C+ V       FP++ L+F GGA + L  E+Y    G   +  +
Sbjct: 349 KPIISKGNQ-------CYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAM 401

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  ++   G + ILG+  +++    YDL NQR+G+    C
Sbjct: 402 WCIGFQKVERGFT-ILGDLVLKDKIFVYDLANQRIGWADYNC 442


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 138/490 (28%), Positives = 207/490 (42%), Gaps = 75/490 (15%)

Query: 1   MASY-ISALCLSFIFFFTLLSI--FPSSITSLTFSLSRFHT--------NPSQDSYQNLN 49
           MA++ I+ L L F+ F  L+S     +S+ + +F+ S  H         NP    +  L 
Sbjct: 1   MAAFSITHLSL-FVIFVALISKTSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQ 59

Query: 50  SLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTG 109
           S    S++RA    N  T  + +   T   +I     G Y + +S GTPP  +  I DTG
Sbjct: 60  SSFHRSISRA----NRFTPNSVSAAKTLEYDIIPGG-GEYFMRISIGTPPIEVLVIADTG 114

Query: 110 SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
           S L+W  C     C+ C   K P F PK SS+ R + C+   C+ ++ +   C       
Sbjct: 115 SDLIWVQCQ---PCQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRAC------- 164

Query: 170 LATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PNRIIPNFLVGCSVLS----SRQ 222
             ++    + C  Y   YG    T G   +E   +   N  I     GC   +       
Sbjct: 165 --SAHGFFKAC-GYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEV 221

Query: 223 PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
            +GI G G G  SL SQL     +KFSYCL+            ++  + S  S   T   
Sbjct: 222 GSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDT--- 278

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN---GGTIVDSG 336
               +V+ P V++      +YY+ L  I+VG +R+     Y     DGN   G  I+DSG
Sbjct: 279 ----YVSTPLVSKEP--ETFYYLTLEAISVGNERL----AYENSRNDGNVEKGNIIIDSG 328

Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTG-SFPELK 394
           TT TF+  +L+  L  E V +        +A+  E ++     F +   +K G   P + 
Sbjct: 329 TTLTFLDSKLYNKL--ELVLE--------KAVEGERVSDPNGIFSICFRDKIGIELPIIT 378

Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
           +HF   A+V L   N FA   E   +C T++     S G + I GN    N+ V YDL  
Sbjct: 379 VHFT-DADVELKPINTFA-KAEEDLLCFTMI----PSNGIA-IFGNLAQMNFLVGYDLDK 431

Query: 455 QRLGFKQQLC 464
             + F    C
Sbjct: 432 NCVSFMPTDC 441


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 103/386 (26%), Positives = 152/386 (39%), Gaps = 51/386 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y IS+  G+P      ++DTGS + W  C        C +     F P  SS+     C 
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP-NR 206
              C+ +  +S +   C+      +K+  Q    Y+V YG G  T G   S+ L L  + 
Sbjct: 195 AAACAQL-GDSGEANGCD------AKSRCQ----YIVKYGDGSNTTGTYSSDVLTLSGSD 243

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
           ++  F  GCS          +  G+ G G    SL SQ        FSYCL +       
Sbjct: 244 VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATP----A 299

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +  L L   +S      +    TP + +  V        YY+  L  I VGG+++ +  
Sbjct: 300 SSGFLTLGAPASGGGGGASRFATTPMLRSKKV------PTYYFAALEDIAVGGKKLGLSP 353

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
                      G++VDSGT  T + P  +  L+  F + M +   Y R   AE L  L  
Sbjct: 354 SVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTR---YAR---AEPLGILDT 401

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF+  G    S P + L F GGA V L      +    G  +      D +A G     +
Sbjct: 402 CFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVS----GGCLAFAPTRDDKAFG----TI 453

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q + + V YD+     GF+   C
Sbjct: 454 GNVQQRTFEVLYDVGGGVFGFRAGAC 479


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/419 (24%), Positives = 176/419 (42%), Gaps = 65/419 (15%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           H++  ++ +T T       ++    YG Y+  +  GTPPQ    I+DTGS L + PC+  
Sbjct: 66  HLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST- 122

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C+ C   + P+F P  SS+ + L C   +C+    E + C    D   A   + + + 
Sbjct: 123 --CEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT-CDSEMMHC--VYDRQYAEMSSSSGVL 176

Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTS 235
              +V +G          ++   P R +     GC  +      S++  GI G GRG  S
Sbjct: 177 GEDIVSFG---------KQSELKPQRTV----FGCENVETGDIYSQRADGIMGLGRGDLS 223

Query: 236 LPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           +  QL       + FS C     +         ++  G S       G+ +T   ++P  
Sbjct: 224 IVDQLVEKGVIGNSFSLC-----YGGMDVGGGAMVLGGIS----PPAGMVFTH--SDP-- 270

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
               A S YY + L+ I + G+++ +      +  DG  GTI+DSGTT+ ++     EP 
Sbjct: 271 ----ARSAYYNIDLKEIHIAGKQLPI----NPMVFDGKYGTILDSGTTYAYLP----EPA 318

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEVTLP 406
              F   ++K  N  + +          CF   G    + + +FP + L F  G  ++L 
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLS 378

Query: 407 VENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ENY F       A CL +  +       + +LG   ++N  V YD  + ++GF +  C
Sbjct: 379 PENYLFQHSKAHGAYCLGIFQNENDQ---TTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 107/398 (26%), Positives = 166/398 (41%), Gaps = 73/398 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ+   I+DTGS + + PC+    C+ C   + P F P+ SS+ + + 
Sbjct: 82  GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPESSSTYQPVK 138

Query: 147 CQNPKCSWIHHESIQCRDCNDEPL--ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           C           +I C +C+ + +     +   ++  S  VL G  L      SE    P
Sbjct: 139 C-----------TIDC-NCDSDRMQCVYERQYAEMSTSSGVL-GEDLISFGNQSEL--AP 183

Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDD 256
            R +     GC       L S+   GI G GRG  S+  QL   N+   S+ L     D 
Sbjct: 184 QRAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD- 238

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                +++L   S  SD       Y+  V +P          YY + L+ I V G+R+ +
Sbjct: 239 -VGGGAMVLGGISPPSD---MAFAYSDPVRSP----------YYNIDLKEIHVAGKRLPL 284

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                    DG  GT++DSGTT+ ++    F    D  V ++            + ++G 
Sbjct: 285 NANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQS---------LKKISGP 331

Query: 377 RP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVT 426
            P     CF   G    + + SFP + + F+ G + TL  ENY F       A CL V  
Sbjct: 332 DPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQ 391

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +       + +LG   ++N  V YD    ++GF +  C
Sbjct: 392 N---GNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/419 (24%), Positives = 176/419 (42%), Gaps = 65/419 (15%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
           H++  ++ +T T       ++    YG Y+  +  GTPPQ    I+DTGS L + PC+  
Sbjct: 66  HLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST- 122

Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
             C+ C   + P+F P  SS+ + L C   +C+    E + C    D   A   + + + 
Sbjct: 123 --CEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT-CDSEMMHC--VYDRQYAEMSSSSGVL 176

Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTS 235
              +V +G          ++   P R +     GC  +      S++  GI G GRG  S
Sbjct: 177 GEDIVSFG---------KQSELKPQRTV----FGCENVETGDIYSQRADGIMGLGRGDLS 223

Query: 236 LPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
           +  QL       + FS C     +         ++  G S       G+ +T   ++P  
Sbjct: 224 IVDQLVEKGVIGNSFSLC-----YGGMDVGGGAMVLGGIS----PPAGMVFTH--SDP-- 270

Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
               A S YY + L+ I + G+++ +      +  DG  GTI+DSGTT+ ++     EP 
Sbjct: 271 ----ARSAYYNIDLKEIHIAGKQLPI----NPMVFDGKYGTILDSGTTYAYLP----EPA 318

Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEVTLP 406
              F   ++K  N  + +          CF   G    + + +FP + L F  G  ++L 
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLS 378

Query: 407 VENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            ENY F       A CL +  +       + +LG   ++N  V YD  + ++GF +  C
Sbjct: 379 PENYLFQHSKAHGAYCLGIFQNENDQ---TTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 158/387 (40%), Gaps = 62/387 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  S GTPPQ +   +DT +   W PC     C  C +S    F P  S+S R + C 
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAG---CAGCPTSSAAPFDPAASASYRTVPCG 168

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P C+               P A      + C   L    S L   ++  ++L +    +
Sbjct: 169 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNAV 214

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
             +  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S K   F  T R
Sbjct: 215 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLR 274

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               +  NG     K T      P + NP        S  YYV +  + VG + V +   
Sbjct: 275 ----LGRNGQPQRIKTT------PLLANPH------RSSLYYVNMTGVRVGRKVVPI--- 315

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
               D     GT++DSGT FT +    +  + DE            R +GA   +L G  
Sbjct: 316 -PAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 364

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF+       ++P + L F G  +VTLP EN       G+  CL +    +       +
Sbjct: 365 TCFNT---TAVAWPPMTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 420

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + + Q QN+ V +D+ N R+GF ++ C
Sbjct: 421 IASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|383161173|gb|AFG63169.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
          Length = 133

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 85/145 (58%), Gaps = 20/145 (13%)

Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFG 230
           C++ICP + + YG+G   G  LS+TL LP      R I NF  GCSVLSS Q AGIAGFG
Sbjct: 1   CSKICPHFSLTYGTGNATGRLLSDTLTLPLEDGGRREIKNFAFGCSVLSS-QVAGIAGFG 59

Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
            G  S+PSQL     DKF+YCL     D  + +S ++L N +   D     LTYTP + N
Sbjct: 60  NGGLSMPSQLAPLIGDKFAYCL-----DYRSNSSKIVLGNKAVPRDLP---LTYTPLLFN 111

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQ 312
           P     + FS Y+Y+ L  +++GG+
Sbjct: 112 P--VNPSVFS-YFYLALEAVSIGGK 133


>gi|285741|dbj|BAA03413.1| EDGP precursor [Daucus carota]
          Length = 433

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            ++D G   +W  C  +Y                +SS+ R + C+  +CS     SI C 
Sbjct: 57  LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 98

Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
           DC + P     N T  + P   V+    G  + E +   E+ +     R++  P F+  C
Sbjct: 99  DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 158

Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +  S  Q       G+AG GR + +LPSQ         KF+ CL       +T ++S+I+
Sbjct: 159 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 213

Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
                +        SDK    LTYTP + NP    + + +   SV Y++G++ I +  + 
Sbjct: 214 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 270

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V +    L++   G GGT + +   +T +   +++ + + F+ +    RN TR       
Sbjct: 271 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 329

Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
                  ++   + G S P + L  +  + V T+   N    + + + VCL VV D  ++
Sbjct: 330 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 387

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              SI++G  Q+++  V++DL   R+GF   L
Sbjct: 388 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 419


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 150/384 (39%), Gaps = 53/384 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y ++ S GTP       +DTGS L W  C        C S K P F P  SSS   + C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
            P C+ +              +  +  C+     Y+V YG G  T G+  S+TL L  + 
Sbjct: 200 GPVCAGLG-------------IYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRT 260
            +  F  GC    S       G+ G GR + SL  Q        FSYCL       T  +
Sbjct: 247 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCL------PTKPS 300

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
           ++  L  G         G + T  + +P+         YY V L  I+VGGQ++ V    
Sbjct: 301 TAGYLTLGLGGPSGAAPGFSTTQLLPSPNA------PTYYVVMLTGISVGGQQLSVPASA 354

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
                   GGT+VD+GT  T + P  +  L   F S M      T    A +   L  C+
Sbjct: 355 FA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPT----APSNGILDTCY 404

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
           +  G  T + P + L F  GA V L  +         S  CL        S G   ILGN
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVMLGADGIL------SFGCLAFAP--SGSDGGMAILGN 456

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q +++ V  D     +GFK   C
Sbjct: 457 VQQRSFEVRID--GTSVGFKPSSC 478


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 121/442 (27%), Positives = 176/442 (39%), Gaps = 66/442 (14%)

Query: 40  PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
           P++  +Q + + +  S+ RA H   P    +T T  +T       S G Y +S S GTPP
Sbjct: 49  PTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTVI----ASQGEYLMSYSVGTPP 104

Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
             I  I+DTGS ++W  C     C+ C +   P F P  S + + L C +  C  +   +
Sbjct: 105 FQILGIVDTGSDIIWLQCQ---PCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQSAA 161

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-----PNRIIPNFLV 213
             C   NDE       C      Y + YG +  ++G    ETL L      +   P  ++
Sbjct: 162 -SCSSNNDE-------C-----EYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVI 208

Query: 214 GCSVLSSRQPAGIAGFGRGKTSLPSQLNLD-------KFSYCLLSHKFDDTTRTSSLILD 266
           GC   +            G    P  L          KFSYC L+  F  +  +S L   
Sbjct: 209 GCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYC-LAPLFSQSNSSSKLNFG 267

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
           + +  S +   G   TP V       +N    +Y++ L   +VG  R+            
Sbjct: 268 DEAVVSGR---GTVSTPIV------PKNGLG-FYFLTLEAFSVGDNRIEFGSSSFES-SG 316

Query: 327 GNGGTIVDSGTTFTFMAPE----LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           G G  I+DSGTT T +  +    L   +AD    + V++ +            LR C+  
Sbjct: 317 GEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPS----------KFLRLCYRT 366

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
                 + P +  HFK GA+V L   + F  V EG  VC      R +  GP  I GN  
Sbjct: 367 TSSDELNVPVITAHFK-GADVELNPISTFIEVDEG-VVCFAF---RSSKIGP--IFGNLA 419

Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
            QN  V YDL  Q + FK   C
Sbjct: 420 QQNLLVGYDLVKQTVSFKPTDC 441


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 110/404 (27%), Positives = 168/404 (41%), Gaps = 57/404 (14%)

Query: 76  TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI 135
           T  T +S H Y  Y + LS GTPP      +DTGS L+W  C     C  C     P F 
Sbjct: 47  TAQTPVSVHHYD-YLMELSIGTPPVKTYAQVDTGSDLIWLQCI---PCTNCYKQLNPMFD 102

Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEG 194
           P+ SS+   +   +  CS ++  S     C+ +      NC     +Y   Y    +TEG
Sbjct: 103 PQSSSTYSNIAYGSESCSKLYSTS-----CSPD----QNNC-----NYTYSYEDDSITEG 148

Query: 195 IALSETLNLPNR-----IIPNFLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQLNL-- 242
           +   ETL L +       +   + GC      V + ++  GI G GRG  SL SQ+    
Sbjct: 149 VLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKE-MGIIGLGRGPLSLVSQIGSSF 207

Query: 243 --DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
               FS CL+    + +  TS +    G   S+    G+  TP V+      +N    +Y
Sbjct: 208 GGKMFSQCLVPFHTNPSI-TSPMSFGKG---SEVLGNGVVSTPLVS------KNTHQAFY 257

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           +V L  I+V    +  ++   +L+    G  ++DSGT  T +  + +  L +E     V+
Sbjct: 258 FVTLLGISVEDINLP-FNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEE-----VR 311

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
           N+     +  +   G + C+  P    G+   L  HF+ GA+V L     F  V +G   
Sbjct: 312 NKVALDPIPIDPTLGYQLCYRTPTNLKGT--TLTAHFE-GADVLLTPTQIFIPVQDG-IF 367

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C    +      G   I GN    NY + +DL  Q + FK   C
Sbjct: 368 CFAFTSTFSNEYG---IYGNHAQSNYLIGFDLEKQLVSFKATDC 408


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/409 (26%), Positives = 171/409 (41%), Gaps = 71/409 (17%)

Query: 77  TTTNISSHSY--------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
           T+ N+ +H++        G + + ++FGTPPQ    ILDTGS + W  C     C +C  
Sbjct: 107 TSGNLKNHAHNNNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCK---ACVHCLK 163

Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
                F   L+SS+   G   P                          + +  +Y + YG
Sbjct: 164 DSHRHF-DSLASSTYSFGSCIP--------------------------STVGNTYNMTYG 196

Query: 189 SGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQL-- 240
              T  G    +T+ L P+ +   F  GC   +         G+ G G+G+ S  SQ   
Sbjct: 197 DKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTAS 256

Query: 241 NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
              K FSYCL     ++ +  S L  +  +S S    + L +T  VN P  +     S Y
Sbjct: 257 KFKKVFSYCLP----EENSIGSLLFGEKATSQS----SSLKFTSLVNGPGTSGLEE-SGY 307

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y+V L  I+VG +R+ +           + GTI+DSGT  T +    +   +    +   
Sbjct: 308 YFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAY---SALKAAFKK 359

Query: 360 KNRNYTRALGAEALTG-LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GE 416
               Y  + G       L  C+++ G K    PE  LHF  GA+V L   N   VV   +
Sbjct: 360 AMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRL---NGKRVVWGND 416

Query: 417 GSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            S +CL    + +++  P + I+GN Q  +  V YD+R +R+GF    C
Sbjct: 417 ASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 159/407 (39%), Gaps = 77/407 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +S+S GTPP     I DTGS L W  C     C+ C     P F  K SS+ +   
Sbjct: 83  GEYFMSISIGTPPSKFLAIADTGSDLTWVQCK---PCQQCYKQNTPLFDKKKSSTYKTES 139

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +  C+ +      C +        S+N  +    Y   YG    T+G   +ET+++ +
Sbjct: 140 CDSITCNALSEHEEGCDE--------SRNACK----YRYSYGDESFTKGEVATETISIDS 187

Query: 206 R-----IIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLNL-- 242
                   P    GC            G+  G T                SL SQL    
Sbjct: 188 SSGSPVSFPGTAFGC------------GYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSI 235

Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYY 300
             KFSYC LSH    T  TS + L   S  S   K + +  TP +             YY
Sbjct: 236 GKKFSYC-LSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDP-------ETYY 287

Query: 301 YVGLRRITVGGQRV-RVWHKYLTLDRDGN--GGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
           ++ L  ITVG  ++        +L+R     G  I+DSGTT T +    ++         
Sbjct: 288 FLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEES 347

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
           +   +  +   G      L  CF   G+K    P + +HF  GA+V L   N F  + E 
Sbjct: 348 VTGAKRVSDPQGI-----LTHCFK-SGDKEIGLPTITMHFT-GADVKLSPINSFVKLSE- 399

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             VCL+++   E +     I GN    ++ V YDL  + + F++  C
Sbjct: 400 DIVCLSMIPTTEVA-----IYGNMVQMDFLVGYDLETKTVSFQRMDC 441


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/387 (25%), Positives = 160/387 (41%), Gaps = 44/387 (11%)

Query: 98  PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
           PPQ I  ++DTGS L W  C      +  + + + +F P  SSS   + C +P C     
Sbjct: 82  PPQNISMVIDTGSELSWLRCN-----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136

Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI-IPNFLVGC- 215
           + +    C+ +         ++C + L    +  +EG   +E  +  N     N + GC 
Sbjct: 137 DFLIPASCDSD---------KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 187

Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
             +S   P       G+ G  RG  S  SQ+   KFSYC+       T      +L   S
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-----SGTDDFPGFLLLGDS 242

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           + +    T L YTP +   S        V Y V L  I V G+ + +    L  D  G G
Sbjct: 243 NFT--WLTPLNYTPLIRI-STPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAG 299

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEALTGLRPCFDVP 383
            T+VDSGT FTF+   ++  L  +F++Q      + ++  +      +    + P F + 
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISP-FRIR 358

Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSIIL 438
                  P + L F+G AE+ +  +     V     G  S  C T   + +  G  + ++
Sbjct: 359 TGILHRLPTVSLVFEG-AEIAVSGQPLLYRVPHLTAGNDSVYCFTF-GNSDLMGMEAYVI 416

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G+   QN ++E+DL+  R+G     C 
Sbjct: 417 GHHHQQNMWIEFDLQRSRIGLAPVQCD 443


>gi|384482418|pdb|3VLB|A Chain A, Crystal Structure Of Xeg-Edgp
 gi|384482420|pdb|3VLB|C Chain C, Crystal Structure Of Xeg-Edgp
          Length = 413

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            ++D G   +W  C  +Y                +SS+ R + C+  +CS     SI C 
Sbjct: 37  LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 78

Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
           DC + P     N T  + P   V+    G  + E +   E+ +     R++  P F+  C
Sbjct: 79  DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 138

Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +  S  Q       G+AG GR + +LPSQ         KF+ CL       +T ++S+I+
Sbjct: 139 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 193

Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
                +        SDK    LTYTP + NP    + + +   SV Y++G++ I +  + 
Sbjct: 194 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 250

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V +    L++   G GGT + +   +T +   +++ + + F+ +    RN TR       
Sbjct: 251 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 309

Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
                  ++   + G S P + L  +  + V T+   N    + + + VCL VV D  ++
Sbjct: 310 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 367

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              SI++G  Q+++  V++DL   R+GF   L
Sbjct: 368 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 399


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 117/425 (27%), Positives = 177/425 (41%), Gaps = 57/425 (13%)

Query: 52  VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
           + S L++ L  +N   +  +TT    + ++   +   Y + +  GTP + +  + DTGS 
Sbjct: 101 IQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSA--NYFVVVGLGTPKRDLSLVFDTGSD 158

Query: 112 LVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
           L W       QC+ C+ S    +   F P  SSS   + C +  C+ +    I+ R C+ 
Sbjct: 159 LTW------TQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSR-CSS 211

Query: 168 EPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSS---RQ 222
              A    C      Y + YG   T  G    E L +    I+ +FL GC   +      
Sbjct: 212 STTA----CI-----YGIQYGDKSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFSG 262

Query: 223 PAGIAGFGRGKTSLPSQLN--LDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
            AG+ G GR   S   Q +   +K FSYCL S      T +S   L  G+S +      L
Sbjct: 263 SAGLIGLGRHPISFVQQTSSIYNKIFSYCLPS------TSSSLGHLTFGASAA--TNANL 314

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
            YTP     +++  N F   Y + +  I+VGG ++      ++      GG+I+DSGT  
Sbjct: 315 KYTPL---STISGDNTF---YGLDIVGISVGGTKLPA----VSSSTFSAGGSIIDSGTVI 364

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
           T +AP  +  L   F   M K   Y  A   + L     C+D  G K  S P++   F G
Sbjct: 365 TRLAPTAYAALRSAFRQGMEK---YPVA-NEDGL--FDTCYDFSGYKEISVPKIDFEFAG 418

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
           G  V LP+     +      VCL    +   +     I GN Q +   V YD+   R+GF
Sbjct: 419 GVTVELPLVGIL-IGRSAQQVCLAFAAN--GNDNDITIFGNVQQKTLEVVYDVEGGRIGF 475

Query: 460 KQQLC 464
               C
Sbjct: 476 GAAGC 480


>gi|384482417|pdb|3VLA|A Chain A, Crystal Structure Of Edgp
          Length = 413

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            ++D G   +W  C  +Y                +SS+ R + C+  +CS     SI C 
Sbjct: 37  LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 78

Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
           DC + P     N T  + P   V+    G  + E +   E+ +     R++  P F+  C
Sbjct: 79  DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 138

Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +  S  Q       G+AG GR + +LPSQ         KF+ CL       +T ++S+I+
Sbjct: 139 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 193

Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
                +        SDK    LTYTP + NP    + + +   SV Y++G++ I +  + 
Sbjct: 194 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 250

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V +    L++   G GGT + +   +T +   +++ + + F+ +    RN TR       
Sbjct: 251 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 309

Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
                  ++   + G S P + L  +  + V T+   N    + + + VCL VV D  ++
Sbjct: 310 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 367

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              SI++G  Q+++  V++DL   R+GF   L
Sbjct: 368 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 399


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 171/427 (40%), Gaps = 53/427 (12%)

Query: 52  VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
           V++++ R+++  N   K +   +T T  +    S G Y +S S GTPP  I  ++DTGS 
Sbjct: 60  VANAMRRSINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSG 119

Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
           + W  C    +C+ C     P F P  S + + L C +  C  +    I    C+ +   
Sbjct: 120 ITWMQCQ---RCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSV----ISTPSCSSD--- 169

Query: 172 TSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-----PNRIIPNFLVGCS-------VL 218
                 +I   Y + YG G  ++G    ETL L      +   PN ++GC          
Sbjct: 170 ------KIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQG 223

Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
                 G+ G      S  S     KFSYC L+  F  +  +S L   + +  S     G
Sbjct: 224 EGSGVVGLGGGPVSLISQLSSSIGGKFSYC-LAPMFSQSNSSSKLNFGDAAVVSG---LG 279

Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGT 337
              TP V+      +    V+YY+ L   +VG +R+  V     +   +G G  I+DSGT
Sbjct: 280 AVSTPLVS------KTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGT 333

Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
           T T +  E +  L       +  NR       ++    L  C+          P +  HF
Sbjct: 334 TLTLLPQEDYSNLESAVADAIQANRV------SDPSNFLSLCYQTTPSGQLDVPVITAHF 387

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
           K GA+V L   + F  V EG  VC    +    S     I GN    N  V YDL  Q +
Sbjct: 388 K-GADVELNPISTFVQVAEG-VVCFAFHSSEVVS-----IFGNLAQLNLLVGYDLMEQTV 440

Query: 458 GFKQQLC 464
            FK   C
Sbjct: 441 SFKPTDC 447


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 111/393 (28%), Positives = 165/393 (41%), Gaps = 62/393 (15%)

Query: 89  YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           Y ++L FGTP  PQ++  ++DTGS L W       QC+ C+SS     K P F P  SS+
Sbjct: 122 YVVTLGFGTPAVPQVL--LIDTGSDLSWV------QCQPCNSSTCYPQKDPVFDPSASST 173

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
              + C +  C  +  +S     C +     S +   +C  Y + YG+G  T G+  +ET
Sbjct: 174 YAPVPCGSEACRDLDPDSYA-NGCTN-----SSSGASLC-QYGIQYGNGDTTVGVYSTET 226

Query: 201 LNLPNR---IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLS 251
           L L      ++ NF  GC ++         G+ G G    SL SQ        FSYCL +
Sbjct: 227 LTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPA 286

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
                   T+  +     +     T G  +TP      V E    + +Y V L  I+VGG
Sbjct: 287 GN-----STAGFLALGAPATGGNNTAGFQFTPL----QVVE----TTFYLVKLTGISVGG 333

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           +++ +            GG I+DSGT  T +    +  L   F S M    +    L   
Sbjct: 334 KQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAM----SAYPLLPPN 383

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
               L  C+D  G    + P + L F+GG  + L V +   + G     CL  V    AS
Sbjct: 384 DDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG-----CLAFVAG--AS 436

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            G + I+GN   + + V YD     +GF+   C
Sbjct: 437 DGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 161/380 (42%), Gaps = 56/380 (14%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLLGCQNPK 151
           +  GTP      ++DTGS L W  C+    C   C     P F PK SS+   +GC   +
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 57

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPN 210
           CS +   ++    C+          + +C  Y   YG S  + G    +T++  +  +PN
Sbjct: 58  CSDLPSATLNPSACSS---------SNVCI-YQASYGDSSFSVGYLSKDTVSFGSTSLPN 107

Query: 211 FLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLI 264
           F  GC   +     + AG+ G  R K SL  QL       F+YCL       ++   SL 
Sbjct: 108 FYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSSGYLSLG 164

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
             N   +S        YTP V++ S+ +       Y++ L  +TV G  + V     +  
Sbjct: 165 SYNPGQYS--------YTPMVSS-SLDDS-----LYFIKLSGMTVAGNPLSVSSSAYSSL 210

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
                 TI+DSGT  T +   ++  L+    + M   +  +RA    A + L  CF    
Sbjct: 211 P-----TIIDSGTVITRLPTSVYSALSKAVAAAM---KGTSRA---SAYSILDTCFKGQA 259

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
            +  S P + + F GGA + L  +N    V + S  CL     R A+     I+GN Q Q
Sbjct: 260 SRV-SAPAVTMSFAGGAALKLSAQNLLVDVDD-STTCLAFAPARSAA-----IIGNTQQQ 312

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
            + V YD+++ R+GF    C
Sbjct: 313 TFSVVYDVKSSRIGFAAGGC 332


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 158/387 (40%), Gaps = 62/387 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  S GTPPQ +   +DT +   W PC     C  C +S    F P  S+S R + C 
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAG---CAGCPTSSAAPFDPASSASYRTVPCG 168

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P C+               P A      + C   L    S L   ++  ++L +    +
Sbjct: 169 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNAV 214

Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
             +  GC   +  ++  P G+ G GRG  S  SQ   +    FSYCL S K   F  T R
Sbjct: 215 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLR 274

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
               +  NG     K T      P + NP        S  YYV +  I VG + V +   
Sbjct: 275 ----LGRNGQPQRIKTT------PLLANPH------RSSLYYVNMTGIRVGRKVVPI--- 315

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
               D     GT++DSGT FT +    +  + DE            R +GA   +L G  
Sbjct: 316 -PAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 364

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
            CF+       ++P + L F G  +VTLP EN       G+  CL +    +       +
Sbjct: 365 TCFNT---TAVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 420

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + + Q QN+ V +D+ N R+GF ++ C
Sbjct: 421 IASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/409 (25%), Positives = 178/409 (43%), Gaps = 59/409 (14%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIP 136
           +++ S+  YG Y+  +  GTPP+     +DTGS ++W  C     C   S   I  +F  
Sbjct: 73  SSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFD 132

Query: 137 KL-SSSSRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEG 194
            + SS++ L+ C +P C S I   + QC          S    Q   ++    GSG T G
Sbjct: 133 TVGSSTAALVPCSDPMCASAIQGAAAQC----------SPQVNQCSYTFQYEDGSG-TSG 181

Query: 195 IALSETL--------NLPNRIIPN--FLVGCSVLSS-------RQPAGIAGFGRGKTSLP 237
           + +S+ +        + P  +  +   + GCS   S       +   GI GFG G+ S+ 
Sbjct: 182 VYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVV 241

Query: 238 SQLNLDKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
           SQL+    +  + SH    D      L+L       +     + Y+P V  PS       
Sbjct: 242 SQLSSRGITPKVFSHCLKGDGNGGGILVL------GEILEPSIVYSPLV--PS------- 286

Query: 297 SVYYYVGLRRITVGGQRVRVWHK-YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
             +Y + L+ I V GQ + +    + T D+    GTI+DSGTT +++  E ++PL +   
Sbjct: 287 QPHYNLNLQSIAVNGQVLSINPAVFATSDKR---GTIIDSGTTLSYLVQEAYDPLVNAVD 343

Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
           + + +      + G++       C+ V      SFP +  +F+GGA + L    Y    G
Sbjct: 344 TAVSQFATSFISKGSQ-------CYLVLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRG 396

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 +  +  ++   G + ILG+  +++  V YDL  Q++G+    C
Sbjct: 397 FQDGAKMWCIGFQKVQEGVT-ILGDLVLKDKIVVYDLARQQIGWTNYDC 444


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/396 (25%), Positives = 162/396 (40%), Gaps = 54/396 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRLLG 146
           Y   L  G+PP+     +DTGS ++W  C++   C   S   IP   F P  S ++ L+ 
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN--- 202
           C + +CS      +  +  +    A +  C      Y   YG G  T G  +S+ L+   
Sbjct: 150 CSDQRCS------LGLQSSDSVCAAQNNQC-----GYTFQYGDGSGTSGYYVSDLLHFDT 198

Query: 203 -LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
            L   ++ N     + GCS L +       R   GI GFG+   S+ SQL     +  + 
Sbjct: 199 ILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVF 258

Query: 251 SHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           SH    D +    L+L       +     + YTP V  PS         +Y + L+ I V
Sbjct: 259 SHCLKGDDSGGGILVL------GEIVEPNIVYTPLV--PS-------QPHYNLNLQSIYV 303

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
            GQ + +           N GTI+DSGTT  ++    ++P      S +  + +   + G
Sbjct: 304 NGQTLAIDPSVFATSS--NQGTIIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPYLSKG 361

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            +       C+         FP++ L+F GG  + L  ++Y       +   L  V  ++
Sbjct: 362 NQ-------CYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQK 414

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             G    ILG+  +++    YD+  QR+G+    CK
Sbjct: 415 IQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDCK 450


>gi|223974335|gb|ACN31355.1| unknown [Zea mays]
          Length = 91

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 46/85 (54%), Positives = 56/85 (65%), Gaps = 9/85 (10%)

Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDREASGG-------PSIILG 439
           + PEL   F+GGA + LPVENYF V G G+  A+CL VVTD     G       P+IILG
Sbjct: 2   ALPELSFRFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILG 61

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           +FQ QNY VEYDL  +RLGF++Q C
Sbjct: 62  SFQQQNYLVEYDLEKERLGFRRQSC 86


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 125/440 (28%), Positives = 174/440 (39%), Gaps = 91/440 (20%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH---SYGGYSISLSF 95
           +PS+   + L      S++R    +          T  T+  I S    S G Y ++L  
Sbjct: 48  DPSKTQAERLTDAFRRSVSRVGRFR---------PTAMTSDGIQSRIVPSAGEYLMNLYI 98

Query: 96  GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
           GTPP  +  I+DTGS L W  C     C +C    +P F PK SS+ R   C    C  +
Sbjct: 99  GTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLAL 155

Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-----IP 209
             +    R C+ E     K CT     +   Y  G  T G   SETL + +        P
Sbjct: 156 GKD----RSCSKE-----KKCT-----FRYSYADGSFTGGNLASETLTVDSTAGKPVSFP 201

Query: 210 NFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
            F  GC   S     +  +GI G G G+ SL SQL       FSYCLL    D +   SS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSS--ISS 259

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
            I  N  +       G   TP                      R+   G     + K   
Sbjct: 260 RI--NFGASGRVSGYGTVSTPL---------------------RLPYKG-----YSKKTE 291

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
           ++    G  IVDSGTT+TF+  E +  L ++ V+  +K +      G  +L     C++ 
Sbjct: 292 VEE---GNIIVDSGTTYTFLPQEFYSKL-EKSVANSIKGKRVRDPNGIFSL-----CYNT 342

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             E     P +  HFK  A V L   N F  + E   VC TV    +       +LGN  
Sbjct: 343 TAEINA--PIITAHFK-DANVELQPLNTFMRMQE-DLVCFTVAPTSDIG-----VLGNLA 393

Query: 443 MQNYYVEYDLRNQRLGFKQQ 462
             N+ V +DLR +R GF ++
Sbjct: 394 QVNFLVGFDLRKKR-GFSKK 412



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 66/137 (48%), Gaps = 14/137 (10%)

Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
            G  IVDSGTT+T++  E +  L +E V+  +K +      G  +L     C++   ++ 
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVKL-EESVAHSIKGKRVRDPNGISSL-----CYNTTVDQI 470

Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
            + P +  HFK  A V L   N F  + E   VC TV+   +       ILGN    N+ 
Sbjct: 471 DA-PIITAHFKD-ANVELQPWNTFLRMQE-DLVCFTVLPTSDIG-----ILGNLAQVNFL 522

Query: 448 VEYDLRNQRLGFKQQLC 464
           V +DLR +R+ FK   C
Sbjct: 523 VGFDLRKKRVSFKAADC 539


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 109/390 (27%), Positives = 161/390 (41%), Gaps = 70/390 (17%)

Query: 89  YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           Y  ++SFGTP  PQ++  ++DTGS L W       QCK CSS      K P F P  SS+
Sbjct: 112 YVATVSFGTPAVPQVV--VIDTGSDLTWL------QCKPCSSGQCSPQKDPLFDPSHSST 163

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
              + C + +C  +  ++      N +P             + + Y  G  T G+   + 
Sbjct: 164 YSAVPCASGECKKLAADAYGSGCSNGQPCG-----------FAISYVDGTSTVGVYGKDK 212

Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT---SLPSQ-LNLDKFSYCLLSHKFD 255
           L L P  I+ +F  GC    S  P    G         SL +Q      FSYCL +    
Sbjct: 213 LTLAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVN-- 270

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
             ++   L    G     +  +G  +TP    P    +  FS    V L  ITVGG+++ 
Sbjct: 271 --SKPGFLAFGAG-----RNPSGFVFTPMGRVPG---QPTFST---VTLAGITVGGKKLD 317

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +     +      GG IVDSGT  T +   ++  L   F   M   + Y    G      
Sbjct: 318 LRPSAFS------GGMIVDSGTVVTVLQSTVYRALRAAFREAM---KAYRLVHG-----D 363

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGP 434
           L  C+D+ G K    P++ L F GGA + L V N   V G     CL    T ++ + G 
Sbjct: 364 LDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILVNG-----CLAFAETGKDGTAG- 417

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +LGN   + + V +D    + GF+ + C
Sbjct: 418 --VLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 109/392 (27%), Positives = 164/392 (41%), Gaps = 58/392 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +S GTPP  I  I DTGS L W  C     C  C   + P F P+ S+S R + 
Sbjct: 23  GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCV---PCNKCYKQRNPIFDPQKSTSYRNIS 79

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
           C          +S  C   +    +  K+C     +Y   Y S  +T+G+   ET+ L  
Sbjct: 80  C----------DSKLCHKLDTGVCSPQKHC-----NYTYAYASAAITQGVLAQETITLSS 124

Query: 204 -PNRIIP--NFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
                +P    + GC   ++     +  GI G G G  S  SQ+       +FS CL+  
Sbjct: 125 TKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPF 184

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              D + +S + L  GS  S K          V+ P VA+++     Y+V L  I+VG  
Sbjct: 185 H-TDVSVSSKMSLGKGSEVSGKGV--------VSTPLVAKQD--KTPYFVTLLGISVGNT 233

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            +            GN    +DSGT  T +  +L+    D  V+Q V++    + +  + 
Sbjct: 234 YLHFNGSSSQSVEKGN--VFLDSGTPPTILPTQLY----DRLVAQ-VRSEVAMKPVTNDL 286

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
             G + C+       G  P L  HF+GG    LP + +  V  +    CL   T+  + G
Sbjct: 287 DLGPQLCYRTKNNLRG--PVLTAHFEGGDVKLLPTQTF--VSPKDGVFCLG-FTNTSSDG 341

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G   + GNF   NY + +DL  Q + FK   C
Sbjct: 342 G---VYGNFAQSNYLIGFDLDRQVVSFKPMDC 370


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 167/396 (42%), Gaps = 69/396 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+D+GS + + PC +   C+ C + + P F P LSSS   + 
Sbjct: 87  GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSSYSPVK 143

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+    +  QC    +   A   + + +    +V +G          E+   P R
Sbjct: 144 C-NVDCT-CDSDKKQC--TYERQYAEMSSSSGVLGEDIVSFGR---------ESELKPQR 190

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
            +     GC       L S+   GI G GRG+ S+  QL       D FS C        
Sbjct: 191 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG- 245

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                +++L    + SD      +++  + +P          YY + L+ I V G+ +RV
Sbjct: 246 ---GGAMVLGGVPAPSDMV---FSHSDPLRSP----------YYNIELKEIHVAGKALRV 289

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK-------NRNYTRALG 369
             +      +   GT++DSGTT+ ++  + F    D   S++         + NY     
Sbjct: 290 DSRVF----NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICF 345

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDR 428
           A A   +    +V       FP++ + F  G +++L  ENY F       A CL V  + 
Sbjct: 346 AGAGRNVSKLHEV-------FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNG 398

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +    P+ +LG   ++N  V YD  N+++GF +  C
Sbjct: 399 K---DPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 108/390 (27%), Positives = 158/390 (40%), Gaps = 62/390 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   L  GTP      ++DTGS L W  C+    C   C     P + P+ SS+   +
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLYDPRASSTYATV 188

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C   +C  +   ++    C+            +C  Y   YG S  + G    +T++  
Sbjct: 189 PCSASQCDELQAATLNPSACSVR---------NVC-IYQASYGDSSFSVGYLSRDTVSFG 238

Query: 205 NRIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
           +   PNF  GC      L  R  AG+ G  R K SL  QL       FSYCL        
Sbjct: 239 SGSYPNFYYGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFSYCL-------P 290

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T  S+  L  G   S       +YTP      +A  +  +  Y+V L  ++VGG  + V 
Sbjct: 291 TPASTGYLSIGPYTSGH----YSYTP------MASSSLDASLYFVTLSGMSVGGSPLAVS 340

Query: 318 -HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             +Y +L       TI+DSGT  T +   ++  L+    + MV  ++      A A + L
Sbjct: 341 PAEYSSLP------TIIDSGTVITRLPTAVYTALSKAVAAAMVGVQS------APAFSIL 388

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
             CF     +    P + + F GGA + L  +N    V + S  CL    TD       +
Sbjct: 389 DTCFQGQASQL-RVPAVAMAFAGGATLKLATQNVLIDV-DDSTTCLAFAPTDS------T 440

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            I+GN Q Q + V YD+   R+GF    C 
Sbjct: 441 TIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/404 (25%), Positives = 162/404 (40%), Gaps = 64/404 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y +++  G P ++    +DTGS L W  C     C+ C+      + PK    +R++ 
Sbjct: 29  GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQC--DAPCRSCAVGPHGLYDPK---RARVVD 83

Query: 147 CQNPKCSWIHHE-----SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
           C+ P C+ +        S   R C+                Y V Y  G  T GI + +T
Sbjct: 84  CRRPTCAQVQRGGQFTCSGDVRQCD----------------YEVDYVDGSSTMGILVEDT 127

Query: 201 LNL----PNRIIPNFLVGCSVLS----SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCL 249
           + L      R     ++GC        ++ PA   G+ G    K SLPSQL     +  +
Sbjct: 128 ITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNV 187

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           + H     +     +       +     G+T+TP +  P V         Y   LR I  
Sbjct: 188 IGHCLAGGSNGGGYLF---FGDTLVPALGMTWTPMIGRPLVEG-------YQARLRSIKY 237

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN---RNYTR 366
           GG+ +      L    D  GG + DSGT+FT++ P  +  +    V Q  ++   R  T 
Sbjct: 238 GGEVLE-----LEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD 292

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEGSAV 420
                   G  P F+   + +  F  + L F G      G  + L  E Y  V  +G+ V
Sbjct: 293 TTLPFCWRGPSP-FESVADVSAYFKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGN-V 350

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           CL V+    AS   + ILG+  M+ Y V YD   +++G+ ++ C
Sbjct: 351 CLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 159/394 (40%), Gaps = 61/394 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + L  GTPP  I   +DTGS L+W  C     C  C +   P F P  SS+   + 
Sbjct: 62  GQYLMELYIGTPPIKISGTVDTGSDLIWVQCV---PCLGCYNQINPMFDPLKSSTYTNIS 118

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +P C   +       +C+ E     K C      Y   Y  S LT+G+   ET+ L +
Sbjct: 119 CDSPLCYKPY-----IGECSPE-----KRC-----DYTYGYADSSLTKGVLAQETVTLTS 163

Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
                  +   L GC   ++        G+ G G G TSL SQ+       KFS CL+  
Sbjct: 164 NTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPF 223

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              D T +S +    GS    +   G+  TP V      +R      YYV L  I+    
Sbjct: 224 -LTDITISSQMSFGKGSEVLGE---GVVTTPLV------QREQDMTSYYVTLLGIS---- 269

Query: 313 RVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              V   YL ++     G  +VDSGT    +  +L++ +  E     VKN+     +  +
Sbjct: 270 ---VEDTYLPMNSTIEKGNMLVDSGTPPNILPQQLYDRVYVE-----VKNKVPLEPITDD 321

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREA 430
              G + C+       G  P L  HF+G   +  P++ +     E   V CL +     +
Sbjct: 322 PSLGPQLCYRTQTNLKG--PTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANS 379

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G   I GNF   NY + +DL  Q + FK   C
Sbjct: 380 DPG---IYGNFAQTNYLIGFDLDRQIVSFKPTDC 410


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 121/475 (25%), Positives = 190/475 (40%), Gaps = 98/475 (20%)

Query: 22  FPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNI 81
           F + + S    LS F+ NPS+  +  L      S++RA H +     T +  +   + N 
Sbjct: 35  FSTDLISRDSPLSPFY-NPSETQFDRLQKAFHRSISRANHFRANGVSTNSIQSPVISNN- 92

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
                G Y +++S GTPP  +  I DTGS L+W  C     C  C     P F P  S +
Sbjct: 93  -----GEYLMNISLGTPPVSMHGIADTGSDLLWRQCK---PCDSCYEQIEPIFDPAKSKT 144

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
            ++L C+   CS +  +      C+D+       C      Y   YG G  T G    +T
Sbjct: 145 YQILSCEGKSCSNLGGQG----GCSDD-----NTCI-----YSYSYGDGSHTSGDLAVDT 190

Query: 201 LNLPNRI-----IPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQ 239
           L + +       +P  + GC            G   G T                S+ SQ
Sbjct: 191 LTIGSTTGRPVSVPKVVFGC------------GHNNGGTFELHGSGLVGLGGGPLSMISQ 238

Query: 240 LNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
           L      +FSYCL+    +D + +S +      S       G   TP      +A R   
Sbjct: 239 LRPLIGGRFSYCLVPLG-NDPSVSSKMHF---GSRGIVSGAGAVSTP------LASRQP- 287

Query: 297 SVYYYVGLRRITVGGQRV--RVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADE 353
             +YY+ L  ++VG +++  + + K  +   D + G  I+DSGTT T +  + +  L   
Sbjct: 288 DTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESN 347

Query: 354 FVSQM----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
            VS +    V++ N   +L    L+GLR             P +  HF  GA++ L   N
Sbjct: 348 VVSAIGGKPVRDPNNVFSLCYSNLSGLR------------IPTITAHFV-GADLELKPLN 394

Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            F  V E    C  ++   + +     I GN    N+ V YDL+++ + FK   C
Sbjct: 395 TFVQVQE-DLFCFAMIPVSDLA-----IFGNLAQMNFLVGYDLKSRTVSFKPTDC 443


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 159/388 (40%), Gaps = 54/388 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   L  GTP      ++D+GS L W  C     C   C     P + P+ SS+   +
Sbjct: 106 GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCA---PCAVSCHPQAGPLYDPRASSTYAAV 162

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
            C  P+C+ +   ++    C+          + +C  Y   YG G  + G    +T++L 
Sbjct: 163 PCSAPQCAELQAATLNPSSCSG---------SGVC-QYQASYGDGSFSFGYLSKDTVSLS 212

Query: 205 NR-IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
           +    P F  GC   +V    + AG+ G  R K SL SQL     + F+YCL       +
Sbjct: 213 SSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCL-----PTS 267

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV- 316
              S+  L  GS+  +K     +YT  V++   A        Y+V L  ++V G  + V 
Sbjct: 268 AAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDAS------LYFVSLAGMSVAGSPLAVP 321

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             +Y +L       TI+DSGT  T +   ++  L+    + +        ++       L
Sbjct: 322 SSEYGSLP------TIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-------L 368

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
           + CF     K    P + + F GGA + L   N    V E +       TD  A      
Sbjct: 369 QTCFKGQVAKL-PVPAVNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTA------ 421

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN Q Q + V YD++  R+GF    C
Sbjct: 422 IIGNTQQQTFSVVYDVKGSRIGFAAGGC 449


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 154/394 (39%), Gaps = 63/394 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  SL  GTP   +   LDTGS   W  C     C  C   + P F P  SS+   + C 
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCK---PCADCYEQRDPVFDPTASSTYSAVPCG 195

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPNR- 206
             +C     E        +     +KNC      Y V Y     T G    +TL L    
Sbjct: 196 AREC----QELASSSSSRNCSSDNNKNC-----PYEVSYDDDSHTVGDLARDTLTLSPSP 246

Query: 207 ------IIPNFLVGCSVLSSRQPAGIAG-------FGRGKTSLPSQLNLD---KFSYCLL 250
                  +P F+ GC        AG  G        G GK SLPSQ+       FSYCL 
Sbjct: 247 SPSPADTVPGFVFGC----GHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLP 302

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           S      +    L     ++ ++ + T +          V  ++  S  YY+ L  I V 
Sbjct: 303 SSP----SAAGYLSFGGAAARANAQFTEM----------VTGQDPTS--YYLNLTGIVVA 346

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           G+ ++V             GTI+DSGT F+ + P  +  L   F S M + R Y RA  +
Sbjct: 347 GRAIKVPASAFAT----AAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYR-YKRAPSS 401

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
                   C+D  G +T   P ++L F  GA V L          + +  CL  V + + 
Sbjct: 402 PIFD---TCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDL 458

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 ILGN Q +   V YD+ +QR+GF ++ C
Sbjct: 459 G-----ILGNTQQRTLAVIYDVGSQRIGFGRKGC 487


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 166/398 (41%), Gaps = 59/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  CT+   C   S  +I    F P +SSS+ L
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETL 201
           + C + +C    + + Q          T   C+   +C SY   YG G  T G  +S+ +
Sbjct: 142 VSCSDRRC----YSNFQ----------TESGCSPNNLC-SYSFKYGDGSGTSGFYISDFM 186

Query: 202 NLPNRIIPN--------FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFS 246
           +    I           F+ GCS L +       R   GI G G+G  S+ SQL +   +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
             + SH          +++       D       YTP V  PS         +Y V L+ 
Sbjct: 247 PRVFSHCLKGDKSGGGIMVLGQIKRPDT-----VYTPLV--PS-------QPHYNVNLQS 292

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ + +     T+      GTI+D+GTT  ++  E + P      + + +   Y R
Sbjct: 293 IAVNGQILPIDPSVFTIAT--GDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQ---YGR 347

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
            +  E+      CF++       FPE+ L F GGA + L    Y  +    S   +  + 
Sbjct: 348 PITYESYQ----CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIF-SSSGSSIWCIG 402

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  S     ILG+  +++  V YDL  QR+G+ +  C
Sbjct: 403 FQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 162/402 (40%), Gaps = 64/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+P +     +DTGS ++W  C     C + S   I    F    SS++ L
Sbjct: 81  GLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           + C +P CS+        +    E  + +  C     SY   YG G  T G  +S+T+  
Sbjct: 141 VSCGDPICSY------AVQTATSECSSQANQC-----SYTFQYGDGSGTTGYYVSDTMYF 189

Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
               L   ++ N     + GCS   S       +   GI GFG G  S+ SQL+      
Sbjct: 190 DTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249

Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
             FS+CL   +          IL+            + Y+P V  PS         +Y +
Sbjct: 250 KVFSHCLKGGENGGGVLVLGEILE----------PSIVYSPLV--PS-------QPHYNL 290

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+ I V GQ + +           N GTIVDSGTT  ++  E + P      + + +  
Sbjct: 291 NLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFS 348

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
               + G +       C+ V       FP++ L+F GGA + L  E+Y    G      +
Sbjct: 349 KPIISKGNQ-------CYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAM 401

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  ++   G + ILG+  +++    YDL NQR+G+    C
Sbjct: 402 WCIGFQKVEQGFT-ILGDLVLKDKIFVYDLANQRIGWADYDC 442


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 157/397 (39%), Gaps = 57/397 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
           G Y   +  GTP +     +DTGS ++W  C    +C   S     +  +  K S++S  
Sbjct: 153 GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 212

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           +GC +  CS             D PL   K   Q    Y VLYG G +      +     
Sbjct: 213 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 259

Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
           NRI  NF         + GC          SS    GI GFG+  +S+ SQL        
Sbjct: 260 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 319

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           + SH  D+        +D G   +     G    P VN   + +  A   +Y V ++ I 
Sbjct: 320 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 364

Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           VGG  + V    + + DR G   TI+DSGTT  +   E++ PL ++ +SQ    R +T  
Sbjct: 365 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 420

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
              +A T    CFD  G     FP + LHF     +T+    Y         +       
Sbjct: 421 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGA 474

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +   G    +LG+  + N  V YDL  Q +G+ +  C
Sbjct: 475 QTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 511


>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 469

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/394 (23%), Positives = 171/394 (43%), Gaps = 54/394 (13%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +  I+D G+  +W  C   Y                +SSS   + C +  C   +
Sbjct: 88  TPLVPVKLIVDLGARFMWVDCEEGY----------------VSSSYTPVSCDSLLCKLAN 131

Query: 157 HESIQCR-DCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIALSETLN--LPNRII- 208
             S+ C  +CN  P     N  C     + ++  G+   + + +   ++ N   P+RI+ 
Sbjct: 132 --SLACATECNSTPKPGCHNNTCAHSPENPVIRLGTSGQIGQDVVSLQSFNGKTPDRIVS 189

Query: 209 -PNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDT 257
            PNF   C    +L +      G+AG G    SLP+Q +       KF+ CL      ++
Sbjct: 190 VPNFPFVCGPTFLLENLADGVTGLAGLGNSNISLPAQFSSAFGFPKKFAVCL-----SNS 244

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQR 313
           T+++ LI      +S+     LTYTP ++NP      ++    SV Y++G++ I +GG+ 
Sbjct: 245 TKSNGLIFFGDGPYSNLPND-LTYTPLIHNPVSTAGGSYLGEASVEYFIGVKSIRIGGKD 303

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V+     L++D +G GGT + +   +T +   +++ +   FV +M  ++ +   +    +
Sbjct: 304 VKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKEM--DKKFIPQV-QPPI 360

Query: 374 TGLRPCFDVPGEKTGSF----PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
                CF      +  F    P + L  +G   VT  +    ++V   S V      D  
Sbjct: 361 APFGACFQSIVIDSNEFGPVLPFIDLVLEGQGSVTWRIWGANSMVKISSLVMCLGFVDGG 420

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
                SI++G  Q+++  +++DL + +LGF   L
Sbjct: 421 IEPRTSIVIGGRQIEDNLLQFDLASSKLGFSSSL 454


>gi|383161172|gb|AFG63168.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
 gi|383161174|gb|AFG63170.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
 gi|383161175|gb|AFG63171.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
          Length = 133

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/145 (42%), Positives = 85/145 (58%), Gaps = 20/145 (13%)

Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFG 230
           C++ICP + + YG+G   G  LS+TL LP      R I NF  GC+V+SS Q AGIAGFG
Sbjct: 1   CSKICPHFSLTYGTGNATGRLLSDTLTLPLEDGGRREIKNFATGCAVVSS-QVAGIAGFG 59

Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
            G  S+PSQL     DKF+YCL     D  + +S ++L N +   D     LTYTP + N
Sbjct: 60  NGGLSMPSQLAPLIGDKFAYCL-----DYRSNSSKIVLGNKAVPRDLP---LTYTPLLFN 111

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQ 312
           P     + FS Y+Y+ L  +++GG+
Sbjct: 112 P--VNPSVFS-YFYLALETVSIGGK 133


>gi|388509650|gb|AFK42891.1| unknown [Lotus japonicus]
          Length = 347

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 82/302 (27%), Positives = 148/302 (49%), Gaps = 52/302 (17%)

Query: 192 TEGIALSETLNLPNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD----- 243
           T+G   ++ +++PN +   F+ G  V+    ++   G+AG GR + SLPSQ +       
Sbjct: 47  TDGTTPTKVVSVPNFL---FICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 103

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF----SV 298
           KF+ CL ++   D      +   +G  + ++  +  LTYTP + NP     +AF    SV
Sbjct: 104 KFAICLTANSGADGV----MFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSV 159

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
            Y++G++ I V  + V +    L+++++G GGT + +   +T M   +++ +AD FV   
Sbjct: 160 EYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV--- 216

Query: 359 VKNRNYTRALGAEALTGLRP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYF 411
                  ++LGA  ++ + P   CF   D+   + G   P + L  + G E   P+    
Sbjct: 217 -------KSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVE--WPIIGAN 267

Query: 412 AVVGEGSAVCLTVV---TDREAS------GGP----SIILGNFQMQNYYVEYDLRNQRLG 458
           ++V     +CL  V   ++ +AS      GG     SI +G  Q++N  +++DL   RLG
Sbjct: 268 SMVQFDDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLG 327

Query: 459 FK 460
           F+
Sbjct: 328 FR 329


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 85/290 (29%), Positives = 135/290 (46%), Gaps = 42/290 (14%)

Query: 179 ICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKT 234
           IC +Y + YG G  T G    E L     ++ +F+ GC   +       +G+ G GR   
Sbjct: 75  IC-NYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDL 133

Query: 235 SLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
           SL SQ   +    FSYCL S    +   + SLIL  G+S   + ++ ++Y   + NP + 
Sbjct: 134 SLISQTSGIFGGVFSYCLPS---TERKGSGSLIL-GGNSSVYRNSSPISYAKMIENPQLY 189

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
                  +Y++ L  I++GG  ++           G    +VDSGT  T + P +++ L 
Sbjct: 190 N------FYFINLTGISIGGVALQA-------PSVGPSRILVDSGTVITRLPPTIYKALK 236

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
            EF+ Q      +T    A A + L  CF++   +    P +K+HF+G AE+T+ V   F
Sbjct: 237 AEFLKQ------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVF 290

Query: 412 AVV-GEGSAVCLTVVT----DREASGGPSIILGNFQMQNYYVEYDLRNQR 456
             V  + S VCL + +    D  A      ILGN+Q +N  V YD +  +
Sbjct: 291 YFVKSDASQVCLALASLEYQDEVA------ILGNYQQKNLRVIYDTKETK 334


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 156/391 (39%), Gaps = 72/391 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS-----SKIPSFIPKLSSSSR 143
           Y ++ S GTP       +DTGS L W       QCK C++      K P F P  SSS  
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWV------QCKPCAAPSCYRQKDPLFDPAQSSSYA 190

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
            + C    C+ +   +  C         ++  C      Y+V YG G  T G+  S+TL 
Sbjct: 191 AVPCGRSACAGLGIYASAC---------SAAQC-----GYVVSYGDGSNTTGVYSSDTLT 236

Query: 203 L-PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
           L  N  +  FL GC    S        G+ GFGR + SL  Q        FSYCL +   
Sbjct: 237 LAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTK-- 294

Query: 255 DDTTRTSSLILDNGSSHSDK-KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
             ++ T  L L   S  +    TT L  +P  N P+         YY V L  I+VGGQ 
Sbjct: 295 --SSTTGYLTLGGPSGVAPGFSTTQLLPSP--NAPT---------YYVVMLTGISVGGQP 341

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + V             GT+VD+GT  T + P  +  L   F S M    +      A  +
Sbjct: 342 LSVPASAFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPS------APPI 389

Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
             L  C+   G  T +   + L F  GA +TL  +         S  CL   +    S G
Sbjct: 390 GILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM------SFGCLAFAS--SGSDG 441

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              ILGN Q +++ V  D     +GF+   C
Sbjct: 442 SMAILGNVQQRSFEVRID--GSSVGFRPSSC 470


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 97/386 (25%), Positives = 156/386 (40%), Gaps = 42/386 (10%)

Query: 98  PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
           PPQ I  ++DTGS L W  C      +  + + + +F P  SSS   + C +P C     
Sbjct: 82  PPQNISMVIDTGSELSWLRCN-----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136

Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI-IPNFLVGC- 215
           + +    C+ +         ++C + L    +  +EG   +E  +  N     N + GC 
Sbjct: 137 DFLIPASCDSD---------KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 187

Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
             +S   P       G+ G  RG  S  SQ+   KFSYC+       T      +L   S
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-----SGTDDFPGFLLLGDS 242

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           + +    T L YTP +   S        V Y V L  I V G+ + +    L  D  G G
Sbjct: 243 NFT--WLTPLNYTPLIRI-STPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG 299

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
            T+VDSGT FTF+   ++  L   F+++                  +  C+ +   +  S
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRS 359

Query: 390 -----FPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILG 439
                 P + L F+G AE+ +  +          VG  S  C T   + +  G  + ++G
Sbjct: 360 GILHRLPTVSLVFEG-AEIAVSGQPLLYRVPHLTVGNDSVYCFTF-GNSDLMGMEAYVIG 417

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +   QN ++E+DL+  R+G     C 
Sbjct: 418 HHHQQNMWIEFDLQRSRIGLAPVECD 443


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 119/446 (26%), Positives = 177/446 (39%), Gaps = 73/446 (16%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NPS+  YQ L      S+ R  H +  +       +   +        G Y +++S GTP
Sbjct: 50  NPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGG------GAYLMNISLGTP 103

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  +  I DTGS L+W  C     C  C     P F PK S + + L C N  C  +  +
Sbjct: 104 PVPMLGIADTGSDLIWRQC---LPCPNCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQ 160

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN-----RIIPNFL 212
                 C+D+       CT     Y   YG    T G   S+TL + +        P   
Sbjct: 161 G----SCDDD-----NTCT-----YSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIA 206

Query: 213 VGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
            GC   +    + +  G+ G G G  SL  QL+ +   +FSYCL+    D T   SS I 
Sbjct: 207 FGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDST--VSSKI- 263

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            N         +G   TP +       +     +YY+ L  ++VG + V    K  + ++
Sbjct: 264 -NFGKSGVVSGSGTVSTPLI-------KGTPDTFYYLTLEGLSVGSETVAF--KGFSENK 313

Query: 326 DG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
                   G  I+DSGTT T +  + +  +              T A+G +  T     F
Sbjct: 314 SSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESAL----------TNAIGGQTTTDPNGIF 363

Query: 381 DVPGEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
            +      +   P +  HF  GA+V LP  N F  V E   VC +++     +     I 
Sbjct: 364 SLCYSSVNNLEIPTITAHFT-GADVQLPPLNTFVQVQE-DLVCFSMIPSSNLA-----IF 416

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN    N+ V YDL+N ++ FKQ  C
Sbjct: 417 GNLAQINFLVGYDLKNNKVSFKQTDC 442


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 123/466 (26%), Positives = 181/466 (38%), Gaps = 91/466 (19%)

Query: 31  FSLSRFHT--------NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
           FSL+  H         NP+   +  L +  S S++R    K      T      +  N  
Sbjct: 34  FSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFK------TKAVDINSFQNDL 87

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
             + G Y + +S GTP   +  I DTGS L W  C     C  C   K P F P  SSS 
Sbjct: 88  VPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQC---LPCDPCYRQKSPLFDPSRSSSY 144

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY---LVLYGSGLTEGIALSE 199
           R + C +  C+ +        D +++      N  +   SY       G+  TE   +  
Sbjct: 145 RHMLCGSRFCNAL--------DVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGS 196

Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN-- 241
           T + P  + P  + GC            G G G T                SL SQL+  
Sbjct: 197 TSSRPVHLSP-IVFGC------------GTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243

Query: 242 -LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
              KFSYCL+    + +  TS +     S  S  +         V+ P V+++     YY
Sbjct: 244 IKGKFSYCLVPLS-EQSNVTSKIKFGTDSVISGPQV--------VSTPLVSKQP--DTYY 292

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           YV L  I+VG +R+   +  L  + +  G  I+DSGTT TF+  E F  L          
Sbjct: 293 YVTLEAISVGNKRLPYTNGLLNGNVE-KGNVIIDSGTTLTFLDSEFFTEL---------- 341

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
            R     + AE ++  R  F V     G    P + +HF   A+V L   N F V  +  
Sbjct: 342 ERVLEETVKAERVSDPRGLFSVCFRSAGDIDLPVIAVHFN-DADVKLQPLNTF-VKADED 399

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +C T+++  +       I GN    ++ V YDL  + + FK   C
Sbjct: 400 LLCFTMISSNQIG-----IFGNLAQMDFLVGYDLEKRTVSFKPTDC 440


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 109/405 (26%), Positives = 163/405 (40%), Gaps = 87/405 (21%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+DTGS + + PC++   C+ C   + P F P LSS+ + + 
Sbjct: 11  GYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS---CEQCGRHQDPKFQPDLSSTYQSVK 67

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL----N 202
           C           +I C +C+DE     + C      Y   Y    T    L E +    N
Sbjct: 68  C-----------NIDC-NCDDE----KQQCV-----YERQYAEMSTSSGVLGEDIISFGN 106

Query: 203 LPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSH 252
           L        + GC  +      S+   GI G GRG  S+   L       D FS C    
Sbjct: 107 LSALAPQRAVFGCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGM 166

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
                    +++L   S  S+           V + S   R   S YY + L+ I V G 
Sbjct: 167 ----GIGGGAMVLGGISPPSN----------MVFSQSDPVR---SPYYNIDLKEIHVAG- 208

Query: 313 RVRVWHKYLTLDR---DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
                 K L L+    DG  GTI+DSGTT+ ++    F    D  + ++           
Sbjct: 209 ------KPLPLNPTVFDGKHGTILDSGTTYAYLPEAAFVSFKDAIMKEL---------HS 253

Query: 370 AEALTGLRP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSA 419
            + + G  P     CF   G    + + SFP +++ F  G ++ L  ENY F       A
Sbjct: 254 LKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGA 313

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            CL +  + +    P+ +LG   ++N  V YD  N ++GF +  C
Sbjct: 314 YCLGIFQNGK---DPTTLLGGIVVRNTLVLYDRENSKIGFWKTNC 355


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 179/416 (43%), Gaps = 55/416 (13%)

Query: 60  LHIKN-PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
           LH K+ P ++     T +  T I + +   +  ++S G PP     ++DTGS L W  C 
Sbjct: 50  LHSKSTPASRLDNLWTVSHVTPIPNPA--AFLANISIGNPPVPQLLLIDTGSDLTWIHC- 106

Query: 119 NHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
               CK C    IP F P  SS+ R     N  C    H   Q     DE        T 
Sbjct: 107 --LPCK-CYPQTIPFFHPSRSSTYR-----NASCVSAPHAMPQI--FRDEK-------TG 149

Query: 179 ICPSYLVLYGSGLTEGIALSETLNLP---NRII--PNFLVGCSVLSS--RQPAGIAGFGR 231
            C  +L       T GI   E L      + +I   N + GC   +S   + +G+ G G 
Sbjct: 150 NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGFTKYSGVLGLGP 209

Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
           G  S+ ++    KFSYC  S   + T   + LIL NG+     K  G        +P+  
Sbjct: 210 GTFSIVTRNFGSKFSYCFGSLT-NPTYPHNILILGNGA-----KIEG--------DPTPL 255

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
           +   F   YY+ L+ I+ G + + +        R   GGT++D+G + T +A E +E L+
Sbjct: 256 Q--IFQDRYYLDLQAISFGEKLLDIEPGTFQRYR-SQGGTVIDTGCSPTILAREAYETLS 312

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVEN 409
           +E     +      R    +  T   PC++  +  +  G FP +  HF GGAE+ L VE+
Sbjct: 313 EEI--DFLLGEVLRRVKDWDQYT--TPCYEGNLKLDLYG-FPVVTFHFAGGAELALDVES 367

Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            F     G + CL +  +   +     ++G    QNY V Y+LR  ++ F++  C+
Sbjct: 368 LFVSSESGDSFCLAMTMN---TFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCE 420


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 122/424 (28%), Positives = 176/424 (41%), Gaps = 69/424 (16%)

Query: 58  RALHIK-------NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
           RA +IK       + +     T  TT  T++S+  Y    I++  G+P       +DTGS
Sbjct: 87  RAAYIKRKFSGAGDIEQSDAATVPTTLGTSLSTLEY---VITVGIGSPAVTQTMSMDTGS 143

Query: 111 HLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPL 170
            + W  C     C  C S     F P  SS+     C +  C+ +  +S +   C     
Sbjct: 144 DVSWVQCK---PCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQL-SQSQEGNGC----- 194

Query: 171 ATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAG 225
             S  C      Y+V YG S  T G   S+TL L +  + +F  GCS   S     Q  G
Sbjct: 195 -MSSQC-----QYIVNYGDSSSTTGTYSSDTLTLGSSAMTDFQFGCSQSESGGFNDQTDG 248

Query: 226 IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTY 281
           + G G G  SL SQ        FSYCL       T+ +S  L L  GSS       G   
Sbjct: 249 LMGLGGGAQSLASQTAGTFGTAFSYCL-----PPTSGSSGFLTLGTGSS-------GFVK 296

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP + +  +        YY V L  I VG Q++ +           + G+++DSGT  T 
Sbjct: 297 TPMLRSTQIP------TYYVVLLESIKVGSQQLNLPTSVF------SAGSLMDSGTIITR 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
           + P  +  L+  F + M   + Y  A  +     L  CFD  G+ + S P + L F GGA
Sbjct: 345 LPPTAYSALSSAFKAGM---QQYPPATPSGI---LDTCFDFSGQSSISIPTVTLVFSGGA 398

Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFK 460
            V L  +     +   S  CL    + + S   S+ I+GN Q + + V YD+    +GFK
Sbjct: 399 AVDLAFDGIMLEI-SSSIRCLAFTPNGDDS---SLGIIGNVQQRTFEVLYDVGGGAVGFK 454

Query: 461 QQLC 464
              C
Sbjct: 455 AGAC 458


>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 435

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 103/406 (25%), Positives = 174/406 (42%), Gaps = 64/406 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   ++  TP   +  I+D G   +W  C N Y                +SS+ R   C+
Sbjct: 47  YKAQINQRTPLVPLNVIVDLGGQFLWVDCENKY----------------ISSTYRPARCR 90

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLP--- 204
           + +CS  + +   C DC   P     N T  + P   + + +  T G    + L++    
Sbjct: 91  SAQCSLANSDG--CGDCFSSPKPGCNNNTCGVTPDNSITHTA--TSGELAEDVLSIQSSN 146

Query: 205 ------NRIIPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLN-----LDKFSYC 248
                 N ++  FL  C+   +L   +   +G+AG GR K +LPSQL        KF+ C
Sbjct: 147 GFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAIC 206

Query: 249 LLSHK----FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VY 299
           L S K    F D        L N    SD     LTYTP + NP V+  +AFS       
Sbjct: 207 LSSSKGVVLFGDGPYG---FLPNVVFDSDS----LTYTPLLINP-VSTASAFSQGQPSAE 258

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y++G++ I +  + V +    L++D +G GGT + +   +T +   +++ + D FV    
Sbjct: 259 YFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKASA 318

Query: 360 KNRNYTRALGAEALTGLRPCF-DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEG 417
             RN  R     ++     C+ ++ G + G + P ++L  +    V         V    
Sbjct: 319 A-RNIKR---VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQNENVVWRIFGANSMVSIND 374

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
             +CL  V   + +   SI++G +Q++N  +++DL   +LGF   L
Sbjct: 375 EVLCLGFVNGGKNT-RTSIVIGGYQLENNLLQFDLAASKLGFSSLL 419


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 104/385 (27%), Positives = 158/385 (41%), Gaps = 50/385 (12%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP Q++  +LDT +   + P +    C  CS++   +F P  S+S   L 
Sbjct: 96  GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSG---CIGCSAT---TFSPNASTSYVPLE 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C  P+CS +    + C      P   S  C     S+   Y         + ++L L   
Sbjct: 150 CSVPQCSQVR--GLSC------PATGSGAC-----SFNKSYAGSTYSATLVQDSLRLATD 196

Query: 207 IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
           +IP++  G       S + ++   G+        S    L    FSYCL S  F     +
Sbjct: 197 VIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPS--FKSYYFS 254

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L         +TT L   P    PS+         Y+V L  ITVG   V    + 
Sbjct: 255 GSLKLGPVGQPKSIRTTPLLRNP--RRPSL---------YFVNLTGITVGKVNVPFPKEL 303

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L  D +   GTI+DSGT  T     ++  + DEF  Q+    +   +LGA        CF
Sbjct: 304 LAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFS---SLGA-----FDTCF 355

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILG 439
               E     P + LHF    ++ LP+EN       GS  CL +  T +  +     ++ 
Sbjct: 356 VKNYETLA--PAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIA 412

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N+Q QN  V +D  N ++G  ++LC
Sbjct: 413 NYQQQNLRVLFDTVNNKVGIARELC 437


>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
 gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
          Length = 434

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 106/429 (24%), Positives = 182/429 (42%), Gaps = 70/429 (16%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P+      T    TTN        Y   ++  TP   +  I+D G   +W  C N Y   
Sbjct: 30  PKALVLPVTKDVATTN-------QYKAQINQRTPLVPLNIIVDLGGLFLWVDCENQY--- 79

Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSY 183
                        +SS+ R   C++ +CS    +   C  C   P     N T  + P  
Sbjct: 80  -------------ISSTYRPARCRSAQCSLAKFD--DCGVCFSSPKPGCNNNTCSVAPGN 124

Query: 184 LVLYGS---GLTEGIALSETLNL----PNRIIPNFLVGCS---VLS--SRQPAGIAGFGR 231
            V   +    L E I   ++ N      N ++  FL  C+   +L   +   +G+AG GR
Sbjct: 125 SVTQSAMSGELAEDILSIQSSNGFNPGQNVMVSRFLFSCARTFLLEGLASGASGMAGLGR 184

Query: 232 GKTSLPSQLN-----LDKFSYCLLSHK----FDDTTR--TSSLILDNGSSHSDKKTTGLT 280
            K +LPSQL        KF+ CL S K    F D       +++ D+ S         LT
Sbjct: 185 NKLALPSQLASAFSFAKKFAICLSSSKGVVLFGDGPYGFLPNVVFDSKS---------LT 235

Query: 281 YTPFVNNP---SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR-DGNGGTIVDSG 336
           YTP + NP   +   ++  S  Y++G++ I + G+ V +    L++D  +G GGT + + 
Sbjct: 236 YTPLLINPFSTAAFAKSEPSAEYFIGVKTIKIDGKVVSLDTSLLSIDSSNGAGGTKISTV 295

Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DVPGEKTGS-FPELK 394
             +T +   +++ + D FV      RN  R    +++     C+ +V G + G+  P ++
Sbjct: 296 DPYTVLEASIYKAVTDAFVKASAA-RNIKRV---DSVAPFEFCYTNVTGTRLGADVPTIE 351

Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
           L+ +      +   N    + +   +CL  V   E +   SI++G +Q++N  +++DL  
Sbjct: 352 LYLQNNVIWRIFGANSMVNIND-EVLCLGFVIGGENTWA-SIVIGGYQLENNLLQFDLAA 409

Query: 455 QRLGFKQQL 463
            +LGF   L
Sbjct: 410 SKLGFSSLL 418


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 117/403 (29%), Positives = 168/403 (41%), Gaps = 53/403 (13%)

Query: 71  TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSS 129
           T  TT    + S    G Y +++  GTP +    I DTGS L W  C     C K C + 
Sbjct: 135 TAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCE---PCVKSCYNQ 191

Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG- 188
           K   F P  S+S   + C +  C  +   +    +C       S  C      Y + YG 
Sbjct: 192 KEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNC------ASSTCV-----YGIQYGD 240

Query: 189 SGLTEGIALSETLNL-PNRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQL--NL 242
           S  + G    E L+L    +  +F  GC   +       AG+ G GR K SL SQ     
Sbjct: 241 SSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRY 300

Query: 243 DK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
           +K FSYCL S     ++ T  L     +S S       ++TP      +A  +  S +Y 
Sbjct: 301 NKIFSYCLPSS----SSSTGFLTFGGSTSKS------ASFTP------LATISGGSSFYG 344

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           + L  I+VGG+++ +     +       GTI+DSGT  T + P  +  L+  F   M   
Sbjct: 345 LDLTGISVGGRKLAISPSVFS-----TAGTIIDSGTVITRLPPAAYSALSSTFRKLM--- 396

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
              ++   A AL+ L  CFD     T S P++ L F GG  V +     F  V + + VC
Sbjct: 397 ---SQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDIDKTGIF-YVNDLTQVC 452

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L    + +AS     I GN Q +   V YD    R+GF    C
Sbjct: 453 LAFAGNSDAS--DVAIFGNVQQKTLEVVYDGAAGRVGFAPAGC 493


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 109/400 (27%), Positives = 166/400 (41%), Gaps = 63/400 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  CT+   C   S  +I    F P +SSS+ L
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETL 201
           + C + +C    + + Q          T   C+   +C SY   YG G  T G  +S+ +
Sbjct: 142 VSCSDRRC----YSNFQ----------TESGCSPNNLC-SYSFKYGDGSGTSGYYISDFM 186

Query: 202 NLPNRIIPN--------FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFS 246
           +    I           F+ GCS L S       R   GI G G+G  S+ SQL +   +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
             + SH          +++       D       YTP V  PS         +Y V L+ 
Sbjct: 247 PRVFSHCLKGDKSGGGIMVLGQIKRPDT-----VYTPLV--PS-------QPHYNVNLQS 292

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN--RNY 364
           I V GQ + +     T+      GTI+D+GTT  ++  E + P       Q V N    Y
Sbjct: 293 IAVNGQILPIDPSVFTIAT--GDGTIIDTGTTLAYLPDEAYSPFI-----QAVANAVSQY 345

Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
            R +  E+      CF++       FP++ L F GGA + L    Y  +    S   +  
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWC 400

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +  +  S     ILG+  +++  V YDL  QR+G+ +  C
Sbjct: 401 IGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440


>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
          Length = 435

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/406 (25%), Positives = 175/406 (43%), Gaps = 64/406 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   ++  TP   +  I+D G   +W  C N Y                +SS+ R   C+
Sbjct: 47  YKAQINQRTPLVPLNVIVDLGGQFLWVDCENKY----------------ISSTYRPARCR 90

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLP--- 204
           + +CS  + +   C DC   P     N T  + P   + + +  T G    + L++    
Sbjct: 91  SAQCSLANSDG--CGDCFSSPKPGCNNNTCGVTPDNSITHTA--TSGELAEDVLSIQSSN 146

Query: 205 ------NRIIPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLN-----LDKFSYC 248
                 N ++  FL  C+   +L   +   +G+AG GR K +LPSQL        KF+ C
Sbjct: 147 GFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAIC 206

Query: 249 LLSHK----FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VY 299
           L S K    F D        L N    SD     LTYTP + NP V+  +AFS       
Sbjct: 207 LSSSKGVVLFGDGPYG---FLPNVVFDSDS----LTYTPLLINP-VSTASAFSQGQPSAE 258

Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
           Y++G++ I +  + V +    L++D +G GGT + +   +T +   +++ + D FV +  
Sbjct: 259 YFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFV-KAP 317

Query: 360 KNRNYTRALGAEALTGLRPCF-DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEG 417
             RN  R     ++     C+ ++ G + G + P ++L  +    V         V    
Sbjct: 318 AARNIKR---VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQNENVVWRIFGANSMVSIND 374

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
             +CL  V   + +   SI++G +Q++N  +++DL   +LGF   L
Sbjct: 375 EVLCLGFVNGGKNT-RTSIVIGGYQLENNLLQFDLAASKLGFSSLL 419


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 112/455 (24%), Positives = 174/455 (38%), Gaps = 91/455 (20%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NPS+  YQ L      S+ R  H +         +     +N+ S   G Y +++S GTP
Sbjct: 50  NPSETKYQRLQKAFRRSILRGNHFR-----AIRASPNDIQSNVISGG-GSYLMNISLGTP 103

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
           P  +  I DTGS L+W  C     C  C     P F PK S + + LGC N  C  +  +
Sbjct: 104 PVSMLGIADTGSDLIWRQC---LPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQ 160

Query: 159 SIQCRDCNDEPLATSK--------------------NCTQICPSYL--VLYGSGLTEGIA 196
                 C D+   TS                       T+  P+    + +G G + G  
Sbjct: 161 G----SCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSNGGT 216

Query: 197 LSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
            +E  +    +    L     LSS+    + G               +FSYCL+    D 
Sbjct: 217 FNEKDSGLIGLGGGPLSLVMQLSSK----VGG---------------QFSYCLVPLSSDS 257

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           T   SS I  N    +    +G   TP +       +     +YY+ L  +++G ++V  
Sbjct: 258 T--ASSKI--NFGKSAVVSGSGTVSTPLI-------KGTPDTFYYLTLEGMSLGSEKVAF 306

Query: 317 WHKYLTLDRDGNGGT-----IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
             K  + ++           I+DSGTT T +  + +  +              T+ +G +
Sbjct: 307 --KGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESAL----------TKVIGGQ 354

Query: 372 ALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
             T  R  F +   G K    P +  HF  GA+V LP  N F V  +   VC +++    
Sbjct: 355 TTTDPRGTFSLCYSGVKKLEIPTITAHFI-GADVQLPPLNTF-VQAQEDLVCFSMIPSSN 412

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +     I GN    N+ V YDL+N ++ FK   C
Sbjct: 413 LA-----IFGNLSQMNFLVGYDLKNNKVSFKPTDC 442


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 111/407 (27%), Positives = 167/407 (41%), Gaps = 75/407 (18%)

Query: 86  YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSS 140
           YG +  +L  GTP +    I+DTGS + + PC++      C S   P     +F P+ SS
Sbjct: 75  YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSS------CGSGCGPNHQDAAFDPEASS 128

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
           ++  + C +PKCS     S +C  C+ +    +++  +   S  +L    L + +AL + 
Sbjct: 129 TASRISCTSPKCSC---GSPRC-GCSTQQCTYTRSYAEQSSSSGIL----LEDVLALHD- 179

Query: 201 LNLPNRIIPNFLVGCSVLSS----RQPA-GIAGFGRGKTSLPSQLNL-----DKFSYCLL 250
             LP   I   + GC    +    RQ A G+ G G    S+ +QL       D FS C  
Sbjct: 180 -GLPGAPI---IFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLC-- 233

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
              F       +L+L +        +  L YTP + +           YY V +  + V 
Sbjct: 234 ---FGMVEGDGALLLGDAEV---PGSISLQYTPLLTS------TTHPFYYNVKMLSLAVE 281

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
           GQ + V         D   GT++DSGTTFT+M   +F+  A            Y  + G 
Sbjct: 282 GQLLPVSQSLF----DQGYGTVLDSGTTFTYMPSPVFKAFAGAV-------EKYALSHGL 330

Query: 371 EALTGLRPCFD------VPGEK-----TGSFPELKLHFKGGAEVTLPVENY-FAVVGEGS 418
           + + G  P FD       P        +  FP +++ F  G  + L   NY F       
Sbjct: 331 KRVPGPDPQFDDICFGQAPSHDDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSG 390

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
             CL V  +  A      +LG    +N  V YD  NQR+GF   LCK
Sbjct: 391 KYCLGVFDNGRAG----TLLGGITFRNVLVRYDRANQRVGFGPALCK 433


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 157/386 (40%), Gaps = 66/386 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C     C  C S     F P  SS+     C 
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCK---PCSQCHSQADSLFDPSSSSTYSAFSCT 183

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPNRI 207
           +  C+ +         C                 Y V YG G T  G   S+TL L +  
Sbjct: 184 SAACAQLRQRGCSSSQCQ----------------YTVKYGDGSTGSGTYSSDTLALGSST 227

Query: 208 IPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
           + NF  GCS      L   Q AG+ G G G  SL +Q        FSYCL        T 
Sbjct: 228 VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCL------PPTP 281

Query: 260 TSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            SS  L  G+S     T+G +  TP + +  V        YY V L+ I VGG+++ +  
Sbjct: 282 GSSGFLTLGAS-----TSGFVVKTPMLRSTQVPS------YYGVLLQAIRVGGRQLNIPA 330

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
                    + G+I+DSGT  T +    +  L+  F + M   + Y     A+ +     
Sbjct: 331 SAF------SAGSIMDSGTIITRLPRTAYSALSSAFKAGM---KQYPP---AQPMGIFDT 378

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CFD  G+ + S P + L F GGA V L  +        GS +     +D  + G    I+
Sbjct: 379 CFDFSGQSSVSIPTVALVFSGGAVVDLASDGIIL----GSCLAFAANSDDTSLG----II 430

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q + + V YD+    +GFK   C
Sbjct: 431 GNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 100/404 (24%), Positives = 156/404 (38%), Gaps = 68/404 (16%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC-TNHYQCKYCSSSKIPSFIPKLSSSS 142
           H  G + ++++ G P +     +DTGS+L W  C      CK C+    P + PK     
Sbjct: 35  HPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK----- 89

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETL 201
           +L+ C +P C  +H +    +DC +EP      C      Y + Y  G T  G+ L +  
Sbjct: 90  KLVPCADPLCDALHKDLGTTKDCREEP----DQC-----HYQINYADGTTSLGVLLLDKF 140

Query: 202 NLPNRIIPNFLVGCSVLSSRQPA----------GIAGFGRGKTSLPSQLNLDKFSYCLLS 251
           +LP     N   GC     + P           GI G GRG   L SQL           
Sbjct: 141 SLPTGSARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQL----------- 189

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
                  + S  +  N   H    + G  Y  F+   +V   +   +Y Y     I+   
Sbjct: 190 -------KHSGAVSKNVIGHC-LSSKGGGYL-FIGEENVPSSHLHIIYIYC----ISREP 236

Query: 312 QRVRVWHKYLTLDRDGNG----GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
                    L L R+  G      I DSG+T+T++   L   L     + ++K+   +  
Sbjct: 237 NHYSPGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKS---SLK 293

Query: 368 LGAEALTGLRPCFDVPG--EKTGSFPE-----LKLHFKGGAEVTLPVENYFAVVGEGSAV 420
           L ++  T L  C+  P   +     P+     + L F  G  +T+P ENY  + G G+A 
Sbjct: 294 LVSDTDTRLHLCWKGPKPFKTVHDLPKEFKSLVTLKFDHGVTMTIPPENYLIITGHGNA- 352

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C  ++   E  G    ++G   MQ   V +D    RL +    C
Sbjct: 353 CFGIL---ELPGYDLFVIGGISMQEQLVIHDNEKGRLAWMPSPC 393


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 147/374 (39%), Gaps = 61/374 (16%)

Query: 103 PFILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
           P  +DT   L W  C     C    C   +   F P+ S +S  + C +  C  +     
Sbjct: 147 PMSIDTSIDLPWIQCA---PCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGA 203

Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
            C          S N  Q    Y V YG G  T G  + + L L P+ ++ NF  GCS  
Sbjct: 204 GC----------SNNQCQ----YFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHA 249

Query: 219 S----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
                S   +G    G G+ SL SQ      + FSYC+          +SS  L  G   
Sbjct: 250 VRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCV-------PDPSSSGFLSLGGPA 302

Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
                     TP V NPS+         Y V LR I VGG+R+ V            GG 
Sbjct: 303 DGGGAGRFARTPLVRNPSI-----IPTLYLVRLRGIEVGGRRLNVPPVVFA------GGA 351

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
           ++DS    T + P  +  L   F S M     Y R  G  A  GL  C+D     + + P
Sbjct: 352 VMDSSVIITQLPPTAYRALRLAFRSAMAA---YPRVAGGRA--GLDTCYDFVRFTSVTVP 406

Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEY 450
            + L F GGA V L   +   V+ EG   CL  V T  + + G    +GN Q Q + V Y
Sbjct: 407 AVSLVFDGGAVVRL---DAMGVMVEG---CLAFVPTPGDFALG---FIGNVQQQTHEVLY 457

Query: 451 DLRNQRLGFKQQLC 464
           D+    +GF++  C
Sbjct: 458 DVGGGSVGFRRGAC 471


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 100/392 (25%), Positives = 169/392 (43%), Gaps = 61/392 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ+   I+DTGS + + PC+    C+ C   + P F P LSS+ + + 
Sbjct: 79  GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPDLSSTYQPVK 135

Query: 147 CQ-NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
           C  +  C    ++ +QC    +   A     + +    +V +G+         ++   P 
Sbjct: 136 CTLDCNCD---NDRMQC--VYERQYAEMSTSSGVLGEDVVSFGN---------QSELAPQ 181

Query: 206 RIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDT 257
           R +     GC       L S+   GI G GRG  S+  QL   N+   S+ L     D  
Sbjct: 182 RAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD-- 235

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
               +++L   S  SD          F  +  V      S YY + L+ I V G+R+ + 
Sbjct: 236 VGGGAMVLGGISPPSDMV--------FAQSDPVR-----SPYYNIDLKEIHVAGKRLPLN 282

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
                   DG  G+++DSGTT+ ++  E F    +  V ++   +++++  G +      
Sbjct: 283 PSVF----DGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKEL---QSFSQISGPDPNYN-D 334

Query: 378 PCFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASG 432
            CF   G    + + +FP + + F  G + +L  ENY F       A CL +  + +   
Sbjct: 335 LCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGK--- 391

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            P+ +LG   ++N  V YD    ++GF +  C
Sbjct: 392 DPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 168/402 (41%), Gaps = 66/402 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP+     +DTGS ++W  C +   C   S   I    F    SS++  
Sbjct: 64  GLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQ 123

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P C S +   + QC    D+       C     SY   YG G  T G  +S+TL 
Sbjct: 124 VRCSDPICTSAVQTTATQCSSQTDQ-------C-----SYTFQYGDGSGTSGYYVSDTLY 171

Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               L   +I N     + GCS   S       +   GI GFG+G+ S+ SQL+    + 
Sbjct: 172 FDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITP 231

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +    L+L       +    G+ Y+P V  PS         +Y + L  
Sbjct: 232 RVFSHCLKGDGSGGGILVL------GEILEPGIVYSPLV--PS-------QPHYNLNLLS 276

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ + +           + GTIVDSGTT  ++  E ++P    FVS +        
Sbjct: 277 IAVNGQLLPI--DPAAFATSNSQGTIVDSGTTLAYLVAEAYDP----FVSAV-------N 323

Query: 367 ALGAEALTGL----RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
           A+ + ++T +      C+ V    +  FP    +F GGA + L  E+Y    G      +
Sbjct: 324 AIVSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAM 383

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  ++  G    ILG+  +++    YDL  QR+G+    C
Sbjct: 384 WCIGFQKVQG--VTILGDLVLKDKIFVYDLVRQRIGWANYDC 423


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 147/374 (39%), Gaps = 61/374 (16%)

Query: 103 PFILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
           P  +DT   L W  C     C    C   +   F P+ S +S  + C +  C  +     
Sbjct: 163 PMSIDTSIDLPWIQCA---PCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGA 219

Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
            C          S N  Q    Y V YG G  T G  + + L L P+ ++ NF  GCS  
Sbjct: 220 GC----------SNNQCQ----YFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHA 265

Query: 219 S----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
                S   +G    G G+ SL SQ      + FSYC+          +SS  L  G   
Sbjct: 266 VRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCV-------PDPSSSGFLSLGGPA 318

Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
                     TP V NPS+         Y V LR I VGG+R+ V            GG 
Sbjct: 319 DGGGAGRFARTPLVRNPSI-----IPTLYLVRLRGIEVGGRRLNVPPVVFA------GGA 367

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
           ++DS    T + P  +  L   F S M     Y R  G  A  GL  C+D     + + P
Sbjct: 368 VMDSSVIITQLPPTAYRALRLAFRSAMAA---YPRVAGGRA--GLDTCYDFVRFTSVTVP 422

Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEY 450
            + L F GGA V L   +   V+ EG   CL  V T  + + G    +GN Q Q + V Y
Sbjct: 423 AVSLVFDGGAVVRL---DAMGVMVEG---CLAFVPTPGDFALG---FIGNVQQQTHEVLY 473

Query: 451 DLRNQRLGFKQQLC 464
           D+    +GF++  C
Sbjct: 474 DVGGGSVGFRRGAC 487


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 107/396 (27%), Positives = 157/396 (39%), Gaps = 62/396 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y  + + GTPPQ +  I+D    LVW       QC  C SS     ++P F P  S++ R
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTYR 115

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
              C +P C     +SI  R+C+ +       C    PS       G T GIA ++ + +
Sbjct: 116 AEQCGSPLC-----KSIPTRNCSGD-----GECGYEAPSMF-----GDTFGIASTDAIAI 160

Query: 204 PNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
            N        GC V S          P+G  G GR   SL  Q N+  FSYCL  H    
Sbjct: 161 GNAE-GRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLALHG--- 216

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
             + S+L L   +  +    +         + S    +    YY V L  I  G   V  
Sbjct: 217 PGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAA 276

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA--DEFVSQMVKNRNYTRALGAEALT 374
                       GG I       T +  E F PL+   +   Q ++ +  T ALG+ ++ 
Sbjct: 277 ASS--------GGGAI-------TVLQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMA 320

Query: 375 GLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVVTD---R 428
                FD+          P+L   F+GGA +T     Y    G G+  VCL++++     
Sbjct: 321 NPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLD 380

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            A  G S ILG+   +N +  +DL  + L F+   C
Sbjct: 381 SADDGVS-ILGSLLQENVHFLFDLEKETLSFEPADC 415


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/392 (27%), Positives = 165/392 (42%), Gaps = 59/392 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + LS GTPP  I  I DTGS L W  C     C  C   + P F P+ S++ R + 
Sbjct: 70  GHYLMELSIGTPPFKIYGIADTGSDLTWTSCV---PCNNCYKQRNPMFDPQKSTTYRNIS 126

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
           C          +S  C   +    +  K C     +Y   Y S  +T G+   ET+ L  
Sbjct: 127 C----------DSKLCHKLDTGVCSPQKRC-----NYTYAYASAAITRGVLAQETITLSS 171

Query: 204 -PNRIIP--NFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
              + +P    + GC   ++        GI G G G  SL SQ+       +FS CL+  
Sbjct: 172 TKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPF 231

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              D + +S +    GS  S K          V+ P VA+++     Y+V L  I+V   
Sbjct: 232 H-TDVSVSSKMSFGKGSKVSGKGV--------VSTPLVAKQD--KTPYFVTLLGISVENT 280

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            +        +++   G   +DSGT  T +  +L+    D+ V+Q V++    + +  + 
Sbjct: 281 YLHFNGSSQNVEK---GNMFLDSGTPPTILPTQLY----DQVVAQ-VRSEVAMKPVTDDP 332

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
             G + C+       G  P L  HF+ GA+V L     F    +G   CL   T+  + G
Sbjct: 333 DLGPQLCYRTKNNLRG--PVLTAHFE-GADVKLSPTQTFISPKDG-VFCLG-FTNTSSDG 387

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           G   + GNF   NY + +DL  Q + FK + C
Sbjct: 388 G---VYGNFAQSNYLIGFDLDRQVVSFKPKDC 416


>gi|224090425|ref|XP_002308984.1| predicted protein [Populus trichocarpa]
 gi|222854960|gb|EEE92507.1| predicted protein [Populus trichocarpa]
          Length = 416

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/405 (24%), Positives = 158/405 (39%), Gaps = 81/405 (20%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +   LD G   +W  C   Y                +SSS +   C   +CS   
Sbjct: 41  TPLVPVEVTLDLGGQYLWVDCQQGY----------------VSSSKKNPSCNTAQCSLAV 84

Query: 157 HESIQC----RDCNDEP------LATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           +    C    + C   P        TS   TQ   S     GS              P R
Sbjct: 85  YRLKTCTVDKKFCVLSPDNTATRTGTSDYLTQDVVSIQSTDGSN-------------PGR 131

Query: 207 II--PNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKF 254
           ++  PNFL  C+   +L   ++   G+AG GR K SLPSQ +       KF+ CL S   
Sbjct: 132 VVSVPNFLFSCAPTFILQGLAKGVKGMAGLGRTKISLPSQFSAAFSFPKKFAICLTS--- 188

Query: 255 DDTTRTSSLILDNGS----SHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRR 306
             +     +I  +G      H+D  +  L YTP + NP       F    S  Y++G++ 
Sbjct: 189 --SNAKGVVIFGDGPYVLLPHADDLSQSLIYTPLILNPVSTASGYFEGEPSTDYFIGVKS 246

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I +    V +    L+++R+G GGT + +   +T M   ++  + D FV ++ K  N  R
Sbjct: 247 IKINENVVPLNASLLSINREGYGGTKISTVNAYTVMETTIYNAVTDSFVRELAK-ANVPR 305

Query: 367 ALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----- 420
                         ++   + G + P++ L  +           Y+ + G  S V     
Sbjct: 306 VASVAPFGACFNSKNIGSTRVGPAVPQIDLVLQSK-------NVYWRIFGANSMVQVKDD 358

Query: 421 --CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
             CL  V D   +   SI++G  Q+++  +++DL   RLGF   L
Sbjct: 359 VLCLGFV-DGGVNPRTSIVIGGHQLEDNLLQFDLAASRLGFSSSL 402


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 169/398 (42%), Gaps = 73/398 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+D+GS + + PC +   C+ C + + P F P LSS+   + 
Sbjct: 83  GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 139

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C +  C+    +S QC    +   A   + + +    +V +G+         E+   P R
Sbjct: 140 C-SADCTCDSDKS-QC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 186

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
            +     GC       L S+   GI G GRG+ S+  QL       D FS C        
Sbjct: 187 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 241

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                +++L    +  D      + +  V +P          YY + L+ I V G+ +R+
Sbjct: 242 ---GGAMVLGAMPAPPDMV---FSRSDPVRSP----------YYNIELKEIHVAGKALRL 285

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             +      D   GT++DSGTT+ ++  + F    D   S++       R L  + + G 
Sbjct: 286 DPRIF----DSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKV-------RPL--KKIRGP 332

Query: 377 RP-----CFDVPGEKTG----SFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVT 426
            P     CF   G        +FP++ + F  G +++L  ENY F       A CL V  
Sbjct: 333 DPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQ 392

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + +    P+ +LG   ++N  V YD  N+++GF +  C
Sbjct: 393 NGK---DPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/389 (27%), Positives = 156/389 (40%), Gaps = 59/389 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   L  GTP      ++DTGS L W  C+    C   C     P F P+ SS+   +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLFDPRASSTYASV 188

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C   +C     + +Q    N    + S  C      Y   YG S  + G   ++T++  
Sbjct: 189 RCSASQC-----DELQAATLNPSACSASNVCI-----YQASYGDSSFSVGSLSTDTVSFG 238

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +   P+F  GC   +     + AG+ G  R K SL  QL       FSYCL        T
Sbjct: 239 STRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-------PT 291

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
             S+  L  G  ++       +YTP      +A  +  +  Y++ L  ++VGG  + V  
Sbjct: 292 AASTGYLSIGPYNTGHY---YSYTP------MASSSLDASLYFITLSGMSVGGSPLAVSP 342

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
            +Y +L       TI+DSGT  T +   +   L+      M   +       A A + L 
Sbjct: 343 SEYSSLP------TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQR------APAFSILD 390

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
            CF+    +    P + + F GGA + L   N    V + S  CL    TD  A      
Sbjct: 391 TCFEGQASQL-RVPTVAMAFAGGASMKLTTRNVLIDV-DDSTTCLAFAPTDSTA------ 442

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN Q Q + V YD+   R+GF    C 
Sbjct: 443 IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|23954367|emb|CAD27730.1| xylanase inhibitor [Triticum aestivum]
 gi|56201268|dbj|BAD72880.1| xylanase inhibitor TAXI-I [Triticum aestivum]
          Length = 402

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 98/388 (25%), Positives = 158/388 (40%), Gaps = 72/388 (18%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
            +LD    LVW           C   + P+ IP  SS + LL       GC  P C    
Sbjct: 47  LVLDVAGPLVW---------STCDGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 96

Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
           H+    + C   P    S  C     S+   + +  T+G      +N+        L  C
Sbjct: 97  HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 145

Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +   +L+S  R   G+AG      +LP+Q+       ++F  CL       T      I 
Sbjct: 146 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 199

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             G     + T  + YTP V           S  +Y+  R I VG  RV V    L    
Sbjct: 200 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 249

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEF----VSQMVKNRNYTRALGAEALTGLRPCFD 381
              GG ++ +   +  + P+++ PL D F     +Q        RA+ A A  G+  C+D
Sbjct: 250 --TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPFGV--CYD 305

Query: 382 VP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG---- 433
               G   G +  P ++L   GG++ T+  +N    V +G+A C+  V  +  + G    
Sbjct: 306 TKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKGVAAGDGRA 364

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           P++ILG  QM+++ +++D+  +RLGF +
Sbjct: 365 PAVILGGAQMEDFVLDFDMEKKRLGFSR 392


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 111/437 (25%), Positives = 176/437 (40%), Gaps = 72/437 (16%)

Query: 44  SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
           +Y N   L +SS  R L    NP  +        T         G Y+  L  GTP Q  
Sbjct: 53  AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 104

Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
             I+D+GS + + PC     C+ C + + P F P LSS+   + C N  C+   +E  QC
Sbjct: 105 ALIVDSGSTVTYVPCAT---CEQCGNHQDPRFQPDLSSTYSPVKC-NVDCT-CDNERSQC 159

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS-----V 217
                      +   ++  S  VL    ++ G    E+   P R +     GC       
Sbjct: 160 --------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGCENTETGD 204

Query: 218 LSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
           L S+   GI G GRG+ S+  QL       D FS C         T    ++L    +  
Sbjct: 205 LFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVLGGMPAPP 260

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
           D      +++  V +P          YY + L+ I V G+ +R+  K      +   GT+
Sbjct: 261 DMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF----NSKHGTV 303

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--- 389
           +DSGTT+ ++  + F    D   +++    N  + +          CF   G        
Sbjct: 304 LDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGRNVSQLSE 359

Query: 390 -FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
            FP++ + F  G +++L  ENY F       A CL V  + +    P+ +LG   ++N  
Sbjct: 360 VFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGGIVVRNTL 416

Query: 448 VEYDLRNQRLGFKQQLC 464
           V YD  N+++GF +  C
Sbjct: 417 VTYDRHNEKIGFWKTNC 433


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/389 (27%), Positives = 156/389 (40%), Gaps = 59/389 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   L  GTP      ++DTGS L W  C+    C   C     P F P+ SS+   +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLFDPRASSTYTSV 188

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C   +C     + +Q    N    + S  C      Y   YG S  + G   ++T++  
Sbjct: 189 RCSASQC-----DELQAATLNPSACSASNVCI-----YQASYGDSSFSVGYLSTDTVSFG 238

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +   P+F  GC   +     + AG+ G  R K SL  QL       FSYCL        T
Sbjct: 239 STSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-------PT 291

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
             S+  L  G  ++       +YTP      +A  +  +  Y++ L  ++VGG  + V  
Sbjct: 292 AASTGYLSIGPYNTGHY---YSYTP------MASSSLDASLYFITLSGMSVGGSPLAVSP 342

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
            +Y +L       TI+DSGT  T +   +   L+      M   +       A A + L 
Sbjct: 343 SEYSSLP------TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQR------APAFSILD 390

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
            CF+    +    P + + F GGA + L   N    V + S  CL    TD  A      
Sbjct: 391 TCFEGQASQL-RVPTVVMAFAGGASMKLTTRNVLIDV-DDSTTCLAFAPTDSTA------ 442

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           I+GN Q Q + V YD+   R+GF    C 
Sbjct: 443 IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 167/389 (42%), Gaps = 54/389 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y + +S GTPP      +DTGS L W  C N   +C   ++     F P  SS+   +GC
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP 204
               C+ +H +           LA    C +   +  Y + YGSG  + G    + L L 
Sbjct: 66  STEACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA 114

Query: 205 -NRIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDT 257
            NR I NF+ GC    L +   AGI GFG    S  +Q+    +   FSYC       D 
Sbjct: 115 SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DH 170

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
               SL +   +   +   T L Y  + + P+          Y +    + V G R+ + 
Sbjct: 171 ENEGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI- 217

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
             Y+ + +     TIVDSGT  T++   +F+ L D+ +++ ++ + YTR          R
Sbjct: 218 DPYIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----R 267

Query: 378 PCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
            CF  +        FP +++     + + LPVEN F      + +C T + D     G  
Sbjct: 268 ICFISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ 325

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +LGN  ++++ + +D++    GFK + C
Sbjct: 326 -MLGNRAVRSFKLVFDIQAMNFGFKARAC 353


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 167/389 (42%), Gaps = 54/389 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           Y + +S GTPP      +DTGS L W  C N   +C   ++     F P  SS+   +GC
Sbjct: 25  YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84

Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP 204
               C+ +H +           LA    C +   +  Y + YGSG  + G    + L L 
Sbjct: 85  STEACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA 133

Query: 205 -NRIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDT 257
            NR I NF+ GC    L +   AGI GFG    S  +Q+    +   FSYC       D 
Sbjct: 134 SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DH 189

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
               SL +   +   +   T L Y  + + P+          Y +    + V G R+ + 
Sbjct: 190 ENEGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI- 236

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
             Y+ + +     TIVDSGT  T++   +F+ L D+ +++ ++ + YTR          R
Sbjct: 237 DPYIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----R 286

Query: 378 PCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
            CF  +        FP +++     + + LPVEN F      + +C T + D     G  
Sbjct: 287 ICFISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ 344

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +LGN  ++++ + +D++    GFK + C
Sbjct: 345 -MLGNRAVRSFKLVFDIQAMNFGFKARAC 372


>gi|55669876|pdb|1T6E|X Chain X, Crystal Structure Of The Triticum Aestivum Xylanase
           Inhibitor I
 gi|55669877|pdb|1T6G|A Chain A, Crystal Structure Of The Triticum Aestivum Xylanase
           Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
 gi|55669878|pdb|1T6G|B Chain B, Crystal Structure Of The Triticum Aestivum Xylanase
           Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
          Length = 381

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/388 (25%), Positives = 158/388 (40%), Gaps = 72/388 (18%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
            +LD    LVW           C   + P+ IP  SS + LL       GC  P C    
Sbjct: 26  LVLDVAGPLVW---------STCDGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 75

Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
           H+    + C   P    S  C     S+   + +  T+G      +N+        L  C
Sbjct: 76  HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 124

Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +   +L+S  R   G+AG      +LP+Q+       ++F  CL       T      I 
Sbjct: 125 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 178

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             G     + T  + YTP V           S  +Y+  R I VG  RV V    L    
Sbjct: 179 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 228

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEF----VSQMVKNRNYTRALGAEALTGLRPCFD 381
              GG ++ +   +  + P+++ PL D F     +Q        RA+ A A  G+  C+D
Sbjct: 229 --TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPFGV--CYD 284

Query: 382 VP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG---- 433
               G   G +  P ++L   GG++ T+  +N    V +G+A C+  V  +  + G    
Sbjct: 285 TKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKGVAAGDGRA 343

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
           P++ILG  QM+++ +++D+  +RLGF +
Sbjct: 344 PAVILGGAQMEDFVLDFDMEKKRLGFSR 371


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 104/432 (24%), Positives = 162/432 (37%), Gaps = 61/432 (14%)

Query: 40  PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
           PS    +++ +L  +   R L + +    ++   T+      S  +   Y +    GTP 
Sbjct: 32  PSPSPLESIIALARADDARLLFLSSKAASSSGGVTSAPVA--SGQTPPSYVVRAGLGTPV 89

Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
           Q +   LDT +   W  C     C  C +     FIP  SSS   L C +  C      +
Sbjct: 90  QQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASDWCPLFRRPA 144

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG---CS 216
           +       EP                    G     A    L   +R   + ++    C 
Sbjct: 145 VP-----GEP--------------------GRVGAAADVRLLQAASRTPRSGVLAATRCG 179

Query: 217 VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK---FDDTTRTSSLILDNGSSHSD 273
              +  PA  +G     +   S+ N   FSYCL S++   F  + R        G++   
Sbjct: 180 WARTPSPATRSGPMSLLSQTGSRYN-GVFSYCLPSYRSYYFSGSLRL-------GAAGQP 231

Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
           +    + YTP + NP    R +    YYV +  ++VG   V+        D     GT++
Sbjct: 232 RN---VRYTPLLTNP---HRPSL---YYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVI 282

Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
           DSGT  T     ++  L DEF  Q+     YT +LGA        CF+      G  P +
Sbjct: 283 DSGTVITRWTAPVYAALRDEFRRQVAAPSGYT-SLGA-----FDTCFNTDEVAAGGAPPV 336

Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
            LH  GG ++TLP+EN           CL +    +       ++ N Q QN  V  D+ 
Sbjct: 337 TLHMGGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVA 396

Query: 454 NQRLGFKQQLCK 465
             R+GF ++ C 
Sbjct: 397 GSRVGFAREPCN 408


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 101/397 (25%), Positives = 167/397 (42%), Gaps = 54/397 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G PP+     +DTGS ++W  C +   C   S  +IP   F P  S+++ L
Sbjct: 81  GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C +  C+      +Q  D        S  C     +Y+  YG G  T G  + + ++L
Sbjct: 141 VSCSDQICAL----GVQSSD--SACFGQSNQC-----AYVFQYGDGSGTSGYYVMDMIHL 189

Query: 204 PNRI--------IPNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
              I          + + GCS         S R   GI GFG+   S+ SQL+    +  
Sbjct: 190 DVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPK 249

Query: 249 LLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           + SH    D +    L+L       +     + YTP V  PS         +Y + L+ I
Sbjct: 250 VFSHCLKGDDSGGGILVL------GEIVEPNVVYTPLV--PS-------QPHYNLNLQSI 294

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           +V GQ + +           + GTI+DSGTT  ++A E +       V+ +V     +++
Sbjct: 295 SVNGQVLPISPAVFATSS--SQGTIIDSGTTLAYLAEEAYNAFVVA-VTNIV-----SQS 346

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
             +  L G R C+      +  FP++ L+F GGA + L  ++Y           +  +  
Sbjct: 347 TQSVVLKGNR-CYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGF 405

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++  G    ILG+  +++    YDL NQR+G+    C
Sbjct: 406 QKIPGQGITILGDLVLKDKIFIYDLANQRIGWTNYDC 442


>gi|356535355|ref|XP_003536212.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 444

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 103/410 (25%), Positives = 179/410 (43%), Gaps = 89/410 (21%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +   +D G    W  C   Y                +SS+S+   C + +CS   
Sbjct: 60  TPLVPVKLTVDLGGGYFWVNCEKGY----------------VSSTSKPARCGSAQCSLFG 103

Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL-NLPNRII--PNFLV 213
                   CN E    S++ +    + +  +G    + +A++ T  N P R++  P FL 
Sbjct: 104 -----LYGCNVEDKICSRSLSNTV-TGVSTFGEIHADVVAINATDGNNPVRVVSVPKFLF 157

Query: 214 --GCSVLSSRQPAGI---AGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSL 263
             G +V+ +   +G+   AG GR K SLPSQ +     L KF+ CL S     +T T+ +
Sbjct: 158 ICGANVVQNGLASGVTGMAGLGRTKVSLPSQFSSAFSFLRKFAICLSS-----STMTNGV 212

Query: 264 IL------DNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQR 313
           +       + G  +SD     LT+TP + NP     + F    SV Y++G++ I V  + 
Sbjct: 213 MFFGDGPYNFGYLNSDLSKV-LTFTPLITNPVSTAPSYFQGEPSVEYFIGVKSIRVSDKN 271

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V +    L++DR+G GGT + +   +T +   +++ +++ FV          +A+GA  +
Sbjct: 272 VPLNTTLLSIDRNGIGGTKISTVNPYTVLETTIYKAVSEAFV----------KAVGAPTV 321

Query: 374 TGLRP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
             + P   CF   D+   + G + P++ L  +   EV   +    ++V     +CL  V 
Sbjct: 322 APVAPFGTCFATKDIQSTRMGPAVPDINLVLQN--EVVWSIIGANSMVYTNDVICLGFV- 378

Query: 427 DREASGGP----------------SIILGNFQMQNYYVEYDLRNQRLGFK 460
             +A   P                SI +G  Q++N  +++DL   RLGF+
Sbjct: 379 --DAGSDPSTAQVGFVVGYSQPITSITIGAHQLENNMLQFDLATSRLGFR 426


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 162/387 (41%), Gaps = 59/387 (15%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           ++S G PP     ++DTGS ++W  CT    C  C +     F P +SS+   L C+ P 
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCT---PCTNCDNHLGLLFDPSMSSTFSPL-CKTP- 158

Query: 152 CSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRII 208
           C +       C  C+  P     + N T           SG+      + ET +     I
Sbjct: 159 CDFK-----GCSRCDPIPFTVTYADNST----------ASGMFGRDTVVFETTDEGTSRI 203

Query: 209 PNFLVGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
           P+ L GC  ++     P   GI G   G  SL +++   KFSYC+     D       LI
Sbjct: 204 PDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKIG-QKFSYCI-GDLADPYYNYHQLI 261

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L  G+      T      PF            + +YYV +  I+VG +R+ +  +   + 
Sbjct: 262 LGEGADLEGYST------PF---------EVHNGFYYVTMEGISVGEKRLDIAPETFEMK 306

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT----RALGAEALTGLRPCF 380
           ++  GG I+D+G+T TF+   +   L+ E        RN      R    E    ++  +
Sbjct: 307 KNRTGGVIIDTGSTITFLVDSVHRLLSKEV-------RNLLGWSFRQTTIEKSPWMQCFY 359

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGPSIIL 438
                    FP +  HF  GA++ L   ++F  + + +  C+TV  V+       PS+I 
Sbjct: 360 GSISRDLVGFPVVTFHFADGADLALDSGSFFNQLND-NVFCMTVGPVSSLNLKSKPSLI- 417

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G    Q+Y V YDL NQ + F++  C+
Sbjct: 418 GLLAQQSYSVGYDLVNQFVYFQRIDCE 444


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/400 (25%), Positives = 165/400 (41%), Gaps = 77/400 (19%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ    I+D+GS + + PC++   C+ C + + P F P LSSS   + 
Sbjct: 86  GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSS---CEQCGNHQDPRFQPDLSSSYSPVK 142

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL---NL 203
           C N  C+           C+ +     K CT     Y   Y    +    L E +     
Sbjct: 143 C-NVDCT-----------CDSD----KKQCT-----YERQYAEMSSSSGVLGEDIVSFGR 181

Query: 204 PNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSH 252
            + + P   + GC       L S+   GI G GRG+ S+  QL       D FS C    
Sbjct: 182 ESELKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGM 241

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTP---FVNNPSVAERNAFSVYYYVGLRRITV 309
                    +++L            G+   P   F N+  +      S YY + L+ I V
Sbjct: 242 DIG----GGAMVL-----------GGMLAPPDMIFSNSDPLR-----SPYYNIELKEIHV 281

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
            G+ +RV  +      +   GT++DSGTT+ ++  + F    +   S++    +  + + 
Sbjct: 282 AGKALRVESRIF----NSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKV----HSLKKIR 333

Query: 370 AEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTV 424
               +    CF   G         FP++ + F  G +++L  ENY F       A CL V
Sbjct: 334 GPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 393

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             + +    P+ +LG   ++N  V YD  N+++GF +  C
Sbjct: 394 FQNGK---DPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430


>gi|383134454|gb|AFG48206.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134458|gb|AFG48208.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134460|gb|AFG48209.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134462|gb|AFG48210.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134464|gb|AFG48211.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134466|gb|AFG48212.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134468|gb|AFG48213.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134470|gb|AFG48214.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134474|gb|AFG48216.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134486|gb|AFG48222.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
          Length = 136

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 70/113 (61%), Gaps = 8/113 (7%)

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           IT+GGQR+++     T D++GNGG IVDSGTTFT +   L+  + ++  S +     Y+R
Sbjct: 1   ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRRVLNKLKSAI----RYSR 56

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPEL---KLHFKGGAEVTLPVENYFAVVGE 416
           ++  EA  GL  C+++P    GSFP L    LHFK  A +TLP ENY +++ +
Sbjct: 57  SVKYEAALGLDLCYELP-SAGGSFPVLPTFSLHFKDNATITLPAENYMSMMSD 108


>gi|361066669|gb|AEW07646.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
          Length = 136

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/113 (40%), Positives = 68/113 (60%), Gaps = 8/113 (7%)

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           IT+GGQR+++     T D++GNGG IVDSGTTFT +   L+     E + ++     Y+R
Sbjct: 1   ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYR----EVLKKLKSAIRYSR 56

Query: 367 ALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGE 416
           ++  EA  GL  C+++P E  GS   FP   LHFK  A + LP ENY +++ +
Sbjct: 57  SVRYEAALGLDLCYELPSE-VGSFPVFPTFSLHFKDNATIRLPAENYMSMMSD 108


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 105/407 (25%), Positives = 161/407 (39%), Gaps = 73/407 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C +S     ++  F P  SS+
Sbjct: 87  GLYFTRVKLGNPAKEYFVQIDTGSDILWVACS---PCTGCPTSSGLNIQLEFFNPDSSST 143

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
           S  + C + +C+            +D P   S  C      Y   YG G  T G  +S+T
Sbjct: 144 SSRIPCSDDRCTAALQTGEAVCQSSDSP---SSPC-----GYTFTYGDGSGTSGFYVSDT 195

Query: 201 LN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQL----- 240
           +     + N    N     + GCS       + + R   GI GFG+ + S+ SQL     
Sbjct: 196 MYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGV 255

Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSS---HSDKKTTGLTYTPFVNNPSVAERNAFS 297
           +   FS+CL                DNG       +    GL +TP V  PS        
Sbjct: 256 SPKTFSHCLKGS-------------DNGGGILVLGEIVEPGLVFTPLV--PS-------Q 293

Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
            +Y + L  I V GQ++ +             GTIVDSGTT  ++    ++P  +   + 
Sbjct: 294 PHYNLNLESIAVSGQKLPIDSSLFATSN--TQGTIVDSGTTLVYLVDGAYDPFINAIAAA 351

Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
           +  +     + G +       CF        SFP   L+FKGG  +T+  ENY    G  
Sbjct: 352 VSPSVRSVVSKGIQ-------CFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSV 404

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               L  +  + + G    ILG+  +++    YDL N R+G+    C
Sbjct: 405 DNNVLWCIGWQRSQG--ITILGDLVLKDKIFVYDLANMRMGWADYDC 449


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 108/395 (27%), Positives = 156/395 (39%), Gaps = 69/395 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y +++  GTP      ++DTGS L W       QC+ C+S+     K P F P  SS+  
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWV------QCQPCNSTTCYPQKDPLFDPSKSSTYA 177

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLA---TSKNCTQICPSYLVLYGSG-LTEGIALSE 199
            + C           +  CRD  D+       S +    C  + + YG G  T G+  +E
Sbjct: 178 PIPCN----------TDACRDLTDDGYGGGCASGDGAAQC-GFAITYGDGSQTRGVYSNE 226

Query: 200 TLNL-PNRIIPNFLVGCS---VLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSH 252
           TL L P   + +F  GC      ++ +  G+ G G    SL  Q   +    FSYCL + 
Sbjct: 227 TLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPA- 285

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             ++     +L      S     T+G  +TP +      E   F   Y V +  ITVGG+
Sbjct: 286 -LNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIR-----EEETF---YVVNMTGITVGGE 336

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + V     +      GG I+DSGT  T +    +  L   F   M     Y      E 
Sbjct: 337 PIDVPPSAFS------GGMIIDSGTVVTELQHTAYNALQAAFRKAMAA---YPLVRNGE- 386

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
              L  C+D  G    + P++ L F GGA + L V N   +       CL          
Sbjct: 387 ---LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGILL-----DDCLAF-----QES 433

Query: 433 GPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GP     ILGN   +   V YD    R+GF+  +C
Sbjct: 434 GPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 109/395 (27%), Positives = 150/395 (37%), Gaps = 73/395 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           + + + FGTP Q    ILDTGS L W    PC+ H     C     P F P  SSS   +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGH-----CYRQHDPDFDPAKSSSYAAV 191

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL- 203
            C  P C+                 A    C      Y V YG G  T G+   +TL   
Sbjct: 192 PCGTPVCA-----------------AAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFN 234

Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLS 251
            +     F  GC          I  FG          GK SLPSQ        FSYCL S
Sbjct: 235 SSSKFTGFTFGCGE------KNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS 288

Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
           +       T+   L+ G++     T  + YT  +  P       +  +Y++ L  I +GG
Sbjct: 289 YN------TTPGYLNIGATKP-TSTVPVQYTAMIKKPQ------YPSFYFIELVSINIGG 335

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
             + V     T       GT++DSGT  T++ P  +  L D F   M  N+       A 
Sbjct: 336 YILPVPPSVFT-----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKP------AP 384

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV--CLTVVTDRE 429
               L  C+D  G+     P +  +F  GA   L          +   +  CL  V+   
Sbjct: 385 PYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPA 444

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           A   P  I+GN Q +   V YD+ +Q++GF    C
Sbjct: 445 AM--PFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 81/265 (30%), Positives = 118/265 (44%), Gaps = 33/265 (12%)

Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
           LVG    S R     P+G+ G GRG+ SL SQ    KFSYCL  + F +   T  L +  
Sbjct: 136 LVGLRAPSRRARSMAPSGLMGLGRGRLSLVSQTGATKFSYCLTPY-FHNNGATGHLFV-- 192

Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
           G+S S      +  T FV  P        S +YY+ L  +TVG  R+ +      L    
Sbjct: 193 GASASLGGHGDVMTTQFVKGPK------GSPFYYLPLIGLTVGETRLPIPATVFDLREVA 246

Query: 328 ----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
               +GG I+DSG+ FT +  + ++ LA E  +++        +L A           V 
Sbjct: 247 PGLFSGGVIIDSGSPFTSLVHDAYDALASELAARL------NGSLVAPPPDADDGALCVA 300

Query: 384 GEKTGS-FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP---SIILG 439
               G   P +  HF+GGA++ +P E+Y+A V + +A             GP     ++G
Sbjct: 301 RRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASA------GPYRRQSVIG 354

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           N+Q QN  V YDL N    F+   C
Sbjct: 355 NYQQQNMRVLYDLANGDFSFQPADC 379


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 111/440 (25%), Positives = 192/440 (43%), Gaps = 57/440 (12%)

Query: 37  HTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFG 96
           H  P++ +   +   +  S  R  +I+     +       T +   S +     ++LS G
Sbjct: 49  HYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLTGRTILVNLSIG 108

Query: 97  TP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
            P  PQ++  ++DTGS ++W  C     C  C +     F P +SS+   L C+ P C +
Sbjct: 109 QPSIPQLV--VMDTGSDILWIMCN---PCTNCDNHLGLLFDPSMSSTFSPL-CKTP-CGF 161

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEG--IALSETLNLPNRIIPNFL 212
                   + C  +P+  +        SY+    +  T G  I + ET +     I + +
Sbjct: 162 --------KGCKCDPIPFTI-------SYVDNSSASGTFGRDILVFETTDEGTSQISDVI 206

Query: 213 VGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
           +GC  ++  +  P   GI G   G  SL +Q+   KFSYC+  +  D     + L L  G
Sbjct: 207 IGCGHNIGFNSDPGYNGILGLNNGPNSLATQIG-RKFSYCI-GNLADPYYNYNQLRLGEG 264

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
           +      T      PF           +  +YYV +  I+VG +R+ +  +   + R+G 
Sbjct: 265 ADLEGYST------PF---------EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGT 309

Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC-FDVPGEKT 387
           GG I+DSGTT T++     + L +E V  ++K  ++ + +   A   L  C + +     
Sbjct: 310 GGVILDSGTTITYLVDSAHKLLYNE-VRNLLK-WSFRQVIFENAPWKL--CYYGIISRDL 365

Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQN 445
             FP +  HF  GA++ L   ++F+        C+TV   +    +  PS+I G    Q+
Sbjct: 366 VGFPVVTFHFVDGADLALDTGSFFS--QRDDIFCMTVSPASILNTTISPSVI-GLLAQQS 422

Query: 446 YYVEYDLRNQRLGFKQQLCK 465
           Y V YDL NQ + F++  C+
Sbjct: 423 YNVGYDLVNQFVYFQRIDCE 442


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 155/398 (38%), Gaps = 69/398 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y ++L  GTP      ++DTGS L W       QCK C +      K P F P  SSS  
Sbjct: 91  YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 144

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
            + C +  C  +   +     C       S     +C  Y + YG+   T G+  +ETL 
Sbjct: 145 SVPCDSDACRKLAAGAYG-HGCT----GVSGGAAALC-EYGIEYGNRATTTGVYSTETLT 198

Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
           L P  ++ +F  GC         +  G+ G G    SL SQ +      FSYCL      
Sbjct: 199 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 252

Query: 256 DTTRTSSLILDNGS---SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             T   +  L  G+   S S    +GL++TP    PSV        +Y V L  I+VGG 
Sbjct: 253 PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSV------PTFYIVTLTGISVGGA 306

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +     +       G ++DSGT  T +    +  L   F S M + R    + G   
Sbjct: 307 PLAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV- 359

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP------VENYFAVVGEGSAVCLTVVT 426
              L  C+D  G    + P + L F GGA + L       V+   A  G G        T
Sbjct: 360 ---LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDGCLAFAGAG--------T 408

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           D         I+GN   + + V YD     +GF+   C
Sbjct: 409 DNAIG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 106/401 (26%), Positives = 164/401 (40%), Gaps = 61/401 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRL 144
           G Y   +  G+P +     +DTGS ++W  C    +C   S   I    + PK S +S  
Sbjct: 67  GLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEF 126

Query: 145 LGCQNPKCSWIHHESI-QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
           + C++  CS  +   I  C+  N             CP Y + YG G  T G  + + L 
Sbjct: 127 VSCEHNFCSSTYEGRILGCKAENP------------CP-YSISYGDGSATTGYYVQDYLT 173

Query: 203 LPNRIIPN---------FLVGCSVL------SSRQPA--GIAGFGRGKTSLPSQLNLDKF 245
             NR+  N          + GC         SS + A  GI GFG+  +S+ SQL     
Sbjct: 174 F-NRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGK 232

Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
              + SH  D  T     I   G     K  T    TP V  P++A       +Y V L+
Sbjct: 233 VKKIFSHCLD--TNVGGGIFSIGEVVEPKVKT----TPLV--PNMA-------HYNVILK 277

Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
            I V G  +++     T D +   GT++DSGTT  ++   +++ L  + +++  + + Y 
Sbjct: 278 NIEVDGDILQLPSD--TFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVY- 334

Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL--T 423
             L  E  +    CF   G     FP +KLHF+    +T+   +Y       S  C+   
Sbjct: 335 --LVEEQYS----CFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQ 388

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  +G    +LG+F + N  V YDL N  +G+    C
Sbjct: 389 KSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 148/365 (40%), Gaps = 52/365 (14%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
           +DT S + W PC     C  CSS+    F    S++ + LGCQ  +C  +   +     C
Sbjct: 1   MDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQAAQCKQVPKPT-----C 49

Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC------SVLS 219
                        +C   L   GS L   ++  +T+ L    +P +  GC        L 
Sbjct: 50  GGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAVPGYSFGCIQKATGGSLP 98

Query: 220 SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
           ++   G+        S    L    FSYCL S  F     + SL L  G     K+   +
Sbjct: 99  AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGSLRL--GPVGQPKR---I 151

Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
            YTP + NP    R +    Y+V L  + VG + V V     T +     GTI DSGT F
Sbjct: 152 KYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVF 205

Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
           T +    +  + D F +++ +N      L   +L G   C+ VP       P +   F G
Sbjct: 206 TRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTVPIAA----PTITFMFTG 255

Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
              VTLP +N       GS  CL +    +       ++ N Q QN+ + YD+ N RLG 
Sbjct: 256 -MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGV 314

Query: 460 KQQLC 464
            ++LC
Sbjct: 315 ARELC 319


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 118/263 (44%), Gaps = 30/263 (11%)

Query: 215 CSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
           C   +  +  G+ G  RG  S  +Q+ L KFSYC+          +S ++L   SS S  
Sbjct: 431 CRTRTHSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQD------SSGILLFGESSFSWL 484

Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
           K   L YTP V   S        V Y V L  I V    +++       D  G G T+VD
Sbjct: 485 K--ALKYTPLVQI-STPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 541

Query: 335 SGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEALTGLRPCFDVPGEKT- 387
           SGT FTF+   ++  L +EFV Q      ++++ N+    GA  L     C+ VP  +  
Sbjct: 542 SGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ-GAMDL-----CYRVPLTRRT 595

Query: 388 -GSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNF 441
               P + L F+ GAE+++  E         + G  S  C T   + E  G  S I+G+ 
Sbjct: 596 LPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFT-FGNSELLGVESYIIGHH 653

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
             QN ++E+DL   R+GF +  C
Sbjct: 654 HQQNVWMEFDLAKSRVGFAEVRC 676



 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 15/79 (18%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
           ++ +S H     ++SL+ G+PPQ +  +LDTGS L W  C            K P+    
Sbjct: 364 SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 412

Query: 134 FIPKLSSSSRLLGCQNPKC 152
           F P  SSS   + C +P C
Sbjct: 413 FDPLRSSSYSPIPCTSPTC 431


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 98/398 (24%), Positives = 160/398 (40%), Gaps = 56/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   L  GTPP+     +DTGS ++W  C +   C   S  +I    F P  S ++  
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           + C + +CSW    S       D   +   N   +C +Y   YG G  T G  +S+ L  
Sbjct: 139 ISCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187

Query: 203 --------LPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                   +PN   P  + GCS       V S R   GI GFG+   S+ SQL     + 
Sbjct: 188 DMIVGSSLVPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAP 246

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
            + SH          +++       +     + +TP V  PS         +Y V L  I
Sbjct: 247 RVFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSI 292

Query: 308 TVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +V GQ + +     +     NG GTI+D+GTT  +++   + P  +   + + ++     
Sbjct: 293 SVNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV 349

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           + G +       C+ +       FP + L+F GGA + L  ++Y           +  + 
Sbjct: 350 SKGNQ-------CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG 402

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +        ILG+  +++    YDL  QR+G+    C
Sbjct: 403 FQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 155/392 (39%), Gaps = 74/392 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSR 143
           Y +++S GTP       +DTGS + W       QCK C      S + P F P  SSS  
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSYS 195

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
            + C    CS +   S  C         +   C      Y+V YG G  T G+  S+TL 
Sbjct: 196 AVPCAAASCSQLALYSNGC---------SGGQC-----GYVVSYGDGSTTTGVYSSDTLT 241

Query: 203 LP-NRIIPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
           L  +  +  FL GC        AG+    G GR   SL SQ +      FSYCL      
Sbjct: 242 LTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------ 295

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQ 312
             T+ S   +  G   S   T G + TP +   N+P+         YY V L  I+VGGQ
Sbjct: 296 PPTQNSVGYISLGGPSS---TAGFSTTPLLTASNDPT---------YYIVMLAGISVGGQ 343

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +             G +VD+GT  T + P  +  L   F + M     Y     A A
Sbjct: 344 PLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-YGYPS---APA 393

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
              L  C+D     T + P + + F GGA + L       ++  G         D +AS 
Sbjct: 394 TGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSG---ILTSGCLAFAPTGGDSQAS- 449

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               ILGN Q +++ V +D     +GF    C
Sbjct: 450 ----ILGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 166/387 (42%), Gaps = 54/387 (13%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
           + +S GTPP      +DTGS L W  C N   +C   ++     F P  SS+   +GC  
Sbjct: 1   MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60

Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP-N 205
             C+ +H +           LA    C +   +  Y + YGSG  + G    + L L  N
Sbjct: 61  EACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASN 109

Query: 206 RIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDTTR 259
           R I NF+ GC    L +   AGI GFG    S  +Q+    +   FSYC       D   
Sbjct: 110 RSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DHEN 165

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
             SL +   +   +   T L Y  + + P+          Y +    + V G R+ +   
Sbjct: 166 EGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI-DP 212

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
           Y+ + +     TIVDSGT  T++   +F+ L D+ +++ ++ + YTR          R C
Sbjct: 213 YIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----RIC 262

Query: 380 F--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
           F  +        FP +++     + + LPVEN F      + +C T + D     G   +
Sbjct: 263 FISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ-M 319

Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           LGN  ++++ + +D++    GFK + C
Sbjct: 320 LGNRAVRSFKLVFDIQAMNFGFKARAC 346


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 112/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 85  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKENS 198

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+   FSYCL +    D T+   +IL       D+      YTP 
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + LP  N F        +C+T   +       S ILGN   +++   
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK   C
Sbjct: 458 FDIQGKQFGFKYAAC 472


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 126/487 (25%), Positives = 195/487 (40%), Gaps = 85/487 (17%)

Query: 11  SFIFFF----TLLSIFP---SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIK 63
           +  FFF    +LL+  P    S T  +F++   H +     + N      SS+TR+  I+
Sbjct: 3   ALAFFFAASCSLLATLPFTEPSKTPSSFTIDLIHHDSPPSPFYN------SSMTRSQLIR 56

Query: 64  NPQTKTTTTTT--------------TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTG 109
           N   ++ +                  ++   I   + G Y + +  GTP      I DTG
Sbjct: 57  NAAMRSISRANQLSLSLSHSLNQLKESSPEPIIIPNNGNYLMRIYIGTPSVERLAIADTG 116

Query: 110 SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
           S L W  C+     K C +   P + P  SS+  LL C +  C+ + +    C D  D  
Sbjct: 117 SDLTWVQCSPCDNTK-CFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCI 175

Query: 170 LATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC------SVLSSRQP 223
            A +             YG   ++ I L   + L          GC      +   S + 
Sbjct: 176 YAYTYGDNSYS------YGGLSSDSIRL---MLLQLHYNSKICFGCGFQNKFTADKSGKT 226

Query: 224 AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
            GI G G G  SL SQL  +   KFSYCLL    +  ++     L  G + +  +  G+ 
Sbjct: 227 TGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSK-----LKFGEA-AIVQGNGVV 280

Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
            TP +  P +        +YY+ L  ITVG + V+      T   DGN   I+DSG+T T
Sbjct: 281 STPLIIKPDLP-------FYYLNLEGITVGAKTVK------TGQTDGN--IIIDSGSTLT 325

Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTGSFPELKLHF 397
           ++     E   +EFVS +VK       +  E    +   FD      E   + P++  HF
Sbjct: 326 YLE----ESFYNEFVS-LVK-----ETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHF 375

Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
            GG  V  P+     V+ E + +C TVV           I GN    +++V YD++  ++
Sbjct: 376 TGGDVVLKPMNT--LVLIEDNLICSTVVPSHFDGIA---IFGNLGQIDFHVGYDIQGGKV 430

Query: 458 GFKQQLC 464
            F    C
Sbjct: 431 SFAPTDC 437


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 112/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 85  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+   FSYCL +    D T+   +IL       D+      YTP 
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + LP  N F        +C+T   +       S ILGN   +++   
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK   C
Sbjct: 458 FDIQGKQFGFKYAAC 472


>gi|356576537|ref|XP_003556387.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 438

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 98/404 (24%), Positives = 174/404 (43%), Gaps = 77/404 (19%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW-- 154
           TP   +   +D G   +W  C   Y                +SS+SR   C + +CS   
Sbjct: 54  TPLVAVKLTVDLGGGYLWVNCEKGY----------------VSSTSRPARCGSAQCSLFG 97

Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
           ++  S + + C   P  T    +     +  +     T+G   ++ +++P  +   F+ G
Sbjct: 98  LYGCSTEDKICGRSPSNTVTGVSTYGDIHADVVAVNSTDGNNPTKVVSVPKFL---FICG 154

Query: 215 CSVLSSRQPAGI---AGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSSLIL- 265
            +V+     +G+   AG GR K SLPSQ         KF+ CL S     +T T+ ++  
Sbjct: 155 SNVVQKGLASGVTGMAGLGRTKVSLPSQFASAFSFHRKFAICLSS-----STMTNGVMFF 209

Query: 266 -----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQRVRV 316
                + G  +SD     LT+TP ++NP     + F    SV Y++G++ I V  + V +
Sbjct: 210 GDGPYNFGYLNSDLSKV-LTFTPLISNPVSTAPSYFQGEPSVEYFIGVKSIKVSDKNVAL 268

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
               L++DR+G GGT + +   +T M   +++ +++ FV +          +GA  +  +
Sbjct: 269 NTTLLSIDRNGIGGTKISTVNPYTVMETTIYKAVSEVFVKE----------VGAPTVAPV 318

Query: 377 RP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            P   CF   D+   + G + P + L  +     T+   N    V +   +CL  V    
Sbjct: 319 APFGTCFATKDIGSTRMGPAVPGIDLVLQNDVVWTIIGANSMVYVND--VICLGFVDAGS 376

Query: 430 A---------SGGP----SIILGNFQMQNYYVEYDLRNQRLGFK 460
           +         +GG     SI +G  Q++N  +++DL   RLGF+
Sbjct: 377 SPSVAQVGFVAGGSHPRTSITIGAHQLENNLLQFDLATSRLGFR 420


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 97/397 (24%), Positives = 162/397 (40%), Gaps = 54/397 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   L  GTPP+     +DTGS ++W  C +   C   S  +I    F P  S ++  
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           + C + +CSW    S       D   +   N   +C +Y   YG G  T G  +S+ L  
Sbjct: 139 ISCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187

Query: 203 ---LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
              + + ++PN     + GCS       V S R   GI GFG+   S+ SQL     +  
Sbjct: 188 DMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPR 247

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           + SH          +++       +     + +TP V  PS         +Y V L  I+
Sbjct: 248 VFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSIS 293

Query: 309 VGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           V GQ + +     +     NG GTI+D+GTT  +++   + P  +   + + ++     +
Sbjct: 294 VNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVS 350

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
            G +       C+ +       FP + L+F GGA + L  ++Y           +  +  
Sbjct: 351 KGNQ-------CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF 403

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +        ILG+  +++    YDL  QR+G+    C
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 114/414 (27%), Positives = 173/414 (41%), Gaps = 82/414 (19%)

Query: 77  TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK------YCSSSK 130
           +T  ISS  +  Y+ ++S GTP +     LDTGS L W PC +  +C       Y S  +
Sbjct: 92  STFRISSLGFLHYT-TVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFE 149

Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
           +  + PK SS+SR + C N  C+   H + +C       L T  NC      Y+V Y S 
Sbjct: 150 LSIYNPKGSSTSRKVTCDNSLCA---HRN-RC-------LGTFSNC-----PYMVSYVSA 193

Query: 191 L--TEGIALSETLNL---PNR---IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSL 236
              T GI + + L+L    NR   +      GC      S L    P G+ G G  K S+
Sbjct: 194 ETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISV 253

Query: 237 PSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
           PS L+ + F+    S  F  D   R          S  DK +     TPF         N
Sbjct: 254 PSILSKEGFTADSFSMCFGPDGIGRI---------SFGDKGSPDQEETPF-------NLN 297

Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
           A    Y + + ++ VG   +           D +   + DSGT+FT++   ++  +   F
Sbjct: 298 ALHPTYNITVTQVRVGTTLI-----------DLDFTALFDSGTSFTYLVDPIYTNVLKSF 346

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
            SQ   +R        ++      C+D+ PGE T   P + L  KGG++   PV +   +
Sbjct: 347 HSQAQDSRR-----PPDSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQ--FPVYDPIII 399

Query: 414 VGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +   S +  C+ VV   E +     I+G   M  Y + +D     LG+K+  C 
Sbjct: 400 ISSQSELIYCMAVVRSAELN-----IIGQNFMTGYRIIFDREKLVLGWKEFECD 448


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 148/345 (42%), Gaps = 56/345 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP      +DTGS ++W  C +   C   S  +I    F P  SS+S +
Sbjct: 23  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
           + C + +C    +  IQ  D      AT  +    C SY   YG G  T G  +S+ ++L
Sbjct: 83  IACSDQRC----NNGIQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 131

Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                     N   P  + GCS         S R   GI GFG+ + S+ SQL+    + 
Sbjct: 132 NTIFEGSVTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 190

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D++    L+L       +     + YT  V  P+         +Y + L+ 
Sbjct: 191 RVFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 235

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ +++           + GTIVDSGTT  ++A E ++P      + + ++ +   
Sbjct: 236 IAVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAV 293

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
           + G +       C+ +    T  FP++ L+F GGA + L  ++Y 
Sbjct: 294 SRGNQ-------CYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 331


>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
 gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 435

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 106/403 (26%), Positives = 165/403 (40%), Gaps = 71/403 (17%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +   +D G   +W  C   Y                +SSS +   C++ +CS + 
Sbjct: 52  TPLVPVKLTVDLGGQFMWVDCDRGY----------------VSSSYKPARCRSAQCS-LA 94

Query: 157 HESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLPNRI-------I 208
            +S  C  C   P     N T  + P   ++  S   E  +   +++  N         I
Sbjct: 95  SKSSACGQCFSPPRPGCNNNTCSLFPGNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSI 154

Query: 209 PNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTT 258
           PNFL  C    +L    P   G+AGFGR   SLPSQ         KF+ CL       +T
Sbjct: 155 PNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKFAVCL-----SGST 209

Query: 259 RTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVG 310
            +  +I   NG  H   +   T   TYTP   NP     V+     S  Y++G+  I V 
Sbjct: 210 SSPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEKSTEYFIGVTSIVVN 269

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
            + V +    L +D +GNGGT + +   FT +   +++ L   F +++ K       +GA
Sbjct: 270 SKPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTTEVSK----VPRVGA 325

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN---YFAVVGEGSAV------- 420
            A      C+      + SFP  +L   G   + L ++N    +++ G  S V       
Sbjct: 326 VA--PFEVCYS-----SKSFPSTRLG-AGVPTIDLVLQNKKVIWSMFGANSMVQVNDEVL 377

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
           CL  V D       +I++G  Q+++  +E+DL   RLGF   L
Sbjct: 378 CLGFV-DGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTL 419


>gi|255552253|ref|XP_002517171.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543806|gb|EEF45334.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 437

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 105/406 (25%), Positives = 171/406 (42%), Gaps = 75/406 (18%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +   +D G  L+W  C   Y                +SSS R L C +  CS  +
Sbjct: 53  TPLVPVKLTVDLGGSLMWINCEEGY----------------VSSSYRPLSCDSALCSLSN 96

Query: 157 HESIQCRDCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIALSETLNLPN--RII-- 208
            +S   ++C   P     N  C Q   + +V  G+G  L + +   ++ +  N  RI+  
Sbjct: 97  SQSCN-KECYSSPKPGCYNNTCGQSSNNRVVYIGTGGDLGQDVVALQSFDGKNLGRIVSV 155

Query: 209 PNFLVGCSVLS-----SRQPAGIAGFGRGKTSLP----SQLNLDK-FSYCLLSHKFDDTT 258
           PNF   C +       +    G+AG GR   SLP    S +   K FS CL S     +T
Sbjct: 156 PNFPFVCGITWLLDDLADGVTGMAGLGRSNISLPAYFSSAIGFSKTFSICLSS-----ST 210

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNP-------SVAERNAFSVYYYVGLRRITVGG 311
           +++ +I+  G   S   +  L Y   + NP       S+ E +A    YY+G++ I V G
Sbjct: 211 KSNGVIV-FGDGPSSIVSNDLIYIRLILNPVGTPGYSSLGESSA---DYYIGVKSIRVDG 266

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-----R 366
           + V+     L++D+DGNGGT++ +   +T +   +++ L   F+ ++V   +        
Sbjct: 267 KEVKFDKTLLSIDKDGNGGTMLSTVNPYTVLHTSIYKALLKAFIKKLVFRFSLVVPSVPV 326

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFP--ELKLHFKGGAEVTLPVENYFAVVGEGSAV---- 420
             GA   +     F    E     P   L+L  + G  V      Y+ ++G  S V    
Sbjct: 327 PFGACVFSN---GFRTTEEFLSYVPIINLELESEQGNSV------YWRILGANSMVAVNS 377

Query: 421 ---CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              CL  +        P II+G  Q+++  + +DL + RLGF   L
Sbjct: 378 YTMCLAFIDGGSQPRTP-IIIGGHQLEDNLLHFDLASSRLGFSSSL 422


>gi|361066667|gb|AEW07645.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134456|gb|AFG48207.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134472|gb|AFG48215.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134476|gb|AFG48217.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134478|gb|AFG48218.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134480|gb|AFG48219.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134482|gb|AFG48220.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
 gi|383134484|gb|AFG48221.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
          Length = 136

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/113 (40%), Positives = 69/113 (61%), Gaps = 8/113 (7%)

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           IT+GGQR+++     T D++GNGG IVDSGTTFT +   L+  + ++  S +     Y+R
Sbjct: 1   ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRQVLNKLKSAI----RYSR 56

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPEL---KLHFKGGAEVTLPVENYFAVVGE 416
           ++  EA  GL  C+++P    GSFP L    LHFK    +TLP ENY +++ +
Sbjct: 57  SVKYEAALGLDLCYELP-SAGGSFPVLPTFSLHFKDNVTITLPAENYMSMMSD 108


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 106/403 (26%), Positives = 157/403 (38%), Gaps = 100/403 (24%)

Query: 90  SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS----FIPKLSSSSRLL 145
           ++SL+ G+PPQ +  +LDTGS L W  C            K+P+    F P +SSS    
Sbjct: 37  TVSLTVGSPPQRVTMVLDTGSELSWLHC-----------KKLPNLNFIFNPLVSSSYTPT 85

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
            C +P C+       Q RD  + P++   N  ++C       G     G+          
Sbjct: 86  PCTSPICT------TQTRDLIN-PVSCDAN--KLCHIITFFVGGPAQRGMVF-------- 128

Query: 206 RIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
                   GC      S     +  G+ G   G  S  +Q+ L KFSYC+      +   
Sbjct: 129 --------GCMDTGTSSGDEDSKTTGLMGMDLGSLSFSNQMRLPKFSYCI-----SNKDS 175

Query: 260 TSSLILDN-------GSSHSD---KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           T  L+L+N       G  H     KKTT L Y  F  N  + +++AF             
Sbjct: 176 TGVLVLENIANPPRLGPLHYTPLVKKTTPLPY--FNRNCCLFQKSAF------------- 220

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
                         D  G G T+VDS T FTF+   ++  L +EF    ++ +N    LG
Sbjct: 221 ------------LPDHTGAGQTMVDSATQFTFLRQPVYTALKNEFA---IQTKNILTPLG 265

Query: 370 AEALT---GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYF----AVVGEGSAVC 421
                    +  CF VP G      P + L F  GAE+ +  E        V    S + 
Sbjct: 266 DPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFD-GAELRVTGERLLYKVSNVAKSNSWIY 324

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                + +  G  + I+G+   +N ++EYDL N R+GF    C
Sbjct: 325 CFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 155/392 (39%), Gaps = 74/392 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSR 143
           Y +++S GTP       +DTGS + W       QCK C      S + P F P  SSS  
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSYS 184

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
            + C    CS +   S  C         +   C      Y+V YG G  T G+  S+TL 
Sbjct: 185 AVPCAAASCSQLALYSNGC---------SGGQC-----GYVVSYGDGSTTTGVYSSDTLT 230

Query: 203 LP-NRIIPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
           L  +  +  FL GC        AG+    G GR   SL SQ +      FSYCL      
Sbjct: 231 LTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------ 284

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQ 312
             T+ S   +  G   S   T G + TP +   N+P+         YY V L  I+VGGQ
Sbjct: 285 PPTQNSVGYISLGGPSS---TAGFSTTPLLTASNDPT---------YYIVMLAGISVGGQ 332

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +             G +VD+GT  T + P  +  L   F + M     Y     A A
Sbjct: 333 PLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-YGYPS---APA 382

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
              L  C+D     T + P + + F GGA + L       ++  G         D +AS 
Sbjct: 383 TGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSG---ILTSGCLAFAPTGGDSQAS- 438

Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               ILGN Q +++ V +D     +GF    C
Sbjct: 439 ----ILGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|24796804|gb|AAN64480.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 161

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 77/136 (56%), Gaps = 4/136 (2%)

Query: 185 VLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLD 243
           V+Y SG T  + +S+TL  P R I NF+VGCS++S  +Q +G+ GF  G  S+PSQL L 
Sbjct: 10  VVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSVPSQLGLT 69

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           KF Y LL+ +FDD    S  ++  G+   D     + Y P   + S   R   SVYYY+ 
Sbjct: 70  KFFYFLLARRFDDNATASDELILGGAGGKDDNVR-MQYIPLARSAST--RPLCSVYYYLA 126

Query: 304 LRRITVGGQRVRVWHK 319
           L  ITV  + V++  +
Sbjct: 127 LIAITVRRKSVQLPKR 142


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 105/396 (26%), Positives = 162/396 (40%), Gaps = 57/396 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL 145
           G + + +S GTPP      +DTGS L W  C      C   +      F P  S++  L+
Sbjct: 73  GKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELV 132

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG----LTEGIALSETL 201
           GC +  C+ +    +    C +E        T  C  Y + YGSG     + G   ++ L
Sbjct: 133 GCSSRDCADVQRSLVAPFGCIEE--------TDTC-LYSLRYGSGPSGQYSAGRLGTDKL 183

Query: 202 NLP--NRIIPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHK 253
            L   + II  F+ GCS   S +   +G+ GFG    S  +Q+    N   FSYC     
Sbjct: 184 TLASSSSIIDGFIFGCSGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGD- 242

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
                 T+   L  G+   D+    L YT  +  P   +R+ +S+        + V G R
Sbjct: 243 -----HTAEGFLSIGAYPKDE----LVYTNLI--PHFGDRSVYSLQQI----DMMVDGNR 287

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           ++V     T         +VDSGT  TF+   +F+  +    S M      +  +G E  
Sbjct: 288 LQVDQSEYTKRM-----MVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTET- 341

Query: 374 TGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYF-AVVGEGSAVCLTVVTDRE 429
                CF   G     +G  P +++ F  G  + LP EN F  ++     +CL    D  
Sbjct: 342 -----CFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLAFKPD-- 393

Query: 430 ASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +G  ++ ILGN    ++ V YDL+    GF+   C
Sbjct: 394 VAGVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 112/445 (25%), Positives = 184/445 (41%), Gaps = 68/445 (15%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP+    +    +V +S TR  ++   Q K            + S     + ++ S G P
Sbjct: 50  NPNASVAERAERIVKTSATRIAYLY-AQIKGDIHMNDFELNLLPSTYEPLFLVNFSMGQP 108

Query: 99  --PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
             PQ+   I+DTGS+++W  C     CK C+    P   P  SS+   L C N  C +  
Sbjct: 109 ATPQLA--IMDTGSNILWVRCA---PCKRCTQQNGPLLDPSKSSTYASLPCTNTMCHY-- 161

Query: 157 HESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTE-GIALSETLNLPN-----RIIP 209
                         A S  C ++    Y + Y +GL+  G+  +E L   +       +P
Sbjct: 162 --------------APSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVP 207

Query: 210 NFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
           + + GCS        R+  G+ G G+G TS  +++   KFSYCL  +  D     + L+ 
Sbjct: 208 SVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMG-SKFSYCL-GNIADPHYGYNQLVF 265

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
              ++     T        VN            +YYV L  I+VG +R+ +     ++ +
Sbjct: 266 GEKANFEGYSTP----LKVVNG-----------HYYVTLEGISVGEKRLDIDSTAFSM-K 309

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
                 ++DSGT  T++A   F  L +E V Q++         G+ A        D+ G 
Sbjct: 310 GNEKSALIDSGTALTWLAESAFRALDNE-VRQLLDGVLMPFWRGSFACYKGTVSQDLIG- 367

Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG-GPSI----ILGN 440
               FP +  HF GGA++ L  E+ F        +C+ V   R+AS  G       ++G 
Sbjct: 368 ----FPVVTFHFSGGADLDLDTESMF-YQATPDILCIAV---RQASAYGNDFKSFSVIGL 419

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLCK 465
              Q Y + YDL + +L F++  C+
Sbjct: 420 MAQQYYNMAYDLNSNKLFFQRIDCQ 444


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 116/416 (27%), Positives = 168/416 (40%), Gaps = 88/416 (21%)

Query: 77  TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI- 135
           +T  ISS  +  Y+ ++  GTP       LDTGS L W PC     C  C++S   +F  
Sbjct: 89  STFRISSLGFLHYT-TVQIGTPGVKFMVALDTGSDLFWVPC----DCTRCAASDSTAFAS 143

Query: 136 --------PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
                   P  SS+S+ + C N  C+   H S QC       L T  NC      Y+V Y
Sbjct: 144 DFDLNVYNPNGSSTSKKVTCNNSLCT---HRS-QC-------LGTFSNC-----PYMVSY 187

Query: 188 GSGL--TEGIALSETLNLPNR------IIPNFLVGC------SVLSSRQPAGIAGFGRGK 233
            S    T GI + + L+L         +  N + GC      S L    P G+ G G  K
Sbjct: 188 VSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEK 247

Query: 234 TSLPSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
            S+PS L+ + F+    S  F  D   R S    D GS   D+       TPF  NPS  
Sbjct: 248 ISVPSMLSREGFTADSFSMCFGRDGIGRIS--FGDKGSFDQDE-------TPFNLNPSHP 298

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
                   Y + + ++ VG   + V    L            DSGT+FT++    +  L 
Sbjct: 299 T-------YNITVTQVRVGTTVIDVEFTAL-----------FDSGTSFTYLVDPTYTRLT 340

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
           + F SQ+   R+ +     ++      C+D+ P   T   P + L   GG+     V + 
Sbjct: 341 ESFHSQVQDRRHRS-----DSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSH--FAVYDP 393

Query: 411 FAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             ++   S +  CL VV   E +     I+G   M  Y V +D     LG+K+  C
Sbjct: 394 IIIISTQSELVYCLAVVKSAELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 444


>gi|168065778|ref|XP_001784824.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663621|gb|EDQ50376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 110/232 (47%), Gaps = 29/232 (12%)

Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
           +LD F++CL+ +    TT TS+L+     S       GL YTP +   S +       +Y
Sbjct: 180 DLDVFAFCLVPYT-AATTLTSALVF---GSRDATNALGLVYTPLLQGTSPS-------FY 228

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-- 358
           +VG+  ++V G    +             G + DSGT  T+ APE+++PL       +  
Sbjct: 229 WVGMVGVSVAGVDAGIPTALFA----STDGVLFDSGTPLTYFAPEIYDPLHQSIAGAIPY 284

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF----KGGAEV--TLPVENYFA 412
               +   A+ A+ L   R CFD+ G ++   P +  HF      GA V   L +EN + 
Sbjct: 285 PVAPDPVDAVVAKPLN--RLCFDLAGVQSPVLPTMAYHFTDADAAGATVDFDLGLENIY- 341

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +    +  CL +V  R  SG PSI+ GN Q  N+Y+E+D+   R+G+  + C
Sbjct: 342 MNDMNTVWCLAIV--RGESGNPSIV-GNIQQANHYIEHDVALNRIGWTSKDC 390



 Score = 45.4 bits (106), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 40/77 (51%), Gaps = 9/77 (11%)

Query: 78  TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSF 134
           TT I++ S+  Y I L FGTP Q    ++DTGS LVW    PC N Y     ++   P F
Sbjct: 98  TTPITAESFE-YVIPLFFGTPLQPFTGMVDTGSDLVWIQCLPCINCY-----TTHPHPEF 151

Query: 135 IPKLSSSSRLLGCQNPK 151
            P  SSS   + C +P 
Sbjct: 152 DPTTSSSEAYVPCTDPA 168


>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
          Length = 443

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 106/408 (25%), Positives = 165/408 (40%), Gaps = 78/408 (19%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TPP  +  +LD G   +W  C   Y+                SS+ R + C +P+C  + 
Sbjct: 56  TPPVQLKVVLDVGGEFLWIDCEKGYK----------------SSTKRPVPCGSPQC--VL 97

Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR----IIPNFL 212
             S  C   ++             P   V     L E I   ++ N  N      +PN L
Sbjct: 98  SGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILYIQSTNGFNPGKQVSVPNLL 157

Query: 213 VGC---SVLSSRQPA--GIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSS 262
             C   S+L        G+AGFGR K +LPS  +       KF  CL S           
Sbjct: 158 FSCAPNSLLEGLASGIIGMAGFGRNKVALPSLFSSAFSFPRKFGVCLSSSNGVIFFGKEP 217

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
            +L  G   SD   T LTYTP + NP    S  E N  S  Y++G++ I V G+ +R+  
Sbjct: 218 YVLLPGIDVSDP--TSLTYTPLIQNPRSLVSSFEGNP-SAEYFIGVKSIKVDGKPLRLNT 274

Query: 319 KYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-----AEA 372
             LT D +G +GGT + +   FT +   +++ +   FV          +ALG      +A
Sbjct: 275 TLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFV----------KALGPKVPRVKA 324

Query: 373 LTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           +     CF+   +   + G + P++ L  +     ++   N    VG+   +CL  V   
Sbjct: 325 VAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDKLWSIFGANSMVSVGD-DVLCLGFV--- 380

Query: 429 EASGGP-------------SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              GGP             ++++G  Q++N ++ +DL   RLGF   L
Sbjct: 381 --DGGPLNFVDWGVKFTPTAVVIGGHQIENNFLLFDLGASRLGFSSSL 426


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 116/431 (26%), Positives = 171/431 (39%), Gaps = 73/431 (16%)

Query: 55  SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG-----YSISLSFGTPPQIIPFILDTG 109
           SL+  L     ++K   +  + +  +I +H  G      Y +++  GTP      ++DTG
Sbjct: 81  SLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLEYVVTVGLGTPAVSQVLLIDTG 140

Query: 110 SHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
           S L W       QC  C+S+     K P F P  SS+   + C    C  +  +     D
Sbjct: 141 SDLSWV------QCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYG-SD 193

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLS--- 219
           C      TS +       Y + YG G  T G+  +ETL + P   + +F  GC       
Sbjct: 194 C------TSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGCGHDQDGP 247

Query: 220 SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
           + +  G+ G G    SL  Q +      FSYCL +          +  L  G+  +D   
Sbjct: 248 NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPA------ANDQAGFLALGAPVNDA-- 299

Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
           +G  +TP V      E+  F   Y V +  ITVGG+ + V     +      GG I+DSG
Sbjct: 300 SGFVFTPMVR-----EQQTF---YVVNMTGITVGGEPIDVPPSAFS------GGMIIDSG 345

Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLH 396
           T  T +    +  L   F   M     Y      E    L  C++  G    + P + L 
Sbjct: 346 TVVTELQHTAYAALQAAFRKAMAA---YPLLPNGE----LDTCYNFTGHSNVTVPRVALT 398

Query: 397 FKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS---IILGNFQMQNYYVEYDLR 453
           F GGA V L V +   +       CL     +EA  GP     ILGN   +   V YD+ 
Sbjct: 399 FSGGATVDLDVPDGILLDN-----CLAF---QEA--GPDNQPGILGNVNQRTLEVLYDVG 448

Query: 454 NQRLGFKQQLC 464
           + R+GF    C
Sbjct: 449 HGRVGFGADAC 459


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/396 (25%), Positives = 151/396 (38%), Gaps = 68/396 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           Y +   +G P Q  P   DT    S L   PC     C        P+F P  SSS   +
Sbjct: 88  YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 140

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
            C +P+C+      ++C   +             CP  +      +  G  + +TL LP 
Sbjct: 141 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 181

Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
           +     F  GC  + +         G+    R   SL S++       +   FSYCL S 
Sbjct: 182 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 241

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               +  +S   L  G+S  +     + Y P  +NP+          Y+V L  I+VGG+
Sbjct: 242 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVDLVGISVGGE 291

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + V             GT++++ T FTF+AP  +  L D F   M           A  
Sbjct: 292 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYP------AAPP 340

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
              L  C+++ G  + + P + L F GG E+ L V    YFA       S  CL      
Sbjct: 341 FRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 400

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +   S+I G    ++  V YDLR  R+GF    C
Sbjct: 401 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/396 (25%), Positives = 152/396 (38%), Gaps = 68/396 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           Y +   +G P Q  P   DT    S L   PC     C        P+F P  SSS   +
Sbjct: 88  YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 140

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
            C +P+C+      ++C   +             CP  +      +  G  + +TL LP 
Sbjct: 141 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 181

Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
           +     F  GC  + +         G+    R   SL S++       +   FSYCL S 
Sbjct: 182 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 241

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               +  +S   L  G+S  +     + Y P  +NP+          Y+V L  I+VGG+
Sbjct: 242 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVELVGISVGGE 291

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + V             GT++++ T FTF+AP  +  L D F       R+      A  
Sbjct: 292 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAF------RRDMAPYPAAPP 340

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
              L  C+++ G  + + P + L F GG E+ L V    YFA       S  CL      
Sbjct: 341 FRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 400

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +   S+I G    ++  V YDLR  R+GF    C
Sbjct: 401 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|225432542|ref|XP_002277699.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
          Length = 435

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 98/398 (24%), Positives = 163/398 (40%), Gaps = 72/398 (18%)

Query: 102 IPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ 161
           IP  LD G   +W  C   Y                +SSS R + C + +CS    ++  
Sbjct: 58  IPLTLDLGGQFLWVDCDQGY----------------VSSSYRPVRCGSAQCSLTRSKA-- 99

Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-------PNRIIP--NFL 212
           C +C   P+      T +      + G+  T G    + +++       P R++     L
Sbjct: 100 CGECFSGPVKGCNYSTCVLSPDNTVTGTA-TSGEVGEDAVSIQSTDGSNPGRVVSVRRLL 158

Query: 213 VGCSV------LSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTS 261
             C        L+SR   G+AG GR + +LPSQ +       KFS CL S     T  T 
Sbjct: 159 FTCGSTFLLEGLASRV-KGMAGLGRSRVALPSQFSSAFSFNRKFSICLSS----STKSTG 213

Query: 262 SLILDNGSSHSDKKTTG---LTYTPFVNNPSVAERNAF-----SVYYYVGLRRITVGGQR 313
            +   +G      K      LTYTP + NP V+  +A+     SV Y++G++ I + G+ 
Sbjct: 214 VVFFGDGPYVLLPKVDASQSLTYTPLITNP-VSTASAYFQGEASVEYFIGVKSIKINGKA 272

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           V +    L++D  G GGT + +   +T +   +++ +   F+ ++      TR       
Sbjct: 273 VPLNATLLSIDSQGYGGTKISTVHPYTVLETSIYKAVTQAFLKEL---STITRVASVSPF 329

Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-------CLTVV 425
                  D+   + G + P + L  +  +        Y+ V G  S V       CL  V
Sbjct: 330 GACFSSKDIGSTRVGPAVPPIDLVLQRQSV-------YWRVFGANSMVQVSDNVLCLGFV 382

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
            D   +   SI++G  Q+++  +++DL   RLGF   L
Sbjct: 383 -DGGVNPRTSIVIGGRQLEDNLLQFDLATSRLGFSSSL 419


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 109/395 (27%), Positives = 154/395 (38%), Gaps = 63/395 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y ++L  GTP      ++DTGS L W       QCK C +      K P F P  SSS  
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 224

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
            + C +  C  +   +     C       S     +C  Y + YG+   T G+  +ETL 
Sbjct: 225 SVPCDSDACRKLAAGAYG-HGCT----GVSGGAAALC-EYGIEYGNRATTTGVYSTETLT 278

Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
           L P  ++ +F  GC         +  G+ G G    SL SQ +      FSYCL      
Sbjct: 279 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 332

Query: 256 DTTRTSSLILDNGS---SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             T   +  L  G+   S S    +GL++TP    PSV        +Y V L  I+VGG 
Sbjct: 333 PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSV------PTFYIVTLTGISVGGA 386

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + +     +       G ++DSGT  T +    +  L   F S M + R    + G   
Sbjct: 387 PLAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV- 439

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV---TDRE 429
              L  C+D  G    + P + L F GGA + L       V G     CL      TD  
Sbjct: 440 ---LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-----CLAFAGAGTDNA 491

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  I+GN   + + V YD     +GF+   C
Sbjct: 492 IG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/398 (24%), Positives = 160/398 (40%), Gaps = 56/398 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP+     +DTGS ++W  C +   C   S  +I    F P  S ++  
Sbjct: 79  GLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATP 138

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           + C + +CSW    S       D   +   N   +C +Y   YG G  T G  +S+ L  
Sbjct: 139 VSCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187

Query: 203 --------LPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
                   +PN   P  + GCS       V S R   GI GFG+   S+ SQL     + 
Sbjct: 188 DMIVGSSLVPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAP 246

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
            + SH          +++       +     + +TP V  PS         +Y V L  I
Sbjct: 247 RVFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSI 292

Query: 308 TVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           +V GQ + +     +     NG GTI+D+GTT  +++   + P  +   + + ++     
Sbjct: 293 SVNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV 349

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
           + G +       C+ +       FP + L+F GGA + L  ++Y           +  + 
Sbjct: 350 SKGNQ-------CYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG 402

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +        ILG+  +++    YDL  QR+G+    C
Sbjct: 403 FQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 154/388 (39%), Gaps = 72/388 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++  G+P      ++DTGS + W  C         S+  +  F P  S++     C 
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCN--------STDGLTLFDPSKSTTYAPFSCS 180

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
           +  C+ + +    C         ++  C      Y V YG G  T G   S+TL L  + 
Sbjct: 181 SAACAQLGNNGDGC---------SNSGC-----QYRVQYGDGSNTTGTYSSDTLALSASD 226

Query: 207 IIPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
            + +F  GCS         +  G+ G G    SL SQ        FSYCL       T R
Sbjct: 227 TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL-----PPTNR 281

Query: 260 TSSLI---LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
           TS  +     NG+S       G   TP +  P           Y V L+ I+VGG  + +
Sbjct: 282 TSGFLTFGAPNGTSG------GFVTTPMLRWPKAP------TLYGVLLQDISVGGTPLGI 329

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
               L+       G+++DSGT  T++    +  L+  F S M + R+      A  L  L
Sbjct: 330 QPSVLS------NGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQR----AAPLGIL 379

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C+D  G    S P + L   GGA V L          +G+ + +       A+ G SI
Sbjct: 380 DTCYDFTGLVNVSIPAVSLVLDGGAVVDL----------DGNGIMIQDCLAFAATSGDSI 429

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I GN Q + + V +D+     GF+   C
Sbjct: 430 I-GNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/400 (24%), Positives = 159/400 (39%), Gaps = 57/400 (14%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  C     C   S   +    F P+ SS++  
Sbjct: 39  GLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASP 98

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
           L C + KC       +     ++    T + C      Y   YG G  T G  +S+  + 
Sbjct: 99  LSCIDSKC-------VSSNQISESVCTTDRYC-----GYSFEYGDGSGTLGYYVSDEFDY 146

Query: 203 -------LPNRIIPNFLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
                  + N        GCS   S       R   GI GFG+   S+ SQLN    +  
Sbjct: 147 NQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPK 206

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           + SH  +       +++       +    G+ YTP V  PS         +Y + L+ I 
Sbjct: 207 IFSHCLEGADPGGGILV-----LGEITEPGMVYTPIV--PS-------QPHYNLNLQGIA 252

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           V GQ++ +  +          GTI+D GTT  ++A E +EP  +  ++ + ++       
Sbjct: 253 VNGQQLSIDPQVFATTN--TRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLK 310

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV---V 425
           G        PCF         FP + L+F+G      P +     +   S+    +    
Sbjct: 311 G-------NPCFLTVHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQK 363

Query: 426 TDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + ++A+    + ILG+  +++    YDL NQR+G+    C
Sbjct: 364 SGQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDC 403


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/396 (25%), Positives = 151/396 (38%), Gaps = 68/396 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           Y +   +G P Q  P   DT    S L   PC     C        P+F P  SSS   +
Sbjct: 176 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 228

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
            C +P+C+      ++C   +             CP  +      +  G  + +TL LP 
Sbjct: 229 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 269

Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
           +     F  GC  + +         G+    R   SL S++       +   FSYCL S 
Sbjct: 270 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 329

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
               +  +S   L  G+S  +     + Y P  +NP+          Y+V L  I+VGG+
Sbjct: 330 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVDLVGISVGGE 379

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + V             GT++++ T FTF+AP  +  L D F   M           A  
Sbjct: 380 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYP------AAPP 428

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
              L  C+++ G  + + P + L F GG E+ L V    YFA       S  CL      
Sbjct: 429 FRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 488

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +   S+I G    ++  V YDLR  R+GF    C
Sbjct: 489 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 523


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 85  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+    SYCL +    D T+   +IL       D+      YTP 
Sbjct: 253 SSFSFFEQLAGYPDILSYKALSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + LP  N F        +C+T   +       S ILGN   +++   
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK  +C
Sbjct: 458 FDIQGKQFGFKYAVC 472


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 87  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 146

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 147 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 200

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 201 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 254

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+    SYCL +    D T+   +IL       D+      YTP 
Sbjct: 255 SSFSFFEQLAGYPDILSYKALSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 306

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 307 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 346

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 347 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 403

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + LP  N F        +C+T   +       S ILGN   +++   
Sbjct: 404 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 459

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK  +C
Sbjct: 460 FDIQGKQFGFKYAVC 474


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/407 (25%), Positives = 166/407 (40%), Gaps = 72/407 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           G Y   +  GTPP      +DTGS + W    PCT+        S K+ ++ P  SS+  
Sbjct: 35  GLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDG 94

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL- 201
            L C++  C             N+    ++  C     +Y   YG G  T+G  + + + 
Sbjct: 95  ALSCRDSNCG-------AALGSNEVSCTSAGYC-----AYSTTYGDGSSTQGYFIQDVMT 142

Query: 202 ------NLPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNL-----D 243
                 N       +   GC        ++SSR   G+ GFG+   S+PSQL       +
Sbjct: 143 FQEIHNNTQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGN 202

Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
           +F++CL      D     ++++ + S  +      ++YTP V+      RN    +Y VG
Sbjct: 203 RFAHCLQG----DNQGGGTIVIGSVSEPN------ISYTPIVS------RN----HYAVG 242

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           ++ I V G+ V     + T      GG I+DSGTT  +    L +P   +FV       N
Sbjct: 243 MQNIAVNGRNVTTPASFDTTSTSA-GGVIMDSGTTLAY----LVDPAYTQFV-------N 290

Query: 364 YTRALGAEALTGLRPCFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSA 419
                 +   +    C  +        FP +KL F  GA + L   NY     +    +A
Sbjct: 291 AVSTFESSMFSSHSQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAA 350

Query: 420 VCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            C+        +G  S  ILG+  ++++ V YD  N+ +G+K   CK
Sbjct: 351 YCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDCK 397


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 165/402 (41%), Gaps = 64/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP      +DTGS ++W  C++   C + S   I    F    S ++  
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157

Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P CS +    + QC + N         C      Y   YG G  T G  +++T  
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204

Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               L   ++ N     + GCS         S +   GI GFG+GK S+ SQL+    + 
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +     +L       +    G+ Y+P V  PS         +Y + L  
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLS 309

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ + +       +     GTIVD+GTT T++  E +    D F++ +    N   
Sbjct: 310 IGVNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVS 360

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLT 423
            L    ++    C+ V    +  FP + L+F GGA + L  ++Y   + +    S  C+ 
Sbjct: 361 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG 420

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                E       ILG+  +++    YDL  QR+G+    CK
Sbjct: 421 FQKAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDCK 458


>gi|147801500|emb|CAN61502.1| hypothetical protein VITISV_011733 [Vitis vinifera]
          Length = 415

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 98/396 (24%), Positives = 163/396 (41%), Gaps = 60/396 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  S++  TP   +  ++D G   +W  C  +Y     SSS  P  +          GC 
Sbjct: 44  YVTSINQRTPLVPLQLVVDLGGQFLWVDCEQNY----VSSSYRPGAVQP--------GCN 91

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           N  CS +   ++  R  + + LA      Q         GS     +++S+         
Sbjct: 92  NNTCSVLPDNTV-TRTASSDELAEDAVSVQSTD------GSNPGRSVSVSK--------- 135

Query: 209 PNFLVGCSVLS-----SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTT 258
             FL  C+  S     +    G+AG GR + +LPSQ         KF+ CL S     TT
Sbjct: 136 --FLFSCAPTSLLEGLASGAKGMAGLGRTRIALPSQFASAFSFHRKFAICLSS----STT 189

Query: 259 RTSSLILDNGSSH---SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGG 311
               ++L +GS     +   +  L YTP + NP    S   +   S  Y++G++ I +  
Sbjct: 190 ADGVILLGDGSYGLLPNVDASQLLIYTPLILNPVSTASAHSQGEPSAEYFIGVKSIQINE 249

Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
           + V +    L+++  G GGT + +   +T M   ++      F+S    + N TR     
Sbjct: 250 KAVPLNTSLLSINSKGVGGTKISTVNPYTVMETSIYSAFTKAFISA-AASMNITR---VA 305

Query: 372 ALTGLRPCF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
           A+     CF   +V   + G + P + L  +  + V         V   G  +CL  V D
Sbjct: 306 AVAPFSVCFSSKNVYSTRGGAAVPTIGLVLQNNSVVWRIFGANSMVFVNGDVLCLGFV-D 364

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
             A+   SI++G +Q+++  +++DL   RLGF   L
Sbjct: 365 GGANPRTSIVIGGYQLEDNLLQFDLAASRLGFSSSL 400


>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 435

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 167/392 (42%), Gaps = 64/392 (16%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            +LD G   +W  C N+Y                +SS+ R   C + +CS    +S  C 
Sbjct: 60  LVLDIGGQFLWVDCDNNY----------------VSSTYRPARCGSAQCSLARSDS--CG 101

Query: 164 DCNDEPLATSKNCT-QICPSYLV---LYGSGLTEGIALSETLN----LPNRIIPNFLVGC 215
           +C   P     N T  + P   V        L + +   ++ N    + N  +  FL  C
Sbjct: 102 NCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVSLQSTNGFNPIQNATVSRFLFSC 161

Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHK----FDDTTRTS 261
           +     Q      +G+AG GR + +LPSQL        KF+ CL S      F D     
Sbjct: 162 APTFLLQGLATGVSGMAGLGRTRIALPSQLASAFSFRRKFAVCLSSSNGVAFFGDGPY-- 219

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VYYYVGLRRITVGGQRVRV 316
            ++L N  +        LT+TP + NP V+  +AFS       Y++G++ I +  + V +
Sbjct: 220 -VLLPNVDASQL-----LTFTPLLINP-VSTASAFSQGEPSAEYFIGVKSIKIDEKTVPL 272

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
               L+++  G GGT + S   +T +   +F+ + + FV +    RN TR     ++   
Sbjct: 273 NTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFV-KASSARNITR---VASVAPF 328

Query: 377 RPCF---DVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
             CF   +V   + G + P ++L  +    V  +   N    V +   +CL  V   E +
Sbjct: 329 EVCFSRENVLATRLGAAVPTIELVLQNQKTVWRIFGANSMVSVSDDKVLCLGFVNGGE-N 387

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
              SI++G +Q+++  +++DL   RLGF   L
Sbjct: 388 PRTSIVIGGYQLEDNLLQFDLATSRLGFSSLL 419


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 158/402 (39%), Gaps = 73/402 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           Y + +  G+P   +  + DTGS L W    PCT  ++         P F    S + R L
Sbjct: 91  YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFR------QLPPIFNSTASRTYRDL 144

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN-L 203
            CQ+  C+  +    QCRD           C      Y + Y  G  T G+A  + L   
Sbjct: 145 PCQHQFCTN-NQNVFQCRD---------DKCV-----YRIAYAGGSATAGVAAQDILQSA 189

Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGF------------GRGKTSLPSQLN---LDKFSYC 248
            N  IP F  GCS    R     + F                 SL  Q+N    ++FSYC
Sbjct: 190 ENDRIP-FYFGCS----RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYC 244

Query: 249 LLSHKFDDTTRTSSLI-LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           L        +  +SL+   N    S +K      TPFV+   +         Y++ L  +
Sbjct: 245 LNLFDLSSPSHATSLLRFGNDIRKSRRKYLS---TPFVSPRGMPN-------YFLNLIDV 294

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           +V G R+++      L  DG GGTI+DSGT  T+++   + P+   F       +NY   
Sbjct: 295 SVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAF-------KNYFDQ 347

Query: 368 LGAE----ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
            G +     L+G   C+   G    ++P +  HF+G      P   Y  V   G A C+ 
Sbjct: 348 HGFQRVNIQLSGYI-CYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRG-AFCVA 405

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           +   +  S     I+G     N    YD  N++L F  + C+
Sbjct: 406 L---QPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENCQ 444


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 154/377 (40%), Gaps = 60/377 (15%)

Query: 104 FILDTGSHLV---WFP-CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
            + D G  +V   W P   N  QC +C    +P F+P  SS+ +   C    C       
Sbjct: 35  LLADGGGAVVPFHWSPELYNCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVC------- 87

Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL----PNRIIPNFLVGC 215
                   + + T K  + +C    V    G T GI  ++T  +    P R   +   G 
Sbjct: 88  --------KSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPARPPAS---GA 136

Query: 216 SVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
           S  ++  P    +G  G GR   SL +Q+ L +FSYCL  H   DT + S L L      
Sbjct: 137 SWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL----GA 189

Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
           S K   G  +TPFV     +  +  S YY + L  I  G          +T+ R  N   
Sbjct: 190 SAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGRNTVL 239

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
           +  +    + +   +++      ++ +      T  +GA        CF  P       P
Sbjct: 240 VQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTAT-PVGAP----FEVCF--PKAGVSGAP 292

Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNFQMQNYY 447
           +L   F+ GA +T+P  NY   VG  + VCL+V++    +  A  G + ILG+FQ +N +
Sbjct: 293 DLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSFQQENVH 350

Query: 448 VEYDLRNQRLGFKQQLC 464
           + +DL    L F+   C
Sbjct: 351 LLFDLDKDMLSFEPADC 367


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 101/400 (25%), Positives = 167/400 (41%), Gaps = 77/400 (19%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ+   I+DTGS + + PC+    C+ C   + P F P+ SS+ + + 
Sbjct: 110 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPESSSTYQPVK 166

Query: 147 CQNPKCSWIHHESIQCRDCNDEPL--ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           C           +I C +C+ + +     +   ++  S  VL    ++ G   +++   P
Sbjct: 167 C-----------TIDC-NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFG---NQSELAP 211

Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKF 254
            R +     GC       L S+   GI G GRG  S+  QL       D FS C      
Sbjct: 212 QRAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV 267

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
                  +++L   S  SD     +T+    ++P        S YY + L+ + V G+R+
Sbjct: 268 GG----GAMVLGGISPPSD-----MTFA--YSDPDR------SPYYNIDLKEMHVAGKRL 310

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +         DG  GT++DSGTT+ ++    F    D  V ++   +          ++
Sbjct: 311 PLNANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQ---------IS 357

Query: 375 GLRP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTV 424
           G  P     CF   G    + + SFP + + F  G + +L  ENY F       A CL +
Sbjct: 358 GPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGI 417

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               +     + +LG   ++N  V YD    ++GF +  C
Sbjct: 418 F---QNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454


>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 436

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 104/434 (23%), Positives = 176/434 (40%), Gaps = 88/434 (20%)

Query: 65  PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
           P T+ T+ +T   TT I               TP   I   +D G    W  C   Y   
Sbjct: 35  PITRDTSASTPQYTTQIKQR------------TPLVPINLTIDLGGGYFWVNCDKSY--- 79

Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI-HHESIQCRDCNDEP--LATSKNCTQICP 181
                        +SS+ + + C + +CS    H     + C   P  + T  + +    
Sbjct: 80  -------------VSSTLKPILCSSSQCSLFGSHGCSDKKICGRSPYNIVTGVSTSGDIQ 126

Query: 182 SYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPS 238
           S +V   S  T G      +++PN +   F+ G +V+    ++   G+AG GR K SLPS
Sbjct: 127 SDIVSVQS--TNGNYSGRFVSVPNFL---FICGSNVVQNGLAKGVKGMAGLGRTKVSLPS 181

Query: 239 QLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAE 292
           Q +      +KF+ CL        T+   L   +G    +  ++  L YTP + NP    
Sbjct: 182 QFSSAFSFKNKFAICL-------GTQNGVLFFGDGPYLFNFDESKNLIYTPLITNPVSTS 234

Query: 293 RNAF----SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
            ++F    SV Y++G++ I V  + V++    L++D++G GGT + +   +T M   +++
Sbjct: 235 PSSFLGEKSVEYFIGVKSIRVSSKNVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYK 294

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRP---CF---DVPGEKTG-SFPELKLHFKGGA 401
            +AD FV          +AL    +  + P   CF    +   + G   P + L  +   
Sbjct: 295 AVADAFV----------KALNVSTVEPVAPFGTCFASQSISSSRMGPDVPSIDLVLQNEN 344

Query: 402 EV-TLPVENYFAVVGEGSAVCLTVVTDRE----------ASGGP----SIILGNFQMQNY 446
            V  +   N    + +   +CL  V                GG     SI +G  Q++N 
Sbjct: 345 VVWNIIGANAMVRINDKDVICLGFVDAGSDFAKTSQVGFVVGGSKPMTSITIGAHQLENN 404

Query: 447 YVEYDLRNQRLGFK 460
            +++DL   RLGF+
Sbjct: 405 LLQFDLATSRLGFR 418


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 107/404 (26%), Positives = 159/404 (39%), Gaps = 70/404 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  C +  +C  K      +  + PK SSS   
Sbjct: 82  GLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGST 141

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLN 202
           + C    C+  +   +               CT   P  Y V+YG G  T G  +++ L 
Sbjct: 142 VSCDQGFCAATYGGKL-------------PGCTANVPCEYSVMYGDGSSTTGFFVTDALQ 188

Query: 203 L----------PNRIIPNFLVGC----SVLSSRQP-AGIAGFGRGKTSLPSQLNLDK--- 244
                      P      F  G      + SS Q   GI GFG+  TS+ SQL       
Sbjct: 189 FDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVK 248

Query: 245 --FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
             F++CL      DT +   +           KTT          P VA+      +Y V
Sbjct: 249 KIFAHCL------DTIKGGGIFAIGNVVQPKVKTT----------PLVADMP----HYNV 288

Query: 303 GLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
            L+ I VGG  +++  H + T +R    GTI+DSGTT T++ PEL       F   M   
Sbjct: 289 NLKSIDVGGTTLQLPAHVFETGERK---GTIIDSGTTLTYL-PELV------FKEVMAAI 338

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV 420
            N  + +    +     CF  PG     FP +  HF+    + + P E +F    +   V
Sbjct: 339 FNKHQDIVFHNVQDFM-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDMYCV 397

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  +   G   +++G+  + N  V YDL NQ +G+    C
Sbjct: 398 GFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNC 441


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 107/446 (23%), Positives = 175/446 (39%), Gaps = 99/446 (22%)

Query: 39  NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
           NP +   Q ++S+++ S+ R  ++ +     + +        +SS    GY +S S GTP
Sbjct: 43  NPKETQIQRISSILNYSINRVRYLNH---VFSFSPNKIQDVPLSSFMGAGYVMSYSIGTP 99

Query: 99  PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS----- 153
           P  +  ++DTG+  +WF C     CK C +   P F P  SS+ + + C +P C      
Sbjct: 100 PFQLYSLIDTGNDNIWFQCK---PCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKNADGH 156

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
           ++  +++     N  P++                                      N ++
Sbjct: 157 YLGVDTLTLNSNNGTPIS------------------------------------FKNIVI 180

Query: 214 GCSVLSSRQP-----AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
           GC    ++ P     +G  G  RG  S  SQLN     KFSYCL+   F     +S L  
Sbjct: 181 GCG-HRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVP-LFSKENVSSKLHF 238

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
            + S+ S     G   TP      + E N     Y+V L   +VG   +++       + 
Sbjct: 239 GDKSTVSG---LGTVSTP------IKEENG----YFVSLEAFSVGDHIIKLE------NS 279

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK-------NRNYTRALGAEALTGLRP 378
           D  G +I+DSGTT T +  +++  L +  V  MVK       ++ +       + T L  
Sbjct: 280 DNRGNSIIDSGTTMTILPKDVYSRL-ESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTK 338

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
              +             HF  G+EV L   N F  + +   +C   V+    S     I 
Sbjct: 339 VLIITA-----------HFS-GSEVHLNALNTFYPITD-EVICFAFVSGGNFSS--LAIF 383

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN   QN+ V +DL  + + FK   C
Sbjct: 384 GNVVQQNFLVGFDLNKKTISFKPTDC 409


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 149/370 (40%), Gaps = 63/370 (17%)

Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
           C  C     P F PKLSSS  ++ C +  C+ +  +  +C + +D            C  
Sbjct: 6   CVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQL--DGHRCHEDDD----------GACQY 53

Query: 183 YLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPS 238
                G G+T+G    + L +   +    + GCS  S   PA    G+ G GRG  SL S
Sbjct: 54  TYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVS 113

Query: 239 QLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
           QL++ +F YCL        +RTS  L+L  G+      +  +T T       ++    + 
Sbjct: 114 QLSVHRFMYCLPPP----MSRTSGKLVLGAGADAVRNMSDRVTVT-------MSSSTRYP 162

Query: 298 VYYYVGLRRITVGGQ-------------------RVRVWHKYLTLDRDGNGGTIVDSGTT 338
            YYY+ L  + VG Q                           +        G IVD  +T
Sbjct: 163 SYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVAST 222

Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP---GEKTGSFPELKL 395
            +F+   L++ LAD+   ++       RA  +  L GL  CF +P   G      P + L
Sbjct: 223 ISFLETSLYDELADDLEEEI----RLPRATPSLRL-GLDLCFILPEGVGMDRVYVPTVSL 277

Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
            F G     L ++     V +G  +CL +      S     ILGNFQ+QN  V ++LR  
Sbjct: 278 SFDG---RWLELDRDRLFVTDGRMMCLMIGRTSGVS-----ILGNFQLQNMRVLFNLRRG 329

Query: 456 RLGFKQQLCK 465
           ++ F +  C 
Sbjct: 330 KITFAKASCD 339


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 91/395 (23%), Positives = 149/395 (37%), Gaps = 54/395 (13%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y +  + G+PP     I DTGS++VW  C +   C  C   KIP F P  SS+  +  C 
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPI-CTNCYKQKIPLFNPTKSSTYAIRLCG 166

Query: 149 NPKCS---WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
           + +C    W   E + C           K+  Q+C  ++       +EG   ++ +  P 
Sbjct: 167 HRECKQALWGLGEYLGC-----------KSSVQVCRYHISYEDHSFSEGTISTDIITFPE 215

Query: 206 RIIP------NFLVGCSVLSSRQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLL 250
            I            GC   +S  P          G+ G G    SL  QL L +FSYC+ 
Sbjct: 216 HIAEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCIS 275

Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
           +        T  +     +S S   T                 N    Y +  +  I V 
Sbjct: 276 TPDVQKPNGTIEIRFGLAASISGHST-------------ALANNLEGWYIFQNVDGIYVD 322

Query: 311 GQRVRVWHKYL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
             +V+ + +++      G GG I+DSGTT+T    EL+    D  + ++ +         
Sbjct: 323 DTKVKGYPEWVFQFAEGGIGGLIMDSGTTYT----ELYFSALDALIGELKEQIELAPDTQ 378

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-AVCLTVVTDR 428
             + +    C++         P ++L F    E   P     A +  G+   CL +    
Sbjct: 379 DHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF--- 435

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
             + G SII G +Q ++  + YDL+   + F +  
Sbjct: 436 -GTSGISII-GIYQHRDIKIGYDLKYNLVSFTEMF 468


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 165/401 (41%), Gaps = 64/401 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP      +DTGS ++W  C++   C + S   I    F    S ++  
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGS 157

Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P CS +    + QC + N         C      Y   YG G  T G  +++T  
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204

Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               L   ++ N     + GCS         S +   GI GFG+GK S+ SQL+    + 
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +     +L       +    G+ Y+P +  PS         +Y + L  
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLL--PS-------QPHYNLNLLS 309

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF---VSQMVKNRN 363
           I V GQ + +       +     GTIVD+GTT T++  E ++P  +     VSQ+V    
Sbjct: 310 IGVNGQILPI--DAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV---- 363

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
                    ++    C+ V    +  FP + L+F GGA + L  ++Y    G      + 
Sbjct: 364 ------TLIISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMW 417

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  ++A      ILG+  +++    YDL  QR+G+    C
Sbjct: 418 CIGFQKAP-EEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 160/391 (40%), Gaps = 64/391 (16%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
            + GTPPQ    I+D    LVW  C+   +C  C    +P FIP  SS+ R   C    C
Sbjct: 47  FTIGTPPQPASAIIDVAGELVWTQCS---RCSRCFKQDLPLFIPNASSTFRPEPCGTDAC 103

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
                +S    +C+ + + T ++ T I            T GI  +ET  +      +  
Sbjct: 104 -----KSTPTSNCSGD-VCTYESTTNI------RLDRHTTLGIVGTETFAI-GTATASLA 150

Query: 213 VGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
            GC V S        +G  G GR   SL +Q+ L KFSYCL       T ++S L L + 
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRG---TGKSSRLFLGSS 207

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
           +  +  ++T  +  PF+      + +    YY + L  I  G   +             +
Sbjct: 208 AKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ---------S 253

Query: 329 GGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA----LTGLRPCFDVP 383
           GG +V  + + F+ +    +              +  T A+G  A     T  +P FD+ 
Sbjct: 254 GGILVMHTVSPFSLLVDSAYRAF----------KKAVTEAVGGAAEQPMATPPQP-FDLC 302

Query: 384 GEKTGSF-----PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGG 433
            +K   F     P+L   F+G A +T+P   Y   VG E    C  +++    +R    G
Sbjct: 303 FKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEG 362

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            S +LG+ Q ++ +  YDL+ + L F+   C
Sbjct: 363 VS-VLGSLQQEDVHFLYDLKKETLSFEPADC 392


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 167/416 (40%), Gaps = 88/416 (21%)

Query: 77  TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI- 135
           +T  ISS  +  Y+ ++  GTP       LDTGS L W PC     C  C+++   +F  
Sbjct: 85  STFRISSLGFLHYT-TVQIGTPGVKFMVALDTGSDLFWVPC----DCTRCAATDSSAFAS 139

Query: 136 --------PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
                   P  SS+S+ + C N  C    H S QC       L T  NC      Y+V Y
Sbjct: 140 DFDLNVYNPNGSSTSKKVTCNNSLCM---HRS-QC-------LGTLSNC-----PYMVSY 183

Query: 188 GSGL--TEGIALSETLNLPNR------IIPNFLVGC------SVLSSRQPAGIAGFGRGK 233
            S    T GI + + L+L         +  N + GC      S L    P G+ G G  K
Sbjct: 184 VSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEK 243

Query: 234 TSLPSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
            S+PS L+ + F+    S  F  D   R S    D GS   D+       TPF  NPS  
Sbjct: 244 ISVPSMLSREGFTADSFSMCFGRDGIGRIS--FGDKGSFDQDE-------TPFNLNPSHP 294

Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
                   Y + + ++ VG   + V    L            DSGT+FT++    +  L 
Sbjct: 295 T-------YNITVTQVRVGTTLIDVEFTAL-----------FDSGTSFTYLVDPTYTRLT 336

Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
           + F SQ+   R+      +++      C+D+ P   T   P + L   GG+     V + 
Sbjct: 337 ESFHSQVQDRRHR-----SDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSH--FAVYDP 389

Query: 411 FAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             ++   S +  CL VV   E +     I+G   M  Y V +D     LG+K+  C
Sbjct: 390 IIIISTQSELVYCLAVVKTAELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 440


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 107/399 (26%), Positives = 169/399 (42%), Gaps = 74/399 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ+   I+D+GS + + PC++   C+ C   + P F P++SS+ + + 
Sbjct: 91  GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSD---CEQCGKHQDPKFQPEMSSTYQPVK 147

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+    +  QC    +   A   +   +    L+ +G         +E+   P R
Sbjct: 148 C-NMDCN-CDDDREQC--VYEREYAEHSSSKGVLGEDLISFG---------NESQLTPQR 194

Query: 207 IIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF-----DD 256
            +     GC       L S++  GI G G+G  SL  QL +DK    L+S+ F       
Sbjct: 195 AV----FGCETVETGDLYSQRADGIIGLGQGDLSLVDQL-VDK---GLISNSFGLCYGGM 246

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                S+IL      SD           V   S  +R   S YY + L  I V G+++ +
Sbjct: 247 DVGGGSMILGGFDYPSD----------MVFTDSDPDR---SPYYNIDLTGIRVAGKQLSL 293

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             +      DG  G ++DSGTT+ ++ P+      +E V + V           + + G 
Sbjct: 294 HSRVF----DGEHGAVLDSGTTYAYL-PDAAFAAFEEAVMREVST--------LKQIDGP 340

Query: 377 RP-----CFDVPG-----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
            P     CF V       E +  FP +++ FK G    L  ENY F       A CL V 
Sbjct: 341 DPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVF 400

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + +     + +LG   ++N  V YD  N ++GF +  C
Sbjct: 401 PNGKDH---TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 436


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 87  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 146

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 147 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 200

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 201 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 254

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+   FSYCL +    D T+   +IL       D+      YTP 
Sbjct: 255 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 306

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 307 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 346

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 347 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 403

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + L   N F        +C+T   +       S ILGN   +++   
Sbjct: 404 LPPLEIGFAGGAALALSPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 459

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK   C
Sbjct: 460 FDIQGKQFGFKYAAC 474


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 151/375 (40%), Gaps = 67/375 (17%)

Query: 105 ILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQ 161
           ++DT S + W  C     C    C   K P + P  SS+   + C +P C  +       
Sbjct: 172 VVDTSSDIPWVQC---LPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNG 228

Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLS 219
           C    DE           C  Y+V YG G  T G  +++TL + P  ++ +F  GCS   
Sbjct: 229 CSPTTDE-----------C-KYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAV 276

Query: 220 ----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
               S Q AGI   G G+ SL  Q      + FSYC+          ++  +   G   +
Sbjct: 277 RGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCI------PKPSSAGFLSLGGPVEA 330

Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
             K    +YTP + N           +Y V L  I V G+++ V             G +
Sbjct: 331 SLK---FSYTPLIKNKHA------PTFYIVHLEAIIVAGKQLAVPPTAFAT------GAV 375

Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG--AEALTGLRPCFDVPGEKTGSF 390
           +DSG   T + P+++  L   F S M        A G  A  +  L  C+D         
Sbjct: 376 MDSGAVVTQLPPQVYAALRAAFRSAMA-------AYGPLAAPVRNLDTCYDFTRFPDVKV 428

Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT-VVTDREASGGPSIILGNFQMQNYYVE 449
           P++ L F GGA + L      +++ +G   CL    T  E S G    +GN Q Q Y V 
Sbjct: 429 PKVSLVFAGGATLDL---EPASIILDG---CLAFAATPGEESVG---FIGNVQQQTYEVL 479

Query: 450 YDLRNQRLGFKQQLC 464
           YD+   ++GF++  C
Sbjct: 480 YDVGGGKVGFRRGAC 494


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 171/404 (42%), Gaps = 77/404 (19%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           +++S G PP +    +DTGS L W    PC  H  C   S+   P F P  S +SR + C
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58

Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
            + KC  + ++  +Q  +C ++      +CT     Y V YG+G   + G  +++TL + 
Sbjct: 59  SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109

Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
           +  + + + GCS  V  S   AGI GFG             P  L+   FSYCL +    
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT---- 164

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           D T+   +IL       D+      YTP    +N P+          Y + +  +   GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+             +   IVDSG   T + P  F  L D+ ++Q + +  Y R   A  
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259

Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
            + +  C+    + +G            + P L++ F GGA + LP  N F        +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C+T   +       S ILGN   +++   +D++ ++ GFK   C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 102/401 (25%), Positives = 164/401 (40%), Gaps = 64/401 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP      +DTGS ++W  C++   C + S   I    F    S ++  
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157

Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P CS +    + QC + N         C      Y   YG G  T G  +++T  
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204

Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               L   ++ N     + GCS         S +   GI GFG+GK S+ SQL+    + 
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264

Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
            + SH    D +     +L       +    G+ Y+P V  PS         +Y + L  
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLS 309

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I V GQ + +       +     GTIVD+GTT T++  E +    D F++ +    N   
Sbjct: 310 IGVNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVS 360

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLT 423
            L    ++    C+ V    +  FP + L+F GGA + L  ++Y   + +    S  C+ 
Sbjct: 361 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG 420

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                E       ILG+  +++    YDL  QR+G+    C
Sbjct: 421 FQKAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|302783204|ref|XP_002973375.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
 gi|300159128|gb|EFJ25749.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
          Length = 407

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 72/263 (27%), Positives = 112/263 (42%), Gaps = 29/263 (11%)

Query: 213 VGCSVLSSR-----QPAGIAGFGRGKTSLPSQL-NLD---KFSYCLLSHKFDDTTRTSSL 263
           +GC   S+R       +G+ GF +   S   QL  +D   KF YC  S  F     +  +
Sbjct: 127 LGCGRQSTRLLGILSTSGLVGFAKTNKSFIGQLAEMDYTGKFIYCAPSDTF-----SGKI 181

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
           +  N   +     + L+YTP + NP        +  YY+GLR I++      +    L  
Sbjct: 182 VFGN---YKISSNSSLSYTPMIVNP------ISTALYYIGLRSISINDMLTFLVQGILA- 231

Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
             DG GGTI+DS   F++  P+ + PL  + +  +  N     +    AL G   C++V 
Sbjct: 232 --DGTGGTIIDSTFAFSYFTPDSYTPLV-QAIQNLNSNLTKVSSNKTAALLGNDICYNVS 288

Query: 384 GEKTGSFPE-LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
                  P+ L  HF+ G +V            E + VCL  V D +  G    ++G +Q
Sbjct: 289 VNGDTPPPQTLTYHFENGTQVEFRTWFLLDDDAENATVCL-AVGDSQKVGFSLNVIGTYQ 347

Query: 443 MQNYYVEYDLRNQRLGFKQQLCK 465
             +  VE+DL  Q +GF    C 
Sbjct: 348 QLDVAVEFDLEKQEIGFGTAGCN 370


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 104/394 (26%), Positives = 158/394 (40%), Gaps = 58/394 (14%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSS 140
           +S + G Y   L  GTP      ++DTGS L W  C+    C   C     P F P+ S 
Sbjct: 124 ASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCS---PCSVSCHRQAGPVFDPRASG 180

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSE 199
           +   + C + +C  +   ++    C         + + +C  Y   YG S  + G    +
Sbjct: 181 TYAAVQCSSSECGELQAATLNPSAC---------SVSNVC-IYQASYGDSSYSVGYLSKD 230

Query: 200 TLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHK 253
           T++  +   P F  GC   +     + AG+ G  + K SL  QL       FSYCL    
Sbjct: 231 TVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCL---- 286

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
              T+  ++  L  GS +  +     +YTP      +A  +  +  Y+V L  I+V G  
Sbjct: 287 --PTSSAAAGYLSIGSYNPGQ----YSYTP------MASSSLDASLYFVTLSGISVAGAP 334

Query: 314 VRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           + V   +Y +L       TI+DSGT  T + P ++  L+    + M              
Sbjct: 335 LAVPPSEYRSLP------TIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSI-- 386

Query: 373 LTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
              L  CF   G   G   P + + F GGA + L   N    V + S  CL       A 
Sbjct: 387 ---LDTCFR--GSAAGLRVPRVDMAFAGGATLALSPGNVLIDV-DDSTTCLAF-----AP 435

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            G + I+GN Q Q + V YD+   R+GF    C 
Sbjct: 436 TGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 176/401 (43%), Gaps = 65/401 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C  S     ++  F    SSS
Sbjct: 82  GLYFTKVKLGNPAREFNVQIDTGSDILWVTCS---PCDGCPDSSGLGIELNLFDTTKSSS 138

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
           +R+L C +P C+ +   + QC       L  + +C+    S+     SG T G  +++++
Sbjct: 139 ARVLPCTDPICAAVSTTTDQC-------LTQTDHCSY---SFHYRDRSG-TSGFYVTDSM 187

Query: 202 N----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
           +    L    I N     + GCS+        +++   GI GFG+G+ S+ SQL+    +
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGIT 247

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
             + SH          +++       +     + Y+P +  PS         +Y + L+ 
Sbjct: 248 PKVFSHCLKGGENGGGILV-----LGEILEPSIVYSPLI--PS-------QPHYTLKLQS 293

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I + GQ   ++           G TI+DSGTT  ++  E+++ +     S + ++   T 
Sbjct: 294 IALSGQ---LFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTI 350

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSAVCLT 423
           + G++       CF V       FP L+ +F+G A + +  E Y    ++V E +  C+ 
Sbjct: 351 SRGSQ-------CFRVSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIG 403

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
               ++A  G + ILG+  +++  + YDL  QR+G+    C
Sbjct: 404 F---QKAEDGLN-ILGDLVLKDKIIVYDLARQRIGWANYDC 440


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 103/404 (25%), Positives = 158/404 (39%), Gaps = 70/404 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS--KIPSFIPKLSSSSRL 144
           G Y   +  GTPP+     +DTGS ++W  C    QC + S     +  + PK SS+  +
Sbjct: 84  GLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSM 143

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-------LTEGIA 196
           + C    C+      +               C    P  Y V YG G       +T+ + 
Sbjct: 144 VMCDQAFCAATFGGKL-------------PKCGANVPCEYSVTYGDGSSTIGSFVTDALQ 190

Query: 197 LSETLNLPNRIIPN--FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDK--- 244
             +          N   + GC       L S   A  GI GFG   TS+ SQL       
Sbjct: 191 FDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVK 250

Query: 245 --FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
             F++CL      DT +   +           KTT          P VA++     +Y V
Sbjct: 251 KIFAHCL------DTIKGGGIFSIGDVVQPKVKTT----------PLVADKP----HYNV 290

Query: 303 GLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
            L+ I VGG  +++  H +   ++ G   TI+DSGTT T++ PEL       F   M+  
Sbjct: 291 NLKTIDVGGTTLQLPAHIFEPGEKKG---TIIDSGTTLTYL-PELV------FKEVMLAV 340

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV 420
            N  + +    + G   CF  PG     FP +  HF+    + + P E +FA   +   V
Sbjct: 341 FNKHQDITFHDVQGFL-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYCV 399

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  +   G   +++G+  + N  V YDL N+ +G+    C
Sbjct: 400 GFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNC 443


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 85  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+   FSYCL +    D T+   +IL       D+      YTP 
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 345 LWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + L   N F        +C+T   +       S ILGN   +++   
Sbjct: 402 LPLLEIGFAGGAALALSPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK   C
Sbjct: 458 FDIQGKQFGFKYAAC 472


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)

Query: 62  IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
           + N Q +  T++++T    I   S   +   +++S G PP +    +DTGS L W    P
Sbjct: 85  LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144

Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
           C  H  C   S+   P F P  S +SR + C + KC  + ++  +Q  +C    +    +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198

Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
           CT     Y V YG+G   + G  +++TL + +  + + + GCS  V  S   AGI GFG 
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252

Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
                       P  L+   FSYCL +    D T+   +IL       D+      YT  
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTSL 304

Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
              +N P+          Y + +  +   GQR+             +   IVDSG   T 
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
           + P  F  L D+ ++Q + +  Y R   A   + +  C+    + +G            +
Sbjct: 345 LWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401

Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
            P L++ F GGA + LP  N F        +C+T   +       S ILGN   +++   
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457

Query: 450 YDLRNQRLGFKQQLC 464
           +D++ ++ GFK   C
Sbjct: 458 FDIQGKQFGFKYAAC 472


>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 122/278 (43%), Gaps = 34/278 (12%)

Query: 204 PNRIIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
           PNR      V CS L+      +  G+ G  RG  S  SQ++  KFSYC+    F     
Sbjct: 108 PNRSSSYSPVPCSSLTCTDQDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSDF----- 162

Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
            S ++L   ++ S      L YTP +   S        V Y V L  I V  + + +   
Sbjct: 163 -SGVLLLGDANFS--WLMPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVSSKLLPLPKS 218

Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEAL 373
               D  G G T+VDSGT FTF+   ++  L +EF++Q      ++++ NY    G +  
Sbjct: 219 VFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDL- 277

Query: 374 TGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVT 426
                C+ VP  +T     P + L F+ GAE+ +  +         V G  S  C T   
Sbjct: 278 -----CYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFT-FG 330

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + +     + ++G+   QN ++E+DL   R+GF Q  C
Sbjct: 331 NSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368



 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 11/72 (15%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSSKIPSFIPKLSSS 141
           H     ++SL+ GTPPQ +  +LDTGS L W  C  T  +Q          +F P  SSS
Sbjct: 63  HHNVSLTVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT---------TFDPNRSSS 113

Query: 142 SRLLGCQNPKCS 153
              + C +  C+
Sbjct: 114 YSPVPCSSLTCT 125


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/166 (30%), Positives = 74/166 (44%), Gaps = 10/166 (6%)

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
           YYYVGL  I+VGG+ + +      +D  GNGG IVDSGT  T +  +++  + D FV   
Sbjct: 10  YYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFV--- 66

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
              +     L    ++    C+D+  + +   P +  HF  G  + LP +NY   V    
Sbjct: 67  ---KGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVG 123

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             C        +      I+GN Q Q   V +DL N  +GF    C
Sbjct: 124 TFCFAFAPTMSSLS----IIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 120/471 (25%), Positives = 193/471 (40%), Gaps = 74/471 (15%)

Query: 15  FFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTT 74
           F ++  I  + + +  FS+   H +  +    N +   +  L R    +   + +  + +
Sbjct: 19  FVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDR--FFRRFMSFSEASIS 76

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
             T     S + G Y + +S GTPP  +  I DTGS L+W  C     C  C   K P F
Sbjct: 77  PNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC---LPCLSCYKQKNPMF 133

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ---ICPSYLVLYGSG- 190
            P  S+S + + C          ES QCR      L  + +C+Q   +C  +   YG G 
Sbjct: 134 DPSKSTSFKEVSC----------ESQQCR------LLDTVSCSQPQKLC-DFSYGYGDGS 176

Query: 191 LTEGIALSETLNL------PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQL 240
           L +G+  +ETL L      P  I+ N + GC   +S        G+ G G    SL SQ+
Sbjct: 177 LAQGVIATETLTLNSNSGQPTSIL-NIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQI 235

Query: 241 -----NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAER 293
                +  KFS CL+  + D +  TS +I       ++   + +  TP V  ++P+    
Sbjct: 236 MSTLGSGRKFSQCLVPFRTDPSI-TSKIIF---GPEAEVSGSDVVSTPLVTKDDPT---- 287

Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
                YY+V L  I+V G ++  +     +   GN    +D+GT  T         L  +
Sbjct: 288 -----YYFVTLDGISV-GDKLFPFSSSSPMATKGN--VFIDAGTPPTL--------LPRD 331

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
           F +++V+       +       L+P            P L  HF  GA+V L   N F  
Sbjct: 332 FYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFD-GADVQLKPLNTFIS 390

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             EG   C  +    +   G + I GNF   N+ + +DL  +++ FK   C
Sbjct: 391 PKEG-VYCFAM----QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 160/397 (40%), Gaps = 55/397 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  GTPP      +DTGS ++W  C +   C   S   I    F    SSSS L
Sbjct: 77  GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P C S     + QC       L  S  C     SY   YG G  T G  +SE++ 
Sbjct: 137 VSCSDPICNSAFQTTATQC-------LTQSNQC-----SYTFQYGDGSGTSGYYVSESMY 184

Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
               +   +I N     + GCS         S     GI GFG G  S+ SQL+    + 
Sbjct: 185 FDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITP 244

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
            + SH          +++       +    G+ Y+P V  PS    N +       L+ I
Sbjct: 245 KVFSHCLKGEGNGGGILV-----LGEVLEPGIVYSPLV--PSQPHYNLY-------LQSI 290

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           +V GQ + +           N GTI+DSGTT  ++  E + P      + + ++   T +
Sbjct: 291 SVNGQTLPIDPSVFATSI--NRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTIS 348

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
            G +       C+ V       FP + L+F G A + L  E Y   +G      L  +  
Sbjct: 349 KGNQ-------CYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGF 401

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ++   G + ILG+  M++    YDL  QR+G+    C
Sbjct: 402 QKVQEGVT-ILGDLVMKDKIFVYDLARQRIGWASYDC 437


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 101/399 (25%), Positives = 163/399 (40%), Gaps = 64/399 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRLLG 146
           Y   +  G+PP      +DTGS ++W  C++   C + S   I    F    S ++  + 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 147 CQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN-- 202
           C +P CS +    + QC + N         C      Y   YG G  T G  +++T    
Sbjct: 165 CSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFYFD 211

Query: 203 --LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
             L   ++ N     + GCS         S +   GI GFG+GK S+ SQL+    +  +
Sbjct: 212 AILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPV 271

Query: 250 LSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
            SH    D +     +L       +    G+ Y+P V  PS         +Y + L  I 
Sbjct: 272 FSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLSIG 316

Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
           V GQ + +       +     GTIVD+GTT T++  E +    D F++ +    N    L
Sbjct: 317 VNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVSQL 367

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLTVV 425
               ++    C+ V    +  FP + L+F GGA + L  ++Y   + +    S  C+   
Sbjct: 368 VTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQ 427

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              E       ILG+  +++    YDL  QR+G+    C
Sbjct: 428 KAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDC 462


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 117/273 (42%), Gaps = 22/273 (8%)

Query: 192 TEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYC 248
           T G   ++T       +P  + GCS  S    AG   + G GRG  SL SQL   KFSY 
Sbjct: 129 TSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQ 188

Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
           LL+ +  D     S+I   G     K   G + TP +++        +  +YYV L  + 
Sbjct: 189 LLAPEATDDGSADSVI-RFGDDAVPKTKRGRS-TPLLSS------TLYPDFYYVNLTGVR 240

Query: 309 VGGQRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           V G R+  +      L  +G GG I+ S T  T++     E  A + V   V +R    A
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPA 295

Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
           +   A   L  C++         P+L L F GGA++ L   NYF +  +    CLT++  
Sbjct: 296 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 355

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           +  S     +LG        + YD+   RL F+
Sbjct: 356 QGGS-----VLGTLLQTGTNMIYDVDAGRLTFE 383


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 150/385 (38%), Gaps = 73/385 (18%)

Query: 94  SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
           + GTPPQ    I+D        PC+                 P  SS+ R   C    C 
Sbjct: 72  TIGTPPQPASAIIDVAGPA---PCS----------------FPNASSTFRPEPCGTDACK 112

Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
            I   +     C  E    SK               G T GI  ++T  +      +   
Sbjct: 113 SIPTSNCSSNMCTYEGTINSKL-------------GGHTLGIVATDTFAI-GTATASLGF 158

Query: 214 GCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
           GC V S       P+G+ G GR  +SL SQ+N+ KFSYCL  H   D+ + S L+L  GS
Sbjct: 159 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRLLL--GS 213

Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
           S         T TPFV     +  +  S YY + L  I  G          + L   GN 
Sbjct: 214 SAKLAGGGNSTTTPFVK---TSPGDDMSQYYPIQLDGIKAG-------DAAIALPPSGN- 262

Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-ALTGLRP---CFDVPGE 385
             +V +    +F+    ++ L  E           T+A+GA    T L+P   CF   G 
Sbjct: 263 TVLVQTLAPMSFLVDSAYQALKKEV----------TKAVGAAPTATPLQPFDLCFPKAGL 312

Query: 386 KTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSA-VCLTVVT----DREASGGPSIILG 439
              S P+L   F +G A +T+P   Y   VGE    VC+ +++    +  A      ILG
Sbjct: 313 SNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILG 372

Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
           + Q +N +   DL  + L F+   C
Sbjct: 373 SLQQENTHFLLDLEKKTLSFEPADC 397


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 163/391 (41%), Gaps = 68/391 (17%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           ++S G PP     ++DTGS ++W  CT    C  C +     F P  SS+   L C+ P 
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCT---PCTNCDNDLGLLFDPSKSSTFSPL-CKTP- 158

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT------EGIALSETLNLPN 205
           C +          C  +P+            + V Y    T          + ET +   
Sbjct: 159 CDF--------EGCRCDPIP-----------FTVTYADNSTASGTFGRDTVVFETTDEGT 199

Query: 206 RIIPNFLVGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
             I + L GC  ++     P   GI G   G  SL ++L   KFSYC+  +  D      
Sbjct: 200 SRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLG-QKFSYCI-GNLADPYYNYH 257

Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
            LIL  G+      T      PF           ++ +YYV +  I+VG +R+ +  +  
Sbjct: 258 QLILGEGADLEGYST------PF---------EVYNGFYYVTMEGISVGEKRLDIAPETF 302

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP--- 378
            +  +  GG I+D+G+T TF+   + + L+ E        RN       +A     P   
Sbjct: 303 EMKENRAGGVIIDTGSTITFLVDSVHKLLSKEV-------RNLLGWSFRQATIEKSPWMQ 355

Query: 379 CF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGP 434
           CF   +  +  G FP +  HF  GA++ L   ++F  + + +  C+TV  V+       P
Sbjct: 356 CFYGSISRDLVG-FPVVTFHFSDGADLALDSGSFFNQLND-NVFCMTVGPVSSLNIKSKP 413

Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           S+I G    Q+Y V YDL NQ + F++  C+
Sbjct: 414 SLI-GLLAQQSYNVGYDLVNQFVYFQRIDCE 443


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 157/394 (39%), Gaps = 62/394 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G + + +  GTPP  I  ++DTGS L+W  C     C  C     P F P  SS+   + 
Sbjct: 66  GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCA---PCLGCYKQIKPMFDPLKSSTYNNIS 122

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
           C +P C       +    C+ E     K C     +Y   YG + LT+G+   +T    +
Sbjct: 123 CDSPLC-----HKLDTGVCSPE-----KRC-----NYTYGYGDNSLTKGVLAQDTATFTS 167

Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
                  +  FL GC   ++        G+ G G G TSL SQ+       KFS CL+  
Sbjct: 168 NTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPF 227

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
              D   +S +    GS        G+  TP V      E++     Y+V L  I+    
Sbjct: 228 -LTDIKISSRMSFGKGSQVLGN---GVVTTPLVPR----EKDT---SYFVTLLGIS---- 272

Query: 313 RVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              V   Y  ++   G    +VDSGT    +  +L++ +  E     V+N+   + +  +
Sbjct: 273 ---VEDTYFPMNSTIGKANMLVDSGTPPILLPQQLYDKVFAE-----VRNKVALKPITDD 324

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREA 430
              G + C+       G  P L  HF G   +  P++ +     +   + CL +     +
Sbjct: 325 PSLGTQLCYRTQTNLKG--PTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNS 382

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             G   + GNF   NY + +DL  Q + FK   C
Sbjct: 383 DPG---VYGNFAQSNYLIGFDLDRQVVSFKPTDC 413


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 112/455 (24%), Positives = 180/455 (39%), Gaps = 65/455 (14%)

Query: 23  PSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
           P+   +  FS    H N     +   N+   + L     +        +  T  T+ N  
Sbjct: 22  PTEAYNKGFSFKLIHKNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNN-- 79

Query: 83  SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
               G Y + L+ G+PP  I  ++DTGS LVW  CT    C  C   K P F P  S + 
Sbjct: 80  ----GDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCT---PCGGCYRQKSPMFEPLRSKTY 132

Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
             + C++ +CS+  +       C+ +         ++C        S +T+G+   E + 
Sbjct: 133 SPIPCESEQCSFFGYS------CSPQ---------KMCAYSYSYADSSVTKGVLAREAIT 177

Query: 203 LPNR-----IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCL 249
             +      ++ + + GC   +S        GI G G G  SL SQ+       +FS CL
Sbjct: 178 FSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCL 237

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
           +   F     TS  I  N    SD    G+  TP       +E    S  Y V L  I+V
Sbjct: 238 V--PFHTDAHTSGTI--NFGEESDVSGEGVVTTPL-----ASEEGQTS--YLVTLEGISV 286

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           G   VR ++   TL +   G  ++DSGT  T++  E +E L +E     +K ++    + 
Sbjct: 287 GDTFVR-FNSSETLSK---GNIMIDSGTPATYIPQEFYERLVEE-----LKVQSSLLPIE 337

Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
            +   G + C+       G  P L  HF+G     LP++ +  +  +    C  +    +
Sbjct: 338 DDPDLGTQLCYRSETNLEG--PILTAHFEGADVQLLPIQTF--IPPKDGVFCFAMAGSTD 393

Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                  I GNF   N  + +DL  + + FK   C
Sbjct: 394 G----DYIFGNFAQSNILMGFDLDRKTISFKPTDC 424


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 112/412 (27%), Positives = 169/412 (41%), Gaps = 85/412 (20%)

Query: 77  TTTNISSHSY--------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
           T+ N+ +H++        G + + ++FGTP   I  ILDTGS + W       QCK C  
Sbjct: 108 TSGNLKNHAHNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITW------TQCKAC-- 159

Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS-----Y 183
                 +  L  S+R                       D   +++ +     PS     Y
Sbjct: 160 ------VNCLQDSNRYF---------------------DSSASSTYSFGSCIPSTVENNY 192

Query: 184 LVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLP 237
            + YG   T  G    +T+ L P+ +   F  GC   +         G+ G G+G+ S  
Sbjct: 193 NMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTV 252

Query: 238 SQL--NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
           SQ     +K FSYCL     ++ +  S L  +  +S S    + L +T  VN P   +  
Sbjct: 253 SQTASKFNKVFSYCLP----EEDSIGSLLFGEKATSQS----SSLKFTSLVNGPGTLQE- 303

Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
             S YY+V L  I+VG +R+ +           + GTI+DS T  T +    +  L   F
Sbjct: 304 --SGYYFVNLSDISVGNERLNIPSSVFA-----SPGTIIDSRTVITRLPQRAYSALKAAF 356

Query: 355 VSQMVKN--RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
              M K    N  R  G      L  C+++ G K    PE+ LHF GGA+V L   N   
Sbjct: 357 KKAMAKYPLSNGRRKKGDI----LDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTN-IV 411

Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              + S +CL      E +     I+GN Q  +  V YD++ +R+GF    C
Sbjct: 412 WGSDASRLCLAFAGTSELT-----IIGNRQQLSLTVLYDIQGRRIGFGGNGC 458


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 108/403 (26%), Positives = 168/403 (41%), Gaps = 67/403 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP+     +DTGS ++W  C +   C   S   I    F    SS++ L
Sbjct: 64  GLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGL 123

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN- 202
           + C +P C S +     QC          S    Q   ++    GSG T G  +S+TL  
Sbjct: 124 VHCSDPICTSAVQTTVTQC----------SPQTNQCSYTFQYEDGSG-TSGYYVSDTLYF 172

Query: 203 ---LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDK---- 244
              L   ++ N     + GCS   S       +   GI GFG+G+ S+ SQL+       
Sbjct: 173 DAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPR 232

Query: 245 -FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
            FS+CL              IL+           G+ Y+P V  PS         +Y + 
Sbjct: 233 VFSHCLKGEGIGGGILVLGEILE----------PGMVYSPLV--PS-------QPHYNLN 273

Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
           L+ I V G+ + +           + GTIVDSGTT  ++  E ++P    FVS +  N  
Sbjct: 274 LQSIAVNGKLLPIDPSVFA--TSNSQGTIVDSGTTLAYLVAEAYDP----FVSAV--NVI 325

Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVC 421
            + ++      G   C+ V    +  FP    +F GGA + L  E+Y    G  +G +V 
Sbjct: 326 VSPSVTPIISKG-NQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSV- 383

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +  +  ++  G    ILG+  +++    YDL  QR+G+    C
Sbjct: 384 MWCIGFQKVQG--VTILGDLVLKDKIFVYDLVRQRIGWANYDC 424


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 112/444 (25%), Positives = 175/444 (39%), Gaps = 76/444 (17%)

Query: 44  SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
           +Y N   L +SS  R L    NP  +        T         G Y+  L  GTP Q  
Sbjct: 53  AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 104

Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSS-------SKIPSFIPKLSSSSRLLGCQNPKCSWI 155
             I+D+GS + + PC    QC    S       +  P F P LSS+   + C N  C+  
Sbjct: 105 ALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC-NVDCT-C 162

Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
            +E  QC           +   ++  S  VL    ++ G    E+   P R +     GC
Sbjct: 163 DNERSQC--------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGC 207

Query: 216 S-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLIL 265
                  L S+   GI G GRG+ S+  QL       D FS C         T    ++L
Sbjct: 208 ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVL 263

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
               +  D      +++  V +P          YY + L+ I V G+ +R+  K      
Sbjct: 264 GGMPAPPDMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF---- 306

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
           +   GT++DSGTT+ ++  + F    D   +++    N  + +          CF   G 
Sbjct: 307 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGR 362

Query: 386 KTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
                   FP++ + F  G +++L  ENY F       A CL V  + +    P+ +LG 
Sbjct: 363 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGG 419

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
             ++N  V YD  N+++GF +  C
Sbjct: 420 IVVRNTLVTYDRHNEKIGFWKTNC 443


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 111/410 (27%), Positives = 159/410 (38%), Gaps = 72/410 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ----------------CKYCSSSKIP 132
           Y  +++ GTPP     + DTGS LVW  C                             + 
Sbjct: 82  YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141

Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC---TQICPSYLVLYGS 189
            F P  SSS   +GC  P C                 LAT+ +C   +  C         
Sbjct: 142 YFNPFDSSSYSRVGCDGPSC---------------LALATNASCNGDSHACDFRYSYRDG 186

Query: 190 GLTEGIALSETLNLPNRI------IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL 240
               G+  ++T      I        +   GC+  ++    Q  G+ G G G  SL SQL
Sbjct: 187 ASATGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQL 246

Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
              KFS+CL ++  DD    +S IL+ G + +     G   TP + + S A     + YY
Sbjct: 247 GR-KFSFCLTAYDIDD----ASSILNFG-ARAVVSDPGAATTPLIASSSNA-----AAYY 295

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMA-PELFEPLADEFVSQMV 359
            + +  + V GQ V        +        IVD+GT  TF+    L  PL  E +++++
Sbjct: 296 AISIDSLKVAGQPVPGTTSVSKV--------IVDTGTVLTFLDRAALLAPLT-ESLARVM 346

Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPE--LKLHFKGGAEVTLPVENYFAVVG 415
                 RA   +    L  C+DV   K   G  P+  L L   GG EV L  E  F +V 
Sbjct: 347 DGAGLPRAPPPDET--LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVK 404

Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           EG  +CL VVT       P  +LGN  +Q+ +V  DL  +   F    C 
Sbjct: 405 EG-VLCLAVVT-TSPELQPLSVLGNVALQDLHVGIDLDARTATFATANCD 452


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 107/399 (26%), Positives = 169/399 (42%), Gaps = 74/399 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  L  GTPPQ+   I+D+GS + + PC++   C+ C   + P F P+LSS+ + + 
Sbjct: 92  GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSD---CEQCGKHQDPKFQPELSSTYQPVK 148

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+    +  QC    +   A   +   +    L+ +G         +E+   P R
Sbjct: 149 C-NMDCN-CDDDKEQC--VYEREYAEHSSSKGVLGEDLISFG---------NESQLTPQR 195

Query: 207 IIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF-----DD 256
            +     GC       L S++  GI G G+G  SL  QL +DK    L+S+ F       
Sbjct: 196 AV----FGCETVETGDLYSQRADGIIGLGQGDLSLVDQL-VDK---GLISNSFGLCYGGM 247

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
                S+IL      SD           +   S  +R   S YY + L  I V G+++ +
Sbjct: 248 DVGGGSMILGGFDYPSD----------MIFTDSDPDR---SPYYNIDLTGIRVAGKKLSL 294

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
             +      DG  G ++DSGTT+ ++ P+      +E V + V           + + G 
Sbjct: 295 NSRVF----DGEHGAVLDSGTTYAYL-PDAAFAAFEEAVMREVS--------PLKQIDGP 341

Query: 377 RP-----CFDVPG-----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
            P     CF V       E +  FP +++ FK G    L  ENY F       A CL V 
Sbjct: 342 DPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVF 401

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + +     + +LG   ++N  V YD  N ++GF +  C
Sbjct: 402 PNGKDH---TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 437


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 102/420 (24%), Positives = 157/420 (37%), Gaps = 58/420 (13%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
           H +    K  ++  +     ++     G Y   +  GTPP+     +DTGS L+W  C  
Sbjct: 7   HDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHP 66

Query: 120 HYQCKYCSSSKIPSFIP---KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
              C   S  KIP  +P   K S+SS  + C +P C+ I    I    CND+       C
Sbjct: 67  CIGCPAFSDLKIP-IVPYDVKASASSSKVPCSDPSCTLITQ--ISESGCNDQ-----NQC 118

Query: 177 TQICPSYLVLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSV-------LSSRQPAGIAG 228
                 Y   YG G  T G  + + L+         + GC          S R   GI G
Sbjct: 119 -----GYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIG 173

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
           FG    S  SQL     +  + +H  D   R   +++       D     + YTP V   
Sbjct: 174 FGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD-----IQYTPLV--- 225

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
                  +  +Y V L+ I+V    + +  K  +   D   GTI DSGTT  ++  E ++
Sbjct: 226 ------PYMYHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTIFDSGTTLAYLPDEAYQ 277

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
               + VS +V          +  +  L             FP + L+F+G +    P E
Sbjct: 278 AFT-QAVSLVVAPFLLCDTRLSRFIYKL-------------FPNVVLYFEGASMTLTPAE 323

Query: 409 NYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                    +A    +      S    +   I G+  ++N  V YDL   R+G++   CK
Sbjct: 324 YLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDCK 383


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 109/461 (23%), Positives = 191/461 (41%), Gaps = 78/461 (16%)

Query: 34  SRFHTNPSQDSYQNL---NSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---- 86
           S FH + S     NL   N + S    +  HIK    +         T +I +H      
Sbjct: 20  SVFHLSASPTLVLNLVHSNQIYSLQSPQVSHIKEASVERLEYLKAKATGDIIAHLSPNVP 79

Query: 87  ---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
                + +++S G+PP      +DT S L+W  C     C  C +  +P F P  S + R
Sbjct: 80  IIPQAFLVNISIGSPPVTQLLHMDTASDLLWLQCR---PCINCYAQSLPIFDPSRSYTHR 136

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
              C+  + S     S++         A +++C      Y + Y  G      L++ + +
Sbjct: 137 NESCRTSQYSM---PSLRFN-------AKTRSC-----EYSMRYMDGTGSKGILAKEMLM 181

Query: 204 PNRI--------IPNFLVGCSVLSSRQP---AGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
            N I        + + + GC   +  +P    GI G G G+ SL  +    KFSYC  S 
Sbjct: 182 FNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGT-KFSYCFGS- 239

Query: 253 KFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
             DD +   + L+L +  ++    TT L                ++ +YYV +  I+V G
Sbjct: 240 -LDDPSYPHNVLVLGDDGANILGDTTPL--------------EIYNGFYYVTIEAISVDG 284

Query: 312 QRVRV--WHKYLTLDRD---GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
             + +  W      +R+   G GGTI+D+G + T +  E ++PL ++ +    + R    
Sbjct: 285 IILPIDPW----VFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNK-IEDYFEGRFTAA 339

Query: 367 ALGAEALTGLRPCFDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
            +  + +  +  C++   E+      FP +  HF  GAE++L V++ F  +   +  CL 
Sbjct: 340 DVNQDDMFKVE-CYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSP-NVFCLA 397

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           V      S      +G    Q+Y + YDL  +++ F++  C
Sbjct: 398 VTPGNMNS------IGATAQQSYNIGYDLEAKKISFERIDC 432


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 158/401 (39%), Gaps = 73/401 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  +  GTP Q    I+DTGS + + PC++   C +  +   P F P  SSS + + 
Sbjct: 97  GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVS 156

Query: 147 CQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
           C +P C          QC+   +   A   +   +    L+ +G+G              
Sbjct: 157 CNSPDCITKMCDARVHQCK--YERVYAEMSSSKGVLGKDLLGFGNG-------------- 200

Query: 205 NRIIPN-FLVGCSVLSS-----RQPAGIAGFGRGKTSLPSQL-----NLDKFSYCLLSHK 253
           +R+ P+  L GC    +     +   GI G GRG  S+  QL       D FS C     
Sbjct: 201 SRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGG-- 258

Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
                      +D G         G    P     + ++ N  S YY + L  I V G  
Sbjct: 259 -----------MDEGGG---SMVLGAIPPPPAMVFAKSDPNR-SNYYNLELSEIQVQGVS 303

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
           + V  +      +G  GT++DSGTT+ ++  + F+   D    Q+            +A+
Sbjct: 304 LNVPSEVF----NGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGS---------LQAV 350

Query: 374 TGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT 423
            G  P     CF   G  + +    FP +   F G  +V L  ENY F       A CL 
Sbjct: 351 PGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLG 410

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              +++A    + +LG   ++N  V YD  N ++GF +  C
Sbjct: 411 FFKNQDA----TTLLGGIVVRNTLVTYDRANHQIGFFKTNC 447


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 112/444 (25%), Positives = 175/444 (39%), Gaps = 76/444 (17%)

Query: 44  SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
           +Y N   L +SS  R L    NP  +        T         G Y+  L  GTP Q  
Sbjct: 54  AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 105

Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSS-------SKIPSFIPKLSSSSRLLGCQNPKCSWI 155
             I+D+GS + + PC    QC    S       +  P F P LSS+   + C N  C+  
Sbjct: 106 ALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC-NVDCT-C 163

Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
            +E  QC           +   ++  S  VL    ++ G    E+   P R +     GC
Sbjct: 164 DNERSQC--------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGC 208

Query: 216 S-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLIL 265
                  L S+   GI G GRG+ S+  QL       D FS C         T    ++L
Sbjct: 209 ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVL 264

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
               +  D      +++  V +P          YY + L+ I V G+ +R+  K      
Sbjct: 265 GGMPAPPDMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF---- 307

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
           +   GT++DSGTT+ ++  + F    D   +++    N  + +          CF   G 
Sbjct: 308 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGR 363

Query: 386 KTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
                   FP++ + F  G +++L  ENY F       A CL V  + +    P+ +LG 
Sbjct: 364 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGG 420

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
             ++N  V YD  N+++GF +  C
Sbjct: 421 IVVRNTLVTYDRHNEKIGFWKTNC 444


>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
 gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 105/404 (25%), Positives = 171/404 (42%), Gaps = 77/404 (19%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           +++S G PP +    +DTGS L W    PC  H  C   S+   P F P  S +SR + C
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58

Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
            + KC  + ++  +Q  +C ++      +CT     Y V YG+G   + G  +++TL + 
Sbjct: 59  SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109

Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
           +  + + + GCS  V  S   AGI GFG             P  L+    SYCL +    
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           D T+   +IL       D+      YTP    +N P+          Y + +  +   GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+             +   IVDSG   T + P  F  L D+ ++Q + +  Y R   A  
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259

Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
            + +  C+    + +G            + P L++ F GGA + LP  N F        +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C+T   +       S ILGN   +++   +D++ ++ GFK  +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 119/470 (25%), Positives = 192/470 (40%), Gaps = 72/470 (15%)

Query: 15  FFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTT 74
           F ++  I  + + +  FS+   H +  +    N +   +  L R    +   + +  + +
Sbjct: 19  FVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDR--FFRRFMSFSEASIS 76

Query: 75  TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
             T     S + G Y + +S GTPP  +  I DTGS L+W  C     C  C   K P F
Sbjct: 77  PNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC---LPCLSCYKQKNPMF 133

Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ---ICPSYLVLYGSG- 190
            P  S+S + + C          ES QCR      L  + +C+Q   +C  +   YG G 
Sbjct: 134 DPSKSTSFKEVSC----------ESQQCR------LLDTVSCSQPQKLC-DFSYGYGDGS 176

Query: 191 LTEGIALSETLNLPNR-----IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQL- 240
           L +G+  +ETL L +       I N + GC   +S        G+ G G    SL SQ+ 
Sbjct: 177 LAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIM 236

Query: 241 ----NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERN 294
               +  KFS CL+  + D +  TS +I       ++   + +  TP V  ++P+     
Sbjct: 237 STLGSGRKFSQCLVPFRTDPSI-TSKIIF---GPEAEVSGSXVVSTPLVTKDDPT----- 287

Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
               YY+V L  I+V G ++  +     +   GN    +D+GT  T         L  +F
Sbjct: 288 ----YYFVTLDGISV-GDKLFPFSSSSPMATKGN--VFIDAGTPPTL--------LPRDF 332

Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
            +++V+       +       L+P            P L  HF  GA+V L   N F   
Sbjct: 333 YNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFD-GADVQLKPLNTFISP 391

Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            EG   C  +    +   G + I GNF   N+ + +DL  +++ FK   C
Sbjct: 392 KEG-VYCFAM----QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 170/404 (42%), Gaps = 77/404 (19%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           +++S G PP +    +DTGS L W    PC  H  C   S+   P F P  S +SR + C
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58

Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
            + KC    ++  +Q  +C ++      +CT     Y V YG+G   + G  +++TL + 
Sbjct: 59  SSVKCGEPRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109

Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
           +  + + + GCS  V  S   AGI GFG             P  L+   FSYCL +    
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT---- 164

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           D T+   +IL       D+      YTP    +N P+          Y + +  +   GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+             +   IVDSG   T + P  F  L D+ ++Q + +  Y R   A  
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259

Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
            + +  C+    + +G            + P L++ F GGA + LP  N F        +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C+T   +       S ILGN   +++   +D++ ++ GFK   C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 102/398 (25%), Positives = 169/398 (42%), Gaps = 67/398 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRLLG 146
           + ++ S G PP     I+DTGS L+W  C   + CK+CSS+ +  P F P LSS+     
Sbjct: 68  FFVNFSVGQPPVPQFTIMDTGSSLLWIQC---HPCKHCSSNHMIHPVFNPALSSTFVECS 124

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
           C +  C +                A + +C+     Y  +Y SG  ++G+   E L    
Sbjct: 125 CDDRFCRY----------------APNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTT 168

Query: 204 PN---RIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCL--LSHKF 254
           PN    +      GC   +  Q      GI G G   TSL  QL   KFSYC+  L++K 
Sbjct: 169 PNGNTVVTQPIAFGCGHENGEQLESEFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANK- 226

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
                 + L+L   +               + +P+  E    +  YY+ L  I+VG +++
Sbjct: 227 --NYGYNQLVLGEDAD-------------ILGDPTPIEFETENGIYYMNLEGISVGDKQL 271

Query: 315 RVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
            +  + +   R G+  G I+D+GT +T++A   +  L +E  S  + +    R    + L
Sbjct: 272 NI--EPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYNEIKS--ILDPKLERFWFRDFL 327

Query: 374 TGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG----SAVCLTVVTDR 428
                C+     E+   FP +  HF GGAE+ +   + F  + E     +  C++V    
Sbjct: 328 -----CYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTT 382

Query: 429 EASGGPS--IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E  G       +G    Q Y + YDL+ + +  ++  C
Sbjct: 383 EHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRIDC 420


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 103/397 (25%), Positives = 169/397 (42%), Gaps = 61/397 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y   +  G P Q +  I+DTGS ++W  C+    C+ C S +    IP LS  +  L 
Sbjct: 81  GLYYTEIGLGNPVQKLKVIVDTGSDILWVKCS---PCRSCLSKQ--DIIPPLSIYN--LS 133

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIA-----LSETL 201
             +        + +    C  E    S++ +    +Y + Y    T   A     +   L
Sbjct: 134 ASSTSSVSSCSDPL----CTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVL 189

Query: 202 NLPNRIIPNFLVGCSV-LSSRQPA-GIAGFGRGKTSLPSQL----NLDK-FSYCLLSHKF 254
              N    +   GC++ ++   PA GI GFG+   ++P+Q+    N+ + FS+CL   K 
Sbjct: 190 QGGNATTSHIFFGCAINITGSWPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKH 249

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
                    IL+ G    +  TT + +TP +N          + +Y V L  I+V  + +
Sbjct: 250 GGG------ILEFG---EEPNTTEMVFTPLLN---------VTTHYNVDLLSISVNSKVL 291

Query: 315 RVWHKYLTL--DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            +  K  +   +     G I+DSGT+F  +A +    L  E        +N T A     
Sbjct: 292 PIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSEI-------KNLTTAKLGPK 344

Query: 373 LTGLRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVV---GEGSAVCLTVVTD 427
           L GL+ CF +    T   SFP + L F GG+ + L  +NY  +V    + +  C      
Sbjct: 345 LEGLQ-CFYLKSGLTVETSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAW--- 400

Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +S     I G   +++  V YD+ N+R+G+K Q C
Sbjct: 401 --SSADGLTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 166/398 (41%), Gaps = 67/398 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRLLG 146
           + ++ S G PP     I+DTGS L+W  C     CK+CSS  +  P F P LSS+     
Sbjct: 96  FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQ---PCKHCSSDHMIHPVFNPALSSTFVECS 152

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
           C +            CR   +    +S  C      Y  +Y SG  ++G+   E L    
Sbjct: 153 CDDRF----------CRYAPNGHCGSSNKCV-----YEQVYISGTGSKGVLAKERLTFTT 197

Query: 204 PN---RIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCL--LSHKF 254
           PN    +      GC   +  Q      GI G G   TSL  QL   KFSYC+  L++K 
Sbjct: 198 PNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANK- 255

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
                 + L+L   +               + +P+  E    +  YY+ L  I+VG  ++
Sbjct: 256 --NYGYNQLVLGEDAD-------------ILGDPTPIEFETENSIYYMNLEGISVGDTQL 300

Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
            +  + +   R G   G I+DSGT +T++A   +  L +E  S  + +    R    + L
Sbjct: 301 NI--EPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKS--ILDPKLERFWFRDFL 356

Query: 374 TGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG---SAVCLTVVTDR 428
                C+   V  E  G FP +  HF GGAE+ +   + F  + E    +  C++V   +
Sbjct: 357 -----CYHGRVSEELIG-FPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTK 410

Query: 429 EASG--GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           E  G       +G    Q Y + YDL+ + +  ++  C
Sbjct: 411 EHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/411 (24%), Positives = 158/411 (38%), Gaps = 83/411 (20%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  GTPP+     +DTGS ++W  C    QCK C +       +  +  K SSS
Sbjct: 83  GLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCI---QCKECPTRSNLGMDLTLYDIKESSS 139

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
            + + C    C  I           +  L T       CP YL +YG G +      + +
Sbjct: 140 GKFVPCDQEFCKEI-----------NGGLLTGCTANISCP-YLEIYGDGSSTAGYFVKDI 187

Query: 202 NLPNRIIPNF---------LVGC------SVLSSRQPA--GIAGFGRGKTSLPSQLN--- 241
            L +++  +          + GC       + SS + A  GI GFG+  +S+ SQL    
Sbjct: 188 VLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSG 247

Query: 242 --LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSV 298
                F++CL                 NG +       G    P VN  P + ++  +SV
Sbjct: 248 KVKKMFAHCL-----------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSV 290

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRD-----GNGGTIVDSGTTFTFMAPELFEPLADE 353
                   +T     V+V H +L+L  D        GTI+DSGTT  ++   ++EPL  +
Sbjct: 291 -------NMTA----VQVGHAFLSLSTDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYK 339

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
            +SQ          L    L     CF         FP +  +F+ G  + +   +Y   
Sbjct: 340 IISQHPD-------LKVRTLHDEYTCFQYSESVDDGFPAVTFYFENGLSLKVYPHDYLFP 392

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            G+   +       +        +LG+  + N  V YDL NQ +G+ +  C
Sbjct: 393 SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 443


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 102/420 (24%), Positives = 157/420 (37%), Gaps = 58/420 (13%)

Query: 61  HIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
           H +    K  ++  +     ++     G Y   +  GTPP+     +DTGS L+W  C  
Sbjct: 7   HDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHP 66

Query: 120 HYQCKYCSSSKIPSFIP---KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
              C   S  KIP  +P   K S+SS  + C +P C+ I    I    CND+       C
Sbjct: 67  CIGCPAFSDLKIP-IVPYDVKASASSSKVPCSDPSCTLITQ--ISESGCNDQ-----NQC 118

Query: 177 TQICPSYLVLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSV-------LSSRQPAGIAG 228
                 Y   YG G  T G  + + L+         + GC          S R   GI G
Sbjct: 119 -----GYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIG 173

Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
           FG    S  SQL     +  + +H  D   R   +++       D     + YTP V   
Sbjct: 174 FGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD-----IQYTPLV--- 225

Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
                  +  +Y V L+ I+V    + +  K  +   D   GTI DSGTT  ++  E ++
Sbjct: 226 ------PYMSHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTIFDSGTTLAYLPDEAYQ 277

Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
               + VS +V          +  +  L             FP + L+F+G +    P E
Sbjct: 278 AFT-QAVSLVVAPFLLCDTRLSRFIYKL-------------FPNVVLYFEGASMTLTPAE 323

Query: 409 NYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
                    +A    +      S    +   I G+  ++N  V YDL   R+G++   CK
Sbjct: 324 YLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDCK 383


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 159/380 (41%), Gaps = 58/380 (15%)

Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIPKL-SSSSRLLGCQNPKC-SWIHHESIQC 162
           +DTGS ++W  C     C   S   I  +F   + SS++ L+ C +  C S +   + +C
Sbjct: 85  IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144

Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRII---------PNFL 212
                     S    Q   SY   YG G  T G  +S+ +   N I+            +
Sbjct: 145 ----------SPRVNQC--SYTFQYGDGSGTSGYYVSDAMYF-NLIMGQPPAVNSTATIV 191

Query: 213 VGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD-DTTRTSSLI 264
            GCS+  S       +   GI GFG G  S+ SQL+    +  + SH    D      L+
Sbjct: 192 FGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILV 251

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L       +     + Y+P V  PS         +Y + L+ I V GQ + +     ++ 
Sbjct: 252 L------GEILEPSIVYSPLV--PS-------QPHYNLNLQSIAVNGQPLPINPAVFSIS 296

Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
            +  GGTIVD GTT  ++  E ++PL     + + ++   T + G +       C+ V  
Sbjct: 297 NN-RGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ-------CYLVST 348

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
                FP + L+F+GGA + L  E Y    G      +  V  ++   G S ILG+  ++
Sbjct: 349 SIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGAS-ILGDLVLK 407

Query: 445 NYYVEYDLRNQRLGFKQQLC 464
           +  V YD+  QR+G+    C
Sbjct: 408 DKIVVYDIAQQRIGWANYDC 427


>gi|116666775|pdb|2B42|A Chain A, Crystal Structure Of The Triticum Xylanse Inhibitor-I In
           Complex With Bacillus Subtilis Xylanase
          Length = 381

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 156/396 (39%), Gaps = 88/396 (22%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
            +LD    LVW           C   + P+ IP  SS + LL       GC  P C    
Sbjct: 26  LVLDVAGPLVW---------STCKGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 75

Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
           H+    + C   P    S  C     S+   + +  T+G      +N+        L  C
Sbjct: 76  HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 124

Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
           +   +L+S  R   G+AG      +LP+Q+       ++F  CL       T      I 
Sbjct: 125 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 178

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
             G     + T  + YTP V           S  +Y+  R I VG  RV V    L    
Sbjct: 179 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 228

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP------- 378
              GG ++ +   +  + P+++ PL D F          T+AL A+   G          
Sbjct: 229 --TGGVMLSTRLPYVLLRPDVYRPLMDAF----------TKALAAQHANGAPVARAVVAV 276

Query: 379 -----CFDVP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
                C+D    G   G +  P ++L   GG++ T+  +N    V +G+A C+  V  + 
Sbjct: 277 APFGVCYDTKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKG 335

Query: 430 ASGG----PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
            + G    P++ILG  QM+++ +++D+  +RLGF +
Sbjct: 336 VAAGDGRAPAVILGGAQMEDFVLDFDMEKKRLGFSR 371


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 95/401 (23%), Positives = 174/401 (43%), Gaps = 62/401 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  G P +     +DTGS ++W  C+    C  C  S     ++  F    SSS
Sbjct: 82  GLYFTKVKLGNPAREFNVQIDTGSDILWVTCS---PCDGCPDSSGLGIELNLFDTTKSSS 138

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
           +R+L C +P C+ +   + QC       L  + +C+    S+     SG T G  +++++
Sbjct: 139 ARVLPCTDPICAAVSTTTDQC-------LTQTDHCSY---SFHYRDRSG-TSGFYVTDSM 187

Query: 202 N----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
           +    L    I N     + GCS+        +++   GI GFG+G+ S+ SQL+    +
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGIT 247

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
             + SH          +++       +     + Y+P +  PS         +Y + L+ 
Sbjct: 248 PKVFSHCLKGGENGGGILV-----LGEILEPSIVYSPLI--PS-------QPHYTLKLQS 293

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I + GQ   ++           G TI+DSGTT  ++  E+++ +     S + ++   T 
Sbjct: 294 IALSGQ---LFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTI 350

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSAVCLT 423
           + G++       CF V       FP L+ +F+G A + +  E Y    ++V       L 
Sbjct: 351 SRGSQ-------CFRVSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLW 403

Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            +  ++A  G + ILG+  +++  + YDL  QR+G+    C
Sbjct: 404 CIGFQKAEDGLN-ILGDLVLKDKIIVYDLAQQRIGWANYDC 443


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 98/394 (24%), Positives = 164/394 (41%), Gaps = 67/394 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G YS+ L+ G PP+   F +DTGS L W  C     CK C+  +   + PK    + L+ 
Sbjct: 52  GYYSVILNIGNPPKAFDFDIDTGSDLTWVQC--DAPCKGCTKPRDKLYKPK----NNLVP 105

Query: 147 CQNPKCSWIH-HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNL 203
           C N  C  +   E+  C   +D+           C   +     G + G+ LS++  L L
Sbjct: 106 CSNSLCQAVSTGENYHCDAPDDQ-----------CDYEIEYADLGSSIGVLLSDSFPLRL 154

Query: 204 PNRII--PNFLVGCSV----LSSRQP---AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
            N  +  P    GC      L    P   AGI G GRGK S+ SQL     +  ++ H F
Sbjct: 155 SNGTLLQPKMAFGCGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCF 214

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
               R   L   +    S +    +T+TP +       R++    Y  G   +  GG+  
Sbjct: 215 -SRARGGFLFFGDHLFPSSR----ITWTPML-------RSSSDTLYSSGPAELLFGGKPT 262

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
            +  K L L        I DSG+++T+   ++++ + +     +V+     + L      
Sbjct: 263 GI--KGLQL--------IFDSGSSYTYFNAQVYQSILN-----LVRKDLAGKPLKDAPEK 307

Query: 375 GLRPCFDVPGEKTGSFPELKLHFK---------GGAEVTLPVENYFAVVGEGSAVCLTVV 425
            L  C+    +   S  ++K +FK            ++ L  E+Y  +  +G+ VCL ++
Sbjct: 308 ELAVCWKT-AKPIKSILDIKSYFKPLTISFMNAKNVQLQLAPEDYLIITKDGN-VCLGIL 365

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
              E   G   ++G+  MQ+  V YD   Q++G+
Sbjct: 366 NGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGW 399


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 90/374 (24%), Positives = 139/374 (37%), Gaps = 91/374 (24%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTPP  +  +LDTGS L+W  C     C +C   K P F P  SS+ +   C 
Sbjct: 65  YLMKLQIGTPPFEVEAVLDTGSELIWTQC---LPCLHCYDQKAPIFDPSKSSTFKETRCN 121

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
            P  S                          CP  LV      T+G   +ET+ + +   
Sbjct: 122 TPDHS--------------------------CPYKLVYDDKSYTQGTLATETVTIHSTSG 155

Query: 207 ---IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
              ++P  ++GCS  +S        +GI G  RG  SL SQ+                  
Sbjct: 156 VPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQM------------------ 197

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VW 317
                    G+   D           V + ++  + A    YY+ L  ++VG  R+  V 
Sbjct: 198 --------GGAYPGDG----------VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVG 239

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
             +  L    NG  ++DSGT  T+  P  +  L  + V ++V           + L    
Sbjct: 240 TPFHAL----NGNIVIDSGTPLTYF-PVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYS 294

Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
              ++       FP + +HF GGA++ L   N +  +  G   CL ++ +         I
Sbjct: 295 NTIEI-------FPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQ---VAI 344

Query: 438 LGNFQMQNYYVEYD 451
            GN    N+ V YD
Sbjct: 345 FGNRAQNNFLVGYD 358



 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/383 (24%), Positives = 150/383 (39%), Gaps = 63/383 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTPP  I  ++DTGS + W  C     C +C     P F P  SS+ +   C 
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQC---LPCVHCYKQNAPIFDPSKSSTFKEKRCH 436

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +  C +             E     K  T+         G+  T+ + +  T   P  ++
Sbjct: 437 DHSCPY-------------EVDYFDKTYTK---------GTLATDTVTIHSTSGEP-FVM 473

Query: 209 PNFLVGCSVLSSR-QPA--GIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSS 262
              ++GC   +S  +P+  G  G   G  SL +Q+  +     SYC              
Sbjct: 474 AETIIGCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG----------- 522

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYL 321
               NG+S  +  T  +     V + ++    A   +YY+ L  ++VG  R+  +   + 
Sbjct: 523 ----NGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFH 578

Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
            L+    G  ++DSGTT T+  PE +  L  + V  +V         G + L     C+ 
Sbjct: 579 ALE----GNIVIDSGTTLTYF-PESYCNLVRQAVEHVVPAVPAADPTGNDLL-----CYY 628

Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
                T  FP + +HF GGA++ L   N F     G   CL ++ +         I GN 
Sbjct: 629 --SNTTEIFPVITMHFSGGADLVLDKYNMFMESYSGGLFCLAIICNNPTQ---EAIFGNR 683

Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
              N+ V YD  +  + FK   C
Sbjct: 684 AQNNFLVGYDSSSLLVSFKPTNC 706


>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
          Length = 477

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 152/380 (40%), Gaps = 43/380 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF--IPKLSSSSRL 144
           G Y I++  GTPPQ +    D  S  VW PC        C S K   +  +P+      L
Sbjct: 85  GTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPR-----EL 139

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
             C   +C  I  +     DC       +  C   C  Y    G+     + L +   L 
Sbjct: 140 YSCGEQRCRTIVGQP----DCG---APYNGPCKYTC-RYGGAGGTETEGHLGL-QPFTLG 190

Query: 205 NRIIP-NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
           +  +P N + GC  L      G+ G  RG+ SL SQL L +FSY   + ++DDT   ++ 
Sbjct: 191 DNTMPVNMIFGCG-LEPETNFGVIGLNRGRLSLISQLQLGRFSY-YFAPEYDDTAAGNAS 248

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            +  G  ++  +T+   YT F +     E  A+S  Y VGL  + VG          L +
Sbjct: 249 FILFG-EYAVPQTSNPRYTQFWSY----ENGAYSYLYLVGLSGMRVGSNN-------LNM 296

Query: 324 DRDGNGG-----TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              G+GG       + +    TF+    ++ L  E VS +  +     AL      GL  
Sbjct: 297 LGAGSGGRDPLVAYLSTSVPITFLEKNAYDLLRRELVSTVGSDTVDGSAL------GLDL 350

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+         FP + L F  GA + L   NY          CLT++    A GG S++ 
Sbjct: 351 CYTSQYLAKAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVA-GGLSLLG 409

Query: 439 GNFQMQNYYVEYDLRNQRLG 458
              Q   + + YD++ Q  G
Sbjct: 410 SLIQTGTHMMYYDIQIQGRG 429


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 155/398 (38%), Gaps = 72/398 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y ++L  GTP      ++DTGS L W       QCK C +      K P F P  SSS  
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 171

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
            + C +  C  +   +     C       +     +C  Y + YG+   T G+  +ETL 
Sbjct: 172 SVPCDSDACRKLAAGAYG-HGC-------TSGAAALC-EYGIEYGNRATTTGVYSTETLT 222

Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
           L P  ++ +F  GC         +  G+ G G    SL SQ +      FSYCL      
Sbjct: 223 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 276

Query: 256 DTTRTSSLILDNGSSHSDKKTT---GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
             T   +  L  G+ +S   +T   G  +TP    PSV        +Y V L  I+VGG 
Sbjct: 277 PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSV------PTFYVVTLTGISVGGA 330

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
            + V     +       G ++DSGT  T +    +  L   F S M + R    + GA  
Sbjct: 331 PLAVPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAV- 383

Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP------VENYFAVVGEGSAVCLTVVT 426
              L  C+D  G    + P + L F GGA + L       V+   A  G G        T
Sbjct: 384 ---LDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLVDGCLAFAGAG--------T 432

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           D         I+GN   + + V YD     +GF+   C
Sbjct: 433 DDTIG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
          Length = 477

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 152/380 (40%), Gaps = 43/380 (11%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF--IPKLSSSSRL 144
           G Y I++  GTPPQ +    D  S  VW PC        C S K   +  +P+      L
Sbjct: 85  GTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPR-----EL 139

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
             C   +C  I  +     DC       +  C   C  Y    G+     + L +   L 
Sbjct: 140 YSCGEQRCRTIVGQP----DCG---APYNGPCKYTC-RYGGAGGTETEGHLGL-QPFTLG 190

Query: 205 NRIIP-NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
           +  +P N + GC  L      G+ G  RG+ SL SQL L +FSY   + ++DDT   ++ 
Sbjct: 191 DNTMPVNMIFGCG-LEPETNFGVIGLNRGRLSLISQLQLGRFSY-YFAPEYDDTAAGNAS 248

Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
            +  G  ++  +T+   YT F +     E  A+S  Y VGL  + VG          L +
Sbjct: 249 FILFG-EYAVPQTSNPRYTQFWSY----ENGAYSYLYLVGLSGMRVGSNN-------LNM 296

Query: 324 DRDGNGG-----TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              G+GG       + +    TF+    ++ L  E VS +  +     AL      GL  
Sbjct: 297 LGAGSGGRDPLVAYLSTSVPVTFLEKNAYDLLRRELVSTVGSDTVDGSAL------GLDL 350

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           C+         FP + L F  GA + L   NY          CLT++    A GG S++ 
Sbjct: 351 CYTSQYLAKAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVA-GGLSLLG 409

Query: 439 GNFQMQNYYVEYDLRNQRLG 458
              Q   + + YD++ Q  G
Sbjct: 410 SLIQTGTHMMYYDIQIQGRG 429


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 147/375 (39%), Gaps = 55/375 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y IS+  G+P      ++DTGS + W  C        C +     F P  SS+     C 
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP-NR 206
              C+ +  +S +   C+      +K+  Q    Y+V YG G  T G   S+ L L  + 
Sbjct: 168 AAACAQL-GDSGEANGCD------AKSRCQ----YIVKYGDGSNTTGTYSSDVLTLSGSD 216

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
           ++  F  GCS          +  G+ G G    S  SQ        F YCL +      T
Sbjct: 217 VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPA------T 270

Query: 259 RTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
             SS  L L   +S      +    TP + +  V        YY+  L  I VGG+++ +
Sbjct: 271 PASSGFLTLGAPASGGGGGASRFATTPMLRSKKV------PTYYFAALEDIAVGGKKLGL 324

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                        G++VDSGT  T + P  +  L+  F + M +   Y R   AE L  L
Sbjct: 325 SPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTR---YAR---AEPLGIL 372

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             CF+  G    S P + L F GGA V L      +    G  +      D +A G    
Sbjct: 373 DTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVS----GGCLAFAPTRDDKAFG---- 424

Query: 437 ILGNFQMQNYYVEYD 451
            +GN Q + + V YD
Sbjct: 425 TIGNVQQRTFEVLYD 439


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 157/398 (39%), Gaps = 65/398 (16%)

Query: 82  SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIP 136
           SS+    Y  ++  GTP      ILDTGS L W       QCK C+SS     ++P F P
Sbjct: 122 SSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWV------QCKPCNSSQCYPQRLPLFDP 175

Query: 137 KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGI 195
             SSS   + C + +C  +    I    C  +       C     +Y + YGSG T  G 
Sbjct: 176 NTSSSYSPVPCDSQECRAL-AAGIDGDGCTSD---GDWGC-----AYEIHYGSGATPAGE 226

Query: 196 ALSETLNL-PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGKTSLPSQLNLDK----FS 246
             ++ L L P  I+  F  GC     R       G+ G GR   SL  Q +  +    FS
Sbjct: 227 YSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFS 286

Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           +CL        T  S+  L  G+ H    T+   +TP +        +    +Y +    
Sbjct: 287 HCL------PPTGVSTGFLALGAPH---DTSAFVFTPLLT------MDDQPWFYQLMPTA 331

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I+V GQ + +             G I DSGT  + +    +  L   F S M +      
Sbjct: 332 ISVAGQLLDIPPAVF------REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPL--- 382

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
              A  +  L  CF+  G    + P + L F+GGA V L   +   + G     CL   +
Sbjct: 383 ---APPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLMDG-----CLAFWS 434

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +   G   ++G+   +   V YD+  +++GF+   C
Sbjct: 435 SGDEYTG---LIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 150/391 (38%), Gaps = 72/391 (18%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y I++S G+P       +DTGS + W  C +              + P  SS+     C 
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRCKSRL------------YDPGTSSTYAPFSCS 178

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNR- 206
            P C+ +      C        ++   C      Y V YG G  T G   S+TL L    
Sbjct: 179 APACAQLGRRGTGC--------SSGSTCV-----YSVKYGDGSNTTGTYGSDTLTLAGTS 225

Query: 207 --IIPNFLVGCSVL----SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
             +I  F  GCS +          G+ G G    S  SQ        FSYCL        
Sbjct: 226 EPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL------PP 279

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T  SS  L  G+  S           F   P +  + A + +Y + LR I+VGG+ + + 
Sbjct: 280 TWNSSGFLTLGAPSSSTSAA------FSTTPMLRSKQA-ATFYGLLLRGISVGGKTLEIP 332

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               +       G+IVDSGT  T + P  +  L+  F   M + +    A     L  L 
Sbjct: 333 SSVFS------AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA--PRGL--LD 382

Query: 378 PCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT-VVTDREASGG 433
            CFD  G   G   + P + L   GGA V L   +   +V +G   CL    TD +   G
Sbjct: 383 TCFDFTGHGEGNNFTVPSVALVLDGGAVVDL---HPNGIVQDG---CLAFAATDDDGRTG 436

Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
              I+GN Q + + V YD+     GF+   C
Sbjct: 437 ---IIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|449432731|ref|XP_004134152.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
 gi|449527081|ref|XP_004170541.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 429

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 166/410 (40%), Gaps = 59/410 (14%)

Query: 81  ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
           ++ H    Y I +   TP   +   +D G  L+W  C   +                +SS
Sbjct: 36  VTKHPSLQYIIQIHQRTPLVPVNLTVDLGGWLMWVDCDRGF----------------VSS 79

Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIA 196
           S +   C++ +CS    +SI C  C   P     N  C+    + ++   SG  +T  + 
Sbjct: 80  SYKPARCRSAQCSL--AKSISCGKCYLPPHPGCNNYTCSLSARNTIIQLSSGGEVTSDLV 137

Query: 197 LSETLNLPNRI----IPNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNLD---- 243
              + N  N      +PNFL  CS   +L        G+AGFGR + SLPSQ        
Sbjct: 138 SVSSTNGFNSTRALSVPNFLFICSSTFLLEGLAGGVTGMAGFGRTRISLPSQFAAAFSFS 197

Query: 244 -KFSYCLLSHKFDDTTRTSSLILDN-GSSH---SDKKTTGLTYTPFVNNPSVAERNAFSV 298
            KF+ CL       +T    +I    G  H   +   T  LTYTP + NP V      S 
Sbjct: 198 RKFTMCL-----SGSTGFPGVIFSGYGPYHFLPNIDLTNSLTYTPLLINP-VGFAGEKSS 251

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
            Y++G++ I    + V +    L +D +GNGGT + +   +T +   ++  L   F S++
Sbjct: 252 EYFIGVKSIEFNSKTVPLNTTLLKIDSNGNGGTKISTVNPYTVLETSIYRALVKTFTSEL 311

Query: 359 VKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEV-TLPVENYFAV 413
               N  R     A+     C+        E   S P + L  +    +  +   N   V
Sbjct: 312 ---GNIPR---VAAVAPFEVCYSSKSFGSTELGPSVPSIDLILQNKKVIWRMFGANSMVV 365

Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
           V E   +CL  V +       ++++G  Q+++  +E+DL   RLGF   L
Sbjct: 366 VTE-EVLCLGFV-EGGVEAETAMVIGGHQIEDNLLEFDLATSRLGFSSTL 413


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 153/377 (40%), Gaps = 50/377 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y + +  GTP Q++  +LDT +   + P +    C  CS++   +F P  S+S   L 
Sbjct: 96  GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSG---CIGCSAT---TFSPNASTSYVPLE 149

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C  P+CS +    + C      P   S  C     S+   Y         + ++L L   
Sbjct: 150 CSVPQCSQVR--GLSC------PATGSGAC-----SFNKSYAGSTYSATLVQDSLRLATD 196

Query: 207 IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
           +IP++  G       S + ++   G+        S    L    FSYCL S  F     +
Sbjct: 197 VIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPS--FKSYYFS 254

Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
            SL L         +TT     P + NP    R +    Y+V L  ITVG   V    + 
Sbjct: 255 GSLKLGPVGQPKSIRTT-----PLLRNP---RRPSL---YFVNLTGITVGKVNVPFPKEL 303

Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
           L  D +   GTI+DSGT  T     ++  + DEF  Q+    +   +LGA        CF
Sbjct: 304 LAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFS---SLGA-----FDTCF 355

Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILG 439
               E     P + LHF    ++ LP+EN       GS  CL +  T +  +     ++ 
Sbjct: 356 VKNYETLA--PAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIA 412

Query: 440 NFQMQNYYVEYDLRNQR 456
           N+Q QN  V +D  N +
Sbjct: 413 NYQQQNLRVLFDTVNNK 429


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 166/405 (40%), Gaps = 70/405 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
           G Y   L  G+PP+     +DTGS ++W  C    +C   S   I    + PK S +S L
Sbjct: 68  GLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSEL 127

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI-CPSYLVLYGSG-LTEGIALSETL- 201
           + C    CS  +          D P+   K  ++I CP Y + YG G  T G  + + L 
Sbjct: 128 ISCDQEFCSATY----------DGPIPGCK--SEIPCP-YSITYGDGSATTGYYVQDYLT 174

Query: 202 ----NLPNRIIP---NFLVGCSVL------SSRQPA--GIAGFGRGKTSLPSQLNLDK-- 244
               N   R  P   + + GC  +      SS + A  GI GFG+  +S+ SQL      
Sbjct: 175 YNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKV 234

Query: 245 ---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
              FS+CL      D  R   +              G    P V+   +  R A   +Y 
Sbjct: 235 KKIFSHCL------DNIRGGGIF-----------AIGEVVEPKVSTTPLVPRMA---HYN 274

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           V L+ I V    +++          GNG GTI+DSGTT  ++   +++ L  + +++  +
Sbjct: 275 VVLKSIEVDTDILQLPSDIFD---SGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPR 331

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-A 419
            + Y   L  +  +    CF   G     FP +KLHF+    +T+   +Y     +G   
Sbjct: 332 LKLY---LVEQQFS----CFQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWC 384

Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +       +  +G    +LG+  + N  V YDL N  +G+    C
Sbjct: 385 IGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNC 429


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/424 (24%), Positives = 163/424 (38%), Gaps = 74/424 (17%)

Query: 66  QTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
           +      +++  +  +SS +Y G   Y + L  GTP Q    + DTGS L W  C     
Sbjct: 90  RVAAEVASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG--- 146

Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP- 181
               +S     F PK S S   + C +  C              D P   +   +   P 
Sbjct: 147 ----ASPPGRVFRPKTSRSWAPIPCSSDTCKL------------DVPFTLANCSSPASPC 190

Query: 182 --SYLVLYGSGLTEGIALSE--TLNLPNRIIP---NFLVGCSV----LSSRQPAGIAGFG 230
              Y    GS    GI  +E  T+ LP   +    + ++GCS      S R   G+   G
Sbjct: 191 TYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLG 250

Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
             K S  +Q        FSYCL+ H      R ++  L  G     +  T  T T    +
Sbjct: 251 NAKISFATQAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQVPR--TPATQTKLFLD 305

Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRV----RVWHKYLTLDRDGNGGTIVDSGTTFTFMA 343
           P +        +Y V +  I V G+ +     VW          +GG I+DSG T T +A
Sbjct: 306 PEMP-------FYGVKVDAIHVAGKALDIPAEVWDAK-------SGGVILDSGNTLTVLA 351

Query: 344 PELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGG 400
                P     V+ + K+ +    +   +      C++    + G+    P+L + F G 
Sbjct: 352 ----APAYKAVVAALSKHLDGVPKV---SFPPFEHCYNWTARRPGAPEIIPKLAVQFAGS 404

Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
           A +  P ++Y   V  G   C+ V   +E       ++GN   Q +  E+DL+N ++ FK
Sbjct: 405 ARLEPPAKSYVIDVKPG-VKCIGV---QEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFK 460

Query: 461 QQLC 464
           Q  C
Sbjct: 461 QSNC 464


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 94/403 (23%), Positives = 161/403 (39%), Gaps = 60/403 (14%)

Query: 86  YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
           YG Y +++  G P +     +D+GS L W  C     C  C+    P +  KL   S L+
Sbjct: 76  YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDA--PCISCAKGPHPLY--KLKKGS-LV 130

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN--L 203
             ++P C+ +   S    +         K  +Q C   +     G +EG  + +++   L
Sbjct: 131 PSKDPLCAAVQAGSGHYHN--------HKEASQRCDYDVAYADHGYSEGFLVRDSVRALL 182

Query: 204 PNRII--PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
            N+ +   N + GC         +S  +  GI G G G  SLPSQ         ++ H  
Sbjct: 183 TNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
               R    +       S   T+ +T+ P +  PS+        +YYVG  ++  G +  
Sbjct: 243 FGAGRDGGYMFFGDDLVS---TSAMTWVPMLGRPSIK-------HYYVGAAQMNFGNK-- 290

Query: 315 RVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
                   LD+DG+    GG I DSG+T+T+   + +      F+S + +N +  +    
Sbjct: 291 -------PLDKDGDGKKLGGIIFDSGSTYTYFTNQAY----GAFLSVVKENLSGKQLEQD 339

Query: 371 EALTGLRPC------FDVPGEKTGSFPELKLHFKGGAEVTLPV--ENYFAVVGEGSAVCL 422
            + + L  C      F    E    F  L L F+      + +  E Y  V  +G+ VCL
Sbjct: 340 SSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKTKQMEIFPEGYLVVNKKGN-VCL 398

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
            ++         + +LG+   Q   V YD    ++G+ +  C+
Sbjct: 399 GILNGTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSDCQ 441


>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 105/404 (25%), Positives = 170/404 (42%), Gaps = 77/404 (19%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           +++S G PP +    +DTGS L W    PC  H  C   S+   P F P  S +SR + C
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58

Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
            + KC    ++  +Q  +C ++      +CT     Y V YG+G   + G  +++TL + 
Sbjct: 59  SSVKCGEPRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109

Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
           +  + + + GCS  V  S   AGI GFG             P  L+    SYCL +    
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           D T+   +IL       D+      YTP    +N P+          Y + +  +   GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+             +   IVDSG   T + P  F  L D+ ++Q + +  Y R   A  
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259

Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
            + +  C+    + +G            + P L++ F GGA + LP  N F        +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C+T   +       S ILGN   +++   +D++ ++ GFK  +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 103/418 (24%), Positives = 164/418 (39%), Gaps = 97/418 (23%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
           G Y   +  GTPP+     +DTGS ++W  C    QCK C +       +  +  K SSS
Sbjct: 81  GLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCI---QCKECPTRSSLGMDLTLYDIKESSS 137

Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
            +L+ C    C  I           +  L T       CP YL +YG G +      + +
Sbjct: 138 GKLVPCDQEFCKEI-----------NGGLLTGCTANISCP-YLEIYGDGSSTAGYFVKDI 185

Query: 202 NLPNRIIPNF---------LVGC------SVLSSRQPA--GIAGFGRGKTSLPSQLN--- 241
            L +++  +          + GC       + SS + A  GI GFG+  +S+ SQL    
Sbjct: 186 VLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSG 245

Query: 242 --LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSV 298
                F++CL                 NG +       G    P VN  P + ++  +SV
Sbjct: 246 KVKKMFAHCL-----------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSV 288

Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNG-----GTIVDSGTTFTFMAPELFEPLADE 353
                   +T     V+V H +L+L  D +      GTI+DSGTT  ++   ++EPL  +
Sbjct: 289 -------NMTA----VQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYK 337

Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-------P 406
            +SQ          L  + L     CF         FP +   F+ G  + +       P
Sbjct: 338 MISQHPD-------LKVQTLHDEYTCFQYSESVDDGFPAVTFFFENGLSLKVYPHDYLFP 390

Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             N++ +  + S         R++      +LG+  + N  V YDL NQ +G+ +  C
Sbjct: 391 SVNFWCIGWQNSG-----TQSRDSKN--MTLLGDLVLSNKLVFYDLENQAIGWAEYNC 441


>gi|343161843|dbj|BAK57511.1| extracellular dermal glycoprotein [Nicotiana benthamiana]
          Length = 440

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 100/409 (24%), Positives = 171/409 (41%), Gaps = 83/409 (20%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +   LD G   +W  C   Y                +SSS +   C++ +CS   
Sbjct: 57  TPLVPVSLTLDLGGQFLWVDCDQGY----------------VSSSYKPARCRSAQCSLAR 100

Query: 157 HESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNL-------PNRII 208
                C  C   P     N T  + P   V   +  T G   S+T+ +       P R +
Sbjct: 101 AGG--CGQCFSPPKPGCNNDTCGLIPDNTVTQTA--TSGELASDTVQVQSSNGKNPGRNV 156

Query: 209 PN----FLVGCSVLSSRQPAGI---AGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDD 256
            +    F+ G + L  R  +G+   AG GR + SLPSQ + +     KF+ CL S     
Sbjct: 157 VDKDFLFVCGSTFLLKRLASGVKGMAGLGRTRISLPSQFSAEFSFPRKFAVCLSS----- 211

Query: 257 TTRTSSLILDNGSSHS-----DKKTTGLTYTPFVNNPSVAERNAFSV-----YYYVGLRR 306
           +T++  ++L     +S     +      +YTP   NP V+  +AFS       Y++G++ 
Sbjct: 212 STKSKGVVLFGDGPYSFLPNREFANDDFSYTPLFINP-VSTASAFSSGEPSSEYFIGVKS 270

Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
           I +  + V +    L++D  G GGT + +   +T +   ++  + + FV ++V   N TR
Sbjct: 271 IKINQKVVSINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTNFFVKELV---NITR 327

Query: 367 ALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYF-AVVGEGSAV- 420
                ++     CFD   +   + G + P + L  +         EN F  + G  S V 
Sbjct: 328 ---VASVAPFGACFDSRNIVSTRVGPTVPPIDLVLQN--------ENVFWTIFGANSMVQ 376

Query: 421 ------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
                 CL  V D   +   SI++G + +++  +++DL + RLGF   +
Sbjct: 377 VSENVLCLGFV-DGGVNPRTSIVIGGYTIEDNLLQFDLASSRLGFTSSI 424


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 97/398 (24%), Positives = 167/398 (41%), Gaps = 67/398 (16%)

Query: 87  GGYSISLSFGTPPQIIPFILD--TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
           G Y +SLS G PP+  P+ LD  TGS L W  C     C  C+ +  P + P    ++ L
Sbjct: 65  GYYYVSLSIGQPPK--PYFLDPDTGSDLSWLQCDA--PCVRCTKAPHPLYRP----NNNL 116

Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSET--L 201
           + C++P C+ +H    +C           + C      Y V Y  G +  G+ + +   L
Sbjct: 117 VICKDPMCASLHPPGYKCEH--------PEQC-----DYEVEYADGGSSLGVLVKDVFPL 163

Query: 202 NLPN--RIIPNFLVGCSVL----SSRQP-AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
           N  N  R+ P   +GC        S  P  G+ G G+GK+S+ SQL+       ++ H  
Sbjct: 164 NFTNGLRLAPRLALGCGYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCV 223

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             ++R    +      +   +   + +TP + +           +Y  G   + +GG+  
Sbjct: 224 --SSRGGGFLFFGDDLYDSSR---VVWTPMLRDQ--------HTHYSSGYAELILGGKTT 270

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL- 373
              +  +T           DSG+++T++    ++ L    V + +  +    AL  + L 
Sbjct: 271 VFKNLLVTF----------DSGSSYTYLNSLAYQALV-HLVRKELSEKPVREALDDQTLP 319

Query: 374 ---TGLRPCFDVPGEKTGSFPELKLHFKGGA----EVTLPVENYFAVVGEGSAVCLTVVT 426
               G RP   V   K   F  L L F GG     +  +P+E+Y  +  +G+ VCL ++ 
Sbjct: 320 LCWRGKRPFKSVRDVKK-FFKPLALSFPGGGRTKTQYDIPLESYLIISLKGN-VCLGILN 377

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             EA      ++G+  MQ+  V YD    ++G+    C
Sbjct: 378 GTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNC 415


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 100/407 (24%), Positives = 167/407 (41%), Gaps = 70/407 (17%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++L  G+PP++    +DTGS L W  C     C+ C+      + PK    ++++ 
Sbjct: 38  GLYYMALLLGSPPKLYFLDMDTGSDLTWAQC--DAPCRNCAIGPHGLYNPK---KAKVVD 92

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-- 203
           C  P C+ I        +CN +     K C      Y V Y  G  T G+ + +TL +  
Sbjct: 93  CHLPVCAQIQQGG--SYECNSD----VKQC-----DYEVEYADGSSTMGVLVEDTLTVRL 141

Query: 204 --PNRIIPNFLVGCSVLS----SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
                I    ++GC        ++ PA   G+ G    K +LP+QL        +L H  
Sbjct: 142 TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCL 201

Query: 255 DDTTRTSSLILDNGSSHSDK--KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
            D +     +        D+   + G+T+TP +  P +       + Y   L+ I  GG 
Sbjct: 202 ADGSNGGGYLF-----FGDELVPSWGMTWTPMMGKPEM-------LGYQARLQSIRYGGD 249

Query: 313 RVRVWHKYLTLDRD---GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
            +      L  D D        + DSGT+FT++ P+ +  +    +S + K     R   
Sbjct: 250 SL-----VLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASV----LSAVTKQSGLLR--- 297

Query: 370 AEALTGLRPCFDVPG------EKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEG 417
            ++ T L  C+  P       +    F  L L F G       + + L  + Y  V  +G
Sbjct: 298 VKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQG 357

Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           + VCL ++    AS   + I+G+  M+ Y V YD    R+G+ ++ C
Sbjct: 358 N-VCLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNC 403


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 156/384 (40%), Gaps = 49/384 (12%)

Query: 93  LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
            + GTPPQ    I+D    LVW  C+   +C  C    +P FIP  SS+ R   C    C
Sbjct: 47  FTIGTPPQPASAIIDVAGELVWTQCS---RCSRCFKQDLPLFIPNASSTFRPEPCGTDAC 103

Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
                +S    +C+ + + T ++ T I            T GI  +ET  +      +  
Sbjct: 104 -----KSTPTSNCSGD-VCTYESTTNI------RLDRHTTLGIVGTETFAI-GTATASLA 150

Query: 213 VGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
            GC V S        +G  G GR   SL +Q+ L KFSYCL       T ++S L L + 
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRG---TGKSSRLFLGSS 207

Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
           +  +  ++T  +  PF+      + +    YY + L  I  G   +             +
Sbjct: 208 AKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ---------S 253

Query: 329 GGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DVPGEK 386
           GG +V  + + F+ +    +     + V++ V                L  CF    G  
Sbjct: 254 GGILVMHTVSPFSLLVDSAYRAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKAAGFS 310

Query: 387 TGSFPELKLHFK-GGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSIILGN 440
             + P+L   F+ GGA +T+P   Y   VG E    C  +++    +R    G S +LG+
Sbjct: 311 RATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVS-VLGS 369

Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
            Q +N +  YDL+ + L F+   C
Sbjct: 370 LQQENVHFLYDLKKETLSFEPADC 393


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 90/345 (26%), Positives = 147/345 (42%), Gaps = 71/345 (20%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y+  +  GTPPQ    I+DTGS + + PC+    C+ C   + P F P+LSS+ + + 
Sbjct: 88  GYYTTRIWIGTPPQTFALIVDTGSTVTYVPCST---CEQCGRHQDPKFEPELSSTYQPVS 144

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
           C N  C+   +E  QC    +   A   + + +    ++ +G         +++  +P R
Sbjct: 145 C-NIDCT-CDNERKQC--VYERQYAEMSSSSGVLGEDIISFG---------NQSELVPQR 191

Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
            I     GC       L S++  GI G GRG  S+  QL       D FS C        
Sbjct: 192 AI----FGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGG 247

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
                ++IL   S  S     G+ +         AE +   S YY + L+ I V G+++ 
Sbjct: 248 ----GAMILGGISPPS-----GMVF---------AESDPVRSQYYNIDLKAIHVAGKQLH 289

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK---------NRNYTR 366
           +         DG  GT++DSGTT+ ++    F    D  + ++           N N   
Sbjct: 290 LDPSIF----DGKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDIC 345

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
             GAE+        DV  + + +FP +++ F  G +++L  ENY 
Sbjct: 346 FSGAES--------DV-SQLSNTFPAVEMVFSNGQKLSLSPENYL 381


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 151/375 (40%), Gaps = 65/375 (17%)

Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
            ++DTGS + W  C     C  C   +   F P  S++ + L C +  C  +   S  C 
Sbjct: 3   LLIDTGSDITWIQCD---PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSC- 58

Query: 164 DCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFLVGCSV 217
                 L +S N       Y+V YG    T G    ETL L +       +PNF  GC  
Sbjct: 59  ------LNSSCN-------YMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH 105

Query: 218 LSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
            +       AG+ G G+     P+Q ++     FSYCL S     ++   S IL  G + 
Sbjct: 106 ANKGLFNGAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSV----SSTIPSGILHFGEAA 161

Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
                  + +TP V++ S   +      Y+V +  I VG + + +           +   
Sbjct: 162 --MLDYDVRFTPLVDSSSGPSQ------YFVSMTGINVGDELLPI-----------SATV 202

Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
           +VDSGT  +      +E L D F +Q++          A ++     CF V      + P
Sbjct: 203 MVDSGTVISRFEQSAYERLRDAF-TQILPGLQT-----AVSVAPFDTCFRVSTVDDINIP 256

Query: 392 ELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEY 450
            + LHF+  AE+ L PV   + V  +   +C        +S G S+ LGNFQ QN    Y
Sbjct: 257 LITLHFRDDAELRLSPVHILYPV--DDGVMCFAFA---PSSSGRSV-LGNFQQQNLRFVY 310

Query: 451 DLRNQRLGFKQQLCK 465
           D+   RLG     C 
Sbjct: 311 DIPKSRLGISAFECN 325


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 111/457 (24%), Positives = 180/457 (39%), Gaps = 70/457 (15%)

Query: 29  LTFSLSRFHTNPSQDSYQNL---NSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHS 85
           L FS+S  H + S     NL     + S       HIK    +        TT +I +H 
Sbjct: 15  LCFSISVVHLSASPTLVLNLVHSYHIYSRKPPHVYHIKEASVERLEYLKAKTTGDIIAHL 74

Query: 86  Y-------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
                     + +++S G+PP      +DT S L+W  C     C  C +  +P F P  
Sbjct: 75  SPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCL---PCINCYAQSLPIFDPSR 131

Query: 139 SSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
           S + R   C+  + S               P       T+ C   +       ++GI   
Sbjct: 132 SYTHRNETCRTSQYSM--------------PSLKFNANTRSCEYSMRYVDDTGSKGILAR 177

Query: 199 ETLNLPNRI--------IPNFLVGCSVLSSRQP---AGIAGFGRGKTSLPSQLNLDKFSY 247
           E L L N I        + + + GC   +  +P    GI G G G+ SL  +    KFSY
Sbjct: 178 EML-LFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFG-KKFSY 235

Query: 248 CLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
           C  S   DD +   + L+L +  ++    TT L              N F   YYV +  
Sbjct: 236 CFGS--LDDPSYPHNVLVLGDDGANILGDTTPLEI-----------HNGF---YYVTIEA 279

Query: 307 ITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
           I+V G  + +  +    +   G GGTI+D+G + T +  E ++PL +  +  + + R   
Sbjct: 280 ISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNR-IEDIFEGRFTA 338

Query: 366 RALGAEALTGLRPCFDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
             +  + +  +  C++   E+      FP +  HF  GAE++L V++ F  +   +  CL
Sbjct: 339 ADVSQDDMIKME-CYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSP-NVFCL 396

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
            V      S      +G    Q+Y + YDL    + F
Sbjct: 397 AVTPGNLNS------IGATAQQSYNIGYDLEAMEVSF 427


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 107/414 (25%), Positives = 164/414 (39%), Gaps = 91/414 (21%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS--------KIPSFIPKL 138
           G Y+  +  GTPP     I+DTGS + + PC++   C +  +S        + P F P+ 
Sbjct: 38  GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97

Query: 139 SSSSRLLGCQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGI 195
           SSS + +GC++  C        S QC+                   Y  +Y    T +G+
Sbjct: 98  SSSYQKIGCRSSDCITGLCDSNSHQCK-------------------YERMYAEMSTSKGV 138

Query: 196 ALSETLNL--PNRIIPNFL-VGCSVLSS-----RQPAGIAGFGRGKTSLPSQLN-----L 242
              + L+    +R+    L  GC    S     +   GI G GRG  S+  QL       
Sbjct: 139 LGKDLLDFGPASRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIE 198

Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN-AFSVYYY 301
           D FS C                +D G       +  L   P  +    A+ +   S YY 
Sbjct: 199 DSFSLCYGG-------------MDEGGG-----SMVLGAIPAPSGMVFAKSDPRRSNYYN 240

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           + L  I V G  +++         +G  GTI+DSGTT+ ++    FE   D  V+Q    
Sbjct: 241 LELTEIQVQGASLKLDSNVF----NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQ---- 292

Query: 362 RNYTRALGA-EALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY- 410
                 LG+ +A+ G  P     C+   G  T      FP +   F    +V+L  ENY 
Sbjct: 293 ------LGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYL 346

Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           F       A CL    +++A    + +LG   ++N  V YD  N ++GF +  C
Sbjct: 347 FKHTKVPGAYCLGFFKNQDA----TTLLGGIIVRNMLVTYDRYNHQIGFLKTNC 396


>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
          Length = 414

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 84/185 (45%), Gaps = 35/185 (18%)

Query: 245 FSYCLLSHKFDDTT----RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
           FSY L+ H  D  +    R   L+L    +H +     L YT F    S A+      +Y
Sbjct: 6   FSYRLVEHDSDAVSKVVFREDDLVL----AHPE-----LKYTAFTPTSSPAD-----TFY 51

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           YV L+ + VGG+ +++      + +DG+GGTI+DSGTT ++      EP+          
Sbjct: 52  YVKLKGVLVGGELLKISSDTWDVGKDGSGGTIIDSGTTLSYFV----EPV---------- 97

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
              Y        L G  PC++V G +    PEL L F  GA    P ENYF  +     +
Sbjct: 98  ---YQAVPSDPGLLGAEPCYNVSGMERPEVPELSLLFPDGAVWDFPAENYFVRLDPDDIM 154

Query: 421 CLTVV 425
           CL V+
Sbjct: 155 CLAVL 159


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 124/501 (24%), Positives = 194/501 (38%), Gaps = 106/501 (21%)

Query: 10  LSFIFFFTLLSI--FPSSITSLTFSLSRFH----------------TNPSQDSYQNLNSL 51
           L F+ F  LLS+  FP +     F+    H                  PS+ S++    L
Sbjct: 5   LKFLVFSLLLSVWVFPQNCKGRIFTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAEL 64

Query: 52  V-SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
                + R   + N +     +   +T   ISS  +  Y+ ++  GTP       LDTGS
Sbjct: 65  AHRDQMLRGRKLYNVEAPLAFSDGNSTF-RISSLGFLHYT-TVELGTPGMKFMVALDTGS 122

Query: 111 HLVWFPCTNHYQC------KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
            L W PC +  +C       Y S  ++  + PK SS+S+ + C N  C+   H + +C  
Sbjct: 123 DLFWVPC-DCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCA---HRN-RC-- 175

Query: 165 CNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLPNR------IIPNFLVGC- 215
                L T  +C      Y+V Y S    T GI + + L+L +       I      GC 
Sbjct: 176 -----LGTFSSC-----PYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGCG 225

Query: 216 -----SVLSSRQPAGIAGFGRGKTSLPS-----QLNLDKFSYCLLSHKFDDTTRTSSLIL 265
                S L++  P G+ G G  + S+PS      L  D FS C      D   R      
Sbjct: 226 QVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCF---GHDGVGRI----- 277

Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
               S  DK +     TPF +NPS    N       + + ++ VG   V           
Sbjct: 278 ----SFGDKGSPDQEETPFNSNPSHPSYN-------ISVTQVRVGTTLV----------- 315

Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PG 384
           D +   + DSGT+FT++   ++  +++ F +Q    R        +       C+D+ PG
Sbjct: 316 DVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRR-----PPDPRIPFEYCYDMSPG 370

Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQM 443
             +   P + L  KG    T+  +    +  +   V CL +V   E +     I+G   M
Sbjct: 371 ANSSLIPSMSLTMKGRGHFTV-FDPIIVITTQNELVYCLAIVKSTELN-----IIGQNFM 424

Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
             Y V +D     LG+K+  C
Sbjct: 425 TGYRVVFDREKLVLGWKETDC 445


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 104/395 (26%), Positives = 160/395 (40%), Gaps = 59/395 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   +  GTP +    ++DTGS L W  C   Y+ +   + ++  F    S S + +GC 
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNC--RYRARGKDNRRV--FRADESKSFKTVGCL 161

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
              C        +    N   L T    +  C SY   Y  G   +G+   ET+ +    
Sbjct: 162 TQTC--------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITVGLTN 212

Query: 205 NRI--IPNFLVGCSVLSSRQ----PAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFD 255
            R+  +P  L+GCS   + Q      G+ G        TS  + L   KFSYCL+ H   
Sbjct: 213 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDH-LS 271

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
           +   ++ LI   GSS S K     T TP        +      +Y + +  I++G   + 
Sbjct: 272 NKNVSNYLIF--GSSRSTKTAFRRT-TPL-------DLTRIPPFYAINVIGISLGYDMLD 321

Query: 315 ---RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              +VW      D    GGTI+DSGT+ T +A   ++ +       +V+     + +  E
Sbjct: 322 IPSQVW------DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVE----LKRVKPE 371

Query: 372 ALTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            +  +  CF    G      P+L  H KGGA      ++Y      G   CL  V    +
Sbjct: 372 GVP-IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG-VKCLGFV----S 425

Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +G P+  ++GN   QNY  E+DL    L F    C
Sbjct: 426 AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 97/224 (43%), Gaps = 25/224 (11%)

Query: 245 FSYCLLSHK---FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
           FSYCL S++   F  + R        G++   +    + YTP + NP    R +    YY
Sbjct: 15  FSYCLPSYRSYYFSGSLRL-------GAAGQPRN---VRYTPLLTNP---HRPSL---YY 58

Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
           V +  ++VG   V+V       D     GT++DSGT  T     ++  L +EF  Q+   
Sbjct: 59  VNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAP 118

Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
             YT +LGA        CF+      G  P + LH  GG ++TLP+EN           C
Sbjct: 119 SGYT-SLGA-----FDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLAC 172

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           L +    +       ++ N Q QN  V  D+   R+GF ++ C 
Sbjct: 173 LAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216


>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
          Length = 205

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 96/188 (51%), Gaps = 20/188 (10%)

Query: 226 IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY-- 281
           + G GRG  SL SQL   +FSYCL S    + +R +  +    NG++ S   ++GL    
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNAS---SSGLPVQS 57

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP V N       A    Y++ L+ I++G +R+ +      ++ DG GG  +DSGT+ T+
Sbjct: 58  TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 111

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHFKG 399
           +  ++++ +  E VS +   R    A   E   GL  CF  P   T +   P+++LHF G
Sbjct: 112 LQQDVYDAVRRELVSVL---RPLPPANDTE--IGLETCFPWPPPPTVTMTVPDMELHFDG 166

Query: 400 GAEVTLPV 407
           GA +  P+
Sbjct: 167 GANMLHPI 174


>gi|356548993|ref|XP_003542883.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 473

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 94/400 (23%), Positives = 156/400 (39%), Gaps = 70/400 (17%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  S+  GTP      ++D     +W+ C  HY                 SSS R + C 
Sbjct: 87  YYTSVGIGTPRHNFDLVIDLSGENLWYDCDTHYN----------------SSSYRPIACG 130

Query: 149 NPKCSWIHHESIQCRDCND--EPLATSKNCTQICPSYLV--LYGSGLTEGIALSETLNLP 204
           + +C       I C  CN   +P  T+  C     + L   +Y  GL E     + + + 
Sbjct: 131 SKQC-----PEIGCVGCNGPFKPGCTNNTCPANVINQLAKFIYSGGLGE-----DFIFIR 180

Query: 205 NRIIPNFLVGC-------SVLSSRQP--------AGIAGFGRGKTSLPSQLNL-----DK 244
              +   L  C       S      P         GI G  + + +LP QL        K
Sbjct: 181 QNKVSGLLSSCIDTDAFPSFSDDELPLFGLPNNTKGIIGLSKSQLALPIQLASANKVPSK 240

Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF-VNNPS---VAERNAFSVYY 300
           FS CL S      T   +L++  G  H    +  L  TP  VNN S   ++     S  Y
Sbjct: 241 FSLCLPSLNNQGFT---NLLVRAGEEHPQGISKFLKTTPLIVNNVSTGAISVEGVPSKEY 297

Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
           ++ ++ + + G  V +    L +D  GNGGT + + + FT    EL   +   F+   +K
Sbjct: 298 FIDVKAVQIDGNVVNLKPSLLAIDNKGNGGTKLSTMSPFT----ELQTTVYKTFIRDFIK 353

Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGE 416
             +  R     ++     C+D    +  S     P + L  +GG + T+   N   V+ +
Sbjct: 354 KASDRRLKRVASVAPFEACYDSTSIRNSSTGLVVPTIDLVLRGGVQWTIYGANSM-VMAK 412

Query: 417 GSAVCLTVVTD----REASGGPSIILGNFQMQNYYVEYDL 452
            +  CL +V      R +    SI++G +Q+++  +E+D+
Sbjct: 413 KNVACLAIVDGGTEPRMSFVKASIVIGGYQLEDNLLEFDV 452


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/402 (25%), Positives = 162/402 (40%), Gaps = 64/402 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
           G Y   +  G+PP+     +DTGS ++W  C +   C   S   I    F P  SS++ L
Sbjct: 84  GLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSL 143

Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
           + C +P C S +   + +C          S  C     SY   YG G  T G  +S+ L 
Sbjct: 144 VSCSHPICTSLVQTTAAECS-------PQSNQC-----SYSFHYGDGSGTTGYYVSDMLY 191

Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
               L + +I N     + GCS   S       +   GI GFG+   S+ SQL+      
Sbjct: 192 FDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITP 251

Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
             FS+CL              IL+            + Y+P V + S         +Y +
Sbjct: 252 KVFSHCLKGEGDGGGKLVLGEILE----------PNIIYSPLVPSQS---------HYNL 292

Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
            L+ I+V GQ + +           N GTIVDSGTT T++    ++P      + +  + 
Sbjct: 293 NLQSISVNGQLLPI--DPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSST 350

Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
               + G +       C+ V       FP + L+F GGA + L    Y   +G      +
Sbjct: 351 TPVLSKGNQ-------CYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAM 403

Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             +  ++ +     ILG+  +++    YDL +QR+G+    C
Sbjct: 404 WCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 104/395 (26%), Positives = 159/395 (40%), Gaps = 59/395 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y   +  GTP +    ++DTGS L W  C   Y+ +   + ++  F    S S + +GC 
Sbjct: 84  YFTEIRVGTPAKKFRVVVDTGSELTWVNC--RYRARGKDNRRV--FRADESKSFKTVGCL 139

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
              C        +    N   L T    +  C SY   Y  G   +G+   ET+ +    
Sbjct: 140 TQTC--------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITVGLTN 190

Query: 205 NRI--IPNFLVGCSVLSSRQ----PAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFD 255
            R+  +P  L+GCS   + Q      G+ G        TS  + L   KFSYCL+ H   
Sbjct: 191 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHL-- 248

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
            + +  S  L  GSS S K     T TP        +      +Y + +  I++G   + 
Sbjct: 249 -SNKNVSNYLIFGSSRSTKTAFRRT-TPL-------DLTRIPPFYAINVIGISLGYDMLD 299

Query: 315 ---RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
              +VW      D    GGTI+DSGT+ T +A   ++ +       +V+     + +  E
Sbjct: 300 IPSQVW------DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVE----LKRVKPE 349

Query: 372 ALTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            +  +  CF    G      P+L  H KGGA      ++Y      G   CL  V    +
Sbjct: 350 GVP-IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG-VKCLGFV----S 403

Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +G P+  ++GN   QNY  E+DL    L F    C
Sbjct: 404 AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 106/453 (23%), Positives = 170/453 (37%), Gaps = 89/453 (19%)

Query: 63  KNPQTKTTTTT-TTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT--- 118
           K P+  +TT+T      + +++   G Y +S+ FGTP      +LDT + L W  C    
Sbjct: 113 KVPKLMSTTSTFELPMRSALNTAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRR 172

Query: 119 ---NHYQCKYCSSSKIPS-----------------FIPKLSSSSRLLGCQNPKCSWIHHE 158
               HY  +   +  +                   + P  SSS R + C   +C+ + + 
Sbjct: 173 RKGKHYGRQSSKTMSVGGDDDVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYN 232

Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL---PNRI--IPNFLV 213
           + Q           S +  + C  Y       +T GI  +E   +     R+  +P  ++
Sbjct: 233 TCQ-----------SPSKLESCSYYQKTQDGTVTIGIYGNEKATVTVSDGRMAKLPGLVL 281

Query: 214 GCSVLSSRQPA----GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILD 266
           GCSVL +        G+   G G  S      L    +FS+CLLS    +++R +S  L 
Sbjct: 282 GCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGGRFSFCLLSA---NSSRDASSYLT 338

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR-----RIT---VGGQRVRVWH 318
            G +            P V  P   E     + Y V ++     R+T   VGG+R+ +  
Sbjct: 339 FGPN------------PAVMGPGTMETE---ILYNVDVKAAYGPRVTAVLVGGERLDIPD 383

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
               +D+    G I+D+ T+ T + PE +EPL       +         L  E+  G   
Sbjct: 384 DVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAH-------LPRESFAGFEY 436

Query: 379 CF-------DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
           C+        V      + P++ +   GGA +  P      +   G  V           
Sbjct: 437 CYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLE-PEAKSVVMPEVGHGVACLAFRKLPWG 495

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GGP II GN  MQ Y  E D       F++  C
Sbjct: 496 GGPCII-GNVLMQEYIWEIDHSKATFRFRKDKC 527


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/389 (26%), Positives = 162/389 (41%), Gaps = 56/389 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
           Y ++L  GTP      ++DTGS L W       QCK C+SS     K P + P  SS+  
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCNSSSCYPQKDPLYDPTASSTYA 180

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
            + C +  C  +  ++     C      T+ + T +C  Y + YG+   T G+  +ETL 
Sbjct: 181 PVPCDSKACKDLVPDAYD-HGC------TNSSGTSLC-QYGIEYGNRDTTVGVYSTETLT 232

Query: 203 L-PNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
           L P   + +F  GC ++   +     G+ G G    SL SQ        FSYCL      
Sbjct: 233 LSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCL------ 286

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
               +++  L  G+  ++  T G  +TP  + P  A       +Y V L  ++VGG+ + 
Sbjct: 287 PPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQA------TFYLVNLTGVSVGGKPLD 340

Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
           +    L+      GG I+DSGT  T +    +  L   F + M    +    L       
Sbjct: 341 IPPTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAM----SAYPLLPPNNDDV 390

Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
           L  C++  G    + P + L F GGA + L V +   +       CL       AS G  
Sbjct: 391 LDTCYNFTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFA--GGASDGDV 443

Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            I+GN   + + V YD     +GF+   C
Sbjct: 444 GIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472


>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
          Length = 205

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 96/188 (51%), Gaps = 20/188 (10%)

Query: 226 IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY-- 281
           + G GRG  SL SQL   +FSYCL S    + +R +  +    NG++ S   ++GL    
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNAS---SSGLPVQS 57

Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
           TP V N       A    Y++ L+ I++G +R+ +      ++ DG GG  +DSGT+ T+
Sbjct: 58  TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 111

Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHFKG 399
           +  ++++ +  E VS +   R    A   E   GL  CF  P   T +   P+++LHF G
Sbjct: 112 LQQDVYDAVRRELVSVL---RPLPPANDTE--IGLETCFPWPPPPTVTMTVPDMELHFDG 166

Query: 400 GAEVTLPV 407
           GA +  P+
Sbjct: 167 GANMLHPI 174


>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
          Length = 422

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/398 (23%), Positives = 157/398 (39%), Gaps = 58/398 (14%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y++ +   TP +I   +LD     +W  C N     Y SS+  P            LGC 
Sbjct: 44  YTVEIRQRTPLRIQRLVLDIEEDYMWVRCDNK---SYISSTYSP------------LGCS 88

Query: 149 NPKCSWIHHESIQCRDC--NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
              C    +    C  C  +  P   +  C       + + GS   E     + L LP+ 
Sbjct: 89  AQLCKSYQYSG--CGTCYGSRGPGCNNNTCV------VAVQGSRSVE--LAQDVLVLPSS 138

Query: 207 I---------IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL-----DKFSYCL 249
                      P     C + S+R      G+AG      +LPSQL+       KF+ CL
Sbjct: 139 DGSNPGPLARFPQLAFACDLSSNRVISGTVGVAGMTSSTLALPSQLSAAEGFSRKFAMCL 198

Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
            S             L          ++ +  TP + N      + ++  +Y+G++RI V
Sbjct: 199 PSGNAPGALFFGDEPLVFLPPPGRDLSSQIIRTPLIKN------SVYTDVFYLGVQRIEV 252

Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
           GG  V +  + L  D+DG GGT + +   +T +A  ++  L   F S + K  N TR   
Sbjct: 253 GGVNVAIDAEKLRFDKDGRGGTKLSTVVRYTQLASPIYNSLEGVFTS-VAKKMNITR--- 308

Query: 370 AEALTGLRPCFDVPG---EKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
             +++    CFD  G    + G + P + +  +G +  T  +    ++V   + V     
Sbjct: 309 VASVSPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTWRIFGANSMVRVNNKVLCLGF 368

Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
            D   +   SI++G +QMQ+  +++DL    LGF   L
Sbjct: 369 VDGGDNLQQSIVIGTYQMQDNLLQFDLATSTLGFSSSL 406


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 106/394 (26%), Positives = 163/394 (41%), Gaps = 75/394 (19%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLL 145
           G Y +++  GTP   +  + DTGS L W  C     C   C S K P F P  SS+ + +
Sbjct: 130 GNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE---PCLGSCYSQKEPKFNPSSSSTYQNV 186

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLP 204
            C +P              C D    ++ NC      Y ++YG    T+G    E   L 
Sbjct: 187 SCSSPM-------------CEDAESCSASNCV-----YSIVYGDKSFTQGFLAKEKFTLT 228

Query: 205 NR-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
           N  ++ +   GC   +       AG+ G G GK SLP+Q      + FSYCL S      
Sbjct: 229 NSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSF----- 283

Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
           T  S+  L  GS+   +    + +TP  + PS     AF+  Y + +  I+VG + + + 
Sbjct: 284 TSNSTGHLTFGSAGISES---VKFTPISSFPS-----AFN--YGIDIIGISVGDKELAIT 333

Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
               + +     G I+DSGT FT +  +++  L   F  +M   ++ T   G        
Sbjct: 334 PNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKS-TSGYGL-----FD 382

Query: 378 PCFDVPGEKTGSFPELKLHFKG-------GAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
            C+D  G  T ++P +   F G       G+ ++LP++         S VCL    + + 
Sbjct: 383 TCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKI--------SQVCLAFAGNDDL 434

Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
                 I GN Q     V YD+   R+GF    C
Sbjct: 435 PA----IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 158/396 (39%), Gaps = 61/396 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++++ G P +     +DTGS L W  C     C+ C+    P + P   +++RL+ 
Sbjct: 51  GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC--DAPCRSCNKVPHPLYRP---TANRLVP 105

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPN 205
           C N  C+ +H        C      + K C      Y + Y  S  ++G+ ++++ +LP 
Sbjct: 106 CANALCTALHSGQGSNNKC-----PSPKQC-----DYQIKYTDSASSQGVLINDSFSLPM 155

Query: 206 R---IIPNFLVGCSVLS------SRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
           R   I P    GC          + Q A  G+ G GRG  SL SQL     +  ++ H  
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             T     L   +    S +    +T+ P     S    +  S   Y   R + V    V
Sbjct: 216 -STNGGGFLFFGDDVVPSSR----VTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV 270

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTRALGA 370
                            + DSG+T+T+   + ++ +       + K+     + T  L  
Sbjct: 271 -----------------VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           +     +  FDV  E    F  + L F     A + +P ENY  V   G+ VCL ++ D 
Sbjct: 314 KGQKAFKSVFDVKNE----FKSMFLSFASAKNAAMEIPPENYLIVTKNGN-VCLGIL-DG 367

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            A+     ++G+  MQ+  V YD    +LG+ +  C
Sbjct: 368 TAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403


>gi|326496543|dbj|BAJ94733.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511583|dbj|BAJ91936.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 101/405 (24%), Positives = 166/405 (40%), Gaps = 76/405 (18%)

Query: 97  TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
           TP   +  +LD G   +W  C   Y                +SSS   + C +  C    
Sbjct: 51  TPQVPVTAVLDLGGASLWVDCDAGY----------------VSSSYAGVPCASKLCRL-- 92

Query: 157 HESIQCR-DCNDEPLATSKNC-TQIC---PSYLVLYGSGLTEGIALSETLNLPNRIIPN- 210
            +S+ C   C  +P   S  C    C   P   V   S  T G  +++ L++P    P  
Sbjct: 93  AKSVACATSCVGKP---SPGCLNDTCSGFPENTVTRVS--TGGNLITDVLSVPTTFRPAP 147

Query: 211 ----------FLVGCSVLSSRQPAGIAGFG---RGKTSLPSQLNLD-----KFSYCLLSH 252
                     F  G + L+    AG  G     R + +LP+QL        KF+ CL S 
Sbjct: 148 GPLATAPAFLFTCGATFLTDGLAAGATGMASLSRARFALPTQLAATFRFSRKFALCLTS- 206

Query: 253 KFDDTTRTSSLILDNGSSHSDKK----TTGLTYTPF-VNNPS---VAERNAFSVYYYVGL 304
                T  + +++   + ++ +     +  LTYTP  VNN S   V+ +   S  Y++G+
Sbjct: 207 -----TSAAGVVVFGDAPYAFQPGVDLSKSLTYTPLLVNNVSTAGVSGQKDKSNEYFIGV 261

Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
             I V G+ V +    L +D+ G GGT + +   +T +   + + + D F ++       
Sbjct: 262 TAIKVNGRAVPLNASLLAIDKQGGGGTKLSTVAPYTVLETSIHKAVTDAFAAETAMIPRV 321

Query: 365 TRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGS 418
                  A+   + C+D  G K GS       P ++L  +  A   +       V  +G 
Sbjct: 322 ------RAVAPFKLCYD--GSKVGSTRVGPAVPTVELVLQNEAASWVVFGANSMVAAKGG 373

Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
           A+CL VV D  A+   S+++G   M++  +E+DL+  RLGF   L
Sbjct: 374 ALCLGVV-DGGAAPRTSVVIGGHTMEDNLLEFDLQRARLGFSSSL 417


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/398 (25%), Positives = 161/398 (40%), Gaps = 57/398 (14%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           + +G Y +++S G PP+     +DTGS L W  C     C  C+    P + P   + ++
Sbjct: 53  YPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDA--PCVSCNKVPHPLYRP---TKNK 107

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--L 201
           ++ C +  CS +H        C D P        Q C   +     G + G+ L+++  +
Sbjct: 108 IVPCVDQLCSSLHGGLSGKHKC-DSP-------KQQCDYEIKYADQGSSLGVLLTDSFAV 159

Query: 202 NLPNRII--PNFLVGCS----VLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
            L N  I  P+   GC     V SS + A   G+ G G G  SL SQL     +  ++ H
Sbjct: 160 RLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGH 219

Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
                        DN   +S       T+ P V       R+AF  YY  G   +  GG+
Sbjct: 220 CLSIRGGGFLFFGDNLVPYSRA-----TWVPMV-------RSAFKNYYSPGTASLYFGGR 267

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR----NYTRAL 368
            + V    + L          DSG++FT+   + ++ L     S + K      + +  L
Sbjct: 268 SLGVRPMEVVL----------DSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPL 317

Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVT 426
             +     +   DV  E    F  L L F  G  A + +P ENY  V   G+A CL ++ 
Sbjct: 318 CWKGKKPFKSVLDVKKE----FKSLVLSFSNGKKALMEIPPENYLIVTKFGNA-CLGILN 372

Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
             E       I+G+  MQ+  V YD    ++G+ +  C
Sbjct: 373 GSEIGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPC 410


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 158/396 (39%), Gaps = 61/396 (15%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y ++++ G P +     +DTGS L W  C     C+ C+    P + P   +++RL+ 
Sbjct: 51  GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC--DAPCRSCNKVPHPLYRP---TANRLVP 105

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPN 205
           C N  C+ +H        C      + K C      Y + Y  S  ++G+ ++++ +LP 
Sbjct: 106 CANALCTALHSGQGSNNKC-----PSPKQC-----DYQIKYTDSASSQGVLINDSFSLPM 155

Query: 206 R---IIPNFLVGCSVLS------SRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
           R   I P    GC          + Q A  G+ G GRG  SL SQL     +  ++ H  
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
             T     L   +    S +    +T+ P     S    +  S   Y   R + V    V
Sbjct: 216 -STNGGGFLFFGDDVVPSSR----VTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV 270

Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTRALGA 370
                            + DSG+T+T+   + ++ +       + K+     + T  L  
Sbjct: 271 -----------------VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313

Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVTDR 428
           +     +  FDV  E    F  + L F     A + +P ENY  V   G+ VCL ++ D 
Sbjct: 314 KGQKAFKSVFDVKNE----FKSMFLSFSSAKNAAMEIPPENYLIVTKNGN-VCLGIL-DG 367

Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            A+     ++G+  MQ+  V YD    +LG+ +  C
Sbjct: 368 TAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 150/388 (38%), Gaps = 61/388 (15%)

Query: 89  YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           Y ++L FGTP  PQ++  ++DTGS + W  CT     K C   K P F P  SS+   + 
Sbjct: 131 YVVTLGFGTPSVPQVL--LMDTGSDVSWVQCTPCNSTK-CYPQKDPLFDPSKSSTYAPIA 187

Query: 147 CQNPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL- 203
           C    C  +  H    C          +   TQ    Y V Y  G  + G+  +ETL L 
Sbjct: 188 CNTDACRKLGDHYHNGC----------TSGGTQC--GYSVEYADGSHSRGVYSNETLTLA 235

Query: 204 PNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
           P   + +F  GC     R P+    G+ G G    SL  Q +      FSYCL +     
Sbjct: 236 PGITVEDFHFGCG-RDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN--- 291

Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
            +    L+L  GS  S  K+    +TP  + P       ++ +Y V +  I+VGG+ + +
Sbjct: 292 -SEAGFLVL--GSPPSGNKSA-FVFTPMRHLP------GYATFYMVTMTGISVGGKPLHI 341

Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
                       GG I+DSGT  T +    +  L           R   +A         
Sbjct: 342 PQSAF------RGGMIIDSGTVDTELPETAYNALEAAL-------RKALKAYPLVPSDDF 388

Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
             C++  G    + P +   F GGA + L V N   V       CL         G    
Sbjct: 389 DTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILV-----NDCLAFQESGPDDG--LG 441

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           I+GN   +   V YD     +GF+   C
Sbjct: 442 IIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/388 (24%), Positives = 162/388 (41%), Gaps = 53/388 (13%)

Query: 92  SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
           S + GTPPQ     +D G  LVW    +      C + ++P F P  SS+ R   C    
Sbjct: 27  SFTIGTPPQPASAFIDVGGLLVW-TQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTAL 85

Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
           C +        R+C+ +  A   + TQ+            T G   ++ + +      + 
Sbjct: 86  CEFF---PASIRNCSGDVCAYEAS-TQLFEH---------TSGKIGTDAVAIGTATAASV 132

Query: 212 LVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
             GC + S  +     P+G  G  R   SL +Q+N+  FS+CL  H      + S L L 
Sbjct: 133 AFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHD-GGGGKNSRLFLG 191

Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
             +  +    +    TPFV +   +  +  S+YY + L  I  G + +      +T+ + 
Sbjct: 192 AAAKLAGGGKSAAMTTPFVKS---SPDDIKSLYYLINLEGIKAGDEAI------ITVPQS 242

Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVP 383
           G    ++ + +  +F+   +++ L           +  T A+G    T     +  FD+ 
Sbjct: 243 GR-TVLLQTFSPVSFLVDGVYQDL----------KKAVTAAVGGPTATPPEQFQSIFDLC 291

Query: 384 GEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR-----EASGGPSI 436
            ++ G    P++ L F+G A +T+P  NY   VG+ + VC+ + +       E +G    
Sbjct: 292 FKRGGVSGAPDVVLTFQGAAALTVPPTNYLLDVGDDT-VCVAIASSARLNSTEVAG--MS 348

Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           ILG  Q QN +  YDL  + L F+   C
Sbjct: 349 ILGGLQQQNVHFLYDLEKETLSFEAADC 376


>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
 gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
          Length = 472

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 151/394 (38%), Gaps = 75/394 (19%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           ++++L+ GTPP    F +   S   W  C+    C    S+  P F    S+S   + C 
Sbjct: 88  FAMNLNLGTPPVQHNFTMALNSEFFWAACSPCVDCNV--STNDPLFSSASSTSYTRIPCT 145

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
           +P CS           C    + ++        SY   Y S   E  +    +  P +  
Sbjct: 146 SPFCS--TSPGFSTNACGSSAVGSTTCLYNF--SYSTDYSSA-GEMASDVVAMKTPRKTR 200

Query: 209 PN----FLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQL-NLD---KFSYCLLSHKFD 255
            N      +GC     ++L     +G+ GF +   S   QL  +D   KF YC+ S  F 
Sbjct: 201 GNKSLRMSLGCGRESTTLLGILNTSGLVGFAKTDKSFIGQLAEMDYTSKFIYCVPSDTF- 259

Query: 256 DTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
               +  ++L N   SSHS      L+YTP + N +          YY+GLR I++    
Sbjct: 260 ----SGKIVLGNYKISSHSS-----LSYTPMIVNSTA--------LYYIGLRSISITDTL 302

Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-- 371
                  L    DG GGTI+DS   F++  P+ + PL     +    N N T+    E  
Sbjct: 303 TFPVQGILA---DGTGGTIIDSTFAFSYFTPDSYTPLVQAIQNL---NSNLTKVSSNETA 356

Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
           AL G   C++V      +                          E + VCL  V D E  
Sbjct: 357 ALLGNDICYNVSVNDDDA--------------------------ENATVCL-AVGDSEKV 389

Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
           G    ++G +Q  +  VE+DL  Q +GF    C 
Sbjct: 390 GFSLNVIGTYQQLDVAVEFDLEKQEIGFGTAGCN 423


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 170/404 (42%), Gaps = 77/404 (19%)

Query: 91  ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
           +++S G PP +    +DTGS L W    PC  H  C   S+   P F P  S +SR + C
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58

Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
            + KC  + ++  +Q  +C ++      +CT     Y V YG+G   + G  +++TL + 
Sbjct: 59  SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109

Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
           +  + + + GCS  V  S   AGI GFG             P  L+    SYCL +    
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164

Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
           D T+   +IL       D+      YTP    +N P+          Y + +  +   GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210

Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
           R+             +   IVDSG   T + P  F  L D+ ++Q + +  Y R   A  
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259

Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
            + +  C+    + +G            + P L++ F GGA + L   N F        +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVF-YNDPHRGL 316

Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           C+T   +       S ILGN   +++   +D++ ++ GFK  +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 101/400 (25%), Positives = 166/400 (41%), Gaps = 61/400 (15%)

Query: 84  HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
           + +G Y +++S G PP+     +DTGS L W  C     C  CS    P + P   + ++
Sbjct: 53  YPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDA--PCVSCSKVPHPLYRP---TKNK 107

Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--L 201
           L+ C +  C+ +H        C D P        Q C   +     G + G+ ++++  L
Sbjct: 108 LVPCVDQMCAALHGGLTGRHKC-DSP-------KQQCDYEIKYADQGSSLGVLVTDSFAL 159

Query: 202 NLPNRII--PNFLVGCS----VLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
            L N  I  P    GC     V SS + +   G+ G G G  SL SQL     +  ++ H
Sbjct: 160 RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGH 219

Query: 253 KFDDTTRTSSLIL--DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
               +TR    +   D+   +S       T+ P   + S   RN    YY  G   +  G
Sbjct: 220 CL--STRGGGFLFFGDDIVPYSRA-----TWAPMARSTS---RN----YYSPGSANLYFG 265

Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTR 366
           G+ + V    +          + DSG++FT+ + + ++ L D     + KN     +++ 
Sbjct: 266 GRPLGVRPMEV----------VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL 315

Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTV 424
            L  +     +   DV  E    F  + L F  G  A + +P ENY  V   G+A CL +
Sbjct: 316 PLCWKGKKPFKSVLDVKKE----FKTVVLSFSNGKKALMEIPPENYLIVTKYGNA-CLGI 370

Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           +   E       I+G+  MQ+  V YD    ++G+ +  C
Sbjct: 371 LNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPC 410


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/403 (23%), Positives = 159/403 (39%), Gaps = 74/403 (18%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
           G Y++SL+ G PP++    +DTGS L W  C     CK C+  +   + P       L+ 
Sbjct: 62  GYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCD--APCKGCTLPRNRLYKPH----GDLVK 115

Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
           C +P C+ I          N      ++ C      Y V Y   G + G+ L +  N+P 
Sbjct: 116 CVDPLCAAIQSAP------NHHCAGPNEQC-----DYEVEYADQGSSLGVLLRD--NIPL 162

Query: 206 RII------PNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLN-----LDKFSY 247
           +        P    GC    +          AG+ G G G+TS+ SQL+      +   +
Sbjct: 163 KFTNGSLARPMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGH 222

Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
           CL             LI            +G+ +TP + + S           +   +  
Sbjct: 223 CLSGRGGGFLFFGDQLI----------PPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTT 272

Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
           +V G  +                 I DSG+++T+   +  + L +  ++  ++ +  +RA
Sbjct: 273 SVKGLEL-----------------IFDSGSSYTYFNSQAHKALVN-LIANDLRGKPLSRA 314

Query: 368 LGAEAL----TGLRPCFDVPGEKTGSFPELKLHF--KGGAEVTLPVENYFAVVGEGSAVC 421
            G  +L     G +P F    + T +F  L L F     + + LP E Y  V   G+ VC
Sbjct: 315 TGDPSLPICWKGPKP-FKSLHDVTSNFKPLLLSFTKSKNSPLQLPPEAYLIVTKHGN-VC 372

Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
           L ++   E   G + I+G+  +Q+  V YD   Q++G+    C
Sbjct: 373 LGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQQIGWASANC 415


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/383 (23%), Positives = 144/383 (37%), Gaps = 61/383 (15%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y + L  GTPP  I  I+DTGS + W  C     C +C     P F P  SS+ +   C 
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQC---LPCVHCYEQNAPIFDPSKSSTFKEKRCD 121

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
              C +     +   D                  +    G+  TE I L  T   P  ++
Sbjct: 122 GHSCPY----EVDYFD------------------HTYTMGTLATETITLHSTSGEP-FVM 158

Query: 209 PNFLVGCSVLSSR-QPA--GIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSS 262
           P  ++GC   +S  +P+  G+ G   G +SL +Q+  +     SYC              
Sbjct: 159 PETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQ---------- 208

Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
                G+S  +     +     V + ++    A   +YY+ L  ++VG  R+       T
Sbjct: 209 -----GTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMG---T 260

Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
                 G  ++DSGTT T+  P  +  L  + V  +V         G + L     C++ 
Sbjct: 261 TFHALEGNIVIDSGTTLTYF-PVSYCNLVRQAVEHVVTAVRAADPTGNDML-----CYN- 313

Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
             +    FP + +HF GG ++ L   N +     G   CL ++ +         I GN  
Sbjct: 314 -SDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQ---EAIFGNRA 369

Query: 443 MQNYYVEYDLRNQRLGFKQQLCK 465
             N+ V YD  +  + F    C 
Sbjct: 370 QNNFLVGYDSSSLLVSFSPTNCS 392


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/391 (24%), Positives = 156/391 (39%), Gaps = 64/391 (16%)

Query: 89  YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
           Y  +L+ GTPPQ    I+      VW  C+    C+ C    +P F    SS+ R   C 
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCS---PCRRCFKQDLPLFNRSASSTYRPEPCG 84

Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
              C     ES+    C+ +          +C SY V    G T GI  ++T  +     
Sbjct: 85  TALC-----ESVPASTCSGD---------GVC-SYEVETMFGDTSGIGGTDTFAI-GTAT 128

Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
            +   GC++ S+ +     +G+ G GR   SL  Q+N   FSYCL  H      + S+L+
Sbjct: 129 ASLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG--AAGKKSALL 186

Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
           L   +  +  K+     TP VN       +  S  Y + L  I  G          + + 
Sbjct: 187 LGASAKLAGGKSAA--TTPLVNT------SDDSSDYMIHLEGIKFGD---------VIIA 229

Query: 325 RDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
              NG  + VD+    +F+    F+ +           +  T A+GA  +      FD+ 
Sbjct: 230 PPPNGSVVLVDTIFGVSFLVDAAFQAI----------KKAVTVAVGAAPMATPTKPFDLC 279

Query: 384 GEKTGS---------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
             K  +          P++ L F+G A +T+P   Y    G G+ VCL +++    +   
Sbjct: 280 FPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMYDAGNGT-VCLAMMSSAMLNLTT 338

Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
            + ILG    +N +  +DL  + L F+   C
Sbjct: 339 ELSILGRLHQENIHFLFDLDKETLSFEPADC 369


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 156/386 (40%), Gaps = 54/386 (13%)

Query: 87  GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
           G Y   +  GTP +    ++DTGS L W  C+    C   C     P F PK SSS   +
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCS---PCVVSCHRQSGPVFNPKASSSYASV 181

Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
            C   +CS +   ++    C+     TS  C      Y   YG S  + G    +T++  
Sbjct: 182 SCSAQQCSDLTTATLNPASCS-----TSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 231

Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
           +  +PNF  GC   +     Q AG+ G  R K SL  QL       FSYCL +     + 
Sbjct: 232 STSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 291

Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
             S    + G           +YTP      +A  +     Y++ +  I V G+ + V  
Sbjct: 292 YLSIGSYNPGQ---------YSYTP------MASSSLDDSLYFIKMTGIKVAGKPLSVSS 336

Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
              +        TI+DSGT  T +   ++  L+      M   +   R   A A + L  
Sbjct: 337 SAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAM---KGTPR---ASAFSILDT 385

Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
           CF     +    PE+ + F GGA + L   N    V + +  CL     R A+     I+
Sbjct: 386 CFQGQAARL-RVPEVTMAFAGGAALKLAARNLLVDV-DSATTCLAFAPARSAA-----II 438

Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
           GN Q Q + V YD++N ++GF    C
Sbjct: 439 GNTQQQTFSVVYDVKNSKIGFAAAGC 464


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.407 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,464,278,262
Number of Sequences: 23463169
Number of extensions: 323153872
Number of successful extensions: 1304202
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 757
Number of HSP's successfully gapped in prelim test: 2084
Number of HSP's that attempted gapping in prelim test: 1292431
Number of HSP's gapped (non-prelim): 7151
length of query: 465
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 319
effective length of database: 8,933,572,693
effective search space: 2849809689067
effective search space used: 2849809689067
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)