BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009271
         (538 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/503 (63%), Positives = 392/503 (77%), Gaps = 10/503 (1%)

Query: 13  CILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKR 72
           C   + S  ++FSSKL+HRFSDEAK   IS+ GN S  D WPK+ S EY +LLL ND KR
Sbjct: 17  CCQFEASIGLTFSSKLIHRFSDEAKSISISRKGNAS-GDLWPKRYSFEYFQLLLGNDLKR 75

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
           Q+ ++       S +NQLLFPS+GSQ  FFGN+  WLHYTWIDIGTPNVSFLVALDAGS+
Sbjct: 76  QRMKL------GSQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSD 129

Query: 133 LLWVPCQCIQCAPLSASYYT-SLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP 191
           LLWVPC CIQCAPLSASYY  SLDR+LSEY PS SS+S+++SC H LC+  S+CK+ KDP
Sbjct: 130 LLWVPCDCIQCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDP 189

Query: 192 CPYIADYST-EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
           CPYI +Y   E+T+S+G+LV+D LHLAS   H  +  +Q+SV++GCGRKQ GS+ DGAAP
Sbjct: 190 CPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAP 249

Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 310
           DGVMGLG GD+SVPSLLAKAGLIQN FS+CFDENDSG + FGD+G A+QQST FLPI   
Sbjct: 250 DGVMGLGPGDISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGT 309

Query: 311 YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
           Y AYFVGVESYC+GNSCL +SGF+ALVDSG+SFT+LP+E+Y E+V +FDK V++KRIS Q
Sbjct: 310 YVAYFVGVESYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQ 369

Query: 371 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
              W YCYNASS+E+  +P ++L F +NQ+FVV N  +S P ++GFT+FCL++  TDG Y
Sbjct: 370 DGLWDYCYNASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSY 429

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST 490
           GIIGQNFM+G+R+VFD ENLKL WS+S C++  D + VHL PPP  +SPNPLPT EQQS 
Sbjct: 430 GIIGQNFMIGYRMVFDIENLKLGWSNSSCQDTSDSADVHLAPPPDNKSPNPLPTNEQQSI 489

Query: 491 SNGQAAAPPSTAKTAPSKSIAAS 513
               + AP    +T+ S+S AAS
Sbjct: 490 PRTPSVAPAVAGRTS-SESSAAS 511


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/526 (59%), Positives = 392/526 (74%), Gaps = 15/526 (2%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           L  IC    C L + S  ++FSSKL+HRFS+EAK   IS + NVS + +WP KNS +YL+
Sbjct: 7   LFVICF---CFLSNHSIGLTFSSKLIHRFSEEAKSLLISGNDNVS-SQTWPNKNSFQYLQ 62

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LLL ND KRQK ++  Q       NQLLFPS GS T F+GN   WLHYTWIDIGTPNVSF
Sbjct: 63  LLLDNDLKRQKMKLGAQ-------NQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSF 115

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           LVALDAGS+L WVPC CIQCAPLSAS Y  LDR+LSEY PS S++S+++SC+H LC+  S
Sbjct: 116 LVALDAGSDLSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGS 175

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK--HAPQSSVQSSVIIGCGRKQT 241
            CK+LKDPCPYIADY+  +TSSSG+LV+DILHLAS S   ++ Q  VQ+SVI+GCGRKQT
Sbjct: 176 HCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQT 235

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS 301
           G YLDGAAPDGVMGLG G +SVPSLLAKAGLI+ SFS+CFD N SG++ FGDQG  +Q+S
Sbjct: 236 GGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKS 295

Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
           T  LP    YDAY + VESYC+GNSCL QSGF+ALVDSGASFT+LP ++Y ++V++FDK 
Sbjct: 296 TPLLPTQGNYDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQ 355

Query: 362 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 421
           V+++RIS QG  W YCYN SS+++  VP MRL F  NQS ++ N  +  P+N+ F VFCL
Sbjct: 356 VNAQRISSQGGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCL 415

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 481
           T+  TD +YGIIGQN+M G+R+VFD ENLKL WS S C+++ D++ V L P P  QSPNP
Sbjct: 416 TLQPTDLNYGIIGQNYMTGYRVVFDMENLKLGWSSSNCKDISDETEVTLAPSPNDQSPNP 475

Query: 482 LPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQLDSVLRVACSL 527
           LPT EQQS  N Q  AP    +T+   S+A  +Q +  +L +  S+
Sbjct: 476 LPTNEQQSVPNKQGVAPAVAGRTSSKHSVA--SQHIPCLLHLISSV 519


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 306/512 (59%), Positives = 392/512 (76%), Gaps = 11/512 (2%)

Query: 3   NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
           +L+ + M +  +++D + AV+FSSKL+HRFSDEAK  ++S++GN+  ADSWPKK S +Y 
Sbjct: 5   SLIPLLMAY-LLVVDAAIAVTFSSKLIHRFSDEAKAFFVSRNGNI-FADSWPKKRSFDYY 62

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
            LLLS+D KRQK ++        +  QLLFPSEGS   F GN+F WLHYTWIDIGTPNVS
Sbjct: 63  RLLLSSDLKRQKLKL-------GAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVS 115

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           FLVALDAGS+LLWVPC C+QCAPLSASYY  L R+L+EY PS SS+SK +SC+  LC+  
Sbjct: 116 FLVALDAGSDLLWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELG 175

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
           S CKS KDPCPY+A Y +E+TSSSG L++D LHLA FS+HA +SSV +SVIIGCGRKQ+G
Sbjct: 176 SDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSG 235

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST 302
           ++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N+FSICFD+N SG++ FGDQG  TQ+ST
Sbjct: 236 AFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKST 295

Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
           SF+P+  K+  Y + VE Y +G+S L  +GFQALVDSG SFTFLP EIY ++VV+FDK V
Sbjct: 296 SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQV 355

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCL 421
           ++ R S +G+ WKYCYN+SS+E+L +P + L+F+ NQSF+V N +     ENE F VFCL
Sbjct: 356 NATRSSFKGSPWKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCL 415

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 481
            +     ++GIIGQNFM G+R+VFDRENLKL WS S C+++ D   +HL PPP  +SPNP
Sbjct: 416 PIQPIHEEFGIIGQNFMWGYRMVFDRENLKLGWSTSNCQDITDGKIMHLTPPPNDRSPNP 475

Query: 482 LPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 513
           LPT +QQ T +  A AP    +T P+KS A S
Sbjct: 476 LPTNQQQMTPSRHAVAPAVAGRT-PAKSAAVS 506


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 304/501 (60%), Positives = 386/501 (77%), Gaps = 10/501 (1%)

Query: 14  ILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQ 73
           +++D + AV+FSSKL+HRFSDEAK  ++S++GN+  ADSWPKK S +Y  LLLS+D KRQ
Sbjct: 5   LVVDAAIAVTFSSKLIHRFSDEAKAFFVSRNGNI-FADSWPKKRSFDYYRLLLSSDLKRQ 63

Query: 74  KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
           K ++        +  QLLFPSEGS   F GN+F WLHYTWIDIGTPNVSFLVALDAGS+L
Sbjct: 64  KLKL-------GAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDL 116

Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP 193
           LWVPC C+QCAPLSASYY  L R+L+EY PS SS+SK +SC+  LC+  S CKS KDPCP
Sbjct: 117 LWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCP 176

Query: 194 YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGV 253
           Y+A Y +E+TSSSG L++D LHLA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+
Sbjct: 177 YLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGL 236

Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 313
           MGLG GD+SVPSLLAKAGL++N+FSICFD+N SG++ FGDQG  TQ+STSF+P+  K+  
Sbjct: 237 MGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVT 296

Query: 314 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 373
           Y + VE Y +G+S L  +GFQALVDSG SFTFLP EIY ++VV+FDK V++ R S +G+ 
Sbjct: 297 YLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 356

Query: 374 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGI 432
           WKYCYN+SS+E+L +P + L+F+ NQSF+V N +     ENE F VFCL +     ++GI
Sbjct: 357 WKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGI 416

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 492
           IGQNFM G+R+VFDRENLKL WS S C+++ D   +HL PPP  +SPNPLPT +QQ T +
Sbjct: 417 IGQNFMWGYRMVFDRENLKLGWSTSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPS 476

Query: 493 GQAAAPPSTAKTAPSKSIAAS 513
             A AP    +T P+KS A S
Sbjct: 477 RHAVAPAVAGRT-PAKSAAVS 496


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score =  590 bits (1522), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 287/515 (55%), Positives = 371/515 (72%), Gaps = 11/515 (2%)

Query: 1   MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
           M     + M    +L++   A  FS++L+HRFSDE K    ++SG   ++ SWP+  ++E
Sbjct: 1   MAARFLVAMSVVVLLIESCMAAMFSARLIHRFSDEVKAFRAARSG---LSGSWPEWRTME 57

Query: 61  YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
           Y ++L+ +DW+RQK  +        S+ Q LFPSEGS+T  FGN + WLHYTWIDIGTPN
Sbjct: 58  YYKMLVRSDWERQKVML-------GSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPN 110

Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           +SFLVALDAGS+LLW+PC CIQCAPLSASYY SLDR+L++Y PS SS+SK++SCSH LC+
Sbjct: 111 ISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCE 170

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
           S  +C S K  CPY  +Y +E+TSSSG L++DILHL S    A  SSV++ VIIGCG +Q
Sbjct: 171 SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQ 230

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
           TG YLDG APDG+MGLGLG++SVPS L+KAGL++NSFS+CF+++DSG +FFGDQG ATQQ
Sbjct: 231 TGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQ 290

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           +T FLP   KY+ Y VGVE+ CIG+SC+ Q+ F+ALVDSGASFTFLP E Y  VV +FDK
Sbjct: 291 TTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDK 350

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
            V++ R S +G  W+YCY +SS+E+LK P + L F+ N SFVV N +F     +G   FC
Sbjct: 351 QVNATRFSFEGYPWEYCYKSSSKELLKNPSVILKFALNNSFVVHNPVFVVHGYQGVVGFC 410

Query: 421 LTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN 480
           L +   DGD GI+GQNFM G+R+VFDRENLKL WS S C+++ D   + L P P  + PN
Sbjct: 411 LAIQPADGDIGILGQNFMTGYRMVFDRENLKLGWSRSNCQDLTDGERMPLTPSPNDRPPN 470

Query: 481 PLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQ 515
           PLP  EQQ+T +G     P+ A  APS   AAS Q
Sbjct: 471 PLPANEQQNTHSGHTIT-PAVAGRAPSNPSAASTQ 504


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score =  589 bits (1518), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 288/511 (56%), Positives = 371/511 (72%), Gaps = 11/511 (2%)

Query: 9   MLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSN 68
           ++   +L+D S  V+FSS+L+HRFSDE K   +S+  ++S   SWP+K S++Y ++L+++
Sbjct: 21  LVMASLLIDKSAEVTFSSRLIHRFSDEVKALRVSRKDSLSY--SWPEKKSMDYYQILVNS 78

Query: 69  DWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
           D++RQK ++  Q        Q LFPS+GS+T   G+ F WLHYTWIDIGTP+VSFLVALD
Sbjct: 79  DFQRQKMKLGPQY-------QFLFPSQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALD 131

Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL 188
           AGS+LLWVPC C+QCAPLSASYY+SLDR+L+EY PS SS+SK++SCSH LC+   +C S 
Sbjct: 132 AGSDLLWVPCDCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSP 191

Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
           K PCPY  DY TE+TSSSG LV+DILHLAS   +A   SV++ V+IGCG KQ+G YLDG 
Sbjct: 192 KQPCPYSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGV 251

Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 308
           APDG+MGLGL ++SVPS LAKAGLI+NSFS+CFDE+DSG +FFGDQGP TQQST FL + 
Sbjct: 252 APDGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLD 311

Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
             Y  Y VGVE +C+G+SCL Q+ F+ALVD+G SFTFLP  +Y  +  +FD+ V++   S
Sbjct: 312 GNYTTYVVGVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISS 371

Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 428
             G  WKYCY +SS  + KVP ++LIF  N SFV+ N +F     +G T FCL +  T+G
Sbjct: 372 FNGYPWKYCYKSSSNHLTKVPSVKLIFPLNNSFVIHNPVFMIYGIQGITGFCLAIQPTEG 431

Query: 429 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQ 488
           D G IGQNFM G+R+VFDREN+KL WSHS CE+  +   + L   P G   NPLPT EQQ
Sbjct: 432 DIGTIGQNFMAGYRVVFDRENMKLGWSHSSCEDRSNDKRMPLT-SPNGTLVNPLPTNEQQ 490

Query: 489 STSNGQAAAPPSTAKTAPSKSIAASAQQLDS 519
           S+  G A + P+ A  APSK  AA+ Q L S
Sbjct: 491 SSPGGHAVS-PAVAGRAPSKPSAAAVQLLPS 520


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score =  587 bits (1513), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 284/495 (57%), Positives = 364/495 (73%), Gaps = 11/495 (2%)

Query: 21  AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
           A  FS++L+HRFSDE K    ++SG   ++ SWP+  ++EY ++L+ +DW+RQK  +   
Sbjct: 2   AAMFSARLIHRFSDEVKAFRAARSG---LSGSWPEWRTMEYYKMLVRSDWERQKVML--- 55

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
                S+ Q LFPSEGS+T  FGN + WLHYTWIDIGTPN+SFLVALDAGS+LLW+PC C
Sbjct: 56  ----GSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDC 111

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           IQCAPLSASYY SLDR+L++Y PS SS+SK++SCSH LC+S  +C S K  CPY  +Y +
Sbjct: 112 IQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYS 171

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           E+TSSSG L++DILHL S    A  SSV++ VIIGCG +QTG YLDG APDG+MGLGLG+
Sbjct: 172 ENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGE 231

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           +SVPS L+KAGL++NSFS+CF+++DSG +FFGDQG ATQQ+T FLP   KY+ Y VGVE+
Sbjct: 232 ISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEA 291

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
            CIG+SC+ Q+ F+ALVDSGASFTFLP E Y  VV +FDK V++ R S +G  W+YCY +
Sbjct: 292 CCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKS 351

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           SS+E+LK P + L F+ N SFVV N +F     +G   FCL +   DGD GI+GQNFM G
Sbjct: 352 SSKELLKNPSVILKFALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTG 411

Query: 441 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 500
           +R+VFDRENLKL WS S C+++ D   + L P P  + PNPLP  EQQ+T +G     P+
Sbjct: 412 YRMVFDRENLKLGWSRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTIT-PA 470

Query: 501 TAKTAPSKSIAASAQ 515
            A  APS   AAS Q
Sbjct: 471 VAGRAPSNPSAASTQ 485


>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 880

 Score =  586 bits (1510), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 278/497 (55%), Positives = 364/497 (73%), Gaps = 14/497 (2%)

Query: 16  LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVA-DSWPKKNSVEYLELLLSNDWKRQK 74
           ++G+  V+FSS+L+HRFS+EAK    S+  + SV   +WP++NS EY  LLL +D  RQ+
Sbjct: 17  MEGAVGVTFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSDVTRQR 76

Query: 75  TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
            R+        S+ ++L+P EG QT  FGN  YWLHYTWIDIGTPNVSFLVALDAGS++L
Sbjct: 77  MRL-------GSQYEMLYPFEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDML 129

Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           WVPC CI+CA LSA  Y  LDR+L++Y PS S++S+++ C H LC   S CK  KDPCPY
Sbjct: 130 WVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPY 189

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              YS+ +TSSSGY+ +D LHL S  KHA Q+SVQ+S+I+GCGRKQTG YL GA PDGV+
Sbjct: 190 AVQYSSANTSSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVL 249

Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY 314
           GLG G++SVPSLLAKAGLIQNSFSICF+EN+SG + FGDQG  TQ ST FLPI  K++AY
Sbjct: 250 GLGPGNISVPSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAY 309

Query: 315 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
            VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV++FDK V++  I LQ NSW
Sbjct: 310 IVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQ-NSW 368

Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 434
           +YCYNASS+E++ +P + L FS+NQ+++++N IF  P ++ +T+FCL V  +D DY  IG
Sbjct: 369 EYCYNASSQELISIPPLNLAFSRNQTYLIQNPIFIDPASQEYTIFCLPVSPSDDDYAAIG 428

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQ 494
           QNF+MG+R+VFDRENL+ +WS   C++    S      P +  SPNPLP  +QQS  N  
Sbjct: 429 QNFLMGYRMVFDRENLRFSWSRWNCQDRASFS-----SPYSVGSPNPLPVDQQQSFPNAH 483

Query: 495 AAAPPSTAKTAPSKSIA 511
              P     T+P  S A
Sbjct: 484 GIPPAIAGHTSPKPSAA 500


>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 525

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 282/522 (54%), Positives = 369/522 (70%), Gaps = 22/522 (4%)

Query: 16  LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVA-DSWPKKNSVEYLELLLSNDWKRQK 74
           ++G+   +FSS+L+HRFS+EAK    S+    SV   +WP++NS EY  LLL +D  RQ+
Sbjct: 17  MEGAVGATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSDVARQR 76

Query: 75  TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
            R+        S+ + L+PSEG QT FFGN  YWLHYTWIDIGTPNVSFLVALDAGS++L
Sbjct: 77  MRL-------GSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDML 129

Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           WVPC CI+CA LSA  Y  LDR+L++Y PS S++S+++ C H LC   S CK  KDPCPY
Sbjct: 130 WVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPY 189

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              Y++ +TSSSGY+ +D LHL S  KHA Q+SVQ+S+I+GCGRKQTG YL GA PDGV+
Sbjct: 190 EVQYASANTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVL 249

Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY 314
           GLG G++SVPSLLAKAGLIQNSFSIC DEN+SG + FGDQG  TQ ST FLPI     AY
Sbjct: 250 GLGPGNISVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPI----IAY 305

Query: 315 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
            VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV +FDK V++ RI LQ +SW
Sbjct: 306 MVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQ-SSW 364

Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDYGI 432
           +YCYNASS+E++ +P ++L FS+NQ+F+++N IF  P  + + +T+FCL V  +  DY  
Sbjct: 365 EYCYNASSQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDYAA 424

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 492
           IGQNF+MG+R+VFDRENL+  WS   C++           P  G SPNPLP  +QQ+  N
Sbjct: 425 IGQNFLMGYRLVFDRENLRFGWSRWNCQD-----RASFTSPSNGGSPNPLPANQQQTVPN 479

Query: 493 GQAAAPPSTAKTAPSKSIAASAQQLDSVLRVACSLLVLMCLL 534
            +   P     T+P  S A     L +  R + + L+L+C L
Sbjct: 480 ARGVPPAIAGHTSPKPSAATPG--LVTTSRHSLASLLLICHL 519


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 273/510 (53%), Positives = 367/510 (71%), Gaps = 22/510 (4%)

Query: 1   MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISK-SGNVSVADSWPKKNSV 59
           M N   + +    + ++ S A++ S  LVHRFSDEAK  W S+ +GNVS A  WP  NS+
Sbjct: 1   MANCALLLLFIASLFVNCSLALTLSLNLVHRFSDEAKSLWESRRTGNVS-AKFWPPTNSL 59

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           +Y ++L+  D KR++  +        S+  +LFPSEGSQ  FFGN+F WLHYTWID+GTP
Sbjct: 60  KYFQMLMDYDLKRRRLNI-------GSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTP 112

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
           +V FLVALD GS+LLWVPC CIQCAPLSA+YY+ LDR+LSEY+P+ SS+SK++ C H LC
Sbjct: 113 SVPFLVALDVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLC 172

Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
              ++CKS  DPC Y  DY +++TS+SG++++D L L SFSKH   S +Q+SV+ GCGRK
Sbjct: 173 AWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRK 232

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           Q+GSYLDGAAPDGVMGLG G++SVP+LLA+ GL++N+FS+CFD N SG + FGD GPATQ
Sbjct: 233 QSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQ 292

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
           Q+T FLP+  ++ AYF+GVES+C+G+SCL +SGFQALVDSG+SFT+LP E+Y ++V +FD
Sbjct: 293 QTTQFLPLFGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFD 352

Query: 360 KL--VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT 417
           K   V++ RI L+   W YCYN S+     +P M+L+F  NQ F + + ++  P N+G+ 
Sbjct: 353 KQVKVNATRIVLRELPWNYCYNISTLVSFNIPSMQLVFPLNQIF-IHDPVYVLPANQGYK 411

Query: 418 VFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP--- 474
           VFCLT+  TD DYG+IGQN M+G+R+VFDRENLKL WS SKC ++   +  H  PP    
Sbjct: 412 VFCLTLEETDEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNG 471

Query: 475 AGQSPNPLPTTEQQSTSNGQAAAPPSTAKT 504
             +SP  LP T +Q       A  P+ A+T
Sbjct: 472 NAKSPIALPPTNRQ-------AIAPTAART 494


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score =  566 bits (1459), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 279/488 (57%), Positives = 349/488 (71%), Gaps = 11/488 (2%)

Query: 20  DAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKL 79
           +  +FSS+L+HRFS E KE  +S+ G+V+    WP+K S EY ++L+S+D KRQK ++  
Sbjct: 16  ELATFSSRLIHRFSKEYKEVSVSRGGDVN-GTWWPEKKSKEYYQILVSSDLKRQKLKL-- 72

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
                    QLLFPS+GS+T   GN F WLHYTWIDIGTP+VSF+VALD+GS+L WVPC 
Sbjct: 73  -----GPHYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALDSGSDLFWVPCD 127

Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
           C+QCAPLSAS+Y+SLDR+LSEY PS SS+SK +SCSH LC    +CK+ K  CPY  +Y 
Sbjct: 128 CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYY 187

Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
           TE TSSSG LV+DI+HLAS       +SV++ VIIGCG KQ+G YLDG APDG++GLGL 
Sbjct: 188 TESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQ 247

Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
           ++SVPS LAKAGLIQNSFS+CF+E+DSG +FFGDQGPATQQS  FL +   Y  Y VGVE
Sbjct: 248 EISVPSFLAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVE 307

Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
             C+G SCL QS F ALVDSG SFTFLP +++  +  +FD  V++ R S +G SWKYCY 
Sbjct: 308 VCCVGTSCLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYK 367

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 439
            SS+++ K+P +RLIF +N SF+V+N +F     +G   FCL +   DGD G IGQNFMM
Sbjct: 368 TSSQDLPKIPSLRLIFPQNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGTIGQNFMM 427

Query: 440 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
           G+R+VFDRENLKL WS S CE         L   P+G   NPLPT EQQST  G A + P
Sbjct: 428 GYRVVFDRENLKLGWSRSNCE--FSGISYTLPLTPSGTPQNPLPTNEQQSTPGGHAVS-P 484

Query: 500 STAKTAPS 507
           + A  APS
Sbjct: 485 AVAVNAPS 492


>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 532

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 276/494 (55%), Positives = 365/494 (73%), Gaps = 10/494 (2%)

Query: 21  AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
           +++F+S+++HRFS+E K    S S N SV  SWP+K S+EY + L+S D++RQK ++   
Sbjct: 21  SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKL--- 77

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
                SR QLLFPSEGS+T   GN F WLHYTWIDIGTP+VSFLVALDAGS+LLWVPC C
Sbjct: 78  ----GSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNC 133

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           IQCAPLSASYY SLD++L+EY PSSSS+SK++SCSH LC S  SC+S K  CPY+ DY T
Sbjct: 134 IQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYIT 193

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           E+TSSSG L+ D+LHL+S  +++   ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG+
Sbjct: 194 ENTSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGE 253

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           +SV S LAK  L+QNSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+  KY+ Y VGVE+
Sbjct: 254 ISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEA 313

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 379
            CI NSCL Q+ F+AL+DSG SFT+LP E Y  +V++FDK L ++  +S +G  WKYCY 
Sbjct: 314 CCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYK 373

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 439
            S++ M KVP + L+F  N SFVV + +F    ++G   FC  ++  DGD GI+GQN+M 
Sbjct: 374 ISADAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMT 433

Query: 440 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
           G+R+VFDR+NLKL WSH+ C+++ ++  + L P      PNPLP  EQQS S G A A P
Sbjct: 434 GYRMVFDRDNLKLGWSHANCQDLSNEKKMPLTPAKE-TPPNPLPADEQQSASGGHAVA-P 491

Query: 500 STAKTAPSKSIAAS 513
           + A  APSK  AA+
Sbjct: 492 AVAGRAPSKPSAAT 505


>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 521

 Score =  540 bits (1391), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 272/498 (54%), Positives = 349/498 (70%), Gaps = 18/498 (3%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
           ++FS++LVHRF+DE K               WP + S+ Y ++LL+ D  R+K +V    
Sbjct: 22  ITFSARLVHRFADEMKPV-------RPPTGYWPDQRSMRYYQMLLTGDILRRKIKV---- 70

Query: 82  NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
               +R QLLFPS GS+T   GN F WLHYTWIDIGTP+ SFLVALDAGS+LLW+PC C+
Sbjct: 71  --GGTRYQLLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCV 128

Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
           QCAPLS+SYY++LDR+L+EY PS S SSK++SCSH LC   S+CKS +  CPY+  Y +E
Sbjct: 129 QCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSE 188

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
           +TSSSG LV+DILHL S    +  SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ 
Sbjct: 189 NTSSSGLLVEDILHLQSGGTLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGES 247

Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
           SVPS LAK+GLI  SFS+CF+E+DSG +FFGDQGP +QQSTSFLP+   Y  Y +GVES 
Sbjct: 248 SVPSFLAKSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESC 307

Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
           CIGNSCL  + F+A VDSG SFTFLP  +Y  +  +FD+ V+  R S +G+ W+YCY  S
Sbjct: 308 CIGNSCLKMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPS 367

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           S+++ KVP   L+F +N SFVV + +F F  NEG   FCL ++ T+GD G IGQNFM G+
Sbjct: 368 SQDLPKVPSFTLMFQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTEGDMGTIGQNFMTGY 427

Query: 442 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 501
           R+VFDR N KLAWS S C+++     + L P     S NPLPT EQQ T NG A A P+ 
Sbjct: 428 RLVFDRGNKKLAWSRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAV 483

Query: 502 AKTAPSKSIAASAQQLDS 519
           A  AP K  AAS++ + S
Sbjct: 484 AGRAPHKPSAASSRMISS 501


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 270/498 (54%), Positives = 345/498 (69%), Gaps = 18/498 (3%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
           ++FS++LVHRF+DE K               WP + S+ Y  +LL+ D  R+K +V    
Sbjct: 21  ITFSARLVHRFADEMKPV-------RPPTGYWPDRWSMGYYRMLLTGDILRRKIKV---- 69

Query: 82  NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
               +R QLLFPS GS+T   GN F WLHYTWIDIGTP+ SFLVALDAGS+LLW+PC C+
Sbjct: 70  --GGARYQLLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCV 127

Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
           QCAPLS+SYY++LDR+L+EY PS S SSK++SCSH LC   S+CKS +  CPY+  Y +E
Sbjct: 128 QCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSE 187

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
           +TSSSG LV+DILHL S    +  SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ 
Sbjct: 188 NTSSSGLLVEDILHLQSGGSLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGES 246

Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
           SVPS LAK+GLI +SFS+CF+E+DSG +FFGDQGP  QQSTSFLP+   Y  Y +GVES 
Sbjct: 247 SVPSFLAKSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESC 306

Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
           C+GNSCL  + F+  VDSG SFTFLP  +Y  +  +FD+ V+  R S +G+ W+YCY  S
Sbjct: 307 CVGNSCLKMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPS 366

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           S+E+ KVP + L F +N SFVV + +F F  NEG   FCL +  T+GD G IGQNFM G+
Sbjct: 367 SQELPKVPSLTLTFQQNNSFVVYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTGY 426

Query: 442 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 501
           R+VFDR N KLAWS S C+++     + L P     S NPLPT EQQ T NG A A P+ 
Sbjct: 427 RLVFDRGNKKLAWSRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAV 482

Query: 502 AKTAPSKSIAASAQQLDS 519
           A  AP K  AA ++ + S
Sbjct: 483 AGRAPHKPSAAPSRMISS 500


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 273/517 (52%), Positives = 360/517 (69%), Gaps = 27/517 (5%)

Query: 1   MVNLVAICMLFGCILL-DGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSV 59
           M +  A  +LF   L+ + S A  FSS+L+HRFSDE +        ++    S+P+K S 
Sbjct: 1   MASRSAFILLFILSLVSEKSLASLFSSRLIHRFSDEGR-------ASIKSPGSFPEKRSF 53

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           EY  LL S D +RQK        N  ++ Q L PSEGS+T   GN F WLHYTWIDIGTP
Sbjct: 54  EYYRLLTSIDSRRQKM-------NLGAKFQSLVPSEGSKTISPGNYFGWLHYTWIDIGTP 106

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPL 178
           +VSFLVALD+GS+LLW+PC C+QCAPLS++YY+SL  ++L+E+DPS+S++SK   CSH L
Sbjct: 107 SVSFLVALDSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKL 166

Query: 179 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
           C+S  +C+S K+ CPY   Y++E+TSSSG LV+D+LHLA +S +A  SSV++ V++GCG 
Sbjct: 167 CESAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLA-YSANA-SSSVKARVVVGCGE 224

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT 298
           KQ+G +L G APDGVMGLG G++SVPS LAKAGL++NSFS+CFDE DSG ++FGD GP+T
Sbjct: 225 KQSGEFLKGIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPST 284

Query: 299 QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
           QQST FLP   ++ AYFVGVE  C+GNSCL QS F  L+DSG SFTFLP EIY EV ++ 
Sbjct: 285 QQSTRFLPYKNEFVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEI 344

Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
           D  +++    ++G  W+YCY  S E   KVP ++L FS N +FV+   +F    +EG   
Sbjct: 345 DSHINATVKKIEGGPWEYCYETSFEP--KVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQ 402

Query: 419 FCLTV-MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK-SHVHLVPPPAG 476
           FCL +  S +G  G+IGQN+M G+RIVFDREN+KL WS SKC+E  DK +      P + 
Sbjct: 403 FCLPISASEEGTGGVIGQNYMAGYRIVFDRENMKLGWSASKCQE--DKIAPPQEASPGST 460

Query: 477 QSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 513
            SPNPLPT EQQS ++   A  P+ A   PSK+ +AS
Sbjct: 461 SSPNPLPTEEQQSRTH---AVSPAIAGKTPSKTSSAS 494


>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
 gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
 gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
 gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
          Length = 528

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 261/502 (51%), Positives = 347/502 (69%), Gaps = 26/502 (5%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
            +  C+LF  +  + + A  FSS+L+HRFSDE +    + S     +DS P K S+EY  
Sbjct: 7   FLLFCVLF--LATEETLASLFSSRLIHRFSDEGRASIKTPSS----SDSLPNKQSLEYYR 60

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL  +D++RQ+        N  ++ Q L PSEGS+T   GN F WLHYTWIDIGTP+VSF
Sbjct: 61  LLAESDFRRQRM-------NLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           LVALD GSNLLW+PC C+QCAPL+++YY+SL  ++L+EY+PSSSS+SK   CSH LC S 
Sbjct: 114 LVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
           S C+S K+ CPY  +Y + +TSSSG LV+DILHL   + +      SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           Q+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293

Query: 300 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
           QST FL +   KY  Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ 
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEI 353

Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
           D+ +++   + +G SW+YCY +S+E   KVP ++L FS N +FV+   +F F +++G   
Sbjct: 354 DRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQ 411

Query: 419 FCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAG 476
           FCL + S  G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E  DK       P + 
Sbjct: 412 FCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGST 468

Query: 477 QSPNPLPTTEQQSTSNGQAAAP 498
            SPNPLPT EQQS   G A +P
Sbjct: 469 SSPNPLPTDEQQS-RGGHAVSP 489


>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 529

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 263/515 (51%), Positives = 340/515 (66%), Gaps = 16/515 (3%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
            +FS KL HRFS+E K         V   D WP + ++ Y E LL ND+ R K       
Sbjct: 25  TTFSVKLFHRFSEEMKPV------QVQTGD-WPDRRTLHYHEKLLRNDFLRHKI------ 71

Query: 82  NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
           N   +R++LLFPS+GS+T  FGN F WLHYTWIDIGTP+ SFLVALDAGS+LLWVPC CI
Sbjct: 72  NLGGARHKLLFPSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCI 131

Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYST 200
            CAPLSAS+Y++LDR+L+EY PS S SSK++SCSH LC   S+CK+ K   CPY  +Y +
Sbjct: 132 HCAPLSASFYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLS 191

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           ++TSSSG LV+DI HL S       SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+
Sbjct: 192 DNTSSSGLLVEDIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGE 251

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
            SVPS LAK+GLI++SFS+CF+E+DSG +FFGDQG   QQST FL +   +  Y VGVE+
Sbjct: 252 SSVPSFLAKSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVET 311

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
            CIGNSC   + F A  DSG SFTFLP   Y  +  +FDK V++ R + QG+ W+YCY  
Sbjct: 312 CCIGNSCPKVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVP 371

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           SS+++ K+P + L+F +N SFVV N +F     +G   FCL +  T+G  G IGQNFM G
Sbjct: 372 SSQQLPKIPTLTLMFQQNNSFVVYNPVFVSYNEQGVDGFCLAIQPTEGGMGTIGQNFMTG 431

Query: 441 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST-SNGQAAAPP 499
           +R+VFDREN KLAWSHS C+++     + L  PP G S + LP  EQQ T  +  A A  
Sbjct: 432 YRLVFDRENKKLAWSHSNCQDLSLGKRMPL-SPPNGTSSSQLPADEQQRTKGHAVAPAVA 490

Query: 500 STAKTAPSKSIAASAQQLDSVLRVACSLLVLMCLL 534
             A   PS + + ++  +       C  L+L  LL
Sbjct: 491 VRAPQKPSVASSQTSYMISYWRHWHCHWLLLFHLL 525


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 256/501 (51%), Positives = 345/501 (68%), Gaps = 27/501 (5%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
            +  C+LF  +  + + A  FSS+++HRFSDE +    + S     ++S P+K S+EY  
Sbjct: 7   FILFCVLF--LATEETLASVFSSRMIHRFSDEGRASIRTPSS----SESLPEKQSLEYYR 60

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL  +D++RQ+        N  ++ Q L PSEGS+T   GN F WLHYTWIDIGTP+VSF
Sbjct: 61  LLAKSDFRRQRM-------NLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           LVALD GS+LLW+PC C+QCAPL+++YY+SL  ++L+EY+PSSSS+SK   CSH LC S 
Sbjct: 114 LVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
           S C+S K+ CPY  +Y + +TSSSG LV+DILHL   + +      SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           Q+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
           QST FL + E    Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ D
Sbjct: 294 QSTPFLQL-ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEID 352

Query: 360 KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
           + +++   S +G SW+YCY +S E   KVP ++L FS N +FV+   +F F +++G   F
Sbjct: 353 RHINATSKSFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQF 410

Query: 420 CLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ 477
           CL + S  G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E  +K       P +  
Sbjct: 411 CLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLRWSASKCQE--EKIEPPQASPGSTS 467

Query: 478 SPNPLPTTEQQSTSNGQAAAP 498
           SP PLPT EQQ  S G A +P
Sbjct: 468 SPYPLPTEEQQ--SRGHAVSP 486


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 262/544 (48%), Positives = 363/544 (66%), Gaps = 28/544 (5%)

Query: 2   VNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEY 61
           V ++   +L    +L+   AV+FSS+++HRFSDEAK    +  G      SWPK+ S EY
Sbjct: 3   VGVLLWLLLAKGFVLETVIAVTFSSRIIHRFSDEAKVHLRNNGG--ENVQSWPKRGSSEY 60

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
             LLL++D  RQK ++        S++Q  +PSEGS+T  FGN F WLHYTWIDIGTPNV
Sbjct: 61  FRLLLNSDLTRQKMKL-------GSQDQSFYPSEGSKTLSFGNDFVWLHYTWIDIGTPNV 113

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
           SFLVALD GS++ WVPC CI+CAPLSA++Y +LDR+L++Y PS SSSS+++ C H LC  
Sbjct: 114 SFLVALDTGSDMFWVPCDCIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQ 173

Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
            S+CK  KD CPYI +Y++++TSSSG+L++D LHLA  S +A ++S+Q+SVI+GCGRKQ+
Sbjct: 174 NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLA--SNNATKNSIQASVILGCGRKQS 231

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ-Q 300
           G +L+GAAP+G++GLG G +SVP+LLAKAGLI+NS SIC +E  SG + FGDQG ATQ +
Sbjct: 232 GYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFGDQGHATQRR 291

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           ST FL    +   YFVGVE +C+G+ C  ++ F+A +D+G SFT+LP  +Y  VV +F+K
Sbjct: 292 STPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEK 351

Query: 361 LVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
            V + RI+ Q  S +  CYNASS E    P M+  FSKNQSF+++N   S  + +  T  
Sbjct: 352 QVHATRITSQIQSDFNCCYNASSRESNNFPPMKFTFSKNQSFIIQNPFISMDQED--TTI 409

Query: 420 CLTVMSTDGD-------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP 472
           CL V+ +D +       Y I  QNF+MG+ +VFDRENL+  W  S C++ + +S  +   
Sbjct: 410 CLAVVQSDDELITIGRKYTIACQNFLMGYDMVFDRENLRFGWFRSNCQDSMGES-ANFTS 468

Query: 473 PPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAASAQQLDSVLRVACSLLVLM 531
           P  G SP+ +P+ +QQ   N   + PP+ A KT+P  S A        +L      L L+
Sbjct: 469 PSIGGSPDSIPSNQQQRVPNNTRSVPPAIAGKTSPKPSAAKPGLNSWHLLNS----LSLI 524

Query: 532 CLLL 535
           CLLL
Sbjct: 525 CLLL 528


>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 529

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 253/500 (50%), Positives = 342/500 (68%), Gaps = 25/500 (5%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
            +  C+LF  +  +G+ A  FSS+L+HRFSDE +    + S     ++S P+K S+ Y  
Sbjct: 7   FILFCVLF--LATEGTLASVFSSRLIHRFSDEGRASIKTPSS----SESLPEKQSLAYYR 60

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL  +D++RQ+        N  ++ Q L PSEGS+T   GN F WLHYTWIDIGTP+VSF
Sbjct: 61  LLAKSDFRRQRM-------NLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           LVALD GS+LLW+PC C+QCAPL+++YY+SL  ++L+EY+PSSSSSSK   CSH LC S 
Sbjct: 114 LVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVFLCSHKLCGSA 173

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
           S C S K+ C Y   Y + +TSSSG LV+DILHL   + +      SSV++ V++GCG+K
Sbjct: 174 SDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKK 233

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           Q+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
           QS  FL + E    Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ D
Sbjct: 294 QSAPFLQL-ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEID 352

Query: 360 KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
           + +++   S +G SW+YCY +S E   KVP ++L FS N +FV+   +F F +++G   F
Sbjct: 353 RHINATSKSFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQF 410

Query: 420 CLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQS 478
           CL +  ++ +  G IGQN+M G+R+VFDREN+KL WS SKC+E  DK+      P +  S
Sbjct: 411 CLPISPSEQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKTEPPQASPGSTSS 468

Query: 479 PNPLPTTEQQSTSNGQAAAP 498
           P PLPT EQQ  S G A +P
Sbjct: 469 PYPLPTEEQQ--SRGHAVSP 486


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  479 bits (1234), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 242/496 (48%), Positives = 326/496 (65%), Gaps = 22/496 (4%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +FSS++VHR SDEA+     + G       WP++ S  Y   LL +D +RQK R+     
Sbjct: 26  TFSSRMVHRLSDEARLEAGPRMG------LWPQRGSGGYYRALLRSDLQRQKRRL----- 74

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
             + +NQLL  S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQ
Sbjct: 75  --AGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           CAPLS SY  +LDR+L  Y P+ S++S+++ CSH LC+  S C + K PC Y  DY +E+
Sbjct: 133 CAPLS-SYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSEN 191

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T+SSG L++D LHL S   HAP   V +SVIIGCGRKQ+G YLDG APDG++GLG+ D+S
Sbjct: 192 TTSSGLLIEDSLHLNSREGHAP---VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADIS 248

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
           VPS LA+AGL++NSFS+CF E+ SG +FFGDQG ++QQST F+P+  K   Y V V+  C
Sbjct: 249 VPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSC 308

Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
           IG+ CL  S FQALVDSG SFT LP ++Y     +FDK +++ R+  + ++WKYCY+AS 
Sbjct: 309 IGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASP 368

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGH 441
            EM  VP + L F+ N+SF   N I  F + +G    FCL V+ +    GIIGQNF++G+
Sbjct: 369 LEMPDVPTIILAFAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGY 428

Query: 442 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 501
            +VFDRE++KL W  S+C +V + + V L P   G S +PLP+ EQQ++        P+T
Sbjct: 429 HVVFDRESMKLGWYRSECRDVDNSTTVPLGPSQHGSSEDPLPSNEQQTS----PPVTPAT 484

Query: 502 AKTAPSKSIAASAQQL 517
             TAP  S   + Q L
Sbjct: 485 TGTAPPSSATTNRQML 500


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 240/494 (48%), Positives = 326/494 (65%), Gaps = 21/494 (4%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           S+++VHR SDEA+      +        WP++ S +Y   L+ +D +RQK RV       
Sbjct: 29  SARMVHRLSDEAR-----LAAGARGGRRWPRRGSGDYFRALVRSDLQRQKRRV------- 76

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
             + QLL  S+G      GN   WL+YTW+D+GTPN SFLVALD GS+L WVPC CIQCA
Sbjct: 77  GGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCA 136

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS SY+ SLDR+L  Y PS S++S+++ CSH LC   S C + K PCPY  DY +E+T+
Sbjct: 137 PLS-SYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTT 195

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG L++D+LHL S   HAP   V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVP
Sbjct: 196 SSGLLIEDMLHLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVP 252

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL++NSFS+CF ++DSG +FFGDQG  TQQST F+P+  K   Y V V+ YCIG
Sbjct: 253 SFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIG 312

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + C   +GFQALVD+G SFT LP + Y  + ++FDK +++ R S    S++YCY+    E
Sbjct: 313 HKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLE 372

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F++N+SF   N I  F + +G F VFCL V+ +    GIIGQNFM+G+ +
Sbjct: 373 MPDVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHV 432

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAK 503
           VFDREN+KL W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A 
Sbjct: 433 VFDRENMKLGWYRSECHDLDNSTTVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAG 488

Query: 504 TAPSKSIAASAQQL 517
            APS   + + Q L
Sbjct: 489 RAPSSGGSTTLQNL 502


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 240/494 (48%), Positives = 326/494 (65%), Gaps = 21/494 (4%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           S+++VHR SDEA+      +        WP++ S +Y   L+ +D +RQK RV       
Sbjct: 29  SARMVHRLSDEAR-----LAAGARGGRRWPRRGSGDYFRALVRSDLQRQKRRV------- 76

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
             + QLL  S+G      GN   WL+YTW+D+GTPN SFLVALD GS+L WVPC CIQCA
Sbjct: 77  GGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCA 136

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS SY+ SLDR+L  Y PS S++S+++ CSH LC   S C + K PCPY  DY +E+T+
Sbjct: 137 PLS-SYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTT 195

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG L++D+LHL S   HAP   V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVP
Sbjct: 196 SSGLLIEDMLHLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVP 252

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL++NSFS+CF ++DSG +FFGDQG  TQQST F+P+  K   Y V V+ YCIG
Sbjct: 253 SFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIG 312

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + C   +GFQALVD+G SFT LP + Y  + ++FDK +++ R S    S++YCY+    E
Sbjct: 313 HKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLE 372

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F++N+SF   N I  F + +G F VFCL V+ +    GIIGQNFM+G+ +
Sbjct: 373 MPDVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHV 432

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAK 503
           VFDREN+KL W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A 
Sbjct: 433 VFDRENMKLGWYRSECHDLDNSTMVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAG 488

Query: 504 TAPSKSIAASAQQL 517
            APS   + + Q L
Sbjct: 489 RAPSSGGSTTLQNL 502


>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like, partial [Cucumis sativus]
          Length = 408

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 227/389 (58%), Positives = 297/389 (76%), Gaps = 8/389 (2%)

Query: 21  AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
           +++F+S+++HRFS+E K    S S N SV  SWP+K S+EY + L+S D++RQK ++   
Sbjct: 21  SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKL--- 77

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
                SR QLLFPSEGS T   GN F WLHYTWIDIGTP+VSFLVALDAGS+LLWVPC C
Sbjct: 78  ----GSRFQLLFPSEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNC 133

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           IQCAPLSASYY SLD++L+EY PSSSS+SK++SCSH LC S  SC+S K  CPY+ DY T
Sbjct: 134 IQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYIT 193

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           E+TSSSG L+ D+LHL+S  +++   ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG+
Sbjct: 194 ENTSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGE 253

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           +SV S LAK  L+QNSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+  KY+ Y VGVE+
Sbjct: 254 ISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEA 313

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 379
            CI NSCL Q+ F+AL+DSG SFT+LP E Y  +V++FDK L ++  +S +G  WKYCY 
Sbjct: 314 CCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYK 373

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
            S++ M KVP + L+F  N SFVV + +F
Sbjct: 374 ISADAMPKVPSVTLLFPLNNSFVVHDPVF 402


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 239/511 (46%), Positives = 322/511 (63%), Gaps = 27/511 (5%)

Query: 21  AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
           + + S+++VHR SDEA+               WP+  S  Y   L+ +D +RQK      
Sbjct: 71  SATLSTRMVHRLSDEARLAAGPHGAR------WPRHGSGGYYRALVRSDLQRQK------ 118

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
                 ++QLL  SE       GN F WL+YTW+D+GTPN SF+VALD GS+L WVPC C
Sbjct: 119 -----RKHQLLSVSEAGGIFSPGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDC 173

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           I+CAPL A Y  +LDR+L  Y P+ S++S+++ CSH LC   S C S K PCPY  DY  
Sbjct: 174 IECAPL-AGYRETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQ 232

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           E+T+SSG L++DILHL S   HAP   V++SV+IGCGRKQ+GSYLDG APDG++GLG+ D
Sbjct: 233 ENTTSSGLLIEDILHLDSRESHAP---VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMAD 289

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           +SVPS LA+AGL++NSFS+CF E DSG +FFGDQG + QQST F+P+  KY  Y V V+ 
Sbjct: 290 ISVPSFLARAGLVRNSFSMCFKE-DSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDK 348

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
            C+G+ C   + F+ALVDSG SFT LP  +Y  V V+FDK V + RI+ +  S++YCY+A
Sbjct: 349 SCVGHKCFEATSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSA 408

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMM 439
           S  +M  VP + L F+ N+SF   N      + EG    FCL +  +    GIIGQNF+ 
Sbjct: 409 SPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLT 468

Query: 440 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
           G+ IVFD+EN+KL W  S+C +  + + V L P        PLP++EQQ++       PP
Sbjct: 469 GYHIVFDKENMKLGWYRSECHDPDNSTTVPLGPSQHNSPGVPLPSSEQQTSPT---VTPP 525

Query: 500 STAKTAPSKSIAASAQQLDSVLRVACSLLVL 530
           + A  AP+ S +     L  +L   CSLL+L
Sbjct: 526 AVAGKAPTSS-SGPPSNLHRLLANCCSLLLL 555


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 234/507 (46%), Positives = 322/507 (63%), Gaps = 29/507 (5%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           SS++VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S   
Sbjct: 28  SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
                    S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCA
Sbjct: 80  ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS  Y  +LDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG L++D LHL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVP 246

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL+QNSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + CL  + F+ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAK 503
           VFDRE++KL W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA 
Sbjct: 427 VFDRESMKLGWYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAG 482

Query: 504 TAPSKSIAASAQQLDSVLRVACSLLVL 530
           TAP   ++ +   L  +L  +  LL+L
Sbjct: 483 TAP---LSCATTNLQMLLASSYPLLLL 506


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 232/504 (46%), Positives = 320/504 (63%), Gaps = 29/504 (5%)

Query: 28  LVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSR 87
           +VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S      
Sbjct: 1   MVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL----- 49

Query: 88  NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLS 147
                 S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCAPLS
Sbjct: 50  ------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLS 103

Query: 148 ASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 207
             Y  +LDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG
Sbjct: 104 G-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSG 162

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
            L++D LHL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS L
Sbjct: 163 LLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFL 219

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
           A+AGL+QNSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ C
Sbjct: 220 ARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKC 279

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           L  + F+ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  
Sbjct: 280 LEGTSFKALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPD 339

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           VP + L F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFD
Sbjct: 340 VPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFD 399

Query: 447 RENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAP 506
           RE++KL W  S+C +V D + V L P       +PLP+ EQQ++     A  P+TA TAP
Sbjct: 400 RESMKLGWYRSECHDVEDSTTVPLGPSQRDSPEDPLPSNEQQTS----PAVTPATAGTAP 455

Query: 507 SKSIAASAQQLDSVLRVACSLLVL 530
              ++ +   L  +L  +  LL+L
Sbjct: 456 ---LSCATTNLQMLLASSYPLLLL 476


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 233/507 (45%), Positives = 321/507 (63%), Gaps = 29/507 (5%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           SS++VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S   
Sbjct: 28  SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
                    S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCA
Sbjct: 80  ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS  Y  +LDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG L++D LHL     H P   V +SVIIGCG+KQ+G YLDG APDG++ LG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVP 246

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL+QNSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + CL  + F+ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAK 503
           VFDRE++KL W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA 
Sbjct: 427 VFDRESMKLGWYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAG 482

Query: 504 TAPSKSIAASAQQLDSVLRVACSLLVL 530
           TAP   ++ +   L  +L  +  LL+L
Sbjct: 483 TAP---LSCATTNLQMLLASSYPLLLL 506


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 229/486 (47%), Positives = 317/486 (65%), Gaps = 22/486 (4%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           S+++V+R SDEA+    ++         WP++ S +Y   L+ +D +RQK R+       
Sbjct: 135 STRMVYRLSDEARMAAGTRGAR------WPRRGSGDYYRSLVRSDLQRQKRRL------G 182

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
             ++QLL  S+       GN F WL+YTW+D+GTPN SF+VALD GS+L W+PC CI+CA
Sbjct: 183 GGKHQLLSFSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCDCIECA 242

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS  Y+ SLDR+L  Y P+ S++S+++ CSH LC   S C + K PCPY   Y  E+T+
Sbjct: 243 PLSG-YHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTT 301

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG LV+DILHL S   HAP   V++SVIIGCGRKQ+GSYLDG APDG++GLG+ D+SVP
Sbjct: 302 SSGLLVEDILHLDSRESHAP---VKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVP 358

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL++NSFS+CF + DSG +FFGDQG +TQQST F+P+  K   Y V V+  C+G
Sbjct: 359 SFLARAGLVRNSFSMCFTK-DSGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVG 417

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + C   + FQA+VDSG SFT LP +IY  V ++FDK V++ R+  +  S+ YCY+AS   
Sbjct: 418 HKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLV 477

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F+ N+SF   N  F   + EG    FCL V+ +    GII QNF++G+ +
Sbjct: 478 MPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHV 537

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAK 503
           VFDREN+KL W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A 
Sbjct: 538 VFDRENMKLGWYRSECHDLDNSTTVPLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAG 593

Query: 504 TAPSKS 509
            A + S
Sbjct: 594 RARASS 599


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 211/437 (48%), Positives = 288/437 (65%), Gaps = 22/437 (5%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           SS++VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S   
Sbjct: 28  SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
                    S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCA
Sbjct: 80  ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS  Y  +LDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           SSG L++D LHL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVP 246

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S LA+AGL+QNSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
           + CL  + F+ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           M  VP + L F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426

Query: 444 VFDRENLKLAWSHSKCE 460
           VFDRE++KL W  S+C+
Sbjct: 427 VFDRESMKLGWYRSECK 443


>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
          Length = 378

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 175/377 (46%), Positives = 246/377 (65%), Gaps = 11/377 (2%)

Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 3   DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
           HL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 63  HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 119

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
           NSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + F+
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + L 
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 239

Query: 395 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++KL 
Sbjct: 240 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 299

Query: 454 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 513
           W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA TAP   ++ +
Sbjct: 300 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 352

Query: 514 AQQLDSVLRVACSLLVL 530
              L  +L  +  LL+L
Sbjct: 353 TTNLQMLLASSYPLLLL 369


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 182/497 (36%), Positives = 271/497 (54%), Gaps = 16/497 (3%)

Query: 7   ICMLFGCIL--LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
           + M+  C+L  L  + A +    L H+FS +A E   S++G +  A  WP + ++E+  +
Sbjct: 11  LVMVHCCVLWMLATTFANALRMDLFHKFSKQAIEAMRSRNG-MDYAQDWPTEGTIEFQTM 69

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           L  +D  R  TR   +    SS +Q +     +    FG     LHY++IDIGTPNV FL
Sbjct: 70  LRDHDVARH-TRTARRILAASSMDQYVLIQGNATEQLFGGG---LHYSYIDIGTPNVQFL 125

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           V LD GS+LLW+PC+C  CAPLSA         L+ Y PS SS++K V CS PLC+  S+
Sbjct: 126 VVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVLCSDPLCEMSST 185

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C +  D CPY  +Y + +TS+SG L +D ++   F + +  + V+  V +GCG+ QTGS 
Sbjct: 186 CMAPTDQCPYEINYVSANTSTSGALYEDYMY---FMRESGGNPVKLPVYLGCGKVQTGSL 242

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
           L GAAP+G+MGLG  D+SVP+ LA  G + +SFS+C     SG++ FGD+GPA Q++T  
Sbjct: 243 LKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEGPAAQRTTPI 302

Query: 305 LPIG-EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           +P      D Y V ++S  +GN+ L  +   AL D+G SFT+L   +Y + V  +D  +S
Sbjct: 303 IPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVYPQFVQAYDAQMS 361

Query: 364 -SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCL 421
             K    + + W  CY  S+    +VP + L  S   S  V + + S   +N      C+
Sbjct: 362 LPKWNDPRFSKWDLCYQTSNTN-FQVPVVSLALSGGNSLDVVSGLKSIVDDNNAMIAVCV 420

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN- 480
           TVM +     IIGQNFM  + I ++R  + + W+ S C   +  S+      PA   P  
Sbjct: 421 TVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGWTPSDCSTDLTLSNSTPGSVPAALPPTA 480

Query: 481 PLPTTEQQSTSNGQAAA 497
           PLP   + ++ N    A
Sbjct: 481 PLPAVPRPASPNSTVTA 497


>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
          Length = 313

 Score =  298 bits (764), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 148/276 (53%), Positives = 196/276 (71%), Gaps = 9/276 (3%)

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           SSV++ V+IGCG+KQ+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE D
Sbjct: 5   SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64

Query: 286 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 344
           SG ++FGD GP+ QQST FL +   KY  Y VGVE+ CIGNSCL Q+ F   +DSG SFT
Sbjct: 65  SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
           +LP EIY +V ++ D+ +++   + +G SW+YCY +S+E   KVP ++L FS N +FV+ 
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIH 182

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             +F F +++G   FCL + S  G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E 
Sbjct: 183 KPLFVFQQSQGLVQFCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE- 240

Query: 463 IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAP 498
            DK       P +  SPNPLPT EQQS   G A +P
Sbjct: 241 -DKIEPPQASPGSTSSPNPLPTDEQQSR-GGHAVSP 274


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score =  293 bits (749), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 182/478 (38%), Positives = 261/478 (54%), Gaps = 24/478 (5%)

Query: 5   VAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
           V I +L   +      A  FS ++ HRFS+  K +W   +GN   A +WP K S EY   
Sbjct: 7   VFIVILLSILGFRSCHARIFSFQMHHRFSEPVK-KWSEGAGNGFPAGNWPAKGSFEYYAE 65

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           L   D   +  R+       S  + LL  S+G+ T F  +   +LHYT + +GTP   FL
Sbjct: 66  LAHRDRALRGRRL-------SDIDGLLTFSDGNST-FRISSLGFLHYTTVSLGTPGKKFL 117

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           VALD GS+L WVPC C +CAP   + Y S D  LS Y+P  SS+S+ V+C + LC  R+ 
Sbjct: 118 VALDTGSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCDNSLCAHRNR 176

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C      CPY+  Y + +TS+SG LV+D+LHL +  +   Q  V++ V  GCG+ QTGS+
Sbjct: 177 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSF 234

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
           LD AAP+G+ GLGL  +SVPS+L+K G   +SFS+CF  +  G + FGD+G   Q+ T F
Sbjct: 235 LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF 294

Query: 305 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVS 363
             +   +  Y + V    +G + L    F AL DSG SFT+L   IY  V+  F  +   
Sbjct: 295 -NLNALHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQD 352

Query: 364 SKRISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT 422
           S+R       +++CY+ S  E    +P M L       F V + I     ++   ++C+ 
Sbjct: 353 SRRPPDSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIII-SSQSELIYCMA 411

Query: 423 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 475
           V+ +  +  IIGQNFM G+RI+FDRE L L W   +C++ I+ S V + P     PPA
Sbjct: 412 VVRS-AELNIIGQNFMTGYRIIFDREKLVLGWKEFECDD-IENSSVPIRPRATSVPPA 467


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score =  292 bits (747), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 189/528 (35%), Positives = 272/528 (51%), Gaps = 33/528 (6%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           L+A+ ++    L+  +DA SF   L HRFS   + RW    G    AD WP + + EY  
Sbjct: 14  LLAMAVVVVASLIAAADASSFGFDLHHRFSPVVR-RWAEARGGPLAADQWPARGTPEYYS 72

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
            L  +D  R+            + + LL  + G+ T+  G     L+Y  +++GTPN +F
Sbjct: 73  ALSRHDRARRAL-------AGGADDGLLTFAAGNDTYQSGT----LYYAEVELGTPNATF 121

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSR 182
           LVALD GS+L WVPC C QCA + ++  T  D  +L  Y P  SS+SK V+C +PLC  R
Sbjct: 122 LVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQR 181

Query: 183 SSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASF--SKHAPQSSVQSSVIIGCGRK 239
           + C +  +  CPY   Y + +TSSSG LV D+LHL        A   ++Q+ V+ GCG+ 
Sbjct: 182 NGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQV 241

Query: 240 QTGSYLD--GAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGP 296
           QTG++LD  G A DG+MGLG+G VSVPS LA +GL+  +SFS+CF ++  G V FGD G 
Sbjct: 242 QTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGS 301

Query: 297 ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVV 356
             Q  T F  +      Y V   S  +G+  +    F A++DSG SFT+L    Y ++  
Sbjct: 302 RGQAETPFT-VRSLNPTYNVSFTSIGVGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLAT 359

Query: 357 KFDKLVSSKRISLQGNS-----WKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSF 410
           KF+  VS +R++    S     ++YCY  S  +  + +PD+ L       F V       
Sbjct: 360 KFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPV 419

Query: 411 PENEGFTV-FCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 467
            +  G  V +CL +M  D   G  IIGQNFM G ++VFDRE   L W    C      + 
Sbjct: 420 GDTTGRAVGYCLAIMRNDMAIGIDIIGQNFMTGLKVVFDRERSVLGWEKFDCYRNARVAD 479

Query: 468 VHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA---KTAPSKSIAA 512
                P    +P   PT      ++G  +  P  A   ++A S++ AA
Sbjct: 480 APDGSPGPSSAPAAGPTKITPRQNDGSGSGYPGAAPLPRSAGSRNAAA 527


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 169/461 (36%), Positives = 259/461 (56%), Gaps = 20/461 (4%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           L+ I ML         +   F+ ++ HRFSDE K+ W   +G  +    +P K S EY  
Sbjct: 12  LIPILMLLS---FGSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA---KFPPKGSFEYFN 64

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
            L+  DW  +  R  L  + + S + L F S+G+ T    +   +LHYT + +GTP + F
Sbjct: 65  ALVLRDWLIRGRR--LSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRF 120

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +VALD GS+L WVPC C +CAP   + Y S +  LS Y+P  S+++K V+C++ LC  R+
Sbjct: 121 MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 179

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            C      CPY+  Y +  TS+SG L++D++HL +  K+  +  V++ V  GCG+ Q+GS
Sbjct: 180 QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 237

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           +LD AAP+G+ GLG+  +SVPS+LA+ GL+ +SFS+CF  +  G + FGD+G + Q+ T 
Sbjct: 238 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 297

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           F  +   +  Y + V    +G + L    F AL D+G SFT+L   +Y  V   F     
Sbjct: 298 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQ 355

Query: 364 SKRISLQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 421
            KR S      ++YCY+ S++     +P + L    N  F + + I      EG  V+CL
Sbjct: 356 DKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCL 414

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            ++ +  +  IIGQN+M G+R+VFDRE L LAW    C ++
Sbjct: 415 AIVKS-SELNIIGQNYMTGYRVVFDREKLVLAWKKFDCYDI 454


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  285 bits (730), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 177/439 (40%), Positives = 247/439 (56%), Gaps = 26/439 (5%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F+ K+ HRFSD  K  W   + N      WP+K S EY   L   D   Q  R +  S+ 
Sbjct: 26  FTFKMHHRFSDSFKN-WSGLTRN------WPEKGSFEYYAALAHRD---QMLRGRRLSDA 75

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
           ++S   L F S+G+ T F  +   +LHYT +++GTP V F+VALD GS+L WVPC C +C
Sbjct: 76  DAS---LAF-SDGNST-FRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWVPCDCSRC 130

Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
           AP   + Y S D  LS Y+P  SS+SK V+C++ +C  R+ C      CPYI  Y +  T
Sbjct: 131 APTHGASYAS-DFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVSYVSAQT 189

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
           S+SG LV D+LHL +  +   +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 190 STSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 247

Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
           PS+L++ GLI +SFS+CF  +  G + FGD+G   Q+ T F  +   +  Y V V    +
Sbjct: 248 PSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPF-NVNPAHPTYNVTVTQARV 306

Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
           G + L    F AL DSG SFT++    Y+ V  KF  L   KR        ++YCY+ S 
Sbjct: 307 G-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDMSP 365

Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           +     VP M L     + F V + I     +NE   V+CL V+ +  +  IIGQNFM G
Sbjct: 366 DANASLVPSMSLTMKGGRHFTVYDPIIVISTQNE--IVYCLAVVKST-ELNIIGQNFMTG 422

Query: 441 HRIVFDRENLKLAWSHSKC 459
           +R+VFDRE L L W    C
Sbjct: 423 YRVVFDREKLVLGWKKFDC 441


>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 525

 Score =  284 bits (726), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 172/467 (36%), Positives = 249/467 (53%), Gaps = 32/467 (6%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK----- 78
           FS K+ HRFSD+ K  W   SG  ++ DSWP K ++EY   L   D   +  R+      
Sbjct: 28  FSFKMHHRFSDQLKN-WSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFRGQRLSEFDGP 86

Query: 79  --LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
                 N+S R   L  +                YT + +GTP   F+VALD GS+L WV
Sbjct: 87  LAFSDGNSSFRISSLGFALFDVFF--------FFYTTVQLGTPGTKFMVALDTGSDLFWV 138

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
           PC C +CAP   S Y S D  LS Y P  SS+SK V C++ LC  R  C      CPY+ 
Sbjct: 139 PCDCSRCAPTEGSPYAS-DFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVV 197

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y + +TS++G L++D+LHL +  KH+    +Q+ +  GCG+ Q+GS+LD AAP+G+ GL
Sbjct: 198 SYVSAETSTTGILIEDLLHLKTEHKHS--EPIQAYITFGCGQVQSGSFLDVAAPNGLFGL 255

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           G+  +SVPS+L++ GL+ NSFS+CF ++  G + FGD+G   Q+ T F  + + +  Y +
Sbjct: 256 GMEQISVPSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNI 314

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
            V S  +G + L  +   AL DSG SF++    IY+++   F       R        ++
Sbjct: 315 TVTSIRVGTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFE 373

Query: 376 YCYNASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGII 433
           YCYN S +    + P + L       F V + I     +NE   ++CL V+ +  +  II
Sbjct: 374 YCYNMSPDANASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKS-AELNII 430

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 475
           GQNFM G+RIVFDRE L L W    C ++ +KS   + P     PPA
Sbjct: 431 GQNFMTGYRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPA 477


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 167/449 (37%), Positives = 250/449 (55%), Gaps = 14/449 (3%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
           P   + EY   L  +D  R+++ + L +             +G+ T+   NQF +LHY  
Sbjct: 54  PSPGTAEYYAALAGHDDLRRRS-LSLAAAPAPGAGGPFAFVDGNDTYRL-NQFGFLHYAV 111

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTPNV+FLVALD GS+L WVPC C++CAPLS+  Y +L  ++  Y P  SS+S+ V 
Sbjct: 112 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLSSPDYGNLKFDV--YSPRKSSTSRKVP 169

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           CS  +C  ++ C +  + CPY  +Y +++TSS G LV+D+++LA+ S H+     Q+ + 
Sbjct: 170 CSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHS--KITQAPIT 227

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  G+  NSFS+CF E+  G + FGD
Sbjct: 228 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGRINFGD 287

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
            G A Q  T  L I +    Y + +     G    +   F A+VDSG SFT L   +Y E
Sbjct: 288 TGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSGTSFTALSDPMYTE 345

Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           +   FDK V  KR     +  ++YCY  SS+  +  P++ L       F V++ I +  +
Sbjct: 346 ITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPPNISLTAKGGSVFPVKDPIITITD 405

Query: 413 NEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 471
                V +CL +M ++G   +IG+NFM G ++VFDRE L L W    C  V   + + + 
Sbjct: 406 ISSSPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERLVLGWKSFNCYSVDHSTKLPVS 464

Query: 472 PPPAGQSPNPLPTTEQQSTSNGQAAAPPS 500
           P  +   P P+       +SN +AA  PS
Sbjct: 465 PNSSAIPPKPV---SGPGSSNPEAAKRPS 490


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score =  281 bits (719), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 162/441 (36%), Positives = 249/441 (56%), Gaps = 19/441 (4%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F+ ++ HRFSDE K+ W   +G       +P K S EY   L+  DW  +  R+    + 
Sbjct: 29  FTFEMHHRFSDEVKQ-WSDSTGRFV---KFPPKGSFEYFNALVLRDWLIRGRRLSDSESE 84

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
           +S        S+G+ T    +   +LHYT + +GTP + F+VALD GS+L WVPC C +C
Sbjct: 85  SSLTF-----SDGNSTSRI-SSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKC 138

Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
           AP   + Y S +  LS Y+P  S+++K V+C++ LC  R+ C      CPY+  Y +  T
Sbjct: 139 APTEGATYAS-EFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 197

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
           S+SG L++D++HL +  K+  +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 198 STSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 255

Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
           PS+LA+ GL+ +SFS+CF  +  G + FGD+G + Q+ T F  +   +  Y + V    +
Sbjct: 256 PSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV 314

Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
           G + L    F AL D+G SFT+L   +Y  V   F      KR S      ++YCY+ S+
Sbjct: 315 GTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSN 373

Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           +     +P + L    N  F + + I      EG  V+CL ++ +  +  IIGQN+M G+
Sbjct: 374 DANASLIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVKS-SELNIIGQNYMTGY 431

Query: 442 RIVFDRENLKLAWSHSKCEEV 462
           R+VFDRE L LAW    C ++
Sbjct: 432 RVVFDREKLVLAWKKFDCYDI 452


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 182/508 (35%), Positives = 259/508 (50%), Gaps = 33/508 (6%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F   L HRFS   + RW    G    AD WP + + EY   L  +D  R+          
Sbjct: 36  FGFDLHHRFSPVVR-RWAEARGGPLAADRWPARGTPEYYSALSRHDRARRAL-------A 87

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
             + + LL  + G+ T+  G     L+Y  +++GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88  GGADDGLLTFAAGNDTYQSGT----LYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQC 143

Query: 144 APLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTE 201
           A + ++  T  D   L  Y P  SS+S+ V+C +PLC  R+ C +  +  CPY   Y + 
Sbjct: 144 ATIPSANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSA 203

Query: 202 DTSSSGYLVDDILHLASF--SKHAPQSSVQSSVIIGCGRKQTGSYLD--GAAPDGVMGLG 257
           +TSSSG LV D+LHL        A   ++Q+ V+ GCG+ QTG++LD  G A DG+MGLG
Sbjct: 204 NTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLG 263

Query: 258 LGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           +G VSVPS LA +GL+  +SFS+CF ++  G V FGD G   Q  T F  +      Y V
Sbjct: 264 MGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYNV 322

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--- 373
              S  IG+  +    F A++DSG SFT+L    Y ++  KF+  VS +R++    S   
Sbjct: 323 SFTSIGIGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADP 381

Query: 374 --WKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGD 429
             ++YCY  S  +  + +PD+ L       F V        +  G  + +CL +M  D  
Sbjct: 382 FPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMA 441

Query: 430 YG--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQ 487
            G  IIGQNFM G ++VFDRE   L W    C      +      P    +P   PT   
Sbjct: 442 IGIDIIGQNFMTGLKVVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKIT 501

Query: 488 QSTSNGQAAAPPSTA---KTAPSKSIAA 512
              ++G  +  P  A   ++A S++ AA
Sbjct: 502 PRQNDGSGSGYPGAAPLPRSAGSRNAAA 529


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 170/481 (35%), Positives = 267/481 (55%), Gaps = 19/481 (3%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
           P   + EY   L  +D  R+++   L         +  F ++G+ T+   N F +LHY  
Sbjct: 48  PPHGTAEYYAALAGHDGLRRRS---LGVGGGGGGAEFAF-ADGNDTYRL-NDFGFLHYAV 102

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTPNV+FLVALD GS+L WVPC C++CAPL +  Y SL  ++  Y P+ S++S+ V 
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLQSPNYGSLKFDV--YSPAQSTTSRKVP 160

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S  A    V + ++
Sbjct: 161 CSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS--AQSKIVTAPIM 218

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+CF ++  G + FGD
Sbjct: 219 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD 278

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
            G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG SFT L   +Y +
Sbjct: 279 TGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTALSDPMYTQ 336

Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           +   FD  + S R  L  +  +++CY+ S+  ++  P++ L       F V + I +  +
Sbjct: 337 ITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITD 395

Query: 413 NEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 471
           N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  C    + S + + 
Sbjct: 396 NAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVN 454

Query: 472 PPPAGQSPNPLPTTEQQSTSNGQAAAPPST-AKTAPSKSIAASAQQLDSVLRVACSLLVL 530
           P P+   P P       +    + A P  T     PS   A+S  Q  SV      L ++
Sbjct: 455 PSPSAVPPKPGLGPSSYTPEAAKGALPNGTQVNVMPS---ASSPLQPQSVFATIVLLFLI 511

Query: 531 M 531
           +
Sbjct: 512 V 512


>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
 gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
          Length = 541

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 187/522 (35%), Positives = 264/522 (50%), Gaps = 51/522 (9%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           LVA+ ++    L+   DA S    L HRFS   ++ W    G+   A  WP + S EY  
Sbjct: 14  LVAVAIVAVSFLVAAGDASSVGFDLHHRFSPVVRQ-WAEARGHPFAAQDWPARGSPEYYS 72

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGN---QFYW-LHYTWIDIGTP 119
            L  +D      R  L      SR  L   ++G  T   GN   Q+   L+Y  +++GTP
Sbjct: 73  ALSRHD------RAVL------SRRALADGADGLVTFAAGNDTLQYIGSLYYAVVEVGTP 120

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
           N +FLVALD GS+L WVPC C QCA + A+        L  Y P  SS+SK V+C + LC
Sbjct: 121 NATFLVALDTGSDLFWVPCDCKQCASI-ANVTGQPATALRPYSPRESSTSKQVTCDNALC 179

Query: 180 KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS------VQSSV 232
              + C +  +  CPY   Y + +TS+SG LV D+LHL   ++  P ++      +Q+ V
Sbjct: 180 DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHL---TRERPGAAAEAGEALQAPV 236

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFF 291
           + GCG+ QTG++LDGAA DG+MGLG  +VSVPS+LA +GL+  +SFS+CF ++  G + F
Sbjct: 237 VFGCGQVQTGTFLDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFSMCFGDDGVGRINF 296

Query: 292 GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTE 349
           GD G + Q  T F      Y+  F  V VE+  +       + F A++DSG SFT+L   
Sbjct: 297 GDSGSSGQGETPFTGRRTLYNVSFTAVNVETKSVA------AEFAAVIDSGTSFTYLADP 350

Query: 350 IYAEVVVKFDKLVSSKRISLQGNS-----WKYCY--NASSEEMLKVPDMRLIFSKNQSFV 402
            Y E+   F+ LV  +R +    S     ++YCY    +  E L +PD+ L       F 
Sbjct: 351 EYTELATNFNSLVRERRTNFSSGSADPFPFEYCYALGPNQTEAL-IPDVSLTTKGGARFP 409

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC- 459
           V   +           +CL +M  D   ++ IIGQNFM G ++VFDRE   L W    C 
Sbjct: 410 VTQPVIGVASGRTVVGYCLAIMKNDLGVNFNIIGQNFMTGLKVVFDREKSVLGWEKFDCY 469

Query: 460 --EEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
               V D       P PA   P  +   +   +SNG  AA P
Sbjct: 470 KNARVADAPDGSPSPAPAAD-PTKITPRQNDGSSNGFPAAAP 510


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 182/486 (37%), Positives = 261/486 (53%), Gaps = 33/486 (6%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F+ K+ HRFSD  K+  +S S   + + ++P K S EY   L   D   Q  R +   N 
Sbjct: 28  FTFKMHHRFSDMLKD--LSDS---TTSRNFPSKGSFEYYAELAHRD---QMLRGRKLYNV 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
            +    L F S+G+ T F  +   +LHYT +++GTP + F+VALD GS+L WVPC C +C
Sbjct: 80  EAP---LAF-SDGNST-FRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKC 134

Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
           AP     Y S D  LS YDP  SS+SK V+C++ LC  R+ C      CPY+  Y +  T
Sbjct: 135 APTQGVAYAS-DFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQT 193

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
           S+SG LV+D+LHL S  + + Q S+++ V  GCG+ Q+GS+L+ AAP+G+ GLG+  +SV
Sbjct: 194 STSGILVEDVLHLTS--EDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISV 251

Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
           PS+L++ GL  +SFS+CF  +  G + FGD+G   Q+ T F      + +Y + V    +
Sbjct: 252 PSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNS-NPSHPSYNISVTQVRV 310

Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS- 381
           G + L    F AL DSG SFT+L   IYA V   F      KR        ++YCY+ S 
Sbjct: 311 GTT-LVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSP 369

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
                 +P M L       F V + I     +NE   V+CL ++ +  +  IIGQNFM G
Sbjct: 370 GANSSLIPSMSLTMKGRGHFTVFDPIIVITTQNE--LVYCLAIVKST-ELNIIGQNFMTG 426

Query: 441 HRIVFDRENLKLAWSHSKCEE-----VIDKSHVHLVPPPA----GQSPNPLPTTEQQSTS 491
           +R+VFDRE L L W  + C +        + H   VPP      G   +P  T + +  S
Sbjct: 427 YRVVFDREKLVLGWKETDCYDQEYNSFPTEPHASDVPPAVAAGLGNYSSPHSTNQDRKKS 486

Query: 492 NGQAAA 497
               A+
Sbjct: 487 QSSVAS 492


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 172/486 (35%), Positives = 258/486 (53%), Gaps = 20/486 (4%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
           P   + EY   L  +D +R   R    +        L F ++G+ T+   N F +LHY  
Sbjct: 48  PPAGTAEYYAALAGHDLRR---RSLAAAAGGGGAGNLAF-ADGNDTYRL-NDFGFLHYAV 102

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTPNV+FLVALD GS+L WVPC CI+CAPL++  Y  L  ++  Y P  SS+S+ V 
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGDLKFDM--YSPRKSSTSRKVP 160

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV-QSSV 232
           CS  LC  ++ C +  + CPY   Y +E+TSS G LV+D+L+L + S    QS + Q+ +
Sbjct: 161 CSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESG---QSKITQAPI 217

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
             GCG+ Q+GS+L  AAP+G++GLG+   SVPSLLA  G+  NSFS+CF E+  G + FG
Sbjct: 218 TFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGHGRINFG 277

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
           D G + Q  T  L I ++   Y + +    +G      + F A+VDSG SFT L   +Y 
Sbjct: 278 DTGSSDQLETP-LNIYKQNPYYNISITGAMVGGKSF-DTKFSAVVDSGTSFTALSDPMYT 335

Query: 353 EVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP 411
           E+   F+  V   R  L  +  ++YCY+ S++  +  P++ L       F V   I +  
Sbjct: 336 EITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNISLTAKGGSIFPVNGPIITIT 395

Query: 412 ENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 470
           +     + +CL +M ++G   +IG+NFM G +IVFDRE L L W    C    + S + +
Sbjct: 396 DTSSRPIAYCLAIMKSEG-VNLIGENFMSGLKIVFDRERLVLGWKTFNCYNFDNSSKLPV 454

Query: 471 VPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQL---DSVLRVACSL 527
              P+   P P       +    + A+P  T    P  S ++S  +L    + L    +L
Sbjct: 455 NRNPSADPPKPALGPSSSNPEAAKGASPNITQIDVPHSS-SSSETRLHLSGTFLSATIAL 513

Query: 528 LVLMCL 533
           L L  L
Sbjct: 514 LFLAAL 519


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 171/490 (34%), Positives = 259/490 (52%), Gaps = 36/490 (7%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           SF   + HRFSD  K         +   D+ P K S EY   +   D   +  R+     
Sbjct: 38  SFGFDIHHRFSDPVK--------GILGIDNIPDKGSREYYVAMAHRDRVFRGRRLA---- 85

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
           +    +Q L       T +  + F +LH+  + +GTP  S+LVALD GS+L W+PC C +
Sbjct: 86  DGGDVDQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTK 145

Query: 143 CA-PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYST 200
           C   +  S    +  N+  YD   SS+SKNV+C+  LC+ ++ C S     CPY  +Y +
Sbjct: 146 CVHGIQLSTGQKIAFNI--YDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLS 203

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
           E+TS++G+LV+D+LHL + +    Q +    +  GCG+ QTG++LDGAAP+G+ GLG+ D
Sbjct: 204 ENTSTTGFLVEDVLHLITDNDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSD 262

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VSVPS+LAK GL  NSFS+CF  +  G + FGD   +  Q  +   I   +  Y + V  
Sbjct: 263 VSVPSILAKQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQ 322

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYC 377
             +G +      F A+ D+G SFT+L    Y ++   FD  +  +R S   +    ++YC
Sbjct: 323 IIVGGNSADLE-FNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYC 381

Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIF-SFPENEGFTVFCLTVMSTDGDYGIIGQN 436
           Y+  + + ++VP++ L      ++ V + I  S   N G  V CL V+ ++ +  IIGQN
Sbjct: 382 YDLRTNQTIEVPNINLTMKGGDNYFVMDPIITSGGGNNG--VLCLAVLKSN-NVNIIGQN 438

Query: 437 FMMGHRIVFDRENLKLAWSHSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQST 490
           FM G+RIVFDREN+ L W  S C  +E+    +++SH   V P    +P       Q + 
Sbjct: 439 FMTGYRIVFDRENMTLGWKESNCYDDELSSLPVNRSHAPAVSPAMAVNPEI-----QSNP 493

Query: 491 SNGQAAAPPS 500
           SNG    P S
Sbjct: 494 SNGPQRLPSS 503


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 177/491 (36%), Positives = 264/491 (53%), Gaps = 27/491 (5%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +F   L HR+SD  K       G +SV D  P+K S+ Y   +   D        KL S+
Sbjct: 40  TFGFDLHHRYSDPVK-------GMLSV-DDLPEKGSLHYYASMAHRDILIHGR--KLVSD 89

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
           N S+   L F S G++T+ F +   +LHY  + IGTP++S+LVALD GS+L W+PC C  
Sbjct: 90  NTST--PLTFFS-GNETYRF-SSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTN 145

Query: 143 CAPLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
              +    + S ++ + + Y P++SS+S+ + C++ LC  +S C S +  CPY   Y + 
Sbjct: 146 SGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSN 205

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
            TSS+G LV+D+LHL +    A   ++ + +I GCGR QTGS+LDGAAP+G+ GLG+ ++
Sbjct: 206 GTSSTGVLVEDLLHLTT--DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNI 263

Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
           SVPS LA+ G   NSFS+CF  +  G + FGD G + Q  T F  + + +  Y V +   
Sbjct: 264 SVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPF-NLRQLHPTYNVSITKI 322

Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNA 380
            +G        F A+ DSG SFT+L    Y  +   F+     KR  S+    ++YCY  
Sbjct: 323 NVGGRDADLE-FSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEM 381

Query: 381 SSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 439
           SS +  L++P + L+      F V + I       G +++CL ++ + GD  IIGQNFM 
Sbjct: 382 SSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKS-GDVNIIGQNFMT 440

Query: 440 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP----NPLPTTEQQSTSNGQA 495
           G+RIVF+RE   L W  S C + +D +   + P   G  P    NP  T    +T+   +
Sbjct: 441 GYRIVFNRERNVLGWKASDCYDDMDTTTFPVDPISPGIPPATAVNPQATAGSGNTTE-VS 499

Query: 496 AAPPSTAKTAP 506
             PP     AP
Sbjct: 500 GTPPPVGNNAP 510


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 157/430 (36%), Positives = 248/430 (57%), Gaps = 15/430 (3%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
           P   + EY   L  +D  R+++   L         +  F ++G+ T+   N F +LHY  
Sbjct: 48  PPHGTAEYYAALAGHDGLRRRS---LGVGGGGGGAEFAF-ADGNDTYRL-NDFGFLHYAV 102

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTPNV+FLVALD GS+L WVPC C++CAP  +  Y SL  ++  Y P+ S++S+ V 
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVP 160

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S  A    V + ++
Sbjct: 161 CSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS--AQSKIVTAPIM 218

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+CF ++  G + FGD
Sbjct: 219 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD 278

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
            G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG SFT L   +Y +
Sbjct: 279 TGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTALSDPMYTQ 336

Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           +   FD  + S R  L  +  +++CY+ S+  ++  P++ L       F V + I +  +
Sbjct: 337 ITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITD 395

Query: 413 NEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 471
           N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  C    + S + + 
Sbjct: 396 NAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLPVN 454

Query: 472 PPPAGQSPNP 481
           P P+     P
Sbjct: 455 PSPSAVPSKP 464


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 182/533 (34%), Positives = 271/533 (50%), Gaps = 46/533 (8%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           +++   +   + L       ++  + HR S+  ++   S +  +      P+K +VEY  
Sbjct: 1   MLSFVFIIASLFLSLCHGHVYTFTMHHRHSEPVRKWSHSTASGIPAP---PEKGTVEYYA 57

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
            L   D      R+ L+    S  +  L  S+G+ T F  +   +LHYT + IGTP V F
Sbjct: 58  ELADRD------RL-LRGRKLSQIDDGLAFSDGNST-FRISSLGFLHYTTVQIGTPGVKF 109

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +VALD GS+L WVPC C +CA   +S + S D +L+ Y+P+ SS+SK V+C++ LC  RS
Sbjct: 110 MVALDTGSDLFWVPCDCTRCAATDSSAFAS-DFDLNVYNPNGSSTSKKVTCNNSLCMHRS 168

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            C      CPY+  Y + +TS+SG LV+D+LHL     H     V+++VI GCG+ Q+GS
Sbjct: 169 QCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGS 226

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           +LD AAP+G+ GLG+  +SVPS+L++ G   +SFS+CF  +  G + FGD+G   Q  T 
Sbjct: 227 FLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETP 286

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           F  +   +  Y + V    +G + L    F AL DSG SFT+L    Y  +   F   V 
Sbjct: 287 F-NLNPSHPTYNITVTQVRVGTT-LIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQ 344

Query: 364 SKRISLQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 421
            +R        ++YCY+ S +     +P + L       F V + I      +   V+CL
Sbjct: 345 DRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIII-STQSELVYCL 403

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID-------KSHVHLVPPP 474
            V+ T  +  IIGQNFM G+R+VFDRE L L W    C ++ D       + H H   PP
Sbjct: 404 AVVKT-AELNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPHSHADVPP 462

Query: 475 A-----GQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQLDSVLR 522
           A     G  P   PT  ++S  N Q             K +  + Q L S+LR
Sbjct: 463 AVAAGLGNYPATDPT--RKSKYNSQ------------RKWLTNTTQWLRSMLR 501


>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 537

 Score =  272 bits (696), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 174/500 (34%), Positives = 253/500 (50%), Gaps = 41/500 (8%)

Query: 29  VHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNS--- 85
           +H  S     RW    G+   A     + + EY   L  +D      R   + +      
Sbjct: 33  LHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLARRGLAEGDGEGLLT 92

Query: 86  -SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
            +   L F  EGS           LHY  + +GTPN +FLVALD GS+L WVPC C QCA
Sbjct: 93  FASGNLTFRLEGS-----------LHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCA 141

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTE 201
           P++ +       +L  Y P  SS+SK V+C H LC+  ++C +  +    CPY   Y + 
Sbjct: 142 PIANASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSA 201

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
           +TSSSG LV+D+LHL+  +     ++V + V++GCG+ QTG++LDGAA DG++GLG+  V
Sbjct: 202 NTSSSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKV 261

Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           SVPS+L  AGL+  +SFS+CF  +  G + FGD G   Q  T F  +   +  Y + V +
Sbjct: 262 SVPSVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFT-VRNTHPTYNISVTA 320

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
             +    +    F A+VDSG SFT+L    Y E+   F+  V  +R +L  +  ++YCY 
Sbjct: 321 MSVSGKEVAAE-FAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYE 379

Query: 380 -ASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPE-NEGFTV---FCLTVMSTDGDYGII 433
               +  L VP++ L       F V R  +  + E ++G  V   +CL V+  D    II
Sbjct: 380 LGRGQTELFVPEVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDII 439

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT----EQQS 489
           GQNFM G ++VFDRE   L W    C + ++   +       G +P P PTT     Q  
Sbjct: 440 GQNFMTGLKVVFDRERSVLGWHEFDCYKDVETEEL-------GAAPGPSPTTRLKPRQSE 492

Query: 490 TSNGQ--AAAPPSTAKTAPS 507
            +NG     A P T + A S
Sbjct: 493 VANGTPYPGAVPVTPRQAGS 512


>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
 gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
          Length = 523

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 163/454 (35%), Positives = 253/454 (55%), Gaps = 20/454 (4%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HR+S   +E W             P   + EY   L  +D +R ++     +       +
Sbjct: 35  HRYSATVRE-WAGH-------HRAPPAGTAEYYAALARHDLRR-RSLAAGPAAGGGGGGE 85

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
           + F ++G+ T+   N+  +LHY  + +GTPNV+FLVALD GS+L WVPC CI CAPL + 
Sbjct: 86  VAF-ADGNDTYRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSP 143

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y   D     Y P  SS+S+ V CS  LC  +S+C+S    CPY  +Y +++TSS+G L
Sbjct: 144 NYR--DLKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVL 201

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L   +++     V + +  GCGR QTGS+L  AAP+G++GLG+  +SVPSLLA 
Sbjct: 202 VEDVLYL--ITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLAS 259

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            G+  NSFS+CF ++  G + FGD G + QQ T  L I ++   Y + +    +G+    
Sbjct: 260 EGVAANSFSMCFGDDGRGRINFGDTGSSDQQETP-LNIYKQNPYYNISITGAMVGSKSF- 317

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV 388
            + F A+VDSG SFT L   +Y+E+   F+  V  K   L  +  +++CY+ S +  +  
Sbjct: 318 NTNFNAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISPKGSVNP 377

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           P++ L+      F V + I +  ++    + +CL VM ++G   +IG+NFM G ++VFDR
Sbjct: 378 PNISLMAKGGSIFPVNDPIITITDDASNPMAYCLAVMKSEG-VNLIGENFMSGLKVVFDR 436

Query: 448 ENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 481
           E   L W    C  V + S++ + P P+G  P P
Sbjct: 437 ERKVLGWKKFNCYSVDNSSNLPVNPNPSGVPPKP 470


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 155/420 (36%), Positives = 243/420 (57%), Gaps = 18/420 (4%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N F +LHY  + +GTPNV+FLVALD GS+L WVPC C++CAP  +  Y SL  ++  Y P
Sbjct: 56  NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSP 113

Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           + S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S  A
Sbjct: 114 AQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS--A 171

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
               V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+CF +
Sbjct: 172 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 231

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 343
           +  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG SF
Sbjct: 232 DGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSF 289

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           T L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L       F 
Sbjct: 290 TALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFP 348

Query: 403 VRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  C  
Sbjct: 349 VNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYN 407

Query: 462 VIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSIAAS 513
             + S + + P P+     P        P   + +  NG Q    PS +     +S++A+
Sbjct: 408 FDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSVSAT 467


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/420 (36%), Positives = 243/420 (57%), Gaps = 18/420 (4%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N F +LHY  + +GTPNV+FLVALD GS+L WVPC C++CAP  +  Y SL  ++  Y P
Sbjct: 70  NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSP 127

Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           + S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S  +
Sbjct: 128 AQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 187

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
               V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+CF +
Sbjct: 188 --KIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 245

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 343
           +  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG SF
Sbjct: 246 DGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSF 303

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           T L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L       F 
Sbjct: 304 TALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFP 362

Query: 403 VRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  C  
Sbjct: 363 VNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFNCYN 421

Query: 462 VIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSIAAS 513
             + S + + P P+     P        P   + +  NG Q    PS +     +S++A+
Sbjct: 422 FDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSVSAT 481


>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 417

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 149/377 (39%), Positives = 217/377 (57%), Gaps = 16/377 (4%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y LHYT + +GTP   F+VALD GS+L WVPC C +CAP   S Y S D  LS Y P  S
Sbjct: 1   YSLHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYAS-DFELSVYSPKKS 59

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+SK V C++ LC  R  C      CPY+  Y + +TS++G L++D+LHL + +KH+   
Sbjct: 60  STSKTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHS--E 117

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
            +Q+ +  GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+L++ GL+ NSFS+CF ++  
Sbjct: 118 PIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGV 177

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
           G + FGD+G   Q+ T F  + + +  Y + V S  +G + L  +   AL DSG SF++ 
Sbjct: 178 GRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRVGTT-LIDADITALFDSGTSFSYF 235

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV-PDMRLIFSKNQSFVVR 404
              IY+++   F       R        ++YCYN S +    + P + L       F V 
Sbjct: 236 TDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVY 295

Query: 405 NHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           + I     +NE   ++CL V+ +  +  IIGQNFM G+RIVFDRE L L W    C ++ 
Sbjct: 296 DPIIVISTQNE--LIYCLAVVKS-AELNIIGQNFMTGYRIVFDREKLVLGWKKFDCYDIE 352

Query: 464 DKSHVHLVP-----PPA 475
           +KS   + P     PPA
Sbjct: 353 EKSLFPMKPDVTTVPPA 369


>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 553

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 175/518 (33%), Positives = 266/518 (51%), Gaps = 57/518 (11%)

Query: 20  DAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKL 79
           +A  F+  + HR+S+  K +W   +   S +  WP+K SVEY   L   D   +  R+  
Sbjct: 22  NAHIFTFTMHHRYSEPVK-KWSHSAP--SPSHRWPEKGSVEYYAELADRDRFLRGRRL-- 76

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
                S  +  L  S+G+ T F  +   +LHYT I++GTP V F+VALD GS+L WVPC 
Sbjct: 77  -----SQFDAGLAFSDGNST-FRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCD 130

Query: 140 CIQCAPL---SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
           C +C+     + +   + D +LS Y+P+ SS+SK V+C++ LC  R+ C      CPY+ 
Sbjct: 131 CTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMV 190

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y + +TS+SG LV+D+LHL     +     V+++VI GCG+ Q+GS+LD AAP+G+ GL
Sbjct: 191 SYVSAETSTSGILVEDVLHLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGL 248

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           G+  +SVPS+L++ G   +SFS+CF  +  G + FGD+G   Q  T F  +   +  Y +
Sbjct: 249 GMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPTYNI 307

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV---------------------- 354
            +    +G + L    F AL DSG SFT+L    Y+ +                      
Sbjct: 308 TINQVRVGTT-LIDVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVT 366

Query: 355 ----VVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIF 408
               +++F   V  +R        + YCY+ S +     +P M L       FVV + I 
Sbjct: 367 IEVFMLQFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPII 426

Query: 409 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS-- 466
                +   V+CL V+ +  +  IIGQNFM G+R+VFDRE L L W  S C ++ D +  
Sbjct: 427 II-STQSELVYCLAVVKS-AELNIIGQNFMTGYRVVFDREKLILGWKKSDCYDIEDHNNA 484

Query: 467 -----HVHLVPPPAGQSPNPLPTTE--QQSTSNGQAAA 497
                H   VPP         PTT+  ++S  N Q ++
Sbjct: 485 IPIGQHSDKVPPAVAAGLGDYPTTDSSRKSKYNSQHSS 522


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 169/484 (34%), Positives = 254/484 (52%), Gaps = 32/484 (6%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           ++  + HR S+  ++   S +  +      P++ +VEY   L   D         L+   
Sbjct: 25  YTFTMHHRHSEPVRKWSHSAAAGIPAP---PEEGTVEYYAELADRDRL-------LRGRK 74

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
            S  +  L  S+G+ T F  +   +LHYT + IGTP V F+VALD GS+L WVPC C +C
Sbjct: 75  LSQIDAGLAFSDGNST-FRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRC 133

Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
           A   ++ + S D +L+ Y+P+ SS+SK V+C++ LC  RS C      CPY+  Y + +T
Sbjct: 134 AASDSTAFAS-DFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAET 192

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
           S+SG LV+D+LHL     H     V+++VI GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 193 STSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISV 250

Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
           PS+L++ G   +SFS+CF  +  G + FGD+G   Q  T F  +   +  Y + V    +
Sbjct: 251 PSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRV 309

Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
           G + +    F AL DSG SFT+L    Y  +   F   V  +R        ++YCY+ S 
Sbjct: 310 GTTVIDVE-FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSP 368

Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           +     +P + L       F V + I      +   V+CL V+ +  +  IIGQNFM G+
Sbjct: 369 DANTSLIPSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKS-AELNIIGQNFMTGY 426

Query: 442 RIVFDRENLKLAWSHSKCEEVID---------KSHVHLVPP--PAGQSPNPLPTTEQQST 490
           R+VFDRE L L W    C ++ D         +SH   VPP   AG    P   + ++S 
Sbjct: 427 RVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPRSHAD-VPPAVAAGLGNYPATDSTRKSK 485

Query: 491 SNGQ 494
            N Q
Sbjct: 486 YNSQ 489


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  264 bits (674), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 168/458 (36%), Positives = 246/458 (53%), Gaps = 28/458 (6%)

Query: 7   ICMLFGCILLDGSDAV-SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELL 65
           I ML    +LD  + +  F  +  HRFSD+           V   D  P ++S +Y  ++
Sbjct: 15  ILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVV--------GVLPGDGLPNRDSSKYYRVM 66

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
              D   +  R+       S    L+  ++G++T    N   +LHY  + +GTP+  FLV
Sbjct: 67  AHRDRLIRGRRLA------SEDQSLVTFADGNET-IRVNALGFLHYANVTVGTPSDWFLV 119

Query: 126 ALDAGSNLLWVPCQC-IQCA-PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           ALD GS+L W+PC C   C   L A   +SLD N+  Y P++SS+S  V C+  LC    
Sbjct: 120 ALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNI--YSPNASSTSSKVPCNSTLCTRVD 177

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            C S    CPY   Y +  TSS+G LV+D+LHL S  K++    +++ + +GCG  QTG 
Sbjct: 178 RCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS--KPIRARITLGCGLVQTGV 235

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           + DGAAP+G+ GLGL D+SVPS+LAK G+  NSFS+CF ++ +G + FGD+G   Q+ T 
Sbjct: 236 FHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETP 295

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
            L I + +  Y V V    +G +      F A+ D+G SFT+L    Y  +   F+ L  
Sbjct: 296 -LNIRQPHPTYNVTVTQISVGGNT-GDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLAL 353

Query: 364 SKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 421
            KR        ++YCY  S +++  + PD+ L      S+ V + +   P  E   V+CL
Sbjct: 354 DKRYQTDSELPFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPI-EDTVVYCL 412

Query: 422 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +M ++ D  IIGQNFM G+R+VFDRE L L W  S C
Sbjct: 413 AIMKSE-DISIIGQNFMTGYRVVFDREKLILGWKESDC 449


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 154/414 (37%), Positives = 224/414 (54%), Gaps = 27/414 (6%)

Query: 108 WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +L+Y  + +GTP V +LVALD GS+L W+PC C+ C  ++    T    N + Y P++SS
Sbjct: 105 FLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTTQGPVNFNIYSPNNSS 162

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +SK V CS  LC     C S  D CPY   Y +++TSS+GYLV+DILHL +         
Sbjct: 163 TSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTT--NDVQSKP 220

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           V + + +GCG+ Q+G++L  AAP+G+ GLG+ +VSVPS+LA AGLI NSFS+CF     G
Sbjct: 221 VNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMG 280

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
            + FGD+G   Q  T F  +G ++  Y V +    +G   ++      + DSG SFT+L 
Sbjct: 281 RIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVIFDSGTSFTYLN 338

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
              Y+    KF  +V  K+ ++  +  ++ CY  S ++     P M L       FV+ N
Sbjct: 339 DPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVI-N 397

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 465
           H       E   +FCL +  +D    IIGQNFM G+ IVFDRE + L W  S C    D+
Sbjct: 398 HPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE 456

Query: 466 SHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAASAQQLD 518
           +  +L   P G +P P             AAAP +TA K   + +I  + Q ++
Sbjct: 457 NTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNTTQTIE 494


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 154/414 (37%), Positives = 224/414 (54%), Gaps = 27/414 (6%)

Query: 108 WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +L+Y  + +GTP V +LVALD GS+L W+PC C+ C  ++    T    N + Y P++SS
Sbjct: 128 FLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTTQGPVNFNIYSPNNSS 185

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +SK V CS  LC     C S  D CPY   Y +++TSS+GYLV+DILHL +         
Sbjct: 186 TSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTT--NDVQSKP 243

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           V + + +GCG+ Q+G++L  AAP+G+ GLG+ +VSVPS+LA AGLI NSFS+CF     G
Sbjct: 244 VNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMG 303

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
            + FGD+G   Q  T F  +G ++  Y V +    +G   ++      + DSG SFT+L 
Sbjct: 304 RIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVIFDSGTSFTYLN 361

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
              Y+    KF  +V  K+ ++  +  ++ CY  S ++     P M L       FV+ N
Sbjct: 362 DPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVI-N 420

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 465
           H       E   +FCL +  +D    IIGQNFM G+ IVFDRE + L W  S C    D+
Sbjct: 421 HPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE 479

Query: 466 SHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAASAQQLD 518
           +  +L   P G +P P             AAAP +TA K   + +I  + Q ++
Sbjct: 480 NTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNTTQTIE 517


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 157/434 (36%), Positives = 239/434 (55%), Gaps = 19/434 (4%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS    +RW    G+V +   WP+  S +Y+  L  +D +R  +           +  
Sbjct: 39  HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
            L  SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C P +++
Sbjct: 97  PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
              S     S Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++  + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSF++CF  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEMTVGNS-LT 326

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
              F  + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386

Query: 388 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P + L       F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 387 TPSISLRTVGGSVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVF 443

Query: 446 DRENLKLAWSHSKC 459
           DRE   L W    C
Sbjct: 444 DRERKILGWKKFNC 457


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 157/434 (36%), Positives = 239/434 (55%), Gaps = 19/434 (4%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS    +RW    G+V +   WP+  S +Y+  L  +D +R  +           +  
Sbjct: 39  HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
            L  SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C P +++
Sbjct: 97  PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
              S     S Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++  + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSF++CF  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LT 326

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
              F  + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386

Query: 388 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P + L       F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 387 TPSISLRTVGGSVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVF 443

Query: 446 DRENLKLAWSHSKC 459
           DRE   L W    C
Sbjct: 444 DRERKILGWKKFNC 457


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 157/434 (36%), Positives = 239/434 (55%), Gaps = 19/434 (4%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS    +RW    G+V +   WP+  S +Y+  L  +D +R  +           +  
Sbjct: 39  HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
            L  SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C P +++
Sbjct: 97  PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
              S     S Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++  + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSF++CF  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LT 326

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
              F  + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386

Query: 388 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P + L       F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 387 TPSISLRTVGGSVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVF 443

Query: 446 DRENLKLAWSHSKC 459
           DRE   L W    C
Sbjct: 444 DRERKILGWKKFNC 457


>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
          Length = 585

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 173/476 (36%), Positives = 240/476 (50%), Gaps = 78/476 (16%)

Query: 5   VAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
           V I +L   +      A  FS ++ HRFS+  K +W   +GN   A +WP K S EY   
Sbjct: 7   VFIVILLSILGFRSCHARIFSFQMHHRFSEPVK-KWSEGAGNGFPAGNWPAKGSFEYYAE 65

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           L   D   +  R+       S  + LL  S+G+ T F  +   +LHYT + +GTP   FL
Sbjct: 66  LAHRDRALRGRRL-------SDIDGLLTFSDGNST-FRISSLGFLHYTTVSLGTPGKKFL 117

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           VALD GS+L WVPC C +CAP   + Y S D  LS Y+P  SS+S+ V+C++ LC  R+ 
Sbjct: 118 VALDTGSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCNNSLCAHRNR 176

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C      CPY+  Y + +TS+SG LV+D+LHL +  +   Q  V++ V  GCG+ QTGS+
Sbjct: 177 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSF 234

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
           LD AAP+G+ GLGL  +SVPS+L+K G   +SFS+CF  +  G + FGD+G   Q+ T F
Sbjct: 235 LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF 294

Query: 305 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
             +   +  Y + V    +G + L    F AL DSG SFT+L   IY  V          
Sbjct: 295 -NLNALHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNV---------- 342

Query: 365 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 424
               L+ +   YC                        VVR+                   
Sbjct: 343 ----LKSSELIYCMA----------------------VVRS------------------- 357

Query: 425 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 475
               +  IIGQNFM G+RI+FDRE L L W   +C++ I+ S V + P     PPA
Sbjct: 358 ---AELNIIGQNFMTGYRIIFDREKLVLGWKEFECDD-IENSSVPIRPRATSVPPA 409


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 157/425 (36%), Positives = 219/425 (51%), Gaps = 44/425 (10%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y LHY  + +GTP+VSFLVALD GSNLLW+PC C  C     S   ++D N+  Y P++S
Sbjct: 59  YILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNI--YSPNTS 116

Query: 167 SSSKNVSCSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           S+S+ V C+  LC    R  C S +  CPY   Y +  TS++GY+V D+LHL   S  + 
Sbjct: 117 STSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHL--ISDDSQ 174

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +V + +  GCG+ QTGS+L G AP+G+ GLG+ ++SVPS LA  G    SFS+CF  N
Sbjct: 175 SKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPN 234

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 344
             G + FGD+G   Q  TSF     +   Y + +    IG    +   + A+ DSG SFT
Sbjct: 235 GIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQA-SDLVYSAIFDSGTSFT 293

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS---------------EEMLKVP 389
           +L    Y  +   F+KLV   R S     + YCY+  S               +    +P
Sbjct: 294 YLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFSCAYANQTEPTIP 353

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            + L+ S    F V + I      +G  V+CL ++ + GD  IIGQNFM GHRIVFDRE 
Sbjct: 354 AVTLVMSGGDYFNVTDPIVLVQLADGSAVYCLGMIKS-GDVNIIGQNFMTGHRIVFDRER 412

Query: 450 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKS 509
           + L W  S C + +D + + + P                       A PP+TA    +K 
Sbjct: 413 MILGWKPSNCYDNMDTNTLAVSP---------------------NTAVPPATAVNPEAKQ 451

Query: 510 IAASA 514
           I AS+
Sbjct: 452 IPASS 456


>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 520

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 164/450 (36%), Positives = 242/450 (53%), Gaps = 22/450 (4%)

Query: 18  GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
           G DA +  S +  HRFS   + RW+   G  ++   WP   S  Y+  L  +D  R    
Sbjct: 23  GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           V     ++S    L F +EG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+
Sbjct: 78  VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
           PCQC  C P + +   S       Y P  SS+SK V C+   C  +  C +    CPY  
Sbjct: 136 PCQCDGCTPPATAASGSFQATF--YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 192

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y +  TSSSG+LV+D+L+L++ + H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 193 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 250

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           G+ +VSVPS+LA+ GL  NSFS+CF  +  G + FGDQ  + Q+ T  L I  ++  Y +
Sbjct: 251 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 309

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
            +    +GN   T   F  + D+G SFT+L    Y  +   F   V + R +      ++
Sbjct: 310 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 368

Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGI 432
           YCY+ +SSE    +PD+ L       F V +   + S  E+E   V+CL ++ +     I
Sbjct: 369 YCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNI 425

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IGQNFM G R+VFDRE   L W    C + 
Sbjct: 426 IGQNFMTGLRVVFDRERKILGWKKFNCYDT 455


>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
 gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 500

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 164/447 (36%), Positives = 243/447 (54%), Gaps = 24/447 (5%)

Query: 18  GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
           G DA +  S +  HRFS   + RW+   G  ++   WP   S  Y+  L  +D  R    
Sbjct: 23  GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           V     ++S    L F +EG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+
Sbjct: 78  VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
           PCQC  C P +    T+   + + Y P  SS+SK V C+   C  +  C +    CPY  
Sbjct: 136 PCQCDGCTPPA----TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 190

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y +  TSSSG+LV+D+L+L++ + H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 191 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 248

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           G+ +VSVPS+LA+ GL  NSFS+CF  +  G + FGDQ  + Q+ T  L I  ++  Y +
Sbjct: 249 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 307

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
            +    +GN   T   F  + D+G SFT+L    Y  +   F   V + R +      ++
Sbjct: 308 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 366

Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGI 432
           YCY+ +SSE    +PD+ L       F V +   + S  E+E   V+CL ++ +     I
Sbjct: 367 YCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNI 423

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
           IGQNFM G R+VFDRE   L W    C
Sbjct: 424 IGQNFMTGLRVVFDRERKILGWKKFNC 450


>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Brachypodium distachyon]
          Length = 509

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 174/493 (35%), Positives = 250/493 (50%), Gaps = 42/493 (8%)

Query: 28  LVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSR 87
           L HRFS   K RW    G  + A  WP+  S EY   L ++D  R   RV       S  
Sbjct: 13  LHHRFSPVVK-RWAESRGRPAAAAWWPE-GSPEYYSALSAHDRAR---RVLAGGKGES-- 65

Query: 88  NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLS 147
             L F    S T   G+    LHY  + +GTPN +F+VALD GS+L WVPC C +CAP++
Sbjct: 66  -LLSFADGNSTTRHAGS----LHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPIA 120

Query: 148 ASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 207
                +    L  Y P  SS+SK V+CSH LC   ++C +    CPY   Y + +TSSSG
Sbjct: 121 -----NTSELLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSG 175

Query: 208 YLVDDILHLA-------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
            LV+D+L++        S +      +V + V+ GCG++QTG++LDGAA +G++GLG+  
Sbjct: 176 VLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDR 235

Query: 261 VSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
           VSVPSLLA AGL+  +SFS+CF  + +G + FG+   A  Q+ +   + +    Y + V 
Sbjct: 236 VSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVT 295

Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCY 378
           +  +       + F A+VDSG SFT+L    Y+ +   F+  V  KR +L  +  ++YCY
Sbjct: 296 AVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCY 355

Query: 379 NAS-SEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTV----FCLTVMSTDGDYGI 432
             S  +  + +P++ L       F V R  +    E     V    +CL V  +D    I
Sbjct: 356 ALSRGQTEVLMPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDI 415

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC---EEVIDKSHVHLVPPP-------AGQSPNPL 482
           IGQNFM G ++VFDR+   L W+   C    +V D       P P         QS  P 
Sbjct: 416 IGQNFMTGLKVVFDRQRSVLGWTKFDCYKNMKVEDDGSPAAAPGPMPVTQLRPRQSDTPF 475

Query: 483 PTTEQQSTSNGQA 495
           P   Q  ++ G A
Sbjct: 476 PGAVQPRSAAGHA 488


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 161/434 (37%), Positives = 237/434 (54%), Gaps = 27/434 (6%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFSD+           V   D  P ++S +Y  ++   D   +  R +  +N + S   
Sbjct: 39  HRFSDQVV--------GVLPGDGLPNRDSSKYYRVMAHRD---RLIRGRRLANEDQS--- 84

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSA 148
           L+  S+G++T    +   +LHY  + +GTP+  F+VALD GS+L W+PC C  C   L A
Sbjct: 85  LVTFSDGNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKA 143

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
              +SLD N+  Y P++SS+S  V C+  LC     C S +  CPY   Y +  TSS+G 
Sbjct: 144 PGGSSLDLNI--YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGV 201

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           LV+D+LHL S  K +   ++ + V  GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LA
Sbjct: 202 LVEDVLHLVSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 259

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
           K G+  NSFS+CF  + +G + FGD+G   Q+ T  L I + +  Y + V    +G +  
Sbjct: 260 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT- 317

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEM 385
               F A+ DSG SFT+L    Y  +   F+ L   KR     +   ++YCY  S +++ 
Sbjct: 318 GDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDS 377

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            + P + L      S+ V + +   P  +   V+CL +M  + D  IIGQNFM G+R+VF
Sbjct: 378 FQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVVF 435

Query: 446 DRENLKLAWSHSKC 459
           DRE L L W  S C
Sbjct: 436 DREKLILGWKESDC 449


>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
 gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
          Length = 499

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 162/434 (37%), Positives = 236/434 (54%), Gaps = 23/434 (5%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS   + RW    G  ++   WP   S  Y+  L  +D  R    V       S    
Sbjct: 35  HRFSAPLR-RWAEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA---VSAAGGGGSGTPP 89

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
           L F +EG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C P +  
Sbjct: 90  LTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA-- 145

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
             T+   + + Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+L
Sbjct: 146 --TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFL 202

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++ + H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+
Sbjct: 203 VEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQ 260

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSFS+CF  +  G + FGDQG + Q+ T  L I +++  Y + +    IGN   T
Sbjct: 261 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-T 318

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
              F  + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE    
Sbjct: 319 DLDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP 378

Query: 388 VPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
           +PD+ L       F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 379 IPDIILRTVSGSLFPVIDPGQVISIQEHE--YVYCLAIVKSR-KLNIIGQNFMTGLRVVF 435

Query: 446 DRENLKLAWSHSKC 459
           DRE   L W    C
Sbjct: 436 DRERKILGWKKFNC 449


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 149/444 (33%), Positives = 234/444 (52%), Gaps = 24/444 (5%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +F   + HRFSD+ K       G + + D  P+K + +Y  ++   D   +  R+     
Sbjct: 32  TFGFDIHHRFSDQIK-------GMLGI-DDVPQKGTPQYYAVMAHRDRVFRGRRLA---- 79

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
             +  +  L  + G+ TH   +  + LH+  + +GTP + FLVALD GS+L W+PC CI 
Sbjct: 80  -GADHHSPLTFAAGNDTHQIASSGF-LHFANVSVGTPPLWFLVALDTGSDLFWLPCDCIS 137

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTE 201
           C        T      + YD   SS+S  VSC++   C+ R  C S    C Y  DY + 
Sbjct: 138 CVHGGLRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSN 197

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
           DTSS G++V+D+LHL +       +  +  +  GCG+ QTG +L+GAAP+G+ GLG+ ++
Sbjct: 198 DTSSRGFVVEDVLHLITDDDQTKDADTR--IAFGCGQVQTGVFLNGAAPNGLFGLGMDNI 255

Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
           SVPS+LA+ GLI NSFS+CF  + +G + FGD G   Q+ T F  + + +  Y + +   
Sbjct: 256 SVPSILAREGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPF-NVRKLHPTYNITITKI 314

Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYC 377
            + +S +    F A+ DSG SFT++    Y  +   ++  V +KR S Q       + YC
Sbjct: 315 IVEDS-VADLEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYC 373

Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 437
           Y+ S  + ++VP + L       + V + I      E   + CL +  +D    IIGQNF
Sbjct: 374 YDISISQTIEVPFLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKSDS-VNIIGQNF 432

Query: 438 MMGHRIVFDRENLKLAWSHSKCEE 461
           M G++IVFDR+N+ L W  + C +
Sbjct: 433 MTGYKIVFDRDNMNLGWKETNCSD 456


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 170/488 (34%), Positives = 251/488 (51%), Gaps = 52/488 (10%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F   + HRFSD   E  I   GN  +    P K + +Y   ++  D      R+      
Sbjct: 39  FGLDIHHRFSDPVTE--ILGIGNDEL---LPHKGTPQYYAAMVHRDRVFHGRRLA----- 88

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
              R+  +  + G++TH     F +LH+  + +GTP + FLVALD GS+L W+PC C  C
Sbjct: 89  -DDRDTPITFAAGNETHQIA-AFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSC 146

Query: 144 AP-LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
              L       +D N+ E D   SS+ KNV C+  +CK ++ C S    C Y  +Y + D
Sbjct: 147 VRGLKTQNGKVIDLNIYELD--KSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSND 203

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           TSSSG+LV+D+LHL   + +     + + + IGCG+ QTG +L+GAAP+G+ GLG+ +VS
Sbjct: 204 TSSSGFLVEDVLHL--ITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVS 261

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
           VPS+LA+ GLI +SFS+CF  + SG + FGD G + Q  T F  + E +  Y V +    
Sbjct: 262 VPSILAQKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPF-NLRESHPTYNVTITQII 320

Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNS---WKYCY 378
           +G        F A+ DSG SFT+L    Y  +  KF+ LV + R S L  +S   ++YCY
Sbjct: 321 VGGYAADHE-FHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCY 379

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG---- 434
           + S ++ ++VP + L       + V + I          + CL +  +D +  IIG    
Sbjct: 380 DMSPDQTIEVPFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKSD-NLNIIGREYT 438

Query: 435 ------------------QNFMMGHRIVFDRENLKLAWSHSKC-EEVI----DKSHVHLV 471
                             +NFM G+RIVFDREN+ L W  S C EEV+    +KSH   +
Sbjct: 439 TEEEFLHLKHMIIKFFIQKNFMTGYRIVFDRENMNLGWKESNCTEEVLSIPTNKSHSPAI 498

Query: 472 PPPAGQSP 479
            P    +P
Sbjct: 499 SPAIAVNP 506


>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 498

 Score =  258 bits (660), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 163/446 (36%), Positives = 241/446 (54%), Gaps = 24/446 (5%)

Query: 18  GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
           G DA +  S +  HRFS   + RW+   G  ++   WP   S  Y+  L  +D  R    
Sbjct: 23  GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           V     ++S    L F +EG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+
Sbjct: 78  VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
           PCQC  C P +    T+   + + Y P  SS+SK V C+   C  +  C +    CPY  
Sbjct: 136 PCQCDGCTPPA----TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 190

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y +  TSSSG+LV+D+L+L++ + H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 191 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 248

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           G+ +VSVPS+LA+ GL  NSFS+CF  +  G + FGDQ  + Q+ T  L I  ++  Y +
Sbjct: 249 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 307

Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
            +    +GN   T   F  + D+G SFT+L    Y  +   F   V + R +      ++
Sbjct: 308 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 366

Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGII 433
           YCY+  SE    +PD+ L       F V +   + S  E+E   V+CL ++ +     II
Sbjct: 367 YCYDL-SEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNII 422

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           GQNFM G R+VFDRE   L W    C
Sbjct: 423 GQNFMTGLRVVFDRERKILGWKKFNC 448


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 162/435 (37%), Positives = 240/435 (55%), Gaps = 29/435 (6%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFSD+           V   D  P ++S +Y  ++   D   +  R +  +N + S   
Sbjct: 39  HRFSDQVV--------GVLPGDGLPNRDSSKYYRVMAHRD---RLIRGRRLANEDQS--- 84

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSA 148
           L+  S+G++T    +   +LHY  + +GTP+  FLVALD GS+L W+PC C  C   L A
Sbjct: 85  LVTFSDGNET-IRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKA 143

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
              +SLD N+  Y P++SS+S  V C+  LC     C S +  CPY   Y +  TSS+G 
Sbjct: 144 PGGSSLDLNI--YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGV 201

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           LV+D+LHL S  K +   ++ + V +GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LA
Sbjct: 202 LVEDVLHLVSNDKSS--KAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 259

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSC 327
           K G+  NSFS+CF  + +G + FGD+G   Q+ T  L I + +  Y + V    + GN+ 
Sbjct: 260 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNTG 318

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEE 384
             +  F A+ DSG SFT+L    Y  +   F+ L   KR     +   ++YCY  S +++
Sbjct: 319 DLE--FDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKD 376

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
             + P + L      S+ V + +   P  +   V+CL ++  + D  IIGQNFM G+R+V
Sbjct: 377 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAILKIE-DISIIGQNFMTGYRVV 434

Query: 445 FDRENLKLAWSHSKC 459
           FDRE L L W  S C
Sbjct: 435 FDREKLILGWKESDC 449


>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
          Length = 829

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 174/532 (32%), Positives = 274/532 (51%), Gaps = 45/532 (8%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           SF   + HRFSD  KE        + V D  P K +  Y  ++   D   +  R  L + 
Sbjct: 29  SFGFDIHHRFSDPVKEI-------LGVHD-LPDKGTRLYYVVMAHRDRIFRGRR--LAAA 78

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
            + S    +  +E  Q   FG    +LH+  + +GTP +SFLVALD GS+L W+PC C +
Sbjct: 79  VHHSPLTFVPANETYQIGAFG----FLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTK 134

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C     S    +  N+  YD   SS+S+ V C+  LC+ +  C S    CPY  +Y +  
Sbjct: 135 CVRGVESNGEKIAFNI--YDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNG 192

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           TS++G+LV+D+LHL +       +  +  +  GCG+ QTG++LDGAAP+G+ GLG+G+ S
Sbjct: 193 TSTTGFLVEDVLHLITDDDETKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMGNES 250

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
           VPS+LAK GL  NSFS+CF  +  G + FGD     Q  T F  +   +  Y + V    
Sbjct: 251 VPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQII 309

Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYN 379
           +G +      F A+ DSG SFT L    Y ++   F+  +  +R S   +    ++YCY+
Sbjct: 310 VGGNAADLE-FHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYD 368

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 439
            SS + +++P + L      +++V + I +    EG  + CL V+ ++ +  IIGQNFM 
Sbjct: 369 LSSNKTVELP-INLTMKGGDNYLVTDPIVTI-SGEGVNLLCLGVLKSN-NVNIIGQNFMT 425

Query: 440 GHRIVFDRENLKLAWSHSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNG 493
           G+RIVFDREN+ L W  S C  +E+    I++S+   + P    +P      E  + SN 
Sbjct: 426 GYRIVFDRENMILGWRESNCYVDELSTLAINRSNSPAISPAIAVNPE-----ETSNQSND 480

Query: 494 QAAAPPSTAKTAPSKSIAAS--------AQQLDSVLRVACSLLVLMCLLLSS 537
              +P  + K  P+ +   +        + Q+   + VA   L++M  ++S+
Sbjct: 481 PELSPNLSFKIKPTSAFMMALLVPKNHRSTQISMAVMVAFLNLIIMFSVVST 532


>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
           Japonica Group]
 gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
          Length = 551

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 181/517 (35%), Positives = 268/517 (51%), Gaps = 63/517 (12%)

Query: 28  LVHRFSDEAKERWISKSGNVSVADSWPKKNSV----EYLELLLSNDWKRQKTRVKLQSNN 83
           L HR+S    +RW  + G+  V  SWP    V    EY   L  +D      R   Q + 
Sbjct: 31  LHHRYS-PIVQRWAEERGHAGV--SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
                 L+  ++G+ T         LHY  + +GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88  ------LVTFADGNITLRLDGS---LHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC 138

Query: 144 APLSASYYTSLDRN----LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
           APL     T++D      L +Y PS SS+SK V+C+  LC   ++C +    CPY   Y+
Sbjct: 139 APLGN--LTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYA 196

Query: 200 TEDTSSSGYLVDDILHLA---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
             +TSSSG LV+D+L+L      +  A  ++V++ V+ GCG+ QTGS+LDGAA DG+MGL
Sbjct: 197 MANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGL 256

Query: 257 GLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
           G+  VSVPS+LA  G+++ NSFS+CF ++  G + FGD G A Q  T F+ +   +  Y 
Sbjct: 257 GMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYN 315

Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
           + + S  +G+  L   GF A+ DSG SFT+L    Y      F+  +S +R +  G++  
Sbjct: 316 ISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRS 374

Query: 374 ----WKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPE---NEGFTV--FCLTV 423
               ++YCY+ S ++  +++P + L  +    F V + ++       N    +  +CL V
Sbjct: 375 GPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAV 434

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC---EEVID--------------KS 466
           + +D    IIGQNFM G ++VF+RE   L W    C   E++ D               +
Sbjct: 435 IKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTT 494

Query: 467 HVHLVP----PPAGQSPNP--LPTTEQQSTSNGQAAA 497
           HV   P     PAG++P P   P     S + G  A 
Sbjct: 495 HVFPQPQESDSPAGRTPIPGAAPVPRSSSAAAGGRAG 531


>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
          Length = 551

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 181/517 (35%), Positives = 268/517 (51%), Gaps = 63/517 (12%)

Query: 28  LVHRFSDEAKERWISKSGNVSVADSWPKKNSV----EYLELLLSNDWKRQKTRVKLQSNN 83
           L HR+S    +RW  + G+  V  SWP    V    EY   L  +D      R   Q + 
Sbjct: 31  LHHRYS-PIVQRWAEERGHAGV--SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
                 L+  ++G+ T         LHY  + +GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88  ------LVTFADGNITLRLDGS---LHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC 138

Query: 144 APLSASYYTSLDRN----LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
           APL     T++D      L +Y PS SS+SK V+C+  LC   ++C +    CPY   Y+
Sbjct: 139 APLGN--LTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYA 196

Query: 200 TEDTSSSGYLVDDILHLA---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
             +TSSSG LV+D+L+L      +  A  ++V++ V+ GCG+ QTGS+LDGAA DG+MGL
Sbjct: 197 MANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGL 256

Query: 257 GLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
           G+  VSVPS+LA  G+++ NSFS+CF ++  G + FGD G A Q  T F+ +   +  Y 
Sbjct: 257 GMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYN 315

Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
           + + S  +G+  L   GF A+ DSG SFT+L    Y      F+  +S +R +  G++  
Sbjct: 316 ISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRS 374

Query: 374 ----WKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPE---NEGFTV--FCLTV 423
               ++YCY+ S ++  +++P + L  +    F V + ++       N    +  +CL V
Sbjct: 375 GPFPFEYCYSLSPDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAV 434

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC---EEVID--------------KS 466
           + +D    IIGQNFM G ++VF+RE   L W    C   E++ D               +
Sbjct: 435 IKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTT 494

Query: 467 HVHLVP----PPAGQSPNP--LPTTEQQSTSNGQAAA 497
           HV   P     PAG++P P   P     S + G  A 
Sbjct: 495 HVFPQPQESDSPAGRTPIPGAAPVPRSSSAAAGGRAG 531


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 156/461 (33%), Positives = 235/461 (50%), Gaps = 30/461 (6%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           L+ + + F    L    A SF   + HRFSD  KE + S        +  P+K++  Y  
Sbjct: 12  LLVLSVFFLAGGLRSGHAASFKFTIHHRFSDSIKEIFGS--------EGLPEKHTPGYYA 63

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
            ++  D  R      L + N  +     + +E  +    GN    L+Y  + IGTP + F
Sbjct: 64  AMVHRD--RLLHGRNLATTNGDTPLMFSYGNETYELSGLGN----LYYANVSIGTPGLYF 117

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRN---LSEYDPSSSSSSKNVSCSHPLCK 180
           LVALD GS+L W+PC+C +C     +Y T  D     L+ Y  ++SS+S  V CS  LC+
Sbjct: 118 LVALDTGSDLFWLPCECTKCP----TYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCE 173

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
             + C S K  CPY   Y +E++SS+GYLV DILH+A+    +    V   V +GCG+ Q
Sbjct: 174 LANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMAT--DDSQLKPVDVKVTLGCGKVQ 231

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
           TG + +  AP+G++GLG+G VSVPS LA  GL  +SFS+CF     G + FGD GP  Q+
Sbjct: 232 TGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFGDIGPVGQR 291

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
            T F P    Y+   + +    I  +  T     A++DSGASFT+L    Y+ +    D 
Sbjct: 292 ETPFNPASLSYNVTILQI----IVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITENMDA 347

Query: 361 LVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
            +  +RI    +  ++YCY  S   + + P++       + F V     S   ++G    
Sbjct: 348 AMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGRKFDVITSYVSVDTDDG-PAL 406

Query: 420 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           CL ++ +  D  +IG NF  G+R+VF+RE + L W    C+
Sbjct: 407 CLAIVKST-DINVIGHNFFGGYRVVFNREKMTLGWKEVDCD 446


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 174/510 (34%), Positives = 256/510 (50%), Gaps = 42/510 (8%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS   + RW    G+  +   WP      Y+  L  +D  R  +           R  
Sbjct: 29  HRFSARVR-RWADSRGH-ELPGGWPSPGGFAYVAALAGHDRHRALSAA-------GGRPP 79

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
           L F SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C   +  
Sbjct: 80  LTF-SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPP 134

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
             ++     S Y PS SS+S+ V C+   C  R  C S    CPY   Y + DTSSSG+L
Sbjct: 135 PSSAASAPASFYIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFL 193

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++   H PQ  +++ ++ GCG  QTGS+LD AAP+G+ GLG+  +SVPS+LA+
Sbjct: 194 VEDVLYLSTEDTH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSFS+CF  +  G + FGDQG + Q+ T  L I +K+  Y + +    +GN+ L 
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LM 309

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
                 + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE  ++
Sbjct: 310 DLEVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQ 369

Query: 388 VPDMRLIFSKNQSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P + L       F  +    + S  ++E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 370 TPSISLRTVGGSLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVF 426

Query: 446 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTA 505
           DRE   L W    C +                S NPL    + ST   +  +P  T   A
Sbjct: 427 DRERKILGWKKFNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPA 470

Query: 506 PSKSIAASAQQLDSVLRVACSLLVLMCLLL 535
            +  +   +     V     SLL++M +LL
Sbjct: 471 GASQLGHVSSSPPLVWWHNNSLLLMMFVLL 500


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 174/510 (34%), Positives = 256/510 (50%), Gaps = 42/510 (8%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HRFS   + RW    G+  +   WP      Y+  L  +D  R  +           R  
Sbjct: 29  HRFSARVR-RWADSRGH-ELPGGWPSPGGFAYVAALAGHDRHRALSAA-------GGRPP 79

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
           L F SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W+PCQC  C   +  
Sbjct: 80  LTF-SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPP 134

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
             ++     S Y PS SS+S+ V C+   C  R  C S    CPY   Y + DTSSSG+L
Sbjct: 135 PSSAASAPASFYIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFL 193

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L++   H PQ  +++ ++ GCG  QTGS+LD AAP+G+ GLG+  +SVPS+LA+
Sbjct: 194 VEDVLYLSTEDTH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            GL  NSFS+CF  +  G + FGDQG + Q+ T  L I +K+  Y + +    +GN+ L 
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LM 309

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
                 + D+G SFT+L    Y  +   F   V + R +      ++YCY+ +SSE  ++
Sbjct: 310 DLEVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQ 369

Query: 388 VPDMRLIFSKNQSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P + L       F  +    + S  ++E   V+CL ++ +     IIGQNFM G R+VF
Sbjct: 370 TPSISLRTVGGSLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVF 426

Query: 446 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTA 505
           DRE   L W    C +                S NPL    + ST   +  +P  T   A
Sbjct: 427 DRERKILGWKKFNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPA 470

Query: 506 PSKSIAASAQQLDSVLRVACSLLVLMCLLL 535
            +  +   +     V     SLL++M +LL
Sbjct: 471 GASQLRHVSSSPPLVWWHNNSLLLMMFVLL 500


>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 164/497 (32%), Positives = 257/497 (51%), Gaps = 39/497 (7%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           SF   + HRFSD  KE        + V D  P K + +Y   +   D   +  R+    +
Sbjct: 29  SFGFDIHHRFSDPVKEI-------LGVHD-LPDKGTRQYYVAMAHRDRIFRGRRLAAGYH 80

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
              S    +  +E  Q   FG    +LH+  + +GTP +SFLVALD GS+L W+PC C +
Sbjct: 81  ---SPLTFIPSNETYQIEAFG----FLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTK 133

Query: 143 CAP-LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
           C   +  S    +  N+  YD   SS+S+ V C+  LC+ +  C S    CPY  +Y + 
Sbjct: 134 CVHGIGLSNGEKIAFNI--YDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSN 191

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
            TS++G+LV+D+LHL +       +  +  +  GCG+ QTG++LDGAAP+G+ GLG+ + 
Sbjct: 192 GTSTTGFLVEDVLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNE 249

Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
           SVPS+LAK GL  NSFS+CF  +  G + FGD     Q  T F  +   +  Y + V   
Sbjct: 250 SVPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQI 308

Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCY 378
            +G   +    F A+ DSG SFT+L    Y ++   F+  +  +R S   ++   ++YCY
Sbjct: 309 IVGEK-VDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCY 367

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
             S  + +++  + L      +++V + I +    EG  + CL V+ ++ +  IIGQNFM
Sbjct: 368 ELSPNQTVEL-SINLTMKGGDNYLVTDPIVTV-SGEGINLLCLGVLKSN-NVNIIGQNFM 424

Query: 439 MGHRIVFDRENLKLAWSHSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 492
            G+RIVFDREN+ L W  S C  +E+    I++S+   + P    +P       + S SN
Sbjct: 425 TGYRIVFDRENMILGWRESNCYDDELSTLPINRSNTPAISPAIAVNPE-----ARSSQSN 479

Query: 493 GQAAAPPSTAKTAPSKS 509
               +P  + K  P+ +
Sbjct: 480 NPVLSPNLSFKIKPTSA 496


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/364 (39%), Positives = 203/364 (55%), Gaps = 21/364 (5%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSASYYTSLDRNLSEYDPSSSS 167
           LHY  + +GTP+  F+VALD GS+L W+PC C  C   L A   +SLD N+  Y P++SS
Sbjct: 54  LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI--YSPNASS 111

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +S  V C+  LC     C S +  CPY   Y +  TSS+G LV+D+LHL S  K +   +
Sbjct: 112 TSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSS--KA 169

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           + + V  GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+  NSFS+CF  + +G
Sbjct: 170 IPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAG 229

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
            + FGD+G   Q+ T  L I + +  Y + V    +G +      F A+ DSG SFT+L 
Sbjct: 230 RISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAVFDSGTSFTYLT 287

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS--WKYCY----------NASSEEMLKVPDMRLIF 395
              Y  +   F+ L   KR     +   ++YCY          +  +++  + P + L  
Sbjct: 288 DAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTM 347

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
               S+ V + +   P  +   V+CL +M  + D  IIGQNFM G+R+VFDRE L L W 
Sbjct: 348 KGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILGWK 405

Query: 456 HSKC 459
            S C
Sbjct: 406 ESDC 409


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 212/352 (60%), Gaps = 10/352 (2%)

Query: 94  SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
           ++G+ T+   N F +LHY  + +GTPNV+FLVALD GS+L WVPC C++CAP  +  Y S
Sbjct: 20  ADGNDTYRL-NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78

Query: 154 LDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
           L  ++  Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+
Sbjct: 79  LKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV 136

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
           L+L S S  A    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL 
Sbjct: 137 LYLTSDS--AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLA 194

Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
            NSFS+CF ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F
Sbjct: 195 ANSFSMCFGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-F 252

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 392
            A+VDSG SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ 
Sbjct: 253 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVS 311

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRI 443
           L       F V + I +  +N    V +CL +M ++G   I G NF    R+
Sbjct: 312 LTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGGYNFDESSRL 363


>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
 gi|219887047|gb|ACL53898.1| unknown [Zea mays]
 gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 416

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 138/358 (38%), Positives = 205/358 (57%), Gaps = 16/358 (4%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           LHY  + +GTP  +F+VALD GS+L W+PCQC  C P +    T+   + + Y P  SS+
Sbjct: 6   LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA----TAASGSATFYIPGMSST 61

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ + H PQ  +
Sbjct: 62  SKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTENAH-PQI-L 118

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
           ++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+CF  +  G 
Sbjct: 119 KAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGR 178

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPT 348
           + FGDQ  + Q+ T  L I  ++  Y + +    +GN   T   F  + D+G SFT+L  
Sbjct: 179 ISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTGTSFTYLAD 236

Query: 349 EIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN- 405
             Y  +   F   V + R +      ++YCY+ +SSE    +PD+ L       F V + 
Sbjct: 237 PAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTGSMFPVIDP 296

Query: 406 -HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W    C + 
Sbjct: 297 GQVISIQEHE--YVYCLAIVKS-MKLNIIGQNFMTGLRVVFDRERKILGWKKFNCYDT 351


>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 510

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 178/530 (33%), Positives = 264/530 (49%), Gaps = 51/530 (9%)

Query: 16  LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKT 75
           +DG      S +  HRFS   +  W    G+  +   WP      Y+  L  +D  R   
Sbjct: 20  VDGRRRAPPSLEFHHRFSARLRG-WADARGH-ELPGGWPPPGGAAYVAALAGHDRHRALA 77

Query: 76  RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
                    ++ +  L  SEG+ T    N   +LHY  + +GTP  +F+VALD GS+L W
Sbjct: 78  ---------AADHPPLTFSEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFW 127

Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
           +PCQC  C P ++    S     S Y PS SS+S+ V C+   C  R  C S    CPY 
Sbjct: 128 LPCQCDGCPPPASGASGSA----SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYK 182

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
             Y + DTSSSG+LV+D+L+L++   H PQ  +++ ++ GCG+ QTGS+LD AAP+G+ G
Sbjct: 183 MVYVSADTSSSGFLVEDVLYLSTEDNH-PQI-LKAQIMFGCGQVQTGSFLDAAAPNGLFG 240

Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
           LG+  +SVPS+LA  GL  +SFS+CF  +  G + FGDQG + Q+ T  L I +K+  Y 
Sbjct: 241 LGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYA 299

Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SW 374
           + +    +G   +    F  + D+G +FT+L    Y  +   F   V + R +      +
Sbjct: 300 ITITGITVGTEPMDLE-FSTIFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPF 358

Query: 375 KYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYG 431
           +YCY+ +SSE  ++ P +         F V +   + S  ++E   V+CL ++ +     
Sbjct: 359 EYCYDLSSSEARIQTPGVSFRTVGGSLFPVIDLGQVISIQQHE--YVYCLAIVKST-KLN 415

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTS 491
           IIGQNFM G R+VFDRE   L W    C +                S NPL    + S+ 
Sbjct: 416 IIGQNFMTGVRVVFDRERKILGWKKFNCYDT--------------DSTNPLSINSRNSS- 460

Query: 492 NGQAAAPPSTAKTAPSKSIAASAQ--QLDSVLRVAC--SLLVLMCLLLSS 537
                  PST     +K+ A + Q   L+S   V    + LVLM LL+ S
Sbjct: 461 ----GFSPSTYSPQETKNPAGATQLRHLNSSPPVMWHNNSLVLMFLLVHS 506


>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
 gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
          Length = 575

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 160/469 (34%), Positives = 232/469 (49%), Gaps = 43/469 (9%)

Query: 17  DGSDAVSFSSKLVHRFSDEAKERWI--SKSGNVSV-ADSW------PKKNSVEYLELLLS 67
           + S  + F+  L HRFS   ++ W+  ++ G   V   SW      P   S EY   LL 
Sbjct: 25  EASGGIGFN--LHHRFSPVVRQ-WMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSALLR 81

Query: 68  NDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVAL 127
           +D      R  L S  +     L F ++G+ T    + + +LHY  +++GTP+  FLVAL
Sbjct: 82  HDRALFTRRRGLASAADGQSTTLTF-ADGNATRL--DTYEYLHYAEVEVGTPSSKFLVAL 138

Query: 128 DAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKS 187
           D GS+L W+PC+C  CA              + Y PS SS+SK V C HPLC+   +C +
Sbjct: 139 DTGSDLFWLPCECKLCA----------KNGSTMYSPSLSSTSKTVPCGHPLCERPDACAT 188

Query: 188 L---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
                  CPY   Y + +T SSG LV+D+LHL          +VQ+ ++ GCG+ QTG++
Sbjct: 189 AGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAF 248

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           L GAA  G+MGLGL  VSVPS LA +GL+  +SFS+CF  +  G + FGD G   Q  T 
Sbjct: 249 LRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETP 308

Query: 304 FLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
            +  G    +Y+ + V +  + +  +    F A+VDSG SFT+L    Y  +   F+  V
Sbjct: 309 LIAAGSLQPSYYNISVGAITVDSKAMAVE-FTAVVDSGTSFTYLDDPAYTFLTTNFNSRV 367

Query: 363 S--SKRISLQGNSWKYCYNASSEE--MLKVPDMRLIFSKNQSFVVRNHIFSF--PENEG- 415
           S  S+        +++CY  S  +  M ++P M L       F +   I       N G 
Sbjct: 368 SEASETYGSGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPITWPIIPVLASTNGGP 427

Query: 416 --FTVFCLTVMST---DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                +CL ++ T     +   IGQNFM G ++VFDR    L W    C
Sbjct: 428 YHPIGYCLGIIKTSILSTEDATIGQNFMTGLKVVFDRRKSVLGWEKFDC 476


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 166/498 (33%), Positives = 252/498 (50%), Gaps = 58/498 (11%)

Query: 1   MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
           M+ ++++ +L G   L   DA SF   + HRFSD  K  + S        +  P+K++  
Sbjct: 11  MLLVLSVFILAGS--LRSGDAASFKFDIHHRFSDSIKGIFHS--------EGLPEKHTPG 60

Query: 61  YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
           Y   ++  D   +  R+     +     QL F + G+ T F  +   +L+Y  + +GTP+
Sbjct: 61  YYATMVHRDRLVRGRRLAASDVDT----QLTF-AYGNDTAFIPD-LGFLYYANVSVGTPS 114

Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRN------LSEYDPSSSSSSKNVSC 174
           + FLVALD GS+L W+PC+C  C       +T L+ +      L+ Y P+ S++S  V C
Sbjct: 115 LDFLVALDTGSDLFWLPCECSSC-------FTYLNTSNGGKFMLNHYSPNDSTTSSTVPC 167

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           +  LC   + C S ++ CPY   Y + +TSS GYLV+D+LHLA+    +    V++ +  
Sbjct: 168 TSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLAT--DDSLLKPVEAKITF 222

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
           GCG  QTG +   AAP+G++GLG+  +SVPS LA  GL  NSFS+CF  +  G + FGD 
Sbjct: 223 GCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDT 282

Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 354
           GPA Q+ T F  + E Y +Y V      +G        F A+ DSG SFT+L    Y+ +
Sbjct: 283 GPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIFDSGTSFTYLTEPAYSTI 340

Query: 355 VVKFDKLVSSKRISLQGNS--WKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP 411
             + D  +  KR SL G +  ++YCY      +  +   +         F   +     P
Sbjct: 341 TKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFTMKGGDEFTPTDIFVFLP 400

Query: 412 EN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            +         E   V CL +  STD D  +IGQNFM G+RI F+R+ + L WS S C +
Sbjct: 401 VDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRITFNRDQMVLGWSSSDCYD 458

Query: 462 VIDKSHVHLVPPPAGQSP 479
                  + V  P+G +P
Sbjct: 459 -------NGVGTPSGDTP 469


>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
 gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
          Length = 455

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 140/384 (36%), Positives = 217/384 (56%), Gaps = 21/384 (5%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           L+ I ML         +   F+ ++ HRFSDE K+ W   +G  +    +P K S EY  
Sbjct: 12  LIPILMLLS---FGSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA---KFPPKGSFEYFN 64

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
            L+  DW  +  R  L  + + S + L F S+G+ T    +   +LHYT + +GTP + F
Sbjct: 65  ALVLRDWLIRGRR--LSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRF 120

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +VALD GS+L WVPC C +CAP   + Y S +  LS Y+P  S+++K V+C++ LC  R+
Sbjct: 121 MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 179

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            C      CPY+  Y +  TS+SG L++D++HL +  K+  +  V++ V  GCG+ Q+GS
Sbjct: 180 QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 237

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           +LD AAP+G+ GLG+  +SVPS+LA+ GL+ +SFS+CF  +  G + FGD+G + Q+ T 
Sbjct: 238 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 297

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           F  +   +  Y + V    +G + L    F AL D+G SFT+L   +Y  V     +   
Sbjct: 298 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQ 351

Query: 364 SKRISLQGN-SWKYCYNASSEEML 386
            KR S      ++YCY+   + +L
Sbjct: 352 DKRHSPDSRIPFEYCYDMREKLVL 375


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 158/446 (35%), Positives = 229/446 (51%), Gaps = 27/446 (6%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS ++ H FSD  K+       ++ + D  P+K S+EY ++L   D  R      L SNN
Sbjct: 29  FSFEVHHMFSDRVKQ-------SLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +    +  +        G    +LHY  + +GTP   FLVALD GS+L W+PC C   
Sbjct: 80  EETPITFMRGNRTISIDLLG----FLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGST 135

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C         S  R L+ Y P++SS+S ++ CS   C   S C S    CPY   Y ++D
Sbjct: 136 CIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKD 195

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T ++G L +D+LHL +  +      V++++ +GCG+ QTG     AA +G++GLGL D S
Sbjct: 196 TFTTGTLFEDVLHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYS 253

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPS+LAKA +  NSFS+CF    +  G + FGD+G   Q  T  LP  E    Y V V  
Sbjct: 254 VPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTE 312

Query: 321 YCIGNSCLTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
             +G   +   G Q  AL D+G SFT L    Y  +   FD  V+ KR  +     +++C
Sbjct: 313 VSVGGDAV---GVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFC 369

Query: 378 YNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQ 435
           Y+ S ++  +  P + + F       +RN +F     +   ++CL ++ S D    IIGQ
Sbjct: 370 YDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQ 429

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCEE 461
           NFM G+RIVFDRE + L W  S C E
Sbjct: 430 NFMSGYRIVFDRERMILGWKRSDCFE 455


>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 152/475 (32%), Positives = 239/475 (50%), Gaps = 47/475 (9%)

Query: 11  FGCIL---LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLS 67
           F CI+   L  S + S S ++ HRFS++ K         V      P+  S++Y + L+ 
Sbjct: 16  FLCIMSLGLASSVSGSLSFEIHHRFSEQVK--------TVLGGHGLPEMGSLDYYKALVH 67

Query: 68  NDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ------FYWLHYTWIDIGTPNV 121
            D  R     +L SNNN +       +   +   +         F +LHY  + IGTP  
Sbjct: 68  RDRGR-----RLTSNNNQTTISFAQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQ 122

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSA------SYYTSLDRNLSEYDPSSSSSSKNVSCS 175
            FLVALD GS+L W+PC C      S       ++  +    L+ Y+PS S+SS  V+C+
Sbjct: 123 WFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCN 182

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
             LC  R+ C S    CPY   Y +  + S+G LV+D++H+++    A  + +      G
Sbjct: 183 STLCALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDARIT----FG 238

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
           C   Q G + +  A +G+MGL + D++VP++L KAG+  +SFS+CF  N  G++ FGD+G
Sbjct: 239 CSETQLGLFQE-VAVNGIMGLAMADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKG 297

Query: 296 PATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
            + Q  T   P+G      F  V +  + +G   + ++ F A+ DSG + T+L    Y  
Sbjct: 298 SSDQHET---PLGGTISPLFYDVSITKFKVGKVTV-ETKFSAIFDSGTAVTWLLDPYYTA 353

Query: 354 VVVKFDKLVSSKRISLQGNS-WKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFS 409
           +   F   V  +R+    +S +++CY   + S EE  K+P +        ++ V + I  
Sbjct: 354 LTTNFHLSVPDRRLPANVDSTFEFCYIITSTSDEE--KLPSISFEMKGGAAYDVFSPILV 411

Query: 410 FPENEG-FTVFCLTVMSTD-GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           F  ++G F V+CL V+  D  D+ IIGQNFM  +RIV DRE + L W  S C + 
Sbjct: 412 FDTSDGSFQVYCLAVLKQDKADFNIIGQNFMTNYRIVHDRERMILGWKKSNCNDT 466


>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 530

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 156/444 (35%), Positives = 226/444 (50%), Gaps = 23/444 (5%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS ++ H FSD  K+        + + D  P+K S+EY ++L   D  R      L SNN
Sbjct: 30  FSFEVHHMFSDRVKQ-------TLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 80

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +    +  +      F G    +LHY  + +GTP   FLVALD GSNL W+PC C   
Sbjct: 81  EETPITFMRGNRTVSIDFLG----FLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGST 136

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C         S  R L+ Y P++SS+S ++ C+   C   S C S    CPY   Y ++D
Sbjct: 137 CIRDLKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKD 196

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T ++G L +D+LHL +  +      V++++ +GCGR QTG     AA +G++GLG+ D S
Sbjct: 197 TFTTGTLFEDVLHLVT--EDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYS 254

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPS+LAKA +  NSFS+CF    +  G + FGD+G   Q  T  LP  E    Y V V  
Sbjct: 255 VPSILAKAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPT-EPSPTYAVNVTE 313

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
             +G   +      AL D+G SFT L    Y  +   FD  V+ KR  +     +++CY+
Sbjct: 314 VSVGGDVVGVQ-LLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYD 372

Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNF 437
            S +   +  P + + F       +RN +F     +   ++CL ++ S D    IIGQNF
Sbjct: 373 LSPNSTTILFPRVAMTFEGGSLMFLRNPLFIVWNEDNTAMYCLGILKSVDFKINIIGQNF 432

Query: 438 MMGHRIVFDRENLKLAWSHSKCEE 461
           M G+R+VFDRE + L W  S C E
Sbjct: 433 MSGYRVVFDRERMILGWKRSDCFE 456


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 148/444 (33%), Positives = 229/444 (51%), Gaps = 24/444 (5%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F  ++ H FSD  K+       ++ + D  P++ S+EY ++L   D  R      L SNN
Sbjct: 29  FGFEVHHIFSDSVKQ-------SLGLGDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
           + +   + F  +G            L+Y  + +GTP  SFLVALD GS+L W+PC C   
Sbjct: 80  DET--PITF--DGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C              L+ Y P++S++S ++ CS   C     C S    CPY   YS   
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NS 194

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T + G L+ D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   S
Sbjct: 195 TGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPSLLAKA +  NSFS+CF     + G + FGD+G   Q+ T F+ +     AY V +  
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPS-TAYGVNISG 311

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
             +    +    F A  D+G+SFT L    Y  +   FD+LV  +R  +     +++CY+
Sbjct: 312 VSVAGDPVDIRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYD 370

Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 437
            S +   ++ P + + F      ++ N  F+    EG  ++CL V+ + G    +IGQNF
Sbjct: 371 LSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNF 430

Query: 438 MMGHRIVFDRENLKLAWSHSKCEE 461
           + G+RIVFDRE + L W  S C E
Sbjct: 431 VAGYRIVFDRERMILGWKQSLCFE 454


>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
          Length = 519

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 155/444 (34%), Positives = 227/444 (51%), Gaps = 33/444 (7%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS ++ H FSD  K+       ++ + D  P+K S+EY ++L   D  R      L SNN
Sbjct: 29  FSFEVHHMFSDRVKQ-------SLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +    +  +        G    +LHY  + +GTP   FLVALD GS+L W+PC C   
Sbjct: 80  EETPITFMRGNRTISIDLLG----FLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGST 135

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C         S  R L+ Y P++SS+S ++ CS   C   S C S    CPY   Y ++D
Sbjct: 136 CIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKD 195

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T ++G L +D+LHL +  +      V++++ +GCG+ QTG     AA +G++GLGL D S
Sbjct: 196 TFTTGTLFEDVLHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYS 253

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPS+LAKA +  NSFS+CF    +  G + FGD+G   Q  T  LP         VG ++
Sbjct: 254 VPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGGDA 313

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
             +G   L      AL D+G SFT L    Y  +   FD  V+ KR  +     +++CY+
Sbjct: 314 --VGVQLL------ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYD 365

Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNF 437
            S ++  +  P + + F       +RN +F     +   ++CL ++ S D    IIGQNF
Sbjct: 366 LSPNKTTILFPRVAMTFEGGSQMFLRNPLFI----DNSAMYCLGILKSVDFKINIIGQNF 421

Query: 438 MMGHRIVFDRENLKLAWSHSKCEE 461
           M G+RIVFDRE + L W  S C E
Sbjct: 422 MSGYRIVFDRERMILGWKRSDCFE 445


>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 158/495 (31%), Positives = 241/495 (48%), Gaps = 44/495 (8%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F  ++ H FSD  K+       ++ + D  P++ S+EY ++L   D  R      L SNN
Sbjct: 29  FGFEVHHIFSDAVKQ-------SLGLDDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +        +G            L+Y  + +GTP  SFLVALD GS+L W+PC C   
Sbjct: 80  EDTPVTF----DGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C              L+ Y P++S++S ++ CS   C     C S K  CPY   YS   
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNS- 194

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T ++G L+ D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   S
Sbjct: 195 TGTTGTLLQDVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPSLLAKA +  +SFS+CF     + G + FGD+G   Q+ T F+ +     AY + V  
Sbjct: 253 VPSLLAKANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPS-TAYGLNVTG 311

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
             +G   +    F A  D+G+SFT L    Y  +   FD LV  KR  +     +++CY+
Sbjct: 312 VSVGGDPVGTRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYD 370

Query: 380 ASSEEM-LKVPDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDG-DYGII 433
            S     ++ P + + F      ++ N  F+        EG  ++CL V+ + G    +I
Sbjct: 371 LSPNATSIEFPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKINVI 430

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC--EEVIDKSHVH----------LVPPPAGQSP-- 479
           GQNF+ G+RIVFDRE + L W  S C  +E ++ +               PPP    P  
Sbjct: 431 GQNFVAGYRIVFDRERMILGWKPSLCFEDESLESTTPPPEIEAPAPSVTAPPPRSLPPAV 490

Query: 480 --NPLPTTEQQSTSN 492
              P P   + ST N
Sbjct: 491 SSTPPPIDPRNSTGN 505


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 153/484 (31%), Positives = 241/484 (49%), Gaps = 34/484 (7%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           S S ++ HRFS++ K         V      P+  S++Y + L+  D  RQ T      +
Sbjct: 21  SLSFEIHHRFSEQVK--------TVLGGHGLPEMGSLDYYKALVHRDRGRQLT------S 66

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
           NN+++  + F ++G+ T     +  +LHY  + IGTP   FLVALD GS+L W+PC C  
Sbjct: 67  NNNNQTTISF-AQGNSTE----EISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNS 121

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
               S          L+ Y+PS S SS  V+C+  LC  R+ C S    CPY   Y +  
Sbjct: 122 TCVRSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPG 181

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           + S+G LV+D++H+++    A  + +      GC   Q G + +  A +G+MGL + D++
Sbjct: 182 SKSTGVLVEDVIHMSTEEGEARDARIT----FGCSESQLGLFKE-VAVNGIMGLAIADIA 236

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVES 320
           VP++L KAG+  +SFS+CF  N  G++ FGD+G + Q  T   P+       F  V +  
Sbjct: 237 VPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITK 293

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY- 378
           + +G   +  + F A  DSG + T+L    Y  +   F   V  +R+S   +S +++CY 
Sbjct: 294 FKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYI 352

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMS-TDGDYGIIGQN 436
             S+ +  K+P +        ++ V + I  F  ++G F V+CL V+   + D+ IIGQN
Sbjct: 353 ITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQN 412

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAA 496
           FM  +RIV DRE   L W  S C +    +    +  P   +P   P T   S+     A
Sbjct: 413 FMTNYRIVHDRERRILGWKKSNCNDTNGFTGPTALAKPPSMAPTSSPRTINLSSRLNPLA 472

Query: 497 APPS 500
           A  S
Sbjct: 473 AASS 476


>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 530

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 158/454 (34%), Positives = 227/454 (50%), Gaps = 38/454 (8%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS ++ H FSD  K+        +   D  P+  S+EY ++L   D  R      L SNN
Sbjct: 30  FSFEVHHMFSDVVKQ-------TLGFDDLVPENGSLEYFKVLAHRD--RFIRGRGLASNN 80

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +       S GS      N   +LHY  + +GTP   FLVALD GS+L W+PC C   
Sbjct: 81  EETP----LTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTT 136

Query: 143 CAP--LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           C      A +  S+  NL  Y P++S++S ++ CS   C     C S +  CPY    S+
Sbjct: 137 CIHDLKDARFSESVPLNL--YTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS 194

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
            +T ++G L+ D+LHL +  +      V ++V +GCG+ QTG++    A +GV+GL + +
Sbjct: 195 -NTVTTGTLLQDVLHLVT--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKE 251

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 318
            SVPSLLAKA +  NSFS+CF    S  G + FGD+G   Q+ T  + + E   AY V V
Sbjct: 252 YSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNV 310

Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
               +G   +    F AL D+G+SFT L    Y      FD L+  KR  +  +  +++C
Sbjct: 311 TGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFC 369

Query: 378 YNASSEEMLKVPDMRLIFSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTD 427
           Y+   E +      R + SK        F  R         S+  NEG  ++CL ++ + 
Sbjct: 370 YDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI 428

Query: 428 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            +  IIGQN M GHRIVFDRE + L W  S C E
Sbjct: 429 -NLNIIGQNLMSGHRIVFDRERMILGWKQSNCFE 461


>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
          Length = 518

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 158/454 (34%), Positives = 227/454 (50%), Gaps = 38/454 (8%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS ++ H FSD  K+        +   D  P+  S+EY ++L   D  R      L SNN
Sbjct: 18  FSFEVHHMFSDVVKQ-------TLGFDDLVPENGSLEYFKVLAHRD--RFIRGRGLASNN 68

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
             +       S GS      N   +LHY  + +GTP   FLVALD GS+L W+PC C   
Sbjct: 69  EETP----LTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTT 124

Query: 143 CAP--LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
           C      A +  S+  NL  Y P++S++S ++ CS   C     C S +  CPY    S+
Sbjct: 125 CIHDLKDARFSESVPLNL--YTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS 182

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
            +T ++G L+ D+LHL +  +      V ++V +GCG+ QTG++    A +GV+GL + +
Sbjct: 183 -NTVTTGTLLQDVLHLVT--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKE 239

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 318
            SVPSLLAKA +  NSFS+CF    S  G + FGD+G   Q+ T  + + E   AY V V
Sbjct: 240 YSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNV 298

Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
               +G   +    F AL D+G+SFT L    Y      FD L+  KR  +  +  +++C
Sbjct: 299 TGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFC 357

Query: 378 YNASSEEMLKVPDMRLIFSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTD 427
           Y+   E +      R + SK        F  R         S+  NEG  ++CL ++ + 
Sbjct: 358 YDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI 416

Query: 428 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            +  IIGQN M GHRIVFDRE + L W  S C E
Sbjct: 417 -NLNIIGQNLMSGHRIVFDRERMILGWKQSNCFE 449


>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
          Length = 335

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 124/327 (37%), Positives = 185/327 (56%), Gaps = 26/327 (7%)

Query: 30  HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
           HR+S   +E W             P   + EY   L  +D +R+           +   +
Sbjct: 28  HRYSATVRE-WAGHRA--------PPAGTAEYYAALAGHDLRRRSL---------AGGGE 69

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
           + F ++G+ T+   N+  +LHY  + +GTPNV+FLVALD GS+L WVPC CI CAPL + 
Sbjct: 70  VAF-ADGNDTYRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSP 127

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y   D     Y P  SS+S+ V CS  LC  +S+C+S    CPY   Y +++TSS+G L
Sbjct: 128 NYR--DLKFDTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVL 185

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           V+D+L+L +     P+  V + +  GCGR QTGS+L  AAP+G++GLG+  +SVPSLLA 
Sbjct: 186 VEDVLYLVTEYGRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLAS 244

Query: 270 AGL-IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
            G+   NSFS+CF ++  G + FGD G + QQ T  L + ++   Y + +    +G+  +
Sbjct: 245 QGVAAANSFSMCFAQDGHGRINFGDTGSSDQQETP-LNMYKQNPYYNISITGATVGSKSI 303

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVV 355
             + F A+VDSG SFT L   +Y ++ 
Sbjct: 304 -HTKFNAIVDSGTSFTALSDPMYTQIT 329


>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 102/264 (38%), Positives = 158/264 (59%), Gaps = 10/264 (3%)

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +VALD GS+L WVPC C +CAP   + Y S +  LS Y+P  S+++K V+C++ LC  R+
Sbjct: 1   MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 59

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            C      CPY+  Y +  TS+SG L++D++HL +  K+  +  V++ V  GCG+ Q+GS
Sbjct: 60  QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 117

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
           +LD AAP+G+ GLG+  +SVPS+LA+ GL+ +SFS+CF  +  G + FGD+G + Q+ T 
Sbjct: 118 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 177

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           F  +   +  Y + V    +G + L    F AL D+G SFT+L   +Y  V     +   
Sbjct: 178 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQ 231

Query: 364 SKRISLQGN-SWKYCYNASSEEML 386
            KR S      ++YCY+   + +L
Sbjct: 232 DKRHSPDSRIPFEYCYDMREKLVL 255


>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 217

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/211 (46%), Positives = 130/211 (61%), Gaps = 21/211 (9%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           SS++VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S   
Sbjct: 28  SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
                    S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCA
Sbjct: 80  ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
           PLS  Y  +LDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
           SSG L++D LHL     H P   V +SVIIG
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIG 217


>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
          Length = 207

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 90/183 (49%), Positives = 119/183 (65%), Gaps = 2/183 (1%)

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           + F+A VDSG SFTFLP   Y  +  +FDK V++ R S +G+ W+YCY +SSE++ KVP 
Sbjct: 2   TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + L+F +N SFVV N +F+F +N+G   FCL +  T+GD G IGQNFM G+R+VFDREN 
Sbjct: 62  LTLMFQQNNSFVVYNPVFTFYDNQGVVGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRENK 121

Query: 451 KLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSI 510
            LAWS S C+++     + L PP    S  PLPT EQQ T NG A AP    + +P  S 
Sbjct: 122 NLAWSPSNCQDLSLGKRMPLSPPNKTSS-APLPTDEQQRT-NGHAVAPAIAGRASPKPSA 179

Query: 511 AAS 513
           A S
Sbjct: 180 APS 182


>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 430

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 173/335 (51%), Gaps = 29/335 (8%)

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
           L+ Y P+ S++S  V C+  LC   + C S ++ CPY   Y + +TSS GYLV+D+LHLA
Sbjct: 3   LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
           +    +    V++ +  GCG  QTG +   AAP+G++GLG+  +SVPS LA  GL  NSF
Sbjct: 60  T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117

Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 337
           S+CF  +  G + FGD GPA Q+ T F  + E Y +Y V      +G        F A+ 
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 175

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYN-ASSEEMLKVPDMRLI 394
           DSG SFT+L    Y+ +  + D  +  KR SL G +  ++YCY      +  +   +   
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFT 235

Query: 395 FSKNQSFVVRNHIFSFPEN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 444
                 F   +     P +         E   V CL +  STD D  +IGQNFM G+RI 
Sbjct: 236 MKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRIT 293

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 479
           F+R+ + L WS S C +       + V  P+G +P
Sbjct: 294 FNRDQMVLGWSSSDCYD-------NGVGTPSGDTP 321


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 132/446 (29%), Positives = 203/446 (45%), Gaps = 81/446 (18%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           F  ++ H FSD  K+       ++ + D  P++ S+EY ++L   D  R      L SNN
Sbjct: 29  FGFEVHHIFSDSVKQ-------SLGLGDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
           + +   + F  +G            L+Y  + +GTP  SFLVALD GS+L W+PC C   
Sbjct: 80  DET--PITF--DGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
           C              L+ Y P++S++S ++ CS   C     C S    CPY   YS   
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NS 194

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           T + G L+ D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   S
Sbjct: 195 TGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252

Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
           VPSLLAKA +  NSFS+CF     + G + FGD+G   Q+ T F+ +  +          
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPR---------- 302

Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                        +  VD    F F                               CY+ 
Sbjct: 303 -------------RRPVDPELPFEF-------------------------------CYDL 318

Query: 381 S-SEEMLKVPDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDGDYGIIGQ 435
           S +   ++ P + + F      ++ N  F+        EG  ++CL V+ +    G+   
Sbjct: 319 SPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKS---VGLKIN 375

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCEE 461
           NF+ G+RIVFDRE + L W  S C E
Sbjct: 376 NFVAGYRIVFDRERMILGWKQSLCFE 401


>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
           vinifera]
          Length = 294

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 155/278 (55%), Gaps = 10/278 (3%)

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
           CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF  + +G + FGD+G
Sbjct: 1   CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
            + Q+ T F P   +   Y + +    +G +    + F A+ DSG SFT+L    Y  + 
Sbjct: 61  SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 118

Query: 356 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
             F+     KR S   +  ++YCY+ S ++  ++ P + L      +F V + I      
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 178

Query: 414 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 473
            G+ V+CL V+ + GD  IIGQNFM G+RI+FDRE + L W+ S C +  + + + + P 
Sbjct: 179 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 236

Query: 474 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 507
            +   P  +    + +  NG  +    AP   A  +P+
Sbjct: 237 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 274


>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
          Length = 306

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 155/278 (55%), Gaps = 10/278 (3%)

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
           CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF  + +G + FGD+G
Sbjct: 13  CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
            + Q+ T F P   +   Y + +    +G +    + F A+ DSG SFT+L    Y  + 
Sbjct: 73  SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 130

Query: 356 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
             F+     KR S   +  ++YCY+ S ++  ++ P + L      +F V + I      
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 190

Query: 414 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 473
            G+ V+CL V+ + GD  IIGQNFM G+RI+FDRE + L W+ S C +  + + + + P 
Sbjct: 191 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 248

Query: 474 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 507
            +   P  +    + +  NG  +    AP   A  +P+
Sbjct: 249 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 286


>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 414

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 137/474 (28%), Positives = 210/474 (44%), Gaps = 84/474 (17%)

Query: 4   LVAICMLFGCILLDGSD-AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
            V + +L  C  L   + A  FS ++ H FSD  K+       N+   D  P+K S+EY 
Sbjct: 8   FVLLSVLVACWGLQRCESAGKFSFEVHHMFSDTVKQ-------NLGFGDLVPEKGSLEYF 60

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
           +LL   D   +  R +  S+NN          E   T   GN+            T ++ 
Sbjct: 61  KLLAQRD---RLIRGRGLSSNNE---------EAPVTFILGNR------------TVSID 96

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           FL     GS+L W+PC C           T+  R+L +                 +  S+
Sbjct: 97  FL-----GSDLFWLPCNC----------GTTCIRDLED-----------------IGLSQ 124

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
             C S    CPY   Y    TS+ G L +D+LHL   ++      V++++ +GCG+ QTG
Sbjct: 125 GGCSSPASVCPYQIPYLFNTTSTRGTLFEDVLHLV--TEDEGLEPVKANITLGCGQNQTG 182

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQ 300
            Y    A +G++GLG+ D SVPS+LAK  +  NSFS+CF    +  G + FGD+G   Q 
Sbjct: 183 LYRKSLAVNGLLGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQL 242

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
            T  +PI E    Y V V    +G   L +    AL D+G SFT L    Y  +   FD 
Sbjct: 243 QTPLVPI-EPNPTYAVNVTEVTVGGDIL-EIQMLALFDTGTSFTHLLEPAYGLLTKAFDD 300

Query: 361 LVSSKRISLQGN-SWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
            V+ KR  +     +++CY+ S   +  K P + + F       +R+ +F+         
Sbjct: 301 HVTDKRRPIDPEIPFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVWNEARHGA 360

Query: 419 FCLTVMSTDGDYG------------IIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           +  ++  +D +              ++ +N M G+RIVFDRE + L W  S C+
Sbjct: 361 WMSSLTFSDREKKKKEYVLNAFHIWVVSENLMSGYRIVFDRERMILGWKRSDCK 414


>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
          Length = 263

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 133/241 (55%), Gaps = 6/241 (2%)

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           V++ ++ GCG+ QTG++LD AAP+G+ GLG+  VSVPS+LA  G   NSFS+CF  +  G
Sbjct: 11  VKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMCFGSDGMG 70

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
            ++FGD G + Q  T F  +   +  Y + +    +GNS +  +   A+VDSG SFT L 
Sbjct: 71  RIYFGDTGSSDQGETPF-DVNHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSGTSFTCLA 128

Query: 348 TEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
             +Y ++   F   V   R  S  G  ++YCY  S ++  + +P + L       F + +
Sbjct: 129 DPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGGSQFPIND 188

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 465
            I     +E  + +CL ++ +     IIGQNFM G RIVFDRE L L W  S C E  D 
Sbjct: 189 PIIVI-SSEQSSFYCLGIVKSS-QLNIIGQNFMTGLRIVFDRERLVLGWKESDCYEAEDS 246

Query: 466 S 466
           S
Sbjct: 247 S 247


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 179/368 (48%), Gaps = 29/368 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I +GTP+  F V +D GS++LWV C  CI+C P  +         L+ YD  +SS
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDVDASS 137

Query: 168 SSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           ++K+VSCS   C     RS C S    C Y+  Y  + +S++GYLV D++HL   + +  
Sbjct: 138 TAKSVSCSDNFCSYVNQRSECHS-GSTCQYVIMYG-DGSSTNGYLVKDVVHLDLVTGNRQ 195

Query: 225 QSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
             S   ++I GCG KQ+G   +  AA DG+MG G  + S  S LA  G ++ SF+ C D 
Sbjct: 196 TGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDN 255

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA------ 335
           N+ G +F    G          P+  K   Y V + +  +GNS   L+ + F +      
Sbjct: 256 NNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGV 313

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           ++DSG +  +LP  +Y  ++ +   L S   ++L      +     ++++ + P +   F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHTVQESFTCFHYTDKLDRFPTVTFQF 371

Query: 396 SKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLK 451
            K+ S  V  R ++F   E+     +    + T G     I+G   +    +V+D EN  
Sbjct: 372 DKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQV 431

Query: 452 LAWSHSKC 459
           + W++  C
Sbjct: 432 IGWTNHNC 439


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 108/419 (25%), Positives = 198/419 (47%), Gaps = 41/419 (9%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
           +RQ +   ++++++S R ++L        +  GN       L++T I +G+P+  + V +
Sbjct: 30  RRQASLTGIKAHDSSRRGRIL---SAVDFNLGGNGLPTVTGLYFTKIGLGSPSKDYYVQV 86

Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
           D GS++LWV C +C +C   S      +   L+ YDP  S +S+ VSC H  C S    +
Sbjct: 87  DTGSDILWVNCVECTRCPRKS-----DIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGR 141

Query: 187 SL----KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
            L    ++PCPY   Y  + ++++GY V D L     + +   ++  SS+I GCG  Q+G
Sbjct: 142 ILGCKAENPCPYSISYG-DGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSG 200

Query: 243 SYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQ 299
           ++   +  A DG++G G  + SV S LA +G ++  FS C D N  G +F  G+      
Sbjct: 201 TFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKV 260

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIY 351
           ++T  +P    Y+     +E   +    L        +++G   ++DSG +  +LP  +Y
Sbjct: 261 KTTPLVPNMAHYNVILKNIE---VDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVY 317

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
            +++ K   L    R+ +     +Y C+  +       P ++L F  + S  V  H + F
Sbjct: 318 DQLMSKV--LAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLF 375

Query: 411 PENEGFTVFCL------TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
              +G + +C+      +      D  ++G   +    +V+D EN+ + W+   C   I
Sbjct: 376 -NYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSI 433


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 119/411 (28%), Positives = 195/411 (47%), Gaps = 39/411 (9%)

Query: 71  KRQKTRVKLQSNNNSSRNQLL----FPSEG-SQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
           KR+K    L++++    ++LL     P  G SQ    G     L++  I +GTP+  F V
Sbjct: 46  KREKDLGALRAHDVHRHSRLLSAIDLPLGGDSQPESIG-----LYFAKIGLGTPSRDFHV 100

Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KS 181
            +D GS++LWV C  CI+C P  +         L+ YD  +SS++K+VSCS   C     
Sbjct: 101 QVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDADASSTAKSVSCSDNFCSYVNQ 154

Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
           RS C S    C Y+  Y  + +S++GYLV D++HL   + +    S   ++I GCG KQ+
Sbjct: 155 RSECHS-GSTCQYVILYG-DGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQS 212

Query: 242 GSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
           G   +  AA DG+MG G  + S  S LA  G ++ SF+ C D N+ G +F    G     
Sbjct: 213 GQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSP 270

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQS--GFQA------LVDSGASFTFLPTEIYA 352
                P+  K   Y V + +  +GNS L  S   F +      ++DSG +  +LP  +Y 
Sbjct: 271 KVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDAVYN 330

Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF 410
            ++ +   L S + ++L      +      + + + P +   F K+ S  V  + ++F  
Sbjct: 331 PLMNQI--LASHQELNLHTVQDSFTCFHYIDRLDRFPTVTFQFDKSVSLAVYPQEYLFQV 388

Query: 411 PENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            E+     +    + T G     I+G   +    +V+D EN  + W++  C
Sbjct: 389 REDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 117/404 (28%), Positives = 184/404 (45%), Gaps = 42/404 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP  S+ V +D GS+++WV C QC QC   S     +L   L+ Y+   S 
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K VSC    C        S CK+    CPY+  Y  + +S++GY V D++   S +  
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA +G ++  F+ 
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
           C D  + G +F    G   Q   +  P+      Y V + +  +G   LT      Q G 
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGD 308

Query: 334 Q--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
           +  A++DSG +  +LP  IY  +V K      + ++ +    +K C+  S       P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNV 367

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVF 445
              F  +    V  H + FP +EG  ++C+      + S D  +  ++G   +    +++
Sbjct: 368 TFHFENSVFLRVYPHDYLFP-HEG--MWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLY 424

Query: 446 DRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 484
           D EN  + W+   C   I         VHLV      S  PL T
Sbjct: 425 DLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/439 (26%), Positives = 195/439 (44%), Gaps = 39/439 (8%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  + VE   L       R + RV+      SS   + F   G+   F     
Sbjct: 31  LTLERAFPTNHGVEIAHL-------RSRDRVRHGRMLQSSGGVIDFSVSGTYDPFL---- 79

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L+YT + +G P   F V +D GS++LWV C      P +    + L   L+ +DP SS
Sbjct: 80  VGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPAT----SGLQIPLNFFDPGSS 135

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           +++  VSCS  +C      S S+C    + C Y+  Y  + + +SGY V D++HL     
Sbjct: 136 TTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYG-DGSGTSGYYVMDMIHLDVVID 194

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
            +  S+  +SV+ GC   QTG       A DG+ G G  D+SV S L+  G+    FS C
Sbjct: 195 SSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC 254

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
              +DSG       G   + +  + P+      Y + ++S  +    L        T S 
Sbjct: 255 LKGDDSGGGIL-VLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSS 313

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPD 390
              ++DSG +  +L  E Y   VV    +V  S++ + L+GN    CY  SS      P 
Sbjct: 314 QGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNR---CYVTSSSVSDIFPQ 370

Query: 391 MRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDR 447
           + L F+   S V+    +   +N   G TV+C+      G    I  + ++  +I ++D 
Sbjct: 371 VSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYDL 430

Query: 448 ENLKLAWSHSKCEEVIDKS 466
            N ++ W++  C   ++ S
Sbjct: 431 ANQRIGWTNYDCSMSVNVS 449


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 181/404 (44%), Gaps = 42/404 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP  S+ V +D GS+++WV C QC QC   S     +L   L+ Y+   S 
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K VSC    C        S CK+    CPY+  Y  + +S++GY V D++   S +  
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA +G ++  F+ 
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ--- 334
           C D  + G +F    G   Q   +  P+      Y V + +  +G   L      FQ   
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGD 308

Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
              A++DSG +  +LP  IY  +V K      + ++ +    +K C+  S       P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNV 367

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVF 445
              F  +    V  H + FP  EG  ++C+      + S D  +  ++G   +    +++
Sbjct: 368 TFHFENSVFLRVYPHDYLFPY-EG--MWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLY 424

Query: 446 DRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 484
           D EN  + W+   C   I         VHLV      S  PL T
Sbjct: 425 DLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 109/422 (25%), Positives = 198/422 (46%), Gaps = 48/422 (11%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
           +R+++   +++++   R ++L        +  GN       L++T + +G+P   + V +
Sbjct: 31  RRKRSLNAVKAHDARRRGRIL---SAVDLNLGGNGLPTETGLYFTKLGLGSPPKDYYVQV 87

Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR---- 182
           D GS++LWV C +C +C   S      L  +L+ YDP  S +S+ +SC    C +     
Sbjct: 88  DTGSDILWVNCVKCSRCPRKS-----DLGIDLTLYDPKGSETSELISCDQEFCSATYDGP 142

Query: 183 -SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL---HLASFSKHAPQSSVQSSVIIGCGR 238
              CKS + PCPY   Y  + ++++GY V D L   H+    + APQ+S   S+I GCG 
Sbjct: 143 IPGCKS-EIPCPYSITYG-DGSATTGYYVQDYLTYNHVNDNLRTAPQNS---SIIFGCGA 197

Query: 239 KQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
            Q+G+    +  A DG++G G  + SV S LA +G ++  FS C D    G +F    G 
Sbjct: 198 VQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGIF--AIGE 255

Query: 297 ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPT 348
             +   S  P+  +   Y V ++S  +    L        + +G   ++DSG +  +LP 
Sbjct: 256 VVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGTIIDSGTTLAYLPA 315

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
            +Y E++ K   +    R+ L     ++ C+  +       P ++L F  + S  V  H 
Sbjct: 316 IVYDELIPKV--MARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLSLTVYPHD 373

Query: 408 FSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           + F   +G  ++C+        + +G D  ++G   +    +++D EN+ + W+   C  
Sbjct: 374 YLFQFKDG--IWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNCSS 431

Query: 462 VI 463
            I
Sbjct: 432 SI 433


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 161/364 (44%), Gaps = 24/364 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
           G +F    G   +      PI +  + Y  V ++S  +  + L        T       +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG++  +LP  IY+E+++          I++       C++       K P +   F  
Sbjct: 313 DSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN 370

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
           + +  V   +++  +  N+    F    +    D  I+G   +    +V+D E   + W+
Sbjct: 371 DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWT 430

Query: 456 HSKC 459
              C
Sbjct: 431 EHNC 434


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 115/426 (26%), Positives = 182/426 (42%), Gaps = 51/426 (11%)

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           E+L  L  +D +R  T V L    N        P++             L++T I IGTP
Sbjct: 56  EHLAALRKHDGRRLLTAVDLPLGGNG------IPTD-----------TGLYFTQIGIGTP 98

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
           +  + V +D GS++LWV   CI C   S    + L  +L+ YDP++S+SSK V+C    C
Sbjct: 99  SKGYYVQVDTGSDILWV--NCISCD--SCPRKSGLGIDLTLYDPTASASSKTVTCGQEFC 154

Query: 180 KSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
            + +      SC +   PC Y   Y  + +S++G+ V D L     S     +   +SV 
Sbjct: 155 ATATNGGVPPSCAA-NSPCQYSITYG-DGSSTTGFFVADFLQYDQVSGDGQTNLANASVT 212

Query: 234 IGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GCG K  G+      A DG++G G  + S+ S L  AG +   FS C D  + G +F  
Sbjct: 213 FGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGIF-- 270

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQALVDSGASF 343
             G   Q      P+      Y V +++  +G S L                ++DSG + 
Sbjct: 271 AIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTL 330

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
            +LP  +Y  V+       +   ++L+      C+  S       P++   F  +   VV
Sbjct: 331 AYLPEVVYKAVLSAV--FSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVV 388

Query: 404 RNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
             H + F   E   V+C+      V S DG D  ++G   +    +V+D EN  + W++ 
Sbjct: 389 YPHDYLFQNTE--DVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNY 446

Query: 458 KCEEVI 463
            C   I
Sbjct: 447 NCSSSI 452


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 106/418 (25%), Positives = 192/418 (45%), Gaps = 40/418 (9%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
           +R+++   +++++   R ++L        +  GN       L++T + +G+P   + V +
Sbjct: 31  RRKRSLSAVRAHDVRRRGRIL---SAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQV 87

Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR---- 182
           D GS++LWV C +C +C   S      L  +L+ YDP  S +S  VSC    C +     
Sbjct: 88  DTGSDILWVNCVECSRCPRKS-----DLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGP 142

Query: 183 -SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
              CKS + PCPY   Y  + ++++GY V D L     + +   S   SS+I GCG  Q+
Sbjct: 143 IPGCKS-EIPCPYSITYG-DGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQS 200

Query: 242 GSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
           G+    +  A DG++G G  + SV S LA +G ++  FS C D    G +F    G   +
Sbjct: 201 GTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGGIF--AIGEVVE 258

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIY 351
              S  P+  +   Y V ++S  +    L        + +G   ++DSG +  +LP  +Y
Sbjct: 259 PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPDIVY 318

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP 411
            E++ K        ++ L    ++ C+  +       P ++L F  + S  V  H + F 
Sbjct: 319 DELIQKVLARQPGLKLYLVEQQFR-CFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQ 377

Query: 412 ENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
             +G  ++C+        + +G D  ++G   +    +++D EN+ + W+   C   I
Sbjct: 378 FKDG--IWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNCSSSI 433


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 175/382 (45%), Gaps = 44/382 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I +G PN  + V +D GS+ LWV C  C  C   S      L   L+ YDP+SS 
Sbjct: 76  LYYTKIGLG-PN-DYYVQVDTGSDTLWVNCVGCTTCPKKSG-----LGMELTLYDPNSSK 128

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKD-PCPYIADYSTEDTSSSGYLVDDIL--HLASF 219
           +SK V C    C S      S CK  KD  CPY   Y    T+S  Y+ DD+    +   
Sbjct: 129 TSKVVPCDDEFCTSTYDGPISGCK--KDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGD 186

Query: 220 SKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
            +  P ++   SVI GCG KQ+G  S     + DG++G G  + SV S LA AG ++  F
Sbjct: 187 LRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVF 243

Query: 278 SICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------T 329
           S C D  + G +F  G+      ++T  +P    Y+     +E    G+          +
Sbjct: 244 SHCLDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIE--VAGDPIQLPTDIFDS 301

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-- 387
            SG   ++DSG +  +LP  IY +++ K     S   + L  + +  C++ S E+ L   
Sbjct: 302 TSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT-CFHYSDEKSLDDA 360

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGH 441
            P ++  F +  +     H + FP  E   ++C+     T  + DG D  ++G   +   
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKE--DMWCIGWQKSTAQTKDGKDLILLGDLVLTNK 418

Query: 442 RIVFDRENLKLAWSHSKCEEVI 463
             ++D +N+ + W+   C   I
Sbjct: 419 LFIYDLDNMSIGWTDYNCSSSI 440


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 181/394 (45%), Gaps = 46/394 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP+  + + +D G++++WV C QC +C   S     +L  +L+ Y+   SS
Sbjct: 72  LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRS-----NLGMDLTLYNIKESS 126

Query: 168 SSKNVSCSHPLCKS-----RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S K V C   LCK       + C S   D CPY+  Y  + +S++GY V D++     S 
Sbjct: 127 SGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYG-DGSSTAGYFVKDVVLFDQVSG 185

Query: 222 HAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               +S   SVI GCG +Q+G  SY +  A DG++G G  + S+ S L+ +G ++  F+ 
Sbjct: 186 DLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAH 245

Query: 280 CFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQ- 334
           C +  + G +F  G     T  +T  LP    Y      ++   +G++ L   T +  Q 
Sbjct: 246 CLNGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTFLNLSTDASEQR 302

Query: 335 ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVP 389
                ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+  S       P
Sbjct: 303 DSKGTIIDSGTTLAYLPDGIYQPLVYKI--LSQQPNLKVQTLHDEYTCFQYSGSVDDGFP 360

Query: 390 DMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRI 443
           ++   F    S  V  H + F  EN    ++C+        S D     +  + ++ +++
Sbjct: 361 NVTFYFENGLSLKVYPHDYLFLSEN----LWCIGWQNSGAQSRDSKNMTLLGDLVLSNKL 416

Query: 444 VF-DRENLKLAWSHSKCEEVI-----DKSHVHLV 471
           VF D EN  + W+   C   I         VHLV
Sbjct: 417 VFYDLENQVIGWTEYNCSSSIKVRDEKTGTVHLV 450


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 164/367 (44%), Gaps = 25/367 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +  FS C D  + 
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
           G +F    G   +      PI +  + Y  V ++S  +  + L        T       +
Sbjct: 231 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG++  +LP  IY+E+++          I++       C++       K P +   F  
Sbjct: 289 DSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN 346

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
           + +  V   +++  +  N+    F    +    D  I+G   +    +V+D E   + W+
Sbjct: 347 DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWT 406

Query: 456 -HSKCEE 461
            H+  EE
Sbjct: 407 EHNSVEE 413


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 163/376 (43%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP   + V +D GS++LWV C  C +C   S      L   L+ YDP  SS
Sbjct: 88  LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 142

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +   VSC    C +        C +   PC Y   Y  + +S++GY V D+L     S  
Sbjct: 143 TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 200

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                  S+V  GCG +Q G       A DG++G G  + S+ S L+ AG ++  F+ C 
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 260

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
           D  + G +F    G   Q      P+      Y V ++S  +G + L        T    
Sbjct: 261 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 318

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG + T+LP  +Y E+++        K I+        C+          P +  
Sbjct: 319 GTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGRVDDDFPKITF 376

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHR-IVFDR 447
            F  +    V  H + F EN G  ++C+      + S DG   ++  + ++ ++ +V+D 
Sbjct: 377 HFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDL 434

Query: 448 ENLKLAWSHSKCEEVI 463
           EN  + W+   C   I
Sbjct: 435 ENQVIGWTEYNCSSSI 450


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 163/376 (43%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP   + V +D GS++LWV C  C +C   S      L   L+ YDP  SS
Sbjct: 3   LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 57

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +   VSC    C +        C +   PC Y   Y  + +S++GY V D+L     S  
Sbjct: 58  TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 115

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                  S+V  GCG +Q G       A DG++G G  + S+ S L+ AG ++  F+ C 
Sbjct: 116 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 175

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
           D  + G +F    G   Q      P+      Y V ++S  +G + L        T    
Sbjct: 176 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 233

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG + T+LP  +Y E+++        K I+        C+          P +  
Sbjct: 234 GTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGRVDDDFPKITF 291

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHR-IVFDR 447
            F  +    V  H + F EN G  ++C+      + S DG   ++  + ++ ++ +V+D 
Sbjct: 292 HFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDL 349

Query: 448 ENLKLAWSHSKCEEVI 463
           EN  + W+   C   I
Sbjct: 350 ENQVIGWTEYNCSSSI 365


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/375 (29%), Positives = 165/375 (44%), Gaps = 32/375 (8%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I+IGTP   + V +D GS++LWV C  C +C   S      L  +L  YDP  SS
Sbjct: 82  LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKS-----DLGIDLRLYDPKGSS 136

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           S   VSC    C +    K    +   PC Y   Y  + +S++GY V D L     S   
Sbjct: 137 SGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYG-DGSSTTGYFVSDSLQYNQVSGDG 195

Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                 +SVI GCG +Q G       A DG++G G  + S+ S LA AG ++  FS C D
Sbjct: 196 QTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLD 255

Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
               G +F  GD      +ST  +P    Y+   V +ES  +G + L        T    
Sbjct: 256 TIKGGGIFAIGDVVQPKVKSTPLVPDMPHYN---VNLESINVGGTTLQLPSHMFETGEKK 312

Query: 334 QALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++DSG + T+LP  +Y +V+   F K   +   S+Q +     Y  S ++    P + 
Sbjct: 313 GTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQ-DFLCIQYFQSVDDGF--PKIT 369

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRIVFDRE 448
             F  +    V  H + F   +    F      + S DG D  ++G   +    +V+D E
Sbjct: 370 FHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNKVVVYDLE 429

Query: 449 NLKLAWSHSKCEEVI 463
           N  + W+   C   I
Sbjct: 430 NQVVGWTDYNCSSSI 444


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 119/439 (27%), Positives = 198/439 (45%), Gaps = 39/439 (8%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  ++VE L  L + D  R   R  LQS+N      + F  +G+   F     
Sbjct: 23  LTLERAFPTNHTVE-LSQLRARDALRH--RRMLQSSNGV----VDFSVQGTFDPFQ---- 71

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
             L+YT + +GTP V F V +D GS++LWV C  C  C   S      L   L+ +DP S
Sbjct: 72  VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGS 126

Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           SS+S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D++HL +  
Sbjct: 127 SSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIF 185

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           + +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G+    FS 
Sbjct: 186 EGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSH 245

Query: 280 CF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGF 333
           C   D +  G +  G+        TS +P    Y+    +  V  ++  I +S    S  
Sbjct: 246 CLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS 305

Query: 334 QA-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPD 390
           +  +VDSG +  +L  E Y   V      +  S   +  +GN    CY  +S      P 
Sbjct: 306 RGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQ---CYLITSSVTEVFPQ 362

Query: 391 MRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDR 447
           + L F+   S ++R   +   +N   G  V+C+      G    I+G   +    +V+D 
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422

Query: 448 ENLKLAWSHSKCEEVIDKS 466
              ++ W++  C   ++ S
Sbjct: 423 AGQRIGWANYDCSLSVNVS 441


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 164/369 (44%), Gaps = 25/369 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +  FS C D  + 
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
           G +F    G   +      PI +  + Y  V ++S  +  + L        T       +
Sbjct: 231 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG++  +LP  IY+E+++          I++       C++       K P +   F  
Sbjct: 289 DSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN 346

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
           + +  V   +++  +  N+    F    +    D  I+G   +    +V+D E   + W+
Sbjct: 347 DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWT 406

Query: 456 -HSKCEEVI 463
            H+    ++
Sbjct: 407 EHNSMARIV 415


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 167/376 (44%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT + +GTP   F V +D GS++LWV C  C QC      + + L  +L+ YDP +SS
Sbjct: 87  LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCP-----HKSGLGLDLTLYDPKASS 141

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +   V C    C      +    S   PC Y   Y  + +S+ G  V+D L     +   
Sbjct: 142 TGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYG-DGSSTVGSFVNDALQFDQVTGDG 200

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                 +SVI GCG +Q G     + A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 201 QTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLD 260

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ-- 334
               G +F    G   Q      P+      Y V +++  +G + L       + G +  
Sbjct: 261 TIKGGGIF--AIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRG 318

Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            ++DSG + T+LP  ++ +V++  F+K    + I+        C+  S       P +  
Sbjct: 319 TIIDSGTTLTYLPELVFKKVMLAVFNK---HQDITFHDVQDFLCFEYSGSVDDGFPTLTF 375

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F  + +  V  H + FP   G  V+C+      + S DG D  ++G   +    +V+D 
Sbjct: 376 HFEDDLALHVYPHEYFFP--NGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDL 433

Query: 448 ENLKLAWSHSKCEEVI 463
           EN  + W+   C   I
Sbjct: 434 ENRVIGWTDYNCSSSI 449


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/438 (26%), Positives = 196/438 (44%), Gaps = 37/438 (8%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  + VE L  L + D  R +  ++      SS   + F  +G+   F     
Sbjct: 26  LTLERAFPTNHGVE-LSQLRARDELRHRRMLQ------SSSGVVDFSVQGTFDPFQ---- 74

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L+YT + +GTP V F V +D GS++LWV C      P +    + L   L+ +DP SS
Sbjct: 75  VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQT----SGLQIQLNFFDPGSS 130

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S+S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D++HL +  +
Sbjct: 131 STSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIFE 189

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
            +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G+    FS C
Sbjct: 190 GSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHC 249

Query: 281 F--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQSGFQ 334
              D +  G +  G+        TS +P    Y+       V  ++  I +S    S  +
Sbjct: 250 LKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSR 309

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
             +VDSG +  +L  E Y   V      +  S + +  +GN    CY  +S      P +
Sbjct: 310 GTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQ---CYLITSSVTDVFPQV 366

Query: 392 RLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRE 448
            L F+   S ++R   +   +N   G  V+C+      G    I+G   +    +V+D  
Sbjct: 367 SLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLA 426

Query: 449 NLKLAWSHSKCEEVIDKS 466
             ++ W++  C   ++ S
Sbjct: 427 GQRIGWANYDCSLSVNVS 444


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 160/360 (44%), Gaps = 24/360 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
           G +F    G   +      PI +  + Y  V ++S  +  + L        T       +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG++  +LP  IY+E+++          I++       C++       K P +   F  
Sbjct: 313 DSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN 370

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
           + +  V   +++  +  N+    F    +    D  I+G   +    +V+D E   + W+
Sbjct: 371 DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWT 430


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 171/370 (46%), Gaps = 24/370 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LWV C  C +C P+     T L   LS YD  +SS
Sbjct: 76  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKASS 130

Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SKNV C    C    +S     K PC Y   Y  + ++S G  V D + L   + +   
Sbjct: 131 TSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG-DGSTSDGDFVKDNITLDQVTGNLRT 189

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           + +   V+ GCG+ Q+G      +A DG+MG G  + SV S LA  G ++  FS C D  
Sbjct: 190 APLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNM 249

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLTQSG-FQALVD 338
           + G +F  G+      ++T  +P    Y+    G+    E   +  S  + +G    ++D
Sbjct: 250 NGGGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIID 309

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSK 397
           SG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S      P + L F  
Sbjct: 310 SGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFED 366

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRIVFDRENLKLA 453
           +    V  H + F   E    F      + + DG D  ++G   +    +V+D EN  + 
Sbjct: 367 SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIG 426

Query: 454 WSHSKCEEVI 463
           W+   C   I
Sbjct: 427 WADHNCSSSI 436


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 116/416 (27%), Positives = 187/416 (44%), Gaps = 68/416 (16%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SSS  
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCG----------NHQDPRFQPDLSSSYS 139

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C S K  C Y   Y+ E +SSSG L +DI+     S+  PQ +   
Sbjct: 140 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKPQHA--- 190

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
             I GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G   
Sbjct: 191 --IFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 247

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V  G   P     ++  P+   Y  Y + ++   +    L        S    ++DSG 
Sbjct: 248 MVLGGMLAPPDMIFSNSDPLRSPY--YNIELKEIHVAGKALRVESRIFNSKHGTVLDSGT 305

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEEMLKV----PDM 391
           ++ +LP + +    V F + V+SK  SL   +G    Y   C+  +   + K+    PD+
Sbjct: 306 TYAYLPEQAF----VAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDV 361

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + +G   +CL V     D      GII +N +    + 
Sbjct: 362 DMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGIIVRNTL----VT 415

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 500
           +DR N K+ +  + C E+ ++ H+       G +P+P P+++  S  +   A  PS
Sbjct: 416 YDRHNEKIGFWKTNCSELWERLHI-------GDTPSPAPSSDTSSEHDMSPAPAPS 464


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  125 bits (313), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 166/379 (43%), Gaps = 40/379 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +YT I+IGTP   F V +D GS++LWV C  C +C   S      L  +L+ YDP  SSS
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSG-----LGIDLALYDPKGSSS 141

Query: 169 SKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
              VSC +  C +          C + K PC Y A+Y  + +S++G  V D L     S 
Sbjct: 142 GSAVSCDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYG-DGSSTAGSFVSDSLQYNQLSG 199

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           +A     +++VI GCG +Q G       A DG++G G  + S  S LA AG ++  FS C
Sbjct: 200 NAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHC 259

Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
            D    G +F  G+      +ST  LP    Y+   V ++S  +  + L        T  
Sbjct: 260 LDTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYN---VNLQSIDVAGNALQLPPHIFETSE 316

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
               ++DSG + T+LP  +Y +++   F K       ++QG     C+  S       P 
Sbjct: 317 KRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQG---FLCFEYSESVDDGFPK 373

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD------GDYGIIGQNFMMGHRIV 444
           +   F  +    V  H + F    G  ++CL   +         D  ++G   +    +V
Sbjct: 374 ITFHFEDDLGLNVYPHDYFF--QNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVV 431

Query: 445 FDRENLKLAWSHSKCEEVI 463
           +D E   + W+   C   I
Sbjct: 432 YDLEKQVIGWTDYNCSSSI 450


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 114/417 (27%), Positives = 186/417 (44%), Gaps = 68/417 (16%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SSS  
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSSYS 140

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C S K  C Y   Y+ E +SSSG L +DI+     S+  PQ +V  
Sbjct: 141 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKPQRAV-- 192

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G   
Sbjct: 193 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 248

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V  G   P+    +   P+   Y  Y + ++   +    L        S    ++DSG 
Sbjct: 249 MVLGGVPAPSDMVFSHSDPLRSPY--YNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGT 306

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
           ++ +LP + +    V F   V+SK  SL+       N    C+  +   + K+    PD+
Sbjct: 307 TYAYLPEQAF----VAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDV 362

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + +G   +CL V     D      GII +N +    + 
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGIIVRNTL----VT 416

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 501
           +DR N K+ +  + C E+ ++ H+         +P+P P+++  S ++   A  PS+
Sbjct: 417 YDRHNEKIGFWKTNCSELWERLHI-------SDAPSPAPSSDTNSETDMSPAPAPSS 466


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 172/390 (44%), Gaps = 40/390 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP  ++ + +D GS+++WV C QC +C   S     SL  +L+ YD   SS
Sbjct: 82  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----SLGMDLTLYDIKESS 136

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K V C    CK       + C +    CPY+  Y  + +S++GY V DI+     S  
Sbjct: 137 SGKLVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVKDIVLYDQVSGD 194

Query: 223 APQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               S   S++ GCG +Q+G  S  +  A DG++G G  + S+ S LA +G ++  F+ C
Sbjct: 195 LKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHC 254

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQA-- 335
            +  + G +F    G   Q   +  P+      Y V + +  +G++ L   T +  Q   
Sbjct: 255 LNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDR 312

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG +  +LP  IY  +V K        ++    + +  C+  S       P + 
Sbjct: 313 KGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYT-CFQYSESVDDGFPAVT 371

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRIVF-D 446
             F    S  V  H + FP       +C+        S D     +  + ++ +++VF D
Sbjct: 372 FFFENGLSLKVYPHDYLFPS---VNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYD 428

Query: 447 RENLKLAWSHSKCEEVID-----KSHVHLV 471
            EN  + W+   C   I         VHLV
Sbjct: 429 LENQAIGWAEYNCSSSIKVRDERTGTVHLV 458


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
           G +F    G   +      PI +  + Y  V ++S  +  + L        T       +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 338 DSGASFTFLPTEIYAEVVV 356
           DSG++  +LP  IY+E+++
Sbjct: 313 DSGSTLVYLPEIIYSELIL 331


>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
 gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
          Length = 307

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 139/281 (49%), Gaps = 38/281 (13%)

Query: 253 VMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
           +MGLG+  VSVPS+LA  G+++ NSFS+CF ++  G + FGD G A Q  T F+ +   +
Sbjct: 9   LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTH 67

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
             Y + + S  +G+  L   GF A+ DSG SFT+L    Y      F+  +S +R +  G
Sbjct: 68  SYYNISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSG 126

Query: 372 NS------WKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFPE---NEGFTV--F 419
           ++      ++YCY+ S ++  +++P + L  +    F V + ++       N    +  +
Sbjct: 127 STRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGY 186

Query: 420 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC---EEVID------------ 464
           CL V+ +D    IIGQNFM G ++VF+RE   L W    C   E++ D            
Sbjct: 187 CLAVIKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSP 246

Query: 465 --KSHVHLVP----PPAGQSPNP--LPTTEQQSTSNGQAAA 497
              +HV   P     PAG++P P   P     S + G  A 
Sbjct: 247 GPTTHVFPQPQESDSPAGRTPIPGAAPVPRSSSAAAGGRAG 287


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 176/389 (45%), Gaps = 38/389 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP   + V +D GS+++WV C QC +C   S     SL  +L+ Y+ + S 
Sbjct: 77  LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTS-----SLGIDLTLYNINESD 131

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           + K V C    C   +  +    +    CPY+  Y  + +S++GY V D++  A  S   
Sbjct: 132 TGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYG-DGSSTAGYFVKDVVQYARVSGDL 190

Query: 224 PQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
             ++   SVI GCG +Q+G     +  A DG++G G  + S+ S LA  G ++  F+ C 
Sbjct: 191 KTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL 250

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ- 334
           D  + G +F    G   Q   +  P+      Y V + +  +G+  L+      ++G + 
Sbjct: 251 DGTNGGGIFV--IGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRK 308

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            A++DSG +  +LP  +Y  +V K        ++    + +  C+  S       P++  
Sbjct: 309 GAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYT-CFQYSDSLDDGFPNVTF 367

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVFDR 447
            F  +    V  H + FP  EG  ++C+      V S D  +  ++G   +    +++D 
Sbjct: 368 HFENSVILKVYPHEYLFPF-EG--LWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDL 424

Query: 448 ENLKLAWSHSKCEEVID-----KSHVHLV 471
           EN  + W+   C   I         VHLV
Sbjct: 425 ENQAIGWTEYNCSSSIQVQDERTGTVHLV 453


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 32/321 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP  S+ V +D GS+++WV C QC QC   S     +L   L+ Y+   S 
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K VSC    C        S CK+    CPY+  Y  + +S++GY V D++   S +  
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA +G ++  F+ 
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
           C D  + G +F    G   Q   +  P+      Y V + +  +G   LT      Q G 
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGD 308

Query: 334 Q--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
           +  A++DSG +  +LP  IY  +V K   L    ++ +    +K C+  S       P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKEPAL----KVHIVDKDYK-CFQYSGRVDEGFPNV 363

Query: 392 RLIFSKNQSFVVRNHIFSFPE 412
              F  +    V  H + FP 
Sbjct: 364 TFHFENSVFLRVYPHDYLFPH 384


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +GTP   + V +D GS++LWV C  C +C   S      L  +L+ YDP +SS
Sbjct: 83  LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSG-----LGLDLTFYDPKASS 137

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           S   VSC    C +    K    +   PC Y   Y  + +S++G+ V D L     +   
Sbjct: 138 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFVTDALQFDQVTGDG 196

Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                 ++V  GCG +Q G       A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 197 QTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLD 256

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQ 334
               G +F    G   Q      P+      Y V ++S  +G + L        T     
Sbjct: 257 TIKGGGIF--AIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKG 314

Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            ++DSG + T+LP  ++ EV+   F+K    + I         C+          P +  
Sbjct: 315 TIIDSGTTLTYLPELVFKEVMAAIFNK---HQDIVFHNVQDFMCFQYPGSVDDGFPTITF 371

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F  + +  V  H + FP   G  ++C+      + S DG D  ++G   +    +++D 
Sbjct: 372 HFEDDLALHVYPHEYFFP--NGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDL 429

Query: 448 ENLKLAWSHSKCEEVI 463
           EN  + W+   C   I
Sbjct: 430 ENQVIGWTDYNCSSSI 445


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 105/377 (27%), Positives = 171/377 (45%), Gaps = 35/377 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT + +G+P   F V +D GS++LWV C  C  C   S      L  +L+ YDP+ S 
Sbjct: 71  LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSG-----LGMDLTLYDPNGSK 125

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  V C    C        S CK     CPY   Y  + +++SG  V+D L     S +
Sbjct: 126 TSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVNDSLTFDEVSGN 183

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                  SSVI GCG KQ+GS    +  A DG++G G  + SV S LA +G ++  FS C
Sbjct: 184 LHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHC 243

Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSG 332
            D +  G +F  G        +T  +P    Y+     ++    G   L       + SG
Sbjct: 244 LDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPILLPLYLFDSGSG 301

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG +  +LP  IY +++ K        ++ +  + +  C++ S +     P ++
Sbjct: 302 RGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT-CFHYSDKLDEGFPVVK 360

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFD 446
             F +  S  V  H + F   E   ++C+     +  + +G D  +IG   +    +V+D
Sbjct: 361 FHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYD 417

Query: 447 RENLKLAWSHSKCEEVI 463
            EN+ + W++  C   I
Sbjct: 418 LENMVIGWTNFNCSSSI 434


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 105/389 (26%), Positives = 172/389 (44%), Gaps = 40/389 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T I IGTP   + V +D GS++LWV C      P      ++L   L+ YDP  S S
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
            + V+C    C +       SC S   PC Y   Y  + +S++G+ V D L     S   
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202

Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             +   +SV  GCG K  G       A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
             + G +F  G+      ++T  +P    Y+    G++   +G + L        + +  
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNSK 319

Query: 334 QALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S       P++ 
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVT 376

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFD 446
             F  + S +V  H + F    G  ++C+      V + DG D  ++G   +    +++D
Sbjct: 377 FHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYD 434

Query: 447 RENLKLAWSHSKCEEVI----DKSHVHLV 471
            EN  + W+   C   I    DK   + V
Sbjct: 435 LENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 114/428 (26%), Positives = 187/428 (43%), Gaps = 57/428 (13%)

Query: 58  SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIG 117
           S EY E L ++D +R    V              FP  G    F       L+YT I +G
Sbjct: 2   SREYFETLKAHDRRRLAAVVD-------------FPLTGDDDPFVTG----LYYTKIYLG 44

Query: 118 TPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           TP V + V +D GS++ W+ C  C  C  ++ +   S+   L+ YDPS SS+   +SC  
Sbjct: 45  TPPVGYYVQVDTGSDVTWLNCAPCTSC--VTETQLPSI--KLTTYDPSRSSTDGALSCRD 100

Query: 177 PLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
             C +       SC S    C Y   Y  + +S+ GY + D++       +  Q +  +S
Sbjct: 101 SNCGAALGSNEVSCTS-AGYCAYSTTYG-DGSSTQGYFIQDVMTFQEIHNNT-QVNGTAS 157

Query: 232 VIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGS 288
           V  GCG  Q+G+ L    A DG++G G   VS+PS LA  G + N F+ C   D    G+
Sbjct: 158 VYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGGT 217

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQA--------LVDS 339
           +  G     ++ + S+ PI  + + Y VG+++  + G +  T + F          ++DS
Sbjct: 218 IVIGS---VSEPNISYTPIVSR-NHYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVIMDS 273

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN- 398
           G +  +L    Y + V      VS+   S+  +  +    A        P ++L F    
Sbjct: 274 GTTLAYLVDPAYTQFV----NAVSTFESSMFSSHSQCLQLAWCSLQADFPTVKLFFDAGA 329

Query: 399 -QSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             +   RN+++S P   G   +C+     T  +    Y I+G   +  H +V+D +N  +
Sbjct: 330 VMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVV 389

Query: 453 AWSHSKCE 460
            W    C+
Sbjct: 390 GWKSFDCK 397


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 167/372 (44%), Gaps = 28/372 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LWV C  C +C P+     T L   LS YD  +SS
Sbjct: 77  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKTSS 131

Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SKNV C    C    +S     K PC Y   Y    TS   ++ D+I  L   + +   
Sbjct: 132 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT-LEQVTGNLRT 190

Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           + +   V+ GCG+ Q+G      +A DG+MG G  + S+ S LA  G  +  FS C D  
Sbjct: 191 APLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNM 250

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS-------CLTQSGFQAL 336
           + G +F  G+      ++T  +P    Y+    G++    G+          T      +
Sbjct: 251 NGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSLASTNGDGGTI 308

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIF 395
           +DSG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S      P + L F
Sbjct: 309 IDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHF 365

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRIVFDRENLK 451
             +    V  H + F   E    F      + + DG D  ++G   +    +V+D EN  
Sbjct: 366 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425

Query: 452 LAWSHSKCEEVI 463
           + W+   C   I
Sbjct: 426 IGWADHNCSSSI 437


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 167/372 (44%), Gaps = 28/372 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LWV C  C +C P+     T L   LS YD  +SS
Sbjct: 73  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKTSS 127

Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SKNV C    C    +S     K PC Y   Y    TS   ++ D+I  L   + +   
Sbjct: 128 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT-LEQVTGNLRT 186

Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           + +   V+ GCG+ Q+G      +A DG+MG G  + S+ S LA  G  +  FS C D  
Sbjct: 187 APLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNM 246

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS-------CLTQSGFQAL 336
           + G +F  G+      ++T  +P    Y+    G++    G+          T      +
Sbjct: 247 NGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSLASTNGDGGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIF 395
           +DSG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S      P + L F
Sbjct: 305 IDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHF 361

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRIVFDRENLK 451
             +    V  H + F   E    F      + + DG D  ++G   +    +V+D EN  
Sbjct: 362 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421

Query: 452 LAWSHSKCEEVI 463
           + W+   C   I
Sbjct: 422 IGWADHNCSSSI 433


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 115/461 (24%), Positives = 196/461 (42%), Gaps = 55/461 (11%)

Query: 58  SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIG 117
           S EY   L  +D +R +  +            + FP  G    F       L+YT I +G
Sbjct: 9   SSEYYRTLREHDQRRLRRILP---------EVVAFPISGDDDTFTTG----LYYTRIYLG 55

Query: 118 TPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           TP   F V +D GS++ WV C  C  C   S     ++   +S +DP  S+S  ++SC+ 
Sbjct: 56  TPPQQFYVHVDTGSDVAWVNCVPCTNCKRAS-----NVALPISIFDPEKSTSKTSISCTD 110

Query: 177 PLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHAPQSSVQSSVI 233
             C   S S C      CPY   Y  + +S++GYL++D+L      S ++  +S  + + 
Sbjct: 111 EECYLASNSKCSFNSMSCPYSTLYG-DGSSTAGYLINDVLSFNQVPSGNSTATSGTARLT 169

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFF 291
            GCG  QTG++L     DG++G G  +VS+PS L+K  +  N F+ C   D   SG++  
Sbjct: 170 FGCGSNQTGTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVI 225

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ------ALVDSGASFTF 345
           G         T  +P    Y+   + +     G +  T + F        ++DSG + T+
Sbjct: 226 GHIREPGLVYTPIVPKQSHYNVELLNIG--VSGTNVTTPTAFDLSNSGGVIMDSGTTLTY 283

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRN 405
           L    Y +   K    + S  + +      + +  + E     P++ L F+   + ++  
Sbjct: 284 LVQPAYDQFQAKVRDCMRSGVLPV-----AFQFFCTIEGYF--PNVTLYFAGGAAMLLSP 336

Query: 406 HIFSFPE--NEGFTVFCLTVMSTDGDYG-----IIGQNFMMGHRIVFDRENLKLAWSHSK 458
             + + E    G + +C + + +   YG     I G N +    +V+D  N ++ W +  
Sbjct: 337 SSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFD 396

Query: 459 CEEVIDKSHVHLVPP----PAGQSPNPLPTTEQQSTSNGQA 495
           C + I  S      P    P+   P     T   + SNG +
Sbjct: 397 CTKEISVSSTATSMPVTVFPSKAGPPGAFVTTNNAHSNGAS 437


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 160/376 (42%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I +GTP   + V +D GS++LWV C  C QC      + + L  +L+ YDP +SS
Sbjct: 85  LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCP-----HKSGLGLDLTLYDPKASS 139

Query: 168 SSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +   V C    C +    K  K     PC Y   Y  + +S+ G  V D L     ++  
Sbjct: 140 TGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYG-DGSSTIGSFVTDALQFDQVTRDG 198

Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                 +SVI GCG +Q G       A DG++G G  + S+ S L  AG ++  F+ C D
Sbjct: 199 QTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLD 258

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF--------Q 334
               G +F    G   Q      P+      Y V +++  +G + L              
Sbjct: 259 TIKGGGIF--SIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKG 316

Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            ++DSG + T+LP  ++ EV++  F+K    + I+        C+          P +  
Sbjct: 317 TIIDSGTTLTYLPELVFKEVMLAVFNK---HQDITFHDVQGFLCFQYPGSVDDGFPTITF 373

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTDG-DYGIIGQNFMMGHRIVFDR 447
            F  + +  V  H + F    G  V+C+        S DG D  ++G   +    +++D 
Sbjct: 374 HFEDDLALHVYPHEYFFA--NGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDL 431

Query: 448 ENLKLAWSHSKCEEVI 463
           EN  + W+   C   I
Sbjct: 432 ENRVIGWTDYNCSSSI 447


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 176/388 (45%), Gaps = 51/388 (13%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP+  F + +D+GS + +VPC  C QC    +     ++ +   + P  SS+  
Sbjct: 94  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V  
Sbjct: 154 PVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-- 205

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G   
Sbjct: 206 ---FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V  G   P     +   P+   Y  Y + ++   +    L        S    ++DSG 
Sbjct: 262 MVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 319

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
           ++ +LP + +    V F   V++K  SL+       N    C+  +   + ++    PD+
Sbjct: 320 TYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDV 375

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    + 
Sbjct: 376 DMVFGNGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VT 429

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVP 472
           +DR N K+ +  + C E+ ++ H+  VP
Sbjct: 430 YDRHNEKIGFWKTNCSELWERLHISEVP 457


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 176/388 (45%), Gaps = 51/388 (13%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP+  F + +D+GS + +VPC  C QC    +     ++ +   + P  SS+  
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V  
Sbjct: 153 PVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-- 204

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G   
Sbjct: 205 ---FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V  G   P     +   P+   Y  Y + ++   +    L        S    ++DSG 
Sbjct: 261 MVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 318

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
           ++ +LP + +    V F   V++K  SL+       N    C+  +   + ++    PD+
Sbjct: 319 TYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDV 374

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    + 
Sbjct: 375 DMVFGNGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VT 428

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVP 472
           +DR N K+ +  + C E+ ++ H+  VP
Sbjct: 429 YDRHNEKIGFWKTNCSELWERLHISEVP 456


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 181/389 (46%), Gaps = 48/389 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D+GS + +VPC  C QC                ++ P  
Sbjct: 93  YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEL 140

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C   K+ C Y  +Y+ E +SS G L +D++   + S+  PQ
Sbjct: 141 SSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 194

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +C+   D
Sbjct: 195 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 248

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQSGFQALVD 338
            G    +  G   P+    T   P    Y   D   + V    +  NS +      A++D
Sbjct: 249 VGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLD 308

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK---YCYNASSE--EMLKV-PDM 391
           SG ++ +LP   +A       + VS  K+I     ++K   +   AS++  E+ K+ P +
Sbjct: 309 SGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSV 368

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFD 446
            +IF   QS+++    + F  ++    +CL V     D+     GI+ +N +    +V+D
Sbjct: 369 EMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL----VVYD 424

Query: 447 RENLKLAWSHSKCEEVIDKSHVHLVPPPA 475
           REN K+ +  + C E+ D+ H+   PPPA
Sbjct: 425 RENSKVGFWRTNCSELSDRLHIDGAPPPA 453


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 180/391 (46%), Gaps = 52/391 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D+GS + +VPC  C QC                ++ P  
Sbjct: 92  YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEM 139

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C   ++ C Y  +Y+ E +SS G L +D++   + S+  PQ
Sbjct: 140 SSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 193

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +C+   D
Sbjct: 194 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 247

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
            G    +  G   P+    T   P    Y  Y + +    +    L+           A+
Sbjct: 248 VGGGSMILGGFDYPSDMVFTDSDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFDGEHGAV 305

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSE----EMLKV-P 389
           +DSG ++ +LP   +A       + VS+ K+I     ++K  C+  ++     E+ K+ P
Sbjct: 306 LDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFP 365

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            + ++F   QS+++    + F  ++    +CL V     D+     GI+ +N +    +V
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL----VV 421

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 475
           +DREN K+ +  + C E+ D+ H+   PPPA
Sbjct: 422 YDRENSKVGFWRTNCSELSDRLHIDGAPPPA 452


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 105/388 (27%), Positives = 168/388 (43%), Gaps = 38/388 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T I IGTP   + V +D GS++LWV C      P      ++L   L+ YDP  S S
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
            + V+C    C +       SC S   PC Y   Y  + +S++G+ V D L     S   
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202

Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             +   +SV  GCG K  G       A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQ 334
             + G +F    G   Q      P+      Y V ++   +G + L        + +   
Sbjct: 263 TVNGGGIFA--IGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKG 320

Query: 335 ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S       P++  
Sbjct: 321 TIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVTF 377

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F  + S +V  H + F    G  ++C+      V + DG D  ++G   +    +++D 
Sbjct: 378 HFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDL 435

Query: 448 ENLKLAWSHSKCEEVI----DKSHVHLV 471
           EN  + W+   C   I    DK   + V
Sbjct: 436 ENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 174/383 (45%), Gaps = 46/383 (12%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I +G  +  + V +D GS+ LWV C  C  C   S      L  +L+ YDP+ S 
Sbjct: 75  LYYTKIGLGPKD--YYVQVDTGSDTLWVNCVGCTACPKKSG-----LGMDLTLYDPNLSK 127

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL--HLASFS 220
           +SK V C    C S      S C      CPY   Y    T+S  Y+ DD+    +    
Sbjct: 128 TSKAVPCDDEFCTSTYDGQISGCTK-GMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDL 186

Query: 221 KHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
           +  P ++   SVI GCG KQ+G  S     + DG++G G  + SV S LA AG ++  FS
Sbjct: 187 RTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFS 243

Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
            C D    G +F    G   Q      P+ +    Y V ++   +    +        + 
Sbjct: 244 HCLDSISGGGIF--AIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSS 301

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK--V 388
           SG   ++DSG +  +LP  IY +++ K     S  ++ L  + +  C++ S EE +    
Sbjct: 302 SGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT-CFHYSDEESVDDLF 360

Query: 389 PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGH 441
           P ++  F +  +     R+++F F E+    ++C+        + DG   I+  + ++ +
Sbjct: 361 PTVKFTFEEGLTLTTYPRDYLFLFKED----MWCVGWQKSMAQTKDGKELILLGDLVLAN 416

Query: 442 R-IVFDRENLKLAWSHSKCEEVI 463
           + +V+D +N+ + W+   C   I
Sbjct: 417 KLVVYDLDNMAIGWADYNCSSSI 439


>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
 gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
          Length = 149

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/126 (48%), Positives = 79/126 (62%), Gaps = 13/126 (10%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +FSS++VHR SDEA+     + G       WP++ S  Y   LL +D +RQK R+     
Sbjct: 26  TFSSRMVHRLSDEARLEAGPRMG------LWPQRGSGGYYRALLRSDLQRQKRRL----- 74

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
             + +NQLL  S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQ
Sbjct: 75  --AGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132

Query: 143 CAPLSA 148
           CAPLS+
Sbjct: 133 CAPLSS 138


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 172/390 (44%), Gaps = 40/390 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP  ++ + +D GS+++WV C QC +C   S     +L  +L+ YD   SS
Sbjct: 84  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----NLGMDLTLYDIKESS 138

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K V C    CK       + C +    CPY+  Y  + +S++GY V DI+     S  
Sbjct: 139 SGKFVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVKDIVLYDQVSGD 196

Query: 223 APQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               S   S++ GCG +Q+G  S  +  A  G++G G  + S+ S LA +G ++  F+ C
Sbjct: 197 LKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHC 256

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQA-- 335
            +  + G +F    G   Q   +  P+      Y V + +  +G++ L   T +  Q   
Sbjct: 257 LNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDR 314

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG +  +LP  IY  +V K        ++    + +  C+  S       P + 
Sbjct: 315 KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYT-CFQYSESVDDGFPAVT 373

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRIVF-D 446
             F    S  V  H + FP  +    +C+        S D     +  + ++ +++VF D
Sbjct: 374 FYFENGLSLKVYPHDYLFPSGD---FWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYD 430

Query: 447 RENLKLAWSHSKCEEVID-----KSHVHLV 471
            EN  + W+   C   I         VHLV
Sbjct: 431 LENQVIGWTEYNCSSSIKVRDERTGTVHLV 460


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 171/382 (44%), Gaps = 47/382 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ--CAPLSASYYTSLDRNLSEYDPS 164
           Y   Y  + +GTP   F V +D GS + +VPC      C P         +   + +DP 
Sbjct: 75  YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGP---------NHQDAAFDPE 125

Query: 165 SSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +SS++  +SC+ P C   S  C      C Y   Y+ E +SSSG L++D+L L      A
Sbjct: 126 ASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYA-EQSSSSGILLEDVLALHDGLPGA 184

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD- 282
           P       +I GC  ++TG      A DG+ GLG  D SV + L KAG+I + FS+CF  
Sbjct: 185 P-------IIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGM 236

Query: 283 -ENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS------G 332
            E D G++  GD    G  + Q T  L        Y V + S  +    L  S      G
Sbjct: 237 VEGD-GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQG 295

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSW-KYCYN-ASSEEMLK 387
           +  ++DSG +FT++P+ ++       +K   S   KR+      +   C+  A S + L+
Sbjct: 296 YGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355

Query: 388 V-----PDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
                 P M + F +  S V+   N++F    N G   +CL V        ++G      
Sbjct: 356 ALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSG--KYCLGVFDNGRAGTLLGGITFRN 413

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             + +DR N ++ +  + C+E+
Sbjct: 414 VLVRYDRANQRVGFGPALCKEL 435


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 170/381 (44%), Gaps = 43/381 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I +G P   + V +D GS++LWV C  C +C   S      L   L+ YDP SS+
Sbjct: 81  LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKS-----DLGVKLTLYDPQSST 135

Query: 168 SSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S+  + C    C +      +   K L  PC Y   Y  + +S++G+ V D L     + 
Sbjct: 136 SATRIYCDDDFCAATYNGVLQGCTKDL--PCQYSVVYG-DGSSTAGFFVKDNLQFDRVTG 192

Query: 222 HAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           +   SS   SVI GCG KQ+G       A DG++G G  + S+ S LA AG ++  F+ C
Sbjct: 193 NLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHC 252

Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
            D    G +F  G+       +T  +P    Y+     +E   +G + L        T  
Sbjct: 253 LDNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIE---VGGNVLELPTDIFDTGD 309

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCYNASSEEMLKV 388
               ++DSG +  +LP  +Y  ++ K    + S++  L+ ++ +    C+  +       
Sbjct: 310 RRGTIIDSGTTLAYLPEVVYESMMTK----IVSEQPGLKLHTVEEQFTCFQYTGNVNEGF 365

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHR 442
           P ++  F+ + S  V  H + F  +E   V+C       + S DG D  ++G   +    
Sbjct: 366 PVVKFHFNGSLSLTVNPHDYLFQIHE--EVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKL 423

Query: 443 IVFDRENLKLAWSHSKCEEVI 463
           +++D EN  + W+   C   I
Sbjct: 424 VLYDLENQAIGWTDYNCSSSI 444


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 117/427 (27%), Positives = 188/427 (44%), Gaps = 35/427 (8%)

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
           +L LS   +R   R +    + +S   + FP +G+   F       L++T + +G+P   
Sbjct: 41  KLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFL----VGLYFTRVQLGSPPKD 96

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-- 180
           F V +D GS++LWV C      P+++     L   L+ +DP SS+++  VSCS   C   
Sbjct: 97  FYVQIDTGSDVLWVSCSSCNGCPVTS----GLQIPLTFFDPGSSTTAALVSCSDQRCTAG 152

Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ-----SSVQSSV 232
              S S C S  + C Y   Y  + + +SGY V D++HL +    + +      +  SSV
Sbjct: 153 IQSSDSLCSSRTNQCGYTFQYG-DGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSV 211

Query: 233 IIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--V 289
              C   QTG       A DG+ G G  ++SV S LA  G+    FS C   +DSG   +
Sbjct: 212 SFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVL 271

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQSGFQA-LVDSGASFT 344
             G+        T  +P    Y+ Y     V  ++  I  S    S  Q  +VDSG +  
Sbjct: 272 VLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLA 331

Query: 345 FLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           +L    Y   V     +VS + R  L +GN    CY  +S      P + L F+   S +
Sbjct: 332 YLAEGAYDPFVSAITSVVSLNARTYLSKGNQ---CYLVTSSVNDVFPQVSLNFAGGASLI 388

Query: 403 VRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKC 459
           +    +   +N   G  V+C+    T G    I  + ++  +I V+D  N ++ W++  C
Sbjct: 389 LNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448

Query: 460 EEVIDKS 466
              ++ S
Sbjct: 449 SMSVNVS 455


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 163/360 (45%), Gaps = 36/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP   F    D GS+L+WV  + C  C+              + +DP  SS+ + +
Sbjct: 59  ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCS------------GGTIFDPRQSSTFREM 106

Query: 173 SCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
            CS  LC     SC+     C Y  +Y + +T   G    D + L + S  + +     S
Sbjct: 107 DCSSQLCAELPGSCEPGSSTCSYSYEYGSGETE--GEFARDTISLGTTSDGSQKFP---S 161

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
             +GCG   +G   DG   DG++GLG G VS+ S L+ A  I + FS C      +++S 
Sbjct: 162 FAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESS 215

Query: 288 SVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 343
            + FG          QST   P  + Y  Y++  V    +    +   G   ++DSG + 
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTL 274

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T++P+ +Y  V+ + + +V+  R+         CY+ SS    K P + +  +       
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            ++ F   ++ G TV CL + S  G    IIG     G+ I++DR + +L++  +KCE +
Sbjct: 335 SSNYFLVVDDSGDTV-CLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 179/410 (43%), Gaps = 27/410 (6%)

Query: 72  RQKTRVKLQSNNNSSRNQLL-FPSEGS----QTHFFGNQFYWLHYTWIDIGTPNVSFLVA 126
           +++ RV+      SS   ++ FP +G+       F+   F  L+YT + +G+P   F V 
Sbjct: 47  KERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQ 106

Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KS 181
           +D GS++LWV C      P+S+  +  L+     +DP SS ++  +SCS   C      S
Sbjct: 107 IDTGSDVLWVSCSSCNGCPVSSGLHIPLNF----FDPGSSPTASLISCSDQRCSLGLQSS 162

Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
            S C +  + C Y   Y  + + +SGY V D+LH  +    +   +  + ++ GC   QT
Sbjct: 163 DSVCAAQNNQCGYTFQYG-DGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTLQT 221

Query: 242 GSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPAT 298
           G       A DG+ G G  D+SV S LA  G+    FS C   +DSG   +  G+     
Sbjct: 222 GDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPN 281

Query: 299 QQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQA-LVDSGASFTFLPTEIYAE 353
              T  +P    Y+    + +V  ++  I  S    S  Q  ++DSG +  +L    Y  
Sbjct: 282 IVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDP 341

Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFP 411
            +      V S  +S   +    CY  SS      P + L F+   S ++  ++++    
Sbjct: 342 FISAITSTV-SPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQS 400

Query: 412 ENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
              G  ++C+      G +  I+G   +     V+D    ++ W++  C+
Sbjct: 401 SINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDCK 450


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 163/360 (45%), Gaps = 36/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP   F    D GS+L+WV  + C  C+              + +DP  SS+ + +
Sbjct: 59  ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCS------------GGTIFDPRQSSTFREM 106

Query: 173 SCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
            CS  LC     SC+     C Y  +Y + +T   G    D + L + S  + +     S
Sbjct: 107 DCSSQLCTELPGSCEPGSSACSYSYEYGSGETE--GEFARDTISLGTTSGGSQKFP---S 161

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
             +GCG   +G   DG   DG++GLG G VS+ S L+ A  I + FS C      +++S 
Sbjct: 162 FAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESS 215

Query: 288 SVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 343
            + FG          QST   P  + Y  Y++  V    +    +   G   ++DSG + 
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTL 274

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T++P+ +Y  V+ + + +V+  R+         CY+ SS    K P + +  +       
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            ++ F   ++ G TV CL + S  G    IIG     G+ I++DR + +L++  +KCE +
Sbjct: 335 SSNYFLVVDDSGDTV-CLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 189/425 (44%), Gaps = 37/425 (8%)

Query: 63  ELLLSNDWKRQKTR--VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
           +L LS   +R + R    LQS   S    + FP +G+   F       L+YT + +GTP 
Sbjct: 10  KLKLSKLKERDRVRHGRMLQS---SGVGVVDFPVQGTFDPFL----VGLYYTRLQLGTPP 62

Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
             F V +D GS++LWV C      P+++  +  L+     +DP SS ++  +SCS   C 
Sbjct: 63  RDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNF----FDPGSSPTASLISCSDQRCS 118

Query: 180 ----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
                S S C +  + C Y   Y  + + +SGY V D+LH  +    +  ++  + ++ G
Sbjct: 119 LGLQSSDSVCSAQNNLCGYNFQYG-DGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFG 177

Query: 236 CGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
           C   QTG       A DG+ G G  D+SV S LA  G+   +FS C   +DSG       
Sbjct: 178 CSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGIL-VL 236

Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFL 346
           G   + +  + P+      Y + ++S  +    L        T S    ++DSG +  +L
Sbjct: 237 GEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYL 296

Query: 347 PTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV- 403
               Y   +     +VS S R  L +GN   +CY  SS      P + L F+   S ++ 
Sbjct: 297 AEAAYDPFISAITSIVSPSVRPYLSKGN---HCYLISSSINDIFPQVSLNFAGGASMILI 353

Query: 404 -RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCEE 461
            ++++       G  ++C+      G    I  + ++  +I V+D  N ++ W++  C  
Sbjct: 354 PQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDCSM 413

Query: 462 VIDKS 466
            ++ S
Sbjct: 414 SVNVS 418


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 173/393 (44%), Gaps = 61/393 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           + T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SS+
Sbjct: 85  YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSST 134

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              V CS     +  +C S K  C Y   Y+ E +SSSG L +DI+   + S+  PQ +V
Sbjct: 135 YSPVKCS-----ADCTCDSDKSQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV 188

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
                 GC   +TG      A DG+MGLG G +S+   L   G+I +SFS+C+   D G 
Sbjct: 189 -----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG 242

Query: 289 ---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDS 339
              V      P     +   P+   Y  Y + ++   +    L        S    ++DS
Sbjct: 243 GAMVLGAMPAPPDMVFSRSDPVRSPY--YNIELKEIHVAGKALRLDPRIFDSKHGTVLDS 300

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----P 389
           G ++ +LP + +    V F   V+SK   L+       N    C+  +   + ++    P
Sbjct: 301 GTTYAYLPEQAF----VAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFP 356

Query: 390 DMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHR 442
           D+ ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    
Sbjct: 357 DVDMVFGDGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL---- 410

Query: 443 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 475
           + +DR N K+ +  + C E+ ++ HV   P PA
Sbjct: 411 VTYDRHNEKIGFWKTNCSELWERLHVSGAPSPA 443


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 173/383 (45%), Gaps = 51/383 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  + T I IGTP  +F + +D GS L +VPC  C QC           D N   + P  
Sbjct: 89  YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG-------KHQDPN---FQPDW 138

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + + CS        +C S    C Y   Y+ E +SSSG L +DI+     S+  PQ
Sbjct: 139 SSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDIVSFGKQSELKPQ 192

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L + G+I NSFS+C+   D
Sbjct: 193 RTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD 246

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
            G    V  G   PA    T   P    Y  Y + ++   I       N  +    +  +
Sbjct: 247 VGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
           +DSG ++ +LP   +        K ++S ++ +QG    Y   C++    ++ ++    P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGVGSDVSQLSKTFP 363

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            + L+FS      +    + F  ++    +CL +   + D      GII +N +    ++
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTL----VM 419

Query: 445 FDRENLKLAWSHSKCEEVIDKSH 467
           +DRE+LK+ +  + C E+ +  H
Sbjct: 420 YDREHLKIGFWKTNCSEIWEILH 442


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 173/383 (45%), Gaps = 51/383 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  + T I IGTP  +F + +D GS L +VPC  C QC           D N   + P  
Sbjct: 89  YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG-------KHQDPN---FQPDW 138

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + + CS        +C S    C Y   Y+ E +SSSG L +DI+     S+  PQ
Sbjct: 139 SSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDIVSFGKQSELKPQ 192

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L + G+I NSFS+C+   D
Sbjct: 193 RTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD 246

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
            G    V  G   PA    T   P    Y  Y + ++   I       N  +    +  +
Sbjct: 247 VGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
           +DSG ++ +LP   +        K ++S ++ +QG    Y   C++    ++ ++    P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGVGSDVSQLSKTFP 363

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            + L+FS      +    + F  ++    +CL +   + D      GII +N +    ++
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTL----VM 419

Query: 445 FDRENLKLAWSHSKCEEVIDKSH 467
           +DRE+LK+ +  + C E+ +  H
Sbjct: 420 YDREHLKIGFWKTNCSEIWEILH 442


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 174/390 (44%), Gaps = 61/390 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           + T + IGTP+  F + +D+GS + +VPC  C QC           +     + P  SS+
Sbjct: 91  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCG----------NHQDPRFQPDLSST 140

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              V C+        +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V
Sbjct: 141 YSPVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV 194

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
                 GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G 
Sbjct: 195 -----FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGG 248

Query: 289 ---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDS 339
              V  G   P     +   P+   Y  Y + ++   +    L        S    ++DS
Sbjct: 249 GTMVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDS 306

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----P 389
           G ++ +LP + +    V F   V++K  SL+       N    C+  +   + ++    P
Sbjct: 307 GTTYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFP 362

Query: 390 DMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHR 442
           D+ ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    
Sbjct: 363 DVDMVFGNGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL---- 416

Query: 443 IVFDRENLKLAWSHSKCEEVIDKSHVHLVP 472
           + +DR N K+ +  + C E+ ++ H+  VP
Sbjct: 417 VTYDRHNEKIGFWKTNCSELWERLHISEVP 446


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 100/378 (26%), Positives = 169/378 (44%), Gaps = 38/378 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I+IG+P+  + V +D GS++LWV C +C  C   S      L   L++YDP+ S 
Sbjct: 84  LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSG-----LGIELTQYDPAGSG 138

Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           ++  V C    C + S      +C S   PC +   Y  + +S++G+ V D +     S 
Sbjct: 139 TT--VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYG-DGSSTTGFYVSDSVQYNQVSG 195

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           +   +   +S+  GCG +  G     + A DG++G G  D S+ S LA A  ++  F+ C
Sbjct: 196 NGQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHC 255

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--- 335
            D    G +F    G   Q      P+ +    Y V ++   +G + L    S F +   
Sbjct: 256 LDTVHGGGIF--AIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDS 313

Query: 336 ---LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
              ++DSG +  +LP E+Y  ++   FDK    + ++L       C+  S       P +
Sbjct: 314 KGTIIDSGTTLAYLPREVYRTLLTAVFDKY---QDLALHNYQDFVCFQFSGSIDDGFPVV 370

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVF 445
              F    +  V  H + F +NE   ++C+      V + DG D  ++G   +    +V+
Sbjct: 371 TFSFEGEITLNVYPHDYLF-QNEN-DLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVY 428

Query: 446 DRENLKLAWSHSKCEEVI 463
           D E   + W+   C   I
Sbjct: 429 DLEKQVIGWADYNCSSSI 446


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 162/376 (43%), Gaps = 45/376 (11%)

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           YT + +GTP  +F V +D GS + ++PC+ C  C   +A ++          DP  S+++
Sbjct: 14  YTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWF----------DPDKSTTA 63

Query: 170 KNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           K ++C  PLC   + SC    D C Y   Y+ E +SS G++++D           P S  
Sbjct: 64  KKLACGDPLCNCGTPSCTCNNDRCYYSRTYA-ERSSSEGWMIEDTFGF-------PDSDS 115

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
              ++ GC   +TG      A DG+MG+G    +  S L +  +I++ FS+CF     G 
Sbjct: 116 PVRLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGI 174

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--------NSCLTQSGFQALVDSG 340
           +  GD       +T + P+      ++  V+   I         ++ +   G+  ++DSG
Sbjct: 175 LLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSG 234

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKY---CYNASSEEMLKV----PDMR 392
            +FT+LPT+ +  +       V  K + S  G   +Y   C+  + ++   +    P   
Sbjct: 235 TTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAE 294

Query: 393 LIFSKNQSFV---VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            +F          +R    S P       +CL +        ++G   +    + +DR N
Sbjct: 295 FVFGGGAKLTLPPLRYLFLSKPAE-----YCLGIFDNGNSGALVGGVSVRDVVVTYDRRN 349

Query: 450 LKLAWSHSKCEEVIDK 465
            K+ ++   C +V  K
Sbjct: 350 SKVGFTTMACADVARK 365


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 183/391 (46%), Gaps = 55/391 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P S
Sbjct: 83  YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPES 130

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C S +  C Y   Y+ E ++SSG L +D++   + S+ APQ
Sbjct: 131 SSTYQPVKCT-----IDCNCDSDRMQCVYERQYA-EMSTSSGVLGEDLISFGNQSELAPQ 184

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+C+   D
Sbjct: 185 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD 238

Query: 286 --SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
              G++  G   P +  + ++  P+   Y  Y + ++   +       N+ +       +
Sbjct: 239 VGGGAMVLGGISPPSDMAFAYSDPVRSPY--YNIDLKEIHVAGKRLPLNANVFDGKHGTV 296

Query: 337 VDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
           +DSG ++ +LP      + + +VK  +L S K+IS    ++   C++ +  ++ ++    
Sbjct: 297 LDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSF 354

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRI 443
           P + ++F   Q + +    + F  ++    +CL V     D      GII +N +    +
Sbjct: 355 PVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTL----V 410

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 474
           V+DRE  K+ +  + C E+ ++  + + PPP
Sbjct: 411 VYDREQTKIGFWKTNCAELWERLQISVAPPP 441


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/441 (24%), Positives = 183/441 (41%), Gaps = 47/441 (10%)

Query: 44  SGNVSVADSWPKKN-SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF 102
           +G   V   +P+ + S ++L  L ++D +R    +    +     N L  P+E       
Sbjct: 27  TGVFEVRRKFPRHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNGL--PTETG----- 79

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
                 L++T I IGTP  S+ V +D GS++LWV C      P      + L   L+ YD
Sbjct: 80  ------LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRK----SGLGIELTLYD 129

Query: 163 PSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
           PS SSS   V+C    C +       SC     PC Y   Y  + +S++G+ V D L   
Sbjct: 130 PSGSSSGTGVTCGQDFCVATHGGVIPSCVPAA-PCQYSISYG-DGSSTTGFFVTDFLQYN 187

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNS 276
             S ++  +   +S+  GCG K  G     + A DG++G G  + S+ S LA AG ++  
Sbjct: 188 QVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKV 247

Query: 277 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
           F+ C D  + G +F    G   Q   S  P+      Y V +E+  +G   L        
Sbjct: 248 FAHCLDTINGGGIF--AIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFD 305

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG +  +LP  +Y  ++ K         + L+ +    C+  S       
Sbjct: 306 IGESKGTIIDSGTTLAYLPGVVYNAIMSKV--FAQYGDMPLKNDQDFQCFRYSGSVDDGF 363

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRI 443
           P +   F       +  H + F   E   ++C+      + + DG   ++  +    +R+
Sbjct: 364 PIITFHFEGGLPLNIHPHDYLFQNGE---LYCMGFQTGGLQTKDGKDMVLLGDLAFSNRL 420

Query: 444 V-FDRENLKLAWSHSKCEEVI 463
           V +D EN  + W+   C   I
Sbjct: 421 VLYDLENQVIGWTDYNCSSSI 441


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 113/436 (25%), Positives = 185/436 (42%), Gaps = 65/436 (14%)

Query: 74  KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL----------HYTWIDIGTPNVSF 123
           K  + +Q  NN + N  +   E S+  F GN    L          ++  + +GTP    
Sbjct: 126 KESITIQQQNNLA-NAFVASLESSKGEFSGNIMATLESGASLGTGEYFLDMFVGTPPKHV 184

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
            + LD GS+L W     IQC P     Y   ++N S Y P  SS+ +N+SC  P C+  S
Sbjct: 185 WLILDTGSDLSW-----IQCDPC----YDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVS 235

Query: 184 S------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
           S      CK+    CPY  DY+    ++  +  +      ++     +      V+ GCG
Sbjct: 236 SSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCG 295

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVFFG 292
               G +  GA+  G++GLG G +S PS +    +  +SFS C  +     + S  + FG
Sbjct: 296 HWNKG-FFYGAS--GLLGLGRGPISFPSQIQ--SIYGHSFSYCLTDLFSNTSVSSKLIFG 350

Query: 293 DQGPATQQS----TSFLPIGEKYDA--YFVGVESYCIGNSCLTQS--------------- 331
           +            T+ L   E  D   Y++ ++S  +G   L  S               
Sbjct: 351 EDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADA 410

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPD 390
           G   ++DSG++ TF P   Y  +   F+K +  ++I+        CYN S   M +++PD
Sbjct: 411 GGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPD 470

Query: 391 MRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 446
             + F+     +F   N+ + +  +E   V CL +M T       IIG        I++D
Sbjct: 471 FGIHFADGGVWNFPAENYFYQYEPDE---VICLAIMKTPNHSHLTIIGNLLQQNFHILYD 527

Query: 447 RENLKLAWSHSKCEEV 462
            +  +L +S  +C EV
Sbjct: 528 VKRSRLGYSPRRCAEV 543


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 112/441 (25%), Positives = 198/441 (44%), Gaps = 42/441 (9%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  + VE  EL   +  + ++    LQS N      + FP +G+    F    
Sbjct: 25  LTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPSQ 73

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L+YT + +GTP     V +D GS++LWV C      P +    + L   L+ +DP SS
Sbjct: 74  VGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQT----SGLQIQLNYFDPGSS 129

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S+S  +SC    C+     S +SC    + C Y   Y  + + +SGY V D++H AS  +
Sbjct: 130 STSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSDLMHFASIFE 188

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G+    FS C
Sbjct: 189 GTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC 248

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
              ++SG       G   + +  + P+      Y + ++S  +    +        T + 
Sbjct: 249 LKGDNSGGGVL-VLGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN 307

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKV-P 389
              +VDSG +  +L  E Y   V+    ++  S + +  +GN    CY  ++   + + P
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ---CYLITTSSNVDIFP 364

Query: 390 DMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VF 445
            + L F+   S V+R   +   +N   EG +V+C+      G    I  + ++  +I V+
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNFIGEG-SVWCIGFQKISGQSITILGDLVLKDKIFVY 423

Query: 446 DRENLKLAWSHSKCEEVIDKS 466
           D    ++ W++  C   ++ S
Sbjct: 424 DLAGQRIGWANYDCSLPVNVS 444


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 170/392 (43%), Gaps = 44/392 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP+  + V +D GS+++WV C QC +C   S     SL   L+ YD   S+
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS-----SLGMELTPYDLEEST 140

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           + K VSC    C        S C +    CPY+  Y  + +S++GY V D +     S  
Sbjct: 141 TGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKDYVQYNRVSGD 198

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
              ++   S+  GCG +Q+G        A DG++G G  + S+ S LA    ++  F+ C
Sbjct: 199 LETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHC 258

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA--- 335
            D  + G +F    G   Q   +  P+      Y V +    +G+  L  S   F+A   
Sbjct: 259 LDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR 316

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDM 391
              ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+  S       P +
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYSERVDDGFPPV 374

Query: 392 RLIFSKNQSFVVRNHIFSFP-ENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRIV- 444
              F  +    V  H + F  EN    ++C+      + S D     +  + ++ +++V 
Sbjct: 375 IFHFENSLLLKVYPHEYLFQYEN----LWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVL 430

Query: 445 FDRENLKLAWSHSKCEEVI-----DKSHVHLV 471
           +D EN  + W+   C   I         VHLV
Sbjct: 431 YDLENQTIGWTEYNCSSSIKVQDEQTGTVHLV 462


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 172/389 (44%), Gaps = 40/389 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T I IGTP   + V +D GS++LWV C      P      ++L   L+ YDP  S S
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
            + V+C    C +       SC S   PC Y   Y  + +S++G+ V D L     S   
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202

Query: 224 PQSSVQSSVIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             +   +SV  GCG K  G       A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
             + G +F  G+      ++T  +P    Y+    G++   +G + L        + +  
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNSK 319

Query: 334 QALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S       P++ 
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVT 376

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-----STDGDYGIIGQNFMMGHRIV-FD 446
             F  + S +V  H + F    G  ++C+        + DG    +  + ++ +++V +D
Sbjct: 377 FHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYD 434

Query: 447 RENLKLAWSHSKCEEVI----DKSHVHLV 471
            EN  + W+   C   I    DK   + V
Sbjct: 435 LENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 159/368 (43%), Gaps = 25/368 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +GTP   F V +D GS++LWV C  C  C   S      L   L+ +D +SSS
Sbjct: 80  LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSG-----LGIQLNYFDTTSSS 134

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +++ V CSHP+C S+     + C    + C Y   Y  + + +SGY V D  +  +    
Sbjct: 135 TARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYG-DGSGTSGYYVSDTFYFDAVLGE 193

Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +  ++  ++++ GC   Q+G       A DG+ G G G++SV S L+  G+    FS C 
Sbjct: 194 SLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL 253

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
              DSG       G   +    + P+      Y + ++S  +    L        T S  
Sbjct: 254 KGEDSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNR 312

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++D+G +  +L  E Y   V      V S+  +   N    CY  S+      P +  
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSNSVSEVFPPVSF 371

Query: 394 IFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            F+   + +++   ++       G  ++C+      G   I+G   +     V+D  + +
Sbjct: 372 NFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQR 431

Query: 452 LAWSHSKC 459
           + W++  C
Sbjct: 432 IGWANYDC 439


>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
          Length = 426

 Score =  115 bits (287), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 136/262 (51%), Gaps = 17/262 (6%)

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
           +++ C S    CPY   Y +  + S+G LV+D++H+++    A  +       I  G  Q
Sbjct: 124 TKARCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDAR------ITFGESQ 177

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
            G + +  A +G+MGL + D++VP++L KAG+  +SFS+CF  N  G++ FGD+G + Q 
Sbjct: 178 LGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQL 236

Query: 301 STSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
            T   P+       F  V +  + +G   +  + F A  DSG + T+L    Y  +   F
Sbjct: 237 ET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNF 292

Query: 359 DKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG- 415
              V  +R+S   +S +++CY   S+ +  K+P +        ++ V + I  F  ++G 
Sbjct: 293 HLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGS 352

Query: 416 FTVFCLTVMS-TDGDYGIIGQN 436
           F V+CL V+   + D+ IIG+N
Sbjct: 353 FQVYCLAVLKQVNADFSIIGRN 374


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 171/390 (43%), Gaps = 40/390 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  + IGTP+  + V +D GS+++WV C QC +C   S     SL   L+ Y+   S 
Sbjct: 85  LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTS-----SLGMELTLYNIKDSV 139

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S K V C    C        S C +    CPY+  Y  + +S++GY V D++     S  
Sbjct: 140 SGKLVPCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDRVSGD 197

Query: 223 APQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
              +S   SVI GCG +Q+G        A DG++G G  + S+ S LA    ++  F+ C
Sbjct: 198 LQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHC 257

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ 334
            D  + G +F    G   Q   +  P+      Y V + +  +G   L       ++G +
Sbjct: 258 LDGINGGGIF--AIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDR 315

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             A++DSG +  +LP  +Y  +V K        ++ +  + +  C+  S       P++ 
Sbjct: 316 KGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSGSVDDGFPNVT 374

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVFD 446
             F  +    V  H + FP  EG  ++C+      + S D  +  ++G   +    +++D
Sbjct: 375 FHFENSVFLKVHPHEYLFP-FEG--LWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYD 431

Query: 447 RENLKLAWSHSKCEEVID-----KSHVHLV 471
            EN  + W+   C   I         VHLV
Sbjct: 432 LENQAIGWTEYNCSSSIKVQDERTGTVHLV 461


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 164/390 (42%), Gaps = 46/390 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +GTP   + V +D GS++LWV C  C +C   S      L  +L+ YDP +SS
Sbjct: 86  LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSG-----LGLDLTFYDPKASS 140

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           S   VSC    C +    K    +   PC Y   Y  + +S++G+ + D L     +   
Sbjct: 141 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFITDALQFDQVTGDG 199

Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                 +++  GCG +Q G   +   A DG++G G  + S+ S LA AG  +  F+ C D
Sbjct: 200 QTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLD 259

Query: 283 ENDSGS--------------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
               G               VFF   G         + I      Y V ++S  +G + L
Sbjct: 260 TIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTL 319

Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYN 379
                   T      ++DSG + T+LP  ++ +V+   D + S  R I+        C+ 
Sbjct: 320 QLPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVM---DVVFSKHRDIAFHNLQDFLCFQ 376

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGII 433
            S       P +   F  + +  V  H + FP   G  ++C+      + S DG D  ++
Sbjct: 377 YSGSVDDGFPTITFHFEDDLALHVYPHEYFFP--NGNDIYCVGFQNGALQSKDGKDIVLM 434

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           G   +    +V+D EN  + W+   C   I
Sbjct: 435 GDLVLSNKLVVYDLENQVIGWTDYNCSSSI 464


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/263 (30%), Positives = 121/263 (46%), Gaps = 24/263 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP   + V +D GS++LWV C  C +C   S      L   L+ YDP  SS
Sbjct: 32  LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 86

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +   VSC    C +        C +   PC Y   Y  + +S++GY V D+L     S  
Sbjct: 87  TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 144

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                  S+V  GCG +Q G       A DG++G G  + S+ S L+ AG ++  F+ C 
Sbjct: 145 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 204

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
           D  + G +F    G   Q      P+      Y V ++S  +G + L        T    
Sbjct: 205 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 262

Query: 334 QALVDSGASFTFLPTEIYAEVVV 356
             ++DSG + T+LP  +Y E+++
Sbjct: 263 GTIIDSGTTLTYLPEIVYKEIML 285


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 116/441 (26%), Positives = 200/441 (45%), Gaps = 42/441 (9%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  + VE  EL   +  + ++    LQS N      + FP +G+    F    
Sbjct: 25  LTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPSQ 73

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L+YT + +GTP   F V +D GS++LWV C      P +    + L   L+ +DP SS
Sbjct: 74  VGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQT----SGLQIQLNYFDPRSS 129

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S+S  +SCS   C+     S +SC S  + C Y   Y  + + +SGY V D++H A   +
Sbjct: 130 STSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYG-DGSGTSGYYVSDLMHFAGIFE 188

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G+    FS C
Sbjct: 189 GTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC 248

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
              ++SG       G   + +  + P+ +    Y + ++S  +    +        T + 
Sbjct: 249 LKGDNSGGGVL-VLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN 307

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKV-P 389
              +VDSG +  +L  E Y   V     LV  S + +  +GN    CY  ++   + + P
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ---CYLITTSSNVDIFP 364

Query: 390 DMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VF 445
            + L F+   S V+R   +   +N   EG +V+C+      G    I  + ++  +I V+
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNYIGEG-SVWCIGFQRIPGQSITILGDLVLKDKIFVY 423

Query: 446 DRENLKLAWSHSKCEEVIDKS 466
           D    ++ W++  C   ++ S
Sbjct: 424 DLAGQRIGWANYDCSLPVNVS 444


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 166/369 (44%), Gaps = 25/369 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LW+ C+ C +C        T+L+  LS +D ++SS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SK V C    C   S   S +    C Y   Y+ E TS  G  + D+L L   +     
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +   V+ GCG  Q+G   +G +A DGVMG G  + SV S LA  G  +  FS C D  
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
             G +F  G       ++T  +P    Y+   +G++    S  +  S +   G   +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
           G +  + P  +Y  ++   + +++ + + L      + C++ S+      P +   F  +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTV--MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAW 454
               V  H + F   E    F      ++TD   +  ++G   +    +V+D +N  + W
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421

Query: 455 SHSKCEEVI 463
           +   C   I
Sbjct: 422 ADHNCSSSI 430


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 173/383 (45%), Gaps = 51/383 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++DP S
Sbjct: 82  YYTTRLWI--GTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPES 129

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ K + C+         C S    C Y   Y+ E ++SSG L +D++   + S+  PQ
Sbjct: 130 SSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQSELIPQ 183

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L + G I +SFS+C+   D
Sbjct: 184 RAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD 237

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG-----FQAL 336
            G    V  G   P+    T   P+   Y  Y V + E +  G      SG     + A+
Sbjct: 238 IGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFDGRYGAV 295

Query: 337 VDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNASSEEML----KVPD 390
           +DSG ++ +LP E ++       D++ S K+I     ++K  C++ +  +      K P 
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVF 445
           + ++F   Q   +    + F  ++    +CL +     D      GI+ +N +    +++
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL----VMY 411

Query: 446 DRENLKLAWSHSKCEEVIDKSHV 468
           DR N K+ +  + C E+ ++  +
Sbjct: 412 DRANSKIGFWKTNCSELWERLRI 434


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 173/383 (45%), Gaps = 51/383 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++DP S
Sbjct: 82  YYTTRLWI--GTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPES 129

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ K + C+         C S    C Y   Y+ E ++SSG L +D++   + S+  PQ
Sbjct: 130 SSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQSELIPQ 183

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L + G I +SFS+C+   D
Sbjct: 184 RAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD 237

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG-----FQAL 336
            G    V  G   P+    T   P+   Y  Y V + E +  G      SG     + A+
Sbjct: 238 IGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFDGRYGAV 295

Query: 337 VDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNASSEEML----KVPD 390
           +DSG ++ +LP E ++       D++ S K+I     ++K  C++ +  +      K P 
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVF 445
           + ++F   Q   +    + F  ++    +CL +     D      GI+ +N +    +++
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL----VMY 411

Query: 446 DRENLKLAWSHSKCEEVIDKSHV 468
           DR N K+ +  + C E+ ++  +
Sbjct: 412 DRANSKIGFWKTNCSELWERLRI 434


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 166/376 (44%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I IGTP+  + V +D GS++LWV C  C +C   S      L  +L+ YD  +S+
Sbjct: 73  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 127

Query: 168 SSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  V C    C         CK  L+  C Y   Y  + +S++GY V D +     S +
Sbjct: 128 TSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQDFVQYNRISGN 184

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +G ++  FS C 
Sbjct: 185 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 244

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ- 334
           D  D G +F    G   +   +  P+ +    Y V ++   +G   L       +SG + 
Sbjct: 245 DNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRK 302

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +  + P E+Y  ++ K        R+     ++  C++ +       P + L
Sbjct: 303 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTL 361

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F K+ S  V  H + F   E    +C+        + DG D  ++G   +    +V+D 
Sbjct: 362 HFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 419

Query: 448 ENLKLAWSHSKCEEVI 463
           E   + W    C   I
Sbjct: 420 EKQGIGWVEYNCSSSI 435


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 166/376 (44%), Gaps = 34/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I IGTP+  + V +D GS++LWV C  C +C   S      L  +L+ YD  +S+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208

Query: 168 SSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  V C    C         CK  L+  C Y   Y  + +S++GY V D +     S +
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQDFVQYNRISGN 265

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +G ++  FS C 
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ- 334
           D  D G +F    G   +   +  P+ +    Y V ++   +G   L       +SG + 
Sbjct: 326 DNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRK 383

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +  + P E+Y  ++ K        R+     ++  C++ +       P + L
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTL 442

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F K+ S  V  H + F   E    +C+        + DG D  ++G   +    +V+D 
Sbjct: 443 HFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 500

Query: 448 ENLKLAWSHSKCEEVI 463
           E   + W    C   I
Sbjct: 501 EKQGIGWVEYNCSSSI 516


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 167/391 (42%), Gaps = 53/391 (13%)

Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
            +G+ + + L+Y  ++IG P   + + +D+GS+L W+ C   C  C  +    Y      
Sbjct: 54  LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 107

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYL 209
                    + SK V C H LC S        +  C+S  + C Y+  Y+ +  SS+G L
Sbjct: 108 -------RPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYA-DQGSSTGVL 159

Query: 210 VDDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLL 267
           V+D     SF+      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L
Sbjct: 160 VND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 214

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGN 325
            + G+ +N    C      G +FFGD     Q++T + P+      + Y  G  S   G+
Sbjct: 215 KQRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGD 273

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
             L     + + DSG+SFT+   + Y  +V      +S         S   C+    E  
Sbjct: 274 RSLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPF 332

Query: 386 LKVPDMRLIFSKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYG 431
             V D+R  F   +S V+     +  +   P      V      CL +++       D  
Sbjct: 333 KSVLDVRKEF---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLS 389

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IIG   M  H +++D E  K+ W  + C+  
Sbjct: 390 IIGDITMQDHMVIYDNEKGKIGWIRAPCDRA 420


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 170/382 (44%), Gaps = 32/382 (8%)

Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           F  L++T + +G+P   F V +D GS++LW+   CI C+  +  + + L   L  +D + 
Sbjct: 79  FVGLYFTKVKLGSPAKEFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134

Query: 166 SSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS-F 219
           SS++  VSC  P+C      + S C S  + C Y   Y  + + ++GY V D ++  +  
Sbjct: 135 SSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYG-DGSGTTGYYVSDTMYFDTVL 193

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +  ++  S++I GC   Q+G       A DG+ G G G +SV S L+  G+    FS
Sbjct: 194 LGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFS 253

Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
            C    EN  G +  G+     + S  + P+      Y + ++S  +    L        
Sbjct: 254 HCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFA 310

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEEML 386
           T +    +VDSG +  +L  E Y   V      VS  SK I  +GN    CY  S+    
Sbjct: 311 TTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CYLVSNSVGD 367

Query: 387 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
             P + L F    S V+   +++  +   +G  ++C+     +  + I+G   +     V
Sbjct: 368 IFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFV 427

Query: 445 FDRENLKLAWSHSKCEEVIDKS 466
           +D  N ++ W+   C   ++ S
Sbjct: 428 YDLANQRIGWADYDCSLSVNVS 449


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/329 (28%), Positives = 151/329 (45%), Gaps = 23/329 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT + +GTP V F V +D GS++LWV C  C  C   S      L   L+ +DP SSS
Sbjct: 24  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSS 78

Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D++HL +  + 
Sbjct: 79  TSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIFEG 137

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G+    FS C 
Sbjct: 138 SVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL 197

Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQA 335
             D +  G +  G+        TS +P    Y+    +  V  ++  I +S    S  + 
Sbjct: 198 KGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            +VDSG +  +L  E Y   V      +  + +    +    CY  +S      P + L 
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSVTEVFPQVSLN 316

Query: 395 FSKNQSFVVRNHIFSFPENE--GFTVFCL 421
           F+   S ++R   +   +N   G  V+C+
Sbjct: 317 FAGGASMILRPQDYLIQQNSIGGAAVWCI 345


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 166/390 (42%), Gaps = 52/390 (13%)

Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
            +G+ + + L+Y  ++IG P   + + +D+GS+L W+ C   C  C  +    Y      
Sbjct: 56  LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 109

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
                    + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L+
Sbjct: 110 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 161

Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           +D     SF+      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L 
Sbjct: 162 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
           + G+ +N    C      G +FFGD     Q++T + P+      + Y  G  S   G+ 
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 275

Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
            L     + + DSG+SFT+   + Y  +V      +S         S   C+    E   
Sbjct: 276 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 334

Query: 387 KVPDMRLIFSKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGI 432
            V D+R  F   +S V+     +  +   P      V      CL +++       D  I
Sbjct: 335 SVLDVRKEF---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSI 391

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IG   M  H +++D E  K+ W  + C+  
Sbjct: 392 IGDITMQDHMVIYDNEKGKIGWIRAPCDRA 421


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 115/423 (27%), Positives = 183/423 (43%), Gaps = 64/423 (15%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           ++LL ++D  R    VKL+S+  S       P EG    +       L++T + +GTP  
Sbjct: 1   MQLLKAHDRGRM---VKLKSSAVS------LPVEGVADPYIAG----LYFTQVQLGTPPR 47

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           ++ + +D GS+LLWV C  CI C   S      L   +  YD  +S+SS  V CS P C 
Sbjct: 48  TYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSCT 102

Query: 181 -----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
                S S C   ++ C Y   Y  + + + GYLV+D+LH           +  ++VI G
Sbjct: 103 LITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLHY--------MVNATATVIFG 152

Query: 236 CGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
           CG KQ+G       A DG++G G  D+S  S LAK G   N F+ C D  E   G +  G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-------QSGFQALV-DSGASFT 344
           +      Q T  +P    Y+   V ++S  + N+ LT           Q  + DSG +  
Sbjct: 213 NVIEPDIQYTPLVPYMYHYN---VVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLA 269

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-PDMRLIF-SKNQSFV 402
           +LP E Y            ++ +SL    +  C    S  + K+ P++ L F   + +  
Sbjct: 270 YLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGASMTLT 320

Query: 403 VRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
              ++          ++C+   S     ++  Y I G   +    +V+D E  ++ W   
Sbjct: 321 PAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPF 380

Query: 458 KCE 460
            C+
Sbjct: 381 DCK 383


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 115/423 (27%), Positives = 183/423 (43%), Gaps = 64/423 (15%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           ++LL ++D  R    VKL+S+  S       P EG    +       L++T + +GTP  
Sbjct: 1   MQLLKAHDRGRM---VKLKSSAVS------LPVEGVADPYIAG----LYFTQVQLGTPPR 47

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           ++ + +D GS+LLWV C  CI C   S      L   +  YD  +S+SS  V CS P C 
Sbjct: 48  TYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSCT 102

Query: 181 -----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
                S S C   ++ C Y   Y  + + + GYLV+D+LH           +  ++VI G
Sbjct: 103 LITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLHY--------MVNATATVIFG 152

Query: 236 CGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
           CG KQ+G       A DG++G G  D+S  S LAK G   N F+ C D  E   G +  G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-------QSGFQALV-DSGASFT 344
           +      Q T  +P    Y+   V ++S  + N+ LT           Q  + DSG +  
Sbjct: 213 NVIEPDIQYTPLVPYMSHYN---VVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLA 269

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-PDMRLIF-SKNQSFV 402
           +LP E Y            ++ +SL    +  C    S  + K+ P++ L F   + +  
Sbjct: 270 YLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGASMTLT 320

Query: 403 VRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
              ++          ++C+   S     ++  Y I G   +    +V+D E  ++ W   
Sbjct: 321 PAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPF 380

Query: 458 KCE 460
            C+
Sbjct: 381 DCK 383


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 166/390 (42%), Gaps = 52/390 (13%)

Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
            +G+ + + L+Y  ++IG P   + + +D+GS+L W+ C   C  C  +    Y      
Sbjct: 47  LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 100

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
                    + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L+
Sbjct: 101 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 152

Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           +D     SF+      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L 
Sbjct: 153 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 207

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
           + G+ +N    C      G +FFGD     Q++T + P+      + Y  G  S   G+ 
Sbjct: 208 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 266

Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
            L     + + DSG+SFT+   + Y  +V      +S         S   C+    E   
Sbjct: 267 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 325

Query: 387 KVPDMRLIFSKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGI 432
            V D+R  F   +S V+     +  +   P      V      CL +++       D  I
Sbjct: 326 SVLDVRKEF---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSI 382

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IG   M  H +++D E  K+ W  + C+  
Sbjct: 383 IGDITMQDHMVIYDNEKGKIGWIRAPCDRA 412


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 95/387 (24%), Positives = 172/387 (44%), Gaps = 45/387 (11%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS++ +VPC  C QC                ++ P  
Sbjct: 12  YYTTRLWI--GTPPQRFALIVDTGSSVTYVPCSSCEQCG----------RHQDPKFQPDL 59

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ ++V C+        +C   K  C Y   Y+ E ++SSG L +DI+   + S  APQ
Sbjct: 60  SSTYQSVKCN-----IDCNCDDEKQQCVYERQYA-EMSTSSGVLGEDIISFGNLSALAPQ 113

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
            +V      GC   +TG      A DG+MG+G GD+S+   L   G+I +SFS+C+    
Sbjct: 114 RAV-----FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMG 167

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
                 V  G   P+    +   P+   Y  Y + ++   +       N  +       +
Sbjct: 168 IGGGAMVLGGISPPSNMVFSQSDPVRSPY--YNIDLKEIHVAGKPLPLNPTVFDGKHGTI 225

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
           +DSG ++ +LP   +        K + S +  ++G    Y   C++ +  ++ ++    P
Sbjct: 226 LDSGTTYAYLPEAAFVSFKDAIMKELHSLK-PIRGPDPNYNDICFSGAGSDISQLSSSFP 284

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRE 448
            + ++F   Q  ++    + F  ++    +CL +     D   ++G   +    +++DRE
Sbjct: 285 AVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRE 344

Query: 449 NLKLAWSHSKCEEVIDKSHVHLVPPPA 475
           N K+ +  + C E+ ++ +V   PPPA
Sbjct: 345 NSKIGFWKTNCSELWERLNVDGAPPPA 371


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 165/376 (43%), Gaps = 35/376 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I IGTP+  + V +D GS++LWV C  C +C   S      L  +L+ YD  +S+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208

Query: 168 SSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  V C    C         CK  L+  C Y   Y  + +S++GY V D +     S +
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQDFVQYNRISGN 265

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +G ++  FS C 
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ- 334
           D  D G +F    G   +   +  P+ +    Y V ++   +G   L       +SG + 
Sbjct: 326 DNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRK 383

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +  + P E+Y  ++ K        R+     ++  C++ +       P + L
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTL 442

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVFDR 447
            F K+ S  V  H + F        +C+        + DG D  ++G   +    +V+D 
Sbjct: 443 HFDKSISLTVYPHEYLFQHE---FEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499

Query: 448 ENLKLAWSHSKCEEVI 463
           E   + W    C   I
Sbjct: 500 EKQGIGWVEYNCSSSI 515


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 165/379 (43%), Gaps = 40/379 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I+IG+P   + V +D GS++LWV C +C  C   S      L   L++YDP+ S 
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSG-----LGIELTQYDPAGSG 137

Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           ++  V C    C + S      +C S   PC +   Y  + ++++G+ V D +     S 
Sbjct: 138 TT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVTDFVQYNQVSG 194

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA A  ++  F+ 
Sbjct: 195 NGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-- 335
           C D    G +F    G   Q      P+      Y V ++   +G + L    S F +  
Sbjct: 254 CLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311

Query: 336 ----LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
               ++DSG +  +LP E+Y  ++   FDK    + + L       C+  S       P 
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQFSGSIDDGFPV 368

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIV 444
           +   F  + +  V    + F       ++C+      V + DG D  ++G   +    +V
Sbjct: 369 ITFSFEGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVV 426

Query: 445 FDRENLKLAWSHSKCEEVI 463
           +D E   + W+   C   I
Sbjct: 427 YDLEKEVIGWTDYNCSSSI 445


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 165/379 (43%), Gaps = 40/379 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I+IG+P   + V +D GS++LWV C +C  C   S      L   L++YDP+ S 
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSG-----LGIELTQYDPAGSG 137

Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           ++  V C    C + S      +C S   PC +   Y  + ++++G+ V D +     S 
Sbjct: 138 TT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVTDFVQYNQVSG 194

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA A  ++  F+ 
Sbjct: 195 NGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-- 335
           C D    G +F    G   Q      P+      Y V ++   +G + L    S F +  
Sbjct: 254 CLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311

Query: 336 ----LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
               ++DSG +  +LP E+Y  ++   FDK    + + L       C+  S       P 
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQFSGSIDDGFPV 368

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIV 444
           +   F  + +  V    + F       ++C+      V + DG D  ++G   +    +V
Sbjct: 369 ITFSFKGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVV 426

Query: 445 FDRENLKLAWSHSKCEEVI 463
           +D E   + W+   C   I
Sbjct: 427 YDLEKEVIGWTDYNCSSSI 445


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 170/382 (44%), Gaps = 32/382 (8%)

Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           F  L++T + +G+P   F V +D GS++LW+   CI C+  +  + + L   L  +D + 
Sbjct: 79  FVGLYFTKVKLGSPAKDFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134

Query: 166 SSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS-F 219
           SS++  VSC+ P+C      + S C S  + C Y   Y  + + ++GY V D ++  +  
Sbjct: 135 SSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYG-DGSGTTGYYVSDTMYFDTVL 193

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +  ++  S+++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS
Sbjct: 194 LGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFS 253

Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
            C    EN  G +  G+     + S  + P+      Y + ++S  +    L        
Sbjct: 254 HCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFA 310

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEEML 386
           T +    +VDSG +  +L  E Y   V      VS  SK I  +GN    CY  S+    
Sbjct: 311 TTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CYLVSNSVGD 367

Query: 387 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
             P + L F    S V+   +++  +   +   ++C+     +  + I+G   +     V
Sbjct: 368 IFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFV 427

Query: 445 FDRENLKLAWSHSKCEEVIDKS 466
           +D  N ++ W+   C   ++ S
Sbjct: 428 YDLANQRIGWADYNCSLAVNVS 449


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/377 (25%), Positives = 163/377 (43%), Gaps = 30/377 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G+P   F V +D GS++LWV C  C  C   S      L   L+ +D SSSS
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSS 119

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           ++  V CS P+C S      + C S  D C Y   Y  + + +SGY V D L+  +    
Sbjct: 120 TAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYG-DGSGTSGYYVSDTLYFDAILGQ 178

Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +   +  + ++ GC   Q+G       A DG+ G G G++SV S L+  G+    FS C 
Sbjct: 179 SLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL 238

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
             + SG       G   +    + P+      Y + + S  +    L        T +  
Sbjct: 239 KGDGSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQ 297

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR--ISLQGNSWKYCYNASSEEMLKVPDM 391
             +VDSG +  +L  E Y   V   + +VS     I+ +GN    CY  S+      P  
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ---CYLVSTSVSQMFPLA 354

Query: 392 RLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
              F+   S V++  +++  F  + G  ++C+      G   I+G   +     V+D   
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVLKDKIFVYDLVR 413

Query: 450 LKLAWSHSKCEEVIDKS 466
            ++ W++  C   ++ S
Sbjct: 414 QRIGWANYDCSLSVNVS 430


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 168/383 (43%), Gaps = 37/383 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IG+P   F V +D GS++LWV C  C  C   S      +  +L  Y+P SSS
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSS 126

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  ++C  P C +        CK     C Y   Y  + ++++GY V+D + L     +
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVNDYIQLQRAVGN 184

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              S    S++ GCG KQ+G     + A DG++G G  + S+ S LA  G ++  F+ C 
Sbjct: 185 HKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL 244

Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
           D    G +F  G+      ++T  +P    Y+    GV+   +G++ L        T   
Sbjct: 245 DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYK 301

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDM 391
             A++DSG +  +LP  IY  ++ K   L +   + L+    ++ C+          P +
Sbjct: 302 RGAIIDSGTTLAYLPDSIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTV 359

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVF 445
              F ++    +  H + F   +   V+C+        S DG +  ++G   +    + +
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYY 417

Query: 446 DRENLKLAWSHSKCEEVIDKSHV 468
           + EN  + W+   C   I    V
Sbjct: 418 NLENQTIGWTEYNCSSGIKLKDV 440


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/420 (25%), Positives = 181/420 (43%), Gaps = 60/420 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C  C                ++ P  
Sbjct: 88  YYTTRLWI--GTPPQRFALIVDTGSTVTYVPCSTCEHCG----------RHQDPKFQPDL 135

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           S + + V C+ P C    +C    + C Y   Y+ E +SSSG L +D++   + S+ APQ
Sbjct: 136 SETYQPVKCT-PDC----NCDGDTNQCMYDRQYA-EMSSSSGVLGEDVVSFGNLSELAPQ 189

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+C+   D
Sbjct: 190 RAV-----FGCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD 243

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
            G    +  G   P     T   P    Y  Y + ++   +       N  +       +
Sbjct: 244 VGGGAMILGGISPPEDMVFTHSDPDRSPY--YNINLKEMHVAGKKLQLNPKVFDGKHGTV 301

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV-- 388
           +DSG ++ +LP   +    + F + +  +R SL+       N    C+  +  ++ ++  
Sbjct: 302 LDSGTTYAYLPETAF----LAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAK 357

Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVF 445
             P + ++F       +    + F  ++    +CL V S   D   ++G  F+    +++
Sbjct: 358 SFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMY 417

Query: 446 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTA 505
           DREN K+ +  + C E+ +  H          +P+PLP+  +   +N   A  PS A +A
Sbjct: 418 DRENSKIGFWKTNCSELWETLHT-------SDAPSPLPSNSE--VTNLTKAFAPSVAPSA 468


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 96/361 (26%), Positives = 163/361 (45%), Gaps = 25/361 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LW+ C+ C +C        T+L+  LS +D ++SS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SK V C    C   S   S +    C Y   Y+ E TS  G  + D+L L   +     
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +   V+ GCG  Q+G   +G +A DGVMG G  + SV S LA  G  +  FS C D  
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
             G +F  G       ++T  +P    Y+   +G++    S  +  S +   G   +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
           G +  + P  +Y  ++   + +++ + + L      + C++ S+      P +   F  +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGII--GQNFMMGHRIVFDRENLKLAW 454
               V  H + F   E    F      ++TD    +I  G   +    +V+D +N  + W
Sbjct: 362 VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421

Query: 455 S 455
           +
Sbjct: 422 A 422


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 177/424 (41%), Gaps = 35/424 (8%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           +E L   D  R   R  L          + FP EGS   F       L++T + +G+P  
Sbjct: 47  VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
            + V +D GS++LWV C  C  C   S      L+  L  ++P +SS+S  + CS   C 
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157

Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
                S + C++  + PC Y   Y  + + +SGY V D ++  S   +   ++  +S++ 
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDSVMGNEQTANSSASIVF 216

Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
           GC   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G      
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
            G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG +  +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335

Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           L    Y   V      VS    SL  +GN    C+  SS      P + L F    +  V
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTVSLYFMGGVAMTV 392

Query: 404 R--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCE 460
           +  N++      +   ++C+      G    I  + ++  +I V+D  N+++ W+   C 
Sbjct: 393 KPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452

Query: 461 EVID 464
             ++
Sbjct: 453 TSVN 456


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 168/381 (44%), Gaps = 42/381 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I+IG+P   + V +D GS++LWV    C  C   S      L   L++YDP+ S 
Sbjct: 84  LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSG-----LGIELTQYDPAGSG 138

Query: 168 SSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           ++  V C    C + S       +C S   PC +   Y  + +S++G+ V D +     S
Sbjct: 139 TT--VGCEQEFCVANSAASGVPPACPSAASPCQFRITYG-DGSSTTGFYVTDFVQYNQVS 195

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
            +   +    S+  GCG +  G     + A DG++G G  D S+ S LA A  ++  F+ 
Sbjct: 196 GNGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAH 255

Query: 280 CFDENDSGSVF-FGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
           C D    G +F  G+   P   ++T  +P    Y+    G+    +G + L    S F +
Sbjct: 256 CLDTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGIS---VGGATLQLPTSTFDS 312

Query: 336 ------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                 ++DSG +  +LP E+Y  ++   FDK      ++++      C+  S     + 
Sbjct: 313 GDSKGTIIDSGTTLAYLPREVYRTLLTAVFDK---HPDLAVRNYEDFICFQFSGSLDEEF 369

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHR 442
           P +   F  + +  V  H + F    G  ++C+      V + DG D  ++G   +    
Sbjct: 370 PVITFSFEGDLTLNVYPHDYLF--QNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKL 427

Query: 443 IVFDRENLKLAWSHSKCEEVI 463
           +V+D E   + W+   C   I
Sbjct: 428 VVYDLEKQVIGWTDYNCSSSI 448


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/433 (24%), Positives = 188/433 (43%), Gaps = 28/433 (6%)

Query: 52  SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGS-QTHFFGNQFYWL 109
           ++P    VE  EL   +  + +  R+ L     SS   ++ FP +GS   +  G++   L
Sbjct: 47  AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTML 104

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +G+P   F V +D GS++LWV C      P S    + L  +L  +D   S ++
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTA 160

Query: 170 KNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
            +V+CS P+C S      + C S  + C Y   Y  + + +SGY + D  +  +    + 
Sbjct: 161 GSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESL 218

Query: 225 QSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
            ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C   
Sbjct: 219 VANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKG 278

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------ 335
           + SG   F   G        + P+      Y + + S  +    L      F+A      
Sbjct: 279 DGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGT 337

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           +VD+G + T+L  E Y   +      VS     +  N  + CY  S+      P + L F
Sbjct: 338 IVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNF 396

Query: 396 SKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           +   S ++R  +++F +   +G +++C+       +  I+G   +     V+D    ++ 
Sbjct: 397 AGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIG 456

Query: 454 WSHSKCEEVIDKS 466
           W+   C   ++ S
Sbjct: 457 WASYDCSMSVNVS 469


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 167/383 (43%), Gaps = 37/383 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IG+P   F V +D GS++LWV C  C  C   S      +  +L  Y+P SSS
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSS 126

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  ++C  P C +        CK     C Y   Y  + ++++GY V+D + L     +
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVNDYIQLQRAVGN 184

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              S    S++ GCG KQ+G     + A DG++G G  + S+ S LA  G ++  F+ C 
Sbjct: 185 HKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL 244

Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
           D    G +F  G+       +T  +P    Y+    GV+   +G++ L        T   
Sbjct: 245 DSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYK 301

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDM 391
             A++DSG +  +LP  IY  ++ K   L +   + L+    ++ C+          P +
Sbjct: 302 RGAIIDSGTTLAYLPESIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTV 359

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQNFMMGHRIVF 445
              F ++    +  H + F   +   V+C+        S DG +  ++G   +    + +
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYY 417

Query: 446 DRENLKLAWSHSKCEEVIDKSHV 468
           + EN  + W+   C   I    V
Sbjct: 418 NLENQTIGWTEYNCSSGIKLKDV 440


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/421 (24%), Positives = 177/421 (42%), Gaps = 41/421 (9%)

Query: 73  QKTRVKLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           ++ R +  + +  SR +LL        FP EGS   +       L++T + +G P   F 
Sbjct: 48  EELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAKEFF 103

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
           V +D GS++LWV C  C  C P S    + L+  L  ++P SSS++  ++CS   C +  
Sbjct: 104 VQIDTGSDILWVTCSPCTGC-PTS----SGLNIQLESFNPDSSSTASRITCSDDRCTAGF 158

Query: 182 ---RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
               + C+   S   PC Y   Y  + + +SGY V D +   +   +   ++  +S++ G
Sbjct: 159 QTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFG 217

Query: 236 CGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
           C   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G       
Sbjct: 218 CSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGIL-VL 276

Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTFL 346
           G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG +  +L
Sbjct: 277 GEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYL 336

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-- 404
               Y   V      VS    SL     + C+  SS      P + L F    +  V+  
Sbjct: 337 ADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVAMSVKPE 395

Query: 405 NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           N++      +   ++C+      G +  I+G   +     V+D  N+++ W+   C   +
Sbjct: 396 NYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDCSMSV 455

Query: 464 D 464
           +
Sbjct: 456 N 456


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/421 (24%), Positives = 177/421 (42%), Gaps = 41/421 (9%)

Query: 73  QKTRVKLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           ++ R +  + +  SR +LL        FP EGS   +       L++T + +G P   F 
Sbjct: 50  EELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAKEFF 105

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
           V +D GS++LWV C  C  C P S    + L+  L  ++P SSS++  ++CS   C +  
Sbjct: 106 VQIDTGSDILWVTCSPCTGC-PTS----SGLNIQLESFNPDSSSTASRITCSDDRCTAGF 160

Query: 182 ---RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
               + C+   S   PC Y   Y  + + +SGY V D +   +   +   ++  +S++ G
Sbjct: 161 QTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFG 219

Query: 236 CGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
           C   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G       
Sbjct: 220 CSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGIL-VL 278

Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTFL 346
           G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG +  +L
Sbjct: 279 GEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYL 338

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-- 404
               Y   V      VS    SL     + C+  SS      P + L F    +  V+  
Sbjct: 339 ADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVAMSVKPE 397

Query: 405 NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           N++      +   ++C+      G +  I+G   +     V+D  N+++ W+   C   +
Sbjct: 398 NYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDCSMSV 457

Query: 464 D 464
           +
Sbjct: 458 N 458


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 170/390 (43%), Gaps = 42/390 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP   + V +D GS+++WV C QC +C   S     SL   L+ YD   S 
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESL 151

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           + K VSC    C +      S C +    C Y   Y+ + +SS GY V DI+     S  
Sbjct: 152 TGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRDIVQYDQVSGD 209

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              +S   SVI GC   Q+G      A DG++G G  + S+ S LA +G ++  F+ C D
Sbjct: 210 LETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLD 269

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
             + G +F    G   Q   +  P+      Y V +++  +G   L           + G
Sbjct: 270 GLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG 327

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG +  +LP  +Y +++ K     S  ++    + +  C+  S       P + 
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSESLDDGFPAVT 384

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVFD 446
             F  +    V  H + F  +    ++C+      + S D  +  ++G   +    +++D
Sbjct: 385 FHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYD 441

Query: 447 RENLKLAWSHSKCE---EVIDK--SHVHLV 471
            EN  + W+   C    +V+D+    VHLV
Sbjct: 442 LENQVIGWTEYNCSSSIKVVDEQSGTVHLV 471


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 107/424 (25%), Positives = 177/424 (41%), Gaps = 35/424 (8%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           +E L   D  R   R  L          + FP EGS   F       L++T + +G+P  
Sbjct: 47  VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
            + V +D GS++LWV C  C  C   S      L+  L  ++P +SS+S  + CS   C 
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157

Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
                S + C++  + PC Y   Y  + + +SGY V D ++  +   +   ++  +S++ 
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVF 216

Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
           GC   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G      
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
            G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG +  +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335

Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           L    Y   V      VS    SL  +GN    C+  SS      P + L F    +  V
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTVSLYFMGGVAMTV 392

Query: 404 R--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCE 460
           +  N++      +   ++C+      G    I  + ++  +I V+D  N+++ W+   C 
Sbjct: 393 KPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452

Query: 461 EVID 464
             ++
Sbjct: 453 TSVN 456


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 90/319 (28%), Positives = 140/319 (43%), Gaps = 28/319 (8%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP+  + V +D GS+++WV C QC +C   S     SL   L+ YD   S+
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS-----SLGMELTPYDLEEST 140

Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           + K VSC    C        S C +    CPY+  Y  + +S++GY V D +     S  
Sbjct: 141 TGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKDYVQYNRVSGD 198

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
              ++   S+  GCG +Q+G        A DG++G G  + S+ S LA    ++  F+ C
Sbjct: 199 LETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHC 258

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA--- 335
            D  + G +F    G   Q   +  P+      Y V +    +G+  L  S   F+A   
Sbjct: 259 LDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR 316

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDM 391
              ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+  S       P +
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYSERVDDGFPPV 374

Query: 392 RLIFSKNQSFVVRNHIFSF 410
              F  +    V  H + F
Sbjct: 375 IFHFENSLLLKVYPHEYLF 393


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 100/421 (23%), Positives = 188/421 (44%), Gaps = 64/421 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P S
Sbjct: 111 YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPES 158

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C   +  C Y   Y+ E ++SSG L +D++   + S+ APQ
Sbjct: 159 SSTYQPVKCT-----IDCNCDGDRMQCVYERQYA-EMSTSSGVLGEDVISFGNQSELAPQ 212

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+C+   D
Sbjct: 213 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD 266

Query: 286 --SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQALV 337
              G++  G   P +  + ++    ++   Y + ++   +       N+ +       ++
Sbjct: 267 VGGGAMVLGGISPPSDMTFAY-SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVL 325

Query: 338 DSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV----P 389
           DSG ++ +LP      + + +VK  +L S K+IS    ++   C++ +  ++ ++    P
Sbjct: 326 DSGTTYAYLPEAAFLAFKDAIVK--ELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFP 383

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV 444
            + ++F     + +    + F  ++    +CL +     D      GII +N +    ++
Sbjct: 384 VVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTL----VM 439

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPP-----------AGQSPNPLPTTEQQSTSNG 493
           +DRE  K+ +  + C E+ ++    + PPP               P+  P+  Q + S G
Sbjct: 440 YDREQTKIGFWKTNCAELWERLQTSIAPPPLPPNSGVRNSSEALEPSVAPSVSQHNASPG 499

Query: 494 Q 494
           +
Sbjct: 500 E 500


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  109 bits (272), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 182/436 (41%), Gaps = 63/436 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P  
Sbjct: 79  YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 126

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SSS K + C +P C    +C      C Y   Y+ E +SSSG L +D++   + S+  PQ
Sbjct: 127 SSSYKALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLTPQ 180

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
            +V      GC   +TG      A DG+MGLG G +SV   L   G+I++ FS+C+   E
Sbjct: 181 RAV-----FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 234

Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
              G++  G   P      S   P    Y  Y + ++   +    L             +
Sbjct: 235 VGGGAMVLGKISPPAGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 292

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEMLKV---- 388
           +DSG ++ + P E +  +     K + S KRI   G    Y   C++ +  ++ ++    
Sbjct: 293 LDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHNFF 350

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
           P++ + F   Q  ++    + F   +    +CL +        ++G   +    + +DRE
Sbjct: 351 PEIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 410

Query: 449 NLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSK 508
           N KL +  + C ++  +    L  P   +SP P     Q  +SN      PS AK+    
Sbjct: 411 NDKLGFLKTNCSDLWRR----LAAP---ESPAPTSPISQNKSSN----ISPSPAKS---- 455

Query: 509 SIAASAQQLDSVLRVA 524
              +    L  VLRV 
Sbjct: 456 --ESPTTDLPGVLRVG 469


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  108 bits (271), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 181/416 (43%), Gaps = 68/416 (16%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SS+  
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSTYS 139

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C S K+ C Y   Y+ E +SSSG L +DI+   + S+  PQ +V  
Sbjct: 140 PVKCN-----VDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-- 191

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L   G+I +SFS+C+   D G   
Sbjct: 192 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGA 247

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V      P     T    +   Y  Y + ++   +    L             ++DSG 
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305

Query: 342 SFTFLPTEIYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDM 391
           ++ +LP + +    V F   VSS     K+I    +++K  C+  +   + ++    P +
Sbjct: 306 TYAYLPEQAF----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKV 361

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    + 
Sbjct: 362 DMVFGNGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VT 415

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 500
           +DR N K+ +  + C E+ ++         +G +P+P P+ +    ++   A  PS
Sbjct: 416 YDRHNEKIGFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464


>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 146

 Score =  108 bits (271), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 73/124 (58%), Gaps = 17/124 (13%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           SS++VHR SDEA+     + G       WP++ S EY   L+ +D +RQK R+ + S   
Sbjct: 28  SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
                    S+G  T   GN   WL+Y W+D+GTP  SFLVALD GS+L WVPC CIQCA
Sbjct: 80  ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130

Query: 145 PLSA 148
           PLS 
Sbjct: 131 PLSG 134


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 104/434 (23%), Positives = 193/434 (44%), Gaps = 35/434 (8%)

Query: 52  SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
           ++P    VE  EL   +  + +  R+ L     SS   ++ FP +GS   +       L+
Sbjct: 47  AFPLDEPVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           +T + +G+P   F V +D GS++LWV C      P S    + L  +L  +D   S ++ 
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSFTAG 156

Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +V+CS P+C S      + C S  + C Y   Y  + + +SGY + D  +  +    +  
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C   +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274

Query: 285 DSGSVFF--GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQA----- 335
            SG   F  G+        +  LP    Y+     +GV    +    +  + F+A     
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILP---IDAAVFEASNTRG 331

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            +VD+G + T+L  E Y   +      V S+ ++L  ++ + CY  S+      P + L 
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLVSTSISDMFPPVSLN 390

Query: 395 FSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           F+   S ++R  +++F +   +G +++C+       +  I+G   +     V+D    ++
Sbjct: 391 FAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 450

Query: 453 AWSHSKCEEVIDKS 466
            W++  C   ++ S
Sbjct: 451 GWANYDCSMSVNVS 464


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 109/432 (25%), Positives = 183/432 (42%), Gaps = 53/432 (12%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           + V +     N+   +E+LL +  +      +L S+      Q   P +   +   G+  
Sbjct: 75  IQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPVQSGASIGSGD-- 132

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
              +   + +GTP   F +  D GS+L W      QC P + + Y   +  L   DP+ S
Sbjct: 133 ---YAVTVGLGTPKKEFTLIFDTGSDLTWT-----QCEPCAKTCYKQKEPRL---DPTKS 181

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           +S KN+SCS   CK        SC S    C Y   Y  + + S G+   + L L+S   
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQYG-DGSYSIGFFATETLTLSS--- 235

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
               S+V  + + GCG++ +G +  GAA  G++GLG   +S+PS  A+    +  FS C 
Sbjct: 236 ----SNVFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQK--YKKLFSYCL 286

Query: 282 DENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLT 329
             + S  G + FG Q     ++  F P+ E + +            VG     I  S  +
Sbjct: 287 PASSSSKGYLSFGGQ---VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFS 343

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
            SG   ++DSG   T LP+  Y+ +   F KL++    +   + +  CY+ S  E +K+P
Sbjct: 344 TSG--TVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIP 401

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNFMMGHRIVFDR 447
            + + F       +      +P N G    CL       D    I G      +++V+D 
Sbjct: 402 KVGVSFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDD 460

Query: 448 ENLKLAWSHSKC 459
              ++ ++ S C
Sbjct: 461 AKGRVGFAPSGC 472


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  108 bits (269), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 115/458 (25%), Positives = 192/458 (41%), Gaps = 67/458 (14%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
           FS  L+HR      E  +S   N S+  S   KN+V          + R K R++L  N+
Sbjct: 29  FSINLIHR------ESPLSPFYNPSLTPSERIKNTV-------LRSFARSKRRLRLSQND 75

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQ 142
           + S   +  P E    +    +FY        IGTP V      D GS+L+WV C  C +
Sbjct: 76  DRSPGTITIPDEPITEYLM--RFY--------IGTPPVERFAIADTGSDLIWVQCAPCEK 125

Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADY 198
           C P          +N   +DP  SS+ K V C    C     S+ +C      C Y   Y
Sbjct: 126 CVP----------QNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIY 175

Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
                     LV  IL   S +  +  ++++   +  GC      +  +     G++GLG
Sbjct: 176 GDHT------LVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLG 229

Query: 258 LGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQ----QSTSFL--PIG 308
           +G +S+ S L     I   FS CF     N +  + FG+     Q     ST  +   IG
Sbjct: 230 VGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIG 287

Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
             Y  Y++ +E   IGN  +  S  Q     L+DSG SFT L    Y + V    ++   
Sbjct: 288 PSY--YYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGV 345

Query: 365 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 424
           + + +    + +C+    +   + PD+  +F+  +  V  +++F   E E   + C+  +
Sbjct: 346 EAVKIPPLVYNFCFENKGKRK-RFPDVVFLFTGAKVRVDASNLF---EAEDNNLLCMVAL 401

Query: 425 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            T D D  I G +  +G+++ +D +   ++++ + C +
Sbjct: 402 PTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADCAK 439


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 162/374 (43%), Gaps = 37/374 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  I IGTP   + V +D GS+++WV C QC +C   S     SL   L+ YD   S 
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESL 151

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           + K VSC    C +      S C +    C Y   Y+ + +SS GY V DI+     S  
Sbjct: 152 TGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRDIVQYDQVSGD 209

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              +S   SVI GC   Q+G      A DG++G G  + S+ S LA +G ++  F+ C D
Sbjct: 210 LETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLD 269

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
             + G +F    G   Q   +  P+      Y V +++  +G   L           + G
Sbjct: 270 GLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG 327

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG +  +LP  +Y +++ K     S  ++    + +  C+  S       P + 
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSESLDDGFPAVT 384

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVFD 446
             F  +    V  H + F  +    ++C+      + S D  +  ++G   +    +++D
Sbjct: 385 FHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYD 441

Query: 447 RENLKLAWSHSKCE 460
            EN  + W+   C+
Sbjct: 442 LENQVIGWTEYNCK 455


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 104/433 (24%), Positives = 189/433 (43%), Gaps = 56/433 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P  
Sbjct: 80  YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPDL 127

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C + +  C Y   Y+ E ++SSG L +D++   + S+ APQ
Sbjct: 128 SSTYQPVKCT-----LDCNCDNDRMQCVYERQYA-EMSTSSGVLGEDVVSFGNQSELAPQ 181

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG+MGLG GD+S+   L    ++ +SFS+C+   D
Sbjct: 182 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD 235

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
            G    V  G   P+        P+   Y  Y + ++   +       N  +      ++
Sbjct: 236 VGGGAMVLGGISPPSDMVFAQSDPVRSPY--YNIDLKEIHVAGKRLPLNPSVFDGKHGSV 293

Query: 337 VDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
           +DSG ++ +LP E    + E +VK  +L S  +IS    ++   C++ +  ++ ++    
Sbjct: 294 LDSGTTYAYLPEEAFLAFKEAIVK--ELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTF 351

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDR 447
           P + +IF     + +    + F  ++    +CL +     D   ++G   +    +++DR
Sbjct: 352 PVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDR 411

Query: 448 ENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 507
           E  K+ +  + C E+ ++  +   PPP        P TE    +N   +  PS A +   
Sbjct: 412 EQTKIGFWKTNCAELWERLQISSAPPPMP------PNTE---ATNSTKSVDPSVAPSVSQ 462

Query: 508 KSIAASAQQLDSV 520
            +I     Q+  +
Sbjct: 463 HNIPRGEFQIAQI 475


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 167/394 (42%), Gaps = 63/394 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI--QCAPLSASYYTSLDRNLSEYDPS 164
           Y   Y  + +GTP   F V +D GS + +VPC      C P             + +DP+
Sbjct: 59  YGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGP---------HHKDAAFDPA 109

Query: 165 SSSSSKNVSCSHPLCK-SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           SSSSS  + C    C   R  C  S K  C Y   Y+ E +SS+G LV D L L   +  
Sbjct: 110 SSSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYA-EQSSSAGLLVSDQLQLRDGAVE 168

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                    V+ GC  K+TG   +  A DG++GLG  +VS+ + LA +G+I + F++CF 
Sbjct: 169 ---------VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFALCFG 218

Query: 283 --ENDSGSVFFGDQGPA----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 330
             E D G++  GD   A      Q T+ L        Y V +E+  +G   L       +
Sbjct: 219 SVEGD-GALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYE 277

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK--------------Y 376
            G+  ++DSG +FT+LP+E +      F + VS+  +    NS K               
Sbjct: 278 EGYGTVLDSGTTFTYLPSEAFQ----LFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDI 333

Query: 377 CY-------NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 428
           C+       +A   ++ KV P   L F+           + F        +CL V     
Sbjct: 334 CFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGA 393

Query: 429 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              ++G        + +DR N ++ +  + C+E+
Sbjct: 394 SGTLLGGISFRNILVQYDRRNRRVGFGAASCQEI 427


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/406 (25%), Positives = 176/406 (43%), Gaps = 57/406 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P  
Sbjct: 75  YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 122

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           S+S + + C +P C    +C      C Y   Y+ E +SSSG L +D++   + S+ +PQ
Sbjct: 123 STSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLSPQ 176

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
            +V      GC  ++TG      A DG+MGLG G +SV   L   G+I++ FS+C+   E
Sbjct: 177 RAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230

Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
              G++  G   P      S   P    Y  Y + ++   +    L             +
Sbjct: 231 VGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 288

Query: 337 VDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV-- 388
           +DSG ++ + P E +  +   V+K  ++ S KRI   G    Y   C++ +  ++ ++  
Sbjct: 289 LDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHN 344

Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
             P++ + F   Q  ++    + F   +    +CL +        ++G   +    + +D
Sbjct: 345 FFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYD 404

Query: 447 RENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 492
           REN KL +  + C ++  +    L  P   +SP P     Q  +SN
Sbjct: 405 RENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN 443


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 169/387 (43%), Gaps = 64/387 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
           ++  I++G P    LV +D GS+L+W     +QC P    Y     R ++  YDP SSS+
Sbjct: 88  YFAVINVGDPPTRALVVIDTGSDLIW-----LQCVPCRHCY-----RQVTPLYDPRSSST 137

Query: 169 SKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            + + C+ P C+       C +    C Y+  Y  + ++SSG L  D L         P 
Sbjct: 138 HRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDRLVF-------PD 189

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--- 282
            +   +V +GCG    G  L+ AA  G++G+G G +S P+ LA A    + FS C     
Sbjct: 190 DTHVHNVTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRL 244

Query: 283 ---ENDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCLTQSGFQ-- 334
              +N S  + FG        ST+F P+     +   Y+V +  + +G   +T  GF   
Sbjct: 245 SRAQNGSSYLVFGRT--PEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVT--GFSNA 300

Query: 335 ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQGNSWKYCY 378
                        +VDSG + +    + YA V   FD   ++    ++++ + + +  CY
Sbjct: 301 SLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACY 360

Query: 379 ----NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG--FTVFCLTVMSTDGDYGI 432
               N +    ++VP + L F+      +    +  P   G   T FCL + + D    +
Sbjct: 361 DLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNV 420

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +G     G  +VFD E  ++ ++ + C
Sbjct: 421 LGNVQQQGFGLVFDVERGRIGFTPNGC 447


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 180/416 (43%), Gaps = 68/416 (16%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SS+  
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSTYS 139

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C S K+ C Y   Y+ E +SSSG L +DI+   + S+  PQ +V  
Sbjct: 140 PVKCN-----VDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-- 191

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L   G+I +SFS+C+   D G   
Sbjct: 192 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGA 247

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V      P     T    +   Y  Y + ++   +    L             ++DSG 
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305

Query: 342 SFTFLPTEIYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDM 391
           ++ +LP + +    V F   VSS     K+I     ++K  C+  +   + ++    P +
Sbjct: 306 TYAYLPEQAF----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKV 361

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIV 444
            ++F   Q  S    N++F   + EG   +CL V     D      GI+ +N +    + 
Sbjct: 362 DMVFGNGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VT 415

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 500
           +DR N K+ +  + C E+ ++         +G +P+P P+ +    ++   A  PS
Sbjct: 416 YDRHNEKIGFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 105/427 (24%), Positives = 183/427 (42%), Gaps = 60/427 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C QC                ++ P  
Sbjct: 75  YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 122

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           S+S + + C +P C    +C      C Y   Y+ E +SSSG L +D++   + S+ +PQ
Sbjct: 123 STSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLSPQ 176

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
            +V      GC  ++TG      A DG+MGLG G +SV   L   G+I++ FS+C+   E
Sbjct: 177 RAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230

Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
              G++  G   P      S   P    Y  Y + ++   +    L             +
Sbjct: 231 VGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 288

Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV-- 388
           +DSG ++ + P E +    + V+K  ++ S KRI   G    Y   C++ +  ++ ++  
Sbjct: 289 LDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHN 344

Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
             P++ + F   Q  ++    + F   +    +CL +        ++G   +    + +D
Sbjct: 345 FFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYD 404

Query: 447 RENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAP 506
           REN KL +  + C ++  +    L  P   +SP P     Q  +SN    +P      +P
Sbjct: 405 RENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN---ISPSPATSESP 454

Query: 507 SKSIAAS 513
           +  +  S
Sbjct: 455 TSHLPGS 461


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/417 (24%), Positives = 172/417 (41%), Gaps = 42/417 (10%)

Query: 78  KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           +L++ + +   +LL        FP +G+   F       L+YT I +G+P   F V +D 
Sbjct: 45  QLKARDKARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKIRLGSPPRDFYVQVDT 100

Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
           GS++LWV C      P +    + L   L+ +DP SS ++  VSCS   C      S S 
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSG 156

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C    + C Y   Y  + + +SG+ V D+L        +   +  + V+ GC   QTG  
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215

Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
           +    A DG+ G G   +SV S LA  GL    FS C   EN  G +     G   + + 
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273

Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
            F P+      Y V + S  +    L        T +G   ++D+G +  +L    Y   
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333

Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           V      VS   + +  +GN    CY  ++      P + L F+   S  +    +   +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQ 390

Query: 413 NE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCEEVIDKS 466
           N   G  V+C+           I  + ++  +I V+D    ++ W++  C   ++ S
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSMSVNVS 447


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 166/372 (44%), Gaps = 31/372 (8%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T I +G+P   + V +D GS++LWV C+ C +C        T+L+ +LS +D ++SS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPS-----KTNLNFHLSLFDVNASS 127

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +SK V C    C   S   S +    C Y   Y+ E TS  G  + D L L   +     
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSE-GNFIRDKLTLEQVTGDLQT 186

Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +   V+ GCG  Q+G      +A DGVMG G  + SV S LA  G  +  FS C D  
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246

Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
             G +F  G       ++T  +P    Y+   +G++    +  +  S +   G   +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMRNGG--TIVDS 304

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
           G +  + P  +Y  ++   + +++ + + L      + C++ S    +  P +   F  +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEDTFQCFSFSENVDVAFPPVSFEFEDS 361

Query: 399 QSFVVRNHIFSFP-ENEGFTVFCLTVMS---TDGDYG---IIGQNFMMGHRIVFDRENLK 451
               V  H + F  E E   ++C    +   T G+     ++G   +    +V+D EN  
Sbjct: 362 VKLTVYPHDYLFTLEKE---LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEV 418

Query: 452 LAWSHSKCEEVI 463
           + W+   C   I
Sbjct: 419 IGWADHNCSSSI 430


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 189/434 (43%), Gaps = 59/434 (13%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL----------HYTWIDIGTP 119
           WK++   + +Q  NN + N ++   + S+  F GN    L          ++  + +GTP
Sbjct: 121 WKQEVKVITIQQQNNLA-NAVVASLKSSKDEFSGNIMATLESGASLGTGEYFIDMFVGTP 179

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
                + LD GS+L W     IQC P     Y   ++N   Y+P+ SSS +N+SC  P C
Sbjct: 180 PKHVWLILDTGSDLSW-----IQCDPC----YDCFEQNGPHYNPNESSSYRNISCYDPRC 230

Query: 180 KSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           +  SS      CK+    CPY  DY+    ++  + ++      ++     +      V+
Sbjct: 231 QLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVM 290

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGS-- 288
            GCG    G +        ++GLG G +S PS L    +  +SFS C  +   N S S  
Sbjct: 291 FGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ--SIYGHSFSYCLTDLFSNTSVSSK 345

Query: 289 -VFFGDQGPATQQSTSF--LPIGEKY---DAYFVGVESYCIGNSCL----------TQSG 332
            +F  D+      + +F  L  GE+      Y++ ++S  +G   L          ++  
Sbjct: 346 LIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGV 405

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG++ TF P   Y  +   F+K +  ++I+        CYN S    +++PD  
Sbjct: 406 GGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYG 465

Query: 393 LIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRE 448
           + F+     +F   N+ + +  +E   V CL ++ T       IIG        I++D +
Sbjct: 466 IHFADGAVWNFPAENYFYQYEPDE---VICLAILKTPNHSHLTIIGNLLQQNFHILYDVK 522

Query: 449 NLKLAWSHSKCEEV 462
             +L +S  +C EV
Sbjct: 523 RSRLGYSPRRCAEV 536


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 103/432 (23%), Positives = 185/432 (42%), Gaps = 31/432 (7%)

Query: 52  SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
           ++P    VE  EL   +  + +  R+ L     SS   ++ FP +GS   +       L+
Sbjct: 47  AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           +T + +G+P   F V +D GS++LWV C      P S    + L  +L  +D   S ++ 
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTAG 156

Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +V+CS P+C S      + C S  + C Y   Y  + + +SGY + D  +  +    +  
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C   +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------L 336
            SG   F   G        + P+      Y + + S  +    L      F+A      +
Sbjct: 275 GSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VD+G + T+L  E Y   +      VS     +  N  + CY  S+      P + L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFA 392

Query: 397 KNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
              S ++R  +++F +   +G +++C+       +  I+G   +     V+D    ++ W
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGW 452

Query: 455 SHSKCEEVIDKS 466
           +   C   ++ S
Sbjct: 453 ASYDCSMSVNVS 464


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 169/375 (45%), Gaps = 40/375 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP ++F   +D GS+L W      QCAP + + +    +    YDP+ SS+ 
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWT-----QCAPCTTACFA---QPTPLYDPARSSTF 147

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             + C+ PLC++  S     +    + DY      ++GYL  D L +         SS  
Sbjct: 148 SKLPCASPLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASSSF 207

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS 288
           + V  GC     G  +DGA+  G++GLG    S  SLL++ G+    FS C   + D+G+
Sbjct: 208 AGVAFGCSTANGGD-MDGAS--GIVGLGR---SALSLLSQIGV--GRFSYCLRSDADAGA 259

Query: 289 --VFFGDQGPATQ---QSTSFL--PIGEKYDA--YFVGVESYCIGNSCLTQS----GFQA 335
             + FG     T    QST+ L  P+  +  A  Y+V +    +G++ L  +    GF A
Sbjct: 260 SPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTA 319

Query: 336 ------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY--CYNASSEEMLK 387
                 +VDSG +FT+L    Y  +   F    +     + G  + +  C+ A + +   
Sbjct: 320 AGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADT-P 378

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           VP +   F+    + V    +    +EG  V CL V+ T G   +IG    M   +++D 
Sbjct: 379 VPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIGNVMQMDLHVLYDL 437

Query: 448 ENLKLAWSHSKCEEV 462
           +    +++ + C  +
Sbjct: 438 DGATFSFAPADCASL 452


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 102/426 (23%), Positives = 183/426 (42%), Gaps = 31/426 (7%)

Query: 52  SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
           ++P    VE  EL   +  + +  R+ L     SS   ++ FP +GS   +       L+
Sbjct: 47  AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           +T + +G+P   F V +D GS++LWV C      P S    + L  +L  +D   S ++ 
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTAG 156

Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +V+CS P+C S      + C S  + C Y   Y  + + +SGY + D  +  +    +  
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C   +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------L 336
            SG   F   G        + P+      Y + + S  +    L      F+A      +
Sbjct: 275 GSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VD+G + T+L  E Y   +      VS     +  N  + CY  S+      P + L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFA 392

Query: 397 KNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
              S ++R  +++F +   +G +++C+       +  I+G   +     V+D    ++ W
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGW 452

Query: 455 SHSKCE 460
           +   C+
Sbjct: 453 ASYDCK 458


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 102/417 (24%), Positives = 172/417 (41%), Gaps = 42/417 (10%)

Query: 78  KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           +L++ + +   +LL        FP +G+   F       L+YT + +GTP   F V +D 
Sbjct: 45  QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100

Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
           GS++LWV C      P +    + L   L+ +DP SS ++  +SCS   C      S S 
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C    + C Y   Y  + + +SG+ V D+L        +   +  + V+ GC   QTG  
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215

Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
           +    A DG+ G G   +SV S LA  G+    FS C   EN  G +     G   + + 
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273

Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
            F P+      Y V + S  +    L        T +G   ++D+G +  +L    Y   
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333

Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           V      VS   + +  +GN    CY  ++      P + L F+   S  +    +   +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390

Query: 413 NE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCEEVIDKS 466
           N   G  V+C+           I  + ++  +I V+D    ++ W++  C   ++ S
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVS 447


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 102/428 (23%), Positives = 175/428 (40%), Gaps = 65/428 (15%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
           +  R  L     +  +  +FP        +G+ + + L+Y  + IG P   + + +D GS
Sbjct: 27  RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79

Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
           +L W+ C   C+ C+ +    Y               + +K V C   +C +       R
Sbjct: 80  DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR-KQT 241
             C S K  C Y   Y+ +  SS G LV D   L    + A  S V+  +  GCG  +Q 
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
           GS  + +A DGV+GLG G VS+ S L + G+ +N    C      G +FFGD   P ++ 
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           + + +      + Y  G  +   G   L     + + DSG+SFT+   + Y  +V     
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF---------- 410
            +S     +  +S   C+    +    V D++  F        R  + SF          
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF--------RTVVLSFSNGKKALMEI 352

Query: 411 -PEN----EGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            PEN      +   CL +++       D  I+G   M    +++D E  ++ W  + C+ 
Sbjct: 353 PPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDR 412

Query: 462 VIDKSHVH 469
           + + + +H
Sbjct: 413 IPNDNTIH 420


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 102/417 (24%), Positives = 172/417 (41%), Gaps = 42/417 (10%)

Query: 78  KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           +L++ + +   +LL        FP +G+   F       L+YT + +GTP   F V +D 
Sbjct: 45  QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100

Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
           GS++LWV C      P +    + L   L+ +DP SS ++  +SCS   C      S S 
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C    + C Y   Y  + + +SG+ V D+L        +   +  + V+ GC   QTG  
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215

Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
           +    A DG+ G G   +SV S LA  G+    FS C   EN  G +     G   + + 
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273

Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
            F P+      Y V + S  +    L        T +G   ++D+G +  +L    Y   
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333

Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           V      VS   + +  +GN    CY  ++      P + L F+   S  +    +   +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390

Query: 413 NE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRENLKLAWSHSKCEEVIDKS 466
           N   G  V+C+           I  + ++  +I V+D    ++ W++  C   ++ S
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVS 447


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/391 (24%), Positives = 173/391 (44%), Gaps = 55/391 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C  C                ++ P  
Sbjct: 92  YYTARLWI--GTPPQRFALIVDTGSTVTYVPCSTCRHCG----------SHQDPKFRPED 139

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           S + + V C+      + +C + +  C Y   Y+ E ++SSG L +D++   + ++ +PQ
Sbjct: 140 SETYQPVKCTW-----QCNCDNDRKQCTYERRYA-EMSTSSGALGEDVVSFGNQTELSPQ 193

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
            +     I GC   +TG   +  A DG+MGLG GD+S+   L +  +I +SFS+C+    
Sbjct: 194 RA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMG 247

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
                 V  G   PA    T   P+   Y  Y + ++   +       N  +       +
Sbjct: 248 VGGGAMVLGGISPPADMVFTRSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFDGKHGTV 305

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEMLKV---- 388
           +DSG ++ +LP   +        K   S KRIS  G   +Y   C++ +  ++ ++    
Sbjct: 306 LDSGTTYAYLPESAFLAFKHAIMKETHSLKRIS--GPDPRYNDICFSGAEIDVSQISKSF 363

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRI 443
           P + ++F       +    + F  ++    +CL V S   D      GI+ +N +    +
Sbjct: 364 PVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTL----V 419

Query: 444 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 474
           ++DRE+ K+ +  + C E+ ++ HV   PPP
Sbjct: 420 MYDREHTKIGFWKTNCSELWERLHVSDAPPP 450


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 165/394 (41%), Gaps = 43/394 (10%)

Query: 93  PSEGSQT-HFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSA 148
           P E S     +G+ + + L+Y  + IG P   + + +D GS+L W+ C   C+ C  +  
Sbjct: 39  PEESSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPH 98

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTE 201
             Y               + +K V C   LC S       +  C S K  C Y   Y+ +
Sbjct: 99  PLY-------------RPTKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYA-D 144

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSS-VQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLG 259
             SS G L+ D     SF+     SS V+ S+  GCG  +Q GS  + A  DGV+GLG G
Sbjct: 145 QGSSLGVLLTD-----SFAVRLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSG 199

Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGV 318
            +S+ S L + G+ +N    C      G +FFGD      ++T    +   +  Y+  G 
Sbjct: 200 SISLLSQLKQHGITKNVVGHCLSIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGT 259

Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 378
            S   G   L     + ++DSG+SFT+   + Y  +V      +S     +   S   C+
Sbjct: 260 ASLYFGGRSLGVRPMEVVLDSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCW 319

Query: 379 NASS--EEMLKVPD----MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----G 428
                 + +L V      + L FS  +  ++     ++     F   CL +++       
Sbjct: 320 KGKKPFKSVLDVKKEFKSLVLSFSNGKKALMEIPPENYLIVTKFGNACLGILNGSEIGLK 379

Query: 429 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           D  I+G   M    +++D E  ++ W  + C+ +
Sbjct: 380 DLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 170/392 (43%), Gaps = 31/392 (7%)

Query: 92  FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
           FP +G+   F       L+YT + +GTP   F V +D GS++LWV C      P +    
Sbjct: 70  FPVDGASDPFL----VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 121

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
           + L   LS +DP  SSS+  VSCS   C S    +S   P   C Y   Y  + + +SG+
Sbjct: 122 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGF 180

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLL 267
            + D +   +        +  +  + GC   QTG       A DG+ GLG G +SV S L
Sbjct: 181 YISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQL 240

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
           A  GL    FS C   + SG       G   +  T + P+      Y V ++S  +    
Sbjct: 241 AVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQI 299

Query: 328 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
           L          +G   ++D+G +  +LP E Y+  +      VS     +   S++ C+ 
Sbjct: 300 LPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFE 358

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGFTVFCLTVMS-TDGDYGIIG 434
            ++ ++   P++ L F+   S V+R H    IFS   + G +++C+     +     I+G
Sbjct: 359 ITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIFS---SSGSSIWCIGFQRMSHRRITILG 415

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS 466
              +    +V+D    ++ W+   C   ++ S
Sbjct: 416 DLVLKDKVVVYDLVRQRIGWAEYDCSLEVNVS 447


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/395 (25%), Positives = 163/395 (41%), Gaps = 60/395 (15%)

Query: 103 GNQFYWLH-YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSA-SYYTSLD 155
           GN +   H Y  ++IG P   + + +D GSNL W+ C      C  C P     YYT  D
Sbjct: 30  GNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPAD 89

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPYIADYSTEDTSSSG 207
            NL             V C  PLC + R     +      DP  C Y   Y T    S G
Sbjct: 90  GNL------------KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEG 135

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSL 266
            L  DI+ +    K          +  GCG KQ        +P DG++GLG+G   + + 
Sbjct: 136 DLATDIISVNGRDK--------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQ 187

Query: 267 LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
           L    +I +N    C      G ++ GD  P T+  T + P+ E    Y  G+    I  
Sbjct: 188 LKGHKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDK 246

Query: 326 SCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS- 382
             +     F+A+ DSG+++T +P +IY E+V K    +S   +  ++G +   C+     
Sbjct: 247 QPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKKP 306

Query: 383 -------EEMLKVPDMRLIFSKNQSFV-VRNHIFSFPENEGFTVFCLTVMSTDGD----- 429
                  +   K   +++  ++  S + +    + F + +G T  CL ++    D     
Sbjct: 307 FGSVNDVKNQFKALSLKITHARGTSNLDIPPQNYLFVKEDGET--CLAILDASLDPVLKE 364

Query: 430 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             + +IG   M    +++D E  +L W  ++C+ V
Sbjct: 365 LNFILIGAVTMQDLFVIYDNEKKQLGWVRAQCDRV 399


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 153/361 (42%), Gaps = 38/361 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP  ++ V  D GS++ W     IQC P S   Y   D     +DP+ S++   V 
Sbjct: 139 VGFGTPAQTYTVIFDTGSDVSW-----IQCLPCSGHCYKQHD---PIFDPTKSATYSVVP 190

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C HP C +    K     C Y  +Y  + +SS+G L  + L L S       +       
Sbjct: 191 CGHPQCAAADGSKCSNGTCLYKVEYG-DGSSSAGVLSHETLSLTS-------TRALPGFA 242

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFF 291
            GCG+   G + D    DG++GLG G +S+ S  A +     +FS C   D    G +  
Sbjct: 243 FGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTI 297

Query: 292 GDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGA 341
           G   PA+     +  + +K D    YFV + S  IG   L       T  G    +DSG 
Sbjct: 298 GPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDG--TFLDSGT 355

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T+LP E Y  +  +F   ++  + +   + +  CY+ + +  + +P +   FS    F
Sbjct: 356 ILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVF 415

Query: 402 VVRNH-IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
            +    I  FP++    + CL  ++      + I+G        +++D    K+ ++ + 
Sbjct: 416 DLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASAS 475

Query: 459 C 459
           C
Sbjct: 476 C 476


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 169/369 (45%), Gaps = 48/369 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y + Y+   +GTP       +D GSN++W+ CQ C  C           ++    ++PS 
Sbjct: 89  YLISYS---VGTPPFKVYGFMDTGSNIVWLQCQPCNTC----------FNQTSPIFNPSK 135

Query: 166 SSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           SSS KN+ C+   CK    +  SC +  D C Y   Y   D  S G L +D L L S S 
Sbjct: 136 SSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGG-DAKSQGDLSNDSLTLDSTSG 194

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
               S +  +++IGCG        D +   GV+G+G G +S+   +  +  + + FS C 
Sbjct: 195 ---SSVLFPNIVIGCGHINV--LQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCL 248

Query: 282 -----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
                D N S  + FG+    + +   ST  + +  + + YF+ +E++ +GN+ +     
Sbjct: 249 IPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGER 308

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
              S    L+DSG   T LP    +++V    + V   RI    +    CYN + ++ L 
Sbjct: 309 SNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQ-LN 367

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVF 445
           VPD+   F+     +  N  F FP  +G  + C   +S++G   +G I QN ++   I +
Sbjct: 368 VPDITAHFNGADVKLNSNGTF-FPFEDG--IMCFGFISSNGLEIFGNIAQNNLL---IDY 421

Query: 446 DRENLKLAW 454
           D E   +++
Sbjct: 422 DLEKEIISF 430


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/437 (23%), Positives = 178/437 (40%), Gaps = 58/437 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C  C                 + P  
Sbjct: 87  YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSDCEHCG----------KHQDPRFQPDE 134

Query: 166 SSSSKNVSCSHPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           SS+       HP+ C    +C      C Y   Y+ E +SSSG L +DI+   + S+  P
Sbjct: 135 SSTY------HPVKCNMDCNCDHDGVNCVYERRYA-EMSSSSGVLGEDIISFGNQSEVVP 187

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-- 282
           Q +V      GC   +TG      A DG+MGLG G +S+   L    +I +SFS+C+   
Sbjct: 188 QRAV-----FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGM 241

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE---SYCIGNSC-LTQSGFQ---- 334
               G++  G   P      S     + Y + +  +E    +  G    L+ S F     
Sbjct: 242 HVGGGAMVLGGIPPPPDMVFSR---SDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHG 298

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV 388
            ++DSG ++ +LP E +    V F   +  K  +L+       N    C++ +  ++ ++
Sbjct: 299 TVLDSGTTYAYLPEEAF----VAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQL 354

Query: 389 ----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
               P++ ++FS  Q   +    + F   +    +CL +        ++G   +    + 
Sbjct: 355 SKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVT 414

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP----LPTTEQQSTSNGQAAAPPS 500
           +DREN K+ +  + C E+  + H+   P  A   P P     P       +N     PP+
Sbjct: 415 YDRENEKIGFWKTNCSELWKRLHIPGAPAAAPIVPTPKSVSAPAPVVSYNNNTTVGMPPT 474

Query: 501 TAKTAPSKSIAASAQQL 517
            A +   + +     Q+
Sbjct: 475 VAPSGLPQEVLPGEFQV 491


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 161/377 (42%), Gaps = 28/377 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L+YT + +GTP   F V +D GS++LWV C      P S    + L   L+ +D   SS+
Sbjct: 77  LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQS----SQLGIELNFFDTVGSST 132

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +  + CS P+C SR     + C    + C Y   Y  + + +SGY V D ++ +      
Sbjct: 133 AALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSDAMYFSLIMGQP 191

Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
           P  +  ++++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C  
Sbjct: 192 PAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL- 250

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGF 333
           + D         G   + S  + P+      Y + ++S  +    L         + +  
Sbjct: 251 KGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRG 310

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
             +VD G +  +L  E Y  +V   +  V  S+++ + +GN    CY  S+      P +
Sbjct: 311 GTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLVSTSIGDIFPSV 367

Query: 392 RLIFSKNQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            L F    S V++   +       +G  ++C+          I+G   +    +V+D   
Sbjct: 368 SLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQ 427

Query: 450 LKLAWSHSKCEEVIDKS 466
            ++ W++  C   ++ S
Sbjct: 428 QRIGWANYDCSLSVNVS 444


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 160/377 (42%), Gaps = 29/377 (7%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G P   F V +D GS++LWV C  C  C P S+     L+  L  ++P SSS
Sbjct: 4   LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC-PTSS----GLNIQLESFNPDSSS 58

Query: 168 SSKNVSCSHPLCKS-----RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           ++  ++CS   C +      + C+   S   PC Y   Y  + + +SGY V D +   + 
Sbjct: 59  TASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETV 117

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
             +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L   G+    FS
Sbjct: 118 MGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 177

Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQS 331
            C   +D+G       G   +    + P+      Y + +ES         I +S  T S
Sbjct: 178 HCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 236

Query: 332 GFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
             Q  +VDSG +  +L    Y   V      VS    SL     + C+  SS      P 
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPT 295

Query: 391 MRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 447
           + L F    +  V+  N++      +   ++C+      G +  I+G   +     V+D 
Sbjct: 296 VTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDL 355

Query: 448 ENLKLAWSHSKCEEVID 464
            N+++ W+   C   ++
Sbjct: 356 ANMRMGWADYDCSMSVN 372


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 161/379 (42%), Gaps = 51/379 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T I +GTP   F V  D GS+L+W     IQC P  A +    ++    +DP  SSS 
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIW-----IQCKPCQACF----NQKDPIFDPEGSSSY 90

Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +SC   LC S  R SC      C Y   Y  + + + G L  + + L S      +  
Sbjct: 91  TTMSCGDTLCDSLPRKSCSP---DCDYSYGYG-DGSGTRGTLSSETVTLTSTQG---EKL 143

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
              ++  GCG    GS+ D +   G++GLG G++S  S L    L  + FS C       
Sbjct: 144 AAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDA 198

Query: 283 ENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNSCL------- 328
            + +  +FFGD+  +         +F P+         Y+V ++   I    L       
Sbjct: 199 PSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSF 258

Query: 329 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
                G   ++ DSG + T LP   Y  V+      +S  +I         CY+ S  + 
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKA 318

Query: 386 ---LKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
              +K+P M   F   +    V N+  +   N+  T+ CL ++S++ D GI G       
Sbjct: 319 SYKMKIPAMVFHFEGADYQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGIYGNMMQQNF 376

Query: 442 RIVFDRENLKLAWSHSKCE 460
           R+++D  + K+ W+ S+C+
Sbjct: 377 RVMYDIGSSKIGWAPSQCD 395


>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 100/397 (25%), Positives = 161/397 (40%), Gaps = 64/397 (16%)

Query: 103 GNQFYWLH-YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSA-SYYTSLD 155
           GN +   H Y  ++IG P   + + +D GSNL W+ C      C  C P     YYT  D
Sbjct: 30  GNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPAD 89

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPYIADYSTEDTSSSG 207
            NL             V C  PLC + R     +      DP  C Y   Y T    S G
Sbjct: 90  GNLK------------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEG 135

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSL 266
            L  DI+ +    K          +  GCG KQ        +P DG++GLG+G     + 
Sbjct: 136 DLATDIISVNGRDK--------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQ 187

Query: 267 LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
           L    +I +N    C      G ++ GD  P T+  T + P+ E    Y  G+    I  
Sbjct: 188 LKGHKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDK 246

Query: 326 SCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS- 382
             +     F+A+ DSG+++T +P +IY E+V K    +S   +  ++G +   C+     
Sbjct: 247 QPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKP 306

Query: 383 -------EEMLKVPDMRLIFSK---NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--- 429
                  +   K   +++  ++   N     +N++F   + E     CL ++    D   
Sbjct: 307 FGSVNDVKNQFKALSLKITHARGTNNLDIPPQNYLFVKEDGE----TCLAILDASLDPVL 362

Query: 430 ----YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
               + +IG   M    +++D E  +L W  ++C+ V
Sbjct: 363 KELNFILIGAVTMQDLFVIYDNEKKQLGWVRAQCDRV 399


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 160/378 (42%), Gaps = 37/378 (9%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N F  L++T + +G P   F V +D GS++LWV C      P S    + L   L+ +D 
Sbjct: 78  NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDS----SGLGIELNLFDT 133

Query: 164 SSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           + SSS++ + C+ P+C + S+    C +  D C Y   Y  + + +SG+ V D +H    
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTDSMHFDIL 192

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +  ++  ++++ GC   Q G       A DG+ G G G+ SV S L+  G+    FS
Sbjct: 193 LGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFS 252

Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--------L 328
            C    EN  G +  G+     + S  + P+      Y + ++S  +            +
Sbjct: 253 HCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPI 309

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           + +G + ++DSG +  +L  E+Y  +V      VS           + C+  S       
Sbjct: 310 SNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIF 367

Query: 389 PDMRLIFSKNQSFVVR-------NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           P +R  F    S VV        + I   P      ++C+     +    I+G   +   
Sbjct: 368 PVLRFNFEGIASMVVTPEEYLQFDSIVREP-----ALWCIGFQKAEDGLNILGDLVLKDK 422

Query: 442 RIVFDRENLKLAWSHSKC 459
            IV+D    ++ W++  C
Sbjct: 423 IIVYDLARQRIGWANYDC 440


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 171/385 (44%), Gaps = 52/385 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP   F + +D GS + +VPC  C QC           +    ++ P  S +   V C
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCG----------NHQDPKFQPDLSDTYHPVKC 51

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
            +P C    +C +  D C Y   Y+ E +SSSG L +D++   + S+  PQ +V      
Sbjct: 52  -NPDC----TCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
           GC   +TG      A DG+MGLG GD+S+   L + G+I +SFS+C+   E   G++  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 293 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 345
              P +    S   P    Y  Y + +    +       N  +       ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSEEMLKV----PDMRLIF 395
           LP   +    + F + ++S+   L+   G    Y   C++ +  E+ ++    P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 454
              + + +    + F  ++    +CL V     D   ++G   +    + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333

Query: 455 SHSKCE---EVIDKSHVHLVPPPAG 476
             + C    E ++ S +   P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/421 (23%), Positives = 171/421 (40%), Gaps = 65/421 (15%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
           +  R  L     +  +  +FP        +G+ + + L+Y  + IG P   + + +D GS
Sbjct: 27  RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79

Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
           +L W+ C   C+ C+ +    Y               + +K V C   +C +       R
Sbjct: 80  DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR-KQT 241
             C S K  C Y   Y+ +  SS G LV D   L    + A  S V+  +  GCG  +Q 
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
           GS  + +A DGV+GLG G VS+ S L + G+ +N    C      G +FFGD   P ++ 
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           + + +      + Y  G  +   G   L     + + DSG+SFT+   + Y  +V     
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF---------- 410
            +S     +  +S   C+    +    V D++  F        R  + SF          
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF--------RTVVLSFSNGKKALMEI 352

Query: 411 -PEN----EGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            PEN      +   CL +++       D  I+G   M    +++D E  ++ W  + C+ 
Sbjct: 353 PPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDR 412

Query: 462 V 462
           +
Sbjct: 413 I 413


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 95/385 (24%), Positives = 170/385 (44%), Gaps = 52/385 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP   F + +D GS + +VPC  C QC           +    ++ P  S +   V C
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCG----------NHQDPKFQPDLSDTYHPVKC 51

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
            +P C    +C +  D C Y   Y+ E +SSSG L +D++   + S+  PQ +V      
Sbjct: 52  -NPDC----TCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
           GC   +TG      A DG+MGLG GD+S+   L + G+I +SFS+C+   E   G++  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 293 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 345
              P +    S   P    Y  Y + +    +       N  +       ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQ------GNSWKYCYNASSEEMLKV----PDMRLIF 395
           LP   +    + F + ++S+   L+       N    C++ +  E+ ++    P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 454
              + + +    + F  ++    +CL V     D   ++G   +    + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333

Query: 455 SHSKCE---EVIDKSHVHLVPPPAG 476
             + C    E ++ S +   P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 165/379 (43%), Gaps = 46/379 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I +G P   + + +D GS+L W+ C   C  CA      Y      +         
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 254

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             K++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +       
Sbjct: 255 PPKDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GR 307

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   D N
Sbjct: 308 EKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPN 367

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVD 338
             G +F GD        TS  PI    D  F    +    G+  L+  G      Q + D
Sbjct: 368 GGGYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFD 426

Query: 339 SGASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPD 390
           SG+S+T+LP EIY  ++         F +  S + + L     +   Y    +++ K   
Sbjct: 427 SGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--P 484

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHR 442
           + L F K + FV+       P+N          CL  ++  D D+G   I+G N + G  
Sbjct: 485 LNLHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKL 543

Query: 443 IVFDRENLKLAWSHSKCEE 461
           +V+D +  ++ W++S C +
Sbjct: 544 VVYDNQQRQIGWTNSDCTK 562


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 162/376 (43%), Gaps = 31/376 (8%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T + +G+P   + V +D GS++LWV C  C  C   S      L+  L  ++P +SS+
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSST 171

Query: 169 SKNVSCSHPLC-----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S  + CS   C      S + C++  + PC Y   Y  + + +SGY V D ++  +   +
Sbjct: 172 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGN 230

Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              ++  +S++ GC   Q+G       A DG+ G G   +SV S L   G+    FS C 
Sbjct: 231 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQ 334
             +D+G       G   +    + P+      Y + +ES         I +S  T S  Q
Sbjct: 291 KGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 349

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDM 391
             +VDSG +  +L    Y   V      VS    SL  +GN    C+  SS      P +
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTV 406

Query: 392 RLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRE 448
            L F    +  V+  N++      +   ++C+      G    I  + ++  +I V+D  
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 466

Query: 449 NLKLAWSHSKCEEVID 464
           N+++ W+   C   ++
Sbjct: 467 NMRMGWTDYDCSTSVN 482


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 165/379 (43%), Gaps = 46/379 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I +G P   + + +D GS+L W+ C   C  CA      Y      +         
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 255

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             K++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +       
Sbjct: 256 PPKDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GR 308

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   D N
Sbjct: 309 EKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPN 368

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVD 338
             G +F GD        TS  PI    D  F    +    G+  L+  G      Q + D
Sbjct: 369 GGGYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFD 427

Query: 339 SGASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPD 390
           SG+S+T+LP EIY  ++         F +  S + + L     +   Y    +++ K   
Sbjct: 428 SGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--P 485

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHR 442
           + L F K + FV+       P+N          CL  ++  D D+G   I+G N + G  
Sbjct: 486 LNLHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKL 544

Query: 443 IVFDRENLKLAWSHSKCEE 461
           +V+D +  ++ W++S C +
Sbjct: 545 VVYDNQQRQIGWTNSDCTK 563


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 101/417 (24%), Positives = 168/417 (40%), Gaps = 31/417 (7%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           +E L   D      R  L     +    + FP EGS   +       L++T + +G P  
Sbjct: 45  VEHLKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAK 100

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
            + V +D GS++LWV C      P S    + L+  L  ++P SSS+S  + CS   C +
Sbjct: 101 EYFVQIDTGSDILWVACSPCTGCPTS----SGLNIQLEFFNPDSSSTSSRIPCSDDRCTA 156

Query: 182 R--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
                     S  S   PC Y   Y  + + +SG+ V D ++  +   +   ++  +SV+
Sbjct: 157 ALQTGEAVCQSSDSPSSPCGYTFTYG-DGSGTSGFYVSDTMYFDTVMGNEQTANSSASVV 215

Query: 234 IGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GC   Q+G  +    A DG+ G G   +SV S L   G+   +FS C   +D+G     
Sbjct: 216 FGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGSDNGGGIL- 274

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFT 344
             G   +    F P+      Y + +ES         I +S    S  Q  +VDSG +  
Sbjct: 275 VLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIVDSGTTLV 334

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
           +L    Y   +      VS    S+     + C+  +S      P   L F    S  V+
Sbjct: 335 YLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVDSSFPTATLYFKGGVSMTVK 393

Query: 405 --NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             N++      +   ++C+    + G   I+G   +     V+D  N+++ W+   C
Sbjct: 394 PENYLLQQGSVDNNVLWCIGWQRSQG-ITILGDLVLKDKIFVYDLANMRMGWADYDC 449


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 161/376 (42%), Gaps = 30/376 (7%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N F  L++T + +G P   F V +D GS++LWV C      P S    + L   L+ +D 
Sbjct: 78  NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDS----SGLGIELNLFDT 133

Query: 164 SSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           + SSS++ + C+ P+C + S+    C +  D C Y   Y  + + +SG+ V D +H    
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTDSMHFDIL 192

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +  ++  ++++ GC   Q G       A DG+ G G G+ SV S L+  G+    FS
Sbjct: 193 LGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFS 252

Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--------L 328
            C    EN  G +  G+     + S  + P+      Y + ++S  +            +
Sbjct: 253 HCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPI 309

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           + +G + ++DSG +  +L  E+Y  +V      VS           + C+  S       
Sbjct: 310 SNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIF 367

Query: 389 PDMRLIFSKNQSFVVRNHIF----SFPENEGF-TVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           P +R  F    S VV    +    S      F +++C+     +    I+G   +    I
Sbjct: 368 PVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKII 427

Query: 444 VFDRENLKLAWSHSKC 459
           V+D    ++ W++  C
Sbjct: 428 VYDLAQQRIGWANYDC 443


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 163/377 (43%), Gaps = 50/377 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  + L+ LD GS+++W     +QCAP    Y  S       +DP  S S 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQSG----RVFDPRRSRSY 172

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A  ++      
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 225

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
           VQ  V IGCG    G ++   A  G++GLG G +S PS +A++     SFS C       
Sbjct: 226 VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 279

Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
                  S +V FG    A     SF P+G        Y+V +  + +G +    ++QS 
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339

Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
            +          ++DSG S T L   +Y  V   F       R+S  G S +  CYN S 
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
             ++KVP + +  +   S  +    +  P +   T FC  +  TDG   IIG     G R
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFR 458

Query: 443 IVFDRENLKLAWSHSKC 459
           +VFD +  ++ +    C
Sbjct: 459 VVFDGDAQRVGFVPKSC 475


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 109/432 (25%), Positives = 174/432 (40%), Gaps = 57/432 (13%)

Query: 57  NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ---LLFPSEGSQTHFFGNQFYWLHYTW 113
            + E L   L  D KR           N +R     ++ P         G      ++T 
Sbjct: 91  TAAELLGHRLQRDGKRAARISAAAGAANGTRRTGSGVVAPVVSGLAQGSGE-----YFTK 145

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP    L+ LD GS+++W     +QCAP    Y    D++   +DP  S S   V 
Sbjct: 146 IGVGTPATPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGQVFDPRRSRSYGAVG 196

Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           CS PLC+   S  C   +  C Y   Y  + + ++G    + L  A  ++ A        
Sbjct: 197 CSAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATETLTFAGGARVA-------R 248

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------ 285
           + +GCG    G ++  A    ++GLG G +S P+ +++      SFS C  +        
Sbjct: 249 IALGCGHDNEGLFVAAAG---LLGLGRGSLSFPAQISR--RYGRSFSYCLVDRTSSANPA 303

Query: 286 --SGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCLTQSGFQ--- 334
             S +V FG     +  + SF P+ +       Y    VG+       S +  S  +   
Sbjct: 304 SHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDP 363

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
                  +VDSG S T L    Y+ +   F    +  R+S  G S +  CY+ S  +++K
Sbjct: 364 SSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVK 423

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           VP + + F+      +    +  P +   T FC     TDG   IIG     G R+VFD 
Sbjct: 424 VPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDG 482

Query: 448 ENLKLAWSHSKC 459
           +  ++ +    C
Sbjct: 483 DGQRVGFVPKGC 494


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  102 bits (254), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 163/377 (43%), Gaps = 50/377 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  + L+ LD GS+++W     +QCAP    Y  S       +DP  S S 
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQSG----RVFDPRRSRSY 178

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A  ++      
Sbjct: 179 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 231

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
           VQ  V IGCG    G ++   A  G++GLG G +S PS +A++     SFS C       
Sbjct: 232 VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 285

Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
                  S +V FG    A     SF P+G        Y+V +  + +G +    ++QS 
Sbjct: 286 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 345

Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
            +          ++DSG S T L   +Y  V   F       R+S  G S +  CYN S 
Sbjct: 346 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 405

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
             ++KVP + +  +   S  +    +  P +   T FC  +  TDG   IIG     G R
Sbjct: 406 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFR 464

Query: 443 IVFDRENLKLAWSHSKC 459
           +VFD +  ++ +    C
Sbjct: 465 VVFDGDAQRVGFVPKSC 481


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 97/412 (23%), Positives = 171/412 (41%), Gaps = 47/412 (11%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
           +  R  L     +  +  +FP        +G+ + + L+Y  + IG P   + + +D GS
Sbjct: 27  RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79

Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
           +L W+ C   C+ C+ +    Y               + +K V C   +C +       R
Sbjct: 80  DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR-KQT 241
             C S K  C Y   Y+ +  SS G LV D   L    + A  S V+  +  GCG  +Q 
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
           GS  + +A DGV+GLG G VS+ S L + G+ +N    C      G +FFGD   P ++ 
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           + + +      + Y  G  +   G   L     + + DSG+SFT+   + Y  +V     
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301

Query: 361 LVSSKRISLQGNSWKYCYNASS--EEMLKVPD----MRLIFSKNQSFVVRNHIFSFPENE 414
            +S     +  +S   C+      + +L V      + L FS  +  ++     ++    
Sbjct: 302 DLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLSFSNGKKALMEIPPENYLIVT 361

Query: 415 GFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            +   CL +++       D  I+G   M    +++D E  ++ W  + C+ +
Sbjct: 362 KYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 159/379 (41%), Gaps = 51/379 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T I +GTP   F V  D GS+L+W     IQC P  A +    ++    +DP  SSS 
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIW-----IQCKPCQACF----NQKDPIFDPEGSSSY 90

Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +SC   LC S  R SC      C Y   Y  + + + G L  + + L S      +  
Sbjct: 91  TTMSCGDTLCDSLPRKSCSP---NCDYSYGYG-DGSGTRGTLSSETVTLTSTQG---EKL 143

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
              ++  GCG    GS+ D +   G++GLG G++S  S L    L  + FS C       
Sbjct: 144 AAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDA 198

Query: 283 ENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNSCL------- 328
            + +  +FFGD+  +         +F P+         Y+V ++   I    L       
Sbjct: 199 PSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSF 258

Query: 329 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
                G   ++ DSG + T LP   Y  V+      VS   I         CY+ S  + 
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKA 318

Query: 386 ---LKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
               K+P M   F   +    V N+  +   N+  T+ CL ++S++ D GI G       
Sbjct: 319 SYKKKIPAMVFHFEGADHQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGIYGNMMQQNF 376

Query: 442 RIVFDRENLKLAWSHSKCE 460
           R+++D  + K+ W+ S+C+
Sbjct: 377 RVMYDIGSSKIGWAPSQCD 395


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 102/415 (24%), Positives = 179/415 (43%), Gaps = 56/415 (13%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D GS + +VPC  C QC                 + P SSS+ K
Sbjct: 90  TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCG----------KHQDPRFQPESSSTYK 139

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            + C+ P C    +C      C Y   Y+ E +SSSG L +D+L   + S+  PQ +   
Sbjct: 140 PMQCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGLLAEDVLSFGNESELTPQRA--- 190

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGS 288
             I GC   +TG      A DG+MGLG G +SV   L    ++ NSFS+C+   D   G+
Sbjct: 191 --IFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGA 247

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGV---ESYCIG-----NSCLTQSGFQALVDSG 340
           +  G+  P      +     + Y + +  +   E +  G     N  +       ++DSG
Sbjct: 248 MVLGNIPPPPDMVFAH---SDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSG 304

Query: 341 ASFTFLPTEIYA---EVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV----PDMR 392
            ++ +LP E +    + ++K  K +  K+I     S+   C++ +  ++ ++    P++ 
Sbjct: 305 TTYAYLPEEAFVAFKDAIIKEIKFL--KQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVN 362

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLK 451
           ++F   Q   +    + F   +    +CL +     D   ++G   +    + +DR+N K
Sbjct: 363 MVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDK 422

Query: 452 LAWSHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAAAPPSTAKTA 505
           + +  + C E+  +           QSP  P P     S+ N   +  P+ A + 
Sbjct: 423 IGFWKTNCSELWKRLQ--------SQSPGIPAPPPVVFSSGNKSESIAPTQAPSG 469


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 91/380 (23%), Positives = 160/380 (42%), Gaps = 29/380 (7%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y L+ T + +GTP   F V +D GS++LW+ C      P S+     L   L+ +D   S
Sbjct: 81  YGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSS----GLGIELNFFDTVGS 136

Query: 167 SSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S++  V CS P+C S      + C    + C Y   Y  + + +SG  V D ++      
Sbjct: 137 STAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYE-DGSGTSGVYVSDAMYFDMILG 195

Query: 222 HAPQSSVQSS--VIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            +  ++V SS  ++ GC   Q+G       A DG++G G G++SV S L+  G+    FS
Sbjct: 196 QSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFS 255

Query: 279 ICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
            C   D N  G +  G+     + S  + P+      Y + ++S  +    L        
Sbjct: 256 HCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFA 312

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           T      ++DSG + ++L  E Y  +V   D  VS    S      + CY   +      
Sbjct: 313 TSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-CYLVLTSIDDSF 371

Query: 389 PDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           P +   F    S  ++   ++ +    +G  ++C+          I+G   +    +V+D
Sbjct: 372 PTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYD 431

Query: 447 RENLKLAWSHSKCEEVIDKS 466
               ++ W++  C   ++ S
Sbjct: 432 LARQQIGWTNYDCSMSVNVS 451


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 165/376 (43%), Gaps = 49/376 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP    L+ LD GS+++W     +QCAP    Y    +++   +DP  S S 
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVW-----LQCAPCRRCY----EQSGQVFDPRRSRSY 190

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C+ PLC+   S  C   +  C Y   Y  + + ++G    + L  A  ++ A    
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLRRSACLYQVAYG-DGSVTAGDFATETLTFAGGARVA---- 245

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
               V +GCG    G ++  A    ++GLG G +S P+ +++      SFS C  +  S 
Sbjct: 246 ---RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGRSFSYCLVDRTSS 297

Query: 288 --------SVFFGDQGPATQQSTSFLPIGEKYDA---YFV--------GVESYCIGNSCL 328
                   +V FG     +  ++SF P+ +       Y+V        G     + NS L
Sbjct: 298 ANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDL 357

Query: 329 T---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSE 383
                SG    +VDSG S T L    Y+ +   F    +  R+S  G S +  CY+ S  
Sbjct: 358 RLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGR 417

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           +++KVP + + F+      +    +  P +   T FC     TDG   IIG     G R+
Sbjct: 418 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIGNIQQQGFRV 476

Query: 444 VFDRENLKLAWSHSKC 459
           VFD +  ++A++   C
Sbjct: 477 VFDGDGQRVAFTPKGC 492


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  101 bits (252), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 163/377 (43%), Gaps = 50/377 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  + L+ LD GS+++W     +QCAP    Y  S       +DP  S S 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQSG----RVFDPRRSRSY 172

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A  ++      
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 225

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
           VQ  V IGCG    G ++   A  G++GLG G +S P+ +A++     SFS C       
Sbjct: 226 VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSS 279

Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
                  S +V FG    A     SF P+G        Y+V +  + +G +    ++QS 
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339

Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
            +          ++DSG S T L   +Y  V   F       R+S  G S +  CYN S 
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
             ++KVP + +  +   S  +    +  P +   T FC  +  TDG   IIG     G R
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFR 458

Query: 443 IVFDRENLKLAWSHSKC 459
           +VFD +  ++ +    C
Sbjct: 459 VVFDGDAQRVGFVPKSC 475


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 105/438 (23%), Positives = 184/438 (42%), Gaps = 36/438 (8%)

Query: 46  NVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
            +++  ++P    VE  EL       + + RV+      SS   + FP EG+   +    
Sbjct: 15  TLTLERAFPLNQRVELDEL-------KARDRVRHGRFLQSSVGVVDFPVEGTYDPYR--- 64

Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
              L++T + +G+P   F V +D GS++LWV C      P S+  +  L+     +DP S
Sbjct: 65  -VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNF----FDPGS 119

Query: 166 SSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           SS++  +SCS   C      S + C S  + C Y   Y  + + +SGY V D+L+  +  
Sbjct: 120 SSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSDLLNFDAIV 178

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
             +  +S  +S++ GC   QTG       A DG+ G G  D+SV S ++  G+    FS 
Sbjct: 179 GSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSH 237

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
           C   +          G   ++   + P+      Y + ++S  +    L        T +
Sbjct: 238 CLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATST 296

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
               +VDSG +  +L  E Y   V    + VS     L     + CY  +S      P +
Sbjct: 297 NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSSVKGIFPTV 355

Query: 392 RLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDRE 448
            L F+   S  ++   +   +N      V+C+      G    I  + ++  +I V+D  
Sbjct: 356 SLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLA 415

Query: 449 NLKLAWSHSKCEEVIDKS 466
             ++ W++  C   ++ S
Sbjct: 416 GQRIGWANYDCSMSVNVS 433


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 161/378 (42%), Gaps = 63/378 (16%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I +G P+  + V +D GS++LWV C  C +C   S      L   L+ YDP+SS 
Sbjct: 26  LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKS-----DLGIKLTLYDPASSV 80

Query: 168 SSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S+  VSC    C S  +     CK  + PC Y   Y  + +S++GY V D +     + +
Sbjct: 81  SATRVSCDDDFCTSTYNGLLPDCKK-ELPCQYNVVYG-DGSSTAGYFVSDAVQFERVTGN 138

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                   +V  GCG +Q+G     G A DG++G                    +F+ C 
Sbjct: 139 LQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCL 178

Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ 334
           D  + G +F  G+       +T  +P    Y+ Y   +E   +G + L        SG +
Sbjct: 179 DNVNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLELPTDVFDSGDR 235

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-----ISLQGNSWKY-CYNASSEEML 386
              ++DSG +  +LP  +Y       D +++  R     +SL     ++ C+  S     
Sbjct: 236 RGTIIDSGTTLAYLPEVVY-------DSMMNEIRSQQPGLSLHTVEEQFICFKYSGNVDD 288

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHR 442
             PD++  F  + +  V  H + F  +E    F      + S DG D  ++G   +    
Sbjct: 289 GFPDIKFHFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKL 348

Query: 443 IVFDRENLKLAWSHSKCE 460
           +++D EN  + W+   C+
Sbjct: 349 VLYDIENQAIGWTEYNCK 366


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 170/389 (43%), Gaps = 51/389 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D GS + +VPC  C  C                ++ P +
Sbjct: 92  YYTTRLWI--GTPPQRFALIVDTGSTVTYVPCSTCKHCG----------SHQDPKFRPEA 139

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           S + + V C+      + +C   +  C Y   Y+ E ++SSG L +D++   + S+ +PQ
Sbjct: 140 SETYQPVKCT-----WQCNCDDDRKQCTYERRYA-EMSTSSGVLGEDVVSFGNQSELSPQ 193

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
            +     I GC   +TG   +  A DG+MGLG GD+S+   L +  +I ++FS+C+    
Sbjct: 194 RA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMG 247

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
                 V  G   PA    T   P+   Y  Y + ++   +       N  +       +
Sbjct: 248 VGGGAMVLGGISPPADMVFTHSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFDGKHGTV 305

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSEEMLKV----PD 390
           +DSG ++ +LP   +        K   S KRIS     +   C++ +   + ++    P 
Sbjct: 306 LDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPV 365

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVF 445
           + ++F       +    + F  ++    +CL V S   D      GI+ +N +    +++
Sbjct: 366 VEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTL----VMY 421

Query: 446 DRENLKLAWSHSKCEEVIDKSHVHLVPPP 474
           DRE+ K+ +  + C E+ ++ HV   PPP
Sbjct: 422 DREHSKIGFWKTNCSELWERLHVSNAPPP 450


>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 410

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 162/385 (42%), Gaps = 48/385 (12%)

Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN F   +Y+  + IG P  +F   +D GS++ WV C   C  C         +L   L 
Sbjct: 46  GNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKL- 95

Query: 160 EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
           +Y P  ++    V CS P+C      +   C + K+ C Y  +Y+ + +S    ++D   
Sbjct: 96  QYKPKGNT----VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAG 271
                 K    S++Q  +  GCG  Q  SY     P    GV+GLG G + + + L  AG
Sbjct: 152 F-----KLLNGSAMQPRLAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAG 204

Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 331
           L +N    C      G +FFGD         ++ P+    + Y  G              
Sbjct: 205 LTRNVVGHCLSSKGGGYLFFGDTL-IPSLGVAWTPLLPPDNHYTTGPAELLFNGKPTGLK 263

Query: 332 GFQALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLK 387
           G + + D+G+S+T+  ++ Y  +V  +  D  VS  +++ +  +   C+  +   + +L+
Sbjct: 264 GLKLIFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLE 323

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNF 437
           V +     + N +   RN     P  E + +       CL +++       +  +IG   
Sbjct: 324 VKNFFKTITINFTNARRNTQLQIPP-ESYLIISKTGNACLGLLNGSEVGLQNSNVIGDIS 382

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEV 462
           M G  I++D E  +L W  S C ++
Sbjct: 383 MQGLLIIYDNEKQQLGWVSSNCNKL 407


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 105/391 (26%), Positives = 171/391 (43%), Gaps = 48/391 (12%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEY-DPSS 165
           L+YT+I +G P   + + +D GS+L WV C   C  C    +  Y     N+  + D   
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLC 257

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
               +N       C   ++C+     C Y   Y+ + +SS G LV D   L    + +  
Sbjct: 258 MEVQRNYDGDQ--C---AACQQ----CNYEVQYADQ-SSSLGVLVKDEFTL----RFSNG 303

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--D 282
           S  + + I GC   Q G  L+  +  DG++GL    VS+PS LA  G+I N    C   D
Sbjct: 304 SLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGD 363

Query: 283 ENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
               G +F GD     Q   +++     P  + Y    V ++   I  S  T   S  Q 
Sbjct: 364 PAGGGYLFLGDDF-VPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQV 422

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           + DSG+S+T+   E Y ++V   ++ VS+  + LQ +S   C+  + + +  V D++  F
Sbjct: 423 VFDSGSSYTYFTKEAYYQLVANLEE-VSAFGLILQDSSDTICWK-TEQSIRSVKDVKHFF 480

Query: 396 SK------NQSFVVRNHIFSFPEN------EGFTVFCLTVMST----DGDYGIIGQNFMM 439
                   ++ ++V   +   PEN      EG    CL ++      DG   I+G N + 
Sbjct: 481 KPLTLQFGSRFWLVSTKLVILPENYLLINKEGNV--CLGILDGSQVHDGSTIILGDNALR 538

Query: 440 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 470
           G  +V+D  N ++ W+ S C       H+ L
Sbjct: 539 GKLVVYDNVNQRIGWTSSDCHNPRKIKHLPL 569


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/461 (24%), Positives = 202/461 (43%), Gaps = 58/461 (12%)

Query: 19  SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKK-NSVEYLELLLSNDWKRQKTRV 77
           S+  S S+++++R S     + ++K G        PK  N     E LL +  + +  +V
Sbjct: 55  SNVCSQSTRVLNRASSL---KVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQV 111

Query: 78  KLQSNNNSS---RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
           +L  N +S      Q   P+    T          +   + +GTP   F ++ D GS+L 
Sbjct: 112 RLSMNPSSGVFKEMQTTIPASIVPTG-------GAYVVTVGLGTPKKDFTLSFDTGSDLT 164

Query: 135 WVPCQ-CIQ-CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKS 187
           W  C+ C+  C P          +N  ++DP++S+S KNVSCS   CK     +  +   
Sbjct: 165 WTQCEPCLGGCFP----------QNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDC 214

Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 247
           + + C Y   Y +  T   G+L  + L +AS       S V  + + GC  +  G++ +G
Sbjct: 215 ISNTCLYGIQYGSGYTI--GFLATETLAIAS-------SDVFKNFLFGCSEESRGTF-NG 264

Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFL 305
               G++GLG   +++PS        +N FS C   + S  G + FG +     +ST   
Sbjct: 265 TT--GLLGLGRSPIALPSQTTNK--YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPIS 320

Query: 306 P-IGEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLV 362
           P + + Y    VG+    +    L  +G   + ++DSG +FTFLP+  Y+ +   F +++
Sbjct: 321 PKLKQLYGLNTVGIS---VRGRELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMM 377

Query: 363 SSKRISLQGNSWKYCYNASS--EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
           ++  ++   +S++ CY+ S+     L +P + + F       +       P N G    C
Sbjct: 378 ANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVN-GLKEVC 436

Query: 421 LTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           L    T  D D+ I G      + +++D     + ++   C
Sbjct: 437 LAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 106/439 (24%), Positives = 185/439 (42%), Gaps = 32/439 (7%)

Query: 48  SVADSWPKKNSVEY---LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGN 104
           +VA  +P   ++E    L   +  D  + + RV+      SS   + FP EG+   +   
Sbjct: 22  AVASGFPATLTLERAFPLNQRVELDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYR-- 79

Query: 105 QFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS 164
               L++T + +G+P   F V +D GS++LWV C      P S+  +  L+     +DP 
Sbjct: 80  --VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNF----FDPG 133

Query: 165 SSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SSS++  +SCS   C      S + C S  + C Y   Y  + + +SGY V D+L+  + 
Sbjct: 134 SSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSDLLNFDAI 192

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +  +S  +S++ GC   QTG       A DG+ G G  D+SV S ++  G+    FS
Sbjct: 193 VGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFS 251

Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
            C   +          G   ++   + P+      Y + ++S  +    L        T 
Sbjct: 252 HCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATS 310

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           +    +VDSG +  +L  E Y   V    + VS     L     + CY  +S      P 
Sbjct: 311 TNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSSVKGIFPT 369

Query: 391 MRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFMMGHRI-VFDR 447
           + L F+   S  ++   +   +N      V+C+      G    I  + ++  +I V+D 
Sbjct: 370 VSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDL 429

Query: 448 ENLKLAWSHSKCEEVIDKS 466
              ++ W++  C   ++ S
Sbjct: 430 AGQRIGWANYDCSMSVNVS 448


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 166/378 (43%), Gaps = 35/378 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G P   ++V +D GS++LWV C+ C  C   SA     L+  L+ YDP  SS
Sbjct: 1   LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESS 55

Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           ++  VSCS PLC      + + C    + C YI  Y  + ++S GY V D +     S +
Sbjct: 56  TTSLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYG-DGSTSEGYYVRDAMQYNVISSN 114

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              ++  S V+ GC  +QTG       A DG++G G  ++SVP+ LA    I   FS C 
Sbjct: 115 G-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 173

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
            E +         G   +   ++ P+      Y V +    + ++ L        + +  
Sbjct: 174 -EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDT 232

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +  + P+  Y   V    +  S+  + +QG   + C+  S       P++ L
Sbjct: 233 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGRLSDLFPNVTL 291

Query: 394 IFSKNQSFVVRNH--IFSFPENEGFT-VFCLTVMSTDGDYG--------IIGQNFMMGHR 442
            F      +  ++  ++      G T V+C+   S+    G        I+G   +    
Sbjct: 292 NFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 351

Query: 443 IVFDRENLKLAWSHSKCE 460
           +V+D +N ++ W    C+
Sbjct: 352 VVYDLDNSRIGWMSYNCK 369


>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
 gi|255630909|gb|ACU15817.1| unknown [Glycine max]
          Length = 244

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 113/234 (48%), Gaps = 9/234 (3%)

Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 338
           +CF  + +G + FGD G   Q+ T F  + + +  Y + +    + +S +    F A+ D
Sbjct: 1   MCFGPDGAGRITFGDTGSPDQRKTPFN-VRKLHPTYNITITQIVVEDS-VADLEFHAIFD 58

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMRLI 394
           SG SFT++    Y  +   ++  V + R S Q       ++YCY+ S  + ++VP + L 
Sbjct: 59  SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118

Query: 395 FSKNQSFVVRNHIFS-FPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
                 + V + I   F E EG  + CL +  +D    IIGQNFM+G++IVFDR+N+ L 
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEG-DLLCLGIQKSDS-VNIIGQNFMIGYKIVFDRDNMNLG 176

Query: 454 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 507
           W  + C + +  +   +  P    + +P       +TSN     P  + +  P+
Sbjct: 177 WKETNCSDDVLSNTSPINTPSPSPAVSPAIAVNPVATSNPSINPPNRSFRIKPT 230


>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
 gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 405

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 166/382 (43%), Gaps = 42/382 (10%)

Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           GN F   +Y+  + IG+P  +F   +D GS+L WV C     AP S     +L  NL +Y
Sbjct: 41  GNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCD----APCSG---CTLPPNL-QY 92

Query: 162 DPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
            P  +     + CS+P+C +     +  C + ++ C Y   Y+ +  SS G LV D   L
Sbjct: 93  KPKGNI----IPCSNPICTALHWPNKPHCPNPQEQCDYEVKYA-DQGSSMGALVTDQFPL 147

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLI 273
               K    S +Q  V  GCG  Q  SY     P    GV+GLG G + + + L  AGL 
Sbjct: 148 ----KLVNGSFMQPPVAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLT 201

Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
           +N    C      G +FFGD         ++ P+  + + Y  G              G 
Sbjct: 202 RNVVGHCLSSKGGGFLFFGDN-LVPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLKGL 260

Query: 334 QALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVP 389
           + + D+G+S+T+  ++ Y  ++  +  D  VS  +++ +  +   C+  +   + +L+V 
Sbjct: 261 KLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVK 320

Query: 390 DMRLIFSKNQSFVVRN-HIFSFPE------NEGFTVFCLTVMSTDG--DYGIIGQNFMMG 440
           +     + N +   RN  ++  PE        G     L   S  G  +  +IG   M G
Sbjct: 321 NFFKTITINFTNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISMQG 380

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             +++D E  +L W  S C ++
Sbjct: 381 LMMIYDNEKQQLGWVSSDCNKL 402


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/373 (26%), Positives = 165/373 (44%), Gaps = 21/373 (5%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T + +GTP + F V +D GS++LWV C      P S    + L   L+ +D SSSSS
Sbjct: 78  LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRS----SGLGIQLNFFDASSSSS 133

Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           S  VSCS P+C S      + C +  + C Y   Y  + + +SGY V + ++       +
Sbjct: 134 SSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYG-DGSGTSGYYVSESMYFDMVMGQS 192

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
             ++  +SV+ GC   Q+G       A DG+ G G GD+SV S L+  G+    FS C  
Sbjct: 193 MIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLK 252

Query: 282 -DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQS-GFQA 335
            + N  G +  G+        +  +P    Y+ Y     V  ++  I  S    S     
Sbjct: 253 GEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRGT 312

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           ++DSG +  +L  E Y   V      V S+ ++   +    CY  S+      P + L F
Sbjct: 313 IIDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNF 371

Query: 396 SKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           + + S V++   ++      +G  ++C+          I+G   M     V+D    ++ 
Sbjct: 372 AGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIG 431

Query: 454 WSHSKCEEVIDKS 466
           W+   C + ++ S
Sbjct: 432 WASYDCSQAVNVS 444


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 153/376 (40%), Gaps = 48/376 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y      L         
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL--------- 107

Query: 168 SSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
               V C++ +C +  S  S       +  C Y   Y T+  SS G LV D   L   +K
Sbjct: 108 ----VPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRNK 162

Query: 222 HAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSF 277
               S+V+ S+  GCG  +Q G   +GAAP   DG++GLG G VS+ S L + G+ +N  
Sbjct: 163 ----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216

Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA 335
             C   +  G +FFGD    T + T ++P+        Y  G  +       L+    + 
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVT-WVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV 275

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           + DSG+++T+   + Y   +      +S     +   S   C+    +    V D++  F
Sbjct: 276 VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDF 334

Query: 396 SKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVF 445
              Q    +N +   P      V      CL ++  DG      + IIG   M    +++
Sbjct: 335 KSLQFIFGKNAVMEIPPENYLIVTKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVIY 392

Query: 446 DRENLKLAWSHSKCEE 461
           D E  +L W    C  
Sbjct: 393 DNEKAQLGWIRGSCSR 408


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/421 (25%), Positives = 186/421 (44%), Gaps = 51/421 (12%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
           + N  +    R+      +S RN ++  S+ ++   F N   +L    I +GTP  S + 
Sbjct: 41  MYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYL--VEISVGTPPFSIVA 98

Query: 126 ALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSC 185
             D GS+++W      QC P S  Y     +N   +DPS S++ KNV+CS P+C      
Sbjct: 99  VADTGSDVIWT-----QCKPCSNCY----QQNAPMFDPSKSTTYKNVACSSPVCSYSGDG 149

Query: 186 KSLKD--PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
            S  D   C Y   Y  +D+ S G L  D + + S S   P +  ++  +IGCG    G+
Sbjct: 150 SSCSDDSECLYSIAYG-DDSHSQGNLAVDTVTMQSTSGR-PVAFPRT--VIGCGHDNAGT 205

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPA 297
           +   A   G++GLG G  S+ + L  A      FS C         NDS  + FG     
Sbjct: 206 F--NANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGSTNDSTKLNFGSNANV 261

Query: 298 TQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCL------TQSGFQA--LVDSGASFTFL 346
           +   T   PI    +Y  ++ + +E+  +G++        ++ G ++  ++DSG + T+L
Sbjct: 262 SGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYL 321

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
           P+ +         + +S            YC+ A++ +  ++P + + F      + R +
Sbjct: 322 PSALLNSFGSAISQSMSLPHAQDPSEFLDYCF-ATTTDDYEMPPVTMHFEGADVPLQREN 380

Query: 407 IFSFPENEGFTVFCLTVMSTDGD----YGIIGQ-NFMMGHRIVFDRENLKLAWSHSKCEE 461
           +F    ++     CL   S   D    YG I Q NF++G    +D +NL +++  + C  
Sbjct: 381 LFVRLSDD---TICLAFGSFPDDNIFIYGNIAQSNFLVG----YDIKNLAVSFQPAHCGA 433

Query: 462 V 462
           V
Sbjct: 434 V 434


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 164/367 (44%), Gaps = 49/367 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP   F   +D GS+L WV     QCAP +  +    ++    + P +SSS  N S
Sbjct: 12  ISLGTPPQQFSAIVDTGSDLCWV-----QCAPCARCF----EQPDPLFIPLASSSYSNAS 62

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C+  LC +  R +C S+++ C Y   Y     +   +         +F       S  + 
Sbjct: 63  CTDSLCDALPRPTC-SMRNTCTYSYSYGDGSNTRGDF---------AFETVTLNGSTLAR 112

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS-- 288
           +  GCG  Q G++   A  DG++GLG G +S+PS L  +    + FS C  D++ +G+  
Sbjct: 113 IGFGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFS 167

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ--SGFQ-------- 334
            + FG+   A     SF P+ +  D    Y+VGVES  +GN  +    S F+        
Sbjct: 168 PITFGNA--AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGG 225

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMR 392
            ++DSG + T+     +  ++ +  + +S             CY+ S  S   L +P M 
Sbjct: 226 VILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMT 285

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           +  +     +  ++++   +N G TV   T MST   + IIG      + IV D  N ++
Sbjct: 286 VHLTNVDFEIPVSNLWVLVDNFGETV--CTAMSTSDQFSIIGNVQQQNNLIVTDVANSRV 343

Query: 453 AWSHSKC 459
            +  + C
Sbjct: 344 GFLATDC 350


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 160/384 (41%), Gaps = 41/384 (10%)

Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
            +G+ + + L+Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y      
Sbjct: 56  LYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY------ 109

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
                    + +K V C   LC S       +  C S  + C Y+  Y+ +  SS+G LV
Sbjct: 110 -------RPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYA-DQGSSTGVLV 161

Query: 211 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 270
           +D   L    + A  S V+ S+  GCG  Q  S  + +  DGV+GLG G VS+ S   + 
Sbjct: 162 NDSFAL----RLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQH 217

Query: 271 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLT 329
           G+ +N    C      G +FFGD     Q+ T    +      Y+  G  S   G+  L 
Sbjct: 218 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLR 277

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
               + + DSG+SFT+   + Y  +V      +S     +   S   C+    +    V 
Sbjct: 278 VKLTEVVFDSGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKG-KKPFKSVL 336

Query: 390 DMRLIF-SKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFM 438
           D++  F S   +F   N  F     + + +       CL +++       D  I+G   M
Sbjct: 337 DVKKEFKSLVLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITM 396

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               +++D E  ++ W  + C+ +
Sbjct: 397 QDQMVIYDNEKGQIGWIRAPCDRI 420


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 104/408 (25%), Positives = 168/408 (41%), Gaps = 57/408 (13%)

Query: 100 HFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDR 156
           H  GN +   L+Y  + +G+P   + + +D GS+L W  C   C  CA      Y     
Sbjct: 29  HVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLY----- 83

Query: 157 NLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
                   +   +K V C  P+C          C S    C Y  +Y+ + +S+ G LV+
Sbjct: 84  --------NPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYA-DGSSTMGVLVE 134

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 270
           D L +    +    + +Q+  IIGCG  Q G+     A+ DGV+GL    V++P+ LA+ 
Sbjct: 135 DTLTV----RLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEK 190

Query: 271 GLIQNSFSICFDE--NDSGSVFFGDQGPATQQST--------SFLPIGEKYDAYFVGVES 320
           G+I+N    C  +  N  G +FFGD+   +   T          L    +  +   G +S
Sbjct: 191 GIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDS 250

Query: 321 YCIGN-SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
             + N   LT+S    + DSG SFT+L  + YA V+    K     R+     +  YC+ 
Sbjct: 251 LVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVK-SDTTLPYCWR 309

Query: 380 ASS--EEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTD 427
             S  + +  V      + L F     F   + +   P  +G+ +       CL ++   
Sbjct: 310 GPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSP--QGYLIVSTQGNVCLGILDAS 367

Query: 428 GD----YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 471
           G       IIG   M G+ +V+D    ++ W    C     K+    V
Sbjct: 368 GASLEVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNCHSRPTKTSSQFV 415


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 159/360 (44%), Gaps = 44/360 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP V +L   D GS+L W  C  C++C       Y  L      ++P  S+S  +V C
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQLR---PIFNPLKSTSFSHVPC 135

Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           +   C +       ++  C Y   Y  + T S G L  + + + S       SSV+S  +
Sbjct: 136 NTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS-------SSVKS--V 185

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGSVF 290
           IGCG   +G +       GV+GLG G +S+ S +++   I   FS C      + +G + 
Sbjct: 186 IGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKIN 242

Query: 291 FGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQSGFQALVDSGASFTF 345
           FG     +       P+  K     Y++ +E+  IGN       + G   ++DSG + +F
Sbjct: 243 FGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQG-NVIIDSGTTLSF 301

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLKVPDMRLIFSKNQSF 401
           LP E+Y  VV    K+V +KR+   GN W  C+    N ++   + +   +     N + 
Sbjct: 302 LPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNL 361

Query: 402 VVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +  N       N    V CLT+   S   ++GIIG   +    I +D E  +L++  + C
Sbjct: 362 LPVNTFQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 165/377 (43%), Gaps = 35/377 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G P   ++V +D GS++LWV C+ C  C   SA     L+  L+ YDP  SS
Sbjct: 28  LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESS 82

Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           ++  VSCS PLC      + + C    + C YI  Y  + ++S GY V D +     S +
Sbjct: 83  TTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYG-DGSTSEGYYVRDAMQYNVISSN 141

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              ++  S V+ GC  +QTG       A DG++G G  ++SVP+ LA    I   FS C 
Sbjct: 142 G-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 200

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
            E +         G   +   ++ P+      Y V +    + ++ L        + +  
Sbjct: 201 -EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDT 259

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +  + P+  Y   V    +  S+  + +QG   + C+  S       P++ L
Sbjct: 260 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGRLSDLFPNVTL 318

Query: 394 IFSKNQSFVVRNH--IFSFPENEGFT-VFCLTVMSTDGDYG--------IIGQNFMMGHR 442
            F      +  ++  ++      G T V+C+   S+    G        I+G   +    
Sbjct: 319 NFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 378

Query: 443 IVFDRENLKLAWSHSKC 459
           +V+D +N ++ W    C
Sbjct: 379 VVYDLDNSRIGWMSYNC 395


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 125/483 (25%), Positives = 199/483 (41%), Gaps = 80/483 (16%)

Query: 31  RFSDEA-KERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR----VKLQSNNNS 85
           R SDE  ++   S+     V   + K  + E+ E +L+ D   +  +    + L+  N  
Sbjct: 107 RVSDERNRDDDSSRETTSFVFPVYHKLRAREFHERILAEDLGLENGKFVESMDLELVNPV 166

Query: 86  SRNQLLFPSEGS---QTHFF--GNQFY--WLHYTWIDIGTPNVS--FLVALDAGSNLLWV 136
             N +L  S GS    T  F  G   Y   L+YT I +G P     + + +D GS+L W+
Sbjct: 167 KVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWI 226

Query: 137 PCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSL 188
            C   C  CA  +   Y     NL             V  S P C      +    C+S 
Sbjct: 227 QCDAPCTSCAKGANQLYKPRKDNL-------------VRSSEPFCVEVQRNQLTEHCESC 273

Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
              C Y  +Y+ + + S G L  D  HL    K    S  +S ++ GCG  Q G  L+  
Sbjct: 274 HQ-CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTL 327

Query: 249 -APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSF 304
              DG++GL    +S+PS LA  G+I N    C   D N  G +F G D  P+     ++
Sbjct: 328 LKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTW 385

Query: 305 LPI--GEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTFLPTEIYAEVVVK 357
           +P+      + Y + V     GN+ L+  G      + L D+G+S+T+ P + Y+++V  
Sbjct: 386 VPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTS 445

Query: 358 FDKLVSSKRISLQGN--SWKYCYNASSEE---------------MLKVPDMRLIFSKNQS 400
             + VS   ++   +  +   C+ A +                  L++    LI SK   
Sbjct: 446 LQE-VSDLELTRDDSDEALPICWRAKTNSPISSLSDVKKFFRPITLQIGSKWLIISKK-- 502

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
            +++   +    N+G    CL ++      DG   IIG   M G  IV+D    ++ W  
Sbjct: 503 LLIQPEDYLIISNKGNV--CLGILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMK 560

Query: 457 SKC 459
           S C
Sbjct: 561 SDC 563


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 168/384 (43%), Gaps = 55/384 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  G+P   F + +D GS + +VPC  C+QC           +     + P  
Sbjct: 88  YYTTRLWI--GSPPQEFALIVDTGSTVTYVPCSNCVQCG----------NHQDPRFQPEL 135

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+     +  +C      C Y   Y+ E ++SSG L +D++     S+  PQ
Sbjct: 136 SSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKESELVPQ 189

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   ++G      A DG+MGLG G +SV   L   G++ NSFS+C+   D
Sbjct: 190 RAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
            G    V  G   P     +   P    Y  Y + ++   +    L          + A+
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAI 301

Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
           +DSG ++ + P + Y    + ++K  K+   K+IS    ++K  C++ +  ++ ++    
Sbjct: 302 LDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVTELPKVF 359

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRI 443
           P++ ++F+  Q   +    + F   +    +CL +     D      GII +N +    +
Sbjct: 360 PEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTL----V 415

Query: 444 VFDRENLKLAWSHSKCEEVIDKSH 467
            ++REN  + +  + C E+    H
Sbjct: 416 TYNRENSTIGFWKTNCSELWKNLH 439


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 163/380 (42%), Gaps = 53/380 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP   F + LD GS+L W+  QC+ C       Y   ++N   YDP  SSS KN++C 
Sbjct: 201 VGTPPKHFSLILDTGSDLNWI--QCVPC-------YACFEQNGPYYDPKDSSSFKNITCH 251

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            P C+  SS      CK     CPY   Y     ++  + ++      +  +  P+  + 
Sbjct: 252 DPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIV 311

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS 288
            +V+ GCG    G +   A    ++GLG G +S  + L    L  +SFS C  D N + S
Sbjct: 312 ENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVDRNSNSS 366

Query: 289 V----FFGDQGPATQQS----TSFLPIGEKYDA----YFVGVESYCIGNSCL-------- 328
           V     FG+            TSF  +G K +     Y+V ++S  +G   L        
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSF--VGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWH 424

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
              Q G   ++DSG + T+     Y  +   F + +    +       K CYN S  E +
Sbjct: 425 LSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKM 484

Query: 387 KVPDMRLIFSKNQ--SFVVRNHIFSF-PENEGFTVFCLTVMST-DGDYGIIGQNFMMGHR 442
           ++P+  ++F+      F V N+     PE+    V CL ++ T      IIG        
Sbjct: 485 ELPEFAILFADGAMWDFPVENYFIQIEPED----VVCLAILGTPRSALSIIGNYQQQNFH 540

Query: 443 IVFDRENLKLAWSHSKCEEV 462
           I++D +  +L ++  KC +V
Sbjct: 541 ILYDLKKSRLGYAPMKCADV 560


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 109/415 (26%), Positives = 179/415 (43%), Gaps = 61/415 (14%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
           L+N ++R  +R     N  ++   L     L P  G             +   + IGTP 
Sbjct: 55  LTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGSGE------------YLMSVSIGTPP 102

Query: 121 VSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
           V ++   D GS+L+W  C  C++C   S   +          DP  S+S  +V C+   C
Sbjct: 103 VDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF----------DPLKSTSFSHVPCNSQNC 152

Query: 180 KS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
           K+   S C + +  C Y   Y  + T + G L  + + + S       SSV+S  +IGCG
Sbjct: 153 KAIDDSHCGA-QGVCDYSYTYG-DQTYTKGDLGFEKITIGS-------SSVKS--VIGCG 201

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGSVFFGDQ 294
            +  G +   +    V+GLG G +S+ S +++   I   FS C      + +G + FG  
Sbjct: 202 HESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQN 258

Query: 295 GPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEI 350
              +       P+  K     Y+V +E+  IGN     S  Q   ++DSG + +FLP E+
Sbjct: 259 AVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKEL 318

Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLKVPDMRLIFSKNQSFVVRNH 406
           Y  VV    K+V +KR+   GN W  C+    N ++   + +   +     N + +  N 
Sbjct: 319 YDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNT 378

Query: 407 IFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                 N    V CLT+   S   ++GIIG   +    I +D E  +L++  + C
Sbjct: 379 FQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 166/362 (45%), Gaps = 46/362 (12%)

Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           +   +Y + Y+   IGTP       +D GS+ +W   QC  C P        L++    +
Sbjct: 85  YAGSYYVMSYS---IGTPPFQLYGVVDTGSDGIWF--QCKPCKPC-------LNQTSPIF 132

Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           +PS SS+ KN+ CS P+CK   ++ C S  K  C Y   Y  + + S G +  D L L S
Sbjct: 133 NPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITY-LDRSGSQGDISKDTLTLNS 191

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            +  +P S  +  ++IGCG K + +  +G A  G++G G G+ S+ S L  +  I   FS
Sbjct: 192 -NDGSPISFPK--IVIGCGHKNSLT-TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKFS 244

Query: 279 ICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN------ 325
            C        N S  ++FGD    +       P+ + +    YF  +E++ +G+      
Sbjct: 245 YCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLK 304

Query: 326 --SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
             S +  +   A++DSG++ T LP ++Y+++      +V  KR+         CY  +  
Sbjct: 305 DSSLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT-- 362

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMM 439
             LK  ++ +I +  +   V+ + F+        V C    S+   + + G    QNF++
Sbjct: 363 --LKKYEVPIITAHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLV 420

Query: 440 GH 441
           G+
Sbjct: 421 GY 422


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/399 (23%), Positives = 170/399 (42%), Gaps = 50/399 (12%)

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           ++ + +D GS   +VPC+ C +C   +  YY   DR++         +S    C   +  
Sbjct: 50  TYDLIVDTGSARTYVPCKGCARCGEHAHGYY-DYDRSMEFERLDCGEASDATLCEETM-- 106

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
            + +C+S    C Y+  Y+ E +SS GY+V D + L        + ++ + +  GC   +
Sbjct: 107 -KGTCQS-DGRCSYVVSYA-EGSSSRGYVVRDRVRLG-------EGTLSAMLAFGCEEAE 156

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS----GSVFFGD 293
           T +  +  A DG+ G G G  +V + LA AGLI+N FS C   F  N      G   FG 
Sbjct: 157 TNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGA 215

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ-SGFQALVDSGASFTFLPTEIYA 352
             PA  + T  +        + V   S+ +G+S +   + +   +DSG +FTF+P  ++ 
Sbjct: 216 DAPALAR-TPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWV 274

Query: 353 EVVVKFDKLVSSKRISL-QGNSWKY---CYNASSEEMLKV----------PDMRLIFSKN 398
               + D   +   + +  G   +Y   CY  S+  M             P + + +   
Sbjct: 275 SFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAYEGG 334

Query: 399 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
            S  +   N++F+   N     FC+ + +   +  ++GQ  M    + FD  N ++  + 
Sbjct: 335 VSLTLGPENYLFAHETNS--AAFCVGIFANPNNQILLGQITMRDTLMEFDVANSRVGMAP 392

Query: 457 SKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQA 495
           + C  + +K + H        SP P P+     +  G A
Sbjct: 393 ANCRRLREK-YTH-------DSPEPTPSNSSTPSGGGDA 423


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 165/377 (43%), Gaps = 42/377 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I +G P   + + +D GS+L W+ C   C  CA      Y      +         
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 245

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +++ C   L   ++ C + K  C Y  +Y+ + +SS G L  D +H+ + +       
Sbjct: 246 PPRDLLCQE-LQGDQNYCATCKQ-CDYEIEYA-DRSSSMGVLAKDDMHMIATNG----GR 298

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   + N
Sbjct: 299 EKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPN 358

Query: 285 DSGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSG-----FQALVD 338
             G +F GD     +   ++ PI G   + Y    +    G+  L   G      Q + D
Sbjct: 359 GGGYMFLGDDY-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417

Query: 339 SGASFTFLPTEIYAEVV--VKFD--KLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMR 392
           SG+S+T+LP EIY ++V  +K+D    V     +     WK  ++    E +K     + 
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLN 477

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIV 444
           L F  N+ FV+       P++          CL +++  + D+    I+G   + G  +V
Sbjct: 478 LHFG-NRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVV 536

Query: 445 FDRENLKLAWSHSKCEE 461
           +D E  ++ W+ S+C +
Sbjct: 537 YDNERRQIGWADSECTK 553


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 145/329 (44%), Gaps = 34/329 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I IGTP+  + V +D GS++LWV C  C +C   S      L  +L+ YD  +S+
Sbjct: 77  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 131

Query: 168 SSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           +S  V C    C         CK  L+  C Y   Y  + +S++GY V D +     S +
Sbjct: 132 TSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQDFVQYNRISGN 188

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +G ++  FS C 
Sbjct: 189 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 248

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYCIGNSCLT---- 329
           D  D G +F    G   +    FL +              Y V ++   +G   L     
Sbjct: 249 DNVDGGGIFA--IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSD 306

Query: 330 --QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
             +SG +   ++DSG +  + P E+Y  ++ K        R+     ++  C++ +    
Sbjct: 307 AFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVD 365

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
              P + L F K+ S  V  H + F   E
Sbjct: 366 DGFPTVTLHFDKSISLTVYPHEYLFQVKE 394


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 168/384 (43%), Gaps = 55/384 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  G+P   F + +D GS + +VPC  C+QC           +     + P  
Sbjct: 88  YYTTRLWI--GSPPQEFALIVDTGSTVTYVPCSNCVQCG----------NHQDPRFQPEL 135

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+     +  +C      C Y   Y+ E ++SSG L +D++     S+  PQ
Sbjct: 136 SSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKESELVPQ 189

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   ++G      A DG+MGLG G +SV   L   G++ NSFS+C+   D
Sbjct: 190 RAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
            G    V  G   P     +   P    Y  Y + ++   +    L          + A+
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAI 301

Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
           +DSG ++ + P + Y    + ++K  K+   K+IS    ++K  C++ +  ++ ++    
Sbjct: 302 LDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVTELPKVF 359

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRI 443
           P++ ++F+  Q   +    + F   +    +CL +     D      GII +N +    +
Sbjct: 360 PEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTL----V 415

Query: 444 VFDRENLKLAWSHSKCEEVIDKSH 467
            ++REN  + +  + C E+    H
Sbjct: 416 TYNRENSTIGFWKTNCSELWKNLH 439


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 101/385 (26%), Positives = 174/385 (45%), Gaps = 39/385 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++  I +GTP   + V +D GS++LWV C  C  C   S      L   LS Y PSSSS
Sbjct: 73  LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKS-----DLGIELSLYSPSSSS 127

Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +S  V+C+   C S         + +  C Y   Y  + +S++GY V D + L   + + 
Sbjct: 128 TSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYG-DGSSTAGYFVRDHVVLDRVTGNF 186

Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             +S   S++ GCG +Q+G      AA DG++G G  + S+ S LA +G ++  F+ C D
Sbjct: 187 QTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLD 246

Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
             + G +F  G+      ++T  +P    Y+ +   +E   + N  L        T    
Sbjct: 247 NINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIE---VDNEVLNLPTDVFDTDLRK 303

Query: 334 QALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++DSG +  + P  IY  ++ K F +  + K  +++     + Y+ + ++    P + 
Sbjct: 304 GTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEYDGNVDDGF--PTVT 361

Query: 393 LIFSKNQSFVVRNH--IFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFMMGHRIV- 444
             F  + S  V  H  +F    N+    +C+        S DG   I+  + ++ +R+V 
Sbjct: 362 FHFEDSLSLTVYPHEYLFDIDSNK----WCVGWQNSGAQSRDGKDMILLGDLVLQNRLVM 417

Query: 445 FDRENLKLAWSHSKCEEVIDKSHVH 469
           +D EN  + W+   C   I     H
Sbjct: 418 YDLENQTIGWTEYNCSSSIKVRDEH 442


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 165/378 (43%), Gaps = 52/378 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP    L+ LD GS+++W     +QCAP    Y    D++   +DP +S S 
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGQMFDPRASHSY 197

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C+ PLC+   S  C   +  C Y   Y  + + ++G    + L  AS ++  P+  
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATETLTFASGAR-VPR-- 253

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---- 283
               V +GCG    G ++  A    ++GLG G +S PS +++      SFS C  +    
Sbjct: 254 ----VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--RFGRSFSYCLVDRTSS 304

Query: 284 -----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL------- 328
                + S +V FG        + SF P+ +       Y+V +    +G + +       
Sbjct: 305 SASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSD 364

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNAS 381
                 T  G   +VDSG S T L    YA +   F    +  R+S  G S +  CY+ S
Sbjct: 365 LRLDPSTGRG-GVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLS 423

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
             +++KVP + + F+      +    +  P +   T FC     TDG   IIG     G 
Sbjct: 424 GLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGF 482

Query: 442 RIVFDRENLKLAWSHSKC 459
           R+VFD +  +L +    C
Sbjct: 483 RVVFDGDGQRLGFVPKGC 500


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 109/432 (25%), Positives = 187/432 (43%), Gaps = 57/432 (13%)

Query: 53  WPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY- 111
           W KK        LL ++ + Q  ++++++  +S+  Q +  ++   T   G +   L+Y 
Sbjct: 86  WGKK----MRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTS--GIKLETLNYI 139

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
             +++G  N+S +V  D GS+L WV     QC P  + Y    ++    YDPS SSS K 
Sbjct: 140 VTVELGGKNMSLIV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKT 188

Query: 172 VSCSHPLCKSRSSCKS-----------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           V C+   C+   +              +K  C Y+  Y  + + + G L  + + L    
Sbjct: 189 VFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYG-DGSYTRGDLASESIVLG--- 244

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                 +   +++ GCGR   G +       G+MGLG   VS+ S   K       FS C
Sbjct: 245 -----DTKLENLVFGCGRNNKGLF---GGASGLMGLGRSSVSLVSQTLKT--FNGVFSYC 294

Query: 281 ---FDENDSGSVFFGDQGPATQQSTS--FLPIGEK---YDAYFVGVESYCIGNSCLTQSG 332
               ++  SG++ FG+     + STS  + P+ +       Y + +    IG   L    
Sbjct: 295 LPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLS 354

Query: 333 F--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           F    L+DSG   T LP  IY  V  +F K  S    +   +    C+N +S E + +P 
Sbjct: 355 FGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPT 414

Query: 391 MRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDR 447
           +++IF  N    V    +F F + +  ++ CL +  +S + + GIIG       R+++D 
Sbjct: 415 IKMIFEGNAELEVDVTGVFYFVKPDA-SLVCLALASLSYENEVGIIGNYQQKNQRVIYDT 473

Query: 448 ENLKLAWSHSKC 459
              +L  +   C
Sbjct: 474 TQERLGIAGENC 485


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 162/378 (42%), Gaps = 31/378 (8%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G+P   F V +D GS++LWV C  C  C   S      L   L+ +D SSSS
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSS 119

Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           ++  V CS P+C S      + C    + C Y   Y  + + +SGY V D L+  +    
Sbjct: 120 TAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYE-DGSGTSGYYVSDTLYFDAILGE 178

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +   +  + ++ GC   Q+G   +   A DG+ G G G++SV S L+  G+    FS C 
Sbjct: 179 SLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL 238

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
            + +         G   +    + P+      Y + ++S  +    L        T +  
Sbjct: 239 -KGEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDM 391
             +VDSG +  +L  E Y   V   + +VS     +  +GN    CY  S+      P  
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ---CYLVSTSVSQMFPLA 354

Query: 392 RLIFSKNQSFVVR--NHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
              F+   S V++  +++  F P   G  ++C+      G   I+G   +     V+D  
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQG-VTILGDLVLKDKIFVYDLV 413

Query: 449 NLKLAWSHSKCEEVIDKS 466
             ++ W++  C   ++ S
Sbjct: 414 RQRIGWANYDCSLSVNVS 431


>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1336

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 161/378 (42%), Gaps = 42/378 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNL-SEYDPSS 165
           L++T + +G P  S+ + +D GS+L W+ C   C  C   +   Y     N+ S  D   
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKPTRSNVVSSVDSLC 252

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
               KN    H         +SL   C Y   Y+ + +SS G LV D LHL + +     
Sbjct: 253 LDVQKNQKNGH-------HDESLLQ-CDYEIQYA-DHSSSLGVLVRDELHLVTTNG---- 299

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           S  + +V+ GCG  Q G  L+  A  DG+MGL    VS+P  LA  GLI+N    C   +
Sbjct: 300 SKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSND 359

Query: 285 DSGS--VFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSG----FQAL 336
            +G   +F GD         +++P+      D Y   +     GN  L   G     +  
Sbjct: 360 GAGGGYMFLGDDF-VPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVF 418

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEMLKVPDM 391
            DSG+S+T+ P E Y ++V   +++     +    ++     W+  +   S + +K    
Sbjct: 419 FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFK 478

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMS----TDGDYGIIGQNFMMGH 441
            L       + + + +F  P  EG+ +       CL ++      DG   I+G   + G+
Sbjct: 479 TLTLRFGSKWWILSTLFQIPP-EGYLIISNKGHVCLGILDGSKVNDGSSIILGDISLRGY 537

Query: 442 RIVFDRENLKLAWSHSKC 459
            +V+D    K+ W  + C
Sbjct: 538 SVVYDNVKQKIGWKRADC 555


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 162/375 (43%), Gaps = 53/375 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V F+   D GS+L W  CQ C  C P          ++   YDPS+SS+   V
Sbjct: 70  LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 119

Query: 173 SCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            CS   C    +SR +C +   PC YI  YS +   S G L  + L + S     P  +V
Sbjct: 120 PCSSATCLPTWRSR-NCSNPSSPCRYIYSYS-DGAYSVGILGTETLTIGS---SVPGQTV 174

Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDE 283
              SV  GCG    G  L+     G +GLG G +   SLLA+ G+    FS C    F+ 
Sbjct: 175 SVGSVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNS 226

Query: 284 NDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
                 F G       GP T QST  L        YFV ++   +G+  L          
Sbjct: 227 TMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLR 286

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  +VDSG +FT L    + EVV +  +L+    ++        C+ +   E   +
Sbjct: 287 ADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-CFPSPDGEPF-M 344

Query: 389 PDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           PD+ L F+      + R++  S+  NE  + FCL ++ +   +  +G       +++FD 
Sbjct: 345 PDLVLHFAGGADMRLHRDNYMSY--NEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDM 402

Query: 448 ENLKLAWSHSKCEEV 462
              +L++  + C ++
Sbjct: 403 TVGQLSFLPTDCSKL 417


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 166/391 (42%), Gaps = 47/391 (12%)

Query: 95  EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYT 152
           EGS       + Y   YT I+IG P   + + +D GS L W+ C   C  C       Y 
Sbjct: 117 EGSTAAVLPERQY---YTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYK 173

Query: 153 SLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
               N+    P   S  + +  +   C    +CK     C Y   Y+ + +SS+G L  D
Sbjct: 174 PAKENIV---PPRDSHCQELQGNQNYC---DTCKQ----CDYEIAYA-DRSSSAGVLARD 222

Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 271
            + L +    A        ++ GC   Q G  L   A+ DG++GL  G +S+P+ LAK G
Sbjct: 223 NMELIT----ADGERENMDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQG 278

Query: 272 LIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIG----EKYDAYFVGVESYCIGN 325
           +I N F  C   + SGS  +F GD     +   +++P+     + Y      V   C   
Sbjct: 279 IISNVFGHCIATDPSGSAYMFLGDDY-VPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQEL 337

Query: 326 SCLTQSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC----YN 379
           +   Q+G   Q + DSG+S+T+ P EIY  ++   + +           +  +C    + 
Sbjct: 338 NVREQAGKLTQVIFDSGSSYTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPFCMKPNFP 397

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PEN----EGFTVFCLTVMSTDG-DYG-- 431
             S + +K     L+   +++++V    F   PEN     G    CL V+  DG + G  
Sbjct: 398 VRSVDDVKQLHKPLLLHFSKTWLVIPRTFEISPENYLIISGKGNVCLGVL--DGTEIGHS 455

Query: 432 ---IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              +IG   + G  + +D +  ++ W+ S C
Sbjct: 456 STIVIGDVSLRGKLVAYDNDANQIGWAQSDC 486


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 112/489 (22%), Positives = 194/489 (39%), Gaps = 89/489 (18%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +S+  + P  + VE L  L + D  R   R+ LQ   +     L F  +G+   +     
Sbjct: 17  LSLERTIPLNHQVE-LTTLKARDRARHGGRI-LQ---DGGGGILDFSVQGTSDPYL---- 67

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L++T + +G+P   F V +D GS++LW+ C      P S    + L  +L+ +D +SS
Sbjct: 68  VGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKS----SGLGIDLNYFDTASS 123

Query: 167 SSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S++  VSCS P+C      + S C S  + C Y   Y  + + +SGY V D ++      
Sbjct: 124 STAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYG-DGSGTSGYYVYDAMYFDVIMG 182

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
            +  S+  S+V+ GC   Q+G       A DG+ G G G +SV S ++  G+    FS C
Sbjct: 183 QSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHC 242

Query: 281 FDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
                SG   +  G+        T  +P+   Y+   + ++S  +    L        T 
Sbjct: 243 LKGQGSGGGILVLGEILEPNIVYTPLVPLQPHYN---LNLQSIAVNGQILPIDQDVFATG 299

Query: 331 SGFQALVDSGASFTFLPTEIY--------------------------------------- 351
           +    +VDSG +  +L  E Y                                       
Sbjct: 300 NNRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVKRH 359

Query: 352 ------AEVVVKFDKLVS------SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
                   +V+K   +++      SK I  +GN    CY   +      P + L F    
Sbjct: 360 YYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQ---CYLVPTSLGDIFPLVSLNFMGGA 416

Query: 400 SFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           S V++   ++  +   +G  ++C+        Y I+G   +     V+D  N ++ W+  
Sbjct: 417 SMVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGYTILGDLVLKDKIFVYDLANQRIGWTDY 476

Query: 458 KCEEVIDKS 466
            C   ++ S
Sbjct: 477 DCSLAVNVS 485


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 151/375 (40%), Gaps = 46/375 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y      L         
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL--------- 107

Query: 168 SSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
               V C++ +C +  S  S       +  C Y   Y T+  SS G LV D   L   +K
Sbjct: 108 ----VPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRNK 162

Query: 222 HAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSF 277
               S+V+ S+  GCG  +Q G   +GAAP   DG++GLG G VS+ S L + G+ +N  
Sbjct: 163 ----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216

Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQAL 336
             C   +  G +FFGD    T + T    +      Y+  G  +       L+    + +
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVV 276

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            DSG+++T+   + Y   +      +S     +   S   C+    +    V D++  F 
Sbjct: 277 FDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDFK 335

Query: 397 KNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVFD 446
             Q    +N +   P      +      CL ++  DG      + IIG   M    +++D
Sbjct: 336 SLQFIFGKNAVMDIPPENYLIITKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVIYD 393

Query: 447 RENLKLAWSHSKCEE 461
            E  +L W    C  
Sbjct: 394 NEKAQLGWIRGSCSR 408


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 93/367 (25%), Positives = 160/367 (43%), Gaps = 36/367 (9%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I +G P   + + +D GS+L W+ C   C  CA      Y      +    P   S
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV---PPRDS 247

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + +      C++   CK     C Y  +Y+ + +SS G L  D +HL + +       
Sbjct: 248 LCQELQGDQNYCET---CKQ----CDYEIEYA-DRSSSMGVLAKDDMHLIATNG----GR 295

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   + N
Sbjct: 296 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETN 355

Query: 285 DSGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL-TQSGFQALVDSGAS 342
             G +F GD     +   ++ PI G   + Y    +    G+  L   +  Q + DSG+S
Sbjct: 356 GGGYMFLGDDY-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           +T+LP E+Y  ++    +   S        +   C+ A          + L F + + FV
Sbjct: 415 YTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGR-RWFV 473

Query: 403 VRNHIFSFPENEGFTVF------CLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKL 452
           V       P++  + +       CL +++ T+ ++G   I+G   + G  +V+D E  ++
Sbjct: 474 VPKTFTIVPDD--YLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQI 531

Query: 453 AWSHSKC 459
            W++S+C
Sbjct: 532 GWANSEC 538


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 160/382 (41%), Gaps = 49/382 (12%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D GS + +VPC  C QC                 + P  SS+ +
Sbjct: 79  TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCG----------KHQDPRFQPDLSSTYR 128

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+ P C    +C      C Y   Y+ E +SSSG + +D++   + S+  PQ +V  
Sbjct: 129 PVKCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGVIAEDVVSFGNESELKPQRAV-- 180

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGS 288
               GC   +TG      A DG+MGLG G +SV   L   G+I +SFS+C+   D   G+
Sbjct: 181 ---FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGA 236

Query: 289 VFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
           +  G   P      S   P    Y  Y + ++   +    L             ++DSG 
Sbjct: 237 MVLGQISPPPNMVFSHSNPYRSPY--YNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGT 294

Query: 342 SFTFLPTEIYAEVVVKFDKLVSS-KRI-SLQGNSWKYCYNASSEEMLKV----PDMRLIF 395
           ++ + P   +  +     K +   K+I     N    C++ +  E+  +    P++ ++F
Sbjct: 295 TYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVF 354

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFDRENL 450
              Q   +    + F   +    +CL +     D      GI+ +N +    + +DREN 
Sbjct: 355 GSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTL----VTYDREND 410

Query: 451 KLAWSHSKCEEVIDKSHVHLVP 472
           K+ +  + C E+     V  VP
Sbjct: 411 KIGFWKTNCSELWKSLQVPGVP 432


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 102/414 (24%), Positives = 176/414 (42%), Gaps = 49/414 (11%)

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
           E+L  +  + +  R K   N++++       +    THF G      +   + +GTP   
Sbjct: 90  EILRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGG-----YAVTVGLGTPKKD 144

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-- 180
           F +  D GS+L W      QC P S   +    +N  ++DP+ S+S KN+SCS   CK  
Sbjct: 145 FSLLFDTGSDLTWT-----QCEPCSGGCF---PQNDEKFDPTKSTSYKNLSCSSEPCKSI 196

Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
              S   C S  + C Y   Y T  T   G+L  + L +         S V  + +IGCG
Sbjct: 197 GKESAQGCSS-SNSCLYGVKYGTGYT--VGFLATETLTIT-------PSDVFENFVIGCG 246

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQG 295
            +  G +   A   G++GLG   V++PS  +     +N FS C   + S  G + FG   
Sbjct: 247 ERNGGRFSGTA---GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGG-- 299

Query: 296 PATQQSTSFLPIGEKY-DAYFVGVESYCIGNSCL--TQSGFQ---ALVDSGASFTFLPTE 349
               Q+  F PI  K  + Y + V    +G   L    S F+    ++DSG + T+LP+ 
Sbjct: 300 -GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPST 358

Query: 350 IYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMRLIFSKNQSFVVRNHI 407
            ++ +   F +++++  ++   +  + CY+ S  + + + +P + + F       + +  
Sbjct: 359 AHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSG 418

Query: 408 FSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                N G    CL       D D  I G      + +V+D     + ++   C
Sbjct: 419 IFIAAN-GLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 162/392 (41%), Gaps = 60/392 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPS 164
           Y L+Y  + +G P+  + + +D+GS L W+ C   CI CA      Y     +L      
Sbjct: 76  YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSL------ 129

Query: 165 SSSSSKNVSCSHPLCKSRSSC-------KSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
                  V    PLC +  +        K     C Y   Y+ +   S G+LV D +   
Sbjct: 130 -------VPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAYA-DHGYSEGFLVRDSVRAL 181

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
             +K    + + ++ + GCG  Q  S  +  A  DG++GLG G  S+PS  AK GLI+N 
Sbjct: 182 LTNK----TVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNV 237

Query: 277 FSICF--DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG- 332
              C      D G +FFGD   +T   T    +G      Y+VG      GN  L + G 
Sbjct: 238 IGHCIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGD 297

Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI------SLQGNSW--KYCYNA 380
                  + DSG+++T+   + Y   +    + +S K++      S     W  K  + +
Sbjct: 298 GKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRS 357

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTDG----DY 430
            +E       + L F   ++      +  FP  EG+ V       CL +++       D 
Sbjct: 358 VAEAAAYFKPLTLKFRSTKT----KQMEIFP--EGYLVVNKKGNVCLGILNGTAIGIVDT 411

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            ++G     G  +V+D E  ++ W+ S C+E+
Sbjct: 412 NVLGDISFQGQLVVYDNEKNQIGWARSDCQEI 443


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 118/475 (24%), Positives = 199/475 (41%), Gaps = 80/475 (16%)

Query: 13  CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
           C +   S A+S  FS +L+HR  D  K  +             P +N  ++      +  
Sbjct: 15  CFIASFSHALSNGFSVELIHR--DSPKSPYYK-----------PTENKYQHF----VDAA 57

Query: 71  KRQKTRVK--LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
           +R   R     + ++ S+    + P  G          Y + Y+   +GTP        D
Sbjct: 58  RRSINRANHFFKDSDTSTPESTVIPDRGG---------YLMTYS---VGTPPTKIYGIAD 105

Query: 129 AGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCK 186
            GS+++W+ C+ C QC           ++    ++PS SSS KN+ CS  LC S R +  
Sbjct: 106 TGSDIVWLQCEPCEQC----------YNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSC 155

Query: 187 SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD 246
           S ++ C Y   Y  + + S G L  D L L S S  +P S  +  ++IGCG    G++  
Sbjct: 156 SDQNSCQYKISYG-DSSHSQGDLSVDTLSLESTSG-SPVSFPK--IVIGCGTDNAGTF-- 209

Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQ 300
           G A  G++GLG G VS+ + L  +  I   FS C       + N S  + FGD    +  
Sbjct: 210 GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD 267

Query: 301 STSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIY 351
                P+ +K    YF+ ++++ +GN  +   G           ++DSG + T +P+++Y
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVY 327

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFS-- 409
             +      LV   R+      +  CY+  S E     D  +I    +   V  H  S  
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEY----DFPIITVHFKGADVELHSISTF 383

Query: 410 FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            P  +G   F        G  +G +  QN ++G    +D +   +++  + C +V
Sbjct: 384 VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG----YDLQQKTVSFKPTDCTKV 434


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 169/379 (44%), Gaps = 32/379 (8%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L++T + +G+P   F V +D GS++LWV C  C  C   S      L   LS +DPSSSS
Sbjct: 85  LYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSG-----LGIELSFFDPSSSS 139

Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           ++  VSCSHP+C S      + C    + C Y   Y  + + ++GY V D+L+  +    
Sbjct: 140 TTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYG-DGSGTTGYYVSDMLYFDTVLGD 198

Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +  ++  +S++ GC   Q+G       A DG+ G G  D+SV S L+  G+    FS C 
Sbjct: 199 SLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL 258

Query: 282 D-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
             E D G       G   + +  + P+      Y + ++S  +    L        T + 
Sbjct: 259 KGEGDGGGKLV--LGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSNN 316

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPD 390
              +VDSG + T+L    Y   V      VSS    +  +GN    CY  S+      P 
Sbjct: 317 QGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ---CYLVSTSVDEIFPP 373

Query: 391 MRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDR 447
           + L F+   S V++   ++     ++G  ++C+      +    I+G   +     V+D 
Sbjct: 374 VSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDL 433

Query: 448 ENLKLAWSHSKCEEVIDKS 466
            + ++ W++  C   ++ S
Sbjct: 434 AHQRIGWANYDCSLSVNVS 452


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 151/369 (40%), Gaps = 39/369 (10%)

Query: 78  KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           +L++ + +   +LL        FP +G+   F       L+YT + +GTP   F V +D 
Sbjct: 45  QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100

Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
           GS++LWV C      P +    + L   L+ +DP SS ++  +SCS   C      S S 
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
           C    + C Y   Y  + + +SG+ V D+L        +   +  + V+ GC   QTG  
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215

Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
           +    A DG+ G G   +SV S LA  G+    FS C   EN  G +     G   + + 
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273

Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
            F P+      Y V + S  +    L        T +G   ++D+G +  +L    Y   
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333

Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
           V      VS   + +  +GN    CY  ++      P + L F+   S  +    +   +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390

Query: 413 NEGFTVFCL 421
           N   +  C 
Sbjct: 391 NNVASALCF 399


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 156/367 (42%), Gaps = 41/367 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP    LV LD GS+  W     IQC P    Y    +++ + +DPS SS+ 
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSW-----IQCKPCPDCY----EQHEALFDPSKSSTY 184

Query: 170 KNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
            +++CS   C+      + +C S K  CPY   Y+ +D+ + G L  D L L      +P
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKK-CPYEITYA-DDSYTVGNLARDTLTL------SP 236

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +V    + GCG    GS+      DG++GLG G  S+ S +  A      FS C   +
Sbjct: 237 TDAVP-GFVFGCGHNNAGSF---GEIDGLLGLGRGKASLSSQV--AARYGAGFSYCLPSS 290

Query: 285 DSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
            S + +    G     P   Q T  +  G+    Y++ +    +    +        +  
Sbjct: 291 PSATGYLSFSGAAAAAPTNAQFTEMV-AGQHPSFYYLNLTGITVAGRAIKVPPSVFATAA 349

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG +F+ LP   YA +       +   + +     +  CY+ +  E +++P + L
Sbjct: 350 GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVAL 409

Query: 394 IFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           +F+   +  +  + +     N   T         D   G++G        +++D +N K+
Sbjct: 410 VFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKV 469

Query: 453 AWSHSKC 459
            +  + C
Sbjct: 470 GFGANGC 476


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 154/375 (41%), Gaps = 54/375 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +   +Y               +
Sbjct: 73  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWY-------------KPT 119

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            +K V C+  LC S +  K    P  C Y   Y T+  SS G L+ D   L+  +     
Sbjct: 120 KNKIVPCAASLCTSLTPNKKCAVPQQCDYQIKY-TDKASSLGVLIADNFTLSLRN----S 174

Query: 226 SSVQSSVIIGCGR-KQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           S+V++++  GCG  +Q G   +GA   A DG++GLG G VS+ S L + G+ +N    CF
Sbjct: 175 STVRANLTFGCGYDQQVGK--NGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCF 232

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDS 339
             N  G +FFGD    T + T ++P+        Y  G  +       L     + + DS
Sbjct: 233 STNGGGFLFFGDDIVPTSRVT-WVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFDS 291

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS------SEEMLKVPDMRL 393
           G+++ +   E Y   V      +S     +   S   C+         SE       + L
Sbjct: 292 GSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKSLFL 351

Query: 394 IFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTDG-----DYGIIGQNFMMGHRIV 444
            F KN    +       PEN      +   CL ++  DG      + IIG   M    I+
Sbjct: 352 SFGKNSVMEIP------PENYLIVTKYGNVCLGIL--DGTTAKLKFNIIGDITMQDQMII 403

Query: 445 FDRENLKLAWSHSKC 459
           +D E  +L W    C
Sbjct: 404 YDNEKGQLGWIRGSC 418


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 118/452 (26%), Positives = 190/452 (42%), Gaps = 63/452 (13%)

Query: 53  WPKKNSVEYLELLLSNDWKRQKTR----VKLQSNNNSSRNQLLFPSEGS---QTHFF--G 103
           + K  + E+ E +L  D   +       + L+  N    N +L  S GS    T  F  G
Sbjct: 135 YHKLRAREFHERILEEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSIDSSTTIFPVG 194

Query: 104 NQFY--WLHYTWIDIGTPNVS--FLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
              Y   L+YT I +G P     + + +D GS L W+ C   C  CA  +   Y     N
Sbjct: 195 GNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDN 254

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
           L     SS +    V  +  L +   +C      C Y  +Y+ + + S G L  D  HL 
Sbjct: 255 LVR---SSEAFCVEVQRNQ-LTEHCENCHQ----CDYEIEYA-DHSYSMGVLTKDKFHL- 304

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNS 276
              K    S  +S ++ GCG  Q G  L+     DG++GL    +S+PS LA  G+I N 
Sbjct: 305 ---KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNV 361

Query: 277 FSICF--DENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQS 331
              C   D N  G +F G D  P+     +++P+    + DAY + V     G   L+  
Sbjct: 362 VGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLD 419

Query: 332 G-----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSE- 383
           G      + L D+G+S+T+ P + Y+++V    + VS   ++   +  +   C+ A +  
Sbjct: 420 GENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE-VSGLELTRDDSDETLPICWRAKTNF 478

Query: 384 EMLKVPDMRLIFSK------NQSFVVRNHIFSFPE------NEGFTVFCLTVMS----TD 427
               + D++  F        ++  ++   +   PE      N+G    CL ++      D
Sbjct: 479 PFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNV--CLGILDGSSVHD 536

Query: 428 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           G   I+G   M GH IV+D    ++ W  S C
Sbjct: 537 GSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568


>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1388

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/387 (25%), Positives = 166/387 (42%), Gaps = 45/387 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNL-SEYDPSS 165
           L++T + +G P  S+ + +D GS+L W+ C   CI C   +   Y     N+ S  D   
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPTRSNVVSSVDALC 250

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
               KN    H         +SL   C Y   Y+ + +SS G LV D LHL + +     
Sbjct: 251 LDVQKNQKNGH-------HDESLLQ-CDYEIQYA-DHSSSLGVLVRDELHLVTTNG---- 297

Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           S  + +V+ GCG  Q G  L+     DG+MGL    VS+P  LA  GLI+N    C   +
Sbjct: 298 SKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSND 357

Query: 285 DSGS--VFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSG----FQAL 336
            +G   +F GD         +++P+      D Y   +     GN  L   G     + +
Sbjct: 358 GAGGGYMFLGDDF-VPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKVGKMV 416

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEMLKVPDM 391
            DSG+S+T+ P E Y ++V   +++     +    ++     W+  +   S + +K    
Sbjct: 417 FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFPIKSVKDVKDYFK 476

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMS----TDGDYGIIGQNFMMGH 441
            L       + + + +F     EG+ +       CL ++      DG   I+G   + G+
Sbjct: 477 TLTLRFGSKWWILSTLFQISP-EGYLIISNKGHVCLGILDGSNVNDGSSIILGDISLRGY 535

Query: 442 RIVFDRENLKLAWSHSKCEEVIDKSHV 468
            +V+D    K+ W  + C   +D+ ++
Sbjct: 536 SVVYDNVKQKIGWKRADC---VDRCYI 559


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/390 (25%), Positives = 168/390 (43%), Gaps = 27/390 (6%)

Query: 92  FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
           FP +G+   F       L+YT + +GTP   F V +D GS++LWV C      P +    
Sbjct: 70  FPVDGASDPFL----VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 121

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
           + L   LS +DP  SSS+  VSCS   C S    +S   P   C Y   Y  + + +SGY
Sbjct: 122 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGY 180

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLL 267
            + D +   +        +  +  + GC   Q+G       A DG+ GLG G +SV S L
Sbjct: 181 YISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQL 240

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
           A  GL    FS C   + SG       G   +  T + P+      Y V ++S  +    
Sbjct: 241 AVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQI 299

Query: 328 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
           L          +G   ++D+G +  +LP E Y+  +      VS     +   S++ C+ 
Sbjct: 300 LPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFE 358

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQN 436
            ++ ++   P + L F+   S V+  R ++  F  + G +++C+     +     I+G  
Sbjct: 359 ITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWCIGFQRMSHRRITILGDL 417

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 466
            +    +V+D    ++ W+   C   ++ S
Sbjct: 418 VLKDKVVVYDLVRQRIGWAEYDCSLEVNVS 447


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 112/408 (27%), Positives = 173/408 (42%), Gaps = 62/408 (15%)

Query: 84  NSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--C 140
           N+ +N  +F      +   GN +   L+Y  + IG P   + + +D GS+L W+ C   C
Sbjct: 2   NADKNATVF------SQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPC 55

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYI 195
             CA        S    L  YDP  +   + V C  PLC         +C      C Y 
Sbjct: 56  RSCA--------SGPHGL--YDPKKA---RLVDCRVPLCALVQQGGSYACGGPVRQCDYD 102

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVM 254
            +Y+ + +S+ G L++D + L     +  +S  +++ IIGCG  Q G+     A+ DGVM
Sbjct: 103 VEYA-DGSSTMGVLMEDTITL--LLTNGTRS--KTTAIIGCGYDQQGTLAQTPASTDGVM 157

Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ-GPATQQSTSFLPIGEKY 311
           GL    +S+PS LAK G+++N    C     N  G +FFGD   PA     ++ PI  K 
Sbjct: 158 GLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNGGGYLFFGDSLVPAL--GMTWTPIMGKS 215

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK---RIS 368
               +G +S    +      G   + DSG SFT+L  E Y  V+   +  V      RI 
Sbjct: 216 ITGNIGGKSGDADDKTGDIGG--VMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIK 273

Query: 369 LQGNSWKYCYNASS--EEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV---- 418
              N+  +C+   S  E +  V      + L F K   +     +   P  EG+ +    
Sbjct: 274 TD-NTLPFCWRGPSPFESVADVQRYFKTVTLDFGKRNWYSASRVLELSP--EGYLIVSTQ 330

Query: 419 --FCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
              CL ++   G       IIG   M G+ +V+D    ++ W    C 
Sbjct: 331 GNVCLGILDASGASLEVTNIIGDVSMRGYLVVYDNARNQIGWVRRNCH 378


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 167/389 (42%), Gaps = 52/389 (13%)

Query: 105 QFYWLHYTWIDIGTPNVS--FLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSE 160
           Q   L+YT I +G P     + + +D GS L W+ C   C  CA  +   Y     NL  
Sbjct: 25  QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVR 84

Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
              SS +    V   + L +   +C      C Y  +Y+ + + S G L  D  HL    
Sbjct: 85  ---SSEAFCVEVQ-RNQLTEHCENCHQ----CDYEIEYA-DHSYSMGVLTKDKFHL---- 131

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           K    S  +S ++ GCG  Q G  L+     DG++GL    +S+PS LA  G+I N    
Sbjct: 132 KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 191

Query: 280 CF--DENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 332
           C   D N  G +F G D  P+     +++P+    + DAY + V     G   L+  G  
Sbjct: 192 CLASDLNGEGYIFMGSDLVPS--HGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGEN 249

Query: 333 ---FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSE-EML 386
               + L D+G+S+T+ P + Y+++V    + VS   ++   +  +   C+ A +     
Sbjct: 250 GRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE-VSGLELTRDDSDETLPICWRAKTNFPFS 308

Query: 387 KVPDMRLIFS------KNQSFVVRNHIFSFPE------NEGFTVFCLTVMST----DGDY 430
            + D++  F        ++  ++   +   PE      N+G    CL ++      DG  
Sbjct: 309 SLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNV--CLGILDGSSVHDGST 366

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            I+G   M GH IV+D    ++ W  S C
Sbjct: 367 IILGDISMRGHLIVYDNVKRRIGWMKSDC 395


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 156/376 (41%), Gaps = 47/376 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y               +
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY-------------RPT 99

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFSK 221
           +++ V C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS 
Sbjct: 100 ANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFSL 153

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N    
Sbjct: 154 PMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGH 213

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALV 337
           C   N  G +FFGD    + + T ++P+ ++     Y  G  +       L     + + 
Sbjct: 214 CLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVF 272

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-S 396
           DSG+++T+   + Y  VV      +S     +   +   C+    +    V D++  F S
Sbjct: 273 DSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFKS 331

Query: 397 KNQSFV-VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVF 445
              SF   +N     P      V      CL ++  DG      + +IG   M    +++
Sbjct: 332 MFLSFASAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVIY 389

Query: 446 DRENLKLAWSHSKCEE 461
           D E  +L W+   C  
Sbjct: 390 DNEKSQLGWARGACTR 405


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 160/385 (41%), Gaps = 62/385 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
           ++  I +G P    LV +D GS+L+W     +QC P    Y     R ++  YDP +S +
Sbjct: 92  YFAVIGVGDPPTHALVVIDTGSDLIW-----LQCLPCRRCY-----RQVTPLYDPRNSKT 141

Query: 169 SKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            + + C+ P C+       C +    C Y+  Y  + ++SSG L  D L L       P 
Sbjct: 142 HRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDTLVL-------PD 193

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-- 283
            +   +V +GCG    G     A   G++G G G +S P+ LA A    + FS C  +  
Sbjct: 194 DTRVHNVTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRM 248

Query: 284 ----NDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCLTQSGFQ-- 334
               N S  + FG        ST+F P+     +   Y+V +  + +G   +  +GF   
Sbjct: 249 SRARNSSSYLVFGRT--PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV--AGFSNA 304

Query: 335 ------------ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYN 379
                        +VDSG + +    + YA V    V        +R+  + + +  CY+
Sbjct: 305 SLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYD 364

Query: 380 ASSE---EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 434
                    ++VP + L F+      +   N++      +  T FCL + + D    ++G
Sbjct: 365 VHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLG 424

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                G  +VFD E  ++ ++ + C
Sbjct: 425 NVQQQGFGVVFDVERGRIGFTPNGC 449


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 154/367 (41%), Gaps = 49/367 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + +G+P     + +D+GS+++WV C+ C QC       Y   D     +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
              VSC   +C++ S             DYS    + + + G L  + L L         
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     FS C     
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSC---------LTQSGFQA 335
           +G       G      T  +P G +  + Y+VG+    +G            LT+ G   
Sbjct: 287 AGGA-----GSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGG 341

Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           +V D+G + T LP E YA +   FD  + +   S   +    CY+ S    ++VP +   
Sbjct: 342 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 401

Query: 395 FSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           F +     +  RN +       G  VFCL    +     I+G     G +I  D  N  +
Sbjct: 402 FDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYV 457

Query: 453 AWSHSKC 459
            +  + C
Sbjct: 458 GFGPNTC 464


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 165/380 (43%), Gaps = 56/380 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP+   L+ LD GS+++W     +QCAP    Y    D++   +DP  SSS 
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGPVFDPRRSSSY 190

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C+ PLC+   S  C   +  C Y   Y  + + ++G    + L  A  ++ A    
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLRRRACLYQVAYG-DGSVTAGDFATETLTFAGGARVA---- 245

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-- 285
               V +GCG    G ++  A    ++GLG G +S P+ +++      SFS C  +    
Sbjct: 246 ---RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGKSFSYCLVDRTSS 297

Query: 286 ----------SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLT 329
                     S +V F   GP +  + SF P+         Y+V +    +G +    + 
Sbjct: 298 SSSGAASRSRSSTVTF---GPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVA 354

Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYN 379
           +S  +          +VDSG S T L    Y+ +   F    +  R+S  G S +  CY+
Sbjct: 355 ESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYD 414

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 439
               +++KVP + + F+      +    +  P +   T FC     TDG   IIG     
Sbjct: 415 LGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQ 473

Query: 440 GHRIVFDRENLKLAWSHSKC 459
           G R+VFD +  ++ ++   C
Sbjct: 474 GFRVVFDGDGQRVGFAPKGC 493


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 157/378 (41%), Gaps = 56/378 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSE-YDPSSSS 167
           ++  + +GTP    L+ +D GS+++W+ C+ C+ C            R LS  YDP  SS
Sbjct: 99  YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCY-----------RQLSPLYDPRGSS 147

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +     CS P C++  +C      C Y   Y  + +S+SG L  D L  ++       +S
Sbjct: 148 TYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYG-DASSTSGNLATDRLVFSN------DTS 200

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
           V  +V +GCG    G +   A   G++G+  G+ S  + +A +      F+ C  D   S
Sbjct: 201 V-GNVTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRS 254

Query: 287 GS----VFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCLTQSGFQ----- 334
           GS    + FG   P    S  F P+     +   Y+V +  + +G   +T  GF      
Sbjct: 255 GSSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVT--GFSNASLS 311

Query: 335 ---------ALVDSGASFTFLPTEIYAEVVVKFDKL---VSSKRISLQGNSWKYCYNASS 382
                     +VDSG S T    + Y  +   FD     V  +++    + +  CY+   
Sbjct: 312 LDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRG 371

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGH 441
             +   P + L F+      +    +  PE  G +  F L     DG   +IG       
Sbjct: 372 VAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDG-LSVIGNVLQQRF 430

Query: 442 RIVFDRENLKLAWSHSKC 459
           R+VFD EN ++ +  + C
Sbjct: 431 RVVFDVENERVGFEPNGC 448


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 123/488 (25%), Positives = 212/488 (43%), Gaps = 83/488 (17%)

Query: 2   VNLVAICMLFGCILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSV 59
           V+ + +   F C  +  S AVS  FS +L+HR  D +K  +             P +N  
Sbjct: 4   VSFLTLSFFFLCFSISFSQAVSNGFSIELIHR--DSSKSPFYK-----------PTQNKY 50

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           +++     +   R   RV     N+S++N L    E +   + G+  Y + Y+   +GTP
Sbjct: 51  QHV----VDAVHRSINRV-----NHSNKNSLASTPESTVISYEGD--YIMSYS---VGTP 96

Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
            +     +D GS+++W+ C+ C QC           ++   +++PS SSS KN+SCS  L
Sbjct: 97  PIKSYGIVDTGSDIVWLQCEPCEQC----------YNQTTPKFNPSKSSSYKNISCSSKL 146

Query: 179 CKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
           C+S   +SC   K+ C Y  +Y  + + S G L  + L L S +   P S  ++  +IGC
Sbjct: 147 CQSVRDTSCNDKKN-CEYSINYGNQ-SHSQGDLSLETLTLES-TTGRPVSFPKT--VIGC 201

Query: 237 GRKQTGSY--------LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
           G    GS+          G  P  ++   LG    PS+  K        SI       GS
Sbjct: 202 GTNNIGSFKRVSSGVVGLGGGPASLI-TQLG----PSIGGKFSYCLVRMSITLKNMSMGS 256

Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF-------QALV 337
             + FGD    +  +    PI +K  +  Y++ +E++ +G+  +  +G          ++
Sbjct: 257 SKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIII 316

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DS    TF+P+++Y ++      LV+ +R+      +  CYN SS+E    P M   F  
Sbjct: 317 DSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAHFKG 376

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVFDRENLKLA 453
               +   + F     +   V C     ++G   I G    Q+FM+G    +D +   ++
Sbjct: 377 ADILLYATNTFVEVARD---VLCFAFAPSNGG-AIFGSFSQQDFMVG----YDLQQKTVS 428

Query: 454 WSHSKCEE 461
           +    C E
Sbjct: 429 FKSVDCTE 436


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 165/384 (42%), Gaps = 73/384 (19%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I IGTP +     LD GS+L+W  C   C +C P  A  Y           P+ S++  N
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145

Query: 172 VSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           VSC  P+C++     S C      C Y   Y  + TS+ G L  +   L S        +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
               V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +  FS C   F+  
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249

Query: 285 DSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNSC---------L 328
            +  +F G      +  ++T F+P        +   Y++ +E   +G++          L
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 329 TQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEE 384
           T  G    ++DSG +FT L    +   V     L S  R+ L   +      C+ A+S E
Sbjct: 310 TPMGDGGVIIDSGTTFTALEERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPE 366

Query: 385 MLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
            ++VP + L F       + +S+VV        E+    V CL ++S  G   ++G    
Sbjct: 367 AVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSARG-MSVLGSMQQ 417

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               I++D E   L++  +KC E+
Sbjct: 418 QNTHILYDLERGILSFEPAKCGEL 441


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 163/384 (42%), Gaps = 52/384 (13%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L+   I+IG P   + + +D GS+L WV C      P +     ++ ++   Y P+    
Sbjct: 61  LYTVSINIGNPPKPYELDIDTGSDLTWVQCD----GPDAPCKGCTMPKD-KLYKPNGK-- 113

Query: 169 SKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
            + V CS P+C +  S       C     PC Y   Y+ +  S+ G LV D +H+ S   
Sbjct: 114 -QVVKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYA-DHASTLGVLVRDYMHIGS--- 168

Query: 222 HAPQSSVQSSVI-IGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
             P SS +  ++  GCG +Q  +G     + P G++GLG G  S+ S L   G I N   
Sbjct: 169 --PSSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLG 226

Query: 279 ICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            C      G +F GD+          P  Q S       EK+  Y  G            
Sbjct: 227 HCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSL------EKH--YNTGPVDLFFNGKPTP 278

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNSWKYC------YNASS 382
             G Q + DSG+S+T+  + +Y  V    +  +  K +S ++  S   C      + + +
Sbjct: 279 AKGLQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLN 338

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFM 438
           E       + L F+K+++   +    ++     +   CL +++ +    G+  ++G   +
Sbjct: 339 EVNNYFKPLTLSFTKSKNLQFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRNVVGDISL 398

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               +V+D E  ++ W+ + C+++
Sbjct: 399 QDKVVVYDNEKQQIGWASANCKQI 422


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 136/309 (44%), Gaps = 35/309 (11%)

Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
            +G+ + + L+Y  ++IG P   + + +D+GS+L W+ C   C  C  +    Y      
Sbjct: 56  LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 109

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
                    + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L+
Sbjct: 110 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 161

Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           +D     SF+      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L 
Sbjct: 162 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
           + G+ +N    C      G +FFGD     Q++T + P+      + Y  G  S   G+ 
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 275

Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
            L     + + DSG+SFT+   + Y  +V      +S         S   C+    E   
Sbjct: 276 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 334

Query: 387 KVPDMRLIF 395
            V D+R  F
Sbjct: 335 SVLDVRKEF 343


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 156/376 (41%), Gaps = 47/376 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y               +
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY-------------RPT 99

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFSK 221
           +++ V C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS 
Sbjct: 100 ANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFSL 153

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N    
Sbjct: 154 PMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGH 213

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALV 337
           C   N  G +FFGD    + + T ++P+ ++     Y  G  +       L     + + 
Sbjct: 214 CLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVF 272

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-S 396
           DSG+++T+   + Y  VV      +S     +   +   C+    +    V D++  F S
Sbjct: 273 DSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFKS 331

Query: 397 KNQSF-VVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVF 445
              SF   +N     P      V      CL ++  DG      + +IG   M    +++
Sbjct: 332 MFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVIY 389

Query: 446 DRENLKLAWSHSKCEE 461
           D E  +L W+   C  
Sbjct: 390 DNEKSQLGWARGACTR 405


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 166/377 (44%), Gaps = 51/377 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP   F + +D GS+L W+ C  C+ C           +++   +DP++S S +NV
Sbjct: 153 VYLGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQSGPIFDPAASISYRNV 202

Query: 173 SCSHPLCK---------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +C    C+          R   +   DPCPY   Y  +  ++        L L +F+ + 
Sbjct: 203 TCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGD------LALEAFTVNL 256

Query: 224 PQSSVQS--SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
            QS  +    V  GCG +  G +   A    ++GLG G +S  S L +     ++FS C 
Sbjct: 257 TQSGTRRVDGVAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RGVYGGHAFSYCL 312

Query: 282 DENDSGS---VFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL-----T 329
            E+ S +   + FG             T+F P  +    Y++ ++S  +G   +     T
Sbjct: 313 VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDT 372

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
            S    ++DSG + ++ P   Y  +   F D++  S  + L       CYN S  E ++V
Sbjct: 373 LSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEV 432

Query: 389 PDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVF 445
           P++ L+F+   +  F   N+     E EG  + CL V+ T      IIG        +++
Sbjct: 433 PELSLVFADGAAWEFPAENYFIRL-EPEG--IMCLAVLGTPRSGMSIIGNYQQQNFHVLY 489

Query: 446 DRENLKLAWSHSKCEEV 462
           D E+ +L ++  +C +V
Sbjct: 490 DLEHNRLGFAPRRCADV 506


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 165/384 (42%), Gaps = 73/384 (19%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I IGTP +     LD GS+L+W  C   C +C P  A  Y           P+ S++  N
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145

Query: 172 VSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           VSC  P+C++     S C      C Y   Y  + TS+ G L  +   L S        +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
               V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +  FS C   F+  
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249

Query: 285 DSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNSC---------L 328
            +  +F G      +  ++T F+P        +   Y++ +E   +G++          L
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 329 TQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEE 384
           T  G    ++DSG +FT L    +   V     L S  R+ L   +      C+ A+S E
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAF---VALARALASRVRLPLASGAHLGLSLCFAAASPE 366

Query: 385 MLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
            ++VP + L F       + +S+VV        E+    V CL ++S  G   ++G    
Sbjct: 367 AVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSARG-MSVLGSMQQ 417

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               I++D E   L++  +KC E+
Sbjct: 418 QNTHILYDLERGILSFEPAKCGEL 441


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/340 (27%), Positives = 149/340 (43%), Gaps = 52/340 (15%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           T + IGTP   F + +D+GS + +VPC  C QC           +     + P  SSS  
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSSYS 140

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            V C+        +C S K  C Y   Y+ E +SSSG L +DI+     S+   Q +V  
Sbjct: 141 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKAQRAV-- 192

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
               GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G   
Sbjct: 193 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGA 248

Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
            V  G   P+    +   P+   Y  Y + ++   +    L        S    ++DSG 
Sbjct: 249 MVLGGVPTPSDMVFSRSDPLRSPY--YNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGT 306

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEEMLKV----PDM 391
           ++ +LP + +    + F   V+SK  SL   +G    Y   C+  +   + K+    PD+
Sbjct: 307 TYAYLPEQAF----MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDV 362

Query: 392 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD 429
            ++F   Q  S    N++F   + +G   +CL V     D
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKD 400


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 162/375 (43%), Gaps = 60/375 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP    +   D GS+++W      QC P +  Y     ++L  ++PS S++ + VS
Sbjct: 89  LSVGTPPFPIIAVADTGSDIIWT-----QCVPCTNCY----QQDLPMFNPSKSTTYRKVS 139

Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSS 227
           CS P+C      +SC S K  C Y   Y  +++ S G    D L + S S      P+++
Sbjct: 140 CSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVDTLTMGSTSGRVVAFPRTA 197

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
                 IGCG    GS+   A   G++GLGLG  S+   +  A  +   FS C      D
Sbjct: 198 ------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGND 247

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF--------VGVES--YCIGNSCLTQ 330
           +  S  + FG     +       PI   +K+ +++        VG  +  Y   NS L  
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGG 307

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
                ++DSG + T LP ++Y          ++ +R        +YC+  ++++  KVP 
Sbjct: 308 KA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDD-YKVPF 365

Query: 391 MRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD---YGIIGQ-NFMMGHRIV 444
           + + F   N      N +    +N    V CL    + D D   YG I Q NF++G    
Sbjct: 366 IAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDISIYGNIAQINFLVG---- 417

Query: 445 FDRENLKLAWSHSKC 459
           +D  N+ L++    C
Sbjct: 418 YDVTNMSLSFKPMNC 432


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 96/374 (25%), Positives = 151/374 (40%), Gaps = 44/374 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y               +
Sbjct: 52  YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLY-------------KPT 98

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPC--PYIADYS---TEDTSSSGYLVDDILHLASFSKH 222
            +K V C+  +C +  S +S    C  P   DY    T+  SS G LV D   L   +  
Sbjct: 99  KNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRN-- 156

Query: 223 APQSSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
              SSV+ S   GCG  Q    + +  A  DG++GLG G VS+ S L   G+ +N    C
Sbjct: 157 --SSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHC 214

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVD 338
              N  G +FFGD    T ++T ++P+        Y  G  +       L     + + D
Sbjct: 215 LSTNGGGFLFFGDNVVPTSRAT-WVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSK 397
           SG+++T+   + Y   V      +S     +   S   C+    +++ K V D++  F  
Sbjct: 274 SGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKG--QKVFKSVSDVKNDFKS 331

Query: 398 NQSFVVRNHIFSFPENEGFTVF-----CLTVMSTDGD-----YGIIGQNFMMGHRIVFDR 447
                V+N +   P      V      CL ++  DG      + IIG   M    I++D 
Sbjct: 332 LFLSFVKNSVLEIPPENYLIVTKNGNACLGIL--DGSAAKLTFNIIGDITMQDQLIIYDN 389

Query: 448 ENLKLAWSHSKCEE 461
           E  +L W    C  
Sbjct: 390 ERGQLGWIRGSCSR 403


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 101/410 (24%), Positives = 162/410 (39%), Gaps = 75/410 (18%)

Query: 92  FPSEGSQ---THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQC 143
           FP EG+     HF         Y  ++IG P   + + +D GSNL W+ C      C  C
Sbjct: 26  FPLEGNVYPVGHF---------YATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGC 76

Query: 144 APLSAS-YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPY 194
            P     YYT  D  L             V C  PLC + R     +      DP  C Y
Sbjct: 77  HPRPPHPYYTPADGKLK------------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHY 124

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGV 253
              Y T    S G L  DI+ +    K          +  GCG KQ        +P +G+
Sbjct: 125 EIQYVT--GKSEGDLATDIISVNGRDK--------KRIAFGCGYKQEEPPDSPPSPVNGI 174

Query: 254 MGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD 312
           +GLG+G     + L    +I +N    C      G ++ GD  P T+  T + P+ E   
Sbjct: 175 LGLGMGKAGFAAQLKGLKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLF 233

Query: 313 AYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQ 370
            Y  G+    I    +     F+A+ DSG+++T +P +IY E+V K     S   +  ++
Sbjct: 234 YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEVK 293

Query: 371 GNSWKYCYNASS--------EEMLKVPDMRLIFSK---NQSFVVRNHIFSFPENEGFTVF 419
           G +   C+            +   K   +++  ++   N     +N++F   + E     
Sbjct: 294 GRALPLCWKGKKPFGSVNDVKNQFKALSLKITHARGTNNLDIPPQNYLFVKEDGE----T 349

Query: 420 CLTVMSTDGD-------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           CL ++    D       + +IG   M    +++D E  +L W  ++C+ V
Sbjct: 350 CLAILDASLDPVLKELNFILIGAVTMQDLFVIYDNEKKQLGWVRAQCDRV 399


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 151/366 (41%), Gaps = 39/366 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +G+P   + + +D GS+L W     +QC P     +   D     +DPS+S + 
Sbjct: 13  YYVKVGLGSPARYYSMIVDTGSSLSW-----LQCKPCVVYCHVQAD---PLFDPSASKTY 64

Query: 170 KNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           K++SC+   C S          C++  + C Y A Y  + + S GYL  D+L LA     
Sbjct: 65  KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDLLTLAP---- 119

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              S      + GCG+   G +   A   G++GLG   +S+   ++       +FS C  
Sbjct: 120 ---SQTLPGFVYGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLP 171

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQSGFQ----A 335
               G      +      +  F P+         YF+ + +  +G   L  +  Q     
Sbjct: 172 TRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT 231

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 394
           ++DSG   T LP  +Y      F K++SSK     G S    C+  + ++M  VP++RLI
Sbjct: 232 IIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLI 291

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           F       +R        +EG T  CL     +G   IIG +     ++  D    ++ +
Sbjct: 292 FQGGADLNLRPVNVLLQVDEGLT--CLAFAGNNG-VAIIGNHQQQTFKVAHDISTARIGF 348

Query: 455 SHSKCE 460
           +   C 
Sbjct: 349 ATGGCN 354


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 162/375 (43%), Gaps = 60/375 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP    +   D GS+++W      QC P +  Y     ++L  ++PS S++ + VS
Sbjct: 89  LSVGTPPFPIIAVADTGSDIIWT-----QCEPCTNCY----QQDLPMFNPSKSTTYRKVS 139

Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSS 227
           CS P+C      +SC S K  C Y   Y  +++ S G    D L + S S      P+++
Sbjct: 140 CSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVDTLTMGSTSGRVVAFPRTA 197

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
                 IGCG    GS+   A   G++GLGLG  S+   +  A  +   FS C      D
Sbjct: 198 ------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGND 247

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF--------VGVES--YCIGNSCLTQ 330
           +  S  + FG     +       PI   +K+ +++        VG  +  Y   NS L  
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGG 307

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
                ++DSG + T LP ++Y          ++ +R        +YC+  ++++  KVP 
Sbjct: 308 KA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDD-YKVPF 365

Query: 391 MRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD---YGIIGQ-NFMMGHRIV 444
           + + F   N      N +    +N    V CL    + D D   YG I Q NF++G    
Sbjct: 366 IAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDISIYGNIAQINFLVG---- 417

Query: 445 FDRENLKLAWSHSKC 459
           +D  N+ L++    C
Sbjct: 418 YDVTNMSLSFKPMNC 432


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 148/333 (44%), Gaps = 29/333 (8%)

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS 206
           + L  +L+ YDP+ S +S  V C    C        S CK     CPY   Y  + +++S
Sbjct: 40  SGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTS 97

Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVP 264
           G  V+D L     S +       SSVI GCG KQ+GS    +  A DG++G G  + SV 
Sbjct: 98  GSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVL 157

Query: 265 SLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
           S LA +G ++  FS C D +  G +F  G        +T  +P    Y+     ++    
Sbjct: 158 SQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VD 215

Query: 324 GNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           G   L       + SG   ++DSG +  +LP  IY +++ K        ++ +  + +  
Sbjct: 216 GEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT- 274

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DY 430
           C++ S +     P ++  F +  S  V  H + F   E   ++C+     +  + +G D 
Sbjct: 275 CFHYSDKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDL 331

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
            +IG   +    +V+D EN+ + W++  C   I
Sbjct: 332 ILIGDLVLSNKLVVYDLENMVIGWTNFNCSSSI 364


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 116/475 (24%), Positives = 199/475 (41%), Gaps = 80/475 (16%)

Query: 13  CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
           C +   S A+S  FS +L+HR  D  K  +             P +N  ++      +  
Sbjct: 15  CFIASFSHALSNGFSVELIHR--DSPKSPYYK-----------PTENKYQHF----VDAA 57

Query: 71  KRQKTRVK--LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
           +R   R     + ++ S+    + P  G          Y + Y+   +GTP        D
Sbjct: 58  RRSINRANHFFKDSDTSTPESTVIPDRGG---------YLMTYS---VGTPPTKIYGIAD 105

Query: 129 AGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCK 186
            GS+++W+ C+ C QC           ++    ++PS SSS KN+ C   LC S R +  
Sbjct: 106 TGSDIVWLQCEPCEQC----------YNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSC 155

Query: 187 SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD 246
           S ++ C Y   Y  + + S G L  D L L S S  +P S  ++  +IGCG    G++  
Sbjct: 156 SDQNSCQYKISYG-DSSHSQGDLSVDTLSLESTSG-SPVSFPKT--VIGCGTDNAGTF-- 209

Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQ 300
           G A  G++GLG G VS+ + L  +  I   FS C       + N S  + FGD    +  
Sbjct: 210 GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD 267

Query: 301 STSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIY 351
                P+ +K    YF+ ++++ +GN  +   G           ++DSG + T +P+++Y
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVY 327

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFS-- 409
             +      LV   R+      +  CY+  S E     D  +I +  +   +  H  S  
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEY----DFPIITAHFKGADIELHSISTF 383

Query: 410 FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            P  +G   F        G  +G +  QN ++G    +D +   +++  + C +V
Sbjct: 384 VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG----YDLQQKTVSFKPTDCTKV 434


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 166/373 (44%), Gaps = 43/373 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     + LD GS+L+W      QCAP    +    D++L   DP++SS+   + 
Sbjct: 88  LAVGTPRRPVALTLDTGSDLVWT-----QCAPCRDCF----DQDLPVLDPAASSTYAALP 138

Query: 174 CSHPLCKSR--SSC--KSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSV 228
           C    C++   +SC  ++L +    I  Y   D S + G +  D       S  + +S  
Sbjct: 139 CGAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGD-SGGSGESLH 197

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---END 285
              +  GCG    G +       G+ G G G  S+PS L        SFS CF    E+ 
Sbjct: 198 TRRLTFGCGHLNKGVFQSNET--GIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFESK 250

Query: 286 SGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
           S  V  G    A          ++T  L    +   YF+ ++   +G + L   ++ F++
Sbjct: 251 SSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRS 310

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK---VPDM 391
            ++DSGAS T LP E+Y  V  +F   V      ++G++   C+      + +   VP +
Sbjct: 311 TIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSL 370

Query: 392 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L        + R N++F   E+ G  V C+ + +  G+  +IG        +V+D EN 
Sbjct: 371 TLHLEGADWELPRSNYVF---EDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLEND 427

Query: 451 KLAWSHSKCEEVI 463
           +L+++ ++C+ ++
Sbjct: 428 RLSFAPARCDRLV 440


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 167/366 (45%), Gaps = 52/366 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V +L   D GS+L W  C  C++C       Y  L R +  ++P  S+S  +V
Sbjct: 96  VSIGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQL-RPI--FNPLKSTSFSHV 145

Query: 173 SCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
            C+   C +       ++  C Y   Y  + T S G L  + + + S       SSV+S 
Sbjct: 146 PCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS-------SSVKS- 196

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGS 288
            +IGCG   +G +       GV+GLG G +S+ S +++   I   FS C      + +G 
Sbjct: 197 -VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 252

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQSGFQALVDSGASF 343
           + FG+    +       P+  K     Y++ +E+  IGN       + G   ++DSG + 
Sbjct: 253 INFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQG-NVIIDSGTTL 311

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKVPDMRLIFS--KNQ 399
           T LP E+Y  VV    K+V +KR+     S   C++   ++   L +P +   FS   N 
Sbjct: 312 TILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANV 371

Query: 400 SFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQ----NFMMGHRIVFDRENLKLA 453
           + +  N      +N    V CLT+   S   ++GIIG     NF++G    +D E  +L+
Sbjct: 372 NLLPINTFRKVADN----VNCLTLKAASPTTEFGIIGNLAQANFLIG----YDLEAKRLS 423

Query: 454 WSHSKC 459
           +  + C
Sbjct: 424 FKPTVC 429


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 160/376 (42%), Gaps = 40/376 (10%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N +   H   I IGTP +     +D GS+L+W     IQCAP    Y     +    +DP
Sbjct: 62  NAYIGQHLMEIYIGTPPIKITGLVDTGSDLIW-----IQCAPCLGCY----KQIKPMFDP 112

Query: 164 SSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
             SS+  N+SC  PLC K  +   S +  C Y   Y  +++ + G L  D    A+F+ +
Sbjct: 113 LKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATFTSN 168

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSIC 280
             +    S  + GCG   TG + D     G++GLG G     SL+++ G +     FS C
Sbjct: 169 TGKPVSLSRFLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKFSQC 223

Query: 281 F-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGV------ESYCIGNSC 327
                 D   S  + FG             P+   EK  +YFV +      ++Y   NS 
Sbjct: 224 LVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNST 283

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEML 386
           + ++    LVDSG     LP ++Y +V  +    V+ K I+   +   + CY   +   L
Sbjct: 284 IGKA--NMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN--L 339

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIVF 445
           K P +   F      +     F  P  +   +FCL + + T+ D G+ G      + I F
Sbjct: 340 KGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGF 399

Query: 446 DRENLKLAWSHSKCEE 461
           D +   +++  + C +
Sbjct: 400 DLDRQVVSFKPTDCTK 415


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 166/377 (44%), Gaps = 40/377 (10%)

Query: 110 HYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYT-SLDRNLSEYDPSSS 166
           +YT  + IGTP   F + +D GS + +VPC  C  C    AS+ T  L      + P +S
Sbjct: 39  YYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENS 98

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           SS + + C    C +   C S    C Y   Y+ E ++S G L  D+L         P S
Sbjct: 99  SSYQKIGCRSSDCIT-GLCDSNSHQCKYERMYA-EMSTSKGVLGKDLLDFG------PAS 150

Query: 227 SVQSSVI-IGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
            +QS ++  GC   ++G  YL  A  DG+MGLG G +S+   L   G I++SFS+C+   
Sbjct: 151 RLQSQLLSFGCETAESGDLYLQVA--DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGM 208

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQSGFQALV 337
           DE     V      P+        P    Y   +   + V+   +  +S +    F  ++
Sbjct: 209 DEGGGSMVLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFGTIL 268

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNAS---SEEMLK- 387
           DSG ++ +LP   +      F   V ++  SLQ       N    CY  +   ++E+ K 
Sbjct: 269 DSGTTYAYLPDRAFE----AFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKH 324

Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            P +  +F++NQ  S    N++F   +  G   +CL          ++G   +    + +
Sbjct: 325 FPLVDFVFAENQKVSLAPENYLFKHTKVPG--AYCLGFFKNQDATTLLGGIIVRNMLVTY 382

Query: 446 DRENLKLAWSHSKCEEV 462
           DR N ++ +  + C E+
Sbjct: 383 DRYNHQIGFLKTNCTEL 399


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 128/272 (47%), Gaps = 17/272 (6%)

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLG 259
           + +S++GYLV D++HL   + +    S   ++I GCG KQ+G   +  AA DG+MG G  
Sbjct: 4   DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63

Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
           + S  S LA  G ++ SF+ C D N+ G +F    G          P+  K   Y V + 
Sbjct: 64  NSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLN 121

Query: 320 SYCIGNSC--LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
           +  +GNS   L+ + F +      ++DSG +  +LP  +Y  ++ +   L S   ++L  
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHT 179

Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG- 428
               +     ++++ + P +   F K+ S  V  R ++F   E+     +    + T G 
Sbjct: 180 VQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGG 239

Query: 429 -DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               I+G   +    +V+D EN  + W++  C
Sbjct: 240 ASLTILGDMALSNKLVVYDIENQVIGWTNHNC 271


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 157/371 (42%), Gaps = 44/371 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +GTP   + + LD GS+L W     +QC P +   +   D     YDPS S + 
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSW-----LQCQPCAVYCHAQAD---PLYDPSVSKTY 176

Query: 170 KNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVDDILHLASFS 220
           K +SC+   C SR    +L DP        C Y A Y   DTS S GYL  D+L L S S
Sbjct: 177 KKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQDLLTLTS-S 232

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSI 279
           +  PQ         GCG+   G +   A   G++GL    +S+ + L+ K G   ++FS 
Sbjct: 233 QTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKYG---HAFSY 280

Query: 280 CFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-- 332
           C    +SGS   G        P + + T  L   +    YF+ + +  +    L  +   
Sbjct: 281 CLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAM 340

Query: 333 --FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVP 389
                L+DSG   T LP  +YA +   F K++S+K       S    C+  S + +  VP
Sbjct: 341 YRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVP 400

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
           ++++IF       +R        ++G T       S      IIG      + I +D   
Sbjct: 401 EIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVST 460

Query: 450 LKLAWSHSKCE 460
            ++ ++   C 
Sbjct: 461 SRIGFAPGSCH 471


>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
 gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
          Length = 260

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 9/180 (5%)

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ--ALVDSGASFT 344
           G + FGD+G   Q  T  LP  E    Y V V    +G   +   G Q  AL D+G SFT
Sbjct: 11  GRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTEVSVGGDAV---GVQLLALFDTGTSFT 66

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFV 402
            L    Y  +   FD  V+ KR  +     +++CY+ S ++  +  P + + F       
Sbjct: 67  HLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMF 126

Query: 403 VRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           +RN +F     +   ++CL ++ S D    IIGQNFM G+RIVFDRE + L W  S C E
Sbjct: 127 LRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDCFE 186


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 156/373 (41%), Gaps = 44/373 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP     + +D GS++ W     +QCAP +  Y     +  + ++PSSSSS 
Sbjct: 16  YFAVVGVGTPRRDMYLVVDTGSDITW-----LQCAPCTNCY----KQKDALFNPSSSSSF 66

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           K + CS  LC +      L + C Y ADY     +    + D+++   +F    P   V 
Sbjct: 67  KVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAF---GPGQVVL 123

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
           +++ +GCG    G++   A   G++GLG G +S P+ L  +   +N FS C      D N
Sbjct: 124 TNIPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNLDAS--TRNIFSYCLPDRESDPN 178

Query: 285 DSGSVFFGDQG-PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCLTQ---SGFQ-- 334
              ++ FGD   P T   S  F+P          Y+V +    +G + LT    S FQ  
Sbjct: 179 HKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLD 238

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  + DSG + T L    Y  V   F         +     +  CY+ +    + V
Sbjct: 239 SHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISV 298

Query: 389 PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           P +   F  +    +   N+I     N    +FC    ++ G   +IG       R+++D
Sbjct: 299 PTVTFHFQGDVDMRLPPSNYIVPVSNNN---IFCFAFAASMGP-SVIGNVQQQSFRVIYD 354

Query: 447 RENLKLAWSHSKC 459
             + ++     +C
Sbjct: 355 NVHKQIGLLPDQC 367


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 173/378 (45%), Gaps = 55/378 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y + Y+   +GTP  +    +D GS+++W+ C+ C QC            +    ++PS 
Sbjct: 87  YLMTYS---VGTPPFNVYGVVDTGSDIVWLQCKPCEQC----------YKQTTPIFNPSK 133

Query: 166 SSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA- 223
           SSS KN+ CS  LC+S R +  + ++ C Y  ++S + + S G L  + L L S + H+ 
Sbjct: 134 SSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFS-DQSYSQGELSVETLTLDSTTGHSV 192

Query: 224 --PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
             P++      +IGCG    G +       G++GLG+G VS+ + L  +  I   FS C 
Sbjct: 193 SFPKT------VIGCGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCL 242

Query: 282 -----DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCL------ 328
                D N +  + FGD    +       P  +K     Y++ +E++ +GN  +      
Sbjct: 243 LPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLD 302

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
            ++ G   ++DSG + T LP+ +Y  +     +LV   R+         CY+ +S++   
Sbjct: 303 DSEEG-NIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQY-- 359

Query: 388 VPDMRLIFSKNQSFVVR-NHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 444
             D  +I +  +   ++ N I +F    +G      T   T   +G + Q N ++G    
Sbjct: 360 --DFPIITAHFKGADIKLNPISTFAHVADGVVCLAFTSSQTGPIFGNLAQLNLLVG---- 413

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +   +++  S C +V
Sbjct: 414 YDLQQNIVSFKPSDCIKV 431


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 160/377 (42%), Gaps = 54/377 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP   + + LD GS+L W+ C  CI C   S  YY          DP  SSS +N++C
Sbjct: 198 IGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKESSSFENITC 247

Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
             P CK  SS      CK     CPY   Y     ++  + ++   ++L + +  + Q  
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
           V++ V+ GCG    G +   A    ++GLG G +S  S L    +  +SFS C  D N  
Sbjct: 308 VEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQLQ--SIYGHSFSYCLVDRNSD 361

Query: 287 GSV----FFGDQGPATQQS----TSFLPIGEKYDA---YFVGVESYCIGNSCLT------ 329
            SV     FG+            TSF+  GE+      Y+VG++S  +    L       
Sbjct: 362 TSVSSKLIFGEDKELLSHPNLNFTSFVG-GEENSVDTFYYVGIKSIMVDGEVLKIPEETW 420

Query: 330 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
               + G   ++DSG + T+     Y  +   F K +    +       K CYN S  E 
Sbjct: 421 HLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEK 480

Query: 386 LKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHR 442
           +++PD  ++FS      F V N+      +    + CL ++ T      IIG        
Sbjct: 481 MELPDFGILFSDGAMWDFPVENYFIQIEPD----LVCLAILGTPKSALSIIGNYQQQNFH 536

Query: 443 IVFDRENLKLAWSHSKC 459
           I++D +  +L ++  KC
Sbjct: 537 ILYDMKKSRLGYAPMKC 553


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/390 (25%), Positives = 159/390 (40%), Gaps = 54/390 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I+IG P   + + +D GS+  W+ C   C  C       Y               +
Sbjct: 16  YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVY-------------KPT 62

Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
             K V    PLC+    +++ C++ K  C Y   Y+ + +SS G L  D + L +    A
Sbjct: 63  EGKIVHPRDPLCEELQGNQNYCETCKQ-CDYEITYA-DRSSSKGVLARDNMQLTT----A 116

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
                    + GC   Q G  LD   + DG++GL  G +S+ + LA +G+I N F  C  
Sbjct: 117 DGEMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMA 176

Query: 282 -DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG-----FQ 334
            D +  G +F GD     +   +++PI     + Y   V     G   L   G      Q
Sbjct: 177 TDPSSGGYMFLGDDY-VPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQ 235

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            + DSG+S+T+ P EIY  ++   +             +  +C   +   +  V D+  +
Sbjct: 236 VIFDSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMKPNV-PVRSVGDVEQL 294

Query: 395 FS------KNQSFVVRNHIFSFPENEGFTV----FCLTVMSTDG-DYG-----IIGQNFM 438
           F+      + + FV+       PEN          CL V+  DG + G     IIG   +
Sbjct: 295 FNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGVL--DGTEIGHSSTIIIGDASL 352

Query: 439 MGHRIVFDRENLKLAWSHSKCEEVIDKSHV 468
            G  +V+D +  ++ W  S C     +S V
Sbjct: 353 RGKFVVYDNDENRIGWVQSDCTRPQKQSRV 382


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 154/376 (40%), Gaps = 58/376 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + +G+P     + +D+GS+++WV C+ C QC       Y   D     +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
              VSC   +C++ S             DYS    + + + G L  + L L         
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     FS C     
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 286 SGSVFFGDQGPATQQSTSFLPIG----------EKYDAYFVGVESYCIGNSC-------- 327
           +G       G      T  +P+G          +    Y+VG+    +G           
Sbjct: 287 AGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLF 341

Query: 328 -LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
            LT+ G   +V D+G + T LP E YA +   FD  + +   S   +    CY+ S    
Sbjct: 342 QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYAS 401

Query: 386 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           ++VP +   F +     +  RN +       G  VFCL    +     I+G     G +I
Sbjct: 402 VRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQI 457

Query: 444 VFDRENLKLAWSHSKC 459
             D  N  + +  + C
Sbjct: 458 TVDSANGYVGFGPNTC 473


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 158/384 (41%), Gaps = 48/384 (12%)

Query: 103 GNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQ--CIQCA-PLSASYYTSLDRNL 158
           GN +   HY+ I +IG P  +F + +D GS+L WV C   C  C  PL   Y    +R  
Sbjct: 60  GNVYPTGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNNR-- 117

Query: 159 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
                        V C+  LC++   ++C    + C Y  +Y+ +  SS G L+ D   L
Sbjct: 118 -------------VPCASSLCQAIQNNNCDIPTEQCDYEVEYA-DLGSSLGVLLSDYFPL 163

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLI 273
               +    S +Q  +  GCG  Q   YL   +P    G++GLG G  S+ S L   G+ 
Sbjct: 164 ----RLNNGSLLQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGIT 217

Query: 274 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 332
           QN    CF     G +FFGD   P +  + + +        Y  G      G       G
Sbjct: 218 QNVVGHCFSRVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKG 277

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 382
            Q + DSG+S+T+   ++Y  ++    K +S   +  + +  +   C+  +         
Sbjct: 278 LQLIFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDI 337

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFM 438
           +   K   +  I +KN    +    +     +G    CL +++      G+  +IG  FM
Sbjct: 338 KSFFKPLTINFIKAKNVQLQLAPEDYLIITKDGNV--CLGILNGGEQGLGNLNVIGDIFM 395

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               +V+D E  ++ W  + C  +
Sbjct: 396 QDRVVVYDNERQQIGWFPTNCNRL 419


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 143/340 (42%), Gaps = 32/340 (9%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           +E L   D  R   R  L          + FP EGS   F       L++T + +G+P  
Sbjct: 47  VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
            + V +D GS++LWV C  C  C   S      L+  L  ++P +SS+S  + CS   C 
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157

Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
                S + C++  + PC Y   Y  + + +SGY V D ++  +   +   ++  +S++ 
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVF 216

Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
           GC   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G      
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
            G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG +  +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335

Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSE 383
           L    Y   V      VS    SL  +GN    C+  SS 
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSR 372


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 154/376 (40%), Gaps = 58/376 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + +G+P     + +D+GS+++WV C+ C QC       Y   D     +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
              VSC   +C++ S             DYS    + + + G L  + L L         
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     FS C     
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQL--GGAAGGVFSYCLASRG 286

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKY----------DAYFVGVESYCIGNSC-------- 327
           +G       G      T  +P+G  +            Y+VG+    +G           
Sbjct: 287 AGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLF 341

Query: 328 -LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
            LT+ G   +V D+G + T LP E YA +   FD  + +   S   +    CY+ S    
Sbjct: 342 QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYAS 401

Query: 386 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           ++VP +   F +     +  RN +       G  VFCL    +     I+G     G +I
Sbjct: 402 VRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQI 457

Query: 444 VFDRENLKLAWSHSKC 459
             D  N  + +  + C
Sbjct: 458 TVDSANGYVGFGPNTC 473


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 168/390 (43%), Gaps = 48/390 (12%)

Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
            G  F  L Y   I IGTP  +F V  D GS+L WV  QC+ C P S+ Y     +    
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWV--QCLPC-PDSSCY----PQQEPL 165

Query: 161 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
           +DPS SS+  +V CS P C      ++ C +    C Y   Y  E + + G L ++   L
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATS--CEYSVKYGDE-SETHGSLAEETFTL 222

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
           +  S  AP +   + V+ GC  +    + D G    G++GLG GD S+   L++     N
Sbjct: 223 SPPSPLAPAA---TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSI---LSQTRRSIN 276

Query: 276 S----FSICFDENDS--GSVFFGDQGPATQQ---STSFLP----IGEKYDAYFVGVESYC 322
           S    FS C     S  G +  G    A QQ   + SF P    I +   AY V +    
Sbjct: 277 SGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVS 336

Query: 323 IGNSC--LTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 376
           +  +   +  S F   A++DSG   T +P   Y  +  +F   + S ++  +G+      
Sbjct: 337 VNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT 396

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEG----FTVFCLTVMSTD-GD 429
           CY+ + ++++  P + L F       V     +   P  +G     T+ CL  + T+   
Sbjct: 397 CYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAG 456

Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             I+G      + +VFD +  ++ +  + C
Sbjct: 457 LVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/482 (22%), Positives = 197/482 (40%), Gaps = 79/482 (16%)

Query: 3   NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
           N+V +  LF  + +  +    FS  L+HR S  +     SK+    + D++ +  S    
Sbjct: 11  NVVVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSAS---- 66

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
                     +  R +  +  +      L PS G          Y ++   + IGTP V 
Sbjct: 67  ----------RVGRFRQSAMTSDGIQSRLVPSAGE---------YIMN---LSIGTPPVP 104

Query: 123 FLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
            +  +D GS+L W  C+ C  C      ++          DP +SS+ ++ SC    C +
Sbjct: 105 VIAIVDTGSDLTWTQCRPCTHCYKQVVPFF----------DPKNSSTYRDSSCGTSFCLA 154

Query: 182 RSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
             + +S ++   C ++  Y+ + + + G L  + L +AS    A +         GC  +
Sbjct: 155 LGNDRSCRNGKKCTFMYSYA-DGSFTGGNLAVETLTVAS---TAGKPVSFPGFAFGCVHR 210

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQ 294
             G + + ++  G++GLG+ ++S+ S L     I   FS C      D + S  + FG  
Sbjct: 211 SGGIFDEHSS--GIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMSSRINFGRS 266

Query: 295 GPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSGF---------QALVDSGAS 342
           G  +   T   P+   G     Y + +E + +G   L+  GF           +VDSG +
Sbjct: 267 GIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTT 326

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-KNQSF 401
           +T+LP E Y ++       +  KR+         CYN + ++ +  P +   F   N   
Sbjct: 327 YTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAPIITAHFKDANVEL 385

Query: 402 VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENLKLAWSHS 457
              N      E+    + C TV+ T  D GI+G     NF++G    FD    ++++  +
Sbjct: 386 QPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVNFLVG----FDLRKKRVSFKAA 436

Query: 458 KC 459
            C
Sbjct: 437 DC 438


>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
          Length = 290

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 75/247 (30%), Positives = 119/247 (48%), Gaps = 22/247 (8%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +++  ++P  + VE  EL   +  + ++    LQS N      + FP +G+    F    
Sbjct: 25  LTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPSQ 73

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
             L+YT + +GTP     V +D GS++LWV C      P ++     L   L+ +DP SS
Sbjct: 74  VGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTS----GLQIQLNYFDPGSS 129

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S+S  +SC    C+     S +SC    + C Y   Y  + + +SGY V D++H AS  +
Sbjct: 130 STSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSDLMHFASIFE 188

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G+    FS C
Sbjct: 189 GTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC 248

Query: 281 FDENDSG 287
              ++SG
Sbjct: 249 LKGDNSG 255


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 148/323 (45%), Gaps = 46/323 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP  +F + +D GS + +VPC  C QC                +++P  
Sbjct: 89  YYTTRIWI--GTPPQTFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFEPEL 136

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + VSC+        +C + +  C Y   Y+ E +SSSG L +DI+   + S+  PQ
Sbjct: 137 SSTYQPVSCN-----IDCTCDNERKQCVYERQYA-EMSSSSGVLGEDIISFGNQSELVPQ 190

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +     I GC  ++TG      A DG+MGLG GD+S+   L + G+I +SFS+C+   D
Sbjct: 191 RA-----IFGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMD 244

Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
            G    +  G   P+        P+  +Y  Y + +++  +    L             +
Sbjct: 245 IGGGAMILGGISPPSGMVFAESDPVRSQY--YNIDLKAIHVAGKQLHLDPSIFDGKHGTV 302

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
           +DSG ++ +LP   +        K ++S +  + G    Y   C++ +  ++ ++    P
Sbjct: 303 LDSGTTYAYLPEAAFTAFKDAMMKELTSLK-QIHGPDPNYNDICFSGAESDVSQLSNTFP 361

Query: 390 DMRLIFSKNQSFVV--RNHIFSF 410
            + ++FS  Q   +   N++F +
Sbjct: 362 AVEMVFSNGQKLSLSPENYLFQY 384


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 159/379 (41%), Gaps = 44/379 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSS 166
           L++T+I +G P   + + +D  S+L W+ C   C  CA  + + Y     N+        
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIV------- 259

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
            + K+  C       ++        C Y  +Y+ + +SS G L  D LHL      A  S
Sbjct: 260 -TPKDSLCVELHRNQKAGYCETCQQCDYEIEYA-DHSSSMGVLARDELHLT----MANGS 313

Query: 227 SVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DE 283
           S       GC   Q G  L+     DG++GL    VS+PS LA  G+I N    C   D 
Sbjct: 314 STNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDV 373

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGE--KYDAYFVGVESYCIGNSCLTQSGFQALV---- 337
              G +F GD     +   S++P+ +    D+Y   +     G+  L+  G +  V    
Sbjct: 374 VGGGYMFLGDDF-VPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIV 432

Query: 338 -DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIF 395
            DSG+S+T+   E Y+E+V    ++     I    + +  +C+ A    +  V D++  F
Sbjct: 433 FDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCWRAKF-PIRSVIDVKQYF 491

Query: 396 SK-----NQSFVVRNHIFSFPENEGFTVF------CLTVMS----TDGDYGIIGQNFMMG 440
                     + + +  F  P  EG+ +       CL ++      DG   I+G   + G
Sbjct: 492 KTLTLQFGSKWWIISTKFRIPP-EGYLIISNKGNVCLGILDGSDVHDGSSIILGDISLRG 550

Query: 441 HRIVFDRENLKLAWSHSKC 459
             I++D  N K+ W+ S C
Sbjct: 551 QLIIYDNVNNKIGWTQSDC 569


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 100/387 (25%), Positives = 168/387 (43%), Gaps = 81/387 (20%)

Query: 13  CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
           C ++  S A++  F+ +L+HR  D +K  +   + N     +   + S+  +     N +
Sbjct: 16  CFIISLSHALNNGFTLELIHR--DSSKSPFYQPTQNKYERIANAVRRSINRV-----NHF 68

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
            +       QS  NS + +                 Y + Y+   IGTP       +D G
Sbjct: 69  YKYSLTSTPQSTVNSDKGE-----------------YLMSYS---IGTPPFKVFGFVDTG 108

Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKS 187
           S+L+W+ C+ C QC P          +    +DPS SSS +N+ C    C S   +SC  
Sbjct: 109 SDLVWLQCEPCKQCYP----------QITPIFDPSLSSSYQNIPCLSDTCHSMRTTSC-- 156

Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 247
             D   Y++  +    S++GY V       SF K           +IGCG + TG++   
Sbjct: 157 --DVRGYLSVETLTLDSTTGYSV-------SFPK----------TMIGCGYRNTGTFHGP 197

Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFGDQGPATQQSTSF 304
           ++  G++GLG G +S+PS L  +  I   FS C      N +  + FGD           
Sbjct: 198 SS--GIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMT 253

Query: 305 LPIGEKYDA---YFVGVESYCIGNSCLTQSG-------FQALVDSGASFTFLPTEIYAEV 354
            PI +K DA   Y++ +E++ +GN  +   G          L+DSG +FTFLP ++Y   
Sbjct: 254 TPIVKK-DAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYDVYYRF 312

Query: 355 VVKFDKLVSSKRISLQGNSWKYCYNAS 381
                + ++ + +     ++K CYN +
Sbjct: 313 ESAVAEYINLEHVEDPNGTFKLCYNVA 339


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 105/425 (24%), Positives = 185/425 (43%), Gaps = 55/425 (12%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQL-LFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           + + W++ + ++++     +  N   L P +G+   F   Q+Y    T I +G P   + 
Sbjct: 148 IDDGWRKARNKMEVAKAAAAGTNSTALLPIKGNV--FPDGQYY----TSIFVGNPPRPYF 201

Query: 125 VALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
           + +D GS+L W+ C   C  CA      Y      +           +++ C   L  ++
Sbjct: 202 LDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIV--------PPRDLLCQE-LQGNQ 252

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
           + C++ K  C Y  +Y+ + +SS G L  D +HL + +        +   + GC   Q G
Sbjct: 253 NYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHLIATNG----GREKLDFVFGCAYDQQG 306

Query: 243 SYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQ 299
             L   A  DG++GL    +S+PS LA  G+I N F  C   ++   G +F GD     +
Sbjct: 307 QLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFLGDDY-VPR 365

Query: 300 QSTSFLPIGEKYD-AYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAE 353
              ++  I    D  Y         G+  L    Q+G   Q + DSG+S+T+LP EIY  
Sbjct: 366 WGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPDEIYEN 425

Query: 354 VVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVVR 404
           +V         F +  S + + L    WK  +     E +K     + L F K   F+ +
Sbjct: 426 LVAAIKYASPGFVQDSSDRTLPL---CWKADFPVRYLEDVKQFFKPLNLHFGKKWLFMSK 482

Query: 405 NHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSH 456
               S PE+          CL +++ T+ ++G   I+G   + G  +V+D +  ++ W++
Sbjct: 483 TFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRRQIGWTN 541

Query: 457 SKCEE 461
           S C +
Sbjct: 542 SDCTK 546


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 164/387 (42%), Gaps = 54/387 (13%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +Y+  I+IG  + +F   +D+GS+L WV C   C  C       Y   +  L+
Sbjct: 47  GNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALN 106

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--ILHLA 217
            ++P  +S        HP+  +   CKS  D C Y  +Y+ +  SS G LV+D   L L 
Sbjct: 107 CFEPLCTSL-------HPI--TNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVPLKLT 156

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNS 276
           + S  AP+      +  GCG     S  D + P  GV+GLG G+VS  S L+  G+++N 
Sbjct: 157 NGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNV 210

Query: 277 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 332
              C  + + G +FFGD+       T  S S   IG  Y +   G      G        
Sbjct: 211 VGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSS---GPAEVYFGGKATGIKD 266

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 382
              + DSG+S+T+  ++ Y  ++      +  K +  + +  S   C+  +         
Sbjct: 267 LTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDV 326

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD----GDYGIIG 434
           ++   +  +R   +KN    +       PEN      +   C  +++      GD  IIG
Sbjct: 327 KKYFNLLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEVGLGDLNIIG 380

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEE 461
              +    +++D E  ++ W  + C +
Sbjct: 381 DISLKDKMVIYDNERRRIGWFPTNCNK 407


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/382 (24%), Positives = 161/382 (42%), Gaps = 56/382 (14%)

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  ++IG P   + + +D GSNL W+ C      C  C  +    Y              
Sbjct: 41  YVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLY-------------- 86

Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
               K V C+ PLC        +   C+   D C Y  +Y+ + T+S G L+ D   L +
Sbjct: 87  -RPKKLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYA-DGTTSLGVLLLDKFSLPT 144

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI- 273
            S          ++  GCG  Q       A      DG++GLG G V + S L  +G + 
Sbjct: 145 GS--------ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVS 196

Query: 274 QNSFSICFDENDSGSVFFGDQG-PATQQSTSFLP-IGEKYDAYFVGVESYCIGNSCLTQS 331
           +N    C      G +F G++  P++     ++  I  + + Y  G  +  +G + +   
Sbjct: 197 KNVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTK 256

Query: 332 GFQALVDSGASFTFLPTEIYAEVV--VKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKV 388
            F+A+ DSG+++T+LP  ++A++V  +K   + SS K +S        C+    +    V
Sbjct: 257 PFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKG-PKPFKTV 315

Query: 389 PDMRLIFSKNQSFVVRNHIFSF---PEN----EGFTVFCLTVMSTDG-DYGIIGQNFMMG 440
            D+   F K+   +  +H  +    PEN     G    C  ++   G D  +IG   M  
Sbjct: 316 HDLPKEF-KSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLFVIGGISMQE 374

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             ++ D E  +LAW  S C+++
Sbjct: 375 QLVIHDNEKGRLAWMPSPCDKM 396


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/390 (25%), Positives = 163/390 (41%), Gaps = 59/390 (15%)

Query: 107 YWLHYTWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H+   +IGTP    + + +D GS+L+W      QC P    +    D+    +DPS 
Sbjct: 87  YLIHF---NIGTPRPQRVALTMDTGSDLVWT-----QCTPCPVCF----DQPFPLFDPSV 134

Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           SS+ + V+C  P+C+     S S+C      C Y+  Y  + + ++GY+  D     S +
Sbjct: 135 SSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYG-DKSITAGYIFKDTFTFMSPN 193

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                    S +  GCG   TG +    +  G+ G G G +S+PS L + G     FS C
Sbjct: 194 GEGAPPVAVSGLAFGCGDYNTGVFASNES--GIAGFGRGPLSLPSQL-RVG----RFSYC 246

Query: 281 F------DENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
                  + N + +VF G           GP   +ST  +        Y++ +E   +G 
Sbjct: 247 LTSHDETESNKTSAVFLGTPPNGLRAHSSGPF--RSTPIIHSPSFPTFYYLSLEGITVGK 304

Query: 326 SCL-TQSGFQAL---------VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
           + L   S   AL         +DSG   T  P  ++ ++  +F   +   R         
Sbjct: 305 TRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGN 364

Query: 376 YCYNASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 432
                  +   +VP  +LIF   S +      N+I   PE+    V CL +   + D  +
Sbjct: 365 LLCFQRPKGGKQVPVPKLIFHLASADMDLPRENYI---PEDTDSGVMCLMINGAEVDMVL 421

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IG        IV+D EN KL ++ ++C+++
Sbjct: 422 IGNFQQQNMHIVYDVENSKLLFASAQCDKM 451


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 120/490 (24%), Positives = 200/490 (40%), Gaps = 93/490 (18%)

Query: 3   NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
           N+V +  LF  + +  +    FS  L+HR  D     +             P K   E L
Sbjct: 11  NVVVVGFLFQLLEVALARGGGFSVDLIHR--DSPHSPFFD-----------PSKTQAERL 57

Query: 63  ELLLSNDWKRQKTRV------KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDI 116
               ++ ++R  +RV       + S+   SR   + PS G          Y ++   + I
Sbjct: 58  ----TDAFRRSVSRVGRFRPTAMTSDGIQSR---IVPSAGE---------YLMN---LYI 98

Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           GTP V  +  +D GS+L W      QC P +  Y     + +  +DP +SS+ ++ SC  
Sbjct: 99  GTPPVPVIAIVDTGSDLTWT-----QCRPCTHCY----KQVVPLFDPKNSSTYRDSSCGT 149

Query: 177 PLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
             C    K RS  K  K  C +   Y+ + + + G L  + L + S    A +       
Sbjct: 150 SFCLALGKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTVDS---TAGKPVSFPGF 203

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSG 287
             GCG    G +   ++  G++GLG G++S+ S L     I   FS C      D + S 
Sbjct: 204 AFGCGHSSGGIFDKSSS--GIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISS 259

Query: 288 SVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGF---------QAL 336
            + FG  G  +   T   P+ +K     Y++ +E   +G   L   G+           +
Sbjct: 260 RINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNII 319

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VDSG ++TFLP E Y+++       +  KR+      +  CYN ++E  +  P +   F 
Sbjct: 320 VDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFK 377

Query: 397 -KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENLK 451
             N      N      E+    + C TV  T  D G++G     NF++G    FD    +
Sbjct: 378 DANVELQPLNTFMRMQED----LVCFTVAPTS-DIGVLGNLAQVNFLVG----FDLRKKR 428

Query: 452 LAWSHSKCEE 461
           +++  + C +
Sbjct: 429 VSFKAADCTQ 438


>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 654

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 164/374 (43%), Gaps = 48/374 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           HYTW+  GTP     V  D GS L+  PC  C  C   +   + +           +SS+
Sbjct: 65  HYTWVYAGTPPQRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQA----------DNSST 114

Query: 169 SKNVSC----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSK 221
             +V+C    SH  CK    C    D C     Y  E +S    +V+D+++L   +SF  
Sbjct: 115 LIHVTCSQQQSHFQCK---ECTEKSDTCAISQSY-MEGSSWKASVVEDVVYLGGESSFHD 170

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 280
            A +    +    GC   +TG ++   A DG+MGL   D  + + L +   I  N FS+C
Sbjct: 171 EAMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLC 229

Query: 281 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ-----S 331
           F EN  G++  G+    A +   S+  + +   A   Y V ++   IG   +       +
Sbjct: 230 FTEN-GGTMSVGEPNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYT 288

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
               +VDSG + ++LP  +  E +  F ++    R    G S   C+  ++E++  +P +
Sbjct: 289 RGHYIVDSGTTDSYLPRAMKNEFLQVFKEVAG--RDYQVGTS---CHGYTNEDLASLPKI 343

Query: 392 RLIFSKNQSFVVRNH--IFSFPENEGF----TVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
           +L+    +++   N   I   P  +        +C ++  ++   G+IG N MM   ++F
Sbjct: 344 QLVM---EAYGDENGEVIIDIPPEQYLLHNDNSYCGSIYLSENAGGVIGANLMMNRDVIF 400

Query: 446 DRENLKLAWSHSKC 459
           D  N ++ +  + C
Sbjct: 401 DNGNQRVGFVDADC 414


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/370 (24%), Positives = 152/370 (41%), Gaps = 47/370 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           IG P   + + +D GS+L W+ C   C  C  +    Y               ++++ V 
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY-------------RPTANRLVP 47

Query: 174 CSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS     S+
Sbjct: 48  CANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFSLPMRSSN 101

Query: 228 VQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           ++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N    C   N 
Sbjct: 102 IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNG 161

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDSGASF 343
            G +FFGD    + + T ++P+ ++     Y  G  +       L     + + DSG+++
Sbjct: 162 GGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTY 220

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SKNQSFV 402
           T+   + Y  VV      +S     +   +   C+    +    V D++  F S   SF 
Sbjct: 221 TYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFKSMFLSFA 279

Query: 403 -VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVFDRENLK 451
             +N     P      V      CL ++  DG      + +IG   M    +++D E  +
Sbjct: 280 SAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVIYDNEKSQ 337

Query: 452 LAWSHSKCEE 461
           L W+   C  
Sbjct: 338 LGWARGACTR 347


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 96/386 (24%), Positives = 162/386 (41%), Gaps = 50/386 (12%)

Query: 103 GNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +Y+ I +IG P  +F   +D GS+L WV C   C  C       Y     NL 
Sbjct: 46  GNVYPTGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKP-KNNL- 103

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
                       V CS+ LC++ S+     C +  D C Y  +Y+ +  SS G L+ D  
Sbjct: 104 ------------VPCSNSLCQAVSTGENYHCDAPDDQCDYEIEYA-DLGSSIGVLLSDSF 150

Query: 215 HLASFSKHAPQSSVQSSVIIGCG--RKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKA 270
            L    + +  + +Q  +  GCG  +K  G +     PD  G++GLG G VS+ S L   
Sbjct: 151 PL----RLSNGTLLQPKMAFGCGYDQKHLGPH---PPPDTAGILGLGRGKVSILSQLRTL 203

Query: 271 GLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
           G+ QN    CF     G +FFGD   P+++ + + +        Y  G      G     
Sbjct: 204 GITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTG 263

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASS-- 382
             G Q + DSG+S+T+   ++Y  ++    K ++ K +           WK      S  
Sbjct: 264 IKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSIL 323

Query: 383 --EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQN 436
             +   K   +  + +KN    +    +     +G    CL +++      G++ +IG  
Sbjct: 324 DIKSYFKPLTISFMNAKNVQLQLAPEDYLIITKDGNV--CLGILNGSEQQLGNFNVIGDI 381

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEV 462
           FM    +++D E  ++ W  + C+ +
Sbjct: 382 FMQDRVVIYDNEKQQIGWFPANCDRL 407


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 152/361 (42%), Gaps = 47/361 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP     +  D GS L+W      QC P  A Y       +  +DP+ S+S K + 
Sbjct: 136 VGIGTPKKEMPLIFDTGSGLIWT-----QCKPCKACY-----PKVPVFDPTKSASFKGLP 185

Query: 174 CSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           CS  LC+S R  C S K  C Y+  Y  +++SS+G L  + +  +             ++
Sbjct: 186 CSSKLCQSIRQGCSSPK--CTYLTAY-VDNSSSTGTLATETISFSHLKYDF------KNI 236

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--DSGSVF 290
           +IGC  + +G  L      G+MGL    +S+ S    A +    FS C       +G + 
Sbjct: 237 LIGCSDQVSGESL---GESGIMGLNRSPISLAS--QTANIYDKLFSYCIPSTPGSTGHLT 291

Query: 291 FGDQGPATQQSTSFLPIGEK-----YDAYFVGVESYCIGNSCLT--QSGFQ--ALVDSGA 341
           FG + P       F P+ +      YD    G+    +G   L    S F+  + +DSGA
Sbjct: 292 FGGKVP---NDVRFSPVSKTAPSSDYDIKMTGIS---VGGRKLLIDASAFKIASTIDSGA 345

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK--NQ 399
             T LP + Y+ +   F +++    +  Q +    CY+ S+   + +P + + F      
Sbjct: 346 VLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEM 405

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V   ++  P   G  V+CL     D +  I G      + +VFD    ++ ++   C
Sbjct: 406 DIDVSGIMWQVP---GSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462

Query: 460 E 460
           +
Sbjct: 463 D 463


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score = 92.4 bits (228), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 154/377 (40%), Gaps = 51/377 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP V  L+ALD  S+L W+ CQ C +C P S             +DP  S+S   +
Sbjct: 138 IAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYGEM 187

Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSS---GYLVDDILHLASFSKHAPQS 226
           +   P C++  RS     K   C Y   Y     S+S   G LV++ L  A   +     
Sbjct: 188 NYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVR----- 242

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
             Q+ + IGCG    G  L GA   G++GLG G +S+P  +A  G    SFS C  +  S
Sbjct: 243 --QAYLSIGCGHDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDFIS 297

Query: 287 G------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
           G      ++ FG     T    SF P      +   Y    +GV    +    +T+   Q
Sbjct: 298 GPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQ 357

Query: 335 ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY--CYNASS 382
                     ++DSG + T L    Y      F    +S  ++S  G S  +  CY    
Sbjct: 358 LDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGG 417

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
              +KVP + + F+      ++   +  P +   TV      + D    +IG     G R
Sbjct: 418 RAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFR 477

Query: 443 IVFDRENLKLAWSHSKC 459
           +V+D    ++ ++ + C
Sbjct: 478 VVYDLAGQRVGFAPNNC 494


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score = 92.4 bits (228), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 152/366 (41%), Gaps = 60/366 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + +G+P     + +D+GS+++WV C+ C QC       Y   D     +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
              VSC   +C++ S             DYS    + + + G L  + L L         
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     FS C     
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---------LTQSGFQAL 336
           +G         A   ++SF         Y+VG+    +G            LT+ G   +
Sbjct: 287 AGG--------AGSLASSF---------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 329

Query: 337 V-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           V D+G + T LP E YA +   FD  + +   S   +    CY+ S    ++VP +   F
Sbjct: 330 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYF 389

Query: 396 SKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            +     +  RN +       G  VFCL    +     I+G     G +I  D  N  + 
Sbjct: 390 DQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVG 445

Query: 454 WSHSKC 459
           +  + C
Sbjct: 446 FGPNTC 451


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score = 92.4 bits (228), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 99/411 (24%), Positives = 163/411 (39%), Gaps = 53/411 (12%)

Query: 76  RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
           +VKLQ  N    + ++FP  G+  +  G      +Y  ++IG P   F + +D GS+L W
Sbjct: 42  QVKLQ--NRRLGSSVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTGSDLTW 93

Query: 136 VPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSL 188
           V C   C  C    A           +Y P+ ++    + CSH LC          C   
Sbjct: 94  VQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHLLCSGLDLTQNRPCDDP 139

Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDG 247
           +D C Y   YS +  SS G LV D   L    K A  S +   +  GCG  +Q       
Sbjct: 140 EDQCDYEIGYS-DHASSIGALVTDEFPL----KLANGSIMNPHLTFGCGYDQQNPGPHPP 194

Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLP 306
               G++GLG G V + + L   G+ +N    C      G +  GD+  P++  + + L 
Sbjct: 195 PPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLA 254

Query: 307 IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
                  Y  G       +      G   + DSG+S+T+   E Y  ++    K ++ K 
Sbjct: 255 TNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKP 314

Query: 367 I--SLQGNSWKYCYNASS--------EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG 415
           +  +    S   C+            ++  K   +R  + KN Q F V    +     +G
Sbjct: 315 LTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVPPESYLIITEKG 374

Query: 416 FTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
               CL +++        Y I+G     G  +++D E  ++ W  S C+++
Sbjct: 375 NV--CLGILNGTEVGLDSYNIVGDISFQGIMVIYDNEKQRIGWISSDCDKI 423


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 92.4 bits (228), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 95/382 (24%), Positives = 165/382 (43%), Gaps = 40/382 (10%)

Query: 96  GSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD 155
           G ++      F +L Y  +++GTP    L   D GS+L+WV C         A    ++ 
Sbjct: 91  GVESKIITRSFEYLMY--VNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNV- 147

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDD 212
                + P+ SS+   +SC    C+  S++SC +  + C Y   YS  D S + G L  +
Sbjct: 148 ----VFQPTRSSTYSQLSCQSNACQALSQASCDADSE-CQY--QYSYGDGSRTIGVLSTE 200

Query: 213 ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
                SF     +  V+   V  GC     G++      DG++GLG G  S+ S L    
Sbjct: 201 TF---SFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLGAGAFSLVSQLGATT 253

Query: 272 LIQNSFSIC----FDENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCIGN 325
            I    S C    +D N S ++ FG +   ++   +  P +    D+Y+ V +ES  +G 
Sbjct: 254 HIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGG 313

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SS 382
             +     + +VDSG + TFL   +   +V + ++ +  +R+       + CY+    S 
Sbjct: 314 QEVATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSE 373

Query: 383 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNF 437
            +   +PD+ L F    +  +R  + FS  + EG     L  +S      I+G    QNF
Sbjct: 374 TDNFGIPDVTLRFGGGAAVTLRPENTFSLLQ-EGTLCLVLVPVSESQPVSILGNIAQQNF 432

Query: 438 MMGHRIVFDRENLKLAWSHSKC 459
            +G    +D +   + ++ + C
Sbjct: 433 HVG----YDLDARTVTFAAADC 450


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 98/387 (25%), Positives = 162/387 (41%), Gaps = 61/387 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  CQ C  C           D+ L  +DPS+
Sbjct: 35  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 81

Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+L  D       
Sbjct: 82  SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 140

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               P       V  GCG    G +       G+ G G G +S+PS L K G    +FS 
Sbjct: 141 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 187

Query: 280 CFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGN 325
           CF             +    +F   QG    Q+T  +   +       Y++ ++   +G+
Sbjct: 188 CFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLKGITVGS 245

Query: 326 S---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           +          LT      ++DSG S T LP ++Y  V  +F   +    +         
Sbjct: 246 TRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYT 305

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 435
           C++A S+    VP + L F      + R N++F  P++ G ++ CL +   D +  IIG 
Sbjct: 306 CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGN 364

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  +++D +N  L++  ++C+++
Sbjct: 365 FQQQNMHVLYDLQNNMLSFVAAQCDKL 391


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 103/420 (24%), Positives = 168/420 (40%), Gaps = 51/420 (12%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
           K    +VKLQ+   SS   ++FP  G+  +  G      +Y  ++IG P   F + +D G
Sbjct: 36  KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87

Query: 131 SNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RS 183
           S+L WV C   C  C    A           +Y P+ ++    + CSH LC         
Sbjct: 88  SDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHILCSGLDLPQDR 133

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTG 242
            C   +D C Y   YS +  SS G LV D + L    K A  S +   +  GCG  +Q  
Sbjct: 134 PCADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNP 188

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQS 301
                    G++GLG G V + + L   G+ +N    C      G +  GD+  P++  +
Sbjct: 189 GPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVT 248

Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
            + L        Y  G       +      G   + DSG+S+T+   E Y  ++    K 
Sbjct: 249 WTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKD 308

Query: 362 VSSKRIS--LQGNSWKYCYNASS--------EEMLKVPDMRLIFSKN-QSFVVRNHIFSF 410
           ++ K ++      S   C+            ++  K   +R    KN Q F V    +  
Sbjct: 309 LNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLI 368

Query: 411 PENEG---FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 467
              +G     +   T +  +G Y IIG     G  +++D E  ++ W  S C+++ + +H
Sbjct: 369 ITEKGRVCLGILNGTEIGLEG-YNIIGDISFQGIMVIYDNEKQRIGWISSDCDKLPNVNH 427


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 97/380 (25%), Positives = 163/380 (42%), Gaps = 48/380 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I IG P   + + +D GS+L W+ C   C  CA      Y      +         
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 238

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +       
Sbjct: 239 PPRDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GR 291

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S PS LA  G+I N F  C   ++ 
Sbjct: 292 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQG 351

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVD 338
             G +F GD     +   ++  I    D  Y         G+  L +     S  Q + D
Sbjct: 352 GGGYMFLGDDY-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 339 SGASFTFLPTEIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VP 389
           SG+S+T+LP EIY  +V         F +  S + + L    WK  +     E +K    
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFE 467

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGH 441
            + L F K   F+ +    S PE+          CL +++ T+ ++G   I+G   + G 
Sbjct: 468 PLNLHFGKKWLFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526

Query: 442 RIVFDRENLKLAWSHSKCEE 461
            +V+D +  ++ W+ S C +
Sbjct: 527 LVVYDNQRKQIGWADSDCTK 546


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/387 (24%), Positives = 162/387 (41%), Gaps = 62/387 (16%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  C+ C+ C           D+ L  +D S 
Sbjct: 35  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCKPCVSC----------FDQPLPYFDTSR 81

Query: 166 SSSSKNVSCSHPLCK---SRSSCKSLK---DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SS++  + C    CK   + + C  L      C Y   Y  +++ + G L  D     + 
Sbjct: 82  SSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYG-DNSVTIGLLAADKFTFVA- 139

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
                  +    V  GCG   TG +   +   G+ G G G +S+PS L K G    +FS 
Sbjct: 140 ------GTSLPGVTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQL-KVG----NFSH 186

Query: 280 CFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGN 325
           CF             +    +F   QG    Q+T  +   +       Y++ ++   +G+
Sbjct: 187 CFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLKGITVGS 244

Query: 326 S---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           +          LT      ++DSG S T LP ++Y  V  +F   +    +         
Sbjct: 245 TRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYT 304

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 435
           C++A S+    VP + L F      + R N++F  P++ G ++ CL +   D +  IIG 
Sbjct: 305 CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGN 363

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  +++D +N  L++  ++C+++
Sbjct: 364 FQQQNMHVLYDLQNNMLSFVAAQCDKL 390


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/389 (23%), Positives = 158/389 (40%), Gaps = 50/389 (12%)

Query: 98  QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLD 155
           Q + + N FY +    + +G P   + +  D GS+L W+ C   C QC            
Sbjct: 48  QGNVYPNGFYNVT---LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCT----------- 93

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVD 211
                  P    S+  V C  PLC S  S    +    D C Y  +Y+ +  SS G LV 
Sbjct: 94  ---ETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVR 149

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           D+  L + +   P   ++  + +GCG  Q          DG++GLG G VS+ S L   G
Sbjct: 150 DVFPL-NLTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQG 205

Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQ 330
           +++N    CF+    G +FFGD G        + P+   Y  ++  G             
Sbjct: 206 IVRNVVGHCFNSKGGGYLFFGD-GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL 264

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 388
                + DSG+S+T+   + Y  +    ++ ++ K  R ++  ++   C+    + +  +
Sbjct: 265 RNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSL 323

Query: 389 PDMRLIFSK----NQSFVVRNHIFSFPENEGFTVF------CLTVMSTDGDYG-----II 433
            D+R  F        S      +F  P  EG+ +       CL +++   D G     II
Sbjct: 324 RDVRKYFKPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGT-DVGLENSNII 381

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G   M    +V++ E   + W+ + C+ V
Sbjct: 382 GDISMQDKMVVYNNEKQAIGWATANCDRV 410


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 40/360 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 234

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + S G+   D L L+S+              
Sbjct: 235 CAAPACSDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     +G+ +  
Sbjct: 287 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 340

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ---ALVDSGASFTF 345
           FG   PA + +T+ + +      Y+VG+    +G   L   QS F     +VDSG   T 
Sbjct: 341 FGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITR 400

Query: 346 LPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           LP   Y+ +   F   +S++       +SL       CY+ +    + +P + L+F    
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKKAPAVSL----LDTCYDFAGMSQVAIPTVSLLFQGGA 456

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V      +  +              GD GI+G   +    + +D     +++S   C
Sbjct: 457 RLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 160/378 (42%), Gaps = 49/378 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP   F + LD GS+L W+  QC+ C       Y    +N   YDP  SSS KN+ C 
Sbjct: 198 IGTPPRHFSLILDTGSDLNWI--QCVPC-------YDCFVQNGPYYDPKESSSFKNIGCH 248

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
            P C   SS      CK+    CPY   Y     ++  + ++   ++L S +  +    V
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
           + +V+ GCG    G +   A    ++GLG G +S  S L    L  +SFS C      D 
Sbjct: 309 E-NVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDT 362

Query: 284 NDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCLT-------- 329
           N S  + FG D+        +F  L  G++      Y+V ++S  +G   L         
Sbjct: 363 NVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHL 422

Query: 330 --QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
             +     +VDSG + ++     Y  +   F K V    +         CYN S  E ++
Sbjct: 423 SPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKME 482

Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIV 444
           +P+ R++F      +F V N+       E   + CL ++ T      IIG        I+
Sbjct: 483 LPEFRILFEDGAVWNFPVENYFIKLEPEE---IVCLAILGTPRSALSIIGNYQQQNFHIL 539

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +  +L ++  KC +V
Sbjct: 540 YDTKKSRLGYAPMKCADV 557


>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
 gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
          Length = 424

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 159/387 (41%), Gaps = 49/387 (12%)

Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCA-PLSASYYTSLDRN 157
           FGN +   +Y+  + IG P   F + +D GS+L WV C   C  C  PL   Y      N
Sbjct: 58  FGNVYPLGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPR--NN 115

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 212
           L             +SC  PLC +  +     C+S  D C Y   Y+ E  SS G LV D
Sbjct: 116 L-------------LSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTD 161

Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAG 271
              L   +     S ++  +  GCG  Q         P  GV+GLG G  S+ S L   G
Sbjct: 162 YFPLRLMNG----SFLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALG 217

Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLT 329
           ++ N    C      G +FFG Q P      S+ P+ +K  D Y+  G      G     
Sbjct: 218 VMGNVIGHCLSRKGGGFLFFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTG 276

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNAS------ 381
               + + DSG+S+T+   ++Y   +    K +S K  R + +  +   C+  +      
Sbjct: 277 TKAEEFIFDSGSSYTYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSV 336

Query: 382 SEEMLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQ 435
           +E         L F+K +S    +    +    N+G    CL +++      G++ +IG 
Sbjct: 337 NEVKSYFKPFALSFTKAKSVQLQIPPEDYLIVTNDGNV--CLGILNGSEVGLGNFNVIGD 394

Query: 436 NFMMGHRIVFDRENLKLAWSHSKCEEV 462
           N      +++D +  ++ W  + C+ +
Sbjct: 395 NLFQDKLVIYDSDKHQIGWIPANCDRL 421


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 164/378 (43%), Gaps = 61/378 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +G+P     + +D+GS+++WV C+ C++C       Y   D     +DP++S++   V
Sbjct: 175 VSVGSPPTEQYLVVDSGSDVMWVQCKPCLEC-------YVQAD---PLFDPATSATFSGV 224

Query: 173 SCSHPLCK--SRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           SC   +C+    S+C   +   C Y   Y+ + + + G L  + L L          +  
Sbjct: 225 SCGSAICRILPTSACGDGELGGCEYEVSYA-DGSYTKGALALETLTLG--------GTAV 275

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-------- 281
             V+IGCG +  G ++  A   G+MGLG G +S+   L   G +  +FS C         
Sbjct: 276 EGVVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQL--GGEVGGAFSYCLASRGGYGS 330

Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-TQSG-FQ 334
              ++D+G +  G +  A  +   ++P+     A   Y+VG+    +G+  L  Q+G FQ
Sbjct: 331 GAADDDAGWLVLG-RSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389

Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSE 383
                    ++D+G + T LP E YA +   F   ++      QG S      CY+ S  
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449

Query: 384 EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
             ++VP +   F  +   ++  RN +          ++CL    +     I+G     G 
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLLEVD----MGIYCLAFAPSSSGLSIMGNTQQAGI 505

Query: 442 RIVFDRENLKLAWSHSKC 459
           +I  D  N  + +  + C
Sbjct: 506 QITVDSANGYIGFGPANC 523


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 158/377 (41%), Gaps = 47/377 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP   F + LD GS+L W+  QC+ C       Y   ++N   YDP  SSS +N+ C 
Sbjct: 187 VGTPPKHFSLILDTGSDLNWI--QCVPC-------YECFEQNGPHYDPGQSSSYRNIGCH 237

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
              C   SS      CK+    CPY   Y     ++  + ++      + S   P+    
Sbjct: 238 DSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV 297

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
            +V+ GCG    G +   A    ++GLG G +S  S L    L  +SFS C      D N
Sbjct: 298 ENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDAN 352

Query: 285 DSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCL---------- 328
            S  + FG D+   +    +F  L  G++      Y+V ++S  +G   +          
Sbjct: 353 VSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIA 412

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           T      ++DSG + ++     Y  +   F   V    +       + CYN +  E   +
Sbjct: 413 TDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDL 472

Query: 389 PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVF 445
           PD  ++FS     +F V N+   F E E   V CL ++ T      IIG        I++
Sbjct: 473 PDFGIVFSDGAVWNFPVENY---FIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILY 529

Query: 446 DRENLKLAWSHSKCEEV 462
           D +  +L ++ +KC +V
Sbjct: 530 DTKKSRLGFAPTKCADV 546


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 155/365 (42%), Gaps = 43/365 (11%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP   F + +D GS + +VPC  C  C    A +          + P +SSS + VSC
Sbjct: 105 IGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFD-------PRFKPDNSSSYQTVSC 157

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           + P C ++  C +    C Y   Y+ E +SS G L  D+L   + S+  P       ++ 
Sbjct: 158 NSPDCITKM-CDARVHQCKYERVYA-EMSSSKGVLGKDLLGFGNGSRLQPHP-----LLF 210

Query: 235 GCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGSVFF 291
           GC   +TG  YL  A  DG+MGLG G +S+   L   G +++SFS+C+   D   GS+  
Sbjct: 211 GCETAETGDLYLQHA--DGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVL 268

Query: 292 GDQGPATQQSTSFLPIGEKYDAYF------VGVESYCIGNSCLTQSG-FQALVDSGASFT 344
           G   P    +  F         Y+      + V+   +       +G    ++DSG ++ 
Sbjct: 269 GAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTYA 326

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSEEMLKV----PDMRLI 394
           +LP + +      F   ++ +  SLQ   G    Y   C+  +  +   +    P +  +
Sbjct: 327 YLPDKAFD----AFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFV 382

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           FS NQ   +    + F   +    +CL          ++G   +    + +DR N ++ +
Sbjct: 383 FSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGF 442

Query: 455 SHSKC 459
             + C
Sbjct: 443 FKTNC 447


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 91.7 bits (226), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 126/259 (48%), Gaps = 27/259 (10%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS-LDRNLSEYDPSSSS 167
           L+YT I +GTP   F V +D GSN+ WV     +CAP +   ++  +   +S +DP  S+
Sbjct: 40  LYYTRISLGTPPQQFYVDVDTGSNVAWV-----KCAPCTGCEHSGDVPVPMSTFDPRKST 94

Query: 168 SSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHA 223
           +  ++SC+   C     +  C   +  CPY   Y  + +S++GY ++D+       S ++
Sbjct: 95  TKISISCTDAECGVLNKKLQCSPERLSCPYSLLYG-DGSSTAGYYLNDVFTFNQVPSDNS 153

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
              S  + ++ GCG  QTGS+    + DG++G G   VS+P+ LA+  +  N F+ C   
Sbjct: 154 TAKSGTARLVFGCGGTQTGSW----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQG 209

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQ------ 334
           D +  GS+  G      +    + P+    D Y V + +  I G +  T + F       
Sbjct: 210 DVSGRGSLVIGT---IREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTPASFDLEYTGG 266

Query: 335 ALVDSGASFTFLPTEIYAE 353
            ++DSG + T+L    Y E
Sbjct: 267 VIIDSGTTLTYLVQPAYDE 285


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 163/381 (42%), Gaps = 59/381 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           ++   +IG P   + +  D GS+L W+ C   CIQC P     Y   +  +   DP  +S
Sbjct: 67  YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPICAS 126

Query: 168 -SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAP 224
               N  C  P            D C Y  +Y+ +  SS G LV+D+  ++L S  +  P
Sbjct: 127 LHPDNYRCDDP------------DQCDYEVEYA-DGGSSIGVLVNDLFPVNLTSGMRARP 173

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           +      + IGCG  Q    L G A    DGV+GLG G  S+ + L+  GL++N    CF
Sbjct: 174 R------LTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCF 223

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---D 338
                G +FFGD          + P+   Y  ++    +  I N     SG + L+   D
Sbjct: 224 SRRGGGYLFFGDD-IYDSSKVIWTPMSRDYLKHYTPGFAELILNG--RSSGLKNLLVVFD 280

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIF- 395
           SG+S+T+  T+ Y  ++    K +  K +  +++ ++   C+    +    + D +  F 
Sbjct: 281 SGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRG-KKPFKSIRDAKKYFK 339

Query: 396 -----------SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMG 440
                      +K+Q F ++   +    ++G    CL +++       +Y IIG   M  
Sbjct: 340 PLALSFGSGWKTKSQ-FEIQQESYLIISSKGSV--CLGILNGTEVGLQNYNIIGDISMQE 396

Query: 441 HRIVFDRENLKLAWSHSKCEE 461
             +++D E   + W  S C+ 
Sbjct: 397 KLVIYDNEKQVIGWQPSNCDR 417


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/420 (24%), Positives = 176/420 (41%), Gaps = 53/420 (12%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
           V+Y++  LS +  R+ T   L S           P+E       G+  Y +    + +GT
Sbjct: 8   VKYIQSRLSKNLGRENTVKDLDSTT--------LPAESGS--LIGSANYVV---VVGLGT 54

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P     +  D GS+L W      QC P + S Y   D   + +DPS SSS  N++C+  L
Sbjct: 55  PKRDLSLVFDTGSDLTWT-----QCEPCAGSCYKQQD---AIFDPSKSSSYTNITCTSSL 106

Query: 179 CKS------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C        +S C S  D  C Y A Y  ++++S G+L  + L + +       + +   
Sbjct: 107 CTQLTSDGIKSECSSSTDASCIYDAKYG-DNSTSVGFLSQERLTITA-------TDIVDD 158

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSV 289
            + GCG+   G + +G+A  G+MGLG   +S+  +   +      FS C     S  G +
Sbjct: 159 FLFGCGQDNEGLF-NGSA--GLMGLGRHPISI--VQQTSSNYNKIFSYCLPATSSSLGHL 213

Query: 290 FFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL------TQSGFQALVDSG 340
            FG    AT  S  + P+     D  F G++  S  +G + L      T S   +++DSG
Sbjct: 214 TFG-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSG 272

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
              T L   +YA +   F + +    ++ +      CY+ S  + + VP +   FS   +
Sbjct: 273 TVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVT 332

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
             + +      E+E           +D D  + G        +V+D +  ++ +  + C+
Sbjct: 333 VELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/431 (23%), Positives = 172/431 (39%), Gaps = 48/431 (11%)

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLHYTWIDIGT 118
           E+ E+L ++D  R             S N ++ F  +G+   +       L+YT I++GT
Sbjct: 4   EHFEMLKAHDRARH----------GRSLNTIVDFTLQGTADPYVAG----LYYTRIELGT 49

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P   F V +D GS++LWV C+     PL++     L   L+ +DP  SS++  +SC    
Sbjct: 50  PPRPFYVQIDTGSDILWVNCKPCNACPLTS----GLGVALNFFDPRGSSTASPLSCIDSK 105

Query: 179 CK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C      S S C + +  C Y  +Y  + + + GY V D      +      ++  + + 
Sbjct: 106 CVSSNQISESVCTTDRY-CGYSFEYG-DGSGTLGYYVSDEFDYNQYVNQYVTNNASAKIT 163

Query: 234 IGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GC   Q+G       A DG+ G G  D+SV S L   GL    FS C +  D G     
Sbjct: 164 FGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGIL- 222

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFT 344
             G  T+    + PI      Y + ++   +    L        T +    ++D G +  
Sbjct: 223 VLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLA 282

Query: 345 FLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SKNQSF 401
           +L  E Y   V      V  S++   L+GN    C+          P + L F       
Sbjct: 283 YLAEEAYEPFVNTIIAAVSQSTQPFMLKGNP---CFLTVHSIDEIFPSVTLYFEGAPMDL 339

Query: 402 VVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQNFMMGHRIVFDRENLKLAWS 455
             ++++      +   V+C+        +TD     I+G   +     V+D EN ++ W+
Sbjct: 340 KPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWT 399

Query: 456 HSKCEEVIDKS 466
              C   ++ S
Sbjct: 400 SFDCSSTVNVS 410


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/374 (24%), Positives = 153/374 (40%), Gaps = 48/374 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C++C              L    P    SS  
Sbjct: 64  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109

Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV D+  +     +    
Sbjct: 110 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDVFSM----NYTKGL 163

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
            +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C      
Sbjct: 164 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 223

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGASF 343
           G +FFGD    + +  S+ P+  +Y  ++   +G E    G           + DSG+S+
Sbjct: 224 GILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSSY 281

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPDMRLIFSK 397
           T+  ++ Y  V     + +S K +  +   ++   C+       S E +K     L  S 
Sbjct: 282 TYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSF 341

Query: 398 NQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 448
              +  +  +F  P      +      CL +++       +  +IG   M    I++D E
Sbjct: 342 KTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNE 400

Query: 449 NLKLAWSHSKCEEV 462
              + W  + C+E+
Sbjct: 401 KQSIGWMPADCDEL 414


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 111/431 (25%), Positives = 181/431 (41%), Gaps = 73/431 (16%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
           L+ ++ + Q  ++K+++  +S+  Q +  ++   T   G +   L+Y   +++G  N+S 
Sbjct: 43  LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 100

Query: 124 LVALDAGSNLLWVPCQ-CIQC----APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           +V  D GS+L WV CQ C  C     PL              YDPS SSS K V C+   
Sbjct: 101 IV--DTGSDLTWVQCQPCRSCYNQQGPL--------------YDPSVSSSYKTVFCNSST 144

Query: 179 CKSRSSCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKH 222
           C+   +  S           +K PC Y+       Y+  D +S   L+ D   L +F   
Sbjct: 145 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF--- 200

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-- 280
                     + GCGR   G +   +   G+       VS+ S   K       FS C  
Sbjct: 201 ----------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLP 245

Query: 281 -FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQ 334
             ++  SGS+ FG+       ST  S+ P+ +       Y + +    IG   L  S F 
Sbjct: 246 SLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFG 305

Query: 335 A--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              L+DSG   T LP  IY  V ++F K  S    +   +    C+N +S E + +P ++
Sbjct: 306 RGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIK 365

Query: 393 LIFSKNQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDREN 449
           +IF  N    V    +F F + +  ++ CL +  +S + + GIIG       R+++D   
Sbjct: 366 MIFQGNAELEVDVTGVFYFVKPDA-SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 424

Query: 450 LKLAWSHSKCE 460
            +L      C 
Sbjct: 425 ERLGIVGENCR 435


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 165/388 (42%), Gaps = 70/388 (18%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I++G+P   F   +D GS+L+W+ C+ C QC       Y+  D     YDPS+SS+    
Sbjct: 8   IELGSPPKKFNAIVDTGSDLVWIQCKPCSQC-------YSQSD---PIYDPSASSTFAKT 57

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           SCS   C+S   S C S    C Y   Y  + +S+ G    + L L S       S    
Sbjct: 58  SCSTSSCQSLPASGCSSSAKTCIYGYQYG-DSSSTQGDFALETLTLRS---SGGSSKAFP 113

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
           +   GCGR  +GS+  GAA  G++GLG G +S+ + L  A  I N FS C   FD++ S 
Sbjct: 114 NFQFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSK 168

Query: 288 S--VFFGDQGPATQQ--STSFLPIGEKYDAYFVGVESYCIGNSCLT-------------- 329
           +  + FG          ST  +P   +   YFVG+E   +G   L+              
Sbjct: 169 TSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSK 228

Query: 330 ----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
                      SG   + DSG + T L   +Y++V   F   VS   +    + +  CY+
Sbjct: 229 KKLRVRALEVNSG-GTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYD 287

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF-------TVFCLTVMSTDGDYGI 432
            S  +  K P + L F        +   FS P+   F       TV CL +  +      
Sbjct: 288 VSKSKNFKFPALTLAF--------KGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLG 339

Query: 433 IGQNFM-MGHRIVFDRENLKLAWSHSKC 459
           I  N M   + +V+DR    ++ S ++C
Sbjct: 340 IIGNLMQQNYHVVYDRGTSTISMSPAQC 367


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/426 (25%), Positives = 182/426 (42%), Gaps = 63/426 (14%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
           L+ ++ + Q  ++K+++  +S+  Q +  ++   T   G +   L+Y   +++G  N+S 
Sbjct: 91  LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 148

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +V  D GS+L WV     QC P  + Y    ++    YDPS SSS K V C+   C+   
Sbjct: 149 IV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197

Query: 184 SCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +  S           +K PC Y+       Y+  D +S   L+ D   L +F        
Sbjct: 198 AATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF-------- 248

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
                + GCGR   G +   +   G+       VS+ S   K       FS C    ++ 
Sbjct: 249 -----VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDG 298

Query: 285 DSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQA--LV 337
            SGS+ FG+       ST  S+ P+ +       Y + +    IG   L  S F    L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILI 358

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG   T LP  IY  V ++F K  S    +   +    C+N +S E + +P +++IF  
Sbjct: 359 DSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG 418

Query: 398 NQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           N    V    +F F + +  ++ CL +  +S + + GIIG       R+++D    +L  
Sbjct: 419 NAELEVDVTGVFYFVKPDA-SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGI 477

Query: 455 SHSKCE 460
               C 
Sbjct: 478 VGENCR 483


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 112/464 (24%), Positives = 191/464 (41%), Gaps = 68/464 (14%)

Query: 4   LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           +V +C+      L+  D   FS +++HR S  +                 P     E   
Sbjct: 12  IVLLCLYINISFLNALDGGGFSVEIIHRDSSRS-----------------PYYRPTETQF 54

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVS 122
             ++N  +R   R      N+ ++  L+  +  +++    +Q  Y + Y+   +GTP   
Sbjct: 55  QRVANALRRSINRA-----NHFNKPNLVASTNTAESTVIASQGEYLMSYS---VGTPPFQ 106

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS- 181
            L  +D GS+++W     +QC P    Y    ++    +DPS S + K + CS  +C+S 
Sbjct: 107 ILGIVDTGSDIIW-----LQCQPCEDCY----NQTTPIFDPSQSKTYKTLPCSSNICQSV 157

Query: 182 --RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGR 238
              +SC S  D C Y   Y  +++ S G L  + L L S       SSVQ    +IGCG 
Sbjct: 158 QSAASCSSNNDECEYTITYG-DNSHSQGDLSVETLTLGS----TDGSSVQFPKTVIGCGH 212

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGD 293
              G++      +G   +GLG   V  +   +  I   FS C        N S  + FGD
Sbjct: 213 NNKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGD 268

Query: 294 QGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGF---------QALVDSGAS 342
           +   + + T   PI  K     YF+ +E++ +G++ +                ++DSG +
Sbjct: 269 EAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTT 328

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP + Y  +       +  +R+       + CY  +S + L VP +   F      V
Sbjct: 329 LTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGAD--V 386

Query: 403 VRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFMMGHRIV 444
             N I +F E +EG   F          +G +  QN ++G+ +V
Sbjct: 387 ELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLLVGYDLV 430


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 90/188 (47%), Gaps = 12/188 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T I IGTP   + V +D GS++LWV C      P      ++L   L+ YDP  S S
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
            + V+C    C +       SC S   PC Y   Y  + +S++G+ V D L     S   
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202

Query: 224 PQSSVQSSVIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             +   +SV  GCG K  G       A DG++G G  + S+ S LA AG ++  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 283 ENDSGSVF 290
             + G +F
Sbjct: 263 TVNGGGIF 270


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 37/320 (11%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
           +  R  L     +  +  +FP        +G+ + + L+Y  + IG P   + + +D GS
Sbjct: 27  RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79

Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
           +L W+ C   C+ C+ +    Y               + +K V C   +C +       R
Sbjct: 80  DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR-KQT 241
             C S K  C Y   Y+ +  SS G LV D   L    + A  S V+  +  GCG  +Q 
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
           GS  + +A DGV+GLG G VS+ S L + G+ +N    C      G +FFGD   P ++ 
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           + + +      + Y  G  +   G   L     + + DSG+SFT+   + Y  +V     
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301

Query: 361 LVSSKRISLQGNSWKYCYNA 380
            +S     +  +S   C+  
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG 321


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/426 (25%), Positives = 182/426 (42%), Gaps = 63/426 (14%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
           L+ ++ + Q  ++K+++  +S+  Q +  ++   T   G +   L+Y   +++G  N+S 
Sbjct: 91  LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 148

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           +V  D GS+L WV     QC P  + Y    ++    YDPS SSS K V C+   C+   
Sbjct: 149 IV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197

Query: 184 SCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +  S           +K PC Y+       Y+  D +S   L+ D   L +F        
Sbjct: 198 AATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF-------- 248

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
                + GCGR   G +   +   G+       VS+ S   K       FS C    ++ 
Sbjct: 249 -----VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDG 298

Query: 285 DSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQA--LV 337
            SGS+ FG+       ST  S+ P+ +       Y + +    IG   L  S F    L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILI 358

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG   T LP  IY  V ++F K  S    +   +    C+N +S E + +P +++IF  
Sbjct: 359 DSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG 418

Query: 398 NQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           N    V    +F F + +  ++ CL +  +S + + GIIG       R+++D    +L  
Sbjct: 419 NAELEVDVTGVFYFVKPDA-SLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGI 477

Query: 455 SHSKCE 460
               C 
Sbjct: 478 VGENCR 483


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 154/367 (41%), Gaps = 36/367 (9%)

Query: 96  GSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD 155
           G ++      F +L Y  +++GTP    L   D GS+L+WV C        S        
Sbjct: 88  GVESKIITRSFEYLMY--VNVGTPPAQMLAIADTGSDLVWVNCS-------SNGGGGGAS 138

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDD 212
                + PS S++   +SC    C+  S++SC +  + C Y   Y+  D S + G L  +
Sbjct: 139 DGAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSE-CQY--QYAYGDGSRTIGVLSTE 195

Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
               A+             V  GC     GS+      DG++GLG G +S+ S L  A  
Sbjct: 196 TFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAAR 251

Query: 273 IQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCI-G 324
           I   FS C        N S ++ FG +   +    +  P +  + D+Y+ V +ES  + G
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAG 311

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---S 381
               + +  + +VDSG + TFL   +   +V + ++ +   R        + CY+    S
Sbjct: 312 QDVASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKS 371

Query: 382 SEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QN 436
             E   +PD+ L F    S  +R  + FS  E EG     L  +S      I+G    QN
Sbjct: 372 QAEDFGIPDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQN 430

Query: 437 FMMGHRI 443
           F +G+ +
Sbjct: 431 FHVGYDL 437


>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
          Length = 320

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 119/261 (45%), Gaps = 35/261 (13%)

Query: 42  SKSGNVSVADSWPK---KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQ 98
           S +G   V   +P+   +   E+L  L  +D  R     +L    + +   +  P++   
Sbjct: 27  SATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHG---RLLGAVDLALGGVGLPTD--- 80

Query: 99  THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRN 157
                     L+YT I+IG+P   + V +D GS++LWV C +C  C   S      L   
Sbjct: 81  --------TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSG-----LGIE 127

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 211
           L++YDP+ S ++  V C    C + S      +C S   PC +   Y  + ++++G+ V 
Sbjct: 128 LTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVT 184

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 269
           D +     S +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA 
Sbjct: 185 DFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAA 243

Query: 270 AGLIQNSFSICFDENDSGSVF 290
           A  ++  F+ C D    G +F
Sbjct: 244 ARRVRKIFAHCLDTVRGGGIF 264


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 154/357 (43%), Gaps = 47/357 (13%)

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
           V +D GS+L WV CQ C +C       Y   D     ++PS+S S + V CS P C+S  
Sbjct: 148 VIVDTGSDLSWVQCQPCKRC-------YNQQD---PVFNPSTSPSYRTVLCSSPTCQSLQ 197

Query: 184 S-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
           S       C S    C Y+ +Y  + + + G L  + L L +       S+  ++ I GC
Sbjct: 198 SATGNLGVCGSNPPSCNYVVNYG-DGSYTRGELGTEHLDLGN-------STAVNNFIFGC 249

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGD 293
           GR   G +       G++GLG   +S+ S    + +    FS C    +   SGS+  G 
Sbjct: 250 GRNNQGLF---GGASGLVGLGRSSLSLIS--QTSAMFGGVFSYCLPITETEASGSLVMGG 304

Query: 294 QGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTF 345
                + +     T  +P   +   YF+ +    +G+  +    F     ++DSG   T 
Sbjct: 305 NSSVYKNTTPISYTRMIP-NPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITR 363

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR- 404
           LP  IY  +  +F K  S    +        C+N S  + +++P++++ F  N    V  
Sbjct: 364 LPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDV 423

Query: 405 NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             +F F + +   V CL +  +S + + GIIG       R+++D +   L ++   C
Sbjct: 424 TGVFYFVKTDASQV-CLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 154/366 (42%), Gaps = 41/366 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP +      D GS+L+W      QC P +  Y     +    +DP SSSS  N++
Sbjct: 64  LSIGTPPIKIYAEADTGSDLVW-----FQCIPCTKCY----KQQNPMFDPRSSSSYTNIT 114

Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C    C    S  C + +  C Y   Y+ +++ + G L  + L L S +    +      
Sbjct: 115 CGTESCNKLDSSLCSTDQKTCNYTYSYA-DNSITQGVLAQETLTLTSTTG---EPVAFQG 170

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQNSFSICF-----DEND 285
           +I GCG   +G + D     G++GLG G +S+ S +  + G   N FS C      D + 
Sbjct: 171 IIFGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSI 227

Query: 286 SGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVE------SYCIGNSCLTQSGFQA 335
           +  + FG         T   P+    G  Y A  +G+        +  G+S  T +    
Sbjct: 228 TSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITKGNI 287

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           L+DSG + T+LP E Y  ++ +    V+ +   + G  ++ CY   +   L  P + + F
Sbjct: 288 LIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG--YELCYQTPTN--LNGPTLTIHF 343

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                 +    +F   +++    FC  V  T+ +Y   G      + I FD E   +++ 
Sbjct: 344 EGGDVLLTPAQMFIPVQDDN---FCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFK 400

Query: 456 HSKCEE 461
            + C +
Sbjct: 401 ATDCTK 406


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/415 (26%), Positives = 174/415 (41%), Gaps = 47/415 (11%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNN---SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           V++ E++  +  + +    KL  N+    S       P++   T   GN     +   I 
Sbjct: 83  VDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGN-----YIVTIG 137

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP     +  D GS+L W      QC P   S Y+   +   +++PSSSS+ +NVSCS
Sbjct: 138 IGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYS---QKEPKFNPSSSSTYQNVSCS 189

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P+C+   SC +    C Y   Y  + + + G+L  +   L         S V   V  G
Sbjct: 190 SPMCEDAESCSASN--CVYSIGYG-DKSFTQGFLAKEKFTLT-------NSDVLEDVYFG 239

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC---FDENDSGSVFF 291
           CG    G +      DGV GL        SL A+     N+ FS C   F  N +G + F
Sbjct: 240 CGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTF 293

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQSGFQ---ALVDSGASFT 344
           G  G    +S  F PI     A+  G++     +G+    +T + F    A++DSG  FT
Sbjct: 294 GSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFT 351

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
            LPT++YAE+   F + +SS + +     +  CY+ +  + +  P +   F+      + 
Sbjct: 352 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELD 411

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               S P     +  CL     D    I G        +V+D    ++ ++ + C
Sbjct: 412 GSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 154/375 (41%), Gaps = 50/375 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C++C              L    P    SS  
Sbjct: 52  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 97

Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV D+     FS +  Q 
Sbjct: 98  IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 150

Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
             +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C     
Sbjct: 151 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 210

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGAS 342
            G +FFGD    + +  S+ P+  +Y  ++   +G E    G           + DSG+S
Sbjct: 211 GGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSS 268

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPDMRLIFS 396
           +T+  ++ Y  V     + +S K +  +   ++   C+       S E +K     L  S
Sbjct: 269 YTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALS 328

Query: 397 KNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 447
               +  +  +F  P      +      CL +++       +  +IG   M    I++D 
Sbjct: 329 FKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDN 387

Query: 448 ENLKLAWSHSKCEEV 462
           E   + W    C+E+
Sbjct: 388 EKQSIGWMPVDCDEL 402


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 154/375 (41%), Gaps = 50/375 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C++C              L    P    SS  
Sbjct: 64  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109

Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV D+     FS +  Q 
Sbjct: 110 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 162

Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
             +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C     
Sbjct: 163 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 222

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGAS 342
            G +FFGD    + +  S+ P+  +Y  ++   +G E    G           + DSG+S
Sbjct: 223 GGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSS 280

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPDMRLIFS 396
           +T+  ++ Y  V     + +S K +  +   ++   C+       S E +K     L  S
Sbjct: 281 YTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALS 340

Query: 397 KNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 447
               +  +  +F  P      +      CL +++       +  +IG   M    I++D 
Sbjct: 341 FKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDN 399

Query: 448 ENLKLAWSHSKCEEV 462
           E   + W    C+E+
Sbjct: 400 EKQSIGWMPVDCDEL 414


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/415 (24%), Positives = 165/415 (39%), Gaps = 51/415 (12%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
           K    +VKLQ+   SS   ++FP  G+  +  G      +Y  ++IG P   F + +D G
Sbjct: 36  KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87

Query: 131 SNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RS 183
           S+L WV C   C  C    A           +Y P+ ++    + CSH LC         
Sbjct: 88  SDLTWVQCDAPCNGCTKPRA----------KQYKPNHNT----LPCSHILCSGLDLPQDR 133

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTG 242
            C   +D C Y   YS +  SS G LV D + L    K A  S +   +  GCG  +Q  
Sbjct: 134 PCADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNP 188

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQS 301
                    G++GLG G V + + L   G+ +N    C      G +  GD+  P++  +
Sbjct: 189 GPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVT 248

Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
            + L        Y  G       +      G   + DSG+S+T+   E Y  ++    K 
Sbjct: 249 WTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKD 308

Query: 362 VSSKRI--SLQGNSWKYCYNASS--------EEMLKVPDMRLIFSKN-QSFVVRNHIFSF 410
           ++ K +  +    S   C+            ++  K   +R    KN Q F V    +  
Sbjct: 309 LNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLI 368

Query: 411 PENEGFTVFCL---TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              +G     +   T +  +G Y IIG     G  +++D E  ++ W  S C+++
Sbjct: 369 ITEKGRVCLGILNGTEIGLEG-YNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 422


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 105/412 (25%), Positives = 176/412 (42%), Gaps = 55/412 (13%)

Query: 78  KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
           KL ++ ++  +  +FP  G     + N  Y+ H   I +G+P   + + +D GS+L W+ 
Sbjct: 288 KLATSVSAFDSSTIFPVRGD---VYPNGLYFTH---IFVGSPPRRYFLDMDTGSDLTWIQ 341

Query: 138 CQ--CIQCAPLSASYYTSLDRNLSEY-DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           C   C  CA      Y     NL    D       +N+   +  C+   +C+     C Y
Sbjct: 342 CDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGY--CE---TCEQ----CDY 392

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGV 253
             +Y+ + +SS G L  D LHL      A  S  +  ++ GC   Q G  L+  A  DG+
Sbjct: 393 EIEYA-DHSSSMGVLASDDLHLM----LANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGI 447

Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI---- 307
           +GL    VS+PS LA   +I N    C   D    G +F GD         +++P+    
Sbjct: 448 LGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPMLNSH 506

Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
              Y +  + +       S   Q G   + + D+G+S+T+ P E Y  +V    K VS +
Sbjct: 507 SPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDE 565

Query: 366 RISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFT 417
            +   G+  +   C+ A    +  V D++  F       +++ ++V    F  P  EG+ 
Sbjct: 566 GLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-EGYL 622

Query: 418 VF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +       CL ++      DG   I+G   + G  +V+D  N K+ W+ S C
Sbjct: 623 IISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 674


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 115/445 (25%), Positives = 176/445 (39%), Gaps = 95/445 (21%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPS-----EGSQTH------------FFGNQFYWLHYT 112
           +++Q+ R KL  N+N + N    P      EG  +H              G+  Y++ + 
Sbjct: 11  FRKQRGRHKLSDNDNGAHNSANPPVITAVIEGPPSHDHDFQSPVVSGSTLGSGQYFVDFF 70

Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
              +GTP   F + +D+GS+LLWV C  C+QC            ++   Y PS+SS+   
Sbjct: 71  ---LGTPPQKFSLIVDSGSDLLWVQCAPCLQC----------YAQDTPLYAPSNSSTFNP 117

Query: 172 VSCSHPLCKSRSSCKSLKDPCPY------IADYSTEDTS-SSGYL------VDDILHLAS 218
           V C  P C    + +    PC +        +Y   DTS S G        VDD+     
Sbjct: 118 VPCLSPECLLIPATEGF--PCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVR---- 171

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
                        V  GCGR   GS+   AA  GV+GLG G +S  S +  A    N F+
Sbjct: 172 ----------IDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFA 216

Query: 279 ICF-DENDSGSV----FFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQ 330
            C  +  D  SV     FGD+  +T     F PI         Y+V +E   +G   L  
Sbjct: 217 YCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI 276

Query: 331 S----------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYN 379
           S             ++ DSG + T+     Y  ++  FDK V   R  S+QG     C +
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQG--LDLCVD 334

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVM---STDGDYGIIG 434
            +  +    P   ++      F  +  N+      N    V CL +    S+ G +  IG
Sbjct: 335 VTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPN----VQCLAMAGLPSSVGGFNTIG 390

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                   + +DRE  ++ ++ +KC
Sbjct: 391 NLLQQNFLVQYDREENRIGFAPAKC 415


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/392 (25%), Positives = 168/392 (42%), Gaps = 64/392 (16%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +Y+  I+IG  + +F   +D+GS+L WV C   C  C       Y   +  L+
Sbjct: 47  GNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALN 106

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--ILHLA 217
            ++P  +S        HP+  +   CKS  D C Y  +Y+ +  SS G LV+D   L L 
Sbjct: 107 CFEPLCTSL-------HPI--TNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVPLKLT 156

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNS 276
           + S  AP+      +  GCG     S  D + P  GV+GLG G+VS  S L+  G+++N 
Sbjct: 157 NGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNV 210

Query: 277 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSC 327
              C  + + G +FFGD+       T  S S   IG  Y     + YF G  +   G   
Sbjct: 211 VGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKAT---GIKD 266

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS--- 382
           LT      + DSG+S+T+  ++ Y  ++      +  K +  + +  S   C+  +    
Sbjct: 267 LT-----LVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFK 321

Query: 383 -----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD----GD 429
                ++      +R   +KN    +       PEN      +   C  +++      GD
Sbjct: 322 SLRDVKKYFNPLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEVGLGD 375

Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
             IIG   +    +++D E  ++ W  + C +
Sbjct: 376 LNIIGDISLKDKMVIYDNERRRIGWFPTNCNK 407


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 161/379 (42%), Gaps = 48/379 (12%)

Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSE 160
           G +   L+Y   ++IG  N++ +V  D GS+L WV CQ C  C       Y   D     
Sbjct: 59  GVRLQTLNYIVTVEIGGRNMTVIV--DTGSDLTWVQCQPCRLC-------YNQQD---PL 106

Query: 161 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
           ++PS S S + + C+   C+S          C S    C Y+ +Y  + + + G L  + 
Sbjct: 107 FNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYG-DGSYTRGDLGMEQ 165

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
           L+L   + H       S+ I GCGR   G +       G+MGLG  D+S+ S    + + 
Sbjct: 166 LNLG--TTHV------SNFIFGCGRNNKGLF---GGASGLMGLGKSDLSLVS--QTSAIF 212

Query: 274 QNSFSICFDE---NDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 325
           +  FS C      + SGS+  G      + +     T  +   +    YF+ +    IG 
Sbjct: 213 EGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG 272

Query: 326 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
             L    ++    L+DSG   T LP  +Y ++  +F K  S    +   +    C+N + 
Sbjct: 273 VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNG 332

Query: 383 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMG 440
            + + +P +R+ F  N    V    IF F + +   V   L  +S D +  IIG      
Sbjct: 333 YDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRN 392

Query: 441 HRIVFDRENLKLAWSHSKC 459
            R++++ +  KL ++   C
Sbjct: 393 QRVIYNTKESKLGFAAEAC 411


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 158/376 (42%), Gaps = 45/376 (11%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           GN  Y +  T   +G+P  SF V +D GS+L WV     QC P    Y     +   ++D
Sbjct: 35  GNGEYLMTLT---LGSPPQSFDVIVDTGSDLNWV-----QCLPCRVCY----QQPGPKFD 82

Query: 163 PSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           PS S S +  +C+  LC   +    +C +  + C Y   Y  +  ++     + I    S
Sbjct: 83  PSKSRSFRKAACTDNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETI----S 136

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            +  A   SV  +   GCG +  G++   A   G++GLG G +S+ S L+      N FS
Sbjct: 137 LNNGAGTQSV-PNFAFGCGTQNLGTF---AGAAGLVGLGQGPLSLNSQLSHT--FANKFS 190

Query: 279 ICFDENDSGS---VFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLT----- 329
            C    +S S   + FG    A   Q TS +        Y+V + S  +G   L      
Sbjct: 191 YCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSV 250

Query: 330 ----QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
               QS  +   ++DSG + T L    Y+ V+  ++  V+  R+         C+N +  
Sbjct: 251 FAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGV 310

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
               VPDM   F +   F +R        +   T  CL +  + G + IIG      H +
Sbjct: 311 SNPSVPDMVFKF-QGADFQMRGENLFVLVDTSATTLCLAMGGSQG-FSIIGNIQQQNHLV 368

Query: 444 VFDRENLKLAWSHSKC 459
           V+D E  K+ ++ + C
Sbjct: 369 VYDLEAKKIGFATADC 384


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/389 (24%), Positives = 161/389 (41%), Gaps = 62/389 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP    L+  D GS+L+W+ C      P          R    +  S S++   V 
Sbjct: 58  MAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATLSVVP 115

Query: 174 CSHPLC--------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           CS   C           S   +   PC Y  DY+ + +S++G+L  D    A+ S     
Sbjct: 116 CSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYA-DGSSTTGFLARDT---ATISNGTSG 171

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEN 284
            +    V  GCG +  G    G    GV+GLG G +S P   A++G L   +FS C  + 
Sbjct: 172 GAAVRGVAFGCGTRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDL 226

Query: 285 DSGS-------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ 334
           + G        +F G   P  + + ++ P+     A   Y+VGV +  +GN  L   G +
Sbjct: 227 EGGRRGRSSSFLFLGR--PERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 284

Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYN 379
                      ++DSG++ T+L    Y  +V  F   V   RI       QG   + CYN
Sbjct: 285 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYN 342

Query: 380 ASSEEMLK-----VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG- 431
            SS   L       P + + F++  S  +   N++    ++    V CL +  T   +  
Sbjct: 343 VSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADD----VKCLAIRPTLSPFAF 398

Query: 432 -IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++G     G+ + FDR + ++ ++ ++C
Sbjct: 399 NVLGNLMQQGYHVEFDRASARIGFARTEC 427


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 155/379 (40%), Gaps = 52/379 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + LD GS+L W+ C  CI C   S  YY          DP  SSS +N+SC
Sbjct: 201 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKDSSSFRNISC 250

Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
             P C+  SS      CK+    CPY   Y     ++  + ++   ++L + +  +    
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
           V+ +V+ GCG    G +   A   G+    L   S         L   SFS C  D N +
Sbjct: 311 VE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRNSN 364

Query: 287 GSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL-------- 328
            SV     FG D+   +  + +F   G   D      Y+V + S  + +  L        
Sbjct: 365 ASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWH 424

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
             ++     ++DSG + T+     Y  +   F + +    +       K CYN S  E +
Sbjct: 425 LSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKM 484

Query: 387 KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRI 443
           ++PD  ++F+     +F V N+      +    V CL ++        IIG        I
Sbjct: 485 ELPDFGILFADGAVWNFPVENYFIQIDPD----VVCLAILGNPRSALSIIGNYQQQNFHI 540

Query: 444 VFDRENLKLAWSHSKCEEV 462
           ++D +  +L ++  KC +V
Sbjct: 541 LYDMKKSRLGYAPMKCADV 559


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/380 (25%), Positives = 160/380 (42%), Gaps = 54/380 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + LD GS+L W+ C  CI C   S  YY          DP  SSS +N+SC
Sbjct: 203 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKDSSSFRNISC 252

Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
             P C+  S+      CK+    CPY   Y     ++  + ++   ++L + +  +    
Sbjct: 253 HDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKH 312

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
           V+ +V+ GCG    G +   A   G+    L   S         L   SFS C  D N +
Sbjct: 313 VE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRNSN 366

Query: 287 GSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL-------- 328
            SV     FG D+   +  + +F   G   D      Y+V ++S  + +  L        
Sbjct: 367 ASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWH 426

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
             ++     ++DSG + T+     Y  +   F + +   ++       K CYN S  E +
Sbjct: 427 LSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKM 486

Query: 387 KVPDMRLIFSKNQ--SFVVRNH-IFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHR 442
           ++PD  ++F+     +F V N+ I+  PE     V CL ++        IIG        
Sbjct: 487 ELPDFGILFADEAVWNFPVENYFIWIDPE-----VVCLAILGNPRSALSIIGNYQQQNFH 541

Query: 443 IVFDRENLKLAWSHSKCEEV 462
           I++D +  +L ++  KC +V
Sbjct: 542 ILYDMKKSRLGYAPMKCADV 561


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 108/415 (26%), Positives = 175/415 (42%), Gaps = 47/415 (11%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNN---SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           V++ E++  +  + +    KL  N+    S       P++   T   GN     +   I 
Sbjct: 83  VDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGN-----YIVTIG 137

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP     +  D GS+L W      QC P   S Y+   +   +++PSSSS+ +NVSCS
Sbjct: 138 IGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYS---QKEPKFNPSSSSTYQNVSCS 189

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P+C+   SC +    C Y   Y  + + + G+L  +   L         S V   V  G
Sbjct: 190 SPMCEDAESCSA--SNCVYSIVYG-DKSFTQGFLAKEKFTLT-------NSDVLEDVYFG 239

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC---FDENDSGSVFF 291
           CG    G +      DGV GL        SL A+     N+ FS C   F  N +G + F
Sbjct: 240 CGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTF 293

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQSGFQ---ALVDSGASFT 344
           G  G    +S  F PI     A+  G++     +G+    +T + F    A++DSG  FT
Sbjct: 294 GSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFT 351

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
            LPT++YAE+   F + +SS + +     +  CY+ +  + +  P +   F+ +    + 
Sbjct: 352 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELD 411

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               S P     +  CL     D    I G        +V+D    ++ ++ + C
Sbjct: 412 GSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 150/367 (40%), Gaps = 57/367 (15%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +IGTP    LVALD  ++  WVPC  C+ CA            +   +DPS SSSS+N+ 
Sbjct: 96  NIGTPAQPMLVALDTSNDAAWVPCSGCVGCA------------SSVLFDPSKSSSSRNLQ 143

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P CK     +C + K  C +   Y      +S  L  D L LA        + V  S
Sbjct: 144 CDAPQCKQAPNPTCTAGKS-CGFNMTYGGSTIEAS--LTQDTLTLA--------NDVIKS 192

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
              GC  K TG+ L      G+MGLG G +S+ S      L  ++FS C       N SG
Sbjct: 193 YTFGCISKATGTSLPA---QGLMGLGRGPLSLIS--QTQNLYMSTFSYCLPNSKSSNFSG 247

Query: 288 SVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           S+  G +  P   ++T  L    +   Y+V +    +GN  +            +G   +
Sbjct: 248 SLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTI 307

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            DSG  FT L    Y  V  +F + + +   +  G  +  CY+ S    +  P +  +F+
Sbjct: 308 FDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG-GFDTCYSGS----VVYPSVTFMFA 362

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKL 452
                +  +++     +   +  CL + +   +      +I       HR++ D  N +L
Sbjct: 363 GMNVTLPPDNLLI--HSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRL 420

Query: 453 AWSHSKC 459
             S   C
Sbjct: 421 GISRETC 427


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 160/363 (44%), Gaps = 64/363 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP       +D G++ +W   QC  C P        L++    + PS SS+ K + C+
Sbjct: 96  IGTPPFQLYSLIDTGNDNIWF--QCKPCKP-------CLNQTSPMFHPSKSSTYKTIPCT 146

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P+CK+                      +   YL  D L L S +   P S    +++IG
Sbjct: 147 SPICKN----------------------ADGHYLGVDTLTLNS-NNGTPISF--KNIVIG 181

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVF 290
           CG +  G  L+G    G +GL  G +S  S L  +  I   FS C       EN S  + 
Sbjct: 182 CGHRNQGP-LEGYV-SGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKLH 237

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGASFTFL 346
           FGD+   +   T   PI E+ + YFV +E++ +G+  +    + +   +++DSG + T L
Sbjct: 238 FGDKSTVSGLGTVSTPIKEE-NGYFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTIL 296

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML-KVPDMRLIFSKNQSFVVRN 405
           P ++Y+ +      +V  KR+      +  CY  +S  +L KV  +   FS ++  +   
Sbjct: 297 PKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNAL 356

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDY-------GIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           + F +P  +   V C   +S  G++        ++ QNF++G    FD     +++  + 
Sbjct: 357 NTF-YPITD--EVICFAFVS-GGNFSSLAIFGNVVQQNFLVG----FDLNKKTISFKPTD 408

Query: 459 CEE 461
           C +
Sbjct: 409 CTK 411


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 156/379 (41%), Gaps = 52/379 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP   F + +D GS+L W+ C  C+ C           D+    +DP +S+S +NV
Sbjct: 154 VYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FDQRGPVFDPMASTSYRNV 203

Query: 173 SCSHPLC------KSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           +C    C       +  +C+S + DPCPY   Y  +  ++     D  L   + +  A  
Sbjct: 204 TCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTASS 259

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           S     V++GCG +  G +   A   G+    L   S   L A  G   ++FS C  ++ 
Sbjct: 260 SRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHG 314

Query: 286 SG---SVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
           S     + FGD            T+F P   +   Y+V ++   +G   L          
Sbjct: 315 SAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVS 374

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 386
                   ++DSG + ++ P   Y  +   F D++  +  +         CYN S  E +
Sbjct: 375 KEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERV 434

Query: 387 KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRI 443
           +VP+  L+F+      F   N+     + EG  + CL V+ T      IIG        +
Sbjct: 435 EVPEFSLLFADGAVWDFPAENYFIRL-DTEG--IMCLAVLGTPRSAMSIIGNYQQQNFHV 491

Query: 444 VFDRENLKLAWSHSKCEEV 462
           ++D  + +L ++  +C EV
Sbjct: 492 LYDLHHNRLGFAPRRCAEV 510


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 160/403 (39%), Gaps = 47/403 (11%)

Query: 63  ELLLSNDWKRQKT---RVKLQ---SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDI 116
           E +L+ D  R K+   RV      S     RN+   P+        GN     +   I +
Sbjct: 113 EEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPASSGSALGTGN-----YVVTIGL 167

Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  N+SC+ 
Sbjct: 168 GTPAGRYTVVFDTGSDTTWV-----QCEPCVVVCYKQQEK---LFDPARSSTYANISCAA 219

Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
           P C            C Y   Y  + + S G+   D L L+S+               GC
Sbjct: 220 PACSDLYIKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAIKGFRFGC 271

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
           G +  G Y + A   G++GLG G  S+P     K G +   F+ CF    SG+ +  D G
Sbjct: 272 GERNEGLYGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL-DFG 324

Query: 296 PA-----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGASFTF 345
           P      + + T+ + +      Y+VG+    +G   L+  QS F     +VDSG   T 
Sbjct: 325 PGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITR 384

Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           LP   Y+ +   F   ++ +  + +   +    CY+ +    + +P + L+F    S  V
Sbjct: 385 LPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDV 444

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
                 +  +             D D GI+G   +    +V+D
Sbjct: 445 HASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYD 487


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/371 (23%), Positives = 153/371 (41%), Gaps = 32/371 (8%)

Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           G     F V +D GS++LWV C      P S    + L   L+ +D   SS++  + CS 
Sbjct: 75  GXXXXXFNVQIDTGSDILWVNCNTCSNCPQS----SQLGIELNFFDTVGSSTAALIPCSD 130

Query: 177 PLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
            +C S      + C    + C Y   Y  + + +SGY V D ++        P  +  ++
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSDAMYFNLIMGQPPAVNSTAT 189

Query: 232 VIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGS 288
           ++ GC   Q+G       A DG+ G G G +SV S L+  G+    FS C   D N  G 
Sbjct: 190 IVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALVDS 339
           +  G+     + S  + P+      Y + ++S  +    L         + +    +VD 
Sbjct: 250 LVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDC 306

Query: 340 GASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           G +  +L  E Y  +V   +  V  S+++ + +GN    CY  S+      P + L F  
Sbjct: 307 GTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLVSTSIGDIFPLVSLNFEG 363

Query: 398 NQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
             S V++   +       +G  ++C+          I+G   +    +V+D    ++ W+
Sbjct: 364 GASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWA 423

Query: 456 HSKCEEVIDKS 466
           +  C   ++ S
Sbjct: 424 NYDCSLSVNVS 434


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 101/416 (24%), Positives = 171/416 (41%), Gaps = 51/416 (12%)

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           E L  +      R    +  Q  +   R+     + G+    F    Y +H   +  GTP
Sbjct: 41  ELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVH---LAAGTP 97

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
                + LD GS++ W   QC +C P SA +    ++ L  +DPS+SSS  ++ CS P C
Sbjct: 98  PQEVQLTLDTGSDITWT--QCKRC-PASACF----NQTLPLFDPSASSSFASLPCSSPAC 150

Query: 180 KSRSSCKSLKD----PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
           ++   C    D    PC Y   Y  + + S G +  ++   AS +     ++V   ++ G
Sbjct: 151 ETTPPCGGGNDATSRPCNYSISYG-DGSVSRGEIGREVFTFASGTGEGSSAAV-PGLVFG 208

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG 292
           CG    G +       G+ G G G +S+PS L K G    +FS CF     + + +V  G
Sbjct: 209 CGHANRGVFTSNET--GIAGFGRGSLSLPSQL-KVG----NFSHCFTTITGSKTSAVLLG 261

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
             G A   ++   P+G +  +Y            C +        +SG S T LP   Y 
Sbjct: 262 LPGVAPPSAS---PLGRRRGSY-----------RCRSTPRSS---NSGTSITSLPPRTYR 304

Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEMLKVPDMRLIF-SKNQSFVVRNHIFSF 410
            V  +F   V    +         C++A        VP M L F          N++F  
Sbjct: 305 AVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQENYVFEV 364

Query: 411 PENEGF----TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            +++       + CL V+  +G   I+G        +++D +N KL++  ++C+++
Sbjct: 365 VDDDDAGNSSRIICLAVI--EGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCDQL 418


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 159/377 (42%), Gaps = 49/377 (12%)

Query: 110 HYTWIDIGTPNVSFLVAL-DAGSNLLWVPCQ-CIQCAPLSASYYTS---LDRNLSEYDPS 164
           +Y  I +G P V FL A+ D GS++LW  C+ C  C+        S   +   ++ YDP 
Sbjct: 88  YYAQIGVGHP-VQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPE 146

Query: 165 SSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHA 223
            S ++   +CS PLC    SC+   + C Y  D S EDTSSS G    D++HL       
Sbjct: 147 LSITASPATCSDPLCSEGGSCRGNNNSCAY--DISYEDTSSSTGIYFRDVVHLGH----- 199

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD- 282
            ++S+ +++ +GC    +G +      DG+MG G   VSVP+ LA      N F  C   
Sbjct: 200 -KASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSG 254

Query: 283 ENDSGSVFF---GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------- 328
           E + G +      D+ P       + P+      Y V + S  + +  L           
Sbjct: 255 EKEGGGILVLGKNDEFP----EMVYTPMLANDIVYNVKLVSLSVNSKALPIEASEFEYNA 310

Query: 329 TQSGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
           T      ++DSG S    P++   ++ + V KF   + +  +   G+      +  +   
Sbjct: 311 TVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESSGSPCFISISDRNSVE 370

Query: 386 LKVPDMRLIFSKNQSFVVRNHIF-------SFPENEGFTVFCLTVMS-TDGDYGIIGQNF 437
           +  P++ L F    +  +  H +          E+  F    L  +S + G+  I+G   
Sbjct: 371 VDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSVGNSTILGDAI 430

Query: 438 MMGHRIVFDRENLKLAW 454
           +    +V+D E  ++ W
Sbjct: 431 LKDKVVVYDMEKSRIGW 447


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 159/367 (43%), Gaps = 46/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + IG+P     + +D+GS+++WV C+ C++C       Y   D     +DP++S++
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC-------YAQAD---PLFDPATSAT 176

Query: 169 SKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
              V C   +C++ R+S       C Y   Y  + + + G L  + L L          +
Sbjct: 177 FSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYG-DGSYTKGALALETLTLG--------GT 227

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
               V IGCG +  G ++  A   G++GLG G +S+   L  A     +FS C     +G
Sbjct: 228 AVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAG 282

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQSGFQA 335
           S+  G +  A  +   ++P+     A   Y+VG+    +G+           LT+ G   
Sbjct: 283 SLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGG 341

Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           +V D+G + T LP E YA +   F   V +   +   +    CY+ S    ++VP +   
Sbjct: 342 VVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFY 401

Query: 395 FSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           F    +  +  RN +    E +G  ++CL    +     I+G     G +I  D  N  +
Sbjct: 402 FDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYI 457

Query: 453 AWSHSKC 459
            +  + C
Sbjct: 458 GFGPTTC 464


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 155/379 (40%), Gaps = 58/379 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I+IG P   + + +D GS+L WV C      P +     +L ++   Y P+ +   + V 
Sbjct: 66  INIGNPPNPYELDIDTGSDLTWVQCD----GPDAPCKGCTLPKD-KLYKPNGN---QLVK 117

Query: 174 CSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           CS P+C +           C     PC Y  +Y+ ++  S+G L  D +H+ S     P 
Sbjct: 118 CSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYA-DNAESTGALARDYMHIGS-----PS 171

Query: 226 SSVQSSVIIGCGRKQT-GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            S    V+ GCG +Q         +  GV+GLG G +S+ S L   G I N    C    
Sbjct: 172 GSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAE 231

Query: 285 DSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
             G +F GD+          P  Q S       EK+  Y  G              G Q 
Sbjct: 232 GGGYLFLGDKFIPSSGIFWTPIIQSSL------EKH--YSTGPVDLFFNGKPTPAKGLQI 283

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYC--YNASSEEMLK 387
           + DSG+S+T+    +Y  V    +  +  K +  +         WK    + + +E    
Sbjct: 284 IFDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVNNY 343

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRI 443
              + L F+K+     +N  F  P  + F   CL +++ +    G+  ++G   +    +
Sbjct: 344 FKPLTLSFTKS-----KNLQFQLPPVK-FGNVCLGILNGNEAGLGNRNVVGDISLQDKVV 397

Query: 444 VFDRENLKLAWSHSKCEEV 462
           V+D E  ++ W+ + C+++
Sbjct: 398 VYDNEKQQIGWASANCKQI 416


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 95/397 (23%), Positives = 162/397 (40%), Gaps = 62/397 (15%)

Query: 97  SQTHFFGNQFYWLHYTWIDIGTPNVS-FLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSL 154
           S +H  G   Y +H+    IGTP      + +D GS+++W  C+ C  C           
Sbjct: 82  SGSHVVGYTEYLIHF---GIGTPRPQQVALEVDTGSDVVWTQCRPCFDC----------F 128

Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
            + L  +D S+S +   V C+ P+C++          C Y  +Y  +++ + G L  D  
Sbjct: 129 TQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYG-DNSVTIGQLAKDSF 187

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
              +F            ++ GCG+  TG++       G+ G G G +S+P  L  +    
Sbjct: 188 ---TFDGKGGGKVTVPDLVFGCGQYNTGNFHSNET--GIAGFGRGPLSLPRQLGVS---- 238

Query: 275 NSFSICFD---ENDSGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESY 321
            SFS CF    E+ S  VF G            GP    ST FLP   +Y  Y++ ++  
Sbjct: 239 -SFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPIL--STPFLPNHPEY--YYLSLKGI 293

Query: 322 CIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ- 370
            +G + L   +S F          ++DSG + T  P  ++  +   F   V     S   
Sbjct: 294 TVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYND 353

Query: 371 -GNSWKYCYNASS---EEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 425
            G     C++  S      + VP M L +   +      N++  +P+++     C+ V++
Sbjct: 354 TGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSD---QLCVVVLA 410

Query: 426 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            D D  +IG        IV D    KL    ++C+++
Sbjct: 411 GDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQCDKM 447


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 105/412 (25%), Positives = 176/412 (42%), Gaps = 55/412 (13%)

Query: 78  KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
           KL ++ ++  +  +FP  G     + N  Y+ H   I +G+P   + + +D GS+L W+ 
Sbjct: 75  KLATSVSAFDSSTIFPVRGD---VYPNGLYFTH---IFVGSPPRRYFLDMDTGSDLTWIQ 128

Query: 138 CQ--CIQCAPLSASYYTSLDRNLSEY-DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           C   C  CA      Y     NL    D       +N+   +  C+   +C+     C Y
Sbjct: 129 CDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGY--CE---TCEQ----CDY 179

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGV 253
             +Y+ + +SS G L  D LHL      A  S  +  ++ GC   Q G  L+  A  DG+
Sbjct: 180 EIEYA-DHSSSMGVLASDDLHLM----LANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGI 234

Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI---- 307
           +GL    VS+PS LA   +I N    C   D    G +F GD         +++P+    
Sbjct: 235 LGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPMLNSH 293

Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
              Y +  + +       S   Q G   + + D+G+S+T+ P E Y  +V    K VS +
Sbjct: 294 SPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDE 352

Query: 366 RISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFT 417
            +   G+  +   C+ A    +  V D++  F       +++ ++V    F  P  EG+ 
Sbjct: 353 GLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-EGYL 409

Query: 418 VF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +       CL ++      DG   I+G   + G  +V+D  N K+ W+ S C
Sbjct: 410 IISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 461


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 100/381 (26%), Positives = 159/381 (41%), Gaps = 55/381 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP    L+ LD GS+++WV     QCAP    Y    +++   +DP  SSS 
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWV-----QCAPCRRCY----EQSGPVFDPRRSSSY 179

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C   LC+   S  C   +  C Y   Y  + + ++G  V + L  A  ++ A    
Sbjct: 180 GAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAGGARVA---- 234

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
               V +GCG    G ++  A   G+     G +S P+ +++      SFS C  D   S
Sbjct: 235 ---RVALGCGHDNEGLFVAAAGLLGLG---RGGLSFPTQISR--RYGRSFSYCLVDRTSS 286

Query: 287 G-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLT 329
           G           +V FG  G     S SF P+         Y+V +    +G +    + 
Sbjct: 287 GAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVA 345

Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQGNS-WKYCY 378
           +S  +          +VDSG S T L    Y+ +   F    +   R+S  G S +  CY
Sbjct: 346 ESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCY 405

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
           +     ++KVP + + F+      +    +  P +   T FC     TDG   IIG    
Sbjct: 406 DLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQ 464

Query: 439 MGHRIVFDRENLKLAWSHSKC 459
            G R+VFD +  ++ ++   C
Sbjct: 465 QGFRVVFDGDGQRVGFAPKGC 485


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 101/413 (24%), Positives = 166/413 (40%), Gaps = 52/413 (12%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
           K    +VKLQ+   SS   ++FP  G+  +  G      +Y  ++IG P   F + +D G
Sbjct: 36  KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87

Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSC 185
           S+L WV C     AP +           ++Y P+ ++    + CSH LC          C
Sbjct: 88  SDLTWVQCD----APCNGC---------TKYKPNHNT----LPCSHILCSGLDLPQDRPC 130

Query: 186 KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTGSY 244
              +D C Y   YS +  SS G LV D + L    K A  S +   +  GCG  +Q    
Sbjct: 131 ADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNPGP 185

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTS 303
                  G++GLG G V + + L   G+ +N    C      G +  GD+  P++  + +
Sbjct: 186 HPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWT 245

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
            L        Y  G       +      G   + DSG+S+T+   E Y  ++    K ++
Sbjct: 246 SLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLN 305

Query: 364 SKRI--SLQGNSWKYCYNASS--------EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPE 412
            K +  +    S   C+            ++  K   +R    KN Q F V    +    
Sbjct: 306 GKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIIT 365

Query: 413 NEGFTVFCL---TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            +G     +   T +  +G Y IIG     G  +++D E  ++ W  S C+++
Sbjct: 366 EKGRVCLGILNGTEIGLEG-YNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 417


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 148/360 (41%), Gaps = 35/360 (9%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP  ++ +  D GS++ W     IQC P S   Y   D     +DP+ S++   V 
Sbjct: 124 VGFGTPAQTYTLMFDTGSDVSW-----IQCLPCSGHCYKQHD---PIFDPTKSATYSAVP 175

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C HP C +     S    C Y   Y  + +S++G L  + L L S       +       
Sbjct: 176 CGHPQCAAAGGKCSSNGTCLYKVQYG-DGSSTAGVLSHETLSLTS-------ARALPGFA 227

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG    G + D    DG++GLG G +S+ S  A +     S+ +       G +  G 
Sbjct: 228 FGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGT 284

Query: 294 QGPAT-QQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGAS 342
             PA+      +  + +K D    YFV + S  +G   L       T+ G   L+DSG  
Sbjct: 285 TTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG--TLLDSGTV 342

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T+LP E Y  +  +F   ++  + +   + +  CY+ + +  + +P +   FS   SF 
Sbjct: 343 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402

Query: 403 VRNH-IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +    +  FP++      CL  +       + I+G        +++D    K+ +    C
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/417 (23%), Positives = 169/417 (40%), Gaps = 45/417 (10%)

Query: 71  KRQKTRVK-LQSNNNSSRNQLLFPSEGSQTHF--FGNQF-YWLHYTWIDIGTPNVSFLVA 126
           KR+  R   L     SSR  L+  + GS   F  +GN +    +   ++IG P   + + 
Sbjct: 31  KRKSGRNSILPGEAMSSRPSLMNHAAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLD 90

Query: 127 LDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           +D GS L W+ C   C QC+      Y                S+  + C  PLC S   
Sbjct: 91  VDTGSELTWLQCDAPCSQCSETPHPLY--------------KPSNDFIPCKDPLCASLQP 136

Query: 185 CK--SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
               + +DP  C Y   Y+ +  S+ G L++D+ +L +F+       ++  + +GCG  Q
Sbjct: 137 TDDYTCEDPNQCDYEIKYA-DQYSTLGVLLNDV-YLLNFTNGV---QLKVRMALGCGYDQ 191

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
             S       DG++GLG G  S+ S L   GL++N    C      G +FFG+   +++ 
Sbjct: 192 IFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGNVYDSSRM 251

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
           S + +   +    Y  G      G           + D+G+S+T+  ++ Y  ++   +K
Sbjct: 252 SWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQAMISLLNK 311

Query: 361 LVSSKRISLQGN--SWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHI---FSFPENE 414
            +  K I    +  +   C++        + +++  F     SF     +   F  P   
Sbjct: 312 ELHRKPIKAAPDDQTLPMCWHG-KRPFRSINEVKKYFKPLTLSFTNGGRVKPQFEIPPEA 370

Query: 415 GFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              +      CL +++      G+  +IG   M+   +VFD E   + W  + C  V
Sbjct: 371 YLIISNMGNVCLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQLIGWGPADCNSV 427


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 146/361 (40%), Gaps = 40/361 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  N+S
Sbjct: 190 IGLGTPAGRYTVVFDTGSDTTWV-----QCEPCVVVCYEQQEK---LFDPARSSTDANIS 241

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + S G+   D L L+S+              
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAIKGFR 293

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ CF    SG+ +  
Sbjct: 294 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL- 346

Query: 293 DQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
           D GP +  +     T+ + +      Y+VG+    +G        S  T +G   +VDSG
Sbjct: 347 DFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAG--TIVDSG 404

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
              T LP   Y+ +   F   ++++  + +   +    CY+ +    + +P + L+F   
Sbjct: 405 TVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGG 464

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
            S  V      +  +             D D GI+G   +    +V+D     + +S   
Sbjct: 465 ASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGA 524

Query: 459 C 459
           C
Sbjct: 525 C 525


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 158/387 (40%), Gaps = 53/387 (13%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +YT  + IG P   + + +D GS+L WV C   C  C         ++ RN  
Sbjct: 56  GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGC---------TIPRN-R 105

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
            Y P+ +     V C  PLCK+  S     C    + C Y  +Y+ +  SS G L+ D +
Sbjct: 106 LYKPNGNL----VKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNI 160

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
            L    K    S  +  +  GCG  Q    +   A+  GV+GLG G  S+ S L   GLI
Sbjct: 161 PL----KFTNGSLARPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLI 216

Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQS 331
           +N    C  E   G +FFGDQ    Q    + P+ +      Y  G           +  
Sbjct: 217 RNVVGHCLSERGGGFLFFGDQ-LVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVK 275

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS------E 383
           G Q + DSG+S+T+  ++ +  +V      +  K +S   + +S   C+          +
Sbjct: 276 GLQLIFDSGSSYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHD 335

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIG 434
                  + L F+K+     +N +   P      V      CL ++       G+  IIG
Sbjct: 336 VTSNFKPLLLSFTKS-----KNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIG 390

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEE 461
              +    +++D E  ++ W+ + C+ 
Sbjct: 391 DISLQDKLVIYDNEKQQIGWASANCDR 417


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score = 88.6 bits (218), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 94/364 (25%), Positives = 149/364 (40%), Gaps = 41/364 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + IG P+ +F + +D GS++ W+ C+ C  C       Y  +D     +DP+SSSS
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDC-------YQQVD---PIFDPASSSS 209

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              + C  P C++        D C Y   Y         Y V D    A+ +     S  
Sbjct: 210 FSRLGCQTPQCRNLDVFACRNDSCLYQVSYG-----DGSYTVGD---FATETVSFGNSGS 261

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DEND 285
              V IGCG    G ++  A   G+ G  L      SL ++  +  +SFS C    D  D
Sbjct: 262 VDKVAIGCGHDNEGLFVGAAGLIGLGGGPL------SLTSQ--IKASSFSYCLVNRDSVD 313

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-------- 335
           S ++ F    P+   +       +    Y+VG+    +G   L    S F+         
Sbjct: 314 SSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGI 373

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           +VD G + T L T+ Y  +   F KL      +     +  CYN SS   ++VP +  +F
Sbjct: 374 IVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLF 433

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
              +S  +    +  P +   T FCL    T     IIG     G R+ +D  N ++++S
Sbjct: 434 DGGKSLPLPPSNYLIPVDSAGT-FCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFS 492

Query: 456 HSKC 459
             KC
Sbjct: 493 SRKC 496


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 90/383 (23%), Positives = 159/383 (41%), Gaps = 57/383 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP+   ++ +D GS+L+W     +QC+P    Y     +    +DP  SS+ 
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVW-----LQCSPCRRCY----AQRGQVFDPRRSSTY 136

Query: 170 KNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           + V CS P C++       S  +    C Y+  Y  + +SS+G L  D L  A+      
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGDLATDKLAFAN------ 189

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +  ++V +GCGR   G + D AA  G++G+G G +S+ + +A A    + F  C  + 
Sbjct: 190 -DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGSVFEYCLGDR 243

Query: 285 DSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---- 334
            S S      VF     P +   T+ L    +   Y+V +  + +G   +T  GF     
Sbjct: 244 TSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT--GFSNASL 301

Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKYCYNAS 381
                      +VDSG + +    + YA +   FD    +  +     + + +  CY+  
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLR 361

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF-----CLTVMSTDGDYGIIGQN 436
                  P + L F+      +    +  P + G         CL   + D    +IG  
Sbjct: 362 GRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNV 421

Query: 437 FMMGHRIVFDRENLKLAWSHSKC 459
              G R+VFD E  ++ ++   C
Sbjct: 422 QQQGFRVVFDVEKERIGFAPKGC 444


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 52/376 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V F+   D GS+L W  CQ C  C P          ++   YDPS+SS+   V
Sbjct: 81  LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 130

Query: 173 SCSH----PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            CS     P+ +SR +C +    C Y   YS +   S+G L  + L L S     P  +V
Sbjct: 131 PCSSATCLPVLRSR-NCSTPSSLCRYGYSYS-DGAYSAGILGTETLTLGS---SVPGQAV 185

Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDE 283
             S V  GCG    G  L+     G +GLG G +   SLLA+ G+    FS C    F+ 
Sbjct: 186 SVSDVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNS 237

Query: 284 NDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
                   G       GP   QST  L        Y V ++   +G+  L          
Sbjct: 238 TLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLH 297

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNASSEEMLK 387
             S    +VDSG +F+ LP   +  VV    +++    ++    +S  +   A   ++  
Sbjct: 298 ANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPF 357

Query: 388 VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           +PD+ L F+      + R++  S+  N+  + FCL ++ T   + ++G       +++FD
Sbjct: 358 MPDLVLHFAGGADMRLHRDNYMSY--NQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFD 415

Query: 447 RENLKLAWSHSKCEEV 462
               +L++  + C ++
Sbjct: 416 MTVGQLSFLPTDCSKL 431


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 111/462 (24%), Positives = 175/462 (37%), Gaps = 90/462 (19%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           VS    W   N +  L L  ++  K  KT   L           LFP             
Sbjct: 38  VSSKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTP-------LFPRS----------- 79

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYD 162
           Y  +   ++ GTP  +    +D GS+L+W PC     C +C     ++       +  + 
Sbjct: 80  YGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSEC-----NFPNIKKTGIPTFL 134

Query: 163 PSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADYSTEDTSSSGY 208
           P  SSSSK + C +P C        +S+   C S    C     PY+  Y +   S++G 
Sbjct: 135 PKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSG--STAGL 192

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           L+ + L         P        ++GC      S      P+G+ G G    S+PS L 
Sbjct: 193 LLSETLDF-------PNKKTIPDFLVGC------SIFSIKQPEGIAGFGRSPESLPSQLG 239

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQG-------PATQQSTSFL--PIGEKYDAYFVGVE 319
                    S  FD+  + S    D G        A    T FL  P     D Y+V + 
Sbjct: 240 LKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLR 299

Query: 320 SYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
           +  IG++ +          T      +VDSG +FTF+   +Y  V  +F+K ++   ++ 
Sbjct: 300 NIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVAT 359

Query: 370 QGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMS 425
           +  +    + CYN S E+ L VPD+   F       +  ++ FS  ++    V CLT++S
Sbjct: 360 EIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSG---VICLTIVS 416

Query: 426 TDGDYG--------IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +            I+G        + FD EN K  +    C
Sbjct: 417 DNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 162/378 (42%), Gaps = 49/378 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP   + + LD GS+L W+  QC+ C       +   ++N   YDP  SSS +N+ C 
Sbjct: 96  IGTPPKHYSLILDTGSDLNWI--QCVPC-------HDCFEQNGPYYDPKESSSFRNIGCH 146

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
            P C   SS      CK+    CPY   Y     ++  +  +   ++L S +  +    V
Sbjct: 147 DPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRV 206

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
           + +V+ GCG    G +  GA+  G++GLG G +S  S L    L  +SFS C      D 
Sbjct: 207 E-NVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDT 260

Query: 284 NDSGSVFFG-DQGPATQQSTSFLP-IGEKYDA----YFVGVESYCIGNSCL--------- 328
           N S  + FG D+        +F   +G K +     Y+V ++S  +G   L         
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNM 320

Query: 329 TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           T  G    +VDSG + ++     Y  +   F K V    I         CYN S  E + 
Sbjct: 321 TSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKID 380

Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIV 444
           +PD  ++F+     +F V N+       E   V CL ++ T      IIG        ++
Sbjct: 381 LPDFGILFADGAVWNFPVENYFIRLDPEE---VVCLAILGTPRSALSIIGNYQQQNFHVL 437

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +  +L ++   C +V
Sbjct: 438 YDTKKSRLGYAPMNCADV 455


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 140/341 (41%), Gaps = 36/341 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I  G+P     + +D GS+L W      QC P S  Y   +     +Y P++S + ++  
Sbjct: 62  IHFGSPQKKQFLHMDTGSSLTWT-----QCFPCSDCYAQKI---YPKYRPAASITYRDAM 113

Query: 174 C--SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  SHP      +   L   C Y   Y  ++T+  G L  +++   +   H         
Sbjct: 114 CEDSHPKSNPHFAFDPLTRICTYQQHY-LDETNIKGTLAQEMI---TVDTHDGGFKRVHG 169

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSG 287
           V  GC     GSY  G    G++GLG+G  S+       G   + FS C  E      S 
Sbjct: 170 VYFGCNTLSDGSYFTGT---GILGLGVGKYSI------IGEFGSKFSFCLGEISEPKASH 220

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
           ++  GD G   Q   + + I E +  +   +ES  +G         Q  VD+G++ + L 
Sbjct: 221 NLILGD-GANVQGHPTVINITEGHTIF--QLESIIVGEEITLDDPVQVFVDTGSTLSHLS 277

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
           T +Y + V  FD L+ S+ +S +      CY A + E L+  D+   F       V  H 
Sbjct: 278 TNLYYKFVDAFDDLIGSRPLSYEPT---LCYKADTIERLEKMDVGFKFDVGAELSVNIHN 334

Query: 408 FSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFD 446
             F +     + CL + +    +   IIG   M G+ + +D
Sbjct: 335 I-FIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYD 374


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 150/365 (41%), Gaps = 53/365 (14%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +IGTP    LVALD  ++  W+PC  C+ C   S+S           +DPS SSSS+ + 
Sbjct: 93  NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P CK     SC ++   C +   Y    ++   YL  D L LA        S V  +
Sbjct: 141 CEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA--------SDVIPN 189

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
              GC  K +G+ L      G+MGLG G +S+ S      L Q++FS C       N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           S+  G +  P   ++T  L    +   Y+V +    +GN  +            +G   +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            DSG  +T L    Y  V  +F + V +   +  G  +  CY+ S    +  P +  +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
                +  +++         +   +    ++ +    +I       HR++ D  N +L  
Sbjct: 360 GMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGI 419

Query: 455 SHSKC 459
           S   C
Sbjct: 420 SRETC 424


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 150/365 (41%), Gaps = 53/365 (14%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +IGTP    LVALD  ++  W+PC  C+ C   S+S           +DPS SSSS+ + 
Sbjct: 93  NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P CK     SC ++   C +   Y    ++   YL  D L LA        S V  +
Sbjct: 141 CEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA--------SDVIPN 189

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
              GC  K +G+ L      G+MGLG G +S+ S      L Q++FS C       N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           S+  G +  P   ++T  L    +   Y+V +    +GN  +            +G   +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            DSG  +T L    Y  V  +F + V +   +  G  +  CY+ S    +  P +  +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
                +  +++         +   +    ++ +    +I       HR++ D  N +L  
Sbjct: 360 GMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGI 419

Query: 455 SHSKC 459
           S   C
Sbjct: 420 SRETC 424


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 161/378 (42%), Gaps = 48/378 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT I IG P   + + +D GS+L W+ C   C   A      Y      +         
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIV-------- 238

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +       
Sbjct: 239 PPRDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GR 291

Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            +   + GC   Q G  L   A  DG++GL    +S PS LA  G+I N F  C   ++ 
Sbjct: 292 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQG 351

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVD 338
             G +F GD     +   ++  I    D  Y         G+  L +     S  Q + D
Sbjct: 352 GGGYMFLGDDY-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 339 SGASFTFLPTEIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VP 389
           SG+S+T+LP EIY  +V         F +  S + + L    WK  +     E +K    
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFE 467

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGH 441
            + L F K   F+ +    S PE+          CL +++ T+ ++G   I+G   + G 
Sbjct: 468 PLNLHFGKKWLFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526

Query: 442 RIVFDRENLKLAWSHSKC 459
            +V+D +  ++ W+ S C
Sbjct: 527 LVVYDNQRKQIGWADSDC 544


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 160/367 (43%), Gaps = 56/367 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +G P V  LV +D GS+LLWV C+ C  C   S             +DPS SS+  ++S 
Sbjct: 65  VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 114

Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
             P+C +    K +  + C Y A Y+   TSS     +DI+    F      +   SSV+
Sbjct: 115 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 170

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSGSV 289
            GCG    G + DG    G++GL  GD S+ S L       + FS C    FD + + + 
Sbjct: 171 FGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 222

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-DS 339
                G   + S++  P       Y+V +E   +G + L         T+SG   +V DS
Sbjct: 223 LVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDS 280

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 396
           G + TFL  + +  +  +  +LV    +++  +      CY     E L+  P++   F+
Sbjct: 281 GTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFA 340

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF------MMGHRIVF 445
           +    V+  +     +N+   VFCL V+ ++        GI+ Q        ++G R+ F
Sbjct: 341 EGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYF 398

Query: 446 DRENLKL 452
            R + +L
Sbjct: 399 QRTDCEL 405


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 142/355 (40%), Gaps = 31/355 (8%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P   + Y   ++    +DP+SSS+  NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 234

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C            C Y   Y  + + S G+   D L L+S+              
Sbjct: 235 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
            GCG +  G + + A   G++GLG G  S+P  +   G     F+ C     +G+ +  F
Sbjct: 287 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDF 341

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
           G   P    +T  L  G     Y+VG+    +G   L    S F A   +VDSG   T L
Sbjct: 342 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 400

Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
           P   Y+ +   F   ++++  R +   +    CY+ +    + +P + L+F    +  V 
Sbjct: 401 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVD 460

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                +  +              GD GI+G   +    + +D     + +S   C
Sbjct: 461 ASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 160/367 (43%), Gaps = 56/367 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +G P V  LV +D GS+LLWV C+ C  C   S             +DPS SS+  ++S 
Sbjct: 97  VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 146

Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
             P+C +    K +  + C Y A Y+   TSS     +DI+    F      +   SSV+
Sbjct: 147 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 202

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSGSV 289
            GCG    G + DG    G++GL  GD S+ S L       + FS C    FD + + + 
Sbjct: 203 FGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 254

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-DS 339
                G   + S++  P       Y+V +E   +G + L         T+SG   +V DS
Sbjct: 255 LVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDS 312

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 396
           G + TFL  + +  +  +  +LV    +++  +      CY     E L+  P++   F+
Sbjct: 313 GTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFA 372

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF------MMGHRIVF 445
           +    V+  +     +N+   VFCL V+ ++        GI+ Q        ++G R+ F
Sbjct: 373 EGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYF 430

Query: 446 DRENLKL 452
            R + +L
Sbjct: 431 QRTDCEL 437


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 145/360 (40%), Gaps = 38/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  NVS
Sbjct: 186 IGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYKQQEK---LFDPARSSTYANVS 237

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + S G+   D L L+S+              
Sbjct: 238 CAAPACSDLYTRGCSGGHCLYSVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 289

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     SG+ +  
Sbjct: 290 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 343

Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
           FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS F     +VDSG 
Sbjct: 344 FGPGSPAAVGARQTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGT 401

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
             T LP   Y+ +   F   ++++  + +   +    CY+ +    + +P + L+F    
Sbjct: 402 VITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGA 461

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V      +  +             D D GI+G   +    +V+D     + +S   C
Sbjct: 462 YLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/365 (25%), Positives = 151/365 (41%), Gaps = 53/365 (14%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +IGTP  + LVALD  ++  W+PC  C+ C   S+S           +DPS SSSS+ + 
Sbjct: 93  NIGTPAQAMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P CK     SC ++   C +   Y    ++   YL  D L LA        + V  +
Sbjct: 141 CEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSAIEAYLTQDTLTLA--------TDVIPN 189

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
              GC  K +G+ L      G+MGLG G +S+ S      L Q++FS C       N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           S+  G +  P   ++T  L    +   Y+V +    +GN  +            +G   +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            DSG  +T L    Y  +  +F + V +   +  G  +  CY+ S    +  P +  +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAW 454
                +  +++         +   +    T+ +    +I       HR++ D  N +L  
Sbjct: 360 GMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGI 419

Query: 455 SHSKC 459
           S   C
Sbjct: 420 SRETC 424


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/389 (25%), Positives = 153/389 (39%), Gaps = 62/389 (15%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSS 166
           L+Y  + IG P   + + +D GS+L W+ C   C  CA      Y          DP  +
Sbjct: 30  LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLY----------DPKRA 79

Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
              + V C  P C       + +C      C Y  DY  + +S+ G LV+D + L   + 
Sbjct: 80  ---RVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDY-VDGSSTMGILVEDTITLVLTNG 135

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               +  Q+  +IGCG  Q G+     A  DGV+GL    +S+PS LA  G+  N    C
Sbjct: 136 ----TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHC 191

Query: 281 F--DENDSGSVFFGDQ-GPATQQSTSFL---PIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
                N  G +FFGD   PA   + + +   P+ E Y A    ++    G   L   G  
Sbjct: 192 LAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIK---YGGEVLELEGTT 248

Query: 335 -----ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
                A+ DSG SFT+L    Y  V   VV+  +    +RI     +  +C+   S    
Sbjct: 249 DDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD-TTLPFCWRGPS-PFE 306

Query: 387 KVPDMRLIFSK------NQSFVVRNHIFSFPENEGFTV------FCLTVMSTDGD----Y 430
            V D+   F          ++     +      EG+ +       CL V+          
Sbjct: 307 SVADVSAYFKTVTLDFGGSTWWSSGKLLEL-SPEGYLIVSTQGNVCLGVLDASVASLEVT 365

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            I+G   M G+ +V+D    ++ W    C
Sbjct: 366 NILGDISMRGYLVVYDNMREQIGWVRRNC 394


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 160/367 (43%), Gaps = 56/367 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +G P V  LV +D GS+LLWV C+ C  C   S             +DPS SS+  ++S 
Sbjct: 65  VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 114

Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
             P+C +    K +  + C Y A Y+   TSS     +DI+    F      +   SSV+
Sbjct: 115 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 170

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSGSV 289
            GCG    G + DG    G++GL  GD S+ S L       + FS C    FD + + + 
Sbjct: 171 FGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 222

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-DS 339
                G   + S++  P       Y+V +E   +G + L         T+SG   +V DS
Sbjct: 223 LVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDS 280

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 396
           G + TFL  + +  +  +  +LV    +++  +      CY     E L+  P++   F+
Sbjct: 281 GTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFA 340

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF------MMGHRIVF 445
           +    V+  +     +N+   VFCL V+ ++        GI+ Q        ++G R+ F
Sbjct: 341 EGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYF 398

Query: 446 DRENLKL 452
            R + +L
Sbjct: 399 QRTDCEL 405


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 142/355 (40%), Gaps = 31/355 (8%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P   + Y   ++    +DP+SSS+  NVS
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 235

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C            C Y   Y  + + S G+   D L L+S+              
Sbjct: 236 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
            GCG +  G + + A   G++GLG G  S+P  +   G     F+ C     +G+ +  F
Sbjct: 288 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPRSTGTGYLDF 342

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
           G   P    +T  L  G     Y+VG+    +G   L    S F A   +VDSG   T L
Sbjct: 343 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 401

Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
           P   Y+ +   F   ++++  R +   +    CY+ +    + +P + L+F    +  V 
Sbjct: 402 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVD 461

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                +  +              GD GI+G   +    + +D     + +S   C
Sbjct: 462 ASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 142/355 (40%), Gaps = 31/355 (8%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P   + Y   ++    +DP+SSS+  NVS
Sbjct: 187 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 238

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C            C Y   Y  + + S G+   D L L+S+              
Sbjct: 239 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 290

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
            GCG +  G + + A   G++GLG G  S+P  +   G     F+ C     +G+ +  F
Sbjct: 291 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDF 345

Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
           G   P    +T  L  G     Y+VG+    +G   L    S F A   +VDSG   T L
Sbjct: 346 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 404

Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
           P   Y+ +   F   ++++  R +   +    CY+ +    + +P + L+F    +  V 
Sbjct: 405 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVD 464

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                +  +              GD GI+G   +    + +D     + +S   C
Sbjct: 465 ASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/425 (23%), Positives = 176/425 (41%), Gaps = 63/425 (14%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHYTWI 114
           KN  +Y   L+    KR + R++       S N +L  S G +T  + G+  Y ++   +
Sbjct: 53  KNLTKYE--LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGDGEYLMN---V 100

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
            IGTP+ SF   +D GS+L+W  C+ C QC            +    ++P  SSS   + 
Sbjct: 101 AIGTPDSSFSAIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLP 150

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C    C+   S     + C Y   Y  + +++ GY+  +            ++S   ++ 
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF--------ETSSVPNIA 201

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VF 290
            GCG    G      A  G++G+G G +S+PS L         FS C     S S   + 
Sbjct: 202 FGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLA 254

Query: 291 FGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALV 337
            G      P    ST+ +        Y++ ++   +G  N  +  S FQ         ++
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMII 314

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFS 396
           DSG + T+LP + Y  V   F   ++   +    +    C+   S+   ++VP++ + F 
Sbjct: 315 DSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD 374

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAW 454
                +   +I   P  EG  V CL  M +    G  I G       ++++D +NL +++
Sbjct: 375 GGVLNLGEQNILISPA-EG--VICL-AMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSF 430

Query: 455 SHSKC 459
             ++C
Sbjct: 431 VPTQC 435


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/414 (22%), Positives = 174/414 (42%), Gaps = 50/414 (12%)

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
           +  ++  +N  LFP         GN F   L+YT I +G+P   + + +D GS+  WV C
Sbjct: 134 RGGDDWPQNSTLFPHS-----LAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQC 188

Query: 139 QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 198
               CA  +   +         Y P+ ++ +  +  S PLC+   +     + C Y   Y
Sbjct: 189 DAPPCASCAKGAHPL-------YRPARTADA--LPASDPLCE--GAQHENPNQCDYEISY 237

Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLG 257
           +   +S   Y+ D +  +    +        + ++ GCG  Q G  L+     DGV+GL 
Sbjct: 238 ADGSSSMGVYVRDSMQFVGEDGERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLT 292

Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYD 312
              +S+P+ LA  G+I N+F  C   + SG+   +F GD     +   +++PI  G   D
Sbjct: 293 NKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADD 351

Query: 313 AYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
                V+    G+  L   G   Q + D+G+++T+ P E    ++    +  S + +   
Sbjct: 352 VRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDD 411

Query: 371 GN-SWKYCYNASSEEMLKVPDMRLIFSK-----------NQSFVVRNHIFSFPENEGFTV 418
            + +  +C   S   +  V D++  F             +++F +R   +    ++G   
Sbjct: 412 SDKTLPFCMK-SDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFNIRPEHYLVISDKGNV- 469

Query: 419 FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 468
            CL V++ T   Y    I+G   + G  + +D +  ++ W    C     +S +
Sbjct: 470 -CLGVLNGTTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCTNPRKRSRI 522


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 147/360 (40%), Gaps = 40/360 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     V  D GS+L WV     QC P S  Y    ++    +DP+ SS+   V 
Sbjct: 150 MGLGTPARDMTVVFDTGSDLSWV-----QCTPCSDCY----EQKDPLFDPARSSTYSAVP 200

Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           C+ P C+   SRS  +  K  C Y   Y  + + + G L  D L L        QS V  
Sbjct: 201 CASPECQGLDSRSCSRDKK--CRYEVVYG-DQSQTDGALARDTLTLT-------QSDVLP 250

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
             + GCG + TG  L G A DG++GLG   VS+ S  A        FS C   + S + +
Sbjct: 251 GFVFGCGEQDTG--LFGRA-DGLVGLGREKVSLSSQAASK--YGAGFSYCLPSSPSAAGY 305

Query: 291 FGDQGPATQQSTSFLPIGEKYDA---YF-----VGVESYCIGNSCLTQSGFQALVDSGAS 342
               GPA   +  F  +  ++D+   Y+     V V    +  S +  S    ++DSG  
Sbjct: 306 LSLGGPAPANA-RFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTV 364

Query: 343 FTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
            T LP  +YA +   F + +     KR     +    CY+ +    +++P + L+F+   
Sbjct: 365 ITRLPPRVYAALRSAFARSMGRYGYKRAPAL-SILDTCYDFTGHTTVRIPSVALVFAGGA 423

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +  +      +                  D GIIG        +V+D    K+ +  + C
Sbjct: 424 AVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGC 483


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 162/377 (42%), Gaps = 48/377 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y LH   + IGTP     + LD GS+L+W  CQ C  C           +++L  YD S 
Sbjct: 91  YLLH---LAIGTPPQPVQLTLDTGSDLVWTQCQPCAVC----------FNQSLPYYDASR 137

Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           SS+    SC    CK   S + C       C +   YS  D S++   +D  +   SF  
Sbjct: 138 SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAF--SYSYGDKSATIGFLD--VETVSFVA 193

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
            A   SV   V+ GCG   TG +       G+ G G G +S+PS L K G   + F+   
Sbjct: 194 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 246

Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
               S  +F         G  T Q+T  +        Y++ ++   +G++ L   +S F 
Sbjct: 247 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 306

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
                   ++DSG +FT LP  +Y  V  +F   V    +         C++A    +  
Sbjct: 307 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 366

Query: 387 KVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            VP + L F      + R N++F   ++ G    CL ++  +G+  IIG        +++
Sbjct: 367 HVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIGNFQQQNMHVLY 423

Query: 446 DRENLKLAWSHSKCEEV 462
           D +N KL++  +KC+++
Sbjct: 424 DLKNSKLSFVRAKCDKL 440


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 161/391 (41%), Gaps = 65/391 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C++C              L    P    SS  
Sbjct: 42  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 87

Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV D+     FS +  Q 
Sbjct: 88  IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 140

Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
             +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C     
Sbjct: 141 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 200

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGAS 342
            G +FFGD    + +  S+ P+  +Y  ++   +G E    G           + DSG+S
Sbjct: 201 GGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSS 258

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNA-----SSEEMLK-VPDMRLI 394
           +T+  ++ Y  V     + +S K +  +   ++   C+       S EE+ K    + L 
Sbjct: 259 YTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALS 318

Query: 395 FSKN-----------QSFVVRNHIFSFPENEGFTV--------FCLTVMSTD----GDYG 431
           F              +++++ +  FS    +G  +         CL +++       +  
Sbjct: 319 FKTGWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNVCLGILNGTEIGLQNLN 378

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           +IG   M    I++D E   + W    C+E+
Sbjct: 379 LIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 409


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  CQ C  C           D+ L  +DPS+
Sbjct: 82  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128

Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+L  D       
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               P       V  GCG    G +       G+ G G G +S+PS L K G    +FS 
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234

Query: 280 CFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 329
           CF   +     +V           G    QST  +        Y++ ++   +G++ L  
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294

Query: 330 -QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
            +S F         ++DSG + T LPT +Y  V   F   V    +S       +C +A 
Sbjct: 295 PESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354

Query: 382 SEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
                 VP + L F      + R N++F   E+ G ++ CL ++   G+   IG      
Sbjct: 355 LRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGEVTTIGNFQQQN 412

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             +++D +N KL++  ++C+++
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  CQ C  C           D+ L  +DPS+
Sbjct: 82  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128

Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+L  D       
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               P       V  GCG    G +       G+ G G G +S+PS L K G    +FS 
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234

Query: 280 CFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 329
           CF   +     +V           G    QST  +        Y++ ++   +G++ L  
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294

Query: 330 -QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
            +S F         ++DSG + T LPT +Y  V   F   V    +S       +C +A 
Sbjct: 295 PESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354

Query: 382 SEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
                 VP + L F      + R N++F   E+ G ++ CL ++   G+   IG      
Sbjct: 355 LRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGEVTTIGNFQQQN 412

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             +++D +N KL++  ++C+++
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/367 (25%), Positives = 158/367 (43%), Gaps = 42/367 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +G+P   + + LD GS+L W     +QC P     ++ +D     ++PS+S++ 
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSW-----LQCKPCVVYCHSQVD---PLFEPSASNTY 171

Query: 170 KNVSCSHPLCKSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           + + CS   C S     +L DP       C Y A Y  + + S GYL  D+L L      
Sbjct: 172 RPLYCSSSEC-SLLKAATLNDPLCTASGVCVYTASYG-DASYSMGYLSRDLLTLT----- 224

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICF 281
              S    S   GCG+   G +   A   G++GL    +S+ + L+ K G    +FS C 
Sbjct: 225 --PSQTLPSFTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPKYGY---AFSYCL 276

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNS--CLTQSGFQA- 335
             + S    F   G  +  S  F P+    +    YF+ + +  +      +  +G+Q  
Sbjct: 277 PTSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVP 336

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRL 393
            ++DSG   T LP  IYA +   F K++S +       S    C+  S + M   P++R+
Sbjct: 337 TIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRM 396

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           IF       +R        ++G  + CL   S++    IIG +    + I +D    K+ 
Sbjct: 397 IFQGGADLSLRAPNILIEADKG--IACLAFASSN-QIAIIGNHQQQTYNIAYDVSASKIG 453

Query: 454 WSHSKCE 460
           ++   C 
Sbjct: 454 FAPGGCR 460


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 161/377 (42%), Gaps = 48/377 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y LH   + IGTP     + LD GS L+W  CQ C  C           +++L  YD S 
Sbjct: 91  YLLH---LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVC----------FNQSLPYYDASR 137

Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           SS+    SC    CK   S + C       C Y   YS  D S++   +D  +   SF  
Sbjct: 138 SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFLD--VETVSFVA 193

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
            A   SV   V+ GCG   TG +       G+ G G G +S+PS L K G   + F+   
Sbjct: 194 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 246

Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
               S  +F         G  T Q+T  +        Y++ ++   +G++ L   +S F 
Sbjct: 247 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 306

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
                   ++DSG +FT LP  +Y  V  +F   V    +         C++A    +  
Sbjct: 307 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 366

Query: 387 KVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            VP + L F      + R N++F   ++ G    CL ++  +G+  IIG        +++
Sbjct: 367 HVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIGNFQQQNMHVLY 423

Query: 446 DRENLKLAWSHSKCEEV 462
           D +N KL++  +KC+++
Sbjct: 424 DLKNSKLSFVRAKCDKL 440


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/390 (23%), Positives = 158/390 (40%), Gaps = 56/390 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y +H   + +GTP     + LD GS+L+W      QCAP    ++    + L   DP++S
Sbjct: 92  YLVH---LAVGTPPRPVALTLDTGSDLVWT-----QCAPCRDCFH----QGLPLLDPAAS 139

Query: 167 SSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
           S+   + C  P C++          RSS  +    C YI  Y  + + + G +  D    
Sbjct: 140 STYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYG-DKSVTVGEIATDRFTF 198

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
              +           +  GCG    G +       G+ G G G  S+PS L        +
Sbjct: 199 GGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNET--GIAGFGRGRWSLPSQLNV-----TT 251

Query: 277 FSICFD---ENDSGSVFFGDQGPATQ------------QSTSFLPIGEKYDAYFVGVESY 321
           FS CF    E+ S  V  G    A              ++T  L    +   YF+ ++  
Sbjct: 252 FSYCFTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGI 311

Query: 322 CIGNSCLTQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL-QGNSWKYC 377
            +G + L     +    ++DSGAS T LP  +Y  V  +F   V      + +G++   C
Sbjct: 312 SVGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLC 371

Query: 378 YNASSEEMLK---VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGII 433
           +      + +   VP + L        + R N++F   E+    V C+ + +  GD  +I
Sbjct: 372 FALPVTALWRRPPVPSLTLHLDGADWELPRGNYVF---EDLAARVMCVVLDAAPGDQTVI 428

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           G        +V+D EN  L+++ ++C+ ++
Sbjct: 429 GNFQQQNTHVVYDLENDWLSFAPARCDSLV 458


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/391 (24%), Positives = 161/391 (41%), Gaps = 68/391 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           HY +     P  +    +D GSN+ W                       +E + S S + 
Sbjct: 56  HYRFELTHRPKDNISAVVDTGSNIFWT----------------------TEKECSRSKTR 93

Query: 170 KNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSGYLVDDILHLAS 218
             + C  P C+ R+SC   +            C Y   Y    + S++G L +D L + +
Sbjct: 94  SMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVA 153

Query: 219 F-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
             SK  P S     V IGC    T  + D +   GV GLG    S+P  L  +      F
Sbjct: 154 VASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQLNFS-----KF 207

Query: 278 SIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           S C   + + D  S       P          A   +T+  P  +    YFV ++   IG
Sbjct: 208 SYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIG 267

Query: 325 NSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYC 377
            + L    T+SG    VD+G SFT L   ++A++V + D+++  ++   +    N+ + C
Sbjct: 268 GTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQIC 327

Query: 378 Y---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD--GDYGI 432
           Y   + +++E  K+PDM L F+ + + V+    + +      +  CL +  ++  G   +
Sbjct: 328 YSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAIDKSNIKGGISV 384

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           +G   M    ++ D  N KL++  + C +VI
Sbjct: 385 LGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 415


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/402 (24%), Positives = 164/402 (40%), Gaps = 74/402 (18%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + +GTP     + LD GS+L+W  C  C+ C    A         +   DP++
Sbjct: 94  YLVH---LSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGA---------IPVLDPAA 141

Query: 166 SSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           SS+   V C  P+C++       R      +  C Y+  Y  + + + G L  D      
Sbjct: 142 SSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYG-DKSITVGKLASDRFTFGP 200

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
                     +  +  GCG    G +   A   G+ G G G  S+PS L        SFS
Sbjct: 201 GDNADGGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFS 253

Query: 279 ICFD---ENDSGSVFFGDQGPAT------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
            CF    E+ S  V  G   PA        QST  L    +   YF+ +++  +G + + 
Sbjct: 254 YCFTSMFESTSSLVTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIP 312

Query: 330 QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
               +       A++DSGAS T LP ++Y  V  +F   V     +++G++   C+   S
Sbjct: 313 IPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPS 372

Query: 383 EEM-----------------LKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCL 421
                               ++VP  RL+F      +      N++F   E+ G  V CL
Sbjct: 373 AAAPKSAFGWRWRGRGRAMPVRVP--RLVFHLGGGADWELPRENYVF---EDYGARVMCL 427

Query: 422 TV--MSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCE 460
            +   +  GD  ++  N+   +  +V+D EN  L+++ ++CE
Sbjct: 428 VLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 154/366 (42%), Gaps = 44/366 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + +G+P     + LD GS++ WV CQ C  C       Y   D     +DPS S+S
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 216

Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
             +V+C +P C     ++C++    C Y   Y  + + + G    + L L         S
Sbjct: 217 YASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 268

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
           +  SSV IGCG    G ++  A    + G  L   S PS ++       +FS C  + DS
Sbjct: 269 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 320

Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------- 334
            S   + FGD   A + +   +        Y+VG+    +G   L+   S F        
Sbjct: 321 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAG 379

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VDSG + T L +  YA +   F +   S   +   + +  CY+ S    ++VP + L
Sbjct: 380 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 439

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            F+      +    +  P  +G   +CL    T+    IIG     G R+ FD     + 
Sbjct: 440 RFAGGGELRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVG 498

Query: 454 WSHSKC 459
           ++ +KC
Sbjct: 499 FTTNKC 504


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 116/419 (27%), Positives = 184/419 (43%), Gaps = 68/419 (16%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQ-SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT 112
           P  NS E     + N  +R   R  LQ SN+++S N         Q+    N+  +L   
Sbjct: 39  PFYNSAETSSQRMRNAIRRSA-RSTLQFSNDDASPNS-------PQSFITSNRGEYLMN- 89

Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
            I IGTP V  L   D GS+L+W      QC P    Y     +    +DP  SS+ + V
Sbjct: 90  -ISIGTPPVPILAIADTGSDLIWT-----QCNPCEDCY----QQTSPLFDPKESSTYRKV 139

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           SCS   C++   +SC + ++ C Y   Y  +++ + G +  D + + S S   P S    
Sbjct: 140 SCSSSQCRALEDASCSTDENTCSYTITYG-DNSYTKGDVAVDTVTMGS-SGRRPVS--LR 195

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEND 285
           ++IIGCG + TG++    A  G++GLG G  S+ S L K+  I   FS C      +   
Sbjct: 196 NMIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGL 251

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL--TQSGF-----QAL 336
           +  + FG  G  +        + +K  A  YF+ +E+  +G+  +  T + F       +
Sbjct: 252 TSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIV 311

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG + T LP+  Y E+       + ++R+         CY  SS    KVPD+ + F 
Sbjct: 312 IDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSS--FKVPDITVHFK 369

Query: 397 KN-------QSFVVRNH---IFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 444
                     +FV  +     F+F  NE  T+F           G + Q NF++G+  V
Sbjct: 370 GGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIF-----------GNLAQMNFLVGYDTV 417


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 160/382 (41%), Gaps = 64/382 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I IG+P V+ L+ +D  S+LLW+ C+ CI C            ++L  +DPS S + +N 
Sbjct: 89  ISIGSPPVTQLLHMDTASDLLWLQCRPCINCYA----------QSLPIFDPSRSYTHRNE 138

Query: 173 SC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SC     S P  +  +  +S    C Y   Y  + T S G L  ++L   +    +  ++
Sbjct: 139 SCRTSQYSMPSLRFNAKTRS----CEYSMRY-MDGTGSKGILAKEMLMFNTIYDESSSAA 193

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           +   V+ GCG    G  L G    G++GLG G+    SL+ + G     FS CF   D  
Sbjct: 194 LH-DVVFGCGHDNYGEPLVGT---GILGLGYGEF---SLVHRFG---TKFSYCFGSLDDP 243

Query: 288 S-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
           S     +  GD G      T+ L I   +  Y+V +E+  +    L           Q+G
Sbjct: 244 SYPHNVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDGIILPIDPWVFNRNHQTG 301

Query: 333 FQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY-CYNASSEEML- 386
               ++D+G S T L  E Y  +  K +     +  +    Q + +K  CYN + E  L 
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLV 361

Query: 387 --KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
               P +   FS     S  V++       N    VFCL V  T G+   IG      + 
Sbjct: 362 ESGFPIVTFHFSDGAELSLDVKSVFMKLSPN----VFCLAV--TPGNMNSIGATAQQSYN 415

Query: 443 IVFDRENLKLAWSHSKCEEVID 464
           I +D E  K+++    C  + D
Sbjct: 416 IGYDLEAKKISFERIDCGVLFD 437


>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
          Length = 947

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/428 (23%), Positives = 174/428 (40%), Gaps = 52/428 (12%)

Query: 57  NSVEYLELLLSNDWKRQKTR--VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
           N  E  EL  + D K+   R     +S   S  +  LFP  G+            H+ ++
Sbjct: 83  NHTESAELTAAVDAKKLARRDWQGRRSLYMSFEDTPLFPGWGT------------HFAYV 130

Query: 115 DIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
             GTP     V +D GS+    PC +C  C   +  ++          D S S+SS  V+
Sbjct: 131 YAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTDPHW----------DQSKSTSSHIVT 180

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ------SS 227
           C    C     C+  K  C +   YS E +S   Y V+D+L +   +    +      S+
Sbjct: 181 CED--CHGSFRCQKDKR-CGFSQRYS-EGSSWRAYQVEDVLWVGELTLQQSEKINHDESA 236

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 286
                + GC   QTG +    A DG+MG+     ++   LAKAG I + +FS+CF +N  
Sbjct: 237 YSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIKERTFSLCFGKNGG 295

Query: 287 GSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVD 338
             V  G      +      + P  +    + V V    +       +  + Q G   +VD
Sbjct: 296 TMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDPAIFQRGKGIIVD 355

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG + T+LP  +       +++   S   + + N   +C   +S E+  +P + +     
Sbjct: 356 SGTTDTYLPRSVAKGFSAAWERATGSPYANCKDN--HFCMILTSAELEALPTVTIHMDGG 413

Query: 399 QSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
               VR   ++ +  ++     +   +  T+   G++G N M+ H +VFD EN  + ++ 
Sbjct: 414 LEVNVRPSGYMDALGKD---NAYAPRIYLTESMGGVLGANVMLDHNVVFDYENHLVGFAE 470

Query: 457 SKCEEVID 464
             C+   D
Sbjct: 471 GVCDYRAD 478


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/375 (25%), Positives = 153/375 (40%), Gaps = 48/375 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I +GTP   F + +D GS+L W     +QC P     +  +D     + PS+S + 
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSW-----LQCQPCVIYCHVQVD---PIFTPSTSKTY 164

Query: 170 KNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK 221
           K + CS   C S          C +    C Y A Y   DTS S GYL  D+L L     
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYG--DTSFSIGYLSQDVLTL----- 217

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
             P  +  S  + GCG+   G +       G++GL    +S+   L+K     N+FS C 
Sbjct: 218 -TPSEAPSSGFVYGCGQDNQGLF---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCL 271

Query: 282 DEND--------SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ 330
             +         SG +  G     T     F P+ +       YF+ + +  +    L  
Sbjct: 272 PSSFSAPNSSSLSGFLSIGASS-LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGV 330

Query: 331 SG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEM 385
           S        ++DSG   T LP  +Y  +   F  ++S K     G S    C+  S +EM
Sbjct: 331 SASSYNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEM 390

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
             VP++++IF       ++ H       +G T  CL + ++     IIG       ++ +
Sbjct: 391 STVPEIQIIFRGGAGLELKAHNSLVEIEKGTT--CLAIAASSNPISIIGNYQQQTFKVAY 448

Query: 446 DRENLKLAWSHSKCE 460
           D  N K+ ++   C+
Sbjct: 449 DVANFKIGFAPGGCQ 463


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 154/366 (42%), Gaps = 44/366 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + +G+P     + LD GS++ WV CQ C  C       Y   D     +DPS S+S
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 212

Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
             +V+C +P C     ++C++    C Y   Y  + + + G    + L L         S
Sbjct: 213 YASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 264

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
           +  SSV IGCG    G ++  A    + G  L   S PS ++       +FS C  + DS
Sbjct: 265 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 316

Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------- 334
            S   + FGD   A + +   +        Y+VG+    +G   L+   S F        
Sbjct: 317 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAG 375

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VDSG + T L +  YA +   F +   S   +   + +  CY+ S    ++VP + L
Sbjct: 376 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 435

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            F+      +    +  P + G   +CL    T+    IIG     G R+ FD     + 
Sbjct: 436 RFAGGGELRLPAKNYLIPVD-GAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVG 494

Query: 454 WSHSKC 459
           ++ +KC
Sbjct: 495 FTSNKC 500


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 161/377 (42%), Gaps = 48/377 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y LH   + IGTP     + LD GS L+W  CQ C  C           +++L  YD S 
Sbjct: 35  YLLH---LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVC----------FNQSLPYYDASR 81

Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           SS+    SC    CK   S + C       C Y   YS  D S++   +D  +   SF  
Sbjct: 82  SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFLD--VETVSFVA 137

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
            A   SV   V+ GCG   TG +       G+ G G G +S+PS L K G   + F+   
Sbjct: 138 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 190

Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
               S  +F         G  T Q+T  +        Y++ ++   +G++ L   +S F 
Sbjct: 191 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 250

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
                   ++DSG +FT LP  +Y  V  +F   V    +         C++A    +  
Sbjct: 251 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 310

Query: 387 KVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            VP + L F      + R N++F   ++ G    CL ++  +G+  IIG        +++
Sbjct: 311 HVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIGNFQQQNMHVLY 367

Query: 446 DRENLKLAWSHSKCEEV 462
           D +N KL++  +KC+++
Sbjct: 368 DLKNSKLSFVRAKCDKL 384


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 156/367 (42%), Gaps = 45/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +GTP     V LD GS++ W     IQC P S  Y     ++   +DP+SSS+ 
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNW-----IQCLPCSECY----QQSDPIFDPTSSSTF 214

Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K+++CS P C S   S+C+S K  C Y   Y     +   Y  D +           +S 
Sbjct: 215 KSLTCSDPKCASLDVSACRSNK--CLYQVSYGDGSFTVGNYATDTVTF--------GESG 264

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             + V +GCG    G +   A   G+           S+  +  +   SFS C  + DS 
Sbjct: 265 KVNDVALGCGHDNEGLFTGAAGLLGLG------GGALSMTNQ--IKAKSFSYCLVDRDSA 316

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
              S+ F         +T+ L    K D  Y+VG+  + +G     +  S F+       
Sbjct: 317 KSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAG 376

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++D G + T L T+ Y  +   F KL +  K+ +   + +  CY+ SS   +KVP + 
Sbjct: 377 GVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVT 436

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S  +    +  P ++  T FC     T     IIG     G RI +D  N  +
Sbjct: 437 FHFTGGKSLNLPAKNYLIPIDDAGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLI 495

Query: 453 AWSHSKC 459
             S +KC
Sbjct: 496 GLSANKC 502


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 109/414 (26%), Positives = 169/414 (40%), Gaps = 90/414 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++IGTP  +  V LD GS+L WVPC      CI+C  L  +      ++ S + P  SS+
Sbjct: 87  LNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDL----KSPSVFSPLHSST 142

Query: 169 SKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYL 209
           S   SC+   C    S  +  D                   PCP  A    E    SG L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
             DIL   + ++  P+ S       GC    T +Y +   P G+ G G G +S+PS L  
Sbjct: 203 TRDILK--ARTRDVPRFS------FGC---VTSTYRE---PIGIAGFGRGLLSLPSQL-- 246

Query: 270 AGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVG 317
            G ++  FS CF       + N S  +  G    +       Q T  L      ++Y++G
Sbjct: 247 -GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIG 305

Query: 318 VESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
           +ES  IG +       LT   F +      LVDSG ++T LP   Y++++      ++  
Sbjct: 306 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYP 365

Query: 366 RISLQ--GNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVVRN----HIFS 409
           R +       +  CY       N +S E   M+  P +   F  N + ++      +  S
Sbjct: 366 RATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMS 425

Query: 410 FPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            P ++G  V CL   +  DGDY   G+ G       ++V+D E  ++ +    C
Sbjct: 426 AP-SDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478


>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 452

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 157/390 (40%), Gaps = 57/390 (14%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   HYT  ++IG P   + + +D+GS+L WV C   C  C       Y   + NL 
Sbjct: 56  GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKP-NHNL- 113

Query: 160 EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
                       V C   LC         +C S  D C Y  +Y+ +  SS G LV D +
Sbjct: 114 ------------VQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYA-DHGSSLGVLVRDYI 160

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLI 273
                 +    S V+  V  GCG  Q  S  +   A  GV+GLG G  S+ S L   GLI
Sbjct: 161 PF----QFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLI 216

Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQS--TSFLP-IGEKYDAYFVGVESYCIGNSCLTQ 330
            N    C      G +FFGD    +     TS LP   EK+  Y  G             
Sbjct: 217 HNVVGHCLSARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKH--YSSGPAELVFNGKATVV 274

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEE 384
            G + + DSG+S+T+  ++ Y  VV    + +  K++    +       WK   +  S  
Sbjct: 275 KGLELIFDSGSSYTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWKGAKSFKSLS 334

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGI 432
            +K     L  S  ++ +++ H+      E + +       CL ++  DG      +  I
Sbjct: 335 DVKKYFKPLALSFTKTKILQMHL----PPEAYLIITKHGNVCLGIL--DGTEVGLENLNI 388

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IG   +    +++D E  ++ W  S C+ +
Sbjct: 389 IGDISLQDKMVIYDNEKQQIGWVSSNCDRL 418


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 89/383 (23%), Positives = 158/383 (41%), Gaps = 57/383 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP+   ++ +D GS+L+W     +QC+P    Y     +    +DP  SS+ 
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVW-----LQCSPCRRCY----AQRGQVFDPRRSSTY 136

Query: 170 KNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           + V CS P C++       S  +    C Y+  Y  + +SS+G L  D L  A+      
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGELATDKLAFAN------ 189

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
             +  ++V +GCGR   G + D AA  G++G+  G +S+ + +A A    + F  C  + 
Sbjct: 190 -DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGSVFEYCLGDR 243

Query: 285 DSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---- 334
            S S      VF     P +   T+ L    +   Y+V +  + +G   +T  GF     
Sbjct: 244 TSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT--GFSNASL 301

Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKYCYNAS 381
                      +VDSG + +    + YA +   FD    +  +     + + +  CY+  
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLR 361

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF-----CLTVMSTDGDYGIIGQN 436
                  P + L F+      +    +  P + G         CL   + D    +IG  
Sbjct: 362 GRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNV 421

Query: 437 FMMGHRIVFDRENLKLAWSHSKC 459
              G R+VFD E  ++ ++   C
Sbjct: 422 QQQGFRVVFDVEKERIGFAPKGC 444


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 165/355 (46%), Gaps = 43/355 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y + Y+   +G P       +D GS+++W     +QC P    Y    ++    +DPS S
Sbjct: 86  YLISYS---VGIPPFQLYGIIDTGSDMIW-----LQCKPCEKCY----NQTTRIFDPSKS 133

Query: 167 SSSKNVSCSHPLCKS--RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           ++ K +  S   C+S   +SC S  +  C Y   Y  + + S G L  + L L S     
Sbjct: 134 NTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYG-DGSYSQGDLSVETLTLGS----T 188

Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICF 281
             SSV+    +IGCGR  T S+ +G +  G++GLG G VS +  L  ++  I   FS C 
Sbjct: 189 NGSSVKFRRTVIGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCL 246

Query: 282 DE--NDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCL--TQSGFQ- 334
               N S  + FGD    +   T   PI   +    Y++ +E++ +GN+ +  T S F+ 
Sbjct: 247 ASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRF 306

Query: 335 -----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
                 ++DSG + T LP +IY+++      LV   R+         CY ++ +E L  P
Sbjct: 307 GEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFDE-LNAP 365

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIG-QNFMMGH 441
            +   FS     V  N + +F E E   V CL  +S+     +G +  QNF++G+
Sbjct: 366 VIMAHFSGAD--VKLNAVNTFIEVEQ-GVTCLAFISSKIGPIFGNMAQQNFLVGY 417


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 161/376 (42%), Gaps = 55/376 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + IG+P     + +D+GS+++WV C+ C++C       Y   D     +DP+SS++
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC-------YAQAD---PLFDPASSAT 174

Query: 169 SKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
              VSC   +C++ R+S       C Y   Y  + + + G L  + L L          +
Sbjct: 175 FSAVSCGSAICRTLRTSGCGDSGGCEYEVSYG-DGSYTKGTLALETLTLG--------GT 225

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---- 283
               V IGCG +  G ++  A   G++GLG G +S+   L  A     +FS C       
Sbjct: 226 AVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGS 280

Query: 284 -----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC-------- 327
                + +GS+  G +  A  +   ++P+     A   Y+VGV    +G+          
Sbjct: 281 GSGAADAAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLF 339

Query: 328 -LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
            LT+ G   +V D+G + T LP E YA +   F   V +   +   +    CY+ S    
Sbjct: 340 QLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTS 399

Query: 386 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
           ++VP +   F    +  +  RN +    E +G  ++CL    +     I+G     G +I
Sbjct: 400 VRVPTVSFYFDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGLSILGNIQQEGIQI 455

Query: 444 VFDRENLKLAWSHSKC 459
             D  N  + +  + C
Sbjct: 456 TVDSANGYIGFGPATC 471


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 157/372 (42%), Gaps = 64/372 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I IG+P ++ L+ +D  S+LLW+ C  CI C            ++L  +DPS S + +N 
Sbjct: 89  ISIGSPPITQLLHMDTASDLLWIQCLPCINCYA----------QSLPIFDPSRSYTHRNE 138

Query: 173 SC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           +C     S P  K  ++ +S    C Y   Y  +DT S G L  ++L   +    +  ++
Sbjct: 139 TCRTSQYSMPSLKFNANTRS----CEYSMRY-VDDTGSKGILAREMLLFNTIYDESSSAA 193

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           +   V+ GCG    G  L G    G++GLG G+ S+     K       FS CF   D  
Sbjct: 194 LH-DVVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGK------KFSYCFGSLDDP 243

Query: 288 S-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
           S     +  GD G      T+ L I   +  Y+V +E+  +    L           Q+G
Sbjct: 244 SYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQTG 301

Query: 333 FQA-LVDSGASFTFLPTEIYAEVVVK----FDKLVSSKRISLQGNSWKYCYNASSEEML- 386
               ++D+G S T L  E Y  +  +    F+   ++  +S        CYN + E  L 
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLV 361

Query: 387 --KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
               P +   FS+    S  V++       N    VFCL V  T G+   IG      + 
Sbjct: 362 ESGFPIVTFHFSEGAELSLDVKSLFMKLSPN----VFCLAV--TPGNLNSIGATAQQSYN 415

Query: 443 IVFDRENLKLAW 454
           I +D E +++++
Sbjct: 416 IGYDLEAMEVSF 427


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 110/424 (25%), Positives = 177/424 (41%), Gaps = 47/424 (11%)

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           VKL   N+S   Q+LF    +QT    + + +L    + IGTP V     +D GS+L+W+
Sbjct: 31  VKLIPRNSS---QVLFNRITAQTPVSVHHYDYL--MELSIGTPPVKTYAQVDTGSDLIWL 85

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPY 194
             QCI C     + Y  L+     +DP SSS+  N++     C     +SC   ++ C Y
Sbjct: 86  --QCIPC----TNCYKQLN---PMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNY 136

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              Y  +D+ + G L  + L L S +    +      VI GCG    G + D     G++
Sbjct: 137 TYSYE-DDSITEGVLAQETLTLTSTTG---KPVALKGVIFGCGHNNNGVFNDKEM--GII 190

Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS--GSVFFGDQGPATQQSTSFLPIGE 309
           GLG G +S+ S +  +      FS C   F  N S    + FG             P+  
Sbjct: 191 GLGRGPLSLVSQIGSS-FGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVS 249

Query: 310 K--YDAYF------VGVESYCI----GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVK 357
           K  + A++      + VE   +    G+S    +    ++DSG   T LP + Y  +V +
Sbjct: 250 KNTHQAFYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEE 309

Query: 358 FDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 416
               V+   I +     ++ CY   +   LK   +   F      +    IF  P  +G 
Sbjct: 310 VRNKVALDPIPIDPTLGYQLCYRTPTN--LKGTTLTAHFEGADVLLTPTQIF-IPVQDG- 365

Query: 417 TVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 475
            +FC    ST   +YGI G +    + I FD E   +++  + C  + D   ++ V P  
Sbjct: 366 -IFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDCTNLQDAPSINGVLPNV 424

Query: 476 GQSP 479
             +P
Sbjct: 425 LSAP 428


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 127/478 (26%), Positives = 184/478 (38%), Gaps = 75/478 (15%)

Query: 16  LDGSDAVSFSSKLVHRFS--DEAKERWISKSGNVSVADSWPKKNSVEYLELLLSND---- 69
           L   D  + S +L+HR S   EAKE+                 +    LE L  ++    
Sbjct: 48  LSPRDGGTLSLELIHRNSLLREAKEKL--------------HTHEQLLLETLQRDEQRVR 93

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           W   K ++  +  + +S   L  P      +  G  F  L      +GTP  S  + +D 
Sbjct: 94  WIESKAQLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRL-----GVGTPARSLFMVVDT 148

Query: 130 GSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRS 183
           GS+L W+ CQ C  C       Y   D     +DP +SSS + + C  PLCK     S S
Sbjct: 149 GSDLPWLQCQPCKSC-------YKQAD---PIFDPRNSSSFQRIPCLSPLCKALEIHSCS 198

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
             +     C Y   Y  + + S G    D+  L + SK         SV  GCG    G 
Sbjct: 199 GSRGATSRCSYQVAYG-DGSFSVGDFSSDLFTLGTGSKAM-------SVAFGCGFDNEGL 250

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPA 297
           +   A   G+    L   S     +      NSFS C  +        S S+ FG     
Sbjct: 251 FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGAAAIP 310

Query: 298 TQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL---------TQSGFQA-LVDSGASFTFL 346
           +  + S L    K D  Y+  +    +G + L         +QSG    ++DSG S T  
Sbjct: 311 STAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRF 370

Query: 347 PTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           PT +YA +   F      L S+ R SL    +  CYN S +  + VP + L F       
Sbjct: 371 PTSVYATIRDAFRNATTNLPSAPRYSL----FDTCYNFSGKASVDVPALVLHFENGADLQ 426

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           +    +  P N   + FCL    T  + GIIG       RI FD +   LA++  +C+
Sbjct: 427 LPPTNYLIPINTAGS-FCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQCK 483


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 152/364 (41%), Gaps = 41/364 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  G+P  ++ +++D GS++ W     IQC P S   Y   D     +DP+ S++   V 
Sbjct: 165 VGFGSPAQNYTLSIDTGSDVSW-----IQCLPCSGHCYKQHD---PVFDPTKSATYSAVP 216

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C HP C +     S    C Y   Y  + +S++G L  + L L+S ++  P         
Sbjct: 217 CGHPQCAAAGGKCSNSGTCLYKVTYG-DGSSTAGVLSHETLSLSS-TRDLP------GFA 268

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFF 291
            GCG+   G +        ++GLG G +S+PS    A     +FS C    D+  G +  
Sbjct: 269 FGCGQTNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYCLPSYDTTHGYLTM 323

Query: 292 GDQGPATQ------QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVD 338
           G   PA        Q T+ +   +    YFV V S  IG   L       T+ G   L D
Sbjct: 324 GSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDG--TLFD 381

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG   T+LP E YA +  +F   ++  + +   + +  CY+ +    + +P +   FS  
Sbjct: 382 SGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDG 441

Query: 399 QSFVVRN-HIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAWS 455
             F +    I  +P++      CL  +       + IIG     G  +++D    K+ + 
Sbjct: 442 AVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFG 501

Query: 456 HSKC 459
              C
Sbjct: 502 QFTC 505


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 88/364 (24%), Positives = 144/364 (39%), Gaps = 46/364 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  N+S
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANIS 235

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + S G+   D L L+S+              
Sbjct: 236 CAAPACSDLDTRGCSGGNCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     SG+ +  
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 341

Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
           FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS F     +VDSG 
Sbjct: 342 FGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGT 399

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
             T LP   Y+ +   F   ++++       +SL       CY+ +    + +P + L+F
Sbjct: 400 VITRLPPAAYSSLRSAFASAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLF 455

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                  V      +  +              GD GI+G   +    + +D     + +S
Sbjct: 456 QGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFS 515

Query: 456 HSKC 459
              C
Sbjct: 516 PGAC 519


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 115/263 (43%), Gaps = 34/263 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C++C              L    P    SS  
Sbjct: 61  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 106

Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV D+     FS +  Q 
Sbjct: 107 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 159

Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
             +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C     
Sbjct: 160 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 219

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGAS 342
            G +FFGD    + +  S+ P+  +Y  ++   +G E    G           + DSG+S
Sbjct: 220 GGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSS 277

Query: 343 FTFLPTEIYAEVVVKFDKLVSSK 365
           +T+  ++ Y  V     + +S K
Sbjct: 278 YTYFNSKAYQAVTYLLKRELSGK 300


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 97/411 (23%), Positives = 165/411 (40%), Gaps = 80/411 (19%)

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
            + P     T + GNQ             P  +    +D GSN+ W              
Sbjct: 25  FMTPRTSCITFYLGNQ------------RPKDNISAVVDTGSNIFWT------------- 59

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS 199
                    +E + S S +   + C  P C+ R+SC   +            C Y   Y 
Sbjct: 60  ---------TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYG 110

Query: 200 -TEDTSSSGYLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
              + S++G L +D L + +  SK  P S     V IGC    T  + D +   GV GLG
Sbjct: 111 GNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLG 169

Query: 258 LGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSF 304
               S+P  L  +      FS C   + + D  S       P          A   +T+ 
Sbjct: 170 RSATSLPRQLNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTAL 224

Query: 305 LPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
            P  +    YFV ++   IG + L    T+SG    VD+G SFT L   ++A++V + D+
Sbjct: 225 QPNSDYKTRYFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDR 284

Query: 361 LVSSKRISLQ---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
           ++  ++   +    N+ + CY   + +++E  K+PDM L F+ + + V+    + +    
Sbjct: 285 IMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT- 343

Query: 415 GFTVFCLTVMSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
             +  CL +  ++  G   ++G   M    ++ D  N KL++  + C +VI
Sbjct: 344 --SKLCLAIDKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 392


>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 454

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 100/388 (25%), Positives = 155/388 (39%), Gaps = 53/388 (13%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   HYT  ++IG P   + + +D+GS+L WV C   C  C       Y   + NL 
Sbjct: 56  GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKP-NHNL- 113

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
                       V C   LC         +C S  DPC Y  +Y+ +  SS G LV D +
Sbjct: 114 ------------VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYA-DHGSSLGVLVRDYI 160

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLI 273
                 +    S V+  V  GCG  Q  S  +   A  GV+GLG G  S+ S L   GLI
Sbjct: 161 PF----QFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLI 216

Query: 274 QNSFSICFDENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 332
           +N    C      G +FFGD   P++    + +        Y  G              G
Sbjct: 217 RNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKG 276

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEML 386
            + + DSG+S+T+  ++ Y  VV    K +  K++    +       WK   +  S   +
Sbjct: 277 LELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDV 336

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGIIG 434
           K     L  S  +S  ++ H+      E + +       CL ++  DG      +  IIG
Sbjct: 337 KKYFKPLALSFKKSXNLQMHL----PPESYLIITKHGNVCLGIL--DGTEVGLENLNIIG 390

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              +    +++D E  ++ W  S C+ +
Sbjct: 391 DITLQDKMVIYDNEKQQIGWVSSNCDRL 418


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 93/382 (24%), Positives = 155/382 (40%), Gaps = 47/382 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           ++  + +GTP    L+  D GS+L+WV C     C +  P SA     L R+ + + P+ 
Sbjct: 89  YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSA----FLARHSTTFSPNH 144

Query: 166 SSSSKNVSCSH-PLCK-SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKH 222
              S   +C   PL K  R +   L  PC Y  +YS  D S +SG+   +   L + S  
Sbjct: 145 CYDS---ACQLVPLPKHHRCNHARLHSPCRY--EYSYGDGSKTSGFFSKETTTLNTSSG- 198

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
             + +    +  GC  + +G  + GA+     GVMGLG G +S+ S L       N FS 
Sbjct: 199 --REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSY 254

Query: 280 CFDEND-----SGSVFFG----DQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC 327
           C  ++D     +  +  G    D  P  ++   F P+     +   Y++G+ES  +    
Sbjct: 255 CLMDHDISPSPTSYLLIGSTQNDVAPG-KRRMRFTPLHINPLSPTFYYIGIESVSVDGIK 313

Query: 328 L----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
           L                 +VDSG + TFLP   Y +++    + V     +     +  C
Sbjct: 314 LPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLC 373

Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 437
            N S  E  ++P +      +  F      +    +E      L  + T   + +IG   
Sbjct: 374 VNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLM 433

Query: 438 MMGHRIVFDRENLKLAWSHSKC 459
             G  + FD++  +L +S   C
Sbjct: 434 QQGFLLEFDKDRTRLGFSRHGC 455


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 96/373 (25%), Positives = 154/373 (41%), Gaps = 39/373 (10%)

Query: 107 YWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  HY   + IGTP        D GS+L W       C P +  Y     RN   +DP  
Sbjct: 21  YLGHYLMEVSIGTPPFKIYGIADTGSDLTWT-----SCVPCNKCYK---QRN-PIFDPQK 71

Query: 166 SSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           S+S +N+SC   LC K  +   S +  C Y   Y++    + G L  + + L+S      
Sbjct: 72  STSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAI-TQGVLAQETITLSSTKG--- 127

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           +S     ++ GCG   TG + D     G++GLG G VS  S +  +      FS C    
Sbjct: 128 ESVPLKGIVFGCGHNNTGGFNDREM--GIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPF 184

Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG----- 332
             D + S  +  G     + +     P+  K D   YFV +    +GN+ L  +G     
Sbjct: 185 HTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQS 244

Query: 333 ---FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV 388
                  +DSG   T LPT++Y  +V +    V+ K ++   +   + CY   ++  L+ 
Sbjct: 245 VEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYR--TKNNLRG 302

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
           P +   F      ++    F  P++    VFCL   +T  D G+ G      + I FD +
Sbjct: 303 PVLTAHFEGGDVKLLPTQTFVSPKDG---VFCLGFTNTSSDGGVYGNFAQSNYLIGFDLD 359

Query: 449 NLKLAWSHSKCEE 461
              +++    C +
Sbjct: 360 RQVVSFKPMDCTK 372


>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
          Length = 642

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 106/423 (25%), Positives = 173/423 (40%), Gaps = 64/423 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCA----PLSASYYTSLDRNLSEY 161
           Y  HY  I +G P     V +D GS+L  +PC  C  C     PL              +
Sbjct: 92  YGTHYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGCGQHTDPL--------------F 137

Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           D S S+++K ++C H       SC+S +    YI+    E +     +VD+++ +  FS 
Sbjct: 138 DVSKSTTAKYLAC-HDF----DSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWVGGFSS 192

Query: 222 HAPQ-----SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QN 275
            A +      +      +GC  K+TG ++     +G+MGLG    +V S +  AG + QN
Sbjct: 193 PADEMEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQN 251

Query: 276 SFSICFDENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCL----- 328
            F++CF   D G + FG    +   S   + P+     AY+ V V+   +    L     
Sbjct: 252 LFTLCF-AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTG 310

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKYCYNASSE 383
              SG   +VDSG + TF   +     +  F K      S  R+ L           +SE
Sbjct: 311 TINSGRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKL-----------TSE 359

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMSTDGDYGIIGQNF 437
           E+  +P + +I S  +     +     P ++  T       +      ++   G++G + 
Sbjct: 360 ELAALPVISIILSGMKGDGTDDVQLDVPASQYLTPADDGKSYYGNFHFSERSGGVLGASA 419

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAA 496
           M+G  ++FD EN ++ ++ S C      S+     P A  S N P P T     SN    
Sbjct: 420 MVGFDVIFDVENKRVGFAESDCGR--SYSNATTAAPIASDSTNQPAPATPVSVDSNATEQ 477

Query: 497 APP 499
             P
Sbjct: 478 PAP 480


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 105/432 (24%), Positives = 179/432 (41%), Gaps = 63/432 (14%)

Query: 65  LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL     R K R  +L S   +S         GS T    +  Y +H   + IGTP    
Sbjct: 72  LLRRMAARSKARSARLLSGRAASARM----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 124

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
            + LD GS+L W      QCAP  + +  SL R    ++PS S +   + C   +C+  +
Sbjct: 125 QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 175

Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
            SSC         C Y   Y+ + + ++G+L  D    AS + HA   +    +  GCG 
Sbjct: 176 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 233

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
              G ++      G+ G   G +S+P     A L  ++FS CF     ++   VF G   
Sbjct: 234 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 286

Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNS---------CLTQSGFQA 335
                    G    QST+ +     +  AY++ ++   +G +          L + G   
Sbjct: 287 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 346

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
            +VDSG   T LP  +Y  V    D  V+  ++++  ++    + C++        VP +
Sbjct: 347 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 403

Query: 392 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L F      + R N++F   E  G  + CL + + + D  +IG        +++D  N 
Sbjct: 404 VLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLAND 462

Query: 451 KLAWSHSKCEEV 462
            L++  ++C ++
Sbjct: 463 MLSFVPARCNKI 474


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 97/354 (27%), Positives = 148/354 (41%), Gaps = 44/354 (12%)

Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
           LD GS+L W     +QC P +   +   D     YDPS S + K +SC+   C SR    
Sbjct: 3   LDTGSSLSW-----LQCQPCAVYCHAQAD---PLYDPSVSKTYKKLSCASVEC-SRLKAA 53

Query: 187 SLKDP--------CPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
           +L DP        C Y A Y   DTS S GYL  D+L L S S+  PQ         GCG
Sbjct: 54  TLNDPLCETDSNACLYTASYG--DTSFSIGYLSQDLLTLTS-SQTLPQ------FTYGCG 104

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSGSVFFGDQ-- 294
           +   G +   A   G++GL    +S+ + L+ K G   ++FS C    +SGS   G    
Sbjct: 105 QDNQGLFGRAA---GIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSI 158

Query: 295 ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLP 347
               P + + T  L   +    YF+ + +  +    L  +        L+DSG   T LP
Sbjct: 159 GSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLP 218

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
             +YA +   F K++S+K       S    C+  S + +  VP++++IF       +R  
Sbjct: 219 MSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAP 278

Query: 407 IFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
                 ++G T       S      IIG      + I +D    ++ ++   C 
Sbjct: 279 SILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 332


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 155/372 (41%), Gaps = 38/372 (10%)

Query: 107 YWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  HY   + IGTP        D GS+L W       C P +  Y     RN   +DP  
Sbjct: 68  YLGHYLMELSIGTPPFKIYGIADTGSDLTWT-----SCVPCNNCYK---QRN-PMFDPQK 118

Query: 166 SSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           S++ +N+SC   LC K  +   S +  C Y   Y++   +  G L  + + L+S      
Sbjct: 119 STTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR-GVLAQETITLSSTKG--- 174

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           +S     ++ GCG   TG + D     G++GLG G VS+ S +  +      FS C    
Sbjct: 175 KSVPLKGIVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPF 231

Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG----- 332
             D + S  + FG     + +     P+  K D   YFV +    + N+ L  +G     
Sbjct: 232 HTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNV 291

Query: 333 --FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVP 389
                 +DSG   T LPT++Y +VV +    V+ K ++   +   + CY   ++  L+ P
Sbjct: 292 EKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYR--TKNNLRGP 349

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            +   F      +     F  P++    VFCL   +T  D G+ G      + I FD + 
Sbjct: 350 VLTAHFEGADVKLSPTQTFISPKDG---VFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDR 406

Query: 450 LKLAWSHSKCEE 461
             +++    C +
Sbjct: 407 QVVSFKPKDCTK 418


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 104/456 (22%), Positives = 183/456 (40%), Gaps = 68/456 (14%)

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLW 135
           ++L +N++  R++ L  S     H   +     +YT  + IGTP   F + +D GS + +
Sbjct: 3   LELVANSHRRRDRELLGSARMDLH--DDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTY 60

Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           VPC  C  C           +     + P+ SSS K + C    C +     S K    Y
Sbjct: 61  VPCSSCTHCG----------NHQDPRFSPALSSSYKPLECGSE-CSTGFCDGSRK----Y 105

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              Y+ E ++SSG L  D++  ++ S    Q      ++ GC   +TG   D  A DG++
Sbjct: 106 QRQYA-EKSTSSGVLGKDVIGFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGII 158

Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
           GLG G +S+   L +   +++ FS+C+   DE     +  G Q P     T+  P    Y
Sbjct: 159 GLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY 218

Query: 312 DAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
             Y + ++   +G S L          +  ++DSG ++ + P   +        + V S 
Sbjct: 219 --YNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSL 276

Query: 366 RISLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV 418
           +  + G   K+   CY  +   +  +    P +  +F   QS  +    + F   +    
Sbjct: 277 K-EVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGA 335

Query: 419 FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQS 478
           +CL V        ++G   +    + ++R    + +  +KC ++  +             
Sbjct: 336 YCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDLWSR------------- 382

Query: 479 PNPLPTTEQ--QSTSNGQAAAPPSTAKTAPSKSIAA 512
              LP T +   ST   Q   PP     APS S+ A
Sbjct: 383 ---LPETNEPGHSTQPAQFLLPP-----APSPSVGA 410


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 105/432 (24%), Positives = 179/432 (41%), Gaps = 63/432 (14%)

Query: 65  LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL     R K R  +L S   +S         GS T    +  Y +H   + IGTP    
Sbjct: 46  LLRRMAARSKARSARLLSGRAASARM----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 98

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
            + LD GS+L W      QCAP  + +  SL R    ++PS S +   + C   +C+  +
Sbjct: 99  QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 149

Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
            SSC         C Y   Y+ + + ++G+L  D    AS + HA   +    +  GCG 
Sbjct: 150 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 207

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
              G ++      G+ G   G +S+P     A L  ++FS CF     ++   VF G   
Sbjct: 208 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 260

Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNS---------CLTQSGFQA 335
                    G    QST+ +     +  AY++ ++   +G +          L + G   
Sbjct: 261 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 320

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
            +VDSG   T LP  +Y  V    D  V+  ++++  ++    + C++        VP +
Sbjct: 321 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 377

Query: 392 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L F      + R N++F   E  G  + CL + + + D  +IG        +++D  N 
Sbjct: 378 VLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLAND 436

Query: 451 KLAWSHSKCEEV 462
            L++  ++C ++
Sbjct: 437 MLSFVPARCNKI 448


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 144/396 (36%), Gaps = 73/396 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I +GTP  S L+  D GS+L+WV C  C  C+    S         S + P  SSS
Sbjct: 88  YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPS---------SAFLPRHSSS 138

Query: 169 SKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
                C  P C+         C    L  PC ++  Y+ + + SSG+   +   L S S 
Sbjct: 139 FSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA-DGSLSSGFFSKETTTLKSLSG 197

Query: 222 HAPQSSVQ-SSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
               S +    +  GCG + +G  + GA      GVMGLG G +S  S L +     N F
Sbjct: 198 ----SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKF 251

Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------------Y 314
           S C  +              +   TSFL IG    +                       Y
Sbjct: 252 SYCLMDYT-----------LSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300

Query: 315 FVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
           ++ + S  I    L           Q     +VDSG + T+L    Y EV+    + V  
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360

Query: 365 KRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 423
              +     +  C NAS E     +P +R        F      +     EG     +  
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRA 420

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           + +   + +IG     G  + FD+E  +L ++   C
Sbjct: 421 VESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 108/453 (23%), Positives = 185/453 (40%), Gaps = 91/453 (20%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           +S++ +     SVE +    S     Q T+ K Q   N++R  +         HF+    
Sbjct: 18  ISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSI-----NRANHFYKTAL 72

Query: 107 --------------YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYT 152
                         Y + Y+   +GTP        D GS+++W     +QC P    Y  
Sbjct: 73  TNTPQSTVIPDHGEYLMTYS---VGTPPFKLYGIADTGSDIVW-----LQCEPCKECY-- 122

Query: 153 SLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
             ++   ++ PS SS+ KN+ CS  LCKS                         G L  D
Sbjct: 123 --NQTTPKFKPSKSSTYKNIPCSSDLCKS----------------------GQQGNLSVD 158

Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
            L L S + H P S  ++  +IGCG   T S+ +GA+  G++GLG G  S+ + L  +  
Sbjct: 159 TLTLESSTGH-PISFPKT--VIGCGTDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS-- 211

Query: 273 IQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN 325
           I   FS C      + N +  + FGD    +       PI +K     Y++ +E++ +GN
Sbjct: 212 IDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGN 271

Query: 326 SCLTQSG-------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 378
             +   G          ++DSG + T +PT++Y  +     +LV  KR++     +  CY
Sbjct: 272 KRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCY 331

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-----DYGII 433
           + +S+      D  +I +  +   V+ H  S   +    + CL   +T          I 
Sbjct: 332 SVTSDGY----DFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIF 387

Query: 434 G----QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G    QN ++G    +D +   +++  + C +V
Sbjct: 388 GNLAQQNLLVG----YDLQQKIVSFKPTDCSKV 416


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 161/385 (41%), Gaps = 47/385 (12%)

Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
            G  F+ L Y   I IGTP  +F V  D GS+L WV     QC P + S Y    +    
Sbjct: 117 LGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWV-----QCKPCTDSCY---QQQEPL 168

Query: 161 YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
           +DPS SS+  +V C  P CK       +C      C Y   Y  + + + G L  +   L
Sbjct: 169 FDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTT--CEYSVKYG-DQSVTRGNLAQEAFTL 225

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 272
                 +P +   + V+ GC  + + S + GA  +    G++GLG GD S+ S   + G 
Sbjct: 226 ------SPSAPPAAGVVFGCSHEYS-SGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGN 277

Query: 273 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNS 326
             + FS C     S  G +  G   P  Q + SF P+     +    Y V +    +  +
Sbjct: 278 SGDVFSYCLPPRGSSAGYLTIGAAAPP-QSNLSFTPLVTDNSQLSSVYVVNLVGISVSGA 336

Query: 327 CL--TQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNA 380
            L    S F    ++DSG   T +P   Y  +  +F + +    +  +G+  S   CY+ 
Sbjct: 337 ALPIDASAFYIGTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDV 396

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNH----IFSF-PENEGFTVFCLTVMSTD-GDYGIIG 434
           +  +++  P + L F       V       +F+     +  T+ CL  + T+   + IIG
Sbjct: 397 TGHDVVTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIG 456

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                 + +VFD E  ++ +  + C
Sbjct: 457 NMQQRAYNVVFDVEGRRIGFGANGC 481


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 155/367 (42%), Gaps = 46/367 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP       +D  ++ +W   QC  C P         +     +DPS SS+ K + CS
Sbjct: 95  IGTPPFQLYGVMDTANDNIWF--QCNPCKPC-------FNTTSPMFDPSKSSTYKTIPCS 145

Query: 176 HPLCKS--RSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            P CK+   + C S  K  C Y   Y  E   S G L  D L L S +   P S    ++
Sbjct: 146 SPKCKNVENTHCSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLNS-NNDTPISF--KNI 201

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSG 287
           +IGCG +  G  L+G    G +GLG G +S  S L  +  I   FS C      +E  SG
Sbjct: 202 VIGCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISG 257

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDS 339
            + FGD+   +   T   PI      Y   + +  +G+  +          +    ++DS
Sbjct: 258 KLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDS 317

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           G + T LP  +Y+ +      +V  +R       +K CY A+ +  L VP +   F+   
Sbjct: 318 GTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN-LDVPIITAHFNGAD 376

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG-IIG----QNFMMGHRIVFDRENLKLAW 454
             +   + F   ++E   V C   +S     G IIG    QNF++G    FD +   +++
Sbjct: 377 VHLNSLNTFYPIDHE---VVCFAFVSVGNFPGTIIGNIAQQNFLVG----FDLQKNIISF 429

Query: 455 SHSKCEE 461
             + C +
Sbjct: 430 KPTDCTK 436


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 158/375 (42%), Gaps = 57/375 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V++   +D GS+L+W  C+ C++C           +++   +DPSSSS+   +
Sbjct: 106 MSIGTPAVAYAAIIDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYAAL 155

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            CS  LC    S K     C Y   Y  + +S+ G L  +   LA         +    V
Sbjct: 156 PCSSTLCSDLPSSKCTSAKCGYTYTYG-DSSSTQGVLAAETFTLA--------KTKLPDV 206

Query: 233 IIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGS 288
             GCG    G  +  GA   G++GLG G +   SL+++ GL  N FS C    D+     
Sbjct: 207 AFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--NKFSYCLTSLDDTSKSP 258

Query: 289 VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---- 334
           +  G     ++        Q+T  +    +   Y+V ++   +G++ +T   S F     
Sbjct: 259 LLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDD 318

Query: 335 ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
                +VDSG S T+L  + Y  +   F   +        G     C+ A +  + +V  
Sbjct: 319 GTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEV 378

Query: 391 MRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
            +L+F     +      N++     + G    CLTVM + G   IIG       + V+D 
Sbjct: 379 PKLVFHLDGADLDLPAENYMV---LDSGSGALCLTVMGSRG-LSIIGNFQQQNIQFVYDV 434

Query: 448 ENLKLAWSHSKCEEV 462
               L+++  +C ++
Sbjct: 435 GENTLSFAPVQCAKL 449


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 114/428 (26%), Positives = 178/428 (41%), Gaps = 56/428 (13%)

Query: 61  YLELLLSNDWKRQKTRVKLQSNN-NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           Y  +L   D K   T+ +L     + SR + L   + +       Q  +L    + IG P
Sbjct: 23  YRLVLTHVDSKGGYTKTELMRRAVHRSRLRALSGYDATSPRLHSVQVEYLM--ELAIGKP 80

Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
            V F+   D GS+L W  CQ C  C P          ++   YDPS+SS+   + CS   
Sbjct: 81  PVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPLPCSSAT 130

Query: 179 CK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
           C    SR+   S    C Y   Y  +   S+G L  + L L   S  AP S     V  G
Sbjct: 131 CLPIWSRNCTPS--SLCRYRYAYG-DGAYSAGILGTETLTLGPSS--APVSV--GGVAFG 183

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSGSVFF 291
           CG    G  L+     G +GLG G +   SLLA+ G+    FS C    F+         
Sbjct: 184 CGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSALDSPFLL 235

Query: 292 GD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSGFQAL 336
           G       GP+T QST  L   +    YFV ++   +G+  L                 +
Sbjct: 236 GTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMI 295

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VDSG +FT L    + EVV +  +++    ++        C+ A + E   +PD+ L F+
Sbjct: 296 VDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLHFA 354

Query: 397 KNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAW 454
                 + R++  S+  NE  + FCL +  T  +   +  NF   + +++FD    +L++
Sbjct: 355 GGADMRLYRDNYMSY--NEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSF 412

Query: 455 SHSKCEEV 462
             + C ++
Sbjct: 413 LPTDCSKL 420


>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
          Length = 165

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 81/150 (54%), Gaps = 11/150 (7%)

Query: 2   VNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEY 61
           V+ +   +LF  +    S+  S+S ++ H+FS+E KE W++    +   D WP + S EY
Sbjct: 6   VSFIYSLILFTSLGFQNSNGQSYSLQMYHKFSNEVKE-WMTWRHGLD-TDGWPVEGSNEY 63

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
            + L  +D  R   ++       +    L F  EG++T     Q  +L Y+ + +GTPNV
Sbjct: 64  YKALYHHDSARHGRKL-------ADHPSLTF-LEGNETVEI-PQLGFLFYSMVQVGTPNV 114

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
           +  VALD GS++ WVPC C  CAP SA+ Y
Sbjct: 115 TLFVALDTGSDVFWVPCDCQACAPTSAASY 144


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 113/257 (43%), Gaps = 31/257 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  ++IG P   + + +D GS+L W+ C   C  C  +    Y     +L         
Sbjct: 54  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSL--------- 104

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFSK 221
               V C++ LC +  S     + CP      Y   Y T+  SS G L++D     +FS 
Sbjct: 105 ----VPCANALCTALHSGHGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----NFSL 154

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N    
Sbjct: 155 PMRSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGH 214

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVD 338
           C   N  G +FFGD    T + T ++P+ +    Y+  G  +       L     + + D
Sbjct: 215 CLSTNGGGFLFFGDDIVPTSRVT-WVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273

Query: 339 SGASFTFLPTEIYAEVV 355
           SG+++T+   + Y  VV
Sbjct: 274 SGSTYTYFTAQPYQAVV 290


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 110/431 (25%), Positives = 171/431 (39%), Gaps = 61/431 (14%)

Query: 55  KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           K  S +++E+L  +  +      K   KL +N+ S       P++   T   GN     +
Sbjct: 79  KATSPDHVEILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGN-----Y 133

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
              + +GTP     +  D GS+L W  CQ C++         T  D+    ++PS S+S 
Sbjct: 134 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 184

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
            NVSCS   C S SS       C      Y   Y  + + S G+L  D   L S      
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKDKFTLTS------ 237

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            S V   V  GCG    G +   A   G++GLG   +S PS  A A      FS C   +
Sbjct: 238 -SDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 291

Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
            S  G + FG  G    +S  F PI    D          A  VG +   I ++  +  G
Sbjct: 292 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 349

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             AL+DSG   T LP + YA +   F   +S    +   +    C++ S  + + +P + 
Sbjct: 350 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 407

Query: 393 LIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRE 448
             FS      +  +   ++F  ++     CL     S D +  I G        +V+D  
Sbjct: 408 FSFSGGAVVELGSKGIFYAFKISQ----VCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGA 463

Query: 449 NLKLAWSHSKC 459
             ++ ++ + C
Sbjct: 464 GGRVGFAPNGC 474


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 147/372 (39%), Gaps = 44/372 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I+IG P   + + LD GS+L W+ C   C+ C              L    P    S+  
Sbjct: 61  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVHC--------------LEAPHPLYQPSNDL 106

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
           + C+ PLCK+     + +   P   DY  E     SS G LV D+  L     +     +
Sbjct: 107 IPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSL----NYTKGLRL 162

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
              + +GCG  Q          DGV+GLG G VS+ S L   G ++N    C      G 
Sbjct: 163 TPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGI 222

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQALVDSGASFTF 345
           +FFG+    + +  S+ P+  +   ++   +G E    G           + DSG+S+T+
Sbjct: 223 LFFGNDLYDSSR-VSWTPMARENSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSSYTY 280

Query: 346 LPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPDMRLIFSKNQ 399
             ++ Y  V     + +S K +  +   ++   C+       S E +K     L  S   
Sbjct: 281 FNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKT 340

Query: 400 SFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENL 450
            +  +  +F  P      +      CL +++       +  +IG   M    I++D E  
Sbjct: 341 GWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQ 399

Query: 451 KLAWSHSKCEEV 462
            + W  + C+E+
Sbjct: 400 SIGWIPADCDEI 411


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 152/368 (41%), Gaps = 46/368 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T + +G P   F + LD GS++ W+ CQ C  C       Y   D     +DP++SS+
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPTASST 210

Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
              V+C    C S   SSC+S +  C Y  +Y         Y   D    A+ S     S
Sbjct: 211 YAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNY-----GDGSYTFGD---FATESVSFGNS 260

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
               +V +GCG    G +        V   GL  +    L     L   SFS C    DS
Sbjct: 261 GSVKNVALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDS 312

Query: 287 G---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ------ 334
               ++ F          T+ L    K D  Y+VG+    +G   ++  +S F+      
Sbjct: 313 AGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGN 372

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              +VD G + T L T+ Y  +   F ++  + +++     +  CY+ S +  ++VP + 
Sbjct: 373 GGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 432

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S+ +    +  P +   T +C     T     IIG     G R+ FD  N ++
Sbjct: 433 FHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRM 491

Query: 453 AWSHSKCE 460
            +S +KC+
Sbjct: 492 GFSPNKCQ 499


>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 873

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 82/366 (22%), Positives = 161/366 (43%), Gaps = 39/366 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           HY  + IG P     V LD GS L   PC +C+ C               +  DP   ++
Sbjct: 46  HYAELYIGIPPQRASVILDTGSGLTAFPCDKCVDCG--------------THTDPKFDAT 91

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            K+ S +   CK    C + +D    I    +E +     ++ D++ + +      +  +
Sbjct: 92  -KSTSINFVQCKYEEGCDTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIM 150

Query: 229 QSSVI---IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDEN 284
           +   I    GC  ++TG ++     +G+MGLG+G  ++ + + KA  ++ + F++CF + 
Sbjct: 151 RRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQK 209

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT------QSGFQALV 337
               V  G          ++ P+ +   + Y + V+   IG   L       +SG  A+V
Sbjct: 210 GGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAIV 269

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           DSG + T+ P+         F      KRI+ ++ N  K   N + E +  +P++ LI +
Sbjct: 270 DSGTTDTYFPSAAATPFQEAF------KRITGVEYNENKM--NLTPEMVETLPNVSLIIA 321

Query: 397 --KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
               + F +  +   +  N+    F  T+  ++    ++G + MMG+ ++FD E  ++ +
Sbjct: 322 GEDGEDFEISLNASDYILNDSNHHFFGTLHFSERRGAVLGASIMMGYDVIFDLEKKRVGF 381

Query: 455 SHSKCE 460
           + + C+
Sbjct: 382 AEATCD 387


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 97/425 (22%), Positives = 166/425 (39%), Gaps = 74/425 (17%)

Query: 81  SNNNSSRNQLLFPSEGSQTHF------------FGNQF-YWLHYTWIDIGTPNVSFLVAL 127
           S++   R + +FP   + +              +GN +    +Y  + IG P   + +  
Sbjct: 25  SDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDP 84

Query: 128 DAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS-SSKNVSCSHPLCKSRSS 184
           D GS+L W+ C   C++C       Y   +  +   DP  +S       C HP       
Sbjct: 85  DTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCASLHPPGYKCEHP------- 137

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQT- 241
                + C Y  +Y+ +  SS G LV D+  L+  +  + AP+      + +GCG  Q  
Sbjct: 138 -----EQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR------LALGCGYDQIP 185

Query: 242 -GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
             SY      DGV+GLG G  S+ S L   G+I+N    C      G +FFGD    + +
Sbjct: 186 GQSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYDSSR 242

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
                 + +++  Y  G     +G             DSG+S+T+L +  Y  +V    K
Sbjct: 243 VVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRK 302

Query: 361 LVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----- 413
            +S K  R +L   +   C+         V D++  F        +    SFP       
Sbjct: 303 ELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVKKFF--------KPLALSFPGGGRTKT 353

Query: 414 ------EGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
                 E + +       CL +++       D+ +IG   M    +V+D E  ++ W+ +
Sbjct: 354 QYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPT 413

Query: 458 KCEEV 462
            C+ +
Sbjct: 414 NCDRL 418


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 108/414 (26%), Positives = 169/414 (40%), Gaps = 90/414 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++IGTP  +  V +D GS+L WVPC      CI C  L ++      ++ S + P  SSS
Sbjct: 15  LNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNL----KSSSIFSPLHSSS 70

Query: 169 SKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYL 209
           S   SC+   C    S  +  D                   PCP  A    E    SG L
Sbjct: 71  SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
             DIL   + ++  P+ S       GC    T +Y +   P G+ G G G +S+PS L  
Sbjct: 131 TRDILK--ARTRDVPRFS------FGC---VTSTYHE---PIGIAGFGRGLLSLPSQL-- 174

Query: 270 AGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVG 317
            G ++  FS CF       + N S  +  G    +       Q T  L      ++Y++G
Sbjct: 175 -GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIG 233

Query: 318 VESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
           +ES  IG +       LT   F +      LVDSG ++T LP   Y++++      ++  
Sbjct: 234 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYP 293

Query: 366 RI--SLQGNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVVRN----HIFS 409
           R   +     +  CY       N +S E   M+  P +   F  N + ++      +  S
Sbjct: 294 RATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMS 353

Query: 410 FPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            P ++G  V CL   +  DG+Y   G+ G       ++V+D E  ++ +    C
Sbjct: 354 AP-SDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 166/367 (45%), Gaps = 45/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +GTP     + LD GS++ W     IQC P S  Y     ++   ++P+SSS+ 
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCSDCY----QQSDPVFNPTSSSTY 212

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +  K      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             + V +GCG    G +       G++GLG G +S+ + +        SFS C  + DSG
Sbjct: 265 --NDVALGCGHDNEGLF---TGAAGLLGLGGGALSITNQMKA-----TSFSYCLVDRDSG 314

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQ---------SGFQ 334
              S+ F      +  +T+ L   +K D  Y+VG+  + +G   +           SG  
Sbjct: 315 KSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSG 374

Query: 335 ALV-DSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDMR 392
            ++ D G + T L T+ Y  +   F KL ++ K+ +   + +  CY+ SS   +KVP + 
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVA 434

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S  +    +  P ++  T FC     T     IIG     G RI +D  N  +
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDNGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLANKII 493

Query: 453 AWSHSKC 459
             S +KC
Sbjct: 494 GLSGNKC 500


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 116/473 (24%), Positives = 186/473 (39%), Gaps = 80/473 (16%)

Query: 3   NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
           N+V +  LF  + +  +    FS  L+HR  D     +             P K   E L
Sbjct: 11  NVVVVGFLFQLLEVALARGGGFSVDLIHR--DSPHSPFFD-----------PSKTQAERL 57

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
               ++ ++R  +RV                S+G Q+    +   +L   +I  GTP V 
Sbjct: 58  ----TDAFRRSVSRV-------GRFRPTAMTSDGIQSRIVPSAGEYLMNLYI--GTPPVP 104

Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC--- 179
            +  +D GS+L W      QC P +  Y     + +  +DP +SS+ ++ SC    C   
Sbjct: 105 VIAIVDTGSDLTWT-----QCRPCTHCY----KQVVPLFDPKNSSTYRDSSCGTSFCLAL 155

Query: 180 -KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
            K RS  K  K  C +   Y+ + + + G L  + L + S    A +         GCG 
Sbjct: 156 GKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTVDS---TAGKPVSFPGFAFGCGH 209

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGD 293
              G +    +  G++GLG G++S+ S L     I   FS C      D + S  + FG 
Sbjct: 210 SSGGIF--DKSSSGIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISSRINFGA 265

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
            G  +   T   P+   Y  Y    E          + G   +VDSG ++TFLP E Y++
Sbjct: 266 SGRVSGYGTVSTPLRLPYKGYSKKTE---------VEEG-NIIVDSGTTYTFLPQEFYSK 315

Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPE 412
           +       +  KR+      +  CYN ++E  +  P +   F   N      N      E
Sbjct: 316 LEKSVANSIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFKDANVELQPLNTFMRMQE 373

Query: 413 NEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENLKLAWSHSKCEE 461
           +    + C TV  T  D G++G     NF++G    FD    +     ++ EE
Sbjct: 374 D----LVCFTVAPTS-DIGVLGNLAQVNFLVG----FDLRKKRGFSKKAEVEE 417



 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 15/129 (11%)

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           +VDSG ++T+LP E Y ++       +  KR+         CYN + ++ +  P +   F
Sbjct: 421 IVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAPIITAHF 479

Query: 396 S-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENL 450
              N      N      E+    + C TV+ T  D GI+G     NF++G    FD    
Sbjct: 480 KDANVELQPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVNFLVG----FDLRKK 530

Query: 451 KLAWSHSKC 459
           ++++  + C
Sbjct: 531 RVSFKAADC 539


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 162/383 (42%), Gaps = 59/383 (15%)

Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           FG+  Y++    + IG+P     + +D GS++ W     IQC+P  + Y     +N + +
Sbjct: 9   FGSGEYFVR---VGIGSPTKLQYLVMDTGSDVPW-----IQCSPCKSCY----KQNDAVF 56

Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           DP +SSS + +SCS P CK     +C S  + C Y   Y  + + + G L  D   L S 
Sbjct: 57  DPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASDSF-LVSR 114

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
            + +P       V+ GCG    G ++  A   G+    L   S PS L+        FS 
Sbjct: 115 GRTSP-------VVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS-----RKFSY 159

Query: 280 CFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT-- 329
           C    D+G     ++ FGD    T  S ++  +    K D  Y+ G+    IG + L+  
Sbjct: 160 CLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIP 219

Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
            + F+          ++DSG S T LPT  Y  +   F     KL  +   SL    +  
Sbjct: 220 STAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL----FDT 275

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 436
           CY+ S+   + +P +   F    S  +    +  P +   T FC     T  D  IIG  
Sbjct: 276 CYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDLSIIGNI 334

Query: 437 FMMGHRIVFDRENLKLAWSHSKC 459
                R+  D ++ ++ ++  +C
Sbjct: 335 QQQTMRVAIDLDSSRVGFAPRQC 357


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 105/432 (24%), Positives = 179/432 (41%), Gaps = 63/432 (14%)

Query: 65  LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL     R K R  +L S   +S         GS T    +  Y +H   + IGTP    
Sbjct: 72  LLHRMAARSKARSARLLSGRAASARV----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 124

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
            + LD GS+L W      QCAP  + +  SL R    ++PS S +   + C   +C+  +
Sbjct: 125 QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 175

Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
            SSC         C Y   Y+ + + ++G+L  D    AS + HA   +    +  GCG 
Sbjct: 176 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 233

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
              G ++      G+ G   G +S+P     A L  ++FS CF     ++   VF G   
Sbjct: 234 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 286

Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNS---------CLTQSGFQA 335
                    G    QST+ +     +  AY++ ++   +G +          L + G   
Sbjct: 287 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 346

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
            +VDSG   T LP  +Y  V    D  V+  ++++  ++    + C++        VP +
Sbjct: 347 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 403

Query: 392 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L F      + R N++F   E  G  + CL + + + D  +IG        +++D  N 
Sbjct: 404 VLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMHVLYDLAND 462

Query: 451 KLAWSHSKCEEV 462
            L++  ++C ++
Sbjct: 463 MLSFVPARCNKI 474


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 154/365 (42%), Gaps = 44/365 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + +G P+  F + LD GS++ W     +QC P S  Y     ++   +DP++SSS 
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNW-----LQCKPCSDCY----QQSDPIFDPTASSSY 207

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             ++C    C+    S+C++ K  C Y   Y     +   Y+ + +    SF   +    
Sbjct: 208 NPLTCDAQQCQDLEMSACRNGK--CLYQVSYGDGSFTVGEYVTETV----SFGAGS---- 257

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             + V IGCG    G +        V   GL  +    L   + +   SFS C  + DSG
Sbjct: 258 -VNRVAIGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDSG 308

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQA 335
              ++ F    P        L   +    Y+V +    +G   +T         QSG   
Sbjct: 309 KSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGG 368

Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            +VDSG + T L T+ Y  V   F +  S+ R +     +  CY+ SS + ++VP +   
Sbjct: 369 VIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFH 428

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           FS ++++ +    +  P  +G   +C     T     IIG     G R+ FD  N  + +
Sbjct: 429 FSGDRAWALPAKNYLIPV-DGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGF 487

Query: 455 SHSKC 459
           S +KC
Sbjct: 488 SPNKC 492


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 160/380 (42%), Gaps = 48/380 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT + IG P   + + +D GS+L W+ C   C  CA      Y     N+    P   S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVV---PPRDS 215

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + +  +     +   C        Y   Y+ + +SS G L  D + L +    A    
Sbjct: 216 YCQELQGNQNYGDTSKQCD-------YEITYA-DRSSSMGILARDNMQLIT----ADGER 263

Query: 228 VQSSVIIGCGRKQTGSYLDGAA-PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
                + GCG  Q G+ L   A  DG++GL    +S+P+ LA  G+I N F  C   D +
Sbjct: 264 ENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLT---QSG--FQALVD 338
           + G +F GD     +   +++PI     + Y   V+    G+  L    ++G   Q + D
Sbjct: 324 NGGYMFLGDDY-VPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFD 382

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG+S+T+LP + Y  ++     L  S        +  +C   +   +  + D++ +F K 
Sbjct: 383 SGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KP 440

Query: 399 QSFVVRNHIFSFPEN-----EGFTV------FCLTVMSTDG-DYG-----IIGQNFMMGH 441
            S V +  +F  P       E + +       CL V+  DG + G     +IG   + G 
Sbjct: 441 LSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGK 498

Query: 442 RIVFDRENLKLAWSHSKCEE 461
            +V++ +  ++ W  S C +
Sbjct: 499 LVVYNNDEKQIGWVQSDCAK 518


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 152/365 (41%), Gaps = 55/365 (15%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
            IGTP  + L+A+D  ++  W+PC  C+ C   S++ + ++           S++ K V 
Sbjct: 101 KIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK----------STTFKTVG 147

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C  P CK   + K     C +   Y +   +++  L  D++ LA+ S          S  
Sbjct: 148 CEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDSI--------PSYT 197

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
            GC  + TGS +    P G++GLG G +S+  L     L Q++FS C       N SGS+
Sbjct: 198 FGCLTEATGSSIP---PQGLLGLGRGPMSL--LSQTQNLYQSTFSYCLPSFRSLNFSGSL 252

Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
             G  G P   ++T  L    +   Y+V + +  +G   +            +G   + D
Sbjct: 253 RLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFD 312

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG  FT L    Y  V   F K V +  ++  G  +  CY +     +  P +  +FS  
Sbjct: 313 SGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG-FDTCYTSP----IVAPTITFMFSGM 367

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLAW 454
              +  +++     +   ++ CL + +   +      +I       HRI+FD  N +L  
Sbjct: 368 NVTLPPDNLLI--HSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGV 425

Query: 455 SHSKC 459
           +   C
Sbjct: 426 AREPC 430


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 160/380 (42%), Gaps = 48/380 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +YT + IG P   + + +D GS+L W+ C   C  CA      Y     N+    P   S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVV---PPRDS 215

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + +  +     +   C        Y   Y+ + +SS G L  D + L +    A    
Sbjct: 216 YCQELQGNQNYGDTSKQCD-------YEITYA-DRSSSMGILARDNMQLIT----ADGER 263

Query: 228 VQSSVIIGCGRKQTGSYLDGAA-PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
                + GCG  Q G+ L   A  DG++GL    +S+P+ LA  G+I N F  C   D +
Sbjct: 264 ENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLT---QSG--FQALVD 338
           + G +F GD     +   +++PI     + Y   V+    G+  L    ++G   Q + D
Sbjct: 324 NGGYMFLGDDY-VPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFD 382

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG+S+T+LP + Y  ++     L  S        +  +C   +   +  + D++ +F K 
Sbjct: 383 SGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KP 440

Query: 399 QSFVVRNHIFSFPEN-----EGFTV------FCLTVMSTDG-DYG-----IIGQNFMMGH 441
            S V +  +F  P       E + +       CL V+  DG + G     +IG   + G 
Sbjct: 441 LSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGK 498

Query: 442 RIVFDRENLKLAWSHSKCEE 461
            +V++ +  ++ W  S C +
Sbjct: 499 LVVYNNDEKQIGWVQSDCAK 518


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 105/404 (25%), Positives = 170/404 (42%), Gaps = 53/404 (13%)

Query: 83  NNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
           +N  R + L    FP +G+ +         L+YT I +G P     V +D GS++LWV C
Sbjct: 58  HNDRRGRFLQGISFPLKGNYSDL------GLYYTEIGLGNPVQKLKVIVDTGSDILWVKC 111

Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDP 191
             C  C  LS      +   LS Y+ S+SS+S   SCS PLC       SRS   S    
Sbjct: 112 SPCRSC--LSKQ---DIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNS---A 163

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
           C Y++ Y  +  S   Y+ DD+ ++     H   ++  S +  GC    TGS+      D
Sbjct: 164 CAYVSSYQDKSASVGAYVRDDMHYVL----HGGNATT-SRIFFGCATNITGSW----PVD 214

Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEK 310
           G+MG GL   +VP+ +A    +   FS C   E   G +    + P T +   F P+   
Sbjct: 215 GIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEAPNTTEMV-FTPLLNV 273

Query: 311 YDAYFVGVESYCIGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
              Y V + S  + +  L            + +    ++DSG +F  L T+    +  + 
Sbjct: 274 TTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEI 333

Query: 359 DKLVSSK-RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEG 415
             L ++K    L+G    Y  +  + E    P++ L FS   +  ++  N++      + 
Sbjct: 334 KSLTTAKLGPKLEGLECFYLKSGLTMET-SFPNVTLTFSGGSTMKLKPDNYLVMAEYKKK 392

Query: 416 FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              +C    S DG   I G+  +    + +D EN ++ W    C
Sbjct: 393 RNGYCYAWSSADG-LTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/368 (24%), Positives = 151/368 (41%), Gaps = 45/368 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + IG+P     + LD GS++ WV CQ C  C       Y   D     +DPS S+S
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSAS 218

Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
              VSC  P C+    ++C++    C Y   Y  + + + G    + L L         S
Sbjct: 219 YAAVSCDSPRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 270

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
           +  ++V IGCG    G ++  A    + G  L   S PS ++      ++FS C  + DS
Sbjct: 271 TPVTNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDS 322

Query: 287 ---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-----------TQS 331
               ++ FG  G      T+ L    +    Y+V +    +G   L           T  
Sbjct: 323 PAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
               +VDSG + T L +  YA +   F +   S   +   + +  CY+ S    ++VP +
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            L F    +  +    +  P  +G   +CL    T+    IIG     G R+ FD     
Sbjct: 443 SLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGV 501

Query: 452 LAWSHSKC 459
           + ++ +KC
Sbjct: 502 VGFTPNKC 509


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 156/389 (40%), Gaps = 77/389 (19%)

Query: 127 LDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS- 181
           +D GS+L+WVPC     CI C   SAS    L        P  SSS   V+C+   CK+ 
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFL--------PRMSSSLHLVTCADSNCKTL 52

Query: 182 ------------RSSCKSLKDPC-PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
                         S K+  + C PY   Y     S++G L+ + L+L    ++   +  
Sbjct: 53  YGNNTELLCQSCAGSLKNCSETCPPYGIQYGR--GSTAGLLLTETLNLP--LENGEGARA 108

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-----FDE 283
            +   +GC      S +    P G+ G G G +S+PS L +  + ++ F+ C     FDE
Sbjct: 109 ITHFAVGC------SIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDE 161

Query: 284 NDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---------YFVGVESYCIGNSCL----- 328
            +  S+   GD+        ++ P      A         Y++G+    IG   L     
Sbjct: 162 ENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPS 221

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNA 380
                 T+     ++DSG +FT    EI+  +   F   +  +R            CY+ 
Sbjct: 222 KLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDV 281

Query: 381 SSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTDG----DYG--- 431
           +  E + +P+    F      V  V N+   F     F   CLT++S+ G    D G   
Sbjct: 282 TGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSS---FDSICLTMISSRGLLEVDSGPAV 338

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           I+G +      +++DRE  +L ++   C+
Sbjct: 339 ILGNDQQQDFYLLYDREKNRLGFTQQTCK 367


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 160/389 (41%), Gaps = 62/389 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP    L+  D GS+L+W+ C      P          R    +  S S++   V 
Sbjct: 57  MAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATLSVVP 114

Query: 174 CSHPLC--------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           CS   C           +   +   PC Y  DY+ + +S++G+L  D    A+ S     
Sbjct: 115 CSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYA-DGSSTTGFLARDT---ATISNGTSG 170

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEN 284
            +    V  GCG +  G    G    GV+GLG G +S P   A++G L   +FS C  + 
Sbjct: 171 GAAVRGVAFGCGTRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDL 225

Query: 285 DSGS-------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ 334
           + G        +F G   P  + + ++ P+     A   Y+VGV +  +GN  L   G +
Sbjct: 226 EGGRRGRSSSFLFLGR--PERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 283

Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYN 379
                      ++DSG++ T+L    Y  +V  F   V   RI       QG   + CYN
Sbjct: 284 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYN 341

Query: 380 ASSEEMLK-----VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG- 431
            SS           P + + F++  S  +   N++    ++    V CL +  T   +  
Sbjct: 342 VSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADD----VKCLAIRPTLSPFAF 397

Query: 432 -IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++G     G+ + FDR + ++ ++ ++C
Sbjct: 398 NVLGNLMQQGYHVEFDRASARIGFARTEC 426


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 50/368 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + IG P     + LD GS++ WV     QCAP +  Y    ++    ++P+SS+S 
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWV-----QCAPCAECY----EQTDPXFEPTSSASF 201

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            ++SC    CKS    +     C Y   Y  + + + G  V + + L S S         
Sbjct: 202 TSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTSL-------- 252

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
            ++ IGCG    G ++  A    ++GLG G +S PS L  +     SFS C  + DS S 
Sbjct: 253 GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDST 304

Query: 290 FFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA--------L 336
              D   P T  + T+ L      D +F +G+    +G + L   ++ FQ         +
Sbjct: 305 STLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364

Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           VDSG + T L T +Y  +   F K    L +++ ++L    +  CY+ SS+  ++VP + 
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL----FDTCYDLSSKSRVEVPTVS 420

Query: 393 LIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
             F+      +    +  P ++EG   FC     TD    I+G     G R+ FD  N  
Sbjct: 421 FHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478

Query: 452 LAWSHSKC 459
           + +S +KC
Sbjct: 479 VGFSPNKC 486


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 109/420 (25%), Positives = 169/420 (40%), Gaps = 46/420 (10%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL +D  R  +  ++ +N  +   Q +  P+E   +   GN     +   + +GTP    
Sbjct: 44  LLEHDQARVDSIHRMIANETAVVGQDVSLPAERGISVGTGN-----YVVSVGLGTPARDL 98

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-KSR 182
            V  D GS+L WV  QC  C+  S   Y   D     + PSSSS+   V C  P C ++R
Sbjct: 99  TVVFDTGSDLSWV--QCGPCS--SGGCYHQQD---PLFAPSSSSTFSAVRCGEPECPRAR 151

Query: 183 SSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHAPQ--SSVQSSVIIGCG 237
            SC S    D CPY   Y  + + + G+L +D L L +  S +A +  S+     + GCG
Sbjct: 152 QSCSSSPGDDRCPYEVVYG-DKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCG 210

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQ 294
              TG  L G A DG+ GLG G VS+ S    AG     FS C      N  G +  G  
Sbjct: 211 ENNTG--LFGKA-DGLFGLGRGKVSLSS--QAAGKYGEGFSYCLPSSSSNAHGYLSLGTP 265

Query: 295 GPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQAL------VDSGASFTF 345
            PA   +  F P+  + +    Y+V +    +    +  S   AL      VDSG   T 
Sbjct: 266 APAPAHA-RFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDSGTVITR 324

Query: 346 LPTEIYAEVVVKFDKLVS------SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           L    Y+ +   F   +       + R+S+      Y + A +   + +P + L+F+   
Sbjct: 325 LAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTC--YDFTAHANATVSIPAVALVFAGGA 382

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +  V      +                    GI+G        +V+D    K+ ++   C
Sbjct: 383 TISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 155/374 (41%), Gaps = 47/374 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +G+P   + + +D GS+  W     +QC P +   +   D     ++PS+S + 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSW-----LQCQPCTIYCHIQED---PVFNPSASKTY 154

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           K V CS   C        +  +C    + C Y A Y  + + S GYL  D+L L      
Sbjct: 155 KTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDVLTLT----- 208

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
              S   SS + GCG+   G +      DG++GL   ++S+ S L  +G   N+FS C  
Sbjct: 209 --PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKYGNAFSYCLP 261

Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCIGNSCLTQSG 332
                 +    G +  G        S  F P+ +  +    YF+ +ES  +    L  + 
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321

Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
                  ++DSG   T LPT +Y  +   +  ++S K     G S    C+  S   + +
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISE 381

Query: 388 V-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           V PD+R+IF       ++ H        G T  CL  M+      IIG       ++ +D
Sbjct: 382 VAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQQQTVKVAYD 438

Query: 447 RENLKLAWSHSKCE 460
             N ++ ++   C+
Sbjct: 439 VGNSRVGFAPGGCQ 452


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 163/383 (42%), Gaps = 59/383 (15%)

Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           FG+  Y++    + IG+P     + +D GS++ W     IQC+P  + Y     +N + +
Sbjct: 9   FGSGEYFVR---VGIGSPTKLQYLVMDTGSDVPW-----IQCSPCKSCY----KQNDAVF 56

Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           DP +SSS + +SCS P CK     +C S  + C Y   Y  + + + G L  D     SF
Sbjct: 57  DPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASD-----SF 110

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           S    ++   S V+ GCG    G ++  A   G+    L   S PS L+        FS 
Sbjct: 111 SVSRGRT---SPVVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS-----RKFSY 159

Query: 280 CFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT-- 329
           C    D+G     ++ FGD    T  S ++  +    K D  Y+ G+    IG + L+  
Sbjct: 160 CLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIP 219

Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
            + F+          ++DSG S T LPT  Y  +   F     KL  +   SL    +  
Sbjct: 220 STAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL----FDT 275

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 436
           CY+ S+   + +P +   F    S  +    +  P +   T FC     T  D  IIG  
Sbjct: 276 CYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDLSIIGNI 334

Query: 437 FMMGHRIVFDRENLKLAWSHSKC 459
                R+  D ++ ++ ++  +C
Sbjct: 335 QQQTMRVAIDLDSSRVGFAPRQC 357


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 162/383 (42%), Gaps = 56/383 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V F+   D GS+L W  C+ C  C P          ++   YD ++S+S   V
Sbjct: 99  LAIGTPPVPFVALADTGSDLTWTQCKPCKLCFP----------QDTPIYDTAASASFSPV 148

Query: 173 SCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQS 226
            C+   C      SR+   +   PC Y   Y+ +D + S+G L  + L  A  S  AP  
Sbjct: 149 PCASATCLPIWRSSRNCTATTTSPCRY--RYAYDDGAYSAGVLGTETLTFAGSSPGAPGP 206

Query: 227 SVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----F 281
            V    V  GCG    G   +     G +GLG G +   SL+A+ G+    FS C    F
Sbjct: 207 GVSVGGVAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFF 258

Query: 282 DENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 329
           + +    V FG           G A  QST  +        Y+V +E   +G++ L    
Sbjct: 259 NTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPN 318

Query: 330 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNAS 381
                        +VDSG  FT L    +  VV     +++   ++    +S  +   A 
Sbjct: 319 GTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAG 378

Query: 382 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
            +++  +PDM L F+      + R++  SF  N+  + FCL +      YG I  NF   
Sbjct: 379 EQQLPDMPDMLLHFAGGADMRLHRDNYMSF--NQESSSFCLNIAGAPSAYGSILGNFQQQ 436

Query: 441 H-RIVFDRENLKLAWSHSKCEEV 462
           + +++FD    +L++  + C ++
Sbjct: 437 NIQMLFDITVGQLSFVPTDCSKL 459


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 150/361 (41%), Gaps = 52/361 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP V  L+ALD  S+L W+ CQ C +C P S             +DP  S+S + +
Sbjct: 142 IAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYREM 191

Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           S +   C++  RS     K   C Y   Y  + +++ G  +++ L  A      P+ S  
Sbjct: 192 SFNAADCQALGRSGGGDAKRGTCVYTVGYG-DGSTTVGDFIEETLTFAG-GVRLPRIS-- 247

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG-- 287
               IGCG    G  L GA   G++GLG G +S P+ +   G    +FS C  +  SG  
Sbjct: 248 ----IGCGHDNKG--LFGAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGPG 297

Query: 288 ----SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN---SCLTQSGFQ--- 334
               ++ FG     T    SF P     +    Y+V +    +G      +T+   Q   
Sbjct: 298 SLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDP 357

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNS--WKYCYNASSEEM 385
                  +VDSG + T L    Y      F  + V   ++S+ G S  +  CY      M
Sbjct: 358 YTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGM 417

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
            KVP + + F+ +    ++   +  P +   TV      + D    IIG     G RIV+
Sbjct: 418 KKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVY 477

Query: 446 D 446
           D
Sbjct: 478 D 478


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/377 (23%), Positives = 158/377 (41%), Gaps = 60/377 (15%)

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           +F + +D GS+  ++PC+ C  C    A  Y         YD  +S+    V CS   C 
Sbjct: 46  TFELIVDTGSSRTYLPCKGCASCGAHEAGRY---------YDYDASADFSRVECS--ACA 94

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
                      C Y   Y  E + S GYLV D++ L         S   ++V+ GC  ++
Sbjct: 95  GIGGKCGTSGVCRYDVHY-LEGSGSEGYLVRDVVSLGG-------SVGNATVVFGCEERE 146

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS------------GS 288
            GS +   + DG+ G G    ++ + LA A +I + FS+C +  +             G+
Sbjct: 147 LGS-IKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLP 347
             FG   PA      + P+      Y V   S+ +GNS +  S G   ++DSG S+T++P
Sbjct: 206 FDFGADAPAL----VYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVP 261

Query: 348 TEIYAEVVVKFDKLV--SSKRISLQ-------------GNSWKYCYNASSEEMLKVPDMR 392
             ++A    +F +L   +++   L+             GNS    ++  SE     P ++
Sbjct: 262 GNMHA----RFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYF---PALK 314

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           + +  +    +    + +   +  + FC+ ++  D +  ++GQ  M      FD    ++
Sbjct: 315 IEYHGSARLTLSPETYLYWHQKNASAFCVGILEHDDNRILLGQITMRNTFTEFDVARSQV 374

Query: 453 AWSHSKCEEVIDKSHVH 469
             + + CE + +K   H
Sbjct: 375 GMASANCEMLREKYVEH 391


>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
          Length = 656

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 111/461 (24%), Positives = 194/461 (42%), Gaps = 62/461 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           HY WI +GTP     + +D GS +   PC  C QC       +T +      ++ + SSS
Sbjct: 95  HYAWIYVGTPPQRVSIIIDTGSGMTAFPCSGCDQCGN-----HTDI-----PFNTNLSSS 144

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSKHAPQ 225
            + +SC+H    S + C +  +PC        E +S S  +++DI++L   AS       
Sbjct: 145 IQPISCNHRTYFSCAYCTNPTEPCRTY----MEGSSWSAKVMEDIVYLGDVASAKDTNLH 200

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            S  +  + GC  K+TG ++   A DG+MG+   G+  V  L  +  +  N+F++CF   
Sbjct: 201 HSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDIVTKLFREKKIPSNTFTLCFSPR 259

Query: 285 DSGSVFFGDQGPATQQ-STSFLPI----GEKYDAYF---VGVESYCIGNSCLTQSGFQAL 336
             G    G    +      ++  I    GE Y A F   + V  + I       + ++ +
Sbjct: 260 -GGYFALGAMDTSRHAGEVTYARINDAYGENYYAVFMTDIRVGGHSIDIDMKATNSYRYI 318

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VDSG + + +       ++  +  L   K   L  N    C   S  ++ ++P ++ +  
Sbjct: 319 VDSGTTNSIISGRAGQALMDLYRNLTHLKN-PLNDND---CILLSPSQIEQLPTLQFVME 374

Query: 397 -----KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
                +    ++ +      EN   T F + V  T    G+IG + MM H ++FDR   K
Sbjct: 375 GVNGDRAILEILASQYLQKGENNK-TCFNILV-DTRKIGGVIGASMMMNHDVIFDRSQNK 432

Query: 452 LAWSHSKCEEVID---KSHVHLVP--------PPAGQSPNP--LPTTEQQSTSNGQAAAP 498
           + +  + C    D    SH + +P        P + QS N       E++  SN     P
Sbjct: 433 VGFVPANCTFAGDTEPNSHKNAIPSDDANGALPVSKQSNNKSNENAEEKKGLSNDTHTDP 492

Query: 499 PSTAKTAPS-----KSIAASAQQLDS----VLRVACSLLVL 530
                ++PS     KS     Q+++     ++++  +LLVL
Sbjct: 493 VVEPVSSPSLEGETKSANVKLQEVEKERPIIVKLVGTLLVL 533


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 155/374 (41%), Gaps = 47/374 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +G+P   + + +D GS+  W     +QC P +   +   D     ++PS+S + 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSW-----LQCQPCTIYCHIQED---PVFNPSASKTY 154

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           K V CS   C        +  +C    + C Y A Y  + + S GYL  D+L L      
Sbjct: 155 KTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDVLTLT----- 208

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
              S   SS + GCG+   G +      DG++GL   ++S+ S L  +G   N+FS C  
Sbjct: 209 --PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKYGNAFSYCLP 261

Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCIGNSCLTQSG 332
                 +    G +  G        S  F P+ +  +    YF+ +ES  +    L  + 
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321

Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
                  ++DSG   T LPT +Y  +   +  ++S K     G S    C+  S   + +
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISE 381

Query: 388 V-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           V PD+R+IF       ++ H        G T  CL  M+      IIG       ++ +D
Sbjct: 382 VAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQQQTVKVAYD 438

Query: 447 RENLKLAWSHSKCE 460
             N ++ ++   C+
Sbjct: 439 VGNSRVGFAPGGCQ 452


>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 95/415 (22%), Positives = 163/415 (39%), Gaps = 56/415 (13%)

Query: 81  SNNNSSRNQLLFPSEGSQTHF------------FGNQF-YWLHYTWIDIGTPNVSFLVAL 127
           S++   R + +FP   + +              +GN +    +Y  + IG P   + +  
Sbjct: 25  SDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDP 84

Query: 128 DAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS-SSKNVSCSHPLCKSRSS 184
             GS+L W+ C   C++C       Y   +  +   DP  +        C HP       
Sbjct: 85  XTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPMCAXLHPPGYKCEHP------- 137

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQT- 241
                + C Y  +Y+ +  SS G LV D+  L+  +  + AP+      + +GCG  Q  
Sbjct: 138 -----EQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR------LALGCGYDQIP 185

Query: 242 -GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
             SY      DGV+GLG G  S+ S L   G+I+N    C   +  G +FFGD    + +
Sbjct: 186 GXSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSR 242

Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
                 + +++  Y  G     +G             DSG+S+T+L +  Y  +V    K
Sbjct: 243 VVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRK 302

Query: 361 LVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHI---FSFPENE 414
            +S K  R +L   +   C+         V D+R  F     SF         +  P   
Sbjct: 303 ELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDIPLES 361

Query: 415 GFTV---FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              +    CL +++       D+ +IG   M    +V+D E  ++ W+ + C+ +
Sbjct: 362 YLIISGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRL 416


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 149/361 (41%), Gaps = 56/361 (15%)

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
           V +D GS+L WV CQ C +C       Y   D     ++PS S S + V C+   C+S  
Sbjct: 79  VIVDTGSDLSWVQCQPCNRC-------YNQQD---PVFNPSKSPSYRTVLCNSLTCRSLQ 128

Query: 182 -----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
                   C S    C Y+ +Y  + + +SG +  + L+L         ++  ++ I GC
Sbjct: 129 LATGNSGVCGSNPPTCNYVVNYG-DGSYTSGEVGMEHLNLG--------NTTVNNFIFGC 179

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND---SGSVFFGD 293
           GRK  G +       G++GLG  D+S+ S ++   +    FS C    +   SGS+  G 
Sbjct: 180 GRKNQGLF---GGASGLVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMGG 234

Query: 294 QGPATQQST----------SFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---QALVDSG 340
                + +T            LP       YF+ +    +G   +    F   + ++DSG
Sbjct: 235 NSSVYKNTTPISYTRMIHNPLLPF------YFLNLTGITVGGVEVQAPSFGKDRMIIDSG 288

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
              + LP  IY  +  +F K  S    +        C+N S  + +K+PD+++ F  +  
Sbjct: 289 TVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAE 348

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
             V      +      +  CL + S   + + GIIG       RI++D +   L ++   
Sbjct: 349 LNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEA 408

Query: 459 C 459
           C
Sbjct: 409 C 409


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 45/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +GTP     + LD GS++ W     IQC P +  Y     ++   ++P+SSS+ 
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +  K      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             ++V +GCG    G +   A   G+         V S+  +  +   SFS C  + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
              S+ F         +T+ L   +K D  Y+VG+  + +G     L  + F        
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++D G + T L T+ Y  +   F KL V+ K+ S   + +  CY+ SS   +KVP + 
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S  +    +  P ++  T FC     T     IIG     G RI +D     +
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVI 493

Query: 453 AWSHSKC 459
             S +KC
Sbjct: 494 GLSGNKC 500


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 45/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +GTP     + LD GS++ W     IQC P +  Y     ++   ++P+SSS+ 
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +  K      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
             ++V +GCG    G +   A   G+         V S+  +  +   SFS C  + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314

Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
              S+ F         +T+ L   +K D  Y+VG+  + +G     L  + F        
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             ++D G + T L T+ Y  +   F KL V+ K+ S   + +  CY+ SS   +KVP + 
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S  +    +  P ++  T FC     T     IIG     G RI +D     +
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVI 493

Query: 453 AWSHSKC 459
             S +KC
Sbjct: 494 GLSGNKC 500


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 155/378 (41%), Gaps = 63/378 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I+IG P   + + +D GS+L W+ C     AP S    T          P    S+  V 
Sbjct: 89  INIGYPPRPYFLDIDTGSDLTWLQCD----APCSRCSQTP--------HPLYRPSNDLVP 136

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           C HPLC S     + +    +  DY  E     SS G LV+D+ ++ +F+       ++ 
Sbjct: 137 CRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDV-YVLNFTNGV---QLKV 192

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
            + +GCG  Q          DG++GLG G  S+ S L   GL++N    C      G +F
Sbjct: 193 RMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIF 252

Query: 291 FGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTE 349
           FGD   +++   ++ P+  + Y  Y  G     +G          A+ D+G+S+T+  + 
Sbjct: 253 FGDVYDSSR--LAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYFNSN 310

Query: 350 IYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
            Y     +  K ++ K I  + +  +   C+        K P  R ++   + F  +   
Sbjct: 311 AY-----QLTKELAGKPIKEAPEDQTLPLCWYG------KRP-FRSVYEVKKYF--KPIA 356

Query: 408 FSFPEN-----------EGFTVF------CLTVMSTDG------DYGIIGQNFMMGHRIV 444
            SFP +           E + +       CL ++  DG      D  +IG   M+   +V
Sbjct: 357 LSFPGSRRSKAQFEIPPEAYLIISNMGNVCLGIL--DGSEVGVEDLNLIGDISMLDKVMV 414

Query: 445 FDRENLKLAWSHSKCEEV 462
           FD E   + W+ + C  V
Sbjct: 415 FDNEKQLIGWTAADCNRV 432


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 46/364 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQQEK---LFDPARSSTYANVS 234

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + S G+   D L L+S+              
Sbjct: 235 CAAPACFDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     SG+ +  
Sbjct: 287 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 340

Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
           FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS F     +VDSG 
Sbjct: 341 FGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGT 398

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
             T LP   Y+ +   F   ++++       +SL       CY+ +    + +P + L+F
Sbjct: 399 VITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLF 454

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                  V      +  +              GD GI+G   +    + +D     + +S
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFS 514

Query: 456 HSKC 459
              C
Sbjct: 515 PGAC 518


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 166/366 (45%), Gaps = 45/366 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP    +   D GSNL+W  C+ C  C       YT +D     +DP +SS+ K+V
Sbjct: 98  LSLGTPPSPIMAVADTGSNLIWTQCKPCDDC-------YTQVD---PLFDPKASSTYKDV 147

Query: 173 SCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           SCS   C   ++++SC +    C Y+  Y+ + + + G    D L L S      Q    
Sbjct: 148 SCSSSQCTALENQASCSTEDKTCSYLVSYA-DGSYTMGKFAVDTLTLGSTDNRPVQ---L 203

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICF-DENDSG 287
            ++IIGCG+    ++ + ++    +G         SL+ + G  I   FS C   END  
Sbjct: 204 KNIIIGCGQNNAVTFRNKSSGVVGLG-----GGAVSLIKQLGDSIDGKFSYCLVPENDQT 258

Query: 288 S-VFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQA--LVD 338
           S + FG      GP T  +   L +  +   Y++ ++S  +G  N     S  +   ++D
Sbjct: 259 SKINFGTNAVVSGPGTVSTP--LVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVID 316

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-K 397
           SG + T LP + Y E+      L+++ +   +      CYNA+++  L +P + + F   
Sbjct: 317 SGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATAD--LNIPVITMHFEGA 374

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIVFDRENLKLAWSH 456
           +      N  F   E+     F ++    +G YG + Q NF++G    +D  +  +++  
Sbjct: 375 DVKLYPYNSFFKVTEDLVCLAFGMSFYR-NGIYGNVAQKNFLVG----YDTASKTMSFKP 429

Query: 457 SKCEEV 462
           + C ++
Sbjct: 430 TDCAKM 435


>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 681

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 87/369 (23%), Positives = 159/369 (43%), Gaps = 39/369 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           HYTW+  GTP     V  D GS L+  PC  C  C      ++T        +  ++SS+
Sbjct: 67  HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCG-----HHTD-----QPFQAANSST 116

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSKHAPQ 225
             +++C+         C    D C     Y  E +S    +V+DI++L   +SF     +
Sbjct: 117 LVHITCAQKSLFQCKECHVQSDTCGISQSY-MEGSSWKASVVEDIVYLGGESSFDDKEMR 175

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEN 284
           +   +    GC   + G ++   A DG+MGL   +  + + L +   I  N FS+CF EN
Sbjct: 176 NRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCFTEN 234

Query: 285 DSGSVFFGDQGPATQQ-STSFLPI------GEKYDAYF----VGVESYCIGNSCLTQSGF 333
             G++  G    A  +   S++ +      G  Y+ +     +G +S        T+  +
Sbjct: 235 -GGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRGHY 293

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VDSG + ++LP  +  E +  F ++    R    GNS   C   +++++  +P ++L
Sbjct: 294 --IVDSGTTDSYLPRALKTEFLQMFKEIAG--RDYQVGNS---CKGFTNKDLASLPTIQL 346

Query: 394 IFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           +            +   PE    E    +C  +  ++   G+IG N MM   ++FD  + 
Sbjct: 347 VMEAYGDENAEVILDVPPEQYLLESNGAYCGGIYLSENSGGVIGANLMMNRDVIFDLGDQ 406

Query: 451 KLAWSHSKC 459
           ++ +  + C
Sbjct: 407 RVGFVDADC 415


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 95/402 (23%), Positives = 164/402 (40%), Gaps = 80/402 (19%)

Query: 99  THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL 158
           T + GNQ             P  +    +D GS++ W                       
Sbjct: 113 TFYLGNQ------------RPEDNISAVVDTGSDIFWT---------------------- 138

Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRSSC----------KSLKDPCPYIADYS-TEDTSSSG 207
           +E + S S +   + C  P C+ R+SC             +  C Y   Y    + S++G
Sbjct: 139 TEKECSRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAG 198

Query: 208 YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
            + +D L + +  SK  P S     V IGC    T  + D +   GV GLG    S+P  
Sbjct: 199 VMYEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 257

Query: 267 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 313
           L  +      FS C   + E D  S       P          A   +T+  P  +    
Sbjct: 258 LNFS-----KFSYCLSSYQEPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTL 312

Query: 314 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
           YFV +++  IG +      T+SG    VD+GASFT L   ++A++V + D+++  ++   
Sbjct: 313 YFVHLQNISIGGTRFPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVK 372

Query: 370 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 423
           +    N+ + CY   + +++E  K+PDM L F+ + + V+    + +      +  CL +
Sbjct: 373 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 429

Query: 424 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
             ++  G   ++G   M    ++ D  N KL++  + C +VI
Sbjct: 430 YKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 471


>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
          Length = 393

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 161/378 (42%), Gaps = 59/378 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           ++IG P+  + + +D GS+L W+ C   C+QC      YY   + NL             
Sbjct: 38  LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRN-NL------------- 83

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHL--ASFSKHAPQS 226
           V C  P+C+S  S    +   P   DY  E     SS G LV D  +L   S  +H+P  
Sbjct: 84  VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPL- 142

Query: 227 SVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
                + +GCG  Q   GS+      DGV+GLG G  S+ S L+  GL++N    C   +
Sbjct: 143 -----LALGCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGH 194

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGA 341
             G +FFGD    + +  ++ P+      Y  G+            +GF+ L+   DSGA
Sbjct: 195 GGGFLFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGA 250

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           S+T+L ++ Y  ++    K +S K  R +L   +   C+    +    + D++  F K  
Sbjct: 251 SYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTF 308

Query: 400 SFVVRNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIV 444
           +    N         FP  E + +       CL +++       D  +IG   M    ++
Sbjct: 309 ALSFTNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVI 367

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D E  ++ W+   C  +
Sbjct: 368 YDNEKERIGWAPGNCNRL 385


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 50/368 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + IG P     + LD GS++ WV     QCAP +  Y    ++    ++P+SS+S 
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWV-----QCAPCAECY----EQTDPIFEPTSSASF 201

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            ++SC    CKS    +     C Y   Y  + + + G  V + + L S S         
Sbjct: 202 TSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTSL-------- 252

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
            ++ IGCG    G ++  A    ++GLG G +S PS L  +     SFS C  + DS S 
Sbjct: 253 GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDST 304

Query: 290 FFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA--------L 336
              D   P T  + T+ L      D +F +G+    +G + L   ++ FQ         +
Sbjct: 305 STLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364

Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           VDSG + T L T +Y  +   F K    L +++ ++L    +  CY+ SS+  ++VP + 
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL----FDTCYDLSSKSRVEVPTVS 420

Query: 393 LIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
             F+      +    +  P ++EG   FC     TD    I+G     G R+ FD  N  
Sbjct: 421 FHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478

Query: 452 LAWSHSKC 459
           + +S +KC
Sbjct: 479 VGFSPNKC 486


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 116/481 (24%), Positives = 200/481 (41%), Gaps = 77/481 (16%)

Query: 5   VAICML----FGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
           +AI +L    FGCI    +  V F+  L+HR S  +                 P  NS E
Sbjct: 12  LAIALLCVSGFGCIY---ARKVGFTVDLIHRDSPLS-----------------PFYNSEE 51

Query: 61  YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
                ++N  +R  +RV    ++           + +++    N+  +L    + +GTP 
Sbjct: 52  TDLQRINNALRRSISRV----HHFDPIAAASVSPKAAESDVTSNRGEYLM--SLSLGTPP 105

Query: 121 VSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
              +   D GS+L+W  C+ C +C       Y  +D     +DP SS + ++ SC    C
Sbjct: 106 FKIMGIADTGSDLIWTQCKPCERC-------YKQVD---PLFDPKSSKTYRDFSCDARQC 155

Query: 180 K--SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
               +S+C    + C Y   YS  D S + G +  D + L S +  +P S  ++  +IGC
Sbjct: 156 SLLDQSTCSG--NICQY--QYSYGDRSYTMGNVASDTITLDS-TTGSPVSFPKT--VIGC 208

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFF 291
           G +  G++ D  +  G++GLG G +S+ S +  +  +   FS C         +S  + F
Sbjct: 209 GHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNF 264

Query: 292 GDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
           G      GP  Q ST  L        YF+ +E+  +GN       S L       ++DSG
Sbjct: 265 GSNAVVSGPGVQ-STPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
            + T +P + ++ +       V  +R          CY+A+S+  LKVP +   F+    
Sbjct: 324 TTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSD--LKVPAITAHFTGADV 381

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
            +   + F    ++   V CL   ST     I G    M   + ++ +   L++  + C 
Sbjct: 382 KLKPINTFVQVSDD---VVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCT 438

Query: 461 E 461
           +
Sbjct: 439 K 439


>gi|388513215|gb|AFK44669.1| unknown [Lotus japonicus]
          Length = 101

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 45/93 (48%), Positives = 62/93 (66%), Gaps = 8/93 (8%)

Query: 16  LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKT 75
           ++G  AV+FSS+LVHRFS+EAK    S+ GN +   SWP K++ EY  LLL++D  RQ  
Sbjct: 17  MEGEAAVTFSSRLVHRFSEEAKVHLASR-GNGAALQSWPNKSTSEYFRLLLNSDLTRQ-- 73

Query: 76  RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYW 108
           R+KL S   S     ++PS+G QT FFGN++ W
Sbjct: 74  RMKLGSQYES-----MYPSKGGQTFFFGNEWNW 101


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 163/380 (42%), Gaps = 54/380 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y + Y+   +GTP       +D GS+++W     +QC P    Y    ++    ++PS S
Sbjct: 87  YLMTYS---VGTPPFKLYGIVDTGSDIVW-----LQCEPCQECY----NQTTPMFNPSKS 134

Query: 167 SSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           SS KN+ C   LC+S   +SC   K+ C Y + Y  +++ S G L  D L L S +    
Sbjct: 135 SSYKNIPCPSKLCQSMEDTSCND-KNYCEY-STYYGDNSHSGGDLSVDTLTLESTNGLTV 192

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
                 +++IGCG     SY +GA+  G++G G G  S  + L  +      FS C    
Sbjct: 193 SF---PNIVIGCGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFSYCLTPL 245

Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGF 333
                   N +  + FGD    +       PI +K     Y++ +E++ +GN  +   G 
Sbjct: 246 FSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGV 305

Query: 334 -------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
                    ++DSG + T L  + Y+ +      LV  +R+     +   CY+  +E   
Sbjct: 306 PNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAEGY- 364

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHR 442
              D  +I    +   V  H  S   +    VFCL   S+  D+ I G    QN M+G  
Sbjct: 365 ---DFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQ-DHAIFGNLAQQNLMVG-- 418

Query: 443 IVFDRENLKLAWSHSKCEEV 462
             +D +   +++  S C +V
Sbjct: 419 --YDLQQKIVSFKPSDCTKV 436


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 112/417 (26%), Positives = 166/417 (39%), Gaps = 55/417 (13%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
           W   K ++  +  + +S   L  P   +    +G+  Y++    + +GTP  S  + +D 
Sbjct: 19  WIESKAKLAGKKKDEASSTDLNGPV--TSGLLYGSGEYFVR---LGLGTPARSLFMVVDT 73

Query: 130 GSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRS 183
           GS+L W+ CQ C  C       Y   D     +DP +SSS + + C  PLCK     S S
Sbjct: 74  GSDLPWLQCQPCKSC-------YKQAD---PIFDPRNSSSFQRIPCLSPLCKALEVHSCS 123

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
             +     C Y   Y  + + S G    D+  L + SK         SV  GCG    G 
Sbjct: 124 GSRGATSRCSYQVAYG-DGSFSVGDFSSDLFTLGTGSKAM-------SVAFGCGFDNEGL 175

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE------NDSGSVFFGDQGPA 297
           +   A   G+    L   S     +      NSFS C  +        S S+ FG     
Sbjct: 176 FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIP 235

Query: 298 TQQSTSFLPIGEKYDA-YFVGVESYCIGNS---------CLTQSGFQA-LVDSGASFTFL 346
           +  + S L    K D  Y+  +    +G +          L+QSG    ++DSG S T  
Sbjct: 236 STAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRF 295

Query: 347 PTEIYAEVVVKFD----KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           PT +YA +   F      L S+ R SL    +  CYN S +  + VP + L F       
Sbjct: 296 PTSVYATIRDAFRNATINLPSAPRYSL----FDTCYNFSGKASVDVPALVLHFENGADLQ 351

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +    +  P N   + FCL    T  + GIIG       RI FD +   LA++  +C
Sbjct: 352 LPPTNYLIPINTAGS-FCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 87/383 (22%), Positives = 149/383 (38%), Gaps = 59/383 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F + +D GS+L +V     QCAP    Y    +++   Y PS+SS+ 
Sbjct: 34  YFVDFSLGTPEQKFHLIVDTGSDLAFV-----QCAPCDLCY----EQDGPLYQPSNSSTF 84

Query: 170 KNVSCSHPLC-----KSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
             V C    C        + C S       +  C Y   Y  +++S+ G    +   +  
Sbjct: 85  TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYG-DNSSTVGVFAYETATVGG 143

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
              +         V  GCG +  GS++      GV+GLG G +S  S    A   +N F+
Sbjct: 144 IRVN--------HVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFA 190

Query: 279 ICFDENDS-----GSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNSCL-- 328
            C     S      S+ FGD   +T     F P+         Y+V +   C G   L  
Sbjct: 191 YCLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLI 250

Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                   +      + DSG + T+   + YA ++  F+K V   R          C N 
Sbjct: 251 PDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNV 310

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNF 437
           S  +    P   + F +  ++     N+      N    + CL ++ +  D + +IG   
Sbjct: 311 SGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPN----IDCLAMLESSSDGFNVIGNII 366

Query: 438 MMGHRIVFDRENLKLAWSHSKCE 460
              + + +DRE  ++ ++H+ C+
Sbjct: 367 QQNYLVQYDREEHRIGFAHANCD 389


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/377 (25%), Positives = 158/377 (41%), Gaps = 41/377 (10%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
           N +   +   + IGTP +     +D GS+L+WV     QC P    Y    ++    +DP
Sbjct: 58  NAYIGQYLMELYIGTPPIKISGTVDTGSDLIWV-----QCVPCLGCY----NQINPMFDP 108

Query: 164 SSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
             SS+  N+SC  PLC K      S +  C Y   Y+ + + + G L  + + L S +  
Sbjct: 109 LKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYA-DSSLTKGVLAQETVTLTSNTGK 167

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSIC 280
               S+Q  ++ GCG   TG++ D     G++GLG G     SL+++ G +     FS C
Sbjct: 168 P--ISLQ-GILFGCGHNNTGNFNDHEM--GLIGLGGGPT---SLVSQIGPLFGGKKFSQC 219

Query: 281 F-----DENDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV---ESYCIGNS 326
                 D   S  + FG       +     P+ ++      Y    +G+   ++Y   NS
Sbjct: 220 LVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNS 279

Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEM 385
            + +     LVDSG     LP ++Y  V V+    V  + I+   +   + CY   +   
Sbjct: 280 TIEKGNM--LVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTN-- 335

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIV 444
           LK P +   F      +     F  P  E   VFCL + +  + D GI G      + I 
Sbjct: 336 LKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIG 395

Query: 445 FDRENLKLAWSHSKCEE 461
           FD +   +++  + C +
Sbjct: 396 FDLDRQIVSFKPTDCTK 412


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/419 (23%), Positives = 165/419 (39%), Gaps = 57/419 (13%)

Query: 57  NSVEYLELLLSNDWKRQKTRVKLQSNNN-SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           N+   +E+LL +  +      KL  ++     +    P++   +   GN     +   I 
Sbjct: 85  NAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTKSGMSLGTGN-----YIVSIG 139

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +G+P    ++  D GS+L W  C   +                  +DP+ S+S  NVSCS
Sbjct: 140 LGSPKKDLMLIFDTGSDLTWARCSAAE-----------------TFDPTKSTSYANVSCS 182

Query: 176 HPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            PLC S  S      +     C Y   Y  + + S G+L  + L + S       + + +
Sbjct: 183 TPLCSSVISATGNPSRCAASTCVYGIQYG-DGSYSIGFLGKERLTIGS-------TDIFN 234

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
           +   GCG+   G +   A   G++GLG   +SV S  A        FS C     S S  
Sbjct: 235 NFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQLFSYCLPS--SSSTG 287

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSCLTQSGFQALVDSGAS 342
           F   G +  +S  F P+     +++        VG +   I  S  + +G   ++DSG  
Sbjct: 288 FLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAG--TIIDSGTV 345

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP   Y+ +   F K ++S  +    +    CY+ S  + +KVP + + FS      
Sbjct: 346 VTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVD 405

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           V +    F  N G    CL      G  D  I G        +V+D    K+ ++ + C
Sbjct: 406 V-DQAGIFVAN-GLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASC 462


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 95/350 (27%), Positives = 157/350 (44%), Gaps = 40/350 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP    +   D GS+LLW  C+ C  C       YT +D     +DP +SS+ K+V
Sbjct: 98  ISLGTPPFPIMAIADTGSDLLWTQCKPCDDC-------YTQVD---PLFDPKASSTYKDV 147

Query: 173 SCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           SCS   C   ++++SC +  + C Y   Y  + + + G +  D L L S      Q    
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYG-DRSYTKGNIAVDTLTLGSTDTRPVQ---L 203

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 285
            ++IIGCG    G++       G++GLG G VS+ + L  +  I   FS C      END
Sbjct: 204 KNIIIGCGHNNAGTF--NKKGSGIVGLGGGAVSLITQLGDS--IDGKFSYCLVPLTSEND 259

Query: 286 SGS-VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA------- 335
             S + FG     +       P+  K     Y++ ++S  +G+  +   G  +       
Sbjct: 260 RTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNI 319

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           ++DSG + T LPTE Y+E+       + +++          CY+A+ +  LKVP + + F
Sbjct: 320 IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGD--LKVPAITMHF 377

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 444
                 +  ++ F    +E    F      +   YG + Q NF++G+  V
Sbjct: 378 DGADVNLKPSNCF-VQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTV 426


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 152/376 (40%), Gaps = 50/376 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP V  L+A+D GS++ W+ CQ C +C P S             +DP  S+S + +
Sbjct: 138 IAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPV----------FDPRHSTSYREM 187

Query: 173 SCSHPLCKS--RSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
               P C++  RS     K   C Y   Y  + +++ G  +++ L  A      P  S  
Sbjct: 188 GYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG-GVQVPHMS-- 244

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE------ 283
               IGCG    G +   AA  G++GLG G +S PS +A  G    SFS C  +      
Sbjct: 245 ----IGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSP 298

Query: 284 --NDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCLTQSGFQ- 334
             + S ++  GD   A     SF P  +       Y    VGV    +    +T+   + 
Sbjct: 299 GRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKL 358

Query: 335 --------ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 383
                    ++DSG + T L    Y A         V   ++S+ G S  +  CY     
Sbjct: 359 DPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGR 418

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
            M KVP + + F+      +    +  P +   TV      + D    IIG     G R+
Sbjct: 419 AM-KVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRV 477

Query: 444 VFDRENLKLAWSHSKC 459
           V++    ++ ++ + C
Sbjct: 478 VYNIGGGRVGFAPNSC 493


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 83/356 (23%), Positives = 140/356 (39%), Gaps = 34/356 (9%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+L WV     QC P +  Y    ++    +DPS SS+   V+
Sbjct: 153 VGLGTPAKQYAVIFDTGSDLSWV-----QCKPCADCY----EQQDPLFDPSLSSTYAAVA 203

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P C+    S C S    C Y   Y  + + + G LV D L L++       S     
Sbjct: 204 CGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA-------SDTLPG 254

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF- 290
            + GCG +  G +      DG+ GLG   VS+PS  A +      F+ C   + SG  + 
Sbjct: 255 FVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYL 309

Query: 291 -FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGASF 343
             G   PA  Q T+ L  G     Y++ +    +G   +        +    ++DSG   
Sbjct: 310 SLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVI 368

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T LP   YA +   F + ++  + +   +    CY+ +     ++P + L F+   +  +
Sbjct: 369 TRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSL 428

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                 +              + D    I+G        + +D  N ++ +    C
Sbjct: 429 DFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 99/418 (23%), Positives = 171/418 (40%), Gaps = 68/418 (16%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
           + T V + S+  +    L  P       F  +         + IGTP +++   +D GS+
Sbjct: 77  RATGVPMTSSKAAGGGDLQVPVHAGNGEFLMD---------VSIGTPALAYSAIVDTGSD 127

Query: 133 LLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLK 189
           L+W  C+ C+ C            ++   +DPSSSS+   V CS   C     S C S  
Sbjct: 128 LVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTS-A 176

Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
             C Y   Y  + +S+ G L  +   LA         S    V+ GCG    G      A
Sbjct: 177 SKCGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGCGDTNEGDGFSQGA 227

Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQ------- 299
             G++GLG G +   SL+++ GL  + FS C    D+ ++  +  G     ++       
Sbjct: 228 --GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 280

Query: 300 -QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
            Q+T  +    +   Y+V +++  +G++   L  S F          +VDSG S T+L  
Sbjct: 281 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 340

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF----SKNQSFVVR 404
           + Y  +   F   ++       G     C+ A ++ + +V   RL+F      +      
Sbjct: 341 QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAE 400

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           N++     + G    CLTVM + G   IIG       + V+D  +  L+++  +C ++
Sbjct: 401 NYMV---LDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 177/445 (39%), Gaps = 53/445 (11%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKT-RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
           V++ D  P  +   YL  LL+ D  R  + +++++++  ++ +     +E   T   G +
Sbjct: 123 VAIPDDDPAAHD-RYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTS--GIR 179

Query: 106 FYWLHY-TWIDIG-----TPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLS 159
           F  L+Y T I +G     +P  +  V +D GS+L WV     QC P SA Y     +   
Sbjct: 180 FQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWV-----QCKPCSACY----AQRDP 230

Query: 160 EYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
            +DP+ S++   V C+   C +          SC    + C Y   Y  + + S G L  
Sbjct: 231 LFDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAYG-DGSFSRGVLAT 289

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS--LLAK 269
           D + L   S            + GCG    G +       G+MGLG  ++S+ S   L  
Sbjct: 290 DTVALGGAS--------LDGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTALRY 338

Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIG 324
            G+           + SGS+  G    + + +T         D      YF+ V    +G
Sbjct: 339 GGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVG 398

Query: 325 NSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS-WKYCYN 379
            + L   G  A   L+DSG   T L   +Y  V  +F +   ++   +  G S    CY+
Sbjct: 399 GTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYD 458

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNF 437
            +  + +KVP + L         V      F   +  +  CL +  +S +    IIG   
Sbjct: 459 LTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQ 518

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEV 462
               R+V+D    +L ++   C  V
Sbjct: 519 QKNKRVVYDTVGSRLGFADEDCNYV 543


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/356 (23%), Positives = 140/356 (39%), Gaps = 34/356 (9%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+L WV     QC P +  Y    ++    +DPS SS+   V+
Sbjct: 153 VGLGTPAKQYAVIFDTGSDLSWV-----QCKPCADCY----EQQDPLFDPSLSSTYAAVA 203

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  P C+    S C S    C Y   Y  + + + G LV D L L++       S     
Sbjct: 204 CGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA-------SDTLPG 254

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF- 290
            + GCG +  G +      DG+ GLG   VS+PS  A +      F+ C   + SG  + 
Sbjct: 255 FVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYL 309

Query: 291 -FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGASF 343
             G   PA  Q T+ L  G     Y++ +    +G   +        +    ++DSG   
Sbjct: 310 SLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVI 368

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T LP   YA +   F + ++  + +   +    CY+ +     ++P + L F+   +  +
Sbjct: 369 TRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSL 428

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                 +              + D    I+G        + +D  N ++ +    C
Sbjct: 429 DFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 101/390 (25%), Positives = 156/390 (40%), Gaps = 64/390 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + IG P  S L+  D GS+L+WV C  C  C+  S +         + + P  SS+
Sbjct: 83  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRHSST 133

Query: 169 SKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASF 219
                C  P+C+         R +   +   CPY  +Y   D S +SG    +   L + 
Sbjct: 134 FSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPY--EYGYADGSLTSGLFARETTSLKTS 191

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNS 276
           S    + +   SV  GCG + +G  + G +    +GVMGLG G +S  S L +     N 
Sbjct: 192 SG---KEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNK 246

Query: 277 FSICFDE-----NDSGSVFFGDQGPATQQ--STSFL--PIGEKYDAYFVGVESYCIGNSC 327
           FS C  +       +  +  GD G A  +   T  L  P+   +  Y+V ++S  +  + 
Sbjct: 247 FSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTF--YYVKLKSVFVNGAK 304

Query: 328 LT---------QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
           L           SG    V DSG +  FL    Y  V+    + +           +  C
Sbjct: 305 LRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLC 364

Query: 378 YNASS----EEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTDGDYG 431
            N S     E++L  P ++  FS    FV   RN+     E     + CL + S D   G
Sbjct: 365 VNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNYFIETEEQ----IQCLAIQSVDPKVG 418

Query: 432 --IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             +IG     G    FDR+  +L +S   C
Sbjct: 419 FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 99/418 (23%), Positives = 171/418 (40%), Gaps = 68/418 (16%)

Query: 73  QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
           + T V + S+  +    L  P       F  +         + IGTP +++   +D GS+
Sbjct: 67  RATGVPMTSSKAAGGGDLQVPVHAGNGEFLMD---------VSIGTPALAYSAIVDTGSD 117

Query: 133 LLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLK 189
           L+W  C+ C+ C            ++   +DPSSSS+   V CS   C     S C S  
Sbjct: 118 LVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTS-A 166

Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
             C Y   Y  + +S+ G L  +   LA         S    V+ GCG    G      A
Sbjct: 167 SKCGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGCGDTNEGDGFSQGA 217

Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQ------- 299
             G++GLG G +   SL+++ GL  + FS C    D+ ++  +  G     ++       
Sbjct: 218 --GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 270

Query: 300 -QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
            Q+T  +    +   Y+V +++  +G++   L  S F          +VDSG S T+L  
Sbjct: 271 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 330

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF----SKNQSFVVR 404
           + Y  +   F   ++       G     C+ A ++ + +V   RL+F      +      
Sbjct: 331 QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAE 390

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           N++     + G    CLTVM + G   IIG       + V+D  +  L+++  +C ++
Sbjct: 391 NYMV---LDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 155/367 (42%), Gaps = 46/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T + +G P   F + LD GS++ W+ CQ C  C       Y   D     +DP++SS+
Sbjct: 20  YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPTASST 69

Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
              V+C    C S   SSC+S +  C Y  +Y         Y   D    A+ S     S
Sbjct: 70  YAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNY-----GDGSYTFGD---FATESVSFGNS 119

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
               +V +GCG    G ++  A   G+ G  L      SL  +  L   SFS C    DS
Sbjct: 120 GSVKNVALGCGHDNEGLFVGAAGLLGLGGGPL------SLTNQ--LKATSFSYCLVNRDS 171

Query: 287 G---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ------ 334
               ++ F          T+ L    K D  Y+VG+    +G   ++  +S F+      
Sbjct: 172 AGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGN 231

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              +VD G + T L T+ Y  +   F ++  + +++     +  CY+ S +  ++VP + 
Sbjct: 232 GGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 291

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F+  +S+ +    +  P +   T +C     T     IIG     G R+ FD  N ++
Sbjct: 292 FHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRM 350

Query: 453 AWSHSKC 459
            +S +KC
Sbjct: 351 GFSPNKC 357


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 158/373 (42%), Gaps = 53/373 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T + +GTP     + LD GS+++W+ C  CI+C       Y+  D     +DP+ S S
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKC-------YSQTD---PVFDPTKSRS 194

Query: 169 SKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-- 224
             N+ C  PLC+      C + K  C Y   Y            D    +  FS      
Sbjct: 195 FANIPCGSPLCRRLDYPGCSTKKQICLYQVSYG-----------DGSFTVGEFSTETLTF 243

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           + +    V++GCG    G ++  A    ++GLG G +S PS + +     + FS C  + 
Sbjct: 244 RGTRVGRVVLGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQIGRR--FNSKFSYCLGDR 298

Query: 285 DS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ 334
            +     S+ FGD   A  ++T F P+    K D  Y+V +    +G    S ++ S F+
Sbjct: 299 SASSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFK 356

Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
                    ++DSG S T L    Y  +   F    S+ + + + + +  C++ S +  +
Sbjct: 357 LDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEV 416

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           KVP + L F      +  ++     +N G   FC     T     IIG     G R+V+D
Sbjct: 417 KVPTVVLHFRGADVPLPASNYLIPVDNSG--SFCFAFAGTASGLSIIGNIQQQGFRVVYD 474

Query: 447 RENLKLAWSHSKC 459
               ++ ++   C
Sbjct: 475 LATSRVGFAPRGC 487


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 78/289 (26%), Positives = 127/289 (43%), Gaps = 27/289 (9%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP  +  V  D GSN+ W     IQC P   S Y   +     +DP+ SS+ +N+S
Sbjct: 20  VGFGTPKKNQTVIFDTGSNVNW-----IQCKPCVVSCYPQQE---PLFDPTLSSTYRNIS 71

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+   C   SS       C Y   Y  + +S+ G+L  +   LA+        +V ++ I
Sbjct: 72  CTSAACTGLSSRGCSGSTCVYGVTYG-DGSSTVGFLATETFTLAA-------GNVFNNFI 123

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG+   G +  GAA  G++GLG    S+ S LA +  + N FS C     S + +   
Sbjct: 124 FGCGQNNQGLF-TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNI 178

Query: 294 QGP-ATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA---LVDSGASFTFLP 347
             P  T   T+ L        YF+ +    +G +   L+ + FQ+   ++DSG   T LP
Sbjct: 179 GNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLP 238

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
              Y  +   F   ++    +   +    CY+ S    +  P ++L ++
Sbjct: 239 PTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYT 287


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 159/384 (41%), Gaps = 65/384 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           ++    +DP++SSS +NV+C
Sbjct: 155 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 204

Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQ 225
               C         R+  +  +D CPY   Y  +  ++        L L SF+ +  AP 
Sbjct: 205 GDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGD------LALESFTVNLTAPG 258

Query: 226 SSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           +S +   V+ GCG +  G +   A   G+    L   S   L A  G   ++FS C  E+
Sbjct: 259 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVEH 313

Query: 285 --DSGS-VFFGDQ----GPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS----- 331
             D+GS V FG+          + T+F P     D  Y+V ++   +G   L  S     
Sbjct: 314 GSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWD 373

Query: 332 -----GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNASSEEM 385
                    ++DSG + ++     Y  +   F  L+S     +        CYN S  E 
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVER 433

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST-DGDYGIIGQNFM 438
            +VP++ L+F+          ++ FP    F       + CL V  T      IIG    
Sbjct: 434 PEVPELSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQ 486

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               +V+D +N +L ++  +C EV
Sbjct: 487 QNFHVVYDLQNNRLGFAPRRCAEV 510


>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
 gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
          Length = 603

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 97/402 (24%), Positives = 160/402 (39%), Gaps = 74/402 (18%)

Query: 119 PNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           P   + +  D GS+L W+ C   C  CA  + ++Y     N+           K++ C  
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKPRRGNIV--------PPKDLLCME 250

Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
                ++      D C Y  +Y+ + +SS G L  D L L      A  S  + + I GC
Sbjct: 251 VQRNQKAGYCETCDQCDYEIEYA-DHSSSMGVLATDKLLLMV----ANGSLTKLNFIFGC 305

Query: 237 GRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGD 293
              Q G  L      DG++GL    VS+PS LA  G+I N    C   D    G +F GD
Sbjct: 306 AYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGD 365

Query: 294 QGPATQQSTSFLPIGE--KYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFL 346
                +   +++P+ +    + Y   V     G+S L+  G ++     L DSG+S+T+ 
Sbjct: 366 DF-VPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYF 424

Query: 347 PTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNAS-------SEEMLKVP--------- 389
           P E Y+E+V   +++  +  + S    +   C+ A+           L  P         
Sbjct: 425 PKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTELTRPIRRRRRRRR 484

Query: 390 ---------------DMR-----LIFSKNQSFVVRNHIFSFPENEGFTVF------CLTV 423
                          D++     L F     ++V +  F  P  EG+ +       CL +
Sbjct: 485 RRRRRRRRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRIPP-EGYLMMSDKGNVCLGI 543

Query: 424 MST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           +      DG   I+G   + G  +V+D  N K+ W+ S C +
Sbjct: 544 LEGSKVHDGSTIILGDISLRGQLVVYDNVNKKIGWTPSDCAK 585


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 152/369 (41%), Gaps = 36/369 (9%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           +Y  I +GTP   F + +D GS+L W+ CQ   I C       +T    ++S+   + S 
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTP---SVSKTYKALSC 163

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQS 226
           SS   S       +   C +    C Y A Y   DTS S GYL  D+L L       P +
Sbjct: 164 SSSQCSSLKSSTLNAPGCSNATGACVYKASYG--DTSFSIGYLSQDVLTL------TPSA 215

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 281
           +  S  + GCG+   G +   A   G++GL    +S+   L+      N+FS C      
Sbjct: 216 APSSGFVYGCGQDNQGLFGRSA---GIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFS 270

Query: 282 -DENDSGSVFFGDQGPATQQST-SFLPIGEKYDA---YFVGVESYCIGNSCLTQSG---- 332
              N S S F      +   S   F P+ +       YF+G+ +  +    L  S     
Sbjct: 271 AQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYN 330

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDM 391
              ++DSG   T LP  IY  +   F  ++S K     G S    C+  S +EM  VP++
Sbjct: 331 VPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEI 390

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
           R+IF       ++ H       +G T  CL + ++     IIG        + +D  N K
Sbjct: 391 RIIFRGGAGLELKVHNSLVEIEKGTT--CLAIAASSNPISIIGNYQQQTFTVAYDVANSK 448

Query: 452 LAWSHSKCE 460
           + ++   C+
Sbjct: 449 IGFAPGGCQ 457


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 37/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F +  D GS++ W      QC P   + Y   +  L   +PS+S+S KN+S
Sbjct: 135 VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 186

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           CS  LCK  +S K     C      Y   Y  + + S G+   + L L+S       S+V
Sbjct: 187 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 238

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             + + GCG++  G +   A   G+    L   ++PS  AK    +  FS C   + S  
Sbjct: 239 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 293

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
            +    G    +S  F P+   +D+   Y + +    +G   L+  +S F A  ++DSG 
Sbjct: 294 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGT 352

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T L    Y+E+   F  L++    +   + +  CY+ S  + +++P + + F      
Sbjct: 353 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEM 412

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +      +P N G    CL       D D  I G      +++V+D    ++ ++   C
Sbjct: 413 DIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 162/400 (40%), Gaps = 66/400 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           + IGTP     V +D GS+L W PC      CI+C     +Y    +R ++ + PS SSS
Sbjct: 84  LSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIEC----DNYRN--NRMMASFSPSHSSS 137

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIA-DYST-------------EDTSSSGYLVDDIL 214
           S   SC+ P C    S  +  DPC       ST               T  +G +V   L
Sbjct: 138 SHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTL 197

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
              +   H     V   +   C      SY +   P G+ G G G +S+PS L   G ++
Sbjct: 198 TRDTLRVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQL---GFLR 251

Query: 275 NSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIG 324
             FS CF       + N S  +  GD    ++    F P+ +     + Y+VG+E+  +G
Sbjct: 252 KGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVG 311

Query: 325 NSCLTQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQG 371
           N   T+                LVDSG ++T LP   Y++V+     +++  R +     
Sbjct: 312 NVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMR 371

Query: 372 NSWKYCYNASSEE--MLK---VPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVF-CLTV 423
             +  CY    +   +L    +P +   F  N S V+   +H ++       TV  CL  
Sbjct: 372 TGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLF 431

Query: 424 MST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            S  DGDY   G++G        +V+D E  ++ +    C
Sbjct: 432 QSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 155/369 (42%), Gaps = 55/369 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP +S    +D GS+L+W  C  C  C+  S                SSSS+   V C
Sbjct: 48  IGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDP------------SSSSTYSKVLC 95

Query: 175 SHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
              LC+  S  SC +  D C Y+  Y  + +S+SG L D+   ++S S          ++
Sbjct: 96  QSSLCQPPSIFSCNNDGD-CEYVYPYG-DRSSTSGILSDETFSISSQSL--------PNI 145

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGS 288
             GCG    G   D     G++G G G +S+ S L  +  + N FS C     D + +  
Sbjct: 146 TFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSP 199

Query: 289 VFFGDQG--PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           +F G+     AT   ++ L      + Y++ +E   +G   L          +      +
Sbjct: 200 LFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLI 259

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG + TFL    Y  V    + +VSS  +         C+N         P M   F 
Sbjct: 260 IDSGTTLTFLQQTAYDAVK---EAMVSSINLPQADGQLDLCFNQQGSSNPGFPSMTFHF- 315

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLA 453
           K   + V    + FP++    + CL +M T+   G+  I G      ++I++D EN  L+
Sbjct: 316 KGADYDVPKENYLFPDSTS-DIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLS 374

Query: 454 WSHSKCEEV 462
           ++ + C+ +
Sbjct: 375 FAPTACDTL 383


>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Brachypodium distachyon]
          Length = 436

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 153/375 (40%), Gaps = 52/375 (13%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+   + +G P+  + +A   GS+++WVPC  C  C P       SLD     YDP +SS
Sbjct: 75  LYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDC-PTPDDIGFSLDL----YDPKNSS 129

Query: 168 SSK-----NVSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           +S      +  C+  L    + C    S  D C Y   Y+    +++GY V D +H   F
Sbjct: 130 TSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIF 189

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
             +   +S  +SVI GC + ++G        DGV+G G    S+ S L   G + ++FS 
Sbjct: 190 MGNESFASSSASVIFGCSKSRSGH----LQADGVIGFGKDAPSLISQLNSQG-VSHAFSR 244

Query: 280 CFDE-NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQS 331
           C D+ +D G V   D+    +    F  +      Y + ++S  + N       S  T S
Sbjct: 245 CLDDSDDGGGVLILDE--VGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTS 302

Query: 332 GFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
             Q   +DSG S  + P  +Y  V+     +  S R                      P 
Sbjct: 303 STQGTFLDSGTSLAYFPDGVYDPVIRAILFIYFSTR-----------------SFSSFPT 345

Query: 391 MRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRIVF 445
           +   F    +  V   N++      +  +  C+    ++GDY    I+G   +     V+
Sbjct: 346 VTXYFEGGAAMKVGPENYLLRRGSYDNDSYMCIAFQRSEGDYKQTTILGDLILHDKIFVY 405

Query: 446 DRENLKLAWSHSKCE 460
           + + +++ W +  C+
Sbjct: 406 NLKKMQIGWVNYNCK 420


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 37/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F +  D GS++ W      QC P   + Y   +  L   +PS+S+S KN+S
Sbjct: 75  VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 126

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           CS  LCK  +S K     C      Y   Y  + + S G+   + L L+S       S+V
Sbjct: 127 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 178

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             + + GCG++  G +   A   G+    L   ++PS  AK    +  FS C   + S  
Sbjct: 179 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 233

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
            +    G    +S  F P+   +D+   Y + +    +G   L+  +S F A  ++DSG 
Sbjct: 234 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGT 292

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T L    Y+E+   F  L++    +   + +  CY+ S  + +++P + + F      
Sbjct: 293 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEM 352

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +      +P N G    CL       D D  I G      +++V+D    ++ ++   C
Sbjct: 353 DIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 150/364 (41%), Gaps = 62/364 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           I IGTP +     LD GS+L+W  C   C +C P  A  Y           P+ S++  N
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145

Query: 172 VSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           VSC  P+C++     S C      C Y   Y  + TS+ G L  +   L S        +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
               V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +   S        G
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTRPRRSCRARAAARG 251

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
                   P T      + +G+      + ++      + +   G   ++DSG +FT L 
Sbjct: 252 GGA-----PTTTSPLEGITVGDT----LLPIDPAVFRLTPMGDGGV--IIDSGTTFTALE 300

Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS------KN 398
              +   V     L S  R+ L   +      C+ A+S E ++VP + L F       + 
Sbjct: 301 ERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 357

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           +S+VV        E+    V CL ++S  G   ++G        I++D E   L++  +K
Sbjct: 358 ESYVV--------EDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFEPAK 408

Query: 459 CEEV 462
           C E+
Sbjct: 409 CGEL 412


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 37/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F +  D GS++ W      QC P   + Y   +  L   +PS+S+S KN+S
Sbjct: 123 VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 174

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           CS  LCK  +S K     C      Y   Y  + + S G+   + L L+S       S+V
Sbjct: 175 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 226

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             + + GCG++  G +   A   G+    L   ++PS  AK    +  FS C   + S  
Sbjct: 227 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 281

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
            +    G    +S  F P+   +D+   Y + +    +G   L+  +S F A  ++DSG 
Sbjct: 282 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGT 340

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T L    Y+E+   F  L++    +   + +  CY+ S  + +++P + + F      
Sbjct: 341 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEM 400

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +      +P N G    CL       D D  I G      +++V+D    ++ ++   C
Sbjct: 401 DIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 155/373 (41%), Gaps = 52/373 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T + +GTP  ++++ +D+GS+L W     +QCAP + S +    +    YDP +SS+ 
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTW-----LQCAPCAVSCH---PQAGPLYDPRASSTY 159

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
             V CS P C        + SSC S    C Y A Y  + + S GYL  D + L+S    
Sbjct: 160 AAVPCSAPQCAELQAATLNPSSC-SGSGVCQYQASYG-DGSFSFGYLSKDTVSLSS---- 213

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              S        GCG+   G +   A   G++GL    +S+ S LA +  + NSF+ C  
Sbjct: 214 ---SGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--VGNSFAYCLP 265

Query: 283 EN---DSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-----Q 330
            +    +G + FG    ++ P     TS +        YFV +    +  S L       
Sbjct: 266 TSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEY 325

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW---KYCYNASSEEMLK 387
                ++DSG   T LPT +Y        K V +   +    ++   + C+     + L 
Sbjct: 326 GSLPTIIDSGTVITRLPTPVY----TALSKAVGAALAAPSAPAYSILQTCFKGQVAK-LP 380

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           VP + + F+   +  +         NE  T  CL    TD    IIG        +V+D 
Sbjct: 381 VPAVNMAFAGGATLRLTPGNVLVDVNE--TTTCLAFAPTD-STAIIGNTQQQTFSVVYDV 437

Query: 448 ENLKLAWSHSKCE 460
           +  ++ ++   C 
Sbjct: 438 KGSRIGFAAGGCS 450


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 80/310 (25%), Positives = 137/310 (44%), Gaps = 46/310 (14%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN- 120
           L  L  +D  R++   ++  +   S     FP  GS         +  +Y  I +G P+ 
Sbjct: 73  LAHLREHDAHRRR---RILESPAESPGASTFPLHGSVKE------HGYYYANIALGDPSP 123

Query: 121 VSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
            +F V +D GS L +VPC  C +C   +           + +DP+     K ++C    C
Sbjct: 124 RTFQVIVDTGSTLTYVPCATCAKCGTHTGG---------TRFDPTG----KWLTCQEKQC 170

Query: 180 KSRSS---CKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           K+      C   +    + C Y   Y+ E +  SG LV D +H       AP ++    V
Sbjct: 171 KAAGGPGICAGGRGAAANRCTYSRTYA-EGSGVSGDLVRDKMHFGG--DIAPATNGTLDV 227

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGD-VSVPSLLAKAGLIQNSFSICFDENDSGSVFF 291
           + GC   ++G+  D  A DG++GLG     S+P+ LA    +   FS+CF   + G    
Sbjct: 228 VFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALS 286

Query: 292 GDQGPATQQSTSF----LPIGEKYDAYFV-GVESYCIGNSCLTQS-----GFQALVDSGA 341
             + PAT  +       + + E + AY+V    +  IG+  +        G+  ++DSG 
Sbjct: 287 FGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGYGTVMDSGT 346

Query: 342 SFTFLPTEIY 351
           +FT++PT+++
Sbjct: 347 TFTYVPTKVF 356


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 105/431 (24%), Positives = 180/431 (41%), Gaps = 73/431 (16%)

Query: 71  KRQKTRVKL---QSNNNSSRNQLLFPSEGSQTHFF--------GNQFYWLHYTWIDIGTP 119
           + +  RV L    ++ N SR+QLL  +     H          GN  + +    + IGTP
Sbjct: 27  RLKGLRVHLTHVDAHGNYSRHQLLRRAARRSHHRMSRLVPVHAGNGEFLMD---VSIGTP 83

Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
            +++   +D GS+L+W  C+ C+ C            ++   +DPSSSS+   V CS   
Sbjct: 84  ALAYSAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSAS 133

Query: 179 CKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
           C     S C S    C Y   Y  + +S+ G L  +   LA         S    V+ GC
Sbjct: 134 CSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGC 183

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGD 293
           G    G      A  G++GLG G +   SL+++ GL  + FS C    D+ ++  +  G 
Sbjct: 184 GDTNEGDGFSQGA--GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGS 236

Query: 294 QGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------A 335
               ++        Q+T  +    +   Y+V +++  +G++   L  S F          
Sbjct: 237 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 296

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           +VDSG S T+L  + Y  +   F   ++       G     C+ A ++ + +V   RL+F
Sbjct: 297 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVF 356

Query: 396 ----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
                 +      N++     + G    CLTVM + G   IIG       + V+D  +  
Sbjct: 357 HFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDT 412

Query: 452 LAWSHSKCEEV 462
           L+++  +C ++
Sbjct: 413 LSFAPVQCNKL 423


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 91/368 (24%), Positives = 158/368 (42%), Gaps = 51/368 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  + L+ LD GS+++W P + +            L R + +   + ++ +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALP----------PLLRAVRQGSSTGAAPA 171

Query: 170 KNV--SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
                +C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A  ++    
Sbjct: 172 PTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR---- 226

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DEN 284
             VQ  V IGCG    G ++   A  G++GLG G +S PS +A++     SFS C  D  
Sbjct: 227 --VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRT 278

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---CLTQSGFQ------- 334
            S       +   T +  +F         Y+V +  + +G +    ++QS  +       
Sbjct: 279 SSRRARPSRRWGGTPRMATF---------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 329

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDM 391
              ++DSG S T L   +Y  V   F       R+S  G S +  CYN S   ++KVP +
Sbjct: 330 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTV 389

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            +  +   S  +    +  P +   T FC  +  TDG   IIG     G R+VFD +  +
Sbjct: 390 SMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQR 448

Query: 452 LAWSHSKC 459
           + +    C
Sbjct: 449 VGFVPKSC 456


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 158/384 (41%), Gaps = 51/384 (13%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + +GTP     + LD GS+L+W  C  C+ C    A+            DP++
Sbjct: 90  YLMH---VSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAA---------PVLDPAA 137

Query: 166 SSSSKNVSCSHPLCKSR--SSC--KSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           SS+   + C  PLC++   +SC  +S  D  C Y+  Y  + + + G L  D        
Sbjct: 138 SSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYG-DRSLTVGQLATDSFTFGGDD 196

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                ++ +  V  GCG    G +   A   G+ G G G  S+PS L        SFS C
Sbjct: 197 NAGGLAARR--VTFGCGHINKGIFQ--ANETGIAGFGRGRWSLPSQLNV-----TSFSYC 247

Query: 281 ----FDENDSGSVFFGDQGP-----------ATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
               FD   S  V  G                  ++T  +    +   YFV +    +G 
Sbjct: 248 FTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGG 307

Query: 326 S--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
           +   + +S  ++  ++DSGAS T LP ++Y  V  +F   V     +    +   C+   
Sbjct: 308 ARVAVPESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALP 367

Query: 382 SEEMLK---VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
              + +   VP + L       + +    + F E+    V C+ + +  G+  +IG    
Sbjct: 368 VAALWRRPAVPALTLHLDGGADWELPRGNYVF-EDYAARVLCVVLDAAAGEQVVIGNYQQ 426

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               +V+D EN  L+++ ++C+++
Sbjct: 427 QNTHVVYDLENDVLSFAPARCDKL 450


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 152/358 (42%), Gaps = 32/358 (8%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP+V  L   D GS+L W+ C  C  C P  A            +DP+ SS+  +V C
Sbjct: 94  LGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPL----------FDPTQSSTYVDVPC 143

Query: 175 SHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
               C    +++  C S K  C Y+  Y T D+ + G L  D +  +S       ++   
Sbjct: 144 ESQPCTLFPQNQRECGSSKQ-CIYLHQYGT-DSFTIGRLGYDTISFSSTGMGQGGATFPK 201

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
           SV  GC      ++      +G +GLG G +S+ S L     I + FS C   F    +G
Sbjct: 202 SV-FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTG 258

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIG-NSCLT-QSGFQALVDSGASFT 344
            + FG   P  +  ++   I   Y +Y+V  +E   +G    LT Q G   ++DS    T
Sbjct: 259 KLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILT 318

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
            L   IY + +    + ++ +        ++YC    +   L  P+    F+     +  
Sbjct: 319 HLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTN--LNFPEFVFHFTGADVVLGP 376

Query: 405 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            ++F   +N    + C+TV+ + G   I G    +  ++ +D    K++++ + C  +
Sbjct: 377 KNMFIALDNN---LVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEKKVSFAPTNCSTI 430


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 102/412 (24%), Positives = 171/412 (41%), Gaps = 89/412 (21%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYD 162
           Y  +   +  GTP  +  + +D GS+L+W PC     C  C+      +++ + + + + 
Sbjct: 87  YGAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFI 140

Query: 163 PSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCP--------YIADYSTEDTSSSGY 208
           P SSSSSK + C +P C      K +S C+  +   P        Y+  Y +  T   G 
Sbjct: 141 PKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG--GI 198

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           ++ + L L    K  P      + I+GC      S L  + P G+ G G G  S+PS L 
Sbjct: 199 MLSETLDLPG--KGVP------NFIVGC------SVLSTSQPAGISGFGRGPPSLPSQL- 243

Query: 269 KAGLIQNSFSICF------DENDSGSVFFGDQGPATQQST--SFLPIGEKYDA------- 313
             GL    FS C       D  +S S+    +  + +++   S+ P  +           
Sbjct: 244 --GL--KKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFS 299

Query: 314 --YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
             Y++G+    +G   +                 ++DSG +FT++  EI+  V  +F+K 
Sbjct: 300 VYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQ 359

Query: 362 VSSKRIS-LQG-NSWKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFT 417
           V SKR + ++G    + C+N S       P++ L F         + N++       G  
Sbjct: 360 VQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFL---GGDD 416

Query: 418 VFCLTVMSTDGDYG--------IIGQNFMMGHRIV-FDRENLKLAWSHSKCE 460
           V CLT++ TDG  G        II  NF   +  V +D  N +L +    C+
Sbjct: 417 VVCLTIV-TDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 467


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 104/438 (23%), Positives = 184/438 (42%), Gaps = 77/438 (17%)

Query: 46  NVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
           N+  +++     SVE +    S     + T  + Q   N+ R  +   +  +Q   + N 
Sbjct: 16  NICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNA 75

Query: 106 F-----------YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL 154
                       Y + Y+   +GTP       +D  S+++WV CQ  +         T  
Sbjct: 76  VESPVTLLDDGDYLMSYS---LGTPPFPVYGIVDTASDIIWVQCQLCE---------TCY 123

Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKS-LKDPCPYIADYSTEDTSSSGYLVD 211
           +     +DPS S + KN+ CS   CKS   +SC S  +  C +  +Y  + + S G L+ 
Sbjct: 124 NDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYK-DGSHSQGDLIV 182

Query: 212 DILHLASFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           + + L S++    H P++      +IGC R    S+       G++GLG G VS+   L+
Sbjct: 183 ETVTLGSYNDPFVHFPRT------VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLS 232

Query: 269 KAGLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIG 324
            +  I   FS C     + S  + FGD    +   T    I  K     Y++ +E++ +G
Sbjct: 233 SS--ISKKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVG 290

Query: 325 NSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           N+ +        +      ++DSG +FT LP ++Y+++      +V  +R       +  
Sbjct: 291 NNRIEFRSSSSRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSL 350

Query: 377 CYNASSEEMLKVPDMRLIFSKN-------QSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 429
           CY  S+ + + VP +   FS          +F+V +H           V CL  +S+   
Sbjct: 351 CYK-STYDKVDVPVITAHFSGADVKLNALNTFIVASH----------RVVCLAFLSSQSG 399

Query: 430 YGIIG----QNFMMGHRI 443
             I G    QNF++G+ +
Sbjct: 400 -AIFGNLAQQNFLVGYDL 416


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 158/381 (41%), Gaps = 67/381 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IG+P   F   +D GS+L+W  C  C+ C      Y          ++P+ S+S  ++
Sbjct: 89  VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPY----------FEPAKSTSYASL 138

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            CS  +C +  S    ++ C Y A Y  +  SS+G L ++     +F  ++ + +V   V
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFGTNSTRVAVP-RV 193

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
             GCG    G+  +G+   G++G G G +S+ S L         FS C   F    +  +
Sbjct: 194 SFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRL 245

Query: 290 FFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
           +FG             GP   QST F+        YF+ +    +    L          
Sbjct: 246 YFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 303

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYN--ASSEE 384
            T      ++DSG + TFL    YA V   F   V   R  +   +++  C+        
Sbjct: 304 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 363

Query: 385 MLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMM 439
           M+ +P+M L F   +    + N++     + G    CL ++ +D D  IIG    QNF M
Sbjct: 364 MVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGSIIGSFQHQNFHM 419

Query: 440 GHRIVFDRENLKLAWSHSKCE 460
               ++D EN  L++  + C 
Sbjct: 420 ----LYDLENSLLSFVPAPCN 436


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 147/363 (40%), Gaps = 40/363 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + IG P     + LD GS++ W     +QC P +  Y+    +    ++PSSSSS 
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNW-----LQCTPCADCYH----QTEPIFEPSSSSSY 201

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           + +SC  P C +    +     C Y   Y  + + + G    + L + S        ++ 
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS--------TLV 252

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
            +V +GCG    G +        V   GL  +    L   + L   SFS C  + DS S 
Sbjct: 253 QNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 304

Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--------L 336
             V FG   P        L   +    Y++G+    +G   L   QS F+         +
Sbjct: 305 STVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 364

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG + T L T IY  +   F K  S    +     +  CYN S++  ++VP +   F 
Sbjct: 365 IDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFP 424

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
             +   +    +  P +   T FCL    T     IIG     G R+ FD  N  + +S 
Sbjct: 425 GGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483

Query: 457 SKC 459
           +KC
Sbjct: 484 NKC 486


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 85/379 (22%), Positives = 148/379 (39%), Gaps = 59/379 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           ++IG P   + + +D GS+L W+ C   C +C+      Y                S+  
Sbjct: 81  LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLY--------------RPSNDF 126

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
           V C H LC S     +     P+  DY  +     SS G L+ D+  L +F+       +
Sbjct: 127 VPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NFTNGV---QL 182

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
           +  + +GCG  Q          DG++GLG G  S+ S L   GL++N    C      G 
Sbjct: 183 KVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGY 242

Query: 289 VFFGDQGPATQQSTSFLPIGEK-YDAY-FVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
           +FFGD   +++   ++ P+  + Y  Y   G      G          A+ D+G+S+T+ 
Sbjct: 243 IFFGDVYDSSR--LTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYF 300

Query: 347 PTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
               Y  ++    K    K +  +    +   C+             R I+   + F  +
Sbjct: 301 NPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIYEVRKYF--K 351

Query: 405 NHIFSFPEN-----------EGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRI 443
             + SF  N           E + +       CL +++      GD  +IG   M+   +
Sbjct: 352 PIVLSFTSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVM 411

Query: 444 VFDRENLKLAWSHSKCEEV 462
           VFD +   + W+ + C++V
Sbjct: 412 VFDNDKQLIGWTPADCDQV 430


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 148/367 (40%), Gaps = 46/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + +G P     + LD GS++ W+ CQ C  C       Y   D     YDPS S+S
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADC-------YAQSD---PVYDPSVSTS 212

Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
              V C  P C+    ++C++    C Y   Y  + + + G    + L L         S
Sbjct: 213 YATVGCDSPRCRDLDAAACRNSTGSCLYEVAYG-DGSYTVGDFATETLTLG-------DS 264

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
           +  S+V IGCG    G ++  A    + G  L   S PS ++       +FS C  + DS
Sbjct: 265 APVSNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 316

Query: 287 GS---VFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
            S   + FGD + PA        P    +  Y+V +    +G   L+   S F       
Sbjct: 317 PSSSTLQFGDSEQPAVTAPLIRSPRTNTF--YYVALSGISVGGEALSIPSSAFAMDDAGS 374

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              +VDSG + T L +  Y  +   F +   S   +   + +  CY+ +    ++VP + 
Sbjct: 375 GGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVA 434

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           L F       +    +  P +   T +CL    T G   IIG     G R+ FD     +
Sbjct: 435 LWFEGGGELKLPAKNYLIPVDAAGT-YCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTV 493

Query: 453 AWSHSKC 459
            ++  KC
Sbjct: 494 GFTADKC 500


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 106/426 (24%), Positives = 172/426 (40%), Gaps = 51/426 (11%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           K   +   L L  D  R KT   L +  N +R         S      +Q    ++T + 
Sbjct: 76  KTPSQLFHLRLERDAARVKTLTHLAAATNKTRPANPGSGFSSSVVSGLSQGSGEYFTRLG 135

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP     + LD GS+++W+ C+ C +C       Y+  D+    +DPS S S   + C
Sbjct: 136 VGTPPKYLYMVLDTGSDVVWLQCKPCTKC-------YSQTDQ---IFDPSKSKSFAGIPC 185

Query: 175 SHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
             PLC+   S  C    + C Y   Y     +   +  + +    +F + A        V
Sbjct: 186 YSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETL----TFRRAA-----VPRV 236

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----GS 288
            IGCG    G ++  A    ++GLG G +S P+         N FS C  +  +     S
Sbjct: 237 AIGCGHDNEGLFVGAAG---LLGLGRGGLSFPT--QTGTRFNNKFSYCLTDRTASAKPSS 291

Query: 289 VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ-------- 334
           + FGD   A  ++  F P+    K D  Y+V +    +G +    ++ S F+        
Sbjct: 292 IVFGDS--AVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGG 349

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            ++DSG S T L    Y  +   F    S  + + + + +  CY+ S    +KVP + L 
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409

Query: 395 F-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           F   + S    N++    +N G   FC     T     IIG     G R+VFD    ++ 
Sbjct: 410 FRGADVSLPAANYLVPV-DNSG--SFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVG 466

Query: 454 WSHSKC 459
           ++   C
Sbjct: 467 FAPRGC 472


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 158/378 (41%), Gaps = 48/378 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IG+P   F + LD GS+L W+  QC+ C       +   ++N   YDP  S S +N++C+
Sbjct: 202 IGSPPKHFSLILDTGSDLNWI--QCVPC-------FDCFEQNGPYYDPKDSISFRNITCN 252

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
            P C+  SS      CK     CPY   Y     ++  + ++   ++L S +    +   
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             +V+ GCG    G +   A    ++GLG G +S  S L    L  +SFS C  + DS +
Sbjct: 313 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRDSDT 367

Query: 289 ------VFFGDQGPATQQSTSFL--------PIGEKY----DAYFVGVESYCIGNSCLTQ 330
                 +F  D+   T    +F         P+   Y     + FVG E   I       
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427

Query: 331 SGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           S   A   ++DSG + ++     Y  +   F + V   ++         CYN S  + L 
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELN 487

Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIV 444
            P+  + F+     +F V N+   F   +   + CL ++ T      IIG        I+
Sbjct: 488 FPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHIL 544

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +N +L ++  +C E+
Sbjct: 545 YDTKNSRLGYAPMRCAEI 562


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/464 (22%), Positives = 176/464 (37%), Gaps = 94/464 (20%)

Query: 47  VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
           VS  + W   N +  L L  ++  K  KT+  L           LFP             
Sbjct: 47  VSSKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTP-------LFPRS----------- 88

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
           Y  +   ++ GTP  +    +D GS+L+W PC     C +C       + +++   +  +
Sbjct: 89  YGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCD------FPNIEVTGIPTF 142

Query: 162 DPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPC---------PYIADYSTEDTSSS 206
            P  SSSS  + C +  C      K +S C+   DP          PY+  Y     S++
Sbjct: 143 IPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQEC-DPTTQNCTQSCPPYVIQYGLG--STA 199

Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
           G L+ + L         P        ++GC      S      P+G+ G G    S+PS 
Sbjct: 200 GLLLSETLDF-------PHKKTIPGFLVGC------SLFSIRQPEGIAGFGRSPESLPSQ 246

Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS-------TSFL--PIGEKYDAYFVG 317
           L          S  FD+  + S    D G  +  +       T F   P     D Y+V 
Sbjct: 247 LGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVL 306

Query: 318 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
           + +  IG++ +          +      +VDSG +FTF+   +Y  V  +F+K V+   +
Sbjct: 307 LRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTV 366

Query: 368 SLQ---GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTV 423
           + +       + C+N S E+ + VP+    F       +   + FSF ++    V CLT+
Sbjct: 367 ATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSG---VICLTI 423

Query: 424 MSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +S +        G   I+G        + FD +N +  +    C
Sbjct: 424 VSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/363 (25%), Positives = 147/363 (40%), Gaps = 40/363 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + IG P     + LD GS++ W     +QC P +  Y+    +    ++PSSSSS 
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNW-----LQCTPCADCYH----QTEPIFEPSSSSSY 198

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           + +SC  P C +    +     C Y   Y  + + + G    + L + S        ++ 
Sbjct: 199 EPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS--------TLV 249

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
            +V +GCG    G +        V   GL  +    L   + L   SFS C  + DS S 
Sbjct: 250 QNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 301

Query: 290 FFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--------L 336
              D G +        P+   +     Y++G+    +G   L   QS F+         +
Sbjct: 302 STVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 361

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG + T L TEIY  +   F K       +     +  CYN S++  ++VP +   F 
Sbjct: 362 IDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFP 421

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
             +   +    +  P +   T FCL    T     IIG     G R+ FD  N  + +S 
Sbjct: 422 GGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 480

Query: 457 SKC 459
           +KC
Sbjct: 481 NKC 483


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 162/392 (41%), Gaps = 77/392 (19%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V+F V  D GS+L+W  C  C +CA           R    + P+SSS+   +
Sbjct: 94  LSIGTPPVTFSVLADTGSSLIWTQCAPCTECA----------ARPAPPFQPASSSTFSKL 143

Query: 173 SCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQS 226
            C+  LC+  +S    C +    C Y   Y    T  +GYL  + LH+  ASF   A   
Sbjct: 144 PCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFT--AGYLATETLHVGGASFPGVAFGC 199

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----D 282
           S ++ V              G +  G++GLG   +   SL+++ G+    FS C     D
Sbjct: 200 STENGV--------------GNSSSGIVGLGRSPL---SLVSQVGV--GRFSYCLRSDAD 240

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL----TQSGF 333
             DS  + FG     T  +    P+ E  +      Y+V +    +G + L    T  GF
Sbjct: 241 AGDS-PILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGF 299

Query: 334 Q----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYN 379
                       +VDSG + T+L  E YA V   F   +++  ++   N  ++    C++
Sbjct: 300 TRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFD 359

Query: 380 ASSE---EMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGFTVFCLTVM--STDGDY 430
           A++      + VP + L F+    + VR      + +        V CL V+  S     
Sbjct: 360 ATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSI 419

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            IIG    M   +++D +    +++ + C  V
Sbjct: 420 SIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 451


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 161/368 (43%), Gaps = 50/368 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + IG P     + LD GS++ WV     QCAP +  Y     +    ++P+SS+S 
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWV-----QCAPCADCY----QQADPIFEPASSASF 199

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             +SC+   C+S    +   D C Y   Y  + + + G  V + + L S    AP  +V 
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSYG-DGSYTVGDFVTETITLGS----APVDNVA 254

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
               IGCG    G ++  A    ++GLG G +S PS +        SFS C  + DS S 
Sbjct: 255 ----IGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQINAT-----SFSYCLVDRDSESA 302

Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------AL 336
             + F    P    S   L        Y+VG+    +G   ++  +S FQ         +
Sbjct: 303 STLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVI 362

Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           VDSG + T L T++Y  +   F K    L S+  I+L    +  CY+ SS+  ++VP + 
Sbjct: 363 VDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL----FDTCYDLSSKGNVEVPTVS 418

Query: 393 LIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
             F   +   +    +  P ++EG   FC     T     IIG     G R+V+D  N  
Sbjct: 419 FHFPDGKELPLPAKNYLVPLDSEG--TFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHL 476

Query: 452 LAWSHSKC 459
           + +  +KC
Sbjct: 477 VGFVPNKC 484


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 155/370 (41%), Gaps = 53/370 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP    +V +D GS+L W     IQ  P  A +    ++    +DPS SS+   ++
Sbjct: 29  IYLGTPPQKAVVIIDTGSDLTW-----IQSEPCRACF----EQADPIFDPSKSSTYNKIA 79

Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQSSV 228
           CS   C       +C +  + C Y   Y  + + + GY          FSK       + 
Sbjct: 80  CSSSACADLLGTQTCSAAAN-CIYAYGYG-DGSVTRGY----------FSKETITATDTA 127

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----- 283
              V  G     TG++ D    +G++GLG G VS+PS L    ++ N FS C  +     
Sbjct: 128 GEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAG 184

Query: 284 NDSGSVFFGDQG-PATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
           +++ +++FGD   P+ + Q T  +P  +    Y++ V+   +G S L   QS ++     
Sbjct: 185 SETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGG 244

Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
               ++DSG + T+L  E++  +V  +   V     +        C+N         P M
Sbjct: 245 SGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTT-SATGLDLCFNTRGTGSPVFPAM 303

Query: 392 RL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDREN 449
            + +   +      N   S   N    + CL   S  D    I G        IV+D +N
Sbjct: 304 TIHLDGVHLELPTANTFISLETN----IICLAFASALDFPIAIFGNIQQQNFDIVYDLDN 359

Query: 450 LKLAWSHSKC 459
           +++ ++ + C
Sbjct: 360 MRIGFAPADC 369


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 158/381 (41%), Gaps = 67/381 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IG+P   F   +D GS+L+W  C  C+ C      Y          ++P+ S+S  ++
Sbjct: 92  VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPY----------FEPAKSTSYASL 141

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            CS  +C +  S    ++ C Y A Y  +  SS+G L ++     +F  ++ + +V   V
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFGTNSTRVAVP-RV 196

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
             GCG    G+  +G+   G++G G G +S+ S L         FS C   F    +  +
Sbjct: 197 SFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRL 248

Query: 290 FFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
           +FG             GP   QST F+        YF+ +    +    L          
Sbjct: 249 YFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 306

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYN--ASSEE 384
            T      ++DSG + TFL    YA V   F   V   R  +   +++  C+        
Sbjct: 307 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 366

Query: 385 MLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMM 439
           M+ +P+M L F   +    + N++     + G    CL ++ +D D  IIG    QNF M
Sbjct: 367 MVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGSIIGSFQHQNFHM 422

Query: 440 GHRIVFDRENLKLAWSHSKCE 460
               ++D EN  L++  + C 
Sbjct: 423 ----LYDLENSLLSFVPAPCN 439


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 169/413 (40%), Gaps = 71/413 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++IGTP     V +D GS+L WVPC      C+ C     S      + +S + PS SSS
Sbjct: 16  LNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNS------KLMSAFSPSHSSS 69

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDI 213
           S   SC+ P C    S  +  DPC  +A  S                  T  +G +V   
Sbjct: 70  SYRDSCASPYCTDIHSSDNSFDPCT-VAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGT 128

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
           L   +   H   + V   +   C      +Y +   P G+ G   G +S PS L   GL+
Sbjct: 129 LTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL---GLL 182

Query: 274 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCI 323
           +  FS CF       + N S  +  GD   +++ +  F P+ +     + Y++G+E+  +
Sbjct: 183 KKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITV 242

Query: 324 GNSCLT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISL 369
           GN   T           Q     L+DSG ++T LP   Y++++  F  +++  R   + +
Sbjct: 243 GNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEM 302

Query: 370 QGNSWKYCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVF-C 420
           +   +  CY      N  +++    P +   F  N SFV+   NH ++       TV  C
Sbjct: 303 RA-GFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKC 361

Query: 421 LTVMS-TDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 469
           L   S  D DY   G+ G       +IV+D E  ++ +    C        +H
Sbjct: 362 LLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCASAAVSQGLH 414


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 158/378 (41%), Gaps = 48/378 (12%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IG+P   F + LD GS+L W+  QC+ C       +   ++N   YDP  S S +N++C+
Sbjct: 202 IGSPPKHFSLILDTGSDLNWI--QCVPC-------FDCFEQNGPYYDPKDSISFRNITCN 252

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
            P C+  SS      CK     CPY   Y     ++  + ++   ++L S +    +   
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             +V+ GCG    G +   A    ++GLG G +S  S L    L  +SFS C  + DS +
Sbjct: 313 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRDSDT 367

Query: 289 ------VFFGDQGPATQQSTSFL--------PIGEKY----DAYFVGVESYCIGNSCLTQ 330
                 +F  D+   T    +F         P+   Y     + FVG E   I       
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427

Query: 331 SGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           S   A   ++DSG + ++     Y  +   F + V   ++         CYN S  + L 
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELN 487

Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIV 444
            P+  + F+     +F V N+   F   +   + CL ++ T      IIG        I+
Sbjct: 488 FPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHIL 544

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +N +L ++  +C E+
Sbjct: 545 YDTKNSRLGYAPMRCAEI 562


>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 440

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/377 (23%), Positives = 149/377 (39%), Gaps = 55/377 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           ++IG P   + + +D GS+L W+ C     AP S    T          P    S+  V 
Sbjct: 83  LNIGQPPRPYFLDIDTGSDLTWLQCD----APCSRCSQTP--------HPLYRPSNDLVP 130

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           C H LC S     +     P+  DY  +     SS G L+ D+  L +F+       ++ 
Sbjct: 131 CRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NFTNGV---QLKV 186

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
            + +GCG  Q          DG++GLG G  S+ S L   GL++N    C      G +F
Sbjct: 187 RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIF 246

Query: 291 FGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALVDSGASFTFLPT 348
           FGD   + +   ++ P+  + Y  Y V G      G          A+ D+G+S+T+  +
Sbjct: 247 FGDVYDSFR--LTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNS 304

Query: 349 EIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
             Y  ++    K    K +  +    +   C+             R I+   + F  +  
Sbjct: 305 YAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIYEVRKYF--KPI 355

Query: 407 IFSFPEN-----------EGFTV------FCLTVMSTD----GDYGIIGQNFMMGHRIVF 445
           + SF  N           E + +       CL +++      GD  +IG   M+   +VF
Sbjct: 356 VLSFTSNGRSKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVMVF 415

Query: 446 DRENLKLAWSHSKCEEV 462
           D +   + W+ + C++V
Sbjct: 416 DNDKQLIGWAPADCDQV 432


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 164/391 (41%), Gaps = 66/391 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           ++    +DP++SSS +NV+C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 206

Query: 175 SHPLC------------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
               C              R+  +  +DPCPY   Y  +  ++        L L SF+ +
Sbjct: 207 GDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGD------LALESFTVN 260

Query: 223 --APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
             AP +S +   V+ GCG +  G +   A   G+    L   S   L A  G   ++FS 
Sbjct: 261 LTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSY 315

Query: 280 CFDEN--DSGS-VFFGDQGPATQ-------QSTSFLPIGEKYDA----YFVGVESYCIGN 325
           C  ++  D GS V FG+   A         + T+F P           Y+V ++   +G 
Sbjct: 316 CLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGG 375

Query: 326 SCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSW 374
             L  S              ++DSG + ++     Y  +   F D++  S  +  +    
Sbjct: 376 ELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVL 435

Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYG 431
             CYN S  E  +VP++ L+F+      F   N+     + +G ++ CL V+ T      
Sbjct: 436 SPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRL-DPDGGSIMCLAVLGTPRTGMS 494

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IIG        +V+D +N +L ++  +C EV
Sbjct: 495 IIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 161/397 (40%), Gaps = 75/397 (18%)

Query: 106 FYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS 164
           F  LH+T  + IGTP     + LD GS+L+W  C+            T   R    YDP+
Sbjct: 84  FGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFD---------TRQHREKPLYDPA 134

Query: 165 SSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
            SSS     C   LC++ S    +C   ++ C Y  +Y +  T   G L  +     +F 
Sbjct: 135 KSSSFAAAPCDGRLCETGSFNTKNCS--RNKCIYTYNYGSATT--KGELASETF---TFG 187

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           +H     V  S+  GCG+  +GS L GA+  G++G+    +S+ S L         FS C
Sbjct: 188 EH---RRVSVSLDFGCGKLTSGS-LPGAS--GILGISPDRLSLVSQLQIP-----RFSYC 236

Query: 281 ----FDENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYF------------ 315
                D N +  +FFG           GP    S    P G  Y  Y             
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296

Query: 316 -VGVESYCIGNSCLTQSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQG 371
            V V S+ IG      SG    VDSG +   LP+   E   E +V+  KL         G
Sbjct: 297 NVPVSSFAIGRD---GSG-GTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATD-HG 351

Query: 372 NSWKYCY------NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 425
             ++ C+        + E  ++VP +   F    + ++R   +    + G    CL V+S
Sbjct: 352 YEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAG--RMCL-VIS 408

Query: 426 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           +     IIG        ++FD EN + +++ ++C ++
Sbjct: 409 SGARGAIIGNYQQQNMHVLFDVENHEFSFAPTQCNQI 445


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 149/365 (40%), Gaps = 38/365 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IGTP        D GS+L+W      QC P  + Y     +    +DPS S+S K VS
Sbjct: 95  ISIGTPPFDVYGIYDTGSDLMWT-----QCLPCLSCY----KQKNPMFDPSKSTSFKEVS 145

Query: 174 CSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C    C+     SC   +  C +   Y  + + + G +  + L L S   ++ Q     +
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYG-DGSLAQGVIATETLTLNS---NSGQPXSIXN 201

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 286
           ++ GCG   +G++ +     G+ G G   +S+ S +         FS C      D + +
Sbjct: 202 IVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSIT 259

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN--------SCLTQSGFQAL 336
             + FG +   +       P+  K D   YFV ++   +G+        S +   G    
Sbjct: 260 SKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKG-NVF 318

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +D+G   T LP + Y  +V    + +  + +       + CY +++  ++  P +   F 
Sbjct: 319 IDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT--LIDGPILTAHFD 376

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
                +   + F  P+ EG  V+C  +   DGD GI G    M   I FD +  K+++  
Sbjct: 377 GADVQLKPLNTFISPK-EG--VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKA 433

Query: 457 SKCEE 461
             C +
Sbjct: 434 VDCTK 438


>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
          Length = 547

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 155/371 (41%), Gaps = 35/371 (9%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  H+ +I  GTP     V ++ GS+    PC +C  C   +  Y+          DPS 
Sbjct: 105 YGTHFAYIYAGTPPQRASVIINTGSHFSAFPCSECRSCGNHTDPYW----------DPSQ 154

Query: 166 SSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA----SFS 220
           SS++  V+C     C     C+S K  C  + ++ TE +S     VDD+L +     S S
Sbjct: 155 SSTAHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWVGERTLSDS 212

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSI 279
           +    S+       GC    TG +    A DG+MGL     ++ + LA AG I +  FS+
Sbjct: 213 QKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKISERKFSL 271

Query: 280 CFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLT------Q 330
           CF E   G++  G   P   +  S   + P   +  A  V V    +    +T      Q
Sbjct: 272 CFSET-GGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITTDASVFQ 330

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
            G    + SG + T+LP  +       ++    S   + + N  ++C   ++ E+  +P 
Sbjct: 331 KGTGIKIVSGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMN--EFCMTRTTVELEALPV 388

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
           + +         VR   +    ++   V+  L    + G  G++G N +  H +VFD +N
Sbjct: 389 LMIHMDGGVEVNVRPEAYMDASSDEENVYPSLPPPCSMG--GVLGANLLRDHNVVFDYDN 446

Query: 450 LKLAWSHSKCE 460
             + ++   C+
Sbjct: 447 HVVGFADGACD 457


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 100/389 (25%), Positives = 162/389 (41%), Gaps = 75/389 (19%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP+ S  + LD GS L W+ C   +         TS       +DPS SSS  ++ 
Sbjct: 84  LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS-------FDPSLSSSFSDLP 136

Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           CSHPLCK R       +SC S +  C Y   Y+ + T + G LV +    ++     P  
Sbjct: 137 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVKEKFTFSNSQTTPP-- 192

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
                +I+GC ++ T          G++G+ LG +   S +++A + + S+ I    N  
Sbjct: 193 -----LILGCAKESTDE-------KGILGMNLGRL---SFISQAKISKFSYCIPTRSNRP 237

Query: 285 ---DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
               +GS + GD               P +Q+  +  P+     AY V ++   IG   L
Sbjct: 238 GLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPL-----AYTVPLQGIRIGQKRL 292

Query: 329 TQSG----------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKY 376
              G           Q +VDSG+ FT L    Y +V  +  +LV S  K+  + G++   
Sbjct: 293 NIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM 352

Query: 377 CY--NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYG 431
           C+  N S E    + D+   F +    +V     S   N G  + C+ +  +        
Sbjct: 353 CFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQ--SLLVNVGGGIHCVGIGRSSMLGAASN 410

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           IIG        + FD  N ++ +S ++C 
Sbjct: 411 IIGNVHQQNLWVEFDVTNRRVGFSKAECR 439


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/384 (23%), Positives = 160/384 (41%), Gaps = 59/384 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP   F + LD GS+L W+  QC+ C       Y    +N + YDP +S+S KN++C+
Sbjct: 168 VGTPPKHFSLILDTGSDLNWL--QCLPC-------YDCFHQNEAFYDPKTSASFKNITCN 218

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
            P C   SS      CKS    CPY   Y     ++  + V+   ++L +    + +  V
Sbjct: 219 DPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKV 278

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
           + +++ GCG    G +   +   G+    L   S         L  +SFS C      D 
Sbjct: 279 E-NMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVDRNSDT 332

Query: 284 NDSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCIGNSCLT-------- 329
           N S  + FG+       +    TSF+   E      Y++ ++S  +G   L         
Sbjct: 333 NVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNI 392

Query: 330 --QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEE 384
                   ++DSG + ++     Y  +  KF +K+  +  +         C+N S   E 
Sbjct: 393 SPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEEN 452

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFM 438
            + +P++ + F+          +++FP    F      + CL ++ T    + IIG    
Sbjct: 453 NIHLPELGIAFADGA-------VWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQ 505

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
               I++D +  +L ++ +KC ++
Sbjct: 506 QNFHILYDTKMSRLGFTPTKCADI 529


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 167/434 (38%), Gaps = 67/434 (15%)

Query: 55  KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           K  S +++E+L  +  +      K   KL +++ S       P++   T   GN     +
Sbjct: 50  KATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGN-----Y 104

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
              + +GTP     +  D GS+L W  CQ C++         T  D+    ++PS S+S 
Sbjct: 105 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 155

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
            NVSCS   C S SS       C      Y   Y  + + S G+L  +   L        
Sbjct: 156 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKEKFTLT------- 207

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            S V   V  GCG    G +   A   G++GLG   +S PS  A A      FS C   +
Sbjct: 208 NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 262

Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
            S  G + FG  G    +S  F PI    D          A  VG +   I ++  +  G
Sbjct: 263 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 320

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             AL+DSG   T LP + YA +   F   +S    +   +    C++ S  + + +P + 
Sbjct: 321 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 378

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVF-----CLTVM--STDGDYGIIGQNFMMGHRIVF 445
             FS          +        F VF     CL     S D +  I G        +V+
Sbjct: 379 FSFSGGA-------VVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVY 431

Query: 446 DRENLKLAWSHSKC 459
           D    ++ ++ + C
Sbjct: 432 DGAGGRVGFAPNGC 445


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 105/226 (46%), Gaps = 18/226 (7%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYD 162
           N    ++YT + IGTP   F V +D GS++LWV C  C+ C PL         +N++ +D
Sbjct: 76  NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC-PL---------QNVTFFD 125

Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           P +SSS+  ++CS   C S    KS   P  Y  +YS + + +SGY + D++   +    
Sbjct: 126 PGASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEYS-DGSFTSGYYISDLISFETVMSS 184

Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                  +  + GC     G   L   +  G++GLG G + V S L+   L    FS+C 
Sbjct: 185 NLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCL 244

Query: 282 D--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
              +   G +  G+       +T + P+      Y V ++++ + +
Sbjct: 245 SGGQEGGGVIILGEN---RLPNTVYTPLVRSQTHYNVNLKTFAVND 287


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 150/365 (41%), Gaps = 38/365 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IGTP        D GS+L+W      QC P  + Y     +    +DPS S+S K VS
Sbjct: 95  ISIGTPPFDVYGIYDTGSDLMWT-----QCLPCLSCY----KQKNPMFDPSKSTSFKEVS 145

Query: 174 CSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C    C+     SC   +  C +   Y  + + + G +  + L L S   ++ Q +   +
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYG-DGSLAQGVIATETLTLNS---NSGQPTSILN 201

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 286
           ++ GCG   +G++ +     G+ G G   +S+ S +         FS C      D + +
Sbjct: 202 IVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSIT 259

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN--------SCLTQSGFQAL 336
             + FG +   +       P+  K D   YFV ++   +G+        S +   G    
Sbjct: 260 SKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKG-NVF 318

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +D+G   T LP + Y  +V    + +  + +       + CY +++  ++  P +   F 
Sbjct: 319 IDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT--LIDGPILTAHFD 376

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
                +   + F  P+ EG  V+C  +   DGD GI G    M   I FD +  K+++  
Sbjct: 377 GADVQLKPLNTFISPK-EG--VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKA 433

Query: 457 SKCEE 461
             C +
Sbjct: 434 VDCTK 438


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/407 (23%), Positives = 168/407 (41%), Gaps = 54/407 (13%)

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
           Q  +N   + ++FP +G   + +   FY +    + IG P   + + +D+GS+L W+ C 
Sbjct: 11  QPISNRMGHTVVFPLQG---NVYPQGFYSVS---LRIGNPPKPYTLDIDSGSDLTWLQCD 64

Query: 140 --CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPC 192
             C+ C                   P    +   ++C+ P+C      S+  CK+  + C
Sbjct: 65  APCVSCT--------------KAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQC 110

Query: 193 PYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
            Y   Y+ +  SS G LV DI  L L + +  AP+      +  GCG  Q  SY    AP
Sbjct: 111 DYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAPR------LAFGCGYDQ--SYPGPNAP 161

Query: 251 ---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 307
              DGV+GLG G  S+ + L   GLI++    C      G +F GD   +T     + P+
Sbjct: 162 PFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPM 220

Query: 308 GEK--YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
             K    AY +G              G + + DSG+S+T+   + Y   +    K ++ K
Sbjct: 221 SRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK 280

Query: 366 RISLQGNSWKYCYNASS--EEMLKVPD----MRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
                  S   C+  +   + + +V +      L F+K +S  ++    S+         
Sbjct: 281 LKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSAQLQLPPESYLIISKHGNA 340

Query: 420 CLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           CL +++      GD  +IG        +++D E  ++ W    C ++
Sbjct: 341 CLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNERQQIGWVPKDCNKL 387


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 157/378 (41%), Gaps = 44/378 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F++  D GS+L WV C+  + +   AS   S       + P++S S 
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLAS----PRVFRPANSKSW 165

Query: 170 KNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFS 220
             + CS   CKS         S+  +   PC Y  DY  +D SS+ G +  D   +A   
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGY--DYRYKDKSSARGVVGTDAATIALSG 223

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
             + + +    V++GC     G     +  DGV+ LG  ++S  S    A      FS C
Sbjct: 224 SGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGGRFSYC 279

Query: 281 F-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCL------ 328
                   N +  + FG  G A   S + L +  +   ++ V V++  +    L      
Sbjct: 280 LVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEV 339

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN-ASSEE 384
              +    A++DSG S T L T  Y  VV    K L    R+++  + ++YCYN  ++  
Sbjct: 340 WDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM--DPFEYCYNWTATRR 397

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFMMGH 441
              VP + + F+   S  +R    S+  +    V C+ +   +G +    +IG      H
Sbjct: 398 PPAVPRLEVRFAG--SARLRPPTKSYVIDAAPGVKCIGLQ--EGVWPGVSVIGNILQQEH 453

Query: 442 RIVFDRENLKLAWSHSKC 459
              FD  N  L +  S+C
Sbjct: 454 LWEFDLANRWLRFQESRC 471


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 100/402 (24%), Positives = 160/402 (39%), Gaps = 76/402 (18%)

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P+ S  + +D GS+L+W PC   +C      +  +   N++         S  VSC  P 
Sbjct: 29  PSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITR--------SHRVSCQSPA 80

Query: 179 CKSRSSCKSLKDPCPY----IADYSTEDTSSSG-----YLVDDILHLASFSKHAPQSSVQ 229
           C +  S  S  D C      + +  T D SS+      Y   D     SF  H  + ++ 
Sbjct: 81  CSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD----GSFIAHLHRDTLS 136

Query: 230 SSVI------IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 280
            S +       GC           A P GV G G G +S+P+ LA  +  + N FS C  
Sbjct: 137 MSQLFLKNFTFGCAHTAL------AEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLV 190

Query: 281 ---FDE---NDSGSVFFGDQGPATQQSTSFL---PIGEKYDAYF--VGVESYCIGNSCL- 328
              FD+        +  G     + +   F+    +     +YF  VG+    +G   + 
Sbjct: 191 SHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTIL 250

Query: 329 ---------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWK 375
                     +     +VDSG +FT LP  +Y  VV +FD+ V    KR S   +     
Sbjct: 251 APEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLG 310

Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENEG---FTVFCLTVMS---- 425
            CY    E +++VP +   F  N S V+    N+ + F + E      V CL +M+    
Sbjct: 311 PCYFL--EGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDD 368

Query: 426 ---TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 464
              + G   I+G     G  +V+D EN ++ ++  +C  + D
Sbjct: 369 TELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASLWD 410


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 98/414 (23%), Positives = 172/414 (41%), Gaps = 56/414 (13%)

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
           Q  +N   + ++FP +G   + +   FY +    + IG P   + + +D+GS+L W+ C 
Sbjct: 44  QPISNRMGHTVVFPLQG---NVYPQGFYSVS---LRIGNPPKPYTLDIDSGSDLTWLQCD 97

Query: 140 --CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPC 192
             C+ C                   P    +   ++C+ P+C      S+  CK+  + C
Sbjct: 98  APCVSCT--------------KAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQC 143

Query: 193 PYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
            Y   Y+ +  SS G LV DI  L L + +  AP+      +  GCG  Q  SY    AP
Sbjct: 144 DYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAPR------LAFGCGYDQ--SYPGPNAP 194

Query: 251 ---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 307
              DGV+GLG G  S+ + L   GLI++    C      G +F GD   +T     + P+
Sbjct: 195 PFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPM 253

Query: 308 GEK--YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
             K    AY +G              G + + DSG+S+T+   + Y   +    K ++ K
Sbjct: 254 SRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK 313

Query: 366 RISLQGNSWKYCYNASS--EEMLKVPD----MRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
                  S   C+  +   + + +V +      L F+K +S  ++    S+         
Sbjct: 314 LKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSAQLQLPPESYLIISKHGNA 373

Query: 420 CLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV--IDKSH 467
           CL +++      GD  +IG        +++D E  ++ W    C ++  +D+ +
Sbjct: 374 CLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNERQQIGWVPKDCNKLPKVDRDY 427


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 159/375 (42%), Gaps = 59/375 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP +++   +D GS+L+W  C+ C+ C            ++   +DPSSSS+   V C
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPC 222

Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           S   C     S C S    C Y   Y  + +S+ G L  +   LA         S    V
Sbjct: 223 SSASCSDLPTSKCTSASK-CGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGV 272

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
           + GCG    G      A  G++GLG G +   SL+++ GL  + FS C    D+ ++  +
Sbjct: 273 VFGCGDTNEGDGFSQGA--GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPL 325

Query: 290 FFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ----- 334
             G            ++ Q+T  +    +   Y+V +++  +G++   L  S F      
Sbjct: 326 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 385

Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
               +VDSG S T+L  + Y  +   F   ++       G     C+ A ++ + +V   
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVP 445

Query: 392 RLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           RL+F      +      N++     + G    CLTVM + G   IIG       + V+D 
Sbjct: 446 RLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDV 501

Query: 448 ENLKLAWSHSKCEEV 462
            +  L+++  +C ++
Sbjct: 502 GHDTLSFAPVQCNKL 516


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 185/445 (41%), Gaps = 72/445 (16%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
           + F++ L+HR S ++                 P  N +E     L N   R   RV    
Sbjct: 29  LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
             +N+ + Q+   S   +        Y ++   + IGTP    +   D GS+LLW     
Sbjct: 72  EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
            QCAP     YT +D     +DP +SS+ K+VSCS   C   ++++SC +  + C Y   
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171

Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
           Y  +++ + G +  D L L S      Q     ++IIGCG    G++      +      
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221

Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
           +G    P SL+ + G  I   FS C       ++ +  + FG     +     ST  +  
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281

Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
             +   Y++ ++S  +G+  +         S    ++DSG + T LPTE Y+E+      
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
            + +++     +    CY+A+ +  LKVP + + F      +  ++ F    +E    F 
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFA 398

Query: 421 LTVMSTDGDYGIIGQ-NFMMGHRIV 444
                +   YG + Q NF++G+  V
Sbjct: 399 FRGSPSFSIYGNVAQMNFLVGYDTV 423


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 185/445 (41%), Gaps = 72/445 (16%)

Query: 22  VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
           + F++ L+HR S ++                 P  N +E     L N   R   RV    
Sbjct: 29  LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
             +N+ + Q+   S   +        Y ++   + IGTP    +   D GS+LLW     
Sbjct: 72  EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
            QCAP     YT +D     +DP +SS+ K+VSCS   C   ++++SC +  + C Y   
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171

Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
           Y  +++ + G +  D L L S      Q     ++IIGCG    G++      +      
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221

Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
           +G    P SL+ + G  I   FS C       ++ +  + FG     +     ST  +  
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281

Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
             +   Y++ ++S  +G+  +         S    ++DSG + T LPTE Y+E+      
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341

Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
            + +++     +    CY+A+ +  LKVP + + F      +  ++ F    +E    F 
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFA 398

Query: 421 LTVMSTDGDYGIIGQ-NFMMGHRIV 444
                +   YG + Q NF++G+  V
Sbjct: 399 FRGSPSFSIYGNVAQMNFLVGYDTV 423


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 149/369 (40%), Gaps = 52/369 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + IG P     V LD GS++ W     IQCAP S  Y  S       +DP SS+S 
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSW-----IQCAPCSECYQQSD----PIFDPISSNSY 199

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             + C  P CKS    +     C Y   Y  + + + G    + + L         S+  
Sbjct: 200 SPIRCDEPQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--------SAAV 250

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
            +V IGCG    G ++  A   G+ G  L   S P     A +   SFS C    DS +V
Sbjct: 251 ENVAIGCGHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYCLVNRDSDAV 302

Query: 290 F---FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--------L 336
               F    P    +   +   E    Y++G++   +G   L   +S F+         +
Sbjct: 303 STLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGII 362

Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           +DSG + T L +E+Y  +   F K    +  +  +SL    +  CY+ SS E +++P + 
Sbjct: 363 IDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSRESVEIPTVS 418

Query: 393 LIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
             F + +   +  RN++      +    FC     T     IIG     G R+ FD  N 
Sbjct: 419 FRFPEGRELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANS 475

Query: 451 KLAWSHSKC 459
            + +S   C
Sbjct: 476 LVGFSVDSC 484


>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 406

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 124/279 (44%), Gaps = 31/279 (11%)

Query: 80  QSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
           +  ++  +N  LFP         GN F   L+YT I +G+P   + + +D GS+  WV C
Sbjct: 134 RGGDDWPQNSTLFPHS-----LAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQC 188

Query: 139 QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 198
               CA  +   +         Y P+ ++ +  +  S PLC+   +     + C Y   Y
Sbjct: 189 DAPPCASCAKGAHPL-------YRPARTADA--LPASDPLCE--GAQHENPNQCDYEISY 237

Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLG 257
           +   +S   Y+ D +  +    +        + ++ GCG  Q G  L+     DGV+GL 
Sbjct: 238 ADGSSSMGVYVRDSMQFVGEDGERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLT 292

Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYD 312
              +S+P+ LA  G+I N+F  C   + SG+   +F GD     +   +++PI  G   D
Sbjct: 293 NKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADD 351

Query: 313 AYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPTE 349
                V+    G+  L   G   Q + D+G+++T+ P E
Sbjct: 352 VRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDE 390


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 101/397 (25%), Positives = 163/397 (41%), Gaps = 59/397 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +++GTP     V +D GS+L WVPC  +    +  + Y + ++ +S Y PS SSSS    
Sbjct: 16  LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRN-NKLMSTYSPSYSSSSLRDL 74

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 218
           C  PLC    S  +  DPC  +A  S                  T  +G +V   L   +
Sbjct: 75  CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 133

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            + H    S    V   C      +Y +   P G+ G G G +S+PS L   G +Q  FS
Sbjct: 134 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 187

Query: 279 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
            CF       + N S  +  GD   ++    Q TS L      + Y++G+E+  +GN+  
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247

Query: 329 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 375
            Q                ++DSG ++T LP   Y +++     +++  R   Q     + 
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 307

Query: 376 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVF-CLTVMST 426
            CY      N  ++    +P +   FS N S V+   NH ++       TV  CL + + 
Sbjct: 308 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNM 367

Query: 427 D----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           D    G  G+ G       ++V+D E  ++ +    C
Sbjct: 368 DDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 161/391 (41%), Gaps = 66/391 (16%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  C+ C  C            R L   DPS+
Sbjct: 415 YLVH---LAIGTPPQPVQLILDTGSDLVWTQCRPCPVC----------FSRALGPLDPSN 461

Query: 166 SSSSKNVSCSHPLCKSR--SSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           SS+   + CS P+C +   SSC         C Y+  Y      + G +    L   +F+
Sbjct: 462 SSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY------ADGSITTGHLDAETFT 515

Query: 221 KHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
             A   + Q++V     GCG    G +       G+ G G G +S+PS L       ++F
Sbjct: 516 FAAADGTGQATVPDLAFGCGLFNNGIFTSNET--GIAGFGRGALSLPSQLKV-----DNF 568

Query: 278 SICFDE---NDSGSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS-- 326
           S CF     ++  SV  G             QST  +       AY++ ++   +G++  
Sbjct: 569 SHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRL 628

Query: 327 -------CLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----W 374
                   L Q G    ++DSG   T LP + Y  V    D   +  R+ +   +     
Sbjct: 629 PIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLV---HDAFTAQVRLPVDNATSSSLS 685

Query: 375 KYCYNASSEEMLK--VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYG 431
           + C++ S     K  VP + L F      + R N++F F E+ G +V CL + + D D  
Sbjct: 686 RLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEF-EDAGGSVTCLAINAGD-DLT 743

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IIG        +++D     L++  ++C  +
Sbjct: 744 IIGNYQQQNLHVLYDLVRNMLSFVPAQCNRL 774


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 157/379 (41%), Gaps = 47/379 (12%)

Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           G +F  L+Y   + +G+ N+S +V  D GS+L WV     QC P  + Y    ++N   +
Sbjct: 114 GIKFQTLNYIVTMGLGSQNMSVIV--DTGSDLTWV-----QCEPCRSCY----NQNGPLF 162

Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHL 216
            PS+S S + + C+   C+S        DP     C Y+ +Y  + + +SG L  + L  
Sbjct: 163 KPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYG-DGSYTSGELGIEKLGF 221

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
              S         S+ + GCGR   G +       G+MGLG  ++S+ S           
Sbjct: 222 GGISV--------SNFVFGCGRNNKGLF---GGASGLMGLGRSELSMIS--QTNATFGGV 268

Query: 277 FSICFDEND----SGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSC 327
           FS C    D    SGS+  G+Q    +       T  LP  +  + Y + +    +G   
Sbjct: 269 FSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVS 328

Query: 328 L--TQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
           L    S F     ++DSG   + L   +Y  +  KF +  S    +   +    C+N + 
Sbjct: 329 LHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTG 388

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNFMMG 440
            + + +P + + F  N    V      +   E  +  CL + S   +Y  GIIG      
Sbjct: 389 YDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRN 448

Query: 441 HRIVFDRENLKLAWSHSKC 459
            R+++D +  ++ ++   C
Sbjct: 449 QRVLYDAKLSQVGFAKEPC 467


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 104/432 (24%), Positives = 170/432 (39%), Gaps = 67/432 (15%)

Query: 66  LSNDWKRQKTRVKL----QSNNNSSRNQL---------------LFPSEGSQTHFFGNQF 106
           LS D  R  +R ++    +S  NS R++L                 PS+   T   GN  
Sbjct: 80  LSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGN-- 137

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
              +   + +GTP        D GS+L W      QC P +   Y    +    ++PS S
Sbjct: 138 ---YVVTVGLGTPKRDLTFIFDTGSDLTWT-----QCEPCARYCY---HQQEPIFNPSKS 186

Query: 167 SSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           +S  N+SCS P C        +  SC +    C Y   Y  + + S G+   D L L S 
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYG-DQSYSVGFFAQDKLALTS- 242

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFS 278
                 + V ++ + GCG+   G ++  A   G++GLG   +S+ S  A K G +   FS
Sbjct: 243 ------TDVFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGKL---FS 290

Query: 279 ICFDENDS--GSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSG- 332
            C     S  G + FG  G  T ++  F P     +    YF+ + +  +G   L+ S  
Sbjct: 291 YCLPSTSSSTGYLTFGSGG-GTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSAS 349

Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG   + LP   Y+++   F + +S    +   +    CY+ S  + + V
Sbjct: 350 VFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDV 409

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
           P + L FS      +      +  N           S   D  I+G        +V+D  
Sbjct: 410 PKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVA 469

Query: 449 NLKLAWSHSKCE 460
             ++ ++   CE
Sbjct: 470 GGRIGFAPGGCE 481


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 96/416 (23%), Positives = 170/416 (40%), Gaps = 62/416 (14%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           L+    KR + R++       S N +L  S G +T  +     +L    + IGTP  S  
Sbjct: 60  LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGSGEYLMN--VAIGTPASSLS 110

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
             +D GS+L+W  C+ C QC            +    ++P  SSS   + C    C+   
Sbjct: 111 AIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLPCESQYCQDLP 160

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           S +S  + C Y   Y  + +S+ GY+  +            ++S   ++  GCG    G 
Sbjct: 161 S-ESCYNDCQYTYGYG-DGSSTQGYMATETFTF--------ETSSVPNIAFGCGEDNQGF 210

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--------GSVFFGDQG 295
                A  G++G+G G +S+PS L         FS C   + S        GS   G   
Sbjct: 211 GQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSPSTLALGSAASGV-- 261

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALVDSGASFTF 345
           P    ST+ +        Y++ ++   +G  N  +  S FQ         ++DSG + T+
Sbjct: 262 PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTY 321

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVR 404
           LP + Y  V   F   ++   +    +    C+   S+   ++VP++ + F      +  
Sbjct: 322 LPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGE 381

Query: 405 NHIFSFPENEGFTVFCLTV-MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++   P  EG  V CL +  S+     I G       ++++D +NL +++  ++C
Sbjct: 382 ENVLISPA-EG--VICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 91/189 (48%), Gaps = 27/189 (14%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y+    WI  GTP   F + +D+GS + +VPC  C QC                ++ P  
Sbjct: 92  YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEM 139

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+ + V C+        +C   ++ C Y  +Y+ E +SS G L +D++   + S+  PQ
Sbjct: 140 SSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 193

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +C+   D
Sbjct: 194 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 247

Query: 286 --SGSVFFG 292
              GS+  G
Sbjct: 248 VGGGSMILG 256


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 105/405 (25%), Positives = 171/405 (42%), Gaps = 55/405 (13%)

Query: 83  NNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
           +N  R + L    FP +G+ +         L+YT I +G P     V +D GS++LWV C
Sbjct: 58  HNDRRGRFLQGISFPLKGNYSDL------GLYYTEIGLGNPVQKLKVIVDTGSDILWVKC 111

Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDP 191
             C  C  LS      +   LS Y+ S+SS+S   SCS PLC       SRS   S    
Sbjct: 112 SPCRSC--LSKQ---DIIPPLSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNS---A 163

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
           C Y   Y  + TS   Y+ DD+ ++         ++  S +  GC    TGS+      D
Sbjct: 164 CAYGISYQDKSTSIGAYVKDDMHYVLQ-----GGNATTSHIFFGCAINITGSW----PAD 214

Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGE 309
           G+MG G    +VP+ +A    +   FS C   +++  G + FG++   T+    F P+  
Sbjct: 215 GIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEM--VFTPLLN 272

Query: 310 KYDAYFVGVESYCIGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVK 357
               Y V + S  + +  L            + +    ++DSG SF  L T+    +  +
Sbjct: 273 VTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSE 332

Query: 358 FDKLVSSK-RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENE 414
              L ++K    L+G    Y  +  + E    P++ L FS   +  ++  N++      +
Sbjct: 333 IKNLTTAKLGPKLEGLQCFYLKSGLTVET-SFPNVTLTFSGGSTMKLKPDNYLVMVELKK 391

Query: 415 GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               +C    S DG   I G+  +    + +D EN ++ W    C
Sbjct: 392 KRNGYCYAWSSADG-LTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 98/388 (25%), Positives = 162/388 (41%), Gaps = 68/388 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           ++    +DP++SSS +N++C
Sbjct: 152 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNLTC 201

Query: 175 SHPLC--------KSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--A 223
             P C         +  +C+   +DPCPY   Y  +  S+        L L SF+ +  A
Sbjct: 202 GDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGD------LALESFTVNLTA 255

Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
           P +S +   V+ GCG +  G +   A    ++GLG G +S  S L +A    ++FS C  
Sbjct: 256 PGASSRVDGVVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RAVYGGHTFSYCLV 311

Query: 283 ENDS---GSVFFGDQGPATQQS------TSFLPIGEKYDA-YFVGVESYCIGNSCLTQS- 331
           ++ S     V FG+       +      T+F P     D  Y+V +    +G   L  S 
Sbjct: 312 DHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISS 371

Query: 332 ---------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS 381
                        ++DSG + ++     Y  +   F D++  S            CYN S
Sbjct: 372 DTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVS 431

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST-DGDYGIIG 434
             E  +VP++ L+F+          ++ FP    F       + CL V+ T      IIG
Sbjct: 432 GVERPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIG 484

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
                   + +D  N +L ++  +C EV
Sbjct: 485 NFQQQNFHVAYDLHNNRLGFAPRRCAEV 512


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 167/434 (38%), Gaps = 67/434 (15%)

Query: 55  KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           K  S +++E+L  +  +      K   KL +++ S       P++   T   GN     +
Sbjct: 78  KATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGN-----Y 132

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
              + +GTP     +  D GS+L W  CQ C++         T  D+    ++PS S+S 
Sbjct: 133 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 183

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
            NVSCS   C S SS       C      Y   Y  + + S G+L  +   L        
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKEKFTLT------- 235

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            S V   V  GCG    G +   A   G++GLG   +S PS  A A      FS C   +
Sbjct: 236 NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 290

Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
            S  G + FG  G    +S  F PI    D          A  VG +   I ++  +  G
Sbjct: 291 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 348

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             AL+DSG   T LP + YA +   F   +S    +   +    C++ S  + + +P + 
Sbjct: 349 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 406

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVF-----CLTVM--STDGDYGIIGQNFMMGHRIVF 445
             FS          +        F VF     CL     S D +  I G        +V+
Sbjct: 407 FSFSGGA-------VVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVY 459

Query: 446 DRENLKLAWSHSKC 459
           D    ++ ++ + C
Sbjct: 460 DGAGGRVGFAPNGC 473


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 156/385 (40%), Gaps = 61/385 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP V  L+ALD  S+L W+ CQ C +C P S             +DP  S+S   +
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYGEM 194

Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTED-----TSSSGYLVDDILHLASFSKHAP 224
           +   P C++  RS     K   C Y   Y   D     ++S G LV++ L  A   +   
Sbjct: 195 NYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVR--- 251

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
               Q+ + IGCG    G  L GA   G++GL  G +S+P  +A  G    SFS C  + 
Sbjct: 252 ----QAYLSIGCGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLVDF 304

Query: 285 DSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYCIGNSCLTQSG 332
            SG      ++ FG     T    SF P      +   Y    +GV    +    +T+  
Sbjct: 305 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 364

Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY--CYNA 380
            Q          ++DSG + T L    Y      F    +   ++S  G S  +  CY  
Sbjct: 365 LQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTV 424

Query: 381 SSEEML----KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 434
                L    KVP + + F+     S   +N++ +  ++ G   F     + D    +IG
Sbjct: 425 GGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITV-DSRGTVCFAF-AGTGDRSVSVIG 482

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                G R+V+D    ++ ++ + C
Sbjct: 483 NILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/400 (23%), Positives = 164/400 (41%), Gaps = 68/400 (17%)

Query: 107 YWLHYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSE 160
           Y  H   +  GTP   +SFLV  D GS+++W PC     C  C     S+  +  + +  
Sbjct: 84  YGGHSIPLSFGTPPQKLSFLV--DTGSHVVWAPCTTHYTCTNC-----SFSDAEPKKVPI 136

Query: 161 YDPSSSSSSKNVSCSHPLCKSRSS---------CKSLKDPC-----PYIADYSTEDTSSS 206
           ++P  SSSSK + C +P C + SS         C      C     PY   Y T   SS 
Sbjct: 137 FNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGT-GASSG 195

Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
            +L++++        + P  ++    ++GC     G     A    + G G    S+P  
Sbjct: 196 DFLLENL--------NFPGKTIH-EFLVGCTTSAVGEVTSAA----LAGFGRSMFSLPMQ 242

Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYC 322
           +          S  +D+  + S    D      +  S+ P  +        Y++GV+   
Sbjct: 243 MGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIK 302

Query: 323 IGNSCL-TQSGFQA---------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
           IGN  L   S + A         ++DSG ++ ++   ++ +V  +  K +S  R SL+  
Sbjct: 303 IGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAE 362

Query: 373 S---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-- 425
           +      CYN + ++ +K+PD+   F    + VV  +N+    PE      F LT  +  
Sbjct: 363 AEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEIS-LACFPLTTDAGT 421

Query: 426 -----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
                T G   I+G +  + + + FD +N +L +    C+
Sbjct: 422 NTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTCQ 461


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 101/397 (25%), Positives = 163/397 (41%), Gaps = 59/397 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +++GTP     V +D GS+L WVPC  +    +  + Y + ++ +S Y PS SSSS    
Sbjct: 33  LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRN-NKLMSTYSPSYSSSSLRDL 91

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 218
           C  PLC    S  +  DPC  +A  S                  T  +G +V   L   +
Sbjct: 92  CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 150

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            + H    S    V   C      +Y +   P G+ G G G +S+PS L   G +Q  FS
Sbjct: 151 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 204

Query: 279 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
            CF       + N S  +  GD   ++    Q TS L      + Y++G+E+  +GN+  
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264

Query: 329 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 375
            Q                ++DSG ++T LP   Y +++     +++  R   Q     + 
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 324

Query: 376 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVF-CLTVMST 426
            CY      N  ++    +P +   FS N S V+   NH ++       TV  CL + + 
Sbjct: 325 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNM 384

Query: 427 D----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           D    G  G+ G       ++V+D E  ++ +    C
Sbjct: 385 DDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 98/403 (24%), Positives = 161/403 (39%), Gaps = 46/403 (11%)

Query: 75  TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
           TR+ L+  N S+        +G      G Q    +++ + IG+P     + LD GS++ 
Sbjct: 132 TRLDLRPANGSAVFAASAAIQGPVVSGVG-QGSGEYFSRVGIGSPARQLYMVLDTGSDVT 190

Query: 135 WVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDP 191
           WV CQ C  C       Y   D     +DPS S+S   VSC    C+    ++C++    
Sbjct: 191 WVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 240

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
           C Y   Y  + + + G    + L L         S+   +V IGCG    G ++  A   
Sbjct: 241 CLYEVAYG-DGSYTVGDFATETLTLG-------DSTPVGNVAIGCGHDNEGLFVGAAGLL 292

Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS---GSVFFGDQGPATQQSTSFLPIG 308
            + G  L   S PS ++      ++FS C  + DS    ++ FGD        T+ L   
Sbjct: 293 ALGGGPL---SFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRS 344

Query: 309 EKYDA-YFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVV 356
            +    Y+V +    +G   L           T      +VDSG + T L +  YA +  
Sbjct: 345 PRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRD 404

Query: 357 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 416
            F +   S   +   + +  CY+ S    ++VP + L F    +  +    +  P  +G 
Sbjct: 405 AFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGA 463

Query: 417 TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             +CL    T+    IIG     G R+ FD     + ++ +KC
Sbjct: 464 GTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506


>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
 gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
          Length = 376

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 159/374 (42%), Gaps = 50/374 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           ++IG P+  + + +D GS+L W+ C   C+QC      YY   + NL             
Sbjct: 24  LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRN-NL------------- 69

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
           V C  P+C+S  S    +   P   DY  E     SS G LV D  +L +F+     S +
Sbjct: 70  VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNL-NFTSEKRHSPL 128

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
            +  + G  +   GS+      DGV+GLG G  S+ S L+  GL++N    C   +  G 
Sbjct: 129 LALGLCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGF 185

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 345
           +FFGD    + +  ++ P+      Y  G+            +GF+ L+   DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGASYTY 241

Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           L ++ Y  ++    K +S K  R +L   +   C+    +    + D++  F K  +   
Sbjct: 242 LNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTFALSF 299

Query: 404 RNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 448
            N         FP  E + +       CL +++       D  +IG   M    +++D E
Sbjct: 300 TNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNE 358

Query: 449 NLKLAWSHSKCEEV 462
             ++ W+   C  +
Sbjct: 359 KERIGWAPGNCNRL 372


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 91/366 (24%), Positives = 149/366 (40%), Gaps = 45/366 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T + +G P  S+ + LD GS++ W+ CQ C  C   S   +T          P++SSS
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFT----------PAASSS 208

Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
              ++C    C S   SSC++ +  C Y  +Y  + + + G  V + +           S
Sbjct: 209 YSPLTCDSQQCNSLQMSSCRNGQ--CRYQVNYG-DGSFTFGDFVTETMSFGG-------S 258

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
              +S+ +GCG    G +        V   GL  +    L   + L   SFS C    DS
Sbjct: 259 GTVNSIALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTSQLKATSFSYCLVNRDS 310

Query: 287 GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT--QSGFQ------- 334
            +    D   A    +   P+    K D  Y+VG+    +G   L   Q  F+       
Sbjct: 311 AASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDG 370

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VD G + T L +E Y  +   F  +    R +     +  CY+ S +  +KVP +  
Sbjct: 371 GVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSF 430

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            F   +S+ +    +  P +   T +C     T     IIG     G R+ FD  N ++ 
Sbjct: 431 HFDGGKSWDLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVG 489

Query: 454 WSHSKC 459
           +S +KC
Sbjct: 490 FSTNKC 495


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 154/368 (41%), Gaps = 50/368 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I IGTP V+ +   D GS+L W  C  C +C           +++   ++P  SSS + V
Sbjct: 94  IFIGTPPVNVIAIADTGSDLTWTQCLPCREC----------FNQSQPIFNPRRSSSYRKV 143

Query: 173 SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQ 229
           SC+   C+S  S  C      C Y   YS  D S + G L  D + + SF    P++   
Sbjct: 144 SCASDTCRSLESYHCGPDLQSCSY--GYSYGDRSFTYGDLASDQITIGSFK--LPKT--- 196

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
              +IGCG +  G++  G     +   G     V  +   AG ++  FS C      + N
Sbjct: 197 ---VIGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAG-VKPRFSYCLPTFFSNAN 251

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN---------SCLTQSGF 333
            +G++ FG +   + +     P+  +     YF+ +E+  +G          S +T  G 
Sbjct: 252 ITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHG- 310

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG + T LP  +Y  V     +++ +KR+       + CY+A   + L +P +  
Sbjct: 311 NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITA 370

Query: 394 IFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            F+   +   +  N      +N    V CLT  +      I G    +   + +D  N +
Sbjct: 371 HFAGGADVKLLPVNTFAPVADN----VTCLT-FAPATQVAIFGNLAQINFEVGYDLGNKR 425

Query: 452 LAWSHSKC 459
           L++    C
Sbjct: 426 LSFEPKLC 433


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 107/423 (25%), Positives = 170/423 (40%), Gaps = 52/423 (12%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLH 110
           K    + E L  +  +    + K+ S  N+   +L       P+  S  +  G   Y + 
Sbjct: 75  KEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTIPT--SSGYSLGTTEYVIT 132

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
            T   IGTP V+ ++++D GS++ WV     QCAP +A   +S    L  +DP+ S++  
Sbjct: 133 VT---IGTPAVTQVMSIDTGSDVSWV-----QCAPCAAQSCSSQKDKL--FDPAMSATYS 182

Query: 171 NVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
             SC    C       + C  LK  C YI  Y  + ++++G    D L L S       S
Sbjct: 183 AFSCGSAQCAQLGDEGNGC--LKSQCQYIVKYG-DGSNTAGTYGSDTLSLTS-------S 232

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF---D 282
               S   GC  +  G   +    DG+MGLG GD    SL+++ A     +FS C     
Sbjct: 233 DAVKSFQFGCSHRAAGFVGE---LDGLMGLG-GDTE--SLVSQTAATYGKAFSYCLPPPS 286

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV--ESYCIGNSCLT--QSGFQ--AL 336
            +  G +  G  G A+    S  P+       F GV  +   +  + L    S F   ++
Sbjct: 287 SSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASV 346

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           VDSG   T LP   Y  +   F K + +   +    S   C++ S    + VP + L FS
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFS 406

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
           +  +  +      +     F     T  + DGD GI+G        ++FD     + +  
Sbjct: 407 RGAAMDLDISGILYAGCLAF-----TATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRS 461

Query: 457 SKC 459
             C
Sbjct: 462 GAC 464


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 154/373 (41%), Gaps = 53/373 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP     + LD GS+++W     IQCAP    Y  S       +DP  S S 
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVW-----IQCAPCKRCYAQS----DPVFDPRKSRSF 176

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--Q 225
            +++C  PLC    S  C + K  C Y   Y            D       FS      +
Sbjct: 177 ASIACRSPLCHRLDSPGCNTQKQTCMYQVSYG-----------DGSFTFGDFSTETLTFR 225

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +  + V +GCG    G ++  A    ++GLG G +S PS   +     + FS C  +  
Sbjct: 226 RTRVARVALGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQTGRR--FNHKFSYCLVDRS 280

Query: 286 S----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ- 334
           +     S+ FGD   A  ++  F P+    K D  Y+V +    +G +    +T S F+ 
Sbjct: 281 ASSKPSSMVFGDS--AVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKL 338

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
                   ++DSG S T L    Y      F    S+ + + Q + +  C++ S +  +K
Sbjct: 339 DQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVK 398

Query: 388 VPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           VP + L F   + S    N++     +     FCL    T G   IIG     G R+V+D
Sbjct: 399 VPTVVLHFRGADVSLPASNYLIPVDTSGN---FCLAFAGTMGGLSIIGNIQQQGFRVVYD 455

Query: 447 RENLKLAWSHSKC 459
               ++ ++   C
Sbjct: 456 LAGSRVGFAPHGC 468


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 154/371 (41%), Gaps = 54/371 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
           + T + +GTP  S+ + +D GS+L W     +QC+P       S  R +   YDP +SS+
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTW-----LQCSPC----VVSCHRQVGPLYDPRASST 184

Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
              V CS   C        + S+C S+++ C Y A Y  + + S GYL  D +   S S 
Sbjct: 185 YATVPCSASQCDELQAATLNPSAC-SVRNVCIYQASYG-DSSFSVGYLSRDTVSFGSGSY 242

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                    +   GCG+   G +   A   G++GL    +S+   LA +  +  SFS C 
Sbjct: 243 --------PNFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289

Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSGF 333
                 S  +   GP T    S+ P+     DA  YFV +    +G S L       S  
Sbjct: 290 PT--PASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSL 347

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNASSEEMLKVP 389
             ++DSG   T LPT +Y        K V++  + +Q     +    C+   + + L+VP
Sbjct: 348 PTIIDSGTVITRLPTAVY----TALSKAVAAAMVGVQSAPAFSILDTCFQGQASQ-LRVP 402

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            + + F+   +  +         ++  T  CL    TD    IIG        +V+D   
Sbjct: 403 AVAMAFAGGATLKLATQNVLIDVDDSTT--CLAFAPTDSTT-IIGNTQQQTFSVVYDVAQ 459

Query: 450 LKLAWSHSKCE 460
            ++ ++   C 
Sbjct: 460 SRIGFAAGGCS 470


>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
 gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 156/373 (41%), Gaps = 48/373 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQC--IQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           ++IG P+  + + +D GS+L W+ C     QC      YY                S+  
Sbjct: 24  LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYY--------------KPSNNL 69

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
           V+C  P+C+S  +    +   P   DY  E     SS G LV D  +L +F+    QS +
Sbjct: 70  VACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNL-NFTSEKRQSPL 128

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
            +  + G  +   G+Y      DGV+GLG G  S+ S L+  GL++N    C      G 
Sbjct: 129 LALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGF 185

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 345
           +FFGD    + +  ++ P+      Y  G             +GF+ L+   DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPNAKHYSPGFAELTFDGK---TTGFKNLIVAFDSGASYTY 241

Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           L +++Y  ++    + +S+K  R +L   +   C+    +    V D++  F K  +   
Sbjct: 242 LNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKG-RKPFKSVRDVKKYF-KTFALSF 299

Query: 404 RNH-----IFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 449
            N         FP      V      CL V++       D  +IG   M    +++D E 
Sbjct: 300 ANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDISMQDRVVIYDNEK 359

Query: 450 LKLAWSHSKCEEV 462
             + W+   C+ +
Sbjct: 360 QLIGWAPRNCDRI 372


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 117/453 (25%), Positives = 187/453 (41%), Gaps = 91/453 (20%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSS-RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP- 119
           + LLLS    R +     QS +N+S +N  LFP             Y  +   +  GTP 
Sbjct: 94  INLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRS-----------YGAYSVSLAFGTPP 142

Query: 120 -NVSFLVALDAGSNLLWVPCQC-IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHP 177
            N+SF+   D GS+L+W PC    +C+  S  Y       +S++ P  SSS K V C +P
Sbjct: 143 QNLSFI--FDTGSSLVWFPCTAGYRCSRCSFPYVDP--ATISKFVPKLSSSVKVVGCRNP 198

Query: 178 LC--------KSR-----SSCKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHA 223
            C        KSR     S  +   D CP Y   Y +  T  +G L+ + L L   +K  
Sbjct: 199 KCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGAT--AGILLSETLDLE--NKRV 254

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
           P        ++GC      S +    P G+ G G G  S+PS +          S  FD+
Sbjct: 255 PD------FLVGC------SVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDD 302

Query: 284 NDSGSVFFGDQGPATQQSTS----FLPIGEK--------YDAYFVGVESYCIGNSCL--- 328
           +   S    D G  + +S +    + P  E          + Y++ +    IG   +   
Sbjct: 303 SPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFP 362

Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKY 376
                   T +G  A++DSG++FTFL   I+  +  + +K +     +K +  Q +  + 
Sbjct: 363 YKYLVPDSTGNG-GAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQ-SGLRP 420

Query: 377 CYN-ASSEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--- 430
           C+N    EE  + PD+ L F      S    N++ +   +EG  V CLT+M+ +      
Sbjct: 421 CFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYL-AMVTDEG--VVCLTMMTDEAVVGGG 477

Query: 431 ---GIIGQNFMMGHRIV-FDRENLKLAWSHSKC 459
               II   F   + +V +D    ++ +   KC
Sbjct: 478 GGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 157/376 (41%), Gaps = 56/376 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP +++   +D GS+L+W  C+ C++C           +++   +DPSSSS+   +
Sbjct: 122 MSIGTPALAYAAIVDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYSTL 171

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            CS  LC     S+C S    C Y   Y  + +S+ G L  +   LA         +   
Sbjct: 172 PCSSSLCSDLPTSTCTSAAKDCGYTYTYG-DASSTQGVLAAETFTLA--------KTKLP 222

Query: 231 SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS 286
            V  GCG    G  +  GA   G++GLG G +   SL+++ GL    FS C    D+   
Sbjct: 223 GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--GKFSYCLTSLDDTSK 274

Query: 287 GSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ-- 334
             +  G            A  Q+T  +    +   Y+V +++  +G++   L  S F   
Sbjct: 275 SPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQ 334

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEML 386
                  +VDSG S T+L  + Y  +   F   +              C+   AS  + +
Sbjct: 335 DDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDV 394

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           +VP + L F       +    +   ++      CLTVM + G   IIG       + V+D
Sbjct: 395 EVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVMGSRG-LSIIGNFQQQNIQFVYD 452

Query: 447 RENLKLAWSHSKCEEV 462
            +   L+++  +C ++
Sbjct: 453 VDKDTLSFAPVQCAKL 468


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 152/362 (41%), Gaps = 41/362 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP  +  + LD GS+L W     IQC P S   Y   D    ++DP+ SSS   V 
Sbjct: 141 VGFGTPAQTAAIILDTGSDLSW-----IQCKPCSGHCYRQHD---PDFDPAKSSSYAAVP 192

Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C  P+C +    C      C Y   Y  + +S++G L  D L   S SK        +  
Sbjct: 193 CGTPVCAAAGGMCNGTT--CLYGVQYG-DGSSTTGVLSRDTLTFNSSSKF-------TGF 242

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVF 290
             GCG K  G   D    DG++GLG G +S+PS  A +      FS C    ++  G + 
Sbjct: 243 TFGCGEKNIG---DFGEVDGLLGLGRGKLSLPSQAAPS--FGGVFSYCLPSYNTTPGYLN 297

Query: 291 FGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
            G   P +    Q T+ +   +    YF+ + S  IG        S  T++G   L+DSG
Sbjct: 298 IGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTG--TLLDSG 355

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
              T+LP   Y  +  +F   +   + +        CY+ + +  + +P +   FS    
Sbjct: 356 TILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAV 415

Query: 401 FVVRNH-IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           F +  + I  FP++    + CL  +S      + I+G        +++D  + K+ +   
Sbjct: 416 FDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPI 475

Query: 458 KC 459
            C
Sbjct: 476 SC 477


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 159/401 (39%), Gaps = 46/401 (11%)

Query: 76  RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
           RV    + N     LL P +G     +  +FY        IG+P V  L  +D GS+L+W
Sbjct: 67  RVSHFLDENKLPESLLIPDKGE----YLMRFY--------IGSPPVERLAMVDTGSSLIW 114

Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKD 190
           + C  C  C P          +    ++P  SS+ K  +C    C     S+  C  L  
Sbjct: 115 LQCSPCHNCFP----------QETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQ 164

Query: 191 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
            C Y   Y  + + S G L  + L     S    Q+    + I GCG     +       
Sbjct: 165 -CIYGIMYG-DKSFSVGILGTETLSFG--STGGAQTVSFPNTIFGCGVDNNFTIYTSNKV 220

Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQSTSFLPI 307
            G+ GLG G +S+ S L     I + FS C   +D   +  + FG +   T       P+
Sbjct: 221 MGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPL 278

Query: 308 GEKYDA---YFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
             K      YF+ +E+  IG   ++  Q+    ++DSG   T+L    Y   V    + +
Sbjct: 279 IIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETL 338

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT 422
             K +    +  K C+   +   L +PD+   F+   S  +R      P  +   + CL 
Sbjct: 339 GVKLLQDLPSPLKTCF--PNRANLAIPDIAFQFT-GASVALRPKNVLIPLTDS-NILCLA 394

Query: 423 VMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           V+ + G    + G       ++ +D E  K++++ + C +V
Sbjct: 395 VVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDCAKV 435


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 144/360 (40%), Gaps = 39/360 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F +  D GS + W      QC P   S Y   ++   ++DP+ S+S  NVS
Sbjct: 139 VGLGTPKEDFTLVFDTGSGITWT-----QCQPCLGSCYPQKEQ---KFDPTKSTSYNNVS 190

Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           CS   C     S   C +    C Y   Y  + + S G+   + L ++S       S V 
Sbjct: 191 CSSASCNLLPTSERGCSASNSTCLYQIIYG-DQSYSQGFFATETLTISS-------SDVF 242

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
           ++ + GCG+   G +   A   G+        SV      A   Q  FS C     S + 
Sbjct: 243 TNFLFGCGQSNNGLFGQAAGLLGLS-----SSSVSLPSQTAEKYQKQFSYCLPSTPSSTG 297

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYF----VGV----ESYCIGNSCLTQSGFQALVDSGA 341
           +  + G    Q+  F PI   + +++    VG+        I  S  T SG  A++DSG 
Sbjct: 298 YL-NFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSG--AIIDSGT 354

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T LP   Y  +   FD+ +S+   +        CY+ S+   +  P + + F      
Sbjct: 355 VITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEV 414

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +      +  N G  + CL   +   D ++GI G +    + +V+D     + ++   C
Sbjct: 415 DIDASGILYLVN-GVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 148/368 (40%), Gaps = 45/368 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F V +D GS+L WV     QC+P    Y     +N S + P++S+S   ++
Sbjct: 7   VRLGTPERVFSVIVDTGSDLTWV-----QCSPCGTCY----SQNDSLFIPNTSTSFTKLA 57

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C   LC         +  C Y   Y  + + S+G  V D + +   +    Q     +  
Sbjct: 58  CGTELCNGLPYPMCNQTTCVYWYSYG-DGSLSTGDFVYDTITMDGINGQKQQV---PNFA 113

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGS 288
            GCG    GS+   A  DG++GLG G +S PS L    +    FS C  +       +  
Sbjct: 114 FGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSP 168

Query: 289 VFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGNSCL--TQSGFQ------- 334
           + FGD    T     ++     P    Y  Y+V +    +G   L  + + F        
Sbjct: 169 LLFGDAAVPTFPGVKYISLLTNPKVPTY--YYVKLNGISVGGKLLNISSTAFDIDSVGRA 226

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEML-KVPDM 391
             + DSG + T L  E++ EV+   +   +   R S   +    C    +E  L  VP M
Sbjct: 227 GTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSM 286

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
              F      +  ++ F F E+     +C +++S+  D  IIG       ++ +D    K
Sbjct: 287 TFHFEGGDMELPPSNYFIFLESS--QSYCFSMVSSP-DVTIIGSIQQQNFQVYYDTVGRK 343

Query: 452 LAWSHSKC 459
           + +    C
Sbjct: 344 IGFVPKSC 351


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/405 (24%), Positives = 163/405 (40%), Gaps = 50/405 (12%)

Query: 81  SNNNSSRNQLLFPSEGSQTHF-----FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
           S +NS   Q   PS+  Q         G+  Y++    + +GTP     + +D GS++LW
Sbjct: 6   STSNSHDRQTKVPSQDFQAPVISGLSLGSGEYFIR---VSVGTPPRGMYLVMDTGSDILW 62

Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
                +QCAP  + Y+   +     +DP  SS+   + C+   C +      + + C Y 
Sbjct: 63  -----LQCAPCVSCYHQCDE----VFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQ 113

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
            DY  + + S+G    D + L S S       V + + +GCG    G ++  A   G+  
Sbjct: 114 VDYG-DGSFSTGEFATDAVSLNSTSGGG--QVVLNKIPLGCGHDNEGYFVGAAGLLGLGK 170

Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQG--PATQQSTSFLPIG 308
                +S P+ +         FS C      D  +  S+ FGD    PA      F P  
Sbjct: 171 G---PLSFPNQINSEN--GGRFSYCLTGRDTDSTERSSLIFGDAAVPPA---GVRFTPQA 222

Query: 309 EKYDA---YFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVV 355
                   Y++ +    +G S LT   S FQ         ++DSG S T L    YA + 
Sbjct: 223 SNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLR 282

Query: 356 VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG 415
             F    S   ++ + + +  CYN S    + VP + L F       +    +  P +  
Sbjct: 283 EAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNS 342

Query: 416 FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
            T FCL    T G   IIG     G R+++D  + ++ +  S+C+
Sbjct: 343 ST-FCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQCD 385


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 92/380 (24%), Positives = 157/380 (41%), Gaps = 44/380 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSS 168
           ++    +GTP+  F++  D GS+L W+ C+   C   + S   +   R+   +  + SSS
Sbjct: 83  YFVAFKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSS 141

Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFS 220
            K + C   +CK       S ++C +   PC Y  DY   D S++ G+  ++ + +    
Sbjct: 142 FKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--L 197

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           K   +  +  +V+IGC     G     A  DGVMGLG    S    +  A      FS C
Sbjct: 198 KEGRKMKLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYC 252

Query: 281 F-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 328
                  +N S  + FG     +      + + L +G     Y V +    IG + L   
Sbjct: 253 LVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIP 312

Query: 329 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASS 382
                 +     ++DSG+S TFL    Y  V+      L+  +++ +     +YC+N++ 
Sbjct: 313 SEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372

Query: 383 EEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD-GDYGIIGQNFMM 439
            E   VP +   F+    F   V++++ S  +     V CL  +S       ++G     
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVAWPGTSVVGNIMQQ 428

Query: 440 GHRIVFDRENLKLAWSHSKC 459
            H   FD    KL ++ S C
Sbjct: 429 NHLWEFDLGLKKLGFAPSSC 448


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 103/437 (23%), Positives = 169/437 (38%), Gaps = 51/437 (11%)

Query: 54  PKKNSVEYLELLLSNDWKR----QKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYW 108
           P  +  E  + LLS D  R    Q      +    SS  ++   +  +Q     G +   
Sbjct: 81  PANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGARLRT 140

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L+Y    +G       V +D  S L WV     QCAP  + +    D+    +DPSSS S
Sbjct: 141 LNYVAT-VGLGGGEATVIVDTASELTWV-----QCAPCESCH----DQQGPLFDPSSSPS 190

Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPC----PYIADYS---TEDTSSSGYLVDDILHL 216
              V C  P C +      +   +   PC    P    Y+    + + S G L  D L L
Sbjct: 191 YAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL 250

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK--AGLIQ 274
           A          V    + GCG    G    G +  G+MGLG   +S+ S       G+  
Sbjct: 251 A--------GEVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTVDQFGGVFS 300

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--------YFVGVESYCIGNS 326
               +  + + SGS+  GD   A + ST  +      ++        Y V +    +G  
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ 360

Query: 327 CLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
            +  +GF  +A+VDSG   T L   +Y  V  +F   ++    +   +    C+N +  +
Sbjct: 361 EVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLK 420

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGHR 442
            ++VP + L+F       V +    +  +   +  CL V S  ++ +  IIG       R
Sbjct: 421 EVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLR 480

Query: 443 IVFDRENLKLAWSHSKC 459
           +VFD    ++ ++   C
Sbjct: 481 VVFDTSASQVGFAQETC 497


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 110/427 (25%), Positives = 173/427 (40%), Gaps = 62/427 (14%)

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL------HYTW 113
           E   L L  D  R K   KL S   +SRN       G  T F  +    L      ++T 
Sbjct: 79  ELFHLRLQRDAIRVK---KLSSLGATSRN---LSKPGGTTGFSSSVISGLAQGSGEYFTR 132

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP     + LD GS+++W     +QCAP    Y     +    ++P  S S   V 
Sbjct: 133 IGVGTPPKYVYMVLDTGSDIVW-----LQCAPCKNCY----SQTDPVFNPVKSGSFAKVL 183

Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C  PLC+   S  C   +  C Y   Y  + + ++G  V + L          + +    
Sbjct: 184 CRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF--------RRTKVEQ 233

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----G 287
           V +GCG    G ++  A    ++GLG G +S PS   +       FS C  +  +     
Sbjct: 234 VALGCGHDNEGLFVGAAG---LLGLGRGGLSFPSQAGRT--FNQKFSYCLVDRSASSKPS 288

Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ------- 334
           SV FG+   A  ++  F P+    + D  Y+V +    +G    S +T S F+       
Sbjct: 289 SVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNG 346

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++D G S T L    Y  +   F    SS + + + + +  CY+ S +  +KVP + L
Sbjct: 347 GVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVL 406

Query: 394 IF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
            F   + S    N++      +G   FC     T     IIG     G R+V+D  + ++
Sbjct: 407 HFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRV 463

Query: 453 AWSHSKC 459
            +S   C
Sbjct: 464 GFSPRGC 470


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 83/354 (23%), Positives = 136/354 (38%), Gaps = 31/354 (8%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP    LV  D GS+L WV     QC P +  Y     ++   +DPS S++   V 
Sbjct: 192 VGLGTPRRDLLVVFDTGSDLSWV-----QCKPCNNCY----KQHDPLFDPSQSTTYSAVP 242

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C    C    +C S K  C Y   Y  + + + G L  D L L       P S      +
Sbjct: 243 CGAQECLDSGTCSSGK--CRYEVVYG-DMSQTDGNLARDTLTL------GPSSDQLQGFV 293

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--DSGSVFF 291
            GCG   TG +      DG+ GLG   VS+ S    A      FS C   +    G +  
Sbjct: 294 FGCGDDDTGLF---GRADGLFGLGRDRVSLAS--QAAARYGAGFSYCLPSSWRAEGYLSL 348

Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA---LVDSGASFTF 345
           G    P   Q T+ +   +    Y++ +    +      +  + F+A   ++DSG   T 
Sbjct: 349 GSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITR 408

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRN 405
           LP+  Y+ +   F   +   + +   +    CY+ +    +++P + L+F    +  +  
Sbjct: 409 LPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGF 468

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               +  N             D   GI+G        +V+D  N K+ +    C
Sbjct: 469 GGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 97/381 (25%), Positives = 158/381 (41%), Gaps = 64/381 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP       LD GS+L+W  C  C  C P     ++          P +SSS + +
Sbjct: 108 LAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFS----------PGASSSYEPM 157

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            C+  LC      SC+   D C Y   Y  + T++ G    +    +S S     + + +
Sbjct: 158 RCAGELCNDILHHSCQR-PDTCTYRYSYG-DGTTTRGVYATERFTFSSSSSGGETTKLSA 215

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--- 287
            +  GCG    GS  +G+   G++G G   +S+ S LA        FS C     SG   
Sbjct: 216 PLGFGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAI-----RRFSYCLTPYASGRKS 267

Query: 288 SVFFG-------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ--SGFQ---- 334
           ++ FG       D   AT Q+T  L   +    Y+V      +G   L    S F     
Sbjct: 268 TLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPD 327

Query: 335 ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK-----YCYNASSEEM 385
               A+VDSG + T  P  + AEVV  F    S  R+    N         C+ A++  +
Sbjct: 328 GSGGAIVDSGTALTLFPAPVLAEVVRAFR---SQLRLPFAANGSSGPDDGVCFAAAASRV 384

Query: 386 LK---VPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM- 438
            +   VP  R++F     +     RN++    +++     CL +++  GD G    NF+ 
Sbjct: 385 PRPAVVP--RMVFHLQGADLDLPRRNYVL---DDQRKGNLCL-LLADSGDSGTTIGNFVQ 438

Query: 439 MGHRIVFDRENLKLAWSHSKC 459
              R+++D E   L+++ ++C
Sbjct: 439 QDMRVLYDLEADTLSFAPAQC 459


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 97/409 (23%), Positives = 161/409 (39%), Gaps = 73/409 (17%)

Query: 98  QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC----QCIQCA-------PL 146
           QT  F +  Y  H   +  GTP       +D GS+++W PC     C  C+       P+
Sbjct: 76  QTSLFPHS-YGAHTIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPI 134

Query: 147 SASYYTSLDRNLSEYDPS-SSSSSKNVSCSHPLCKSRSSCKSLKDPCP-YIADYSTEDTS 204
                +S D+ L   DP  + +SS BV    P C   S  K     CP Y   Y T   +
Sbjct: 135 FNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNGNS--KKCSHACPQYTLQYGTG--A 190

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
           +SG+ + + L     + H          ++GC    T S     + D + G G    S+P
Sbjct: 191 ASGFFLLENLDFPGKTIH--------KFLVGC----TTSADREPSSDALAGFGRTMFSLP 238

Query: 265 SLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYD----AY 314
             +         F+ C + +D      SG +   D      Q  S+ P  +        Y
Sbjct: 239 MQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSDGETQGLSYAPFXKNPPDYPIYY 292

Query: 315 FVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
           ++GV+   IGN  L   G             ++DSG +++++   ++  V  +  K +S 
Sbjct: 293 YLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSK 352

Query: 365 KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVF 419
            R SL+  +      CYN +  + +K+PD+   F+   + VV   N+   F E    ++ 
Sbjct: 353 YRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEA---SLG 409

Query: 420 CLTVMS---------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           C  V +         T G   I+G    + H + FD +N +L +    C
Sbjct: 410 CFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 102/416 (24%), Positives = 158/416 (37%), Gaps = 41/416 (9%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           LL  D  R  + + + +N  S+    +  P+E   +   GN     +   + +GTP    
Sbjct: 113 LLDQDQARVDSILGMITNETSAVGPGVSLPAERGISVGTGN-----YVVSVGLGTPARDL 167

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
            V  D GS+L WV  QC  C+  S   Y   D     + PS SS+   V C    C++R 
Sbjct: 168 TVVFDTGSDLSWV--QCGPCS--SGGCYKQQD---PLFAPSDSSTFSAVRCGARECRARQ 220

Query: 184 SCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLASFS---KHAPQSSVQSSVIIGCGR 238
           SC      D CPY   Y  + + + G+L +D L L + +     A   +     + GCG 
Sbjct: 221 SCGGSPGDDRCPYEVVYG-DKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGE 279

Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG--- 295
             TG  L G A DG+ GLG G VS+ S    AG     FS C   + S +  +   G   
Sbjct: 280 NNTG--LFGQA-DGLFGLGRGKVSLSS--QAAGKFGEGFSYCLPSSSSSAPGYLSLGTPV 334

Query: 296 --PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS----GFQALVDSGASFTFLPTE 349
             PA  Q T  L        Y+V +    +    +  S        +VDSG   T L   
Sbjct: 335 PAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPR 394

Query: 350 IYAEVVVKFDKLVS------SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
            Y  +   F   +       + R+S+      Y + A +   + +P + L+F+   +  V
Sbjct: 395 AYRALRAAFLSAMGKYGYKRAPRLSILDTC--YDFTAHANATVSIPAVALVFAGGATISV 452

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                 +                    GI+G        +V+D    K+ ++   C
Sbjct: 453 DFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508


>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 413

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 163/391 (41%), Gaps = 62/391 (15%)

Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPC--QCIQCAPLSASYYTSLDRNLS 159
           GN +   H+T  ++IG P+  F + +D GS+L WV C  +CI C         +L R++ 
Sbjct: 45  GNVYPLGHFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGC---------TLPRDML 95

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
            Y P +++    VS   PLC + SS      K+  D C Y  +Y+ +  SS G LV D++
Sbjct: 96  -YRPHNNA----VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYA-DHGSSVGVLVKDLV 149

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
            +    +      +  ++  GCG  Q  G      +  GV+GL     ++ S L+  G +
Sbjct: 150 PM----RLTNGKRISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHV 205

Query: 274 QNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS 331
            N    C          F GD  P++    S+ PI    +  Y  G          +   
Sbjct: 206 SNVVGHCLTGRGGGFLFFGGDVVPSS--GMSWTPILRNSEGKYSSGPAEVYFNGRAVGIG 263

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML----- 386
           G     DSG+S+T+  +++Y  +    +KL+ +    L+GN  K   +  + E+      
Sbjct: 264 GLTLTFDSGSSYTYFNSQVYRAI----EKLLKN---DLKGNPLKLASDDKTLELCWKGPK 316

Query: 387 ---KVPDMRLIF---------SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDY 430
               V D+R  F         SKN  F +    +       F   CL ++       G+ 
Sbjct: 317 PFESVVDVRNFFKPLAMSFKNSKNVQFQIPPEAYLIISE--FGNVCLGILDGSKEGMGNV 374

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
            IIG   M+   +V+D E  ++ W+ S C  
Sbjct: 375 NIIGDISMLNKIVVYDNERERIGWASSNCNR 405


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 146/367 (39%), Gaps = 34/367 (9%)

Query: 101 FFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
           F G+  Y +    +  GTP  +  V  D GS++ W     +QC P +   Y   +     
Sbjct: 10  FIGSGNYVIT---VGFGTPTRTQTVVFDTGSDVNW-----LQCKPCAVRCYAQQE---PL 58

Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           +DPS SS+ +NVSC+ P C   S+       C Y   Y  + +S+ G+L  D   L    
Sbjct: 59  FDPSLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYG-DGSSTIGFLAMDTFMLTPAQ 117

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAK-AGLIQNSFS 278
           K         + I GCG+  TG +       G  GL GLG  S  SL ++ A  + N FS
Sbjct: 118 KF-------KNFIFGCGQNNTGLF------QGTAGLVGLGRSSTYSLNSQVAPSLGNVFS 164

Query: 279 ICFDENDSGSVFFGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 335
            C     S + +     P  T   T+ L        YF+ +    +G + L+ S   FQ+
Sbjct: 165 YCLPSTSSATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQS 224

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG   T LP   Y+ +       ++   ++        CY+ S    +  P + 
Sbjct: 225 VGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIV 284

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
           L F+     +    +F F  N           +     GIIG    +   + +D E  ++
Sbjct: 285 LHFAGLDVRIPATGVF-FVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRI 343

Query: 453 AWSHSKC 459
            +S   C
Sbjct: 344 GFSAGAC 350


>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
 gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
 gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
          Length = 410

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/404 (21%), Positives = 162/404 (40%), Gaps = 58/404 (14%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++IG P  S+ + +D GS L W+ C   C  C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHV 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTED 202
            Y    + L             V+C+  LC    +       C S K  C Y+  Y   D
Sbjct: 80  LYKPTPKKL-------------VTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--D 123

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDV 261
           +SS G LV D      FS  A   +  +++  GCG  Q     +   P D ++GL  G V
Sbjct: 124 SSSMGVLVID-----RFSLSASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKV 178

Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVE 319
           ++ S L   G+I ++    C      G +FFGD Q P +  + + +    KY +   G  
Sbjct: 179 TLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTL 238

Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSW 374
            +   +  ++ +    + DSGA++T+   + Y   +      ++S+      ++ +  + 
Sbjct: 239 HFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRAL 298

Query: 375 KYCYNASSEEMLKVPDMRLIF----------SKNQSFVVRNHIFSFPENEGFTVFCLTVM 424
             C+    ++++ + +++  F           K  +  +    +     EG    CL ++
Sbjct: 299 TVCWKG-KDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGIL 355

Query: 425 STDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
               ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 356 DGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 96/397 (24%), Positives = 158/397 (39%), Gaps = 73/397 (18%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  GTP  +    +D GS+++W PC         +   +S    +  + P  SSSSK + 
Sbjct: 71  LSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLG 130

Query: 174 CSHPLCK-------------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           C +P C              S  SC +   P PY+  Y +  T   G  + + LHL S S
Sbjct: 131 CKNPKCSWIHHSNINCDQDCSIKSCLNQTCP-PYMIFYGSGTTG--GVALSETLHLHSLS 187

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
           K         + ++GC      S      P G+ G G G  S+PS L          S  
Sbjct: 188 K--------PNFLVGC------SVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233

Query: 281 FDENDSGS---VFFGDQGPATQQSTS--FLPI--GEKYD-------AYFVGVESYCIGNS 326
           FD++   S   V   +Q  + +++ +  + P     K D        Y++G+    +G  
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293

Query: 327 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNS 373
            +                 ++DSG +FTF+  E +  +  +F + +   R   +      
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353

Query: 374 WKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 431
            + C+N S  + +  P++RL F    + +  V N+ F+F   E   V CLTV+ TDG  G
Sbjct: 354 LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENY-FAFVGGE---VACLTVV-TDGVAG 408

Query: 432 ---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                    I+G   M    + +D  N +L +   KC
Sbjct: 409 PERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 149/376 (39%), Gaps = 52/376 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP  + LVA+D  ++  WVPC  C+ CAP ++S           +DP+ SS+ + V C
Sbjct: 106 LGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS---------PSFDPTQSSTYRPVRC 156

Query: 175 SHPLC----KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             P C     +  SC +     C +   Y++    +   L  D L L+  +  A      
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA--VLGQDALSLSDSNGAA---VPD 211

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSICF----DEN 284
                GC R  TGS      P G++G G G +   S L++      S FS C       N
Sbjct: 212 DHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPL---SFLSQTKATYGSIFSYCLPSYKSSN 267

Query: 285 DSGSVFFGDQG-PATQQSTSFLP------------IGEKYDAYFVGVESYCIGNSCLTQS 331
            SG++  G  G P   ++T  L             +G + +   V + +  +     T  
Sbjct: 268 FSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGR 327

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
           G   +VD+G  FT L    YA +   F + VS+      G  +  CY  +  +   VP +
Sbjct: 328 G-GTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALGG-FDTCYYVNGTK--SVPAV 383

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM-----MGHRIVFD 446
             +F+      +           G  V CL + +   D    G N +       HR+VFD
Sbjct: 384 AFVFAGGARVTLPEENVVISSTSG-GVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFD 442

Query: 447 RENLKLAWSHSKCEEV 462
             N ++ +S   C  V
Sbjct: 443 VGNGRVGFSRELCTAV 458


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/420 (23%), Positives = 168/420 (40%), Gaps = 56/420 (13%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
           V+Y++  LS +  R+ +  +L S    +++  L  S               ++  + +GT
Sbjct: 98  VKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSAN-------------YFVVVGLGT 144

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P     +  D GS+L W      QC P + S Y   D   + +DPS SSS  N++C+  L
Sbjct: 145 PKRDLSLVFDTGSDLTWT-----QCEPCAGSCYKQQD---AIFDPSKSSSYINITCTSSL 196

Query: 179 CKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C        +S C S    C Y   Y  + T S G+L  + L + +       + +    
Sbjct: 197 CTQLTSAGIKSRCSSSTTACIYGIQYGDKST-SVGFLSQERLTITA-------TDIVDDF 248

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVF 290
           + GCG+   G +  G+A  G++GLG   +S   +   + +    FS C     S  G + 
Sbjct: 249 LFGCGQDNEGLF-SGSA--GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLT 303

Query: 291 FGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL------TQSGFQALVDSGA 341
           FG    AT  +  + P+     D  F G++     +G + L      T S   +++DSG 
Sbjct: 304 FG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGT 362

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
             T L    YA +   F + +    ++ +   +  CY+ S  + + VP +   F+     
Sbjct: 363 VITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGG--V 420

Query: 402 VVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            V   +            CL   +   D D  I G        +V+D E  ++ +  + C
Sbjct: 421 TVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 154/378 (40%), Gaps = 60/378 (15%)

Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE 160
           F N  Y +    + +GTP       +D GS + W  C  C+ C           ++N   
Sbjct: 60  FDNSVYLMK---LQVGTPPFEIQAIIDTGSEITWTQCLPCVHC----------YEQNAPI 106

Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           +DPS SS+ K   C                 CPY  DY  + T + G L  + + L S S
Sbjct: 107 FDPSKSSTFKEKRCD-------------GHSCPYEVDYF-DHTYTMGTLATETITLHSTS 152

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               +  V    IIGCG     S+   +   G++GL  G  S+  +    G      S C
Sbjct: 153 G---EPFVMPETIIGCGHNN--SWFKPSF-SGMVGLNWGPSSL--ITQMGGEYPGLMSYC 204

Query: 281 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 334
           F    +  + FG      G     +T F+    K   Y++ +++  +GN+ +   G  F 
Sbjct: 205 FSGQGTSKINFGANAIVAGDGVVSTTMFMTTA-KPGFYYLNLDAVSVGNTRIETMGTTFH 263

Query: 335 AL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
           AL     +DSG + T+ P      V    + +V++ R +    +   CYN+ + ++  V 
Sbjct: 264 ALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDIFPVI 323

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQ-NFMMGHRIV 444
            M   FS     V+  +      N G  VFCL ++    + +  +G   Q NF++G    
Sbjct: 324 TMH--FSGGVDLVLDKYNMYMESNNG-GVFCLAIICNSPTQEAIFGNRAQNNFLVG---- 376

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D  +L +++S + C  +
Sbjct: 377 YDSSSLLVSFSPTNCSAL 394


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 144/376 (38%), Gaps = 47/376 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP   F V +D GS L WV C+          Y      N   +    S S 
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCR----------YRARGKDNRRVFRADESKSF 155

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSK 221
           K V C    CK       S ++C +   PC Y  DY   D S++ G    + + +   + 
Sbjct: 156 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNG 213

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +       +IGC    TG    GA  DGV+GL   D S  S      L    FS C 
Sbjct: 214 RMARLPGH---LIGCSSSFTGQSFQGA--DGVLGLAFSDFSFTS--TATSLYGAKFSYCL 266

Query: 282 -----DENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 328
                ++N S  + FG         + T+ L +      Y + V    +G   L      
Sbjct: 267 VDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV 326

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSE-E 384
               SG   ++DSG S T L    Y +VV    + LV  KR+  +G   +YC++ +S   
Sbjct: 327 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFN 386

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRI 443
           + K+P +         F    H  S+  +    V CL  +S       +IG      +  
Sbjct: 387 VSKLPQLTFHLKGGARF--EPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLW 444

Query: 444 VFDRENLKLAWSHSKC 459
            FD     L+++ S C
Sbjct: 445 EFDLMASTLSFAPSAC 460


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 156/380 (41%), Gaps = 67/380 (17%)

Query: 114 IDIGTP---NVSF--LVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           I +GTP   + SF  L++ D GS++ W+ C  C +C       Y  L           SS
Sbjct: 129 ITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLK----------SS 178

Query: 168 SSKNVSCSHPLCKSRSS---CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           S+ +V C  P C++  S   C    + C Y  +Y    +S+  + V+ +          P
Sbjct: 179 SASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF--------P 230

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
                  V IGCG    G +   AA  G++GLG G +S PS +  AG    SFS C    
Sbjct: 231 PGVRVPGVAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQI--AGRYGRSFSYCLAGQ 286

Query: 285 DSG----SVFFGDQGPA------TQQSTSFLPIGEKYDAYFVGVESYCIGN---SCLTQS 331
            +G    ++ FG    A          T  L     Y  Y+VG+    +G      +T+S
Sbjct: 287 GTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES 346

Query: 332 GFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKY---C 377
             +          +VDSG + T L    YA     F ++ + K +     G  + +   C
Sbjct: 347 DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-RVAAVKELGWPSPGGPFAFFDTC 405

Query: 378 YNA-SSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG--I 432
           Y++     M KVP + + F+      +  +N++     N+G   F     +  GD G  I
Sbjct: 406 YSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAF---AGSGDRGVSI 462

Query: 433 IGQNFMMGHRIVFDRENLKL 452
           IG   + G R+V+D +  ++
Sbjct: 463 IGNIQLQGFRVVYDVDGQRV 482


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/417 (22%), Positives = 180/417 (43%), Gaps = 42/417 (10%)

Query: 61  YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTP 119
           + E+L  +  +    R K+ +++N  +  +   +       +G      +Y   + +GTP
Sbjct: 95  HTEILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN------WGKSLSTTNYVASLRLGTP 148

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
               +V LD GS+  WV     QC P +  Y    ++    +DP++SS+   V C    C
Sbjct: 149 ATELVVELDTGSDQSWV-----QCKPCADCY----EQRDPVFDPTASSTYSAVPCGAREC 199

Query: 180 KSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           +  +S  S ++        CPY   Y  +D+ + G L  D L L+     +P  +V    
Sbjct: 200 QELASSSSSRNCSSDNNKNCPYEVSYD-DDSHTVGDLARDTLTLSPSPSPSPADTVPG-F 257

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
           + GCG    G++ +    DG++GLGLG  S+PS +  A     +FS C   + S + +  
Sbjct: 258 VFGCGHSNAGTFGE---VDGLLGLGLGKASLPSQV--AARYGAAFSYCLPSSPSAAGYLS 312

Query: 293 DQGPATQQSTSF--LPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ----ALVDSGASFT 344
             G A + +  F  +  G+   +Y++ +    +    +    S F      ++DSG +F+
Sbjct: 313 FGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFS 372

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            LP   YA +   F   +   R     +S  +  CY+ +  E +++P + L+F+   +  
Sbjct: 373 RLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGATVH 432

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +      +  N+     CL  +  + D GI+G        +++D  + ++ +    C
Sbjct: 433 LHPSGVLYTWND-VAQTCLAFVP-NHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 108/470 (22%), Positives = 194/470 (41%), Gaps = 89/470 (18%)

Query: 5   VAICML-FGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
           +A+C+  FGCI    +    F+++LVHR S ++                 P  NS +   
Sbjct: 14  IALCVASFGCIY---AHNAGFTTELVHRDSPKS-----------------PLYNSQQTHL 53

Query: 64  LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
              +   +R  +RV     ++  R       +  ++    N   +L    + +GTP    
Sbjct: 54  QRWNKAMRRSVSRV-----HHFQRTAATVSPKEVESEIIANGGEYL--MSLSLGTPPFEI 106

Query: 124 LVALDAGSNLLWVPCQ-CIQC----APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           L   D GS+L+W  C  C +C    APL              +DP SS + +++SC    
Sbjct: 107 LAIADTGSDLIWTQCTPCDKCYKQIAPL--------------FDPKSSKTYRDLSCDTRQ 152

Query: 179 CKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK---HAPQSSVQSSV 232
           C++    SSC S +  C Y + Y  + + ++G L  D + L S +    + P++      
Sbjct: 153 CQNLGESSSCSS-EQLCQY-SYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKT------ 204

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDS 286
           +IGCGR+  G++       G++GLG G +S+ S +  +  +   FS C          +S
Sbjct: 205 VIGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNS 260

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCL-------TQSGFQALV 337
             + FG     +       P+  K     Y++ +E+  +G+  +         S    ++
Sbjct: 261 SKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIII 320

Query: 338 DSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           DSG S T  P   + E     +  +++ +R         +CY  + +  LKVP +   F+
Sbjct: 321 DSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPD--LKVPVITAHFN 378

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQ-NFMMGHRI 443
                +   + F    ++   V CL   ST     +G + Q NF++G+ I
Sbjct: 379 GADVVLQTLNTFILISDD---VLCLAFNSTQSGAIFGNVAQMNFLIGYDI 425


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 144/376 (38%), Gaps = 47/376 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP   F V +D GS L WV C+          Y      N   +    S S 
Sbjct: 84  YFTEIRVGTPAKKFRVVVDTGSELTWVNCR----------YRARGKDNRRVFRADESKSF 133

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSK 221
           K V C    CK       S ++C +   PC Y  DY   D S++ G    + + +   + 
Sbjct: 134 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNG 191

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              +       +IGC    TG    GA  DGV+GL   D S  S      L    FS C 
Sbjct: 192 RMARLPGH---LIGCSSSFTGQSFQGA--DGVLGLAFSDFSFTS--TATSLYGAKFSYCL 244

Query: 282 -----DENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 328
                ++N S  + FG         + T+ L +      Y + V    +G   L      
Sbjct: 245 VDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV 304

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSE-E 384
               SG   ++DSG S T L    Y +VV    + LV  KR+  +G   +YC++ +S   
Sbjct: 305 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFN 364

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRI 443
           + K+P +         F    H  S+  +    V CL  +S       +IG      +  
Sbjct: 365 VSKLPQLTFHLKGGARF--EPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLW 422

Query: 444 VFDRENLKLAWSHSKC 459
            FD     L+++ S C
Sbjct: 423 EFDLMASTLSFAPSAC 438


>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
          Length = 178

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 48/128 (37%), Positives = 63/128 (49%), Gaps = 8/128 (6%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVP-CQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+YT I IGTP V + V LD GS   WV    C QC      + + + R L+ YDP SS 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH      +     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 228 VQSSVIIG 235
             +SV  G
Sbjct: 171 TSTSVTFG 178


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 168/391 (42%), Gaps = 72/391 (18%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP + F V +D GSNL+W  C  C +C P                 P+ SS+   +
Sbjct: 95  ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146

Query: 173 SCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQ 225
            C+   C+     SR    +    C Y  +Y+     ++GYL  + L +   +F K    
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVGDGTFPK---- 200

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DE 283
                 V  GC    T + +D ++  G++GLG G +S+ S LA        FS C   D 
Sbjct: 201 ------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDM 244

Query: 284 NDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGNSCL------- 328
            D G+  + FG     T+    QST  L  P  ++   Y+V +    + ++ L       
Sbjct: 245 ADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTF 304

Query: 329 --TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYNA 380
             TQ+G     +VDSG + T+L  + YA V   F   +++   +   +   Y    CY  
Sbjct: 305 GFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKP 364

Query: 381 SS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTVMSTDGDY--G 431
           S+    + ++VP + L F+    +   V+N+      +++G  TV CL V+    D    
Sbjct: 365 SAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPIS 424

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IIG    M   +++D +    +++ + C ++
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 90/402 (22%), Positives = 152/402 (37%), Gaps = 87/402 (21%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS---- 165
           ++  + IGTP  + L+  D GS+L+WV     +C+P          RN S   P S    
Sbjct: 86  YFVSLRIGTPPQTLLLVADTGSDLIWV-----KCSPC---------RNCSHRSPGSAFFA 131

Query: 166 --SSSSKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
             S++   + C  P C+       + C    L  PC Y   Y+ + ++++G+   + L L
Sbjct: 132 RHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYA-DSSTTTGFFSKEALTL 190

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLI 273
            + +    +    + +  GCG + +G  L GA+     GVMGLG   +S  S L +    
Sbjct: 191 NTSTGKVKK---LNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--F 245

Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------------------- 313
            + FS C  +              +   TSFL IG   +                     
Sbjct: 246 GSKFSYCLMDYT-----------LSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSP 294

Query: 314 --YFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
             Y++ ++   +    L                 ++DSG + TF+    Y E++  F K 
Sbjct: 295 TFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKR 354

Query: 362 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPENEGFTVF 419
           V     +     +  C N S      +P M    +    F    RN+        G  + 
Sbjct: 355 VKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFI----ETGDQIK 410

Query: 420 CLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           CL V  +S DG + ++G     G  + FDR+  +L ++   C
Sbjct: 411 CLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 168/391 (42%), Gaps = 72/391 (18%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +GTP + F V +D GSNL+W  C  C +C P                 P+ SS+   +
Sbjct: 95  ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146

Query: 173 SCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQ 225
            C+   C+     SR    +    C Y  +Y+     ++GYL  + L +   +F K    
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVGDGTFPK---- 200

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DE 283
                 V  GC    T + +D ++  G++GLG G +S+ S LA        FS C   D 
Sbjct: 201 ------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDM 244

Query: 284 NDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGNSCL------- 328
            D G+  + FG     T+    QST  L  P  ++   Y+V +    + ++ L       
Sbjct: 245 ADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTF 304

Query: 329 --TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYNA 380
             TQ+G     +VDSG + T+L  + YA V   F   +++   +   +   Y    CY  
Sbjct: 305 GFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKP 364

Query: 381 SS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTVMSTDGDY--G 431
           S+    + ++VP + L F+    +   V+N+      +++G  TV CL V+    D    
Sbjct: 365 SAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPIS 424

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           IIG    M   +++D +    +++ + C ++
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 155/374 (41%), Gaps = 44/374 (11%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSSSKNVSC 174
           +GTP+  F++  D GS+L W+ C+   C   + S   +   R+   +  + SSS K + C
Sbjct: 89  VGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPC 147

Query: 175 SHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAPQS 226
              +CK       S ++C +   PC Y  DY   D S++ G+  ++ + +    K   + 
Sbjct: 148 LTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--LKEGRKM 203

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 281
            +  +V+IGC     G     A  DGVMGLG    S    +  A      FS C      
Sbjct: 204 KLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLS 258

Query: 282 DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------T 329
            +N S  + FG     +      + + L +G     Y V +    IG + L         
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 318

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           +     ++DSG+S TFL    Y  V+      L+  +++ +     +YC+N++  E   V
Sbjct: 319 KGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLV 378

Query: 389 PDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD-GDYGIIGQNFMMGHRIVF 445
           P +   F+    F   V++++ S  +     V CL  +S       ++G      H   F
Sbjct: 379 PRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVAWPGTSVVGNIMQQNHLWEF 434

Query: 446 DRENLKLAWSHSKC 459
           D    KL ++ S C
Sbjct: 435 DLGLKKLGFAPSSC 448


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 135/365 (36%), Gaps = 49/365 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F V  D GS+  WV     QC P  A  Y   +     +DP+ S++  N+S
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTWV-----QCQPCVAYCYRQKE---PLFDPTKSATYANIS 216

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           CS   C            C Y   Y  + + + G+   D L LA             +  
Sbjct: 217 CSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--------YDTIKNFR 267

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG K  G +   A   G++GLG G  S+P     K G +   F+ C     +G+ F  
Sbjct: 268 FGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFLD 321

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTF 345
            G   PA     + + +      Y+VG+    +G   L   G        LVDSG   T 
Sbjct: 322 LGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITR 381

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASSEE--MLKVPDMRLI 394
           LP   YA +   F K       ++QG  +           CY+ +  +   + +P + L+
Sbjct: 382 LPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLV 434

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           F       V      +  +           + D D  I+G      H +++D     + +
Sbjct: 435 FQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGF 494

Query: 455 SHSKC 459
           +   C
Sbjct: 495 APGAC 499


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 160/378 (42%), Gaps = 56/378 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V F+   D GS+L W  CQ C  C P          ++   YD + SSS   V
Sbjct: 97  LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPIYDTAVSSSFSPV 146

Query: 173 SCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            C+   C    S  +C +   PC Y   Y  +   S+G L  + L        AP  SV 
Sbjct: 147 PCASATCLPIWSSRNCTASSSPCRYRYAYG-DGAYSAGVLGTETLTFPG----APGVSV- 200

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDEND 285
             +  GCG    G   +     G +GLG G +   SL+A+ G+    FS C    F+ + 
Sbjct: 201 GGIAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFFNTSL 252

Query: 286 SGSVFFGD----QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--------- 329
              V FG       P+T  +    P+ +       Y+V +E   +G++ L          
Sbjct: 253 GSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLR 312

Query: 330 -QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEML 386
                  +VDSG +FTFL  E    VVV     V  + +    +    C+ A++  +++ 
Sbjct: 313 DDGSGGMIVDSGTTFTFL-VESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLP 371

Query: 387 KVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 444
            +PDM L F+      + R++  SF + E  + FCL +  S   D  I+G       +++
Sbjct: 372 AMPDMVLHFAGGADMRLHRDNYMSFNQEE--SSFCLNIAGSPSADVSILGNFQQQNIQML 429

Query: 445 FDRENLKLAWSHSKCEEV 462
           FD    +L++  + C ++
Sbjct: 430 FDITVGQLSFMPTDCGKL 447


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 172/424 (40%), Gaps = 53/424 (12%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLH 110
           K    + E L  +  +      KL S  NSS  +L       P+  S  +  G   Y + 
Sbjct: 76  KEKPSHEETLGRDQLRAANIHAKLSSPRNSSAKELQQSGVTIPT--SSGYSLGTPEYVI- 132

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
              + +GTP V+ ++++D GS++ WV     QCAP +A   +S    L  +DP+ S++  
Sbjct: 133 --TVSLGTPAVTQVMSIDTGSDVSWV-----QCAPCAAQSCSSQKDKL--FDPAKSATYS 183

Query: 171 NVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
             SCS   C       +  L   C YI  Y  + ++++G    D L L +       S  
Sbjct: 184 AFSCSSAQCAQLGGEGNGCLNSHCQYIVKY-VDHSNTTGTYGSDTLGLTT-------SDA 235

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSG 287
             +   GC  +  G        DG+MGLG GD    SL+++ A     +FS C   + S 
Sbjct: 236 VKNFQFGCSHRANGFV---GQLDGLMGLG-GDTE--SLVSQTAATYGKAFSYCLPPSSSS 289

Query: 288 SVFFGDQGPATQQST----SFLPIGEKYDAYFVGV--ESYCIGNSCLT--QSGFQ--ALV 337
           +  F   G A   ++    S  P+       F GV  ++  +  + L    S F   ++V
Sbjct: 290 AGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVV 349

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG   T LP   Y  +   F K + +   +        C++ S  + ++VP + L FS 
Sbjct: 350 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFS- 408

Query: 398 NQSFVVRNHIFSFPENEGFTVFCL--TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                 R  +     +  F   CL  T  + DGD GI+G        ++FD     L + 
Sbjct: 409 ------RGAVMDLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFR 462

Query: 456 HSKC 459
              C
Sbjct: 463 PGAC 466


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 150/383 (39%), Gaps = 56/383 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVP-CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP    L+ +D  S L WV    C  C+P            +  ++P  SSS  +  C
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSP----------TKVPPFNPGLSSSFISEPC 54

Query: 175 SHPLCKSR------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           +  +C  R      S+C      C +   Y  + + + G +  +I  L S+   A   S 
Sbjct: 55  TSSVCLGRSKLGFQSACNRSTGSCSFQVAY-LDGSEAYGVIAREIFSLQSWDGAA---ST 110

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL---AKAGLIQNSFSICFDE-- 283
              VI GC  K     +D ++  G +GL  G  S P+ +   +K+GL  + FS CF    
Sbjct: 111 LGDVIFGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSGL-SDRFSYCFPNRA 167

Query: 284 ---NDSGSVFFGDQG-PATQQSTSFL----PIGEKYDAYFVGVESYCIGNSCL--TQSGF 333
              N SG + FGD G PA       L    PI    D Y+VG++   +G   L   +S F
Sbjct: 168 EHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAF 227

Query: 334 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLV-SSKRISLQGNSWKYCYN--ASS 382
           +           DSG + +FL    +  +V  F + V    R S    + + CY+  A  
Sbjct: 228 KIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGD 287

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMS----TDGDYGIIGQN 436
             +   P + L F  N    +R      P          CL  ++      G   +IG  
Sbjct: 288 ARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNY 347

Query: 437 FMMGHRIVFDRENLKLAWSHSKC 459
               + I  D E  ++ ++ + C
Sbjct: 348 QQQDYLIEHDLERSRIGFAPANC 370


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 120/482 (24%), Positives = 191/482 (39%), Gaps = 72/482 (14%)

Query: 15  LLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVAD--SWPKKNSVEYLELLLSNDWKR 72
           L +  +AV+F+   +++   + +   +S S  + +    S  K +  +Y  L LS    R
Sbjct: 38  LQNAHNAVAFTPHHLNQHQRQQEALLLSSSFGIHLRSRASIQKPSHRDYKSLTLSR-LAR 96

Query: 73  QKTRVK-LQSNNN----SSRNQLLFPSEGSQTHFFGN-----------QFYWLHYTWIDI 116
              RVK LQ+  +       N  L P+E S   F  N           Q    ++  + I
Sbjct: 97  DSARVKSLQTRLDLVLKRVSNSDLHPAE-SNAEFEANALQGPVVSGTSQGSGEYFLRVGI 155

Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           G P     V LD GS++ W     IQCAP S  Y  S       +DP SS+S   + C  
Sbjct: 156 GKPPSQAYVVLDTGSDVSW-----IQCAPCSECYQQSD----PIFDPVSSNSYSPIRCDA 206

Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
           P CKS    +     C Y   Y  + + + G    + + L         ++   +V IGC
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--------TAAVENVAIGC 257

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF---FGD 293
           G    G ++  A   G+ G  L   S P     A +   SFS C    DS +V    F  
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYCLVNRDSDAVSTLEFNS 309

Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASF 343
             P    +       E    Y++G++   +G   L   +S F+         ++DSG + 
Sbjct: 310 PLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAV 369

Query: 344 TFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           T L +E+Y  +   F K    +  +  +SL    +  CY+ SS E ++VP +   F + +
Sbjct: 370 TRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSRESVQVPTVSFHFPEGR 425

Query: 400 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
              +  RN++      +    FC     T     I+G     G R+ FD  N  + +S  
Sbjct: 426 ELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSAD 482

Query: 458 KC 459
            C
Sbjct: 483 SC 484


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 149/355 (41%), Gaps = 43/355 (12%)

Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           VS  V +D  S++ WV  QC+ C P+   +   L ++   YDP+ SS+   + C  P CK
Sbjct: 167 VSQTVVVDTSSDIPWV--QCLPC-PIPQCH---LQKD-PLYDPAKSSTFAPIPCGSPACK 219

Query: 181 SRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
              S     C    D C YI +Y  +  +++G  V D L ++        + V      G
Sbjct: 220 ELGSSYGNGCSPTTDECKYIVNYG-DGKATTGTYVTDTLTMSP-------TIVVKDFRFG 271

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
           C     GS+ +  A  G++ LG G  S+  L   A    N+FS C  +  S   F    G
Sbjct: 272 CSHAVRGSFSNQNA--GILALGGGRGSL--LEQTADAYGNAFSYCIPKPSSAG-FLSLGG 326

Query: 296 PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLP 347
           P       S+ P+ +   A   Y V +E+  +    L    T     A++DSGA  T LP
Sbjct: 327 PVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLP 386

Query: 348 TEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
            ++YA +   F   + +   ++    +   CY+ +    +KVP + L+F+   +  +   
Sbjct: 387 PQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPA 446

Query: 407 IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                  +G    CL   +T G+   G IG      + +++D    K+ +    C
Sbjct: 447 SIIL---DG----CLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 144/384 (37%), Gaps = 48/384 (12%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           G +   L+Y    +G       V +D  S L WV     QC P  A +    D+    +D
Sbjct: 105 GARLRTLNYVAT-VGIGGGEATVIVDTASELTWV-----QCEPCDACH----DQQEPLFD 154

Query: 163 PSSSSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
           PSSS S   V C+   C         S  +C      C Y   Y  + + S G L  D L
Sbjct: 155 PSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYR-DGSYSRGVLAHDRL 213

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
            LA          +Q   + GCG    G +       G+MGLG   +S+ S         
Sbjct: 214 SLAG-------EDIQG-FVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQ--FG 260

Query: 275 NSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNS 326
             FS C    +   SGS+  GD     + ST  +      D      Y   +    +G  
Sbjct: 261 GVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGE 320

Query: 327 CLTQSGF------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
            +   GF      +A+VDSG   T L   +YA V  +F   ++    +   +    C++ 
Sbjct: 321 DVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDL 380

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFM 438
           +    ++VP ++L+F       V +    +      +  CL + S   +Y   IIG    
Sbjct: 381 TGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQ 440

Query: 439 MGHRIVFDRENLKLAWSHSKCEEV 462
              R++FD    ++ ++   C+ +
Sbjct: 441 KNLRVIFDTVGSQIGFAQETCDYI 464


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/404 (23%), Positives = 176/404 (43%), Gaps = 82/404 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQC-IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP+ +  + +D GS+L+W PC     CA  S ++  +    + ++ P  SSSSK +
Sbjct: 88  LSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCA--SCNFPNTDITKIPKFMPRLSSSSKLI 145

Query: 173 SCSHPLC------KSRSSC-------KSLKDPC-PYIADYSTEDTSSSGYLVDDILHLAS 218
            C +P C        +S C       ++    C PYI  Y     S++G L+ + ++   
Sbjct: 146 GCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLG--STAGLLLSETINF-- 201

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
                P  ++ S  + GC      S L    P+G+ G G    S+P  L   GL + S+ 
Sbjct: 202 -----PNKTI-SDFLAGC------SLLSTRQPEGIAGFGRSQESLPLQL---GLKKFSYC 246

Query: 279 IC---FDENDSGSVFFGDQGPATQQST----SFLPIGEKY---------DAYFVGVESYC 322
           +    FD++   S    D GP+T  S     S+ P  +           + Y+V +    
Sbjct: 247 LVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306

Query: 323 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--- 369
           +G + +          +      +VDSG++FTF+   ++  +  +F+K +++  ++    
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366

Query: 370 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTD- 427
           +    + C++ S E+ + +PD+   F       +  ++ F+F +     V CLT++S + 
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVD---MGVVCLTIVSDNA 423

Query: 428 ----GDYGII--GQNFMMGH------RIVFDRENLKLAWSHSKC 459
               GD G+   G   ++G+       I +D EN +  +    C
Sbjct: 424 AALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/383 (23%), Positives = 152/383 (39%), Gaps = 62/383 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           ++    +DP++S S +NV+C
Sbjct: 158 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPATSLSYRNVTC 207

Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             P C         R+  +   DPCPY   Y  +  ++     D  L   + +  AP +S
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTAPGAS 263

Query: 228 VQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
            +   V+ GCG    G +   A   G+    L   S   L A  G   ++FS C  ++ S
Sbjct: 264 RRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG---HAFSYCLVDHGS 318

Query: 287 ---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS------ 331
                + FGD       P    +          D  Y+V ++   +G   L  S      
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378

Query: 332 ----GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 386
                   ++DSG + ++     Y  +   F +++  +  +         CYN S  E +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST-DGDYGIIGQNFMM 439
           +VP+  L+F+          ++ FP    F       + CL V+ T      IIG     
Sbjct: 439 EVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQ 491

Query: 440 GHRIVFDRENLKLAWSHSKCEEV 462
              +++D +N +L ++  +C EV
Sbjct: 492 NFHVLYDLQNNRLGFAPRRCAEV 514


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 158/371 (42%), Gaps = 49/371 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     IQCAP    Y     +    ++P+ S S 
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVW-----IQCAPCKKCY----SQTDPVFNPTKSRSF 197

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
            N+ C  PLC+   S  C + K  C Y   Y  + + + G    + L          + +
Sbjct: 198 ANIPCGSPLCRRLDSPGCSTKKHICLYQVSYG-DGSFTYGEFSTETLTF--------RGT 248

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
               V +GCG    G ++  A    ++GLG G +S PS + +       FS C  +  + 
Sbjct: 249 RVGRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSAS 303

Query: 288 S----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ--- 334
           S    + FGD   A  ++  F P+    K D  Y+V +    +G +    +T S F+   
Sbjct: 304 SKPSYMVFGDS--AISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDS 361

Query: 335 -----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
                 ++DSG S T L    Y  +   F    S+ + + + + +  C++ S +  +KVP
Sbjct: 362 TGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVP 421

Query: 390 DMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
            + L F   + S    N++    +N G   FC     T     I+G     G R+V+D  
Sbjct: 422 TVVLHFRGADVSLPASNYLIPV-DNSG--SFCFAFAGTMSGLSIVGNIQQQGFRVVYDLA 478

Query: 449 NLKLAWSHSKC 459
             ++ ++   C
Sbjct: 479 ASRVGFAPRGC 489


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/383 (23%), Positives = 152/383 (39%), Gaps = 62/383 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           ++    +DP++S S +NV+C
Sbjct: 158 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASLSYRNVTC 207

Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             P C         R+  +   DPCPY   Y  +  ++     D  L   + +  AP +S
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTAPGAS 263

Query: 228 VQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
            +   V+ GCG    G +   A   G+    L   S   L A  G   ++FS C  ++ S
Sbjct: 264 RRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG---HAFSYCLVDHGS 318

Query: 287 ---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS------ 331
                + FGD       P    +          D  Y+V ++   +G   L  S      
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378

Query: 332 ----GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 386
                   ++DSG + ++     Y  +   F +++  +  +         CYN S  E +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST-DGDYGIIGQNFMM 439
           +VP+  L+F+          ++ FP    F       + CL V+ T      IIG     
Sbjct: 439 EVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQ 491

Query: 440 GHRIVFDRENLKLAWSHSKCEEV 462
              +++D +N +L ++  +C EV
Sbjct: 492 NFHVLYDLQNNRLGFAPRRCAEV 514


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 152/362 (41%), Gaps = 40/362 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IGTP+V  L   D GS+L WV     QC+P   +      +N   YDP +SS+   + 
Sbjct: 100 IYIGTPSVERLAIADTGSDLTWV-----QCSPCDNT--KCFAQNTPLYDPLNSSTFTLLP 152

Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           C    C     S+  C    D C Y   Y  +++ S G L  D + L     H       
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGD-CIYAYTYG-DNSYSYGGLSSDSIRLMLLQLH-----YN 205

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS 286
           S +  GCG +   +        G++GLG G +S+ S L     I + FS C   F  N +
Sbjct: 206 SKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSN 263

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT--QSGFQALVDSGAS 342
             + FG+            P+  K D   Y++ +E   +G   +   Q+    ++DSG++
Sbjct: 264 SKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGST 323

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T+L    Y E V    + V+ +        + +C+    E M   PD+   F+     +
Sbjct: 324 LTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTY-KEGMSTPPDVVFHFTGGDVVL 382

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQ-NFMMGHRIVFDRENLKLAWSHS 457
              +     E+    + C TV+ +  D    +G +GQ +F +G    +D +  K++++ +
Sbjct: 383 KPMNTLVLIEDN---LICSTVVPSHFDGIAIFGNLGQIDFHVG----YDIQGGKVSFAPT 435

Query: 458 KC 459
            C
Sbjct: 436 DC 437


>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
          Length = 775

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 90/437 (20%), Positives = 173/437 (39%), Gaps = 66/437 (15%)

Query: 59  VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
           V + E ++     R+    +  S+   +   +L    G+  HFF           ++IG 
Sbjct: 361 VPHSEAIIHETPNRKVGTARQPSSPAPTGAAILCRGVGAPRHFF---------ITMNIGD 411

Query: 119 PNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
           P  S+ + +D GS L W+ C   C  C  +    Y    + L             V+C+ 
Sbjct: 412 PAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL-------------VTCAD 458

Query: 177 PLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            LC    +       C S K  C Y+  Y   D+SS G LV D      FS  A   +  
Sbjct: 459 SLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLSASNGTNP 510

Query: 230 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSG 287
           +++  GCG  Q     +   P D ++GL  G V++ S L   G+I ++    C      G
Sbjct: 511 TTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGG 570

Query: 288 SVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
            +FFGD Q P +  + + +    KY +   G   +   +  ++ +    + DSGA++T+ 
Sbjct: 571 FLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYF 630

Query: 347 PTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNASSEEMLKVPDMRLIF------ 395
             + Y   +      ++S+      ++ +  +   C+    ++++ + +++  F      
Sbjct: 631 AAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKG-KDKIVTIDEVKKCFRSLSLE 689

Query: 396 ----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVF 445
                K  +  +    +     EG    CL ++    ++       +IG   M+   +++
Sbjct: 690 FADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEHLSLAGTNLIGGITMLDQMVIY 747

Query: 446 DRENLKLAWSHSKCEEV 462
           D E   L W + +C+ +
Sbjct: 748 DSERSLLGWVNYQCDRI 764



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 118/318 (37%), Gaps = 31/318 (9%)

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ-TGSYLDGAAP 250
           C Y   Y+ +  S+ G L+ D   L       P+ + + ++  GCG  Q  G      +P
Sbjct: 29  CDYEIKYA-DGASTIGALIVDQFSL-------PRIATRPNLPFGCGYNQGIGENFQQTSP 80

Query: 251 -DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 308
            +G++GL  G VS  S L   G+I ++    C      G +F GD         + + + 
Sbjct: 81  VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGDG------DGNLVLLH 134

Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
             Y  Y  G  +       L  +    + DSG+++T+   + Y   V      +SS  + 
Sbjct: 135 ANY--YSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLE 192

Query: 369 -LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLT 422
            +   S   C+    +    V D++  F   Q     N +   P      V      CL 
Sbjct: 193 QVSDPSLPLCWKGQ-KAFESVFDVKKEFKSLQLNFGNNAVMEIPPENYLIVTEYGNVCLG 251

Query: 423 VM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 481
           ++     ++ IIG   M    +++D E  +L W    C    D S       P+ +    
Sbjct: 252 ILHGCRLNFNIIGDITMQDQMVIYDNEREQLGWIRGSC----DGSQEAPTQAPSAEEVVG 307

Query: 482 LPTTEQQSTSNGQAAAPP 499
                + S + G   APP
Sbjct: 308 AAARREASQATGSYLAPP 325


>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 407

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 158/394 (40%), Gaps = 65/394 (16%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +Y+  + IG P  ++ + +D GS+L WV C   C  C         +L R+  
Sbjct: 40  GNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGC---------TLPRD-R 89

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
           +Y P  +     V C  PLC +  S     C +  + C Y  +Y+ +  SS G LV DI+
Sbjct: 90  QYKPHGNL----VKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYA-DQGSSLGVLVRDII 144

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
            L    K    +   S +  GCG  QT   +    +  GV+GLG G  S+ S L   GLI
Sbjct: 145 PL----KLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLI 200

Query: 274 QNSFSICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
           +N    C      G +FFGDQ          P  Q S+S L        Y  G       
Sbjct: 201 RNVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLL------KHYKTGPADMFFN 254

Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS 382
               +  G +   DSG+S+T+  +  +  +V      +  K +S   +  S   C+    
Sbjct: 255 GKATSVKGLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPK 314

Query: 383 ------EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD---- 427
                 +       + L F+K+     +N +F  P      V      CL ++       
Sbjct: 315 PFKSLHDVTSNFKPLVLSFTKS-----KNSLFQVPPEAYLIVTKHGNVCLGILDGTEIGL 369

Query: 428 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           G+  IIG   +    +++D E  ++ W+ + C+ 
Sbjct: 370 GNTNIIGDISLQDKLVIYDNEKQRIGWASANCDR 403


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 157/389 (40%), Gaps = 70/389 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP   F + +D GS+L W+ C  C+ C           D+    +DP++SSS +NV+C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FDQVGPVFDPAASSSYRNVTC 206

Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQ 225
               C         R+  +  +D CPY   Y  +  ++        L L SF+ +  AP 
Sbjct: 207 GDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGD------LALESFTVNLTAPG 260

Query: 226 SSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           +S +   V+ GCG    G +   A   G+    L   S   L A  G   ++FS C  ++
Sbjct: 261 ASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVDH 315

Query: 285 DS---GSVFFGDQGPATQQS-------TSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG- 332
            S     V FG+       +       T+F P     D  Y+V ++   +G   L  S  
Sbjct: 316 GSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSD 375

Query: 333 -----------FQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNA 380
                         ++DSG + ++     Y  +   F D++  S  +         CYN 
Sbjct: 376 TWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNV 435

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST-DGDYGII 433
           S  +  +VP++ L+F+          ++ FP    F       + CL V+ T      II
Sbjct: 436 SGVDRPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSII 488

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G        +V+D +N +L ++  +C EV
Sbjct: 489 GNFQQQNFHVVYDLKNNRLGFAPRRCAEV 517


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 140/358 (39%), Gaps = 31/358 (8%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP    LV  D GS+L WV     QC P    Y     ++   +DPS S++   V 
Sbjct: 142 VGLGTPKRDLLVVFDTGSDLSWV-----QCKPCDGCY----QQHDPLFDPSQSTTYSAVP 192

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C    C+   S       C Y   Y  + + + G L  D L L   S  +    +Q   +
Sbjct: 193 CGAQECRRLDSGSCSSGKCRYEVVYG-DMSQTDGNLARDTLTLGPSSSSSSSDQLQ-EFV 250

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSICFDENDS--GSVF 290
            GCG   TG +      DG+ GLG   VS+ S   AK G     FS C   + +  G + 
Sbjct: 251 FGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAKYGA---GFSYCLPSSSTAEGYLS 304

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ---ALVDSGASFTF 345
            G   P   + T+ +   +    Y++ +    +    +  S   F+    ++DSG   T 
Sbjct: 305 LGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITR 364

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
           LP+  YA +   F  L+  +R S +     +    CY+ +    +++P + L+F    + 
Sbjct: 365 LPSRAYAALRSSFAGLM--RRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATL 422

Query: 402 VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +      +  N+            D    I+G        +V+D  N K+ +    C
Sbjct: 423 NLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 155/376 (41%), Gaps = 44/376 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSSSKNV 172
             +GTP+  F++  D GS+L W+ C+   C   + S   +   R+   +  + SSS K +
Sbjct: 16  FKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTI 74

Query: 173 SCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAP 224
            C   +CK       S ++C +   PC Y  DY   D S++ G+  ++ + +    K   
Sbjct: 75  PCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--LKEGR 130

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           +  +  +V+IGC     G     A  DGVMGLG    S    +  A      FS C    
Sbjct: 131 KMKLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDH 185

Query: 282 --DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 328
              +N S  + FG     +      + + L +G     Y V +    IG + L       
Sbjct: 186 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVW 245

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASSEEML 386
             +     ++DSG+S TFL    Y  V+      L+  +++ +     +YC+N++  E  
Sbjct: 246 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEES 305

Query: 387 KVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD-GDYGIIGQNFMMGHRI 443
            VP +   F+    F   V++++ S  +     V CL  +S       ++G      H  
Sbjct: 306 LVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVAWPGTSVVGNIMQQNHLW 361

Query: 444 VFDRENLKLAWSHSKC 459
            FD    KL ++ S C
Sbjct: 362 EFDLGLKKLGFAPSSC 377


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 151/367 (41%), Gaps = 54/367 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I +G P   F +  D  ++  W+ CQ CI+C           D+  S +DPS SSS   +
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKC----------YDQPDSIFDPSQSSSYTLL 240

Query: 173 SCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           SC    C     SSC S    C Y   Y  + T++ G L+++ +   S       S    
Sbjct: 241 SCETKHCNLLPNSSC-SDDGYCRYNITYK-DGTNTEGVLINETVSFES-------SGWVD 291

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--- 287
            V +GC  K  G ++     DG  GLG G +S PS +  +     S S C  E+  G   
Sbjct: 292 RVSLGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINAS-----SMSYCLVESKDGYSS 343

Query: 288 -SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG-------NSCLTQSGFQ---AL 336
            ++ F     +       L   +  + Y+VG++   +G       NS  T   +     +
Sbjct: 344 STLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMI 403

Query: 337 VDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           V S +  T L  + Y  V    V K   L   K   LQ ++   CYN SS   +++P + 
Sbjct: 404 VSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAF-LQFDT---CYNLSSNNTVELPILE 459

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
              +  +S+++    + +  ++  T FC     + G + I+G     G R+ FD  N   
Sbjct: 460 FEVNDGKSWLLPKESYLYAVDKNGT-FCFAFAPSKGSFSILGTLQQYGTRVTFDLVN-SF 517

Query: 453 AWSHSKC 459
            + H+ C
Sbjct: 518 VYLHTLC 524


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 156/364 (42%), Gaps = 60/364 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP       LD GS  +W  C  C+ C           ++    +DPS SS+ K +
Sbjct: 69  LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHC----------YNQTAPIFDPSKSSTFKEI 118

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
                       C +    CPY   Y  + + + G LV + + + S S    Q  V    
Sbjct: 119 -----------RCDTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTSG---QPFVMPET 163

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
           IIGCGR  +G +  G A  GV+GL  G  S+  +    G      S CF    +  + FG
Sbjct: 164 IIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINFG 218

Query: 293 DQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGA 341
                 G     +T F+    K   Y++ +++  +GN+ +   G  F AL     +DSG+
Sbjct: 219 ANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 277

Query: 342 SFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
           + T+ P E Y  +V K  +++V++ R      S   CY + + ++  V  M   FS    
Sbjct: 278 TLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPVITMH--FSGGAD 331

Query: 401 FVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRIVFDRENLKLAWS 455
            V+  +      N G  VFCL ++ ++  +  I G     NF++G    +D  +L +++ 
Sbjct: 332 LVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG----YDSSSLLVSFK 386

Query: 456 HSKC 459
            + C
Sbjct: 387 PTNC 390


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 87/383 (22%), Positives = 155/383 (40%), Gaps = 57/383 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP   F + LD GS+L W+  QC+ C       Y    +N   YDP +S+S KN++C+
Sbjct: 166 VGTPPKHFSLILDTGSDLNWL--QCLPC-------YDCFHQNGMFYDPKTSASFKNITCN 216

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            P C   SS      C+S    CPY   Y     ++  + V+      + ++        
Sbjct: 217 DPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKV 276

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
            +++ GCG    G +   +   G+    L   S         L  +SFS C      + N
Sbjct: 277 GNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVDRNSNTN 331

Query: 285 DSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCIGNSCL---------- 328
            S  + FG+       +    TSF+   E      Y++ ++S  +G   L          
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNIS 391

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEEM 385
           +      ++DSG + ++     Y  +  KF +K+  +  I         C+N S   E  
Sbjct: 392 SDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENN 451

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFMM 439
           + +P++ + F       V   +++FP    F      + CL ++ T    + IIG     
Sbjct: 452 IHLPELGIAF-------VDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQ 504

Query: 440 GHRIVFDRENLKLAWSHSKCEEV 462
              I++D +  +L ++ +KC ++
Sbjct: 505 NFHILYDTKRSRLGFTPTKCADI 527


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 156/364 (42%), Gaps = 60/364 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP       LD GS  +W  C  C+ C           ++    +DPS SS+ K +
Sbjct: 63  LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHC----------YNQTAPIFDPSKSSTFKEI 112

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
                       C +    CPY   Y  + + + G LV + + + S S    Q  V    
Sbjct: 113 -----------RCDTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTSG---QPFVMPET 157

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
           IIGCGR  +G +  G A  GV+GL  G  S+  +    G      S CF    +  + FG
Sbjct: 158 IIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINFG 212

Query: 293 DQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGA 341
                 G     +T F+    K   Y++ +++  +GN+ +   G  F AL     +DSG+
Sbjct: 213 ANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 271

Query: 342 SFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
           + T+ P E Y  +V K  +++V++ R      S   CY + + ++  V  M   FS    
Sbjct: 272 TLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPVITMH--FSGGAD 325

Query: 401 FVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRIVFDRENLKLAWS 455
            V+  +      N G  VFCL ++ ++  +  I G     NF++G    +D  +L +++ 
Sbjct: 326 LVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG----YDSSSLLVSFK 380

Query: 456 HSKC 459
            + C
Sbjct: 381 PTNC 384


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/403 (21%), Positives = 161/403 (39%), Gaps = 74/403 (18%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
           Y  + T +  GTP  +  +  D GS+L+W PC     C +C+      +  +D   +  +
Sbjct: 78  YGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRF 131

Query: 162 DPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADYSTEDTSSSG 207
            P  SSSSK V C +P C        +S C+S           CP Y+  Y +  T  +G
Sbjct: 132 VPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST--AG 189

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
            L+ + L      K  P      + ++GC      S+L    P G+ G G G  S+PS +
Sbjct: 190 LLLSETLDFP--DKKIP------NFVVGC------SFLSIHQPSGIAGFGRGSESLPSQM 235

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--------YDAYFVGVE 319
                     S  FD++        D         ++ P  +          + Y++ + 
Sbjct: 236 GLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIR 295

Query: 320 SYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRI 367
              +GN  +                +++DSG++FTF+   +   V  +F+K ++  ++  
Sbjct: 296 KIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRAT 355

Query: 368 SLQG-NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMS 425
            ++     + C++ S E+ +K P++   F     + +  N+ F+   + G  V CLTV++
Sbjct: 356 DVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG--VACLTVVT 413

Query: 426 TDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              + G         I+G        + +D  N +L +    C
Sbjct: 414 HQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
          Length = 423

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/404 (21%), Positives = 160/404 (39%), Gaps = 45/404 (11%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++IG P   + + +D GS L W+ C   CI C    + 
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSL 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCK-SLKDPCPYIADYSTEDT 203
           +Y  L  +   +          V C+   C       R   K   K+ C Y   Y     
Sbjct: 80  FYPRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GG 137

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVS 262
           SS G L+ D     SFS  A   +  +S+  GCG  Q  +  +   P +G++GLG G V+
Sbjct: 138 SSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVT 192

Query: 263 VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVE 319
           + S L   G+I ++    C      G +FFGD    T   T + P+  ++  Y    G  
Sbjct: 193 LLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTL 251

Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSW 374
            +   +  ++ +  + + DSGA++T+   + Y   +      +S +      +  +  + 
Sbjct: 252 QFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRAL 311

Query: 375 KYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVM 424
             C+    +++  + +++  F           K  +  +    +     EG    CL ++
Sbjct: 312 TVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGIL 368

Query: 425 STDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
               ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 369 DGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 412


>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
 gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
          Length = 408

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/389 (22%), Positives = 155/389 (39%), Gaps = 68/389 (17%)

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y  ++IG P   + + +D GS+  W+ C      C  C  +    Y    + L       
Sbjct: 40  YVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKL------- 92

Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLA 217
                 V C+ PLC        +   C  + K+ C Y   Y    +S    L+D      
Sbjct: 93  ------VPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLD------ 140

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI 273
              K +  +    ++  GCG  Q       A      DG++GLG G V + S L  +G +
Sbjct: 141 ---KFSLPTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAV 197

Query: 274 -QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI-----GEKYDAYFVGVESYCIGNSC 327
            +N    C      G +F G++   +   T ++P+     GE  + Y  G  +  + ++ 
Sbjct: 198 SKNVIGHCLSSKGGGYLFIGEENVPSSHVT-WVPMAPTTPGEP-NHYSPGQATLHLDSNP 255

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           +     +A+ DSG+++T+LP  ++A+       LVS+ + SL  +S K   + +     K
Sbjct: 256 IGTKPLKAIFDSGSTYTYLPENLHAQ-------LVSALKASLSKSSLKQVSDPALPLCWK 308

Query: 388 VPD-MRLIFSKNQSFV--------VRNHIFSFPEN----EGFTVFCLTVMSTDG-DYGII 433
            P   + +    + F         +   +   PEN     G    C  ++   G D  II
Sbjct: 309 GPKPFKTVHDTPKEFKSLVTLKFDLGVTMIIPPENYLIITGHGNACFGILDMPGLDQYII 368

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G   M    +++D E  +LAW  S C+++
Sbjct: 369 GDITMQEQLVIYDNEKGRLAWMPSPCDKI 397


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 108/445 (24%), Positives = 171/445 (38%), Gaps = 68/445 (15%)

Query: 27  KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSS 86
           +L HR          ++   V  AD    +  VEY++  +S    R      LQ     S
Sbjct: 76  RLAHRCGPSTASASFAE---VQRAD----EQRVEYIQRRVSGGGARGAK-GALQQLATGS 127

Query: 87  RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPL 146
           R+  +  + G  T        + +   + +GTP VS  V +D GS++ WV     QC P 
Sbjct: 128 RSATVPTTMGVGT--------FQYVVTVSLGTPGVSQTVEVDTGSDVSWV-----QCKPC 174

Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTED 202
           SA    S    L  +DP+ SS+   V C    C       + C   +  C Y+  Y  + 
Sbjct: 175 SAPACNSQRDQL--FDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ--CGYVVSYG-DG 229

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           ++++G    D L L      AP ++V  + + GCG  Q G +   A  DG++ LG   +S
Sbjct: 230 SNTTGVYGSDTLAL------APGNTV-GTFLFGCGHAQAGMF---AGIDGLLALGRQSMS 279

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVE 319
           + S    AG     FS C     S + +    GP++     +T  L        Y V + 
Sbjct: 280 LKS--QAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLT 337

Query: 320 SYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNS- 373
              +G     +  S F    +VD+G   T LP   YA +   F   ++     S   N  
Sbjct: 338 GISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGI 397

Query: 374 WKYCYNASSEEMLKVPDMRLIFSKNQSF------VVRNHIFSFPENEGFTVFCLTVMSTD 427
              CY+ S   ++ +P + L FS   +       ++ +   +F  N G           D
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGILSSGCLAFAPNGG-----------D 446

Query: 428 GDYGIIGQNFMMGHRIVFDRENLKL 452
           GD  I+G        + FD   +  
Sbjct: 447 GDAAILGNVQQRSFAVRFDGSTVGF 471


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/403 (21%), Positives = 161/403 (39%), Gaps = 74/403 (18%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
           Y  + T +  GTP  +  +  D GS+L+W PC     C +C+      +  +D   +  +
Sbjct: 78  YGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRF 131

Query: 162 DPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADYSTEDTSSSG 207
            P  SSSSK V C +P C        +S C+S           CP Y+  Y +  T  +G
Sbjct: 132 VPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST--AG 189

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
            L+ + L      K  P      + ++GC      S+L    P G+ G G G  S+PS +
Sbjct: 190 LLLSETLDFP--DKXIP------NFVVGC------SFLSIHQPSGIAGFGRGSESLPSQM 235

Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--------YDAYFVGVE 319
                     S  FD++        D         ++ P  +          + Y++ + 
Sbjct: 236 GLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIR 295

Query: 320 SYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRI 367
              +GN  +                +++DSG++FTF+   +   V  +F+K ++  ++  
Sbjct: 296 KIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRAT 355

Query: 368 SLQG-NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMS 425
            ++     + C++ S E+ +K P++   F     + +  N+ F+   + G  V CLTV++
Sbjct: 356 DVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG--VACLTVVT 413

Query: 426 TDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              + G         I+G        + +D  N +L +    C
Sbjct: 414 HQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/401 (24%), Positives = 158/401 (39%), Gaps = 88/401 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +  GTP  +    +D GS+ +W PC     C  C         S    +S + P  SSSS
Sbjct: 81  LSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNC---------SFTSRISPFLPKHSSSS 131

Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPC-----PYIADYSTEDTSSSGYLVDDILHLA 217
           K + C +P C          + C +    C     PY+  Y +  T   G  + + LHL 
Sbjct: 132 KIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTT--GGVALSETLHLH 189

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
                     +  + ++GC      S      P G+ G G G  S+PS L   GL +  F
Sbjct: 190 GL--------IVPNFLVGC------SVFSSRQPAGIAGFGRGPSSLPSQL---GLTK--F 230

Query: 278 SICF------DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDA----YFVGVES 320
           S C       D  +S S+    Q  + +++ + +       P  +   A    Y+V +  
Sbjct: 231 SYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRR 290

Query: 321 YCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
             IG   +                 ++DSG +FT++ TE +  +  +F   V +   +L 
Sbjct: 291 ISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALM 350

Query: 371 GNS---WKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFTVFCLTVMS 425
             +    K C+N S  + L++P +RL F    +    + N+       E   V C TV+ 
Sbjct: 351 VEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSRE---VACFTVV- 406

Query: 426 TDGDY-----GIIGQNFMMGHRIV-FDRENLKLAWSHSKCE 460
           TDG       G+I  NF M +  V +D +N +L +    C+
Sbjct: 407 TDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESCK 447


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 116/462 (25%), Positives = 173/462 (37%), Gaps = 90/462 (19%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRN--QLLFP-SEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           LL +   R  +R + Q      RN  Q+  P S GS         Y L +T       +V
Sbjct: 45  LLKSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGSD--------YTLSFTLNSNPPQHV 96

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
           S    LD GS+L+W PC+  +C         + +   S   P  SS++++V C    C +
Sbjct: 97  SLY--LDTGSDLVWFPCKPFECILCEGK---AENTTASTPPPRLSSTARSVHCKSSACSA 151

Query: 182 RSSCKSLKDPCPYIADYSTEDTSSS----------------GYLVDDILHLASFSKHAPQ 225
             S     D C  IAD   E   +S                G LV  + H +     A  
Sbjct: 152 AHSNLPTSDLCA-IADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDEN 284
           S    +   GC           A P GV G G G +S+P+ LA  A  + N FS C   +
Sbjct: 211 SLSLHNFTFGCAHTAL------AEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSH 264

Query: 285 DSGS--------VFFGDQGPATQQS---------TSFLPIGEKYDAYFVGVESYCIGNSC 327
              S        +  G      ++          TS L   +    Y VG+E   IG   
Sbjct: 265 SFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKK 324

Query: 328 LTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVS-----SKRISLQGN 372
           +    F            +VDSG +FT LP  +Y  VV +FD  V      +K +     
Sbjct: 325 IPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE-DKT 383

Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVM 424
               CY    + ++ +P + L F  N+S VV   +N+ + F +          V CL +M
Sbjct: 384 GLGPCYYY--DTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLM 441

Query: 425 S-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +       T G    +G     G  +V+D E  ++ ++  KC
Sbjct: 442 NGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483


>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
          Length = 394

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 165/370 (44%), Gaps = 60/370 (16%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           T I +G  N +F V +D GS+L+ +P   + C        T  DR    YDP+ S  SK 
Sbjct: 43  TKIIVG--NHTFTVQVDTGSSLMAIPM--VNCN-------TCHDR--PSYDPTHSQYSKV 89

Query: 172 VSC----------SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           VSC          + P CK+R+     +D C ++  Y  + +  SG +  D+++L+  S 
Sbjct: 90  VSCFSEHCLGSGSAPPQCKNRA-----EDDCDFVILYG-DGSRVSGKIYQDVVNLSGLSG 143

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG-DVSVPSL---LAKAGLIQNSF 277
            A           G  R +TG + +    DG++G G      VP++   L +A  ++N F
Sbjct: 144 IAN---------FGANRIETGDF-EYPRADGIVGFGRSCKTCVPTVFESLVQAHGLKNIF 193

Query: 278 SICFDENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS--GFQ 334
           ++  D    G++  G+  P+       + P+ E    Y +   ++ + ++ +     G Q
Sbjct: 194 AMSMDYEGRGTLSLGELNPSNHIGEIQYTPLFEDGPFYNIKPTNFKVDDTVILPRLLGRQ 253

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDK-------LVSSKRISLQGNSWKYCYNASSEEMLK 387
            +VDSG+S   L +  Y  +V  F K       +  S  I L G+    CYN++S   L 
Sbjct: 254 VIVDSGSSALSLASGAYDALVHHFRKNYCHVAGICDSPSI-LDGS---ICYNSASSLDL- 308

Query: 388 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
           +P + L F       V  +N++   P   G + +C  +   D    I+G  FM G+  VF
Sbjct: 309 LPTIYLTFEGGVKVAVPPKNYLTKAPLTNGASGYCWMIDRADPSTTILGDVFMRGYYTVF 368

Query: 446 DRENLKLAWS 455
           D E  ++ ++
Sbjct: 369 DNEEKRIGFA 378


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 149/352 (42%), Gaps = 46/352 (13%)

Query: 65  LLSNDWKRQK---TRVKLQSNNNSSRNQL---LFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
           +L+ D +R K   +R+      +SS ++L     P++       GN     ++  + +GT
Sbjct: 99  ILNQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGSGN-----YFVVVGLGT 153

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P     +  D GS+L W      QC P + S Y   D   + +DPS S+S  N++C+  L
Sbjct: 154 PKRDLSLIFDTGSDLTWT-----QCEPCARSCYKQQD---AIFDPSKSTSYSNITCTSTL 205

Query: 179 CKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           C   S+       C +    C Y   Y  + + S GY   + L + +       + +  +
Sbjct: 206 CTQLSTATGNEPGCSASTKACIYGIQYG-DSSFSVGYFSRERLSVTA-------TDIVDN 257

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSV 289
            + GCG+   G +  G+A  G++GLG   +S   +   A + +  FS C     S  G +
Sbjct: 258 FLFGCGQNNQGLF-GGSA--GLIGLGRHPISF--VQQTAAVYRKIFSYCLPATSSSTGRL 312

Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALVDSGASFT 344
            FG    +  + T F  I      Y + +    +G + L     T S   A++DSG   T
Sbjct: 313 SFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVIT 372

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
            LP   Y  +   F + +S    + + +    CY+ S  E+  +P +   F+
Sbjct: 373 RLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFA 424


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 135/365 (36%), Gaps = 49/365 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F V  D GS+  WV     QC P  A  Y   +     +DP+ S++  N+S
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTWV-----QCQPCVAYCYRQKE---PLFDPTKSATYANIS 151

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           CS   C            C Y   Y  + + + G+   D L LA             +  
Sbjct: 152 CSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--------YDTIKNFR 202

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG K  G +   A   G++GLG G  S+P     K G +   F+ C     +G+ F  
Sbjct: 203 FGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFLD 256

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTF 345
            G   PA     + + +      Y+VG+    +G   L   G        LVDSG   T 
Sbjct: 257 LGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITR 316

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASSEE--MLKVPDMRLI 394
           LP   YA +   F K       ++QG  +           CY+ +  +   + +P + L+
Sbjct: 317 LPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLV 369

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           F       V      +  +           + D D  I+G      H +++D     + +
Sbjct: 370 FQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGF 429

Query: 455 SHSKC 459
           +   C
Sbjct: 430 APGAC 434


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 92/396 (23%), Positives = 158/396 (39%), Gaps = 68/396 (17%)

Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
           LD GS+L+W PC    C  L     T    N S       + S+ + C+ P C +  S  
Sbjct: 102 LDTGSDLVWFPCAPFTCM-LCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSA 160

Query: 187 SLKDPCPY----IADYSTEDTSSSG------YLVDDILHLASFSKHAPQSSVQSSVII-- 234
              D C      + D  T   ++S       Y   D   +A   +   +  + +SV +  
Sbjct: 161 PPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRG--RVGIAASVAVEN 218

Query: 235 ---GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------ 285
               C     G       P GV G G G +S+P+ LA A L    FS C   +       
Sbjct: 219 FTFACAHTALGE------PVGVAGFGRGPLSLPAQLAPAAL-SGRFSYCLVAHSFRADRP 271

Query: 286 --SGSVFFG---DQGPATQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCLT-------- 329
                +  G    + PA++    + P+    K+  ++ V +E+  +G + +         
Sbjct: 272 IRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRV 331

Query: 330 -QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---------WKYCY 378
            ++G   +V DSG +FT LP E YA V  +F + +++ R      +         + Y +
Sbjct: 332 GRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDH 391

Query: 379 NASSEE---MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-----TDG 428
           +AS+ E      VP + + F    + V+  RN+   F   E   V CL +M+       G
Sbjct: 392 DASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGG 451

Query: 429 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 464
             G +G     G  +V+D +  ++ ++  +C ++ D
Sbjct: 452 PAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDLWD 487


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 102/416 (24%), Positives = 161/416 (38%), Gaps = 61/416 (14%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           +  VEY++  +S    R      LQ     SR+  +  + G  T        + +   + 
Sbjct: 98  EQRVEYIQRRVSGGGARGAK-GALQQLATGSRSATVPTTMGVGT--------FQYVVTVS 148

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP VS  V +D GS++ WV     QC P SA    S    L  +DP+ SS+   V C 
Sbjct: 149 LGTPGVSQTVEVDTGSDVSWV-----QCKPCSAPACNSQRDQL--FDPAKSSTYSAVPCG 201

Query: 176 HPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
              C       + C   +  C Y+  Y  + ++++G    D L L      AP ++V  +
Sbjct: 202 ADACSELRIYEAGCSGSQ--CGYVVSYG-DGSNTTGVYGSDTLAL------APGNTV-GT 251

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFF 291
            + GCG  Q G +   A  DG++ LG   +S+ S    AG     FS C     S + + 
Sbjct: 252 FLFGCGHAQAGMF---AGIDGLLALGRQSMSLKS--QAAGAYGGVFSYCLPSKQSAAGYL 306

Query: 292 GDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA--LVDSGASFT 344
              GP +     +T  L        Y V +    +G     +  S F    +VD+G   T
Sbjct: 307 TLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVIT 366

Query: 345 FLPTEIYAEVVVKFDKLVSSKRI-SLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSF- 401
            LP   YA +   F   ++     S   N     CY+ S   ++ +P + L FS   +  
Sbjct: 367 RLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLA 426

Query: 402 -----VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
                ++ +   +F  N G           DGD  I+G        + FD   +  
Sbjct: 427 LEAPGILSSGCLAFAPNGG-----------DGDAAILGNVQQRSFAVRFDGSTVGF 471


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 137/323 (42%), Gaps = 54/323 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + +GTP     V LD GS+L WVPC    QC  C     S   S    ++ + P +SSSS
Sbjct: 95  VSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNC-----SSSPSAMSAMAVFHPKNSSSS 149

Query: 170 KNVSCSHPLC-----KSRSSCKSL-----KDPC-PYIADYSTEDTSSSGYLVDDILHLAS 218
           + V C +P C     KS S+C S       D C PY+  Y +  T  SG L+ D L L+ 
Sbjct: 150 RLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGST--SGLLISDTLRLSP 207

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
            S  +  +  + +  IGC             P G+ G G G  SVPS L          S
Sbjct: 208 SSSSSAPAPFR-NFAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLKVPKFSYCLLS 261

Query: 279 ICFDEND--SGSVFFGD-QGPATQQSTS--FLPI------GEKYDA-YFVGVESYCIGN- 325
             FD+N   SG +  GD   PA ++ T+  ++P+         Y   Y++ +    +G  
Sbjct: 262 RRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGK 321

Query: 326 -------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK----RISLQGNSW 374
                  + +  SG  A++DSG +FT+L   ++  V    +  V  +    R        
Sbjct: 322 PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL 381

Query: 375 KYCY--NASSEEMLKVPDMRLIF 395
           + C+         +++PD+ L F
Sbjct: 382 RPCFALPPGPGGAMELPDLELKF 404


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 112/460 (24%), Positives = 185/460 (40%), Gaps = 75/460 (16%)

Query: 25  SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
           S  L+HR  D    R    + +  +  +      VEYL+  LS      +   ++ S   
Sbjct: 70  SLALLHR--DAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGI- 126

Query: 85  SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQC 143
                    SEGS  +F            + +G+P     + +D+GS+++W+ C+ C +C
Sbjct: 127 ---------SEGSGEYF----------VRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAEC 167

Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYS 199
                  Y   D     +DP++S+S   V C   +C++     S C      C Y   Y 
Sbjct: 168 -------YQQAD---PLFDPAASASFTAVPCDSGVCRTLPGGSSGCAD-SGACRYQVSYG 216

Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
            + + + G L  + L   +F    P   VQ  V IGCG +  G ++  A   G++GLG G
Sbjct: 217 -DGSYTQGVLAMETL---TFGDSTP---VQ-GVAIGCGHRNRGLFVGAA---GLLGLGWG 265

Query: 260 DVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFG--DQGPATQQSTSFLPIGEKYDA 313
            +S+   L  A     +FS C      +  +GS+ FG  D  P        L   ++   
Sbjct: 266 PMSLVGQLGGA--AGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSF 323

Query: 314 YFVGVESYCI---------GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVS 363
           Y+VG+    +         G   LT+ G   +V D+G + T LP + YA +   F   + 
Sbjct: 324 YYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIG 383

Query: 364 SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFV---VRNHIFSFPENEGFTVF 419
                  G S    CY+ S    ++VP + L F ++ + +    RN +       G  V+
Sbjct: 384 GDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVEM----GGGVY 439

Query: 420 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           CL   ++     I+G     G +I  D  N  + +  S C
Sbjct: 440 CLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 85/351 (24%), Positives = 140/351 (39%), Gaps = 46/351 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  NVS
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 235

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C   +        C Y   Y  + + S G+   D L L+S+              
Sbjct: 236 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     +G+ +  
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 341

Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
           FG    A  ++    P+    G  +  Y+VG+    +G   L+  QS F     +VDSG 
Sbjct: 342 FGAGSLAAARARLTTPMLTENGPTF--YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGT 399

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
             T LP   Y+ +   F   ++++       +SL       CY+ +    + +P + L+F
Sbjct: 400 VITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLF 455

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
                  V      +  +              GD GI+G   +    + +D
Sbjct: 456 QGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 144/370 (38%), Gaps = 50/370 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     +  D GS+L W      QC P   S Y    +    +DPS+S +  N+S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWT-----QCQPCVKSCYA---QQQPIFDPSASKTYSNIS 209

Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           C+   C    S       C S    C Y   Y  + + + G+   D L L        Q+
Sbjct: 210 CTSTACSGLKSATGNSPGCSS--SNCVYGIQYG-DSSFTVGFFAKDTLTLT-------QN 259

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            V    + GCG+   G +   A   G++GLG   +S+    A+       FS C      
Sbjct: 260 DVFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314

Query: 285 DSGSVFFGD-----QGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG--FQ- 334
            +G + FG+        A +   +F P      A  YF+ V    +G   L+ S   FQ 
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG   T LP+ +Y  +   F + +S    +   +    CY+ S+   + +P + 
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKIS 434

Query: 393 LIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDREN 449
             F+ N +  +  N I       G +  CL       D   GI G        +V+D   
Sbjct: 435 FNFNGNANVDLEPNGIL---ITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAG 491

Query: 450 LKLAWSHSKC 459
            +L + +  C
Sbjct: 492 GQLGFGYKGC 501


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/367 (23%), Positives = 146/367 (39%), Gaps = 43/367 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   F V +D GS+L WV     QC+P    Y     +N + + P++S+S   ++
Sbjct: 17  VRLGTPERVFSVIVDTGSDLTWV-----QCSPCGKCY----SQNDALFLPNTSTSFTKLA 67

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C   LC         +  C Y   Y  + + ++G  V D + +   +    Q     +  
Sbjct: 68  CGSALCNGLPFPMCNQTTCVYWYSYG-DGSLTTGDFVYDTITMDGINGQKQQV---PNFA 123

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGS 288
            GCG    GS+   A  DG++GLG G +S  S L    +    FS C  +       +  
Sbjct: 124 FGCGHDNEGSF---AGADGILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSP 178

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQS----------GFQA 335
           + FGD          +LPI         Y+V +    +G++ L  S          G   
Sbjct: 179 LLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGT 238

Query: 336 LVDSGASFTFLPTEIYAEVVVKFD--KLVSSKRISLQGNSWKYCYNASSEEML-KVPDMR 392
           + DSG + T L    Y EV+   +   +  S++I    +    C +   ++ L  VP M 
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKID-DISRLDLCLSGFPKDQLPTVPAMT 297

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F      +  ++ F + E+     +C   M++  D  IIG       ++ +D    KL
Sbjct: 298 FHFEGGDMVLPPSNYFIYLESS--QSYCF-AMTSSPDVNIIGSVQQQNFQVYYDTAGRKL 354

Query: 453 AWSHSKC 459
            +    C
Sbjct: 355 GFVPKDC 361


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 154/377 (40%), Gaps = 56/377 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP +S+   +D GS+L+W  C+ C+ C            ++   +DPSSSS+   V
Sbjct: 104 VAIGTPALSYAAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATV 153

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            CS  LC     S+C S    C Y   Y  + +S+ G L  +   L    K  P      
Sbjct: 154 PCSSALCSDLPTSTCTSASK-CGYTYTYG-DASSTQGVLASETFTLGKEKKKLP------ 205

Query: 231 SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
            V  GCG    G  +  GA   G++GLG G +   SL+++ GL  + FS C    D G  
Sbjct: 206 GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDGDG 257

Query: 289 ---VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ- 334
              +  G    A          Q+T  +    +   Y+V +    +G++ +T   S F  
Sbjct: 258 KSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAI 317

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEM 385
                   +VDSG S T+L  + Y  +   F   ++   +         C+   A   + 
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDE 377

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
           ++VP + L F       +    +   ++      CLTV  + G   IIG       + V+
Sbjct: 378 VQVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVAPSRG-LSIIGNFQQQNFQFVY 435

Query: 446 DRENLKLAWSHSKCEEV 462
           D     L+++  +C ++
Sbjct: 436 DVAGDTLSFAPVQCNKL 452


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/386 (24%), Positives = 158/386 (40%), Gaps = 57/386 (14%)

Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           G +   L+Y   + IG  N + +V  D GS+L WV     QC P    Y    ++    +
Sbjct: 137 GARLQTLNYIVTVGIGGQNSTLIV--DTGSDLTWV-----QCLPCRLCY----NQQEPLF 185

Query: 162 DPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
           +PS+SSS  ++ C+ P C        S   C +     C Y  DY  + + S G L    
Sbjct: 186 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL---- 240

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
                F K     +   + I GCGR   G +       G+MGL   ++S+ S    + L 
Sbjct: 241 ----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLF 291

Query: 274 QNSFSICFDE---NDSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYC 322
            + FS C        SGS+  G    +  ++ S  PI         +  + YF+ +    
Sbjct: 292 GSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGIS 349

Query: 323 IGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           IG   L      +  G  +L+DSG   T L   IY     +F+K  S  R +   +    
Sbjct: 350 IGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNT 409

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGII 433
           C+N +  E + +P ++ IF  N   +V    +F F +++   + CL   S   +    II
Sbjct: 410 CFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMII 468

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           G       R++++ +  K+ ++   C
Sbjct: 469 GNYQQKNQRVIYNSKESKVGFAGEPC 494


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 91/202 (45%), Gaps = 35/202 (17%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNL------- 158
           Y+    WI  GTP   F + +D+GS + +VPC  C QC        +  D+ L       
Sbjct: 91  YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKV 148

Query: 159 -------------SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 205
                         ++ P  SS+ + V C+        +C   K+ C Y  +Y+ E +SS
Sbjct: 149 QIFKISYGLFDEDPKFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSS 202

Query: 206 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 265
            G L +D++   + S   PQ +V      GC   +TG      A DG++GLG GD+S+  
Sbjct: 203 KGVLGEDLISFGNESHLTPQRAV-----FGCKTVETGDLYSQRA-DGIIGLGQGDLSLVG 256

Query: 266 LLAKAGLIQNSFSICFDENDSG 287
            L   GLI NSF +C+   D G
Sbjct: 257 QLVDKGLISNSFGLCYGGLDVG 278


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 97/407 (23%), Positives = 165/407 (40%), Gaps = 93/407 (22%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++ GTP+ +F   LD GS L+W+PC     C +C   S         N  ++ P +SSSS
Sbjct: 90  LEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFS---------NTPKFIPKNSSSS 140

Query: 170 KNVSCSHPLC--------------KSRSSCKSLKDPCP-YIADYSTEDTSSSGYLVDDIL 214
           K V C++P C              + +++  +    CP Y   Y     S++G+L+ + L
Sbjct: 141 KFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLG--STAGFLLSENL 198

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
           +          +   S  ++GC      S +    P G+ G G G+ S+PS   +  L +
Sbjct: 199 NFP--------TKKYSDFLLGC------SVVSVYQPAGIAGFGRGEESLPS---QMNLTR 241

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQS----------TSFL--PIGEKYDA----YFVGV 318
            S+ +   + D  +    +    T  S          T FL  P  +K  A    Y++ +
Sbjct: 242 FSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITL 301

Query: 319 ESYCIGNSCLT------------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
           +   +G   +               GF  +VDSG++FTF+   I+  V  +F K VS  R
Sbjct: 302 KRIVVGEKRVRVPRRLLEPNVDGDGGF--IVDSGSTFTFMERPIFDLVAQEFAKQVSYTR 359

Query: 367 ISLQGNSWKY--CYN-ASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFTVFCL 421
                  +    C+  A   E    P++R  F         V N+     + +   V CL
Sbjct: 360 AREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGD---VACL 416

Query: 422 TVMSTD--GDYGIIGQNFMMGH------RIVFDRENLKLAWSHSKCE 460
           T++S D  G  G +G   ++G+       + +D EN +  +    C+
Sbjct: 417 TIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 143/357 (40%), Gaps = 43/357 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP    L+ LD GS+++W     +QCAP    Y  S       +DP  S S 
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVW-----LQCAPCRQCYAQS----GRVFDPRRSRSY 192

Query: 170 KNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
             V C  P C          C   +  C Y   Y  + + ++G L  + L  A  ++  P
Sbjct: 193 AAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYG-DGSVTAGDLATETLWFARGAR-VP 250

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           +      V +GCG    G ++  A    ++GLG G +S+P+  A+       FS CF   
Sbjct: 251 R------VAVGCGHDNEGLFVAAAG---LLGLGRGRLSLPTQTAR--RYGRRFSYCF--- 296

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 343
                    QG      T    + +      V GV    +     T  G   ++DSG S 
Sbjct: 297 ---------QGSDLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRG-GVILDSGTSV 346

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
           T L   +Y  V   F       R++  G S +  CY+     ++KVP + +  +      
Sbjct: 347 TRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVA 406

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +    +  P +   T FCL +  TDG   I+G     G R+VFD +  ++A     C
Sbjct: 407 LPPENYLIPVDTRGT-FCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 101/403 (25%), Positives = 158/403 (39%), Gaps = 78/403 (19%)

Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
           +D GS+L+W PC   +C      Y T+    LS   P + +SS +VSC  P C +  +  
Sbjct: 91  MDTGSDLVWFPCAPFECILCEGKYDTAATGGLS---PPNITSSASVSCKSPACSAAHTSL 147

Query: 187 SLKDPCPY----IADYSTEDTSS-----------SGYLVDDILHLASFSKHAPQSSVQSS 231
           S  D C      +    T D SS            G LV   L+  S S  A    V  +
Sbjct: 148 SSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVAR-LYRDSLSMPASSPLVLHN 206

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-----FD--- 282
              GC     G       P GV G G G +S+P+ LA  +  + N FS C     FD   
Sbjct: 207 FTFGCAHTALGE------PVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADR 260

Query: 283 --------------ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
                         +++       D+G      T+ L   +    Y VG+E   +GN  +
Sbjct: 261 VRRPSPLILGRYSLDDEKKKRVGHDRGEFVY--TAMLDNPKHPYFYCVGLEGITVGNRKI 318

Query: 329 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISL--QGNSW 374
                      +     +VDSG +FT LP  +Y  +V +F+  +    KR +   +    
Sbjct: 319 PVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGL 378

Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF-----PENEGFTVFCLTVMS-- 425
             CY  S +   KVP + L F  N + ++   N+ + F      + +   V CL +M+  
Sbjct: 379 GPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGG 437

Query: 426 ----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 464
               + G    +G     G  +V+D E  ++ ++  KC  + D
Sbjct: 438 DEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCALLWD 480


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 102/414 (24%), Positives = 168/414 (40%), Gaps = 73/414 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQC-----APLSASYYTSLDRNLSEYDP 163
           ++IGTP     V +D GS+L WVPC      C++C       L A++  S   +      
Sbjct: 86  LNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASC 145

Query: 164 SS-------SSSSKNVSCSHPLCKSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILH 215
           +S       SS +   +C+   C   +  K+    PCP  A      T  +G +V  IL 
Sbjct: 146 ASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFA-----YTYGAGGVVTGILT 200

Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
             +   +     V   +   C      +Y +   P G+ G G G +   S++++ G +Q 
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQK 254

Query: 276 SFSICF-------DENDSGSVFFGDQGPATQQSTSFLPI--GEKY-DAYFVGVESYCIGN 325
            FS CF       + N S  +  GD    ++    F P+     Y + Y+VG+E+  +GN
Sbjct: 255 GFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGN 314

Query: 326 SCLTQ-----SGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQG 371
              T+       F +L      +DSG ++T LP   Y++V+      ++  R   + +Q 
Sbjct: 315 VSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQ- 373

Query: 372 NSWKYCYNA--------SSEEMLKVPDMRLIFSKNQSFVVR--NHIF--SFPENEGFTVF 419
             +  CY          +S+++L  P +   F  N S V+   NH +  S P N    V 
Sbjct: 374 TGFDLCYKVPRPNNNTLTSDDLL--PSITFHFLNNVSLVLPQGNHFYPVSAPGNPA-VVK 430

Query: 420 CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 469
           CL   ST    DG  G+ G        +V+D E  ++ +    C        +H
Sbjct: 431 CLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAASSQGLH 484


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 96/386 (24%), Positives = 158/386 (40%), Gaps = 57/386 (14%)

Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           G +   L+Y   + IG  N + +V  D GS+L WV     QC P    Y    ++    +
Sbjct: 58  GARLQTLNYIVTVGIGGQNSTLIV--DTGSDLTWV-----QCLPCRLCY----NQQEPLF 106

Query: 162 DPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
           +PS+SSS  ++ C+ P C        S   C +     C Y  DY  + + S G L    
Sbjct: 107 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL---- 161

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
                F K     +   + I GCGR   G +       G+MGL   ++S+ S    + L 
Sbjct: 162 ----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLF 212

Query: 274 QNSFSICFDE---NDSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYC 322
            + FS C        SGS+  G    +  ++ S  PI         +  + YF+ +    
Sbjct: 213 GSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGIS 270

Query: 323 IGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
           IG   L      +  G  +L+DSG   T L   IY     +F+K  S  R +   +    
Sbjct: 271 IGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNT 330

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGII 433
           C+N +  E + +P ++ IF  N   +V    +F F +++   + CL   S   +    II
Sbjct: 331 CFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMII 389

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           G       R++++ +  K+ ++   C
Sbjct: 390 GNYQQKNQRVIYNSKESKVGFAGEPC 415


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 92/335 (27%), Positives = 142/335 (42%), Gaps = 46/335 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP ++  + +D GS+L WV  QC  CA  + S Y   D     +DP+ SSS   V C 
Sbjct: 143 LGTPGMAQTLEVDTGSDLSWV--QCKPCA--APSCYRQKD---PLFDPAQSSSYAAVPCG 195

Query: 176 HPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
              C       S+C + +  C Y+  Y  + ++++G    D L LA+       ++VQ  
Sbjct: 196 RSACAGLGIYASACSAAQ--CGYVVSYG-DGSNTTGVYSSDTLTLAA------NATVQ-G 245

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVF 290
            + GCG  Q+G    G   DG++G G      PSL+ + AG     FS C     S + +
Sbjct: 246 FLFGCGHAQSGGLFTGI--DGLLGFGR---EQPSLVQQTAGAYGGVFSYCLPTKSSTTGY 300

Query: 291 FGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--LVDSGAS 342
               GP+       +T  LP       Y V +    +G   L+   S F A  +VD+G  
Sbjct: 301 LTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTV 360

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP   YA +   F   ++S   +        CY+ +    + +  + L FS   +  
Sbjct: 361 ITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMT 420

Query: 403 V-RNHIFSFPENEGFTVFCLTVMS--TDGDYGIIG 434
           +  + I SF         CL   S  +DG   I+G
Sbjct: 421 LGADGIMSF--------GCLAFASSGSDGSMAILG 447


>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
          Length = 566

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 79/277 (28%), Positives = 117/277 (42%), Gaps = 45/277 (16%)

Query: 92  FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
           FP +G+   F       L+YT + +GTP   F V +D GS++LWV C      P +    
Sbjct: 118 FPVDGASDPFLVG----LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 169

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
           + L   LS +DP  SSS+  VSCS   C S    +S   P   C Y   Y  + + +SGY
Sbjct: 170 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGY 228

Query: 209 LVDDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
            + D +  +L S     P+ +V                      DG+ GLG G +SV S 
Sbjct: 229 YISDFMCSNLQSGDLQRPRRAV----------------------DGIFGLGQGSLSVISQ 266

Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 326
           LA  GL    FS C   + SG       G   +  T + P+      Y V ++S  +   
Sbjct: 267 LAVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQ 325

Query: 327 CL--------TQSGFQALVDSGASFTFLPTEIYAEVV 355
            L          +G   ++D+G +  +LP E Y+  +
Sbjct: 326 ILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI 362


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 155/369 (42%), Gaps = 54/369 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP    L+A+D  ++  W+PC  C  C   SA            +DP++S+S ++V C
Sbjct: 116 LGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSA----------PPFDPAASTSYRSVPC 165

Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
             PLC     ++C      C +   Y+  D+S    L  D L +A             + 
Sbjct: 166 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVA--------GDAVKTY 215

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
             GC +K TG+    A P G++GLG G +S   L     + Q +FS C       N SG+
Sbjct: 216 TFGCLQKATGT---AAPPQGLLGLGRGPLSF--LSQTRDMYQGTFSYCLPSFKSLNFSGT 270

Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALV 337
           +  G  G P   ++T  L    +   Y+V +    +G   +            +G   ++
Sbjct: 271 LRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVL 330

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG  FT L    Y  V  +  + V +   SL G  +  C+N ++   +  P + L+F  
Sbjct: 331 DSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPVTLLFDG 385

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGHRIVFDRENLKLA 453
            Q  +   ++     +   T+ CL + +  DG      +I       HR++FD  N ++ 
Sbjct: 386 MQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVG 443

Query: 454 WSHSKCEEV 462
           ++  +C  V
Sbjct: 444 FARERCTAV 452


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 155/362 (42%), Gaps = 50/362 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP    L+A+D  ++  W+PC  C  C   SA          + +DP++S+S + V C
Sbjct: 118 LGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSA----------APFDPAASASYRTVPC 167

Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
             PLC     ++C      C +   Y+  D+S    L  D L +A  +  A         
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVAGNAVKA--------Y 217

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
             GC ++ TG+    A P G++GLG G +S   L     + + +FS C       N SG+
Sbjct: 218 TFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGT 272

Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGA 341
           +  G  G P   ++T  L    +   Y+V +    +G   +        +G   ++DSG 
Sbjct: 273 LRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGT 332

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
            FT L    Y  V  +  + V +   SL G  +  C+N ++   +  P M L+F   Q  
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPMTLLFDGMQVT 387

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           +   ++     +   T+ CL + +  DG      +I       HR++FD  N ++ ++  
Sbjct: 388 LPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARE 445

Query: 458 KC 459
           +C
Sbjct: 446 RC 447


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 89/382 (23%), Positives = 147/382 (38%), Gaps = 51/382 (13%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLV-ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEY 161
           N  Y +H   + IG P    +V  LD GS+++W  C+ C +C            + L  +
Sbjct: 89  NSEYLIH---LSIGAPRSQPVVLTLDTGSDVVWTQCEPCAEC----------FTQPLPRF 135

Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           D ++S++ ++V+CS PLC + S        C Y++ Y     S   +L D      +F  
Sbjct: 136 DTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSF----TFDD 191

Query: 222 HAPQSSVQSSVI-IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                 V    I  GCG    G +L      G+ G G G +S+PS L         FS C
Sbjct: 192 GKGGGKVTVPDIGFGCGMYNAGRFLQ--TETGIAGFGRGPLSLPSQLK-----VRQFSYC 244

Query: 281 FD---ENDSGSVFFGDQGPATQQSTS---------FLPIGEKYDAYFVGVESYCIGNSCL 328
           F    E  S  VF G  G     +T           LP G     Y +  +   +G + L
Sbjct: 245 FTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRL 304

Query: 329 TQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
                +A       +DSG   T  P  ++ ++   F    +   ++   +    C++   
Sbjct: 305 PVPEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALP-VNKTADEDDICFSWDG 363

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMG 440
           ++   +P +          + R +  +     G    C+ V ST G  D  +IG      
Sbjct: 364 KKTAAMPKLVFHLEGADWDLPRENYVTEDRESG--QVCVAV-STSGQMDRTLIGNFQQQN 420

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             IV+D    KL    ++C+++
Sbjct: 421 THIVYDLAAGKLLLVPAQCDKL 442


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 97/399 (24%), Positives = 160/399 (40%), Gaps = 76/399 (19%)

Query: 110 HYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCA-------PLSASYYTSLDR 156
           H   +  GTP   +SFLV  D GS+++W PC     C  C+       P+     +S D+
Sbjct: 87  HTIPLSFGTPPQKLSFLV--DTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144

Query: 157 NLSEYDPS-SSSSSKNVSCSHPLCKSRSSCKSLKDPCP-YIADYSTEDTSSSGYLVDDIL 214
            L   DP  +++SS +V    P C   S  K     CP Y   Y T   ++SG+ + + L
Sbjct: 145 ILGCRDPKCANTSSPDVHLGCPRCNGNS--KKCSHACPQYTLQYGTG--AASGFFLLENL 200

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
                + H          ++GC    T S     + D + G G    S+P  +       
Sbjct: 201 DFPGKTIH--------KFLVGC----TTSADREPSSDALAGFGRTMFSLPMQMG-----V 243

Query: 275 NSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIG 324
             F+ C + +D      SG +   D      Q  S+ P  +        Y++GV+   IG
Sbjct: 244 KKFAYCLNSHDYDDTRNSGKLIL-DYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIG 302

Query: 325 NSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 373
           N  L   G             ++DSG ++ ++   ++  V  +  K +S  R SL+  + 
Sbjct: 303 NKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQ 362

Query: 374 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS---- 425
                CYN +  + +K+PD+   F+   + VV   N+   F E    ++ C  V +    
Sbjct: 363 SGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEA---SLGCFPVTTDSPT 419

Query: 426 -----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                T G   I+G    + H + FD +N +L +    C
Sbjct: 420 NNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 101/429 (23%), Positives = 167/429 (38%), Gaps = 63/429 (14%)

Query: 62  LELL----LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHF-----FGNQFYWLHYT 112
           LE+L    +  ++ R+K     +   N ++ ++L     SQT F     FG         
Sbjct: 79  LEMLRWDQVRTEYVRRKASGGAEDVLNPAKPRVLM----SQTDFAVRSPFGVGSGSGSSA 134

Query: 113 WIDI-GTPNV--SFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           WID  G P V     +A+D   ++ W+   PC   QC P          +    +DP++S
Sbjct: 135 WIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYP----------QRDPLFDPTTS 184

Query: 167 SSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           S++  V C  P C+S        S +S    C Y+ +YS +D +++G  + D L ++   
Sbjct: 185 STAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYS-DDRATAGTYMTDTLTISG-- 241

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
                ++   +   GC     G + D  A  G M LG G  S+ +  A++  + N+FS C
Sbjct: 242 -----TTAVRNFRFGCSHAVRGRFSDLTA--GTMSLGGGAQSLLAQTARS--LGNAFSYC 292

Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------YFVGVESYCIGNSCL----TQ 330
             +  S S F    GPAT  ST+         +      Y V ++   +    L      
Sbjct: 293 VPQA-SASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVA 351

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
               A++DS A  T LP   Y  +   F   + +   S    +   CY+      ++VP 
Sbjct: 352 FSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLTNVRVPA 411

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + L+F      V+       P          T  S+D   G IG      H +++D    
Sbjct: 412 VSLVFGGGAVVVLDP-----PAVMIGGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAG 466

Query: 451 KLAWSHSKC 459
            + +    C
Sbjct: 467 GVGFRRGAC 475


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 157/371 (42%), Gaps = 51/371 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I +G P    L+ LD GS++ W     IQC P S  Y  S       Y+P+ SSS 
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTW-----IQCEPCSDCYQQSD----PIYNPALSSSY 195

Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           K V C   LC+    S C S    C Y   Y  + + + G    + L L      AP  +
Sbjct: 196 KLVGCQANLCQQLDVSGC-SRNGSCLYQVSYG-DGSYTQGNFATETLTLGG----APLQN 249

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDS 286
           V     IGCG    G ++  A    ++GLG G +S PS L  + G I   FS C  + DS
Sbjct: 250 VA----IGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQLTDENGKI---FSYCLVDRDS 299

Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS----GFQA--- 335
            S   + FG          + +    + D  Y+V +    +G   L+ S    G  A   
Sbjct: 300 ESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGN 359

Query: 336 ---LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
              +VDSG + T L T  Y  +   F      L S+  +SL    +  CY+ SS+E + V
Sbjct: 360 GGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL----FDTCYDLSSKESVDV 415

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
           P +   FS   S  +    +  P +     FC     T     I+G     G R+ FDR 
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDS-MGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRA 474

Query: 449 NLKLAWSHSKC 459
           N ++ ++ +KC
Sbjct: 475 NNQVGFAVNKC 485


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 36/301 (11%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y +H   + +GTP     + LD GS+L+W      QCAP    +    D+ +   DP++S
Sbjct: 86  YLVH---LAVGTPPRPVALTLDTGSDLVWT-----QCAPCRDCF----DQGIPLLDPAAS 133

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+   + C  P C++          C Y+  Y  + + + G +  D        +     
Sbjct: 134 STYAALPCGAPRCRALPFTSCGGRSCVYVYHYG-DKSVTVGKIATDRFTFGDNGRRNGDG 192

Query: 227 SVQSS--VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-- 282
           S+ ++  +  GCG    G +       G+ G G G  S+PS L        SFS CF   
Sbjct: 193 SLPATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA-----TSFSYCFTSM 245

Query: 283 -ENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QS 331
            ++ S  V  G    A          ++T       +   YF+ ++   +G + L   ++
Sbjct: 246 FDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPET 305

Query: 332 GFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
            F++ ++DSGAS T LP E+Y  V  +F   V      ++G++   C+      + + P 
Sbjct: 306 KFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPA 365

Query: 391 M 391
           +
Sbjct: 366 V 366


>gi|340500865|gb|EGR27703.1| plasmepsin 5, putative [Ichthyophthirius multifiliis]
          Length = 602

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 93/403 (23%), Positives = 157/403 (38%), Gaps = 67/403 (16%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           YW++   I IG+P       +D GS LL  PCQ C  C           D     YD   
Sbjct: 46  YWIN---IYIGSPPQRQTAIIDTGSYLLAFPCQECKTCG----------DHISYPYDLEK 92

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-------- 217
           S ++K   C       +  C +    C +   Y+ E +S SGY+  D + L         
Sbjct: 93  SLTAKKEKCKSTKLSCQGYCNNFSQECNWSVSYA-EGSSISGYMAGDYVVLGDEMQDYIE 151

Query: 218 -----SFSKHAPQSSV----QSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDVS---- 262
                  S+   Q  +      SV +  GC   +T  +L    PDG++GL   D S    
Sbjct: 152 KLTKNQISEKEEQEYLTYIKHESVFLNFGCTTNETNLFL-SQVPDGIIGLAPSDKSGRAN 210

Query: 263 ----VPSLLAKAGLIQNS----FSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKY 311
               V  +  K    QN+    FS+C +    G +  G       +    T  +P     
Sbjct: 211 TGNIVDEIFKKHK--QNNETHVFSLCLNAEKGGYMSVGGYNYELHEKNARTQIIPFDSDS 268

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
             Y V ++   I N+ +  +    ++DSG +    P+ I   ++ K ++L  S++ S  G
Sbjct: 269 GYYSVSIKQILIQNNVIVTNIGYTIIDSGTTIVLGPSRIINPIIQKINELCESEQYSCGG 328

Query: 372 NSW-------KYCYNASSEE------MLKVPDMRLIFSKNQSFVVRNHIFSFPENE-GF- 416
           +         K+ YN S  E          P++   F   Q  V +   + + + + G+ 
Sbjct: 329 SKKNGDKQQSKFLYNPSKYENNVNNFFDSFPNIDFKFENGQVIVWKPSAYLYIDRKNGYK 388

Query: 417 TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++     + +     +G  FM  + I+FDR+N ++ ++ SKC
Sbjct: 389 NLYQFGFEAYESGKLYLGGPFMKNYDILFDRDNQEIHFTASKC 431


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 147/352 (41%), Gaps = 64/352 (18%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW---IDIGTPNVSFLVA 126
           + R ++RV      NS  NQ  +  E  + H   N+ +     +   +  GTP   F + 
Sbjct: 124 FGRDESRVSFI---NSKFNQ--YAPENLKDHTPNNKLFDEDGNFLVDVAFGTPPQKFTLI 178

Query: 127 LDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSC 185
           LD GS++ W  C+ C++C          L  +   +DPS+S           L  S  SC
Sbjct: 179 LDTGSSITWTQCKPCVRC----------LKASRRHFDPSAS-----------LTYSLGSC 217

Query: 186 KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL 245
                   Y   Y  + TS   Y  D +            S V      GCGR   G + 
Sbjct: 218 IPSTVGNTYNMTYGDKSTSVGNYGCDTMT--------LEHSDVFPKFQFGCGRNNEGDFG 269

Query: 246 DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQGPATQQSTSF 304
            GA  DG++GLG G +S  S  A     +  FS C  E DS GS+ FG++  AT QS+S 
Sbjct: 270 SGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEK--ATSQSSSL 323

Query: 305 ----LPIG------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSGASFTFLPTE 349
               L  G      E+   YFV +    +GN  L    S F +   ++DSG   T LP  
Sbjct: 324 KFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQR 383

Query: 350 IYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
            Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L F +
Sbjct: 384 AYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGE 435


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 102/435 (23%), Positives = 174/435 (40%), Gaps = 68/435 (15%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           L +N   +   R  L +N N++  QL  P      +    ++       + IGTP     
Sbjct: 52  LSTNTALKMMLRNSLIANTNNNNTQLKSPPSSPYNYKLSFKYSMALIVDLPIGTPPQVQP 111

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           + LD GS L W+  QC + AP       S       +DPS SS+   + C+HP+CK R  
Sbjct: 112 MVLDTGSQLSWI--QCHKKAPAKPPPTAS-------FDPSLSSTFSTLPCTHPVCKPRIP 162

Query: 185 CKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
             +L   C      + + +  + T + G LV +     +FS+    S     +I+GC  +
Sbjct: 163 DFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKF---TFSR----SLFTPPLILGCATE 215

Query: 240 QTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLI-QNSFSICFDEND 285
            T        P G++G+  G +S             VP+ + + G     SF +  + N 
Sbjct: 216 STD-------PRGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNS 268

Query: 286 SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLTQS----------GFQ 334
           +   +      A  Q    L P+     AY V ++   IG   L  S            Q
Sbjct: 269 NTFRYIEMLTFARSQRMPNLDPL-----AYTVALQGIRIGGRKLNISPAVFRADAGGSGQ 323

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDM 391
            ++DSG+ FT+L  E Y +V  +  + V    K+  + G     C++ ++ E+ + + DM
Sbjct: 324 TMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDM 383

Query: 392 RLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDR 447
              F K    VV +  + +  E     V C+ + ++D       IIG        + FD 
Sbjct: 384 VFEFEKGVQIVVPKERVLATVEGG---VHCIGIANSDKLGAASNIIGNFHQQNLWVEFDL 440

Query: 448 ENLKLAWSHSKCEEV 462
            N ++ +  + C  +
Sbjct: 441 VNRRMGFGTADCSRL 455


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 81/360 (22%), Positives = 141/360 (39%), Gaps = 41/360 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y    +    +DP+ SS+  NVS
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWV-----QCRPCVVKCY---KQKEPLFDPAKSSTYANVS 218

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+   C    +       C Y   Y  + + + G+   D L +A                
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--------HDAIKGFR 269

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG K  G +   A   G+MGLG G  S+   +        +F+ C     +G+ +  D
Sbjct: 270 FGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-D 323

Query: 294 QGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSGA 341
            GP +  + + L P+    G+ +  Y+VG+    +G        S  + +G   LVDSG 
Sbjct: 324 FGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVFSTAG--TLVDSGT 379

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
             T LP   Y  +   FDK++ ++  + +   +    CY+ +    +++P + L+F    
Sbjct: 380 VITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGA 439

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V      +  +E            D    I+G      + +++D     + ++   C
Sbjct: 440 CLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 149/366 (40%), Gaps = 55/366 (15%)

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           + LD GS+++WV     QCAP    Y    +++   +DP  SSS   V C   LC+   S
Sbjct: 1   MVLDTGSDVVWV-----QCAPCRRCY----EQSGPVFDPRRSSSYGAVGCGAALCRRLDS 51

Query: 185 --CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
             C   +  C Y   Y  + + ++G  V + L  A  ++ A        V +GCG    G
Sbjct: 52  GGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAGGARVA-------RVALGCGHDNEG 103

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSG-----------SVF 290
            ++  A   G+       +S P+ +++      SFS C  D   SG           +V 
Sbjct: 104 LFVAAAGLLGLGRG---GLSFPTQISR--RYGRSFSYCLVDRTSSGAGAAPGSHRSSTVS 158

Query: 291 FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSGFQ---------A 335
           FG  G     S SF P+         Y+V +    +G +    + +S  +          
Sbjct: 159 FG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGV 217

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQGNS-WKYCYNASSEEMLKVPDMRL 393
           +VDSG S T L    Y+ +   F    +   R+S  G S +  CY+     ++KVP + +
Sbjct: 218 IVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSM 277

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
            F+      +    +  P +   T FC     TDG   IIG     G R+VFD +  ++ 
Sbjct: 278 HFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVG 336

Query: 454 WSHSKC 459
           ++   C
Sbjct: 337 FAPKGC 342


>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 90/201 (44%), Gaps = 30/201 (14%)

Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           GN F   +Y+  + IGTP  +F   +D GS+L WV C     AP +      +     +Y
Sbjct: 46  GNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCD----APCTGCTLPPI----RQY 97

Query: 162 DPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
            P  ++    V C  P+C      ++  C + K+ C Y  +Y+ +  SS G LV D   L
Sbjct: 98  KPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYA-DQGSSMGALVIDQFPL 152

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 272
              +     S++Q  +  GCG  Q    L  A P     GV+GLG G + V   L  AGL
Sbjct: 153 KLLNG----SAMQPRLAFGCGYDQI---LPKAHPPPATAGVLGLGRGKIGVLPQLVAAGL 205

Query: 273 IQNSFSICFDENDSGSVFFGD 293
            +N    C      G +FFGD
Sbjct: 206 TRNVVGHCLSSKGGGYLFFGD 226


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 107/447 (23%), Positives = 180/447 (40%), Gaps = 54/447 (12%)

Query: 34  DEAKERWISKSGNVSVADSWPKKNSVEY---LELLLSNDWKRQKTRVK-LQSNNNSSRNQ 89
           +E  E+W+ K   V   D     NS ++   L+  L  D KR  + ++ L S    S   
Sbjct: 66  EEGGEKWMMK---VVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRV 122

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
             F ++       G+  Y++    I +G+P  S  + +D+GS+++WV CQ C QC     
Sbjct: 123 DDFGTDVISGMEQGSGEYFVR---IGVGSPPRSQYMVIDSGSDIVWVQCQPCTQC----- 174

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
             Y   D     +DP+ S+S   VSCS  +C    +       C Y   Y  + + + G 
Sbjct: 175 --YHQSD---PVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGT 228

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           L    L   +F +     ++  SV IGCG +  G ++  A   G+        S+  +  
Sbjct: 229 LA---LETLTFGR-----TMVRSVAIGCGHRNRGMFVGAAGLLGLG-----GGSMSFVGQ 275

Query: 269 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 322
             G    +FS C      + SGS+ FG +  A     +++P+     A   Y++G+    
Sbjct: 276 LGGQTGGAFSYCLVSRGTDSSGSLVFGRE--ALPAGAAWVPLVRNPRAPSFYYIGLAGLG 333

Query: 323 IGNS---------CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
           +G            LT+ G   +V D+G + T LPT  Y      F    ++   +    
Sbjct: 334 VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 393

Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 432
            +  CY+      ++VP +   FS      +    F  P ++  T FC     +     I
Sbjct: 394 IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT-FCFAFAPSTSGLSI 452

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +G     G +I FD  N  + +  + C
Sbjct: 453 LGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 162/388 (41%), Gaps = 57/388 (14%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   H+T  + IG P   F + +D GS+L WV C   C  C           DR   
Sbjct: 47  GNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCT-------LPHDR--- 96

Query: 160 EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI- 213
            Y P ++     V C  PLC      S+S CK+  D C Y  +Y+ +  SS G LV D  
Sbjct: 97  LYKPHNNV----VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYA-DHGSSIGVLVKDPV 151

Query: 214 -LHLASFSKHAPQSSVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 270
            L L + +  AP      ++  GCG  Q   GS L      GV+GLG    ++ + L+  
Sbjct: 152 PLRLTNGTILAP------NLGFGCGYDQHNGGSQLPPLT-AGVLGLGNSKATMATQLSAL 204

Query: 271 GLIQNSFSIC-FDENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGN 325
             ++N    C   +      F GD  P++    S++PI    G KY A   G      G 
Sbjct: 205 SHVRNVLGHCFSGQGGGFLFFGGDLVPSS--GMSWMPILRTPGGKYSA---GPAEVYFGG 259

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSE 383
           + +   G     DSG+S+T+  +++Y  V+      +  +  R + +  +   C+   S+
Sbjct: 260 NPVGIRGLILTFDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICWKG-SK 318

Query: 384 EMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGII 433
               V D+R  F     SF      F  P      +      CL +++      G+  +I
Sbjct: 319 AFKSVADVRNFFKPLALSFGNSKVQFQIPPEAYLIISNLGNVCLGILNGSQVGLGNVNLI 378

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           G   M+   +V+D E  ++ W+ + C +
Sbjct: 379 GDISMLDKMMVYDNERQQIGWAPANCSK 406


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 134/313 (42%), Gaps = 39/313 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP V+  + +D GS+L WV  QC  CA  + + Y+  D     +DP+ SSS   V 
Sbjct: 144 VSLGTPGVAQTLEVDTGSDLSWV--QCTPCA--APACYSQKD---PLFDPAQSSSYAAVP 196

Query: 174 CSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           C  P+C       SSC + +  C Y+  Y  + + ++G    D L L      +P  +V+
Sbjct: 197 CGGPVCGGLGIYASSCSAAQ--CGYVVSYG-DGSKTTGVYSSDTLTL------SPNDAVR 247

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
                GCG  Q+G   +    DG++GLG  + S+  +   AG     FS C     S + 
Sbjct: 248 -GFFFGCGHAQSGFTGN----DGLLGLGREEASL--VEQTAGTYGGVFSYCLPTRPSTTG 300

Query: 290 FFGDQGPATQ-----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--LVDSG 340
           +    GP+        +T  L        Y V +    +G   L+   S F    +VD+G
Sbjct: 301 YLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTG 360

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
              T LP   YA +   F   ++S     +        CYN S    + +P++ L FS  
Sbjct: 361 TVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTFSGG 420

Query: 399 QSFVV-RNHIFSF 410
            +  +  + I SF
Sbjct: 421 ATVTLGADGILSF 433


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 108/441 (24%), Positives = 170/441 (38%), Gaps = 66/441 (14%)

Query: 60  EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHY-TWIDIG 117
            YL  LL+ D  R  +    Q   N  R      S  ++     G +   L+Y T I +G
Sbjct: 95  RYLRRLLAADESRANS---FQPRRNKDRASASTQSASAEVPLTSGIRLQTLNYVTTISLG 151

Query: 118 ----TPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
               +P  +  V +D GS+L WV     QC P SA Y     +    +DP+ S++   V 
Sbjct: 152 GSSGSPAANLTVIVDTGSDLTWV-----QCKPCSACYA----QRDPLFDPAGSATYAAVR 202

Query: 174 CSHPLCK--------SRSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           C+   C         +  SC S     + C Y   Y  + + S G L  D + L   S  
Sbjct: 203 CNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYG-DGSFSRGVLATDTVALGGAS-- 259

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
                     + GCG    G +       G+MGLG  ++S+ S  A        FS C  
Sbjct: 260 ------LGGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAS--RYGGVFSYCLP 308

Query: 283 ENDSG------SVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCL 328
              SG      S+  GD   ++ ++T+  P+          +   YF+ V    +G + L
Sbjct: 309 AATSGDASGSLSLGGGDDAASSYRNTT--PVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 366

Query: 329 TQSGFQA---LVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNS-WKYCYNASSE 383
              G  A   L+DSG   T L   +Y  V  +F  +  ++   +  G S    CY+ +  
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGH 426

Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGH 441
           + +KVP + L         V      F   +  +  CL +  +S + +  IIG       
Sbjct: 427 DEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNK 486

Query: 442 RIVFDRENLKLAWSHSKCEEV 462
           R+V+D    +L ++   C  V
Sbjct: 487 RVVYDTLGSRLGFADEDCNYV 507


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 98/396 (24%), Positives = 149/396 (37%), Gaps = 76/396 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  + IG P  S L+  D GS+L+WV C  C  C+  S +         + + P  SS+
Sbjct: 84  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRHSST 134

Query: 169 SKNVSCSHPLCKSRSSCKSLKDP-CPYIADYST---EDTSSSGYLVDDIL--HLASFSKH 222
                C  P+C  R   K  + P C +   +ST   E   + G L   +      S    
Sbjct: 135 FSPAHCYDPVC--RLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTS 192

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
           + + +   SV  GCG + +G  + G +    +GVMGLG G +S  S L +     N FS 
Sbjct: 193 SGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSY 250

Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------------------YFVGVESY 321
           C  +              +   TS+L IG   D                   Y+V ++S 
Sbjct: 251 CLMDYT-----------LSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSV 299

Query: 322 CIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
            +  + L                 +VDSG +  FL    Y  V+    + V         
Sbjct: 300 FVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALT 359

Query: 372 NSWKYCYNASS----EEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMS 425
             +  C N S     E++L  P ++  FS    FV   RN+     E     + CL + S
Sbjct: 360 PGFDLCVNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNYFIETEEQ----IQCLAIQS 413

Query: 426 TDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            D   G  +IG     G    FDR+  +L +S   C
Sbjct: 414 VDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 81/360 (22%), Positives = 141/360 (39%), Gaps = 41/360 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y    +    +DP+ SS+  NVS
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWV-----QCRPCVVKCY---KQKGPLFDPAKSSTYANVS 218

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+   C    +       C Y   Y  + + + G+   D L +A                
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--------HDAIKGFR 269

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG K  G +   A   G+MGLG G  S+   +        +F+ C     +G+ +  D
Sbjct: 270 FGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-D 323

Query: 294 QGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSGA 341
            GP +  + + L P+    G+ +  Y+VG+    +G        S  + +G   LVDSG 
Sbjct: 324 FGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVFSTAG--TLVDSGT 379

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
             T LP   Y  +   FDK++ ++  + +   +    CY+ +    +++P + L+F    
Sbjct: 380 VITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGA 439

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V      +  +E            D    I+G      + +++D     + ++   C
Sbjct: 440 CLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 162/382 (42%), Gaps = 55/382 (14%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CI-QCAPLSASYYTSLDRNLSEY 161
           NQF+      I +GTP V  LV +D GS + WV CQ CI  C       YT   R    +
Sbjct: 21  NQFFM----GISLGTPAVFNLVTIDTGSTISWVQCQYCIVHC-------YTQDQRAGPTF 69

Query: 162 DPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
           + SSSS+ + V CS  +C          S C   +D C Y   Y++ +  S+GYL  D L
Sbjct: 70  NTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY-SAGYLSQDRL 128

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
            LA+        S+Q   I GCG   + +  +G +  G++G G    S  + +A+     
Sbjct: 129 TLAN------SYSIQ-KFIFGCG---SDNRYNGHSA-GIIGFGNKSYSFFNQIAQL-TNY 176

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
           ++FS CF  N     F    GP  + S   + + + +D Y   +  Y +    +  +G +
Sbjct: 177 SAFSYCFPSNQENEGFL-SIGPYVRDSNKLI-LTQLFD-YGAHLPVYALQQFDMMVNGMR 233

Query: 335 ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY--NA 380
                        +VDSG   TF+ + ++  +     K + ++      +S + C+  N 
Sbjct: 234 LQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNG 293

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 437
            S +  K+P + + FS++   +   ++F +  ++G    C T    D       I+G   
Sbjct: 294 DSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDG--SICSTFQPDDAGVPGVQILGNRA 351

Query: 438 MMGHRIVFDRENLKLAWSHSKC 459
               R+VFD +     +    C
Sbjct: 352 TRSFRVVFDIQQRNFGFEAGAC 373


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 145/389 (37%), Gaps = 49/389 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   FL+  D GS+L WV C+    A  S S   S       + P  S + 
Sbjct: 97  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156

Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
             +SC+   C      S ++C +   PC Y  DY  +D S++   V       + S    
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSGREE 214

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           + +    +++GC    TG   +  A DGV+ LG   +S  S  A        FS C    
Sbjct: 215 RKAKLKGLVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAAS--RFGGRFSYCLVDH 270

Query: 282 --DENDSGSVFFGDQGPATQ----------------QSTSFLPIGEKYDAYFVGVESYCI 323
               N +  + FG   PA                  + T  L        Y V +++  +
Sbjct: 271 LSPRNATSYLTFGPN-PAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISV 329

Query: 324 GNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSW 374
               L         ++G   ++DSG S T L    Y  VV    K L    R+++  + +
Sbjct: 330 AGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM--DPF 387

Query: 375 KYCYNASS----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
           +YCYN +S    +  + VP M + F+           +      G     L      G  
Sbjct: 388 EYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPG-I 446

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +IG      H   FD +N +L +  S+C
Sbjct: 447 SVIGNILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 152/378 (40%), Gaps = 73/378 (19%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  CQ C  C           D+ L  +DPS+
Sbjct: 89  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 135

Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           SS+    SC   LC+                          G  V  +     F+     
Sbjct: 136 SSTLSLTSCDSTLCQ--------------------------GLPVASLPRSDKFTFVGAG 169

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-- 283
           +SV   V  GCG    G +       G+ G G G +S+PS L K G    +FS CF    
Sbjct: 170 ASV-PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTIT 221

Query: 284 ---------NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 332
                    +    +F   QG    Q+T  +        Y++ ++   +G++ L   +S 
Sbjct: 222 GAIPSTVLLDLPADLFSNGQGAV--QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESE 279

Query: 333 FQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
           F         ++DSG + T LPT +Y  V   F   V    +S       +C +A     
Sbjct: 280 FALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAK 339

Query: 386 LKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
             VP + L F      + R N++F   E+ G ++ CL ++   G+   IG        ++
Sbjct: 340 PYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVL 397

Query: 445 FDRENLKLAWSHSKCEEV 462
           +D +N KL++  ++C+++
Sbjct: 398 YDLQNSKLSFVPAQCDKL 415


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 90/381 (23%), Positives = 147/381 (38%), Gaps = 48/381 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F++  D GS+L WV C+    A  + +   +       +  ++S S 
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPA-----RVFRTAASKSW 155

Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA------ 217
             ++CS   C S      ++C S   PC Y  DY   D S++ G +  D   +A      
Sbjct: 156 APIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYRDGSAARGVVGTDSATIALSSGSG 213

Query: 218 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
                    + +    V++GC     G     +  DGV+ LG  ++S  S    A     
Sbjct: 214 RGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGG 269

Query: 276 SFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
            FS C        N +  + FG    A    T  L        Y V V++  +    L  
Sbjct: 270 RFSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDI 329

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNAS 381
                       A++DSG S T L T  Y  VV    K L    R+++  + ++YCYN +
Sbjct: 330 PADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM--DPFEYCYNWT 387

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFM 438
               L++P M + F+ +         +      G  V C+ V   +G +    +IG    
Sbjct: 388 DAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPG--VKCIGVQ--EGSWPGVSVIGNILQ 443

Query: 439 MGHRIVFDRENLKLAWSHSKC 459
             H   FD  +  L + H++C
Sbjct: 444 QEHLWEFDLRDRWLRFKHTRC 464


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 89/388 (22%), Positives = 154/388 (39%), Gaps = 48/388 (12%)

Query: 98  QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLD 155
           Q + + N FY +    + +G P   + +  D GS+L W+ C   C QC            
Sbjct: 48  QGNVYPNGFYNVT---LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCT----------- 93

Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVD 211
                  P    S+  V C  PLC S  S    +    D C Y  +Y+ +  SS G LV 
Sbjct: 94  ---ETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVR 149

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           D+  L + +   P   ++  + +GCG  Q          DG++GLG G VS+ S L   G
Sbjct: 150 DVFPL-NLTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQG 205

Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQ 330
           +++N    CF+        F   G        + P+   Y  ++  G             
Sbjct: 206 IVRNVVGHCFNSKGG-GYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL 264

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 388
                + DSG+S+T+   + Y  +    ++ ++ K  R ++  ++   C+    + +  +
Sbjct: 265 RNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSL 323

Query: 389 PDMRLIFSK----NQSFVVRNHIFSFPENEGFTVF------CLTVMS-TD---GDYGIIG 434
            D+R  F        S      +F  P  EG+ +       CL +++ TD    +  IIG
Sbjct: 324 RDVRKYFKPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGTDVGLENSNIIG 382

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              M    +V++ E   + W+ + C+ V
Sbjct: 383 DISMQDKMVVYNNEKQAIGWATANCDRV 410


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 92/388 (23%), Positives = 150/388 (38%), Gaps = 53/388 (13%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEY 161
           G     L+Y    +G       V +D  S L WV CQ C  C           D+    +
Sbjct: 112 GANLRTLNYVAT-VGLGAAEATVVVDTASELTWVQCQPCESCH----------DQQDPLF 160

Query: 162 DPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCP----------YIADYSTEDTSSSGYLV 210
           DPSSS S   V C+   C + R +  +   PC           Y   Y  + + S G L 
Sbjct: 161 DPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR-DGSYSRGVLA 219

Query: 211 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAK 269
            D L LA               + GCG    G+   G +  G+MGLG   VS V   + +
Sbjct: 220 RDKLRLAGQDIEG--------FVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMDQ 269

Query: 270 AGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------YFVGVE 319
            G +   FS C    +   SGS+  GD   A + ST  +      D+       YF+ + 
Sbjct: 270 FGGV---FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLT 326

Query: 320 SYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
              +G   +    F A   ++DSG   T L   +Y  V  +F   ++    +   +    
Sbjct: 327 GITVGGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDT 386

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIG 434
           C+N +  + ++VP ++ +F  +    V +    +  +   +  CL + S   +Y   IIG
Sbjct: 387 CFNLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIG 446

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  R++FD    ++ ++   C+ +
Sbjct: 447 NYQQKNLRVIFDTLGSQIGFAQETCDYI 474


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 103/429 (24%), Positives = 171/429 (39%), Gaps = 56/429 (13%)

Query: 55  KKNSVEYLELLLSNDWKRQKTRVKLQSN----NNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           K NS  + ++L  ++ +    + +L  N    +N   ++   PS+ + T   GN     +
Sbjct: 93  KANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLPSKSASTLGSGN-----Y 147

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
              + +G+P        D GS+L W      QC P     Y   +     +DPS+S S  
Sbjct: 148 VVTVGLGSPKRDLTFIFDTGSDLTWT-----QCEPCVGYCYQQREH---IFDPSTSLSYS 199

Query: 171 NVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           NVSC  P C+   S       C S    C Y   Y  + + S G+   + L L S     
Sbjct: 200 NVSCDSPSCEKLESATGNSPGCSS--STCLYGIRYG-DGSYSIGFFAREKLSLTS----- 251

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICF- 281
             + V ++   GCG+   G +  G A  G++GL    +S+ S  A K G +   FS C  
Sbjct: 252 --TDVFNNFQFGCGQNNRGLF-GGTA--GLLGLARNPLSLVSQTAQKYGKV---FSYCLP 303

Query: 282 -DENDSGSVFFGDQGPATQQSTSFLP--IGEKYDAYF--------VGVESYCIGNSCLTQ 330
              + +G + FG  G    ++  F P  +   Y +++        VG     I  S  + 
Sbjct: 304 SSSSSTGYLSFG-SGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST 362

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           +G   ++DSG   + LP  +Y+ V   F +L+S        +    CY+ S  + +KVP 
Sbjct: 363 AG--TIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPK 420

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + L FS      +      +              S D +  IIG        +V+D    
Sbjct: 421 IILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEG 480

Query: 451 KLAWSHSKC 459
           ++ ++ S C
Sbjct: 481 RVGFAPSGC 489


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 108/434 (24%), Positives = 196/434 (45%), Gaps = 87/434 (20%)

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW---IDIGTPNVSFLVALDAGSNL 133
           +++Q+N++ S  +L F +  S+T   G   +  + T    + IGTP  +  + LD GS L
Sbjct: 34  LRIQNNHHISTRRL-FSNSSSKTT--GKLLFHHNVTLTASLTIGTPPQNITMVLDTGSEL 90

Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---D 190
            W+ C+           +TS+      ++P +S +   + CS   CK+R+S  +L    D
Sbjct: 91  SWLRCK-------KEPNFTSI------FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCD 137

Query: 191 P---CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL-D 246
           P   C +I  Y+ + +S  G+L  +     S ++ A         + GC    + S   +
Sbjct: 138 PAKLCHFIISYA-DASSVEGHLAFETFRFGSLTRPA--------TVFGCMDSGSSSNTEE 188

Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQ----------G 295
            A   G+MG+  G +   S + + G     FS C    DS G +  G+            
Sbjct: 189 DAKTTGLMGMNRGSL---SFVNQMGF--RKFSYCISGLDSTGFLLLGEARYSWLKPLNYT 243

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFT 344
           P  Q ST  LP  ++  AY V +E   + N  L           T +G Q +VDSG  FT
Sbjct: 244 PLVQISTP-LPYFDRV-AYSVQLEGIKVNNKVLPLPKSVFVPDHTGAG-QTMVDSGTQFT 300

Query: 345 FLPTEIYAEVVVKF-------DKLVSSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIF 395
           FL   +Y+ +  +F        ++++  +   QG +   CY  +++S  +  +P ++L+F
Sbjct: 301 FLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQG-AMDLCYLIDSTSSTLPNLPVVKLMF 359

Query: 396 SKNQSFVV-RNHIFSFP-ENEGF-TVFCLTVMSTDGDYGIIGQNFMMGHR------IVFD 446
              +  V  +  ++  P E  G  +V+C T  ++D + GI   +F++GH       + +D
Sbjct: 360 RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSD-ELGI--SSFLIGHHQQQNVWMEYD 416

Query: 447 RENLKLAWSHSKCE 460
            EN ++ ++  +C+
Sbjct: 417 LENSRIGFAELRCD 430


>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 547

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 94/213 (44%), Gaps = 22/213 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +YT++ IGTP  +    LD GS L   PC  C +C P               + P  SS+
Sbjct: 81  YYTYLTIGTPGQTVSGILDTGSTLPAFPCSGCTRCGP----------SKTGMFKPELSST 130

Query: 169 SKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           S    CS   C    +SC    + C Y   Y  E +S+SG+L +D+L +      A    
Sbjct: 131 SSTFGCSDARCFCGANSCSCNNEQCGYSIRY-LEGSSTSGFLAEDMLAVGDGGPAA---- 185

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
              + + GC + ++G      A DGV G+G    S+   L + G+I ++FS+CF     G
Sbjct: 186 ---NFVFGCAQSESGLLYSQIA-DGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREG 241

Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVE 319
            +  G+   PA   +    P+    + + + +E
Sbjct: 242 VLLLGNVALPADAPAPVVTPVVGNTNKFNIQIE 274


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 158/381 (41%), Gaps = 55/381 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +G+P   F + LD GS+L W+  QC+ C       +    +N + YDP +S+S KN++C+
Sbjct: 161 VGSPPKHFSLILDTGSDLNWI--QCLPC-------HDCFQQNGAFYDPKASASYKNITCN 211

Query: 176 HPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            P C   S       CKS    CPY   Y     ++  + V+      + S  + +    
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNV 271

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
            +++ GCG    G +   A    ++GLG G +S  S L    L  +SFS C      D N
Sbjct: 272 ENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 326

Query: 285 DSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVGVESYCIGNSCL---------- 328
            S  + FG+            TSF+   E      Y+V ++S  +    L          
Sbjct: 327 VSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNIS 386

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNASSEEMLK 387
           +      ++DSG + ++     Y  +  K  +    K  +         C+N S  + ++
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQ 446

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFMMGH 441
           +P++ + F+          +++FP    F      + CL ++ T    + IIG       
Sbjct: 447 LPELGIAFADGA-------VWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNF 499

Query: 442 RIVFDRENLKLAWSHSKCEEV 462
            I++D +  +L ++ +KC ++
Sbjct: 500 HILYDTKRSRLGYAPTKCADI 520


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 150/369 (40%), Gaps = 51/369 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + +G P   F + LD GS++ W+ CQ C  C       Y   D     +DP SSSS
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPRSSSS 204

Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
             ++ C    C++   S C++ K  C Y   Y  + + + G  V + L   +       S
Sbjct: 205 FASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVTETLTFGN-------S 254

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
            + + V +GCG    G +        V   GL  +    L   + +  +SFS C  +   
Sbjct: 255 GMINDVAVGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQMKASSFSYCLVDRDS 306

Query: 285 -DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQ 334
             S  + F    P+   +   L  G+    Y+VG+    +G   L+          SG+ 
Sbjct: 307 SSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKVPD 390
             +VDSG + T L T+ Y  +    D  VS      + N +     CY+ SS+  + +P 
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           +   F+  +S  +    +  P +   T FC     T     IIG     G R+ +D  N 
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTRVHYDLANS 482

Query: 451 KLAWSHSKC 459
            + +S  KC
Sbjct: 483 VVGFSPHKC 491


>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
          Length = 507

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 162/378 (42%), Gaps = 75/378 (19%)

Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           T I +G  N +FLV +D GS L+ +P +    C++  P+              Y PSS+S
Sbjct: 124 TQIIVG--NTTFLVQVDTGSLLMAIPLEGCNTCVESRPV--------------YHPSSTS 167

Query: 168 SSKNVSCSHPLCKSRSSC------KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           +   V+CS   CK   S        S  + C +   Y  + +  SGY+ +D+++LA    
Sbjct: 168 T--KVACSSDQCKGSGSTPPSCSRTSSGESCDFQIRYG-DGSHVSGYIYEDVVNLAG--- 221

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VP----SLLAKAGLIQNS 276
                 +Q     G   ++TG + +    DG++G G    S VP    SL++  GL +N 
Sbjct: 222 ------LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQ 273

Query: 277 FSICFDENDSGSVFFGD-----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
           F +  +    GS+  G+             P  Q++T F  +     +  + +  Y I  
Sbjct: 274 FGMLLNYEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSV----KSTGIRINDYTIPG 329

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-----NSWKYCYNA 380
           S L Q   + +VDSG++   L +  Y ++   F     +   S+QG     N ++     
Sbjct: 330 SKLGQ---EVIVDSGSTALSLASGAYDQLRNYFQ----THYCSIQGVCENPNIFQGSICY 382

Query: 381 SSEEML-KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 437
           SS+++L K P +   F       +  +N++   P   G   +C  +   D    I+G  F
Sbjct: 383 SSDDVLSKFPTLYFTFDGGVQVAIPPKNYLVKAPLTNGKYGYCFMIERADSTMTILGDVF 442

Query: 438 MMGHRIVFDRENLKLAWS 455
           M G+  VFD  N ++ ++
Sbjct: 443 MRGYYTVFDNVNDRVGFA 460


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 155/362 (42%), Gaps = 50/362 (13%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP    L+A+D  ++  W+PC  C  C   SA          + +DP+SS+S + V C
Sbjct: 118 LGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSA----------APFDPASSASYRTVPC 167

Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
             PLC     ++C      C +   Y+  D+S    L  D L +A  +  A         
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVAGNAVKA--------Y 217

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
             GC ++ TG+    A P G++GLG G +S   L     + + +FS C       N SG+
Sbjct: 218 TFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGT 272

Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGA 341
           +  G  G P   ++T  L    +   Y+V +    +G   +        +G   ++DSG 
Sbjct: 273 LRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGT 332

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
            FT L    Y  V  +  + V +   SL G  +  C+N ++   +  P + L+F   Q  
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPVTLLFDGMQVT 387

Query: 402 VVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           +   ++     +   T+ CL + +  DG      +I       HR++FD  N ++ ++  
Sbjct: 388 LPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARE 445

Query: 458 KC 459
           +C
Sbjct: 446 RC 447


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 141/365 (38%), Gaps = 38/365 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  + +GTP     +  D GS L W      QC P + S Y   D     +DPS SSS 
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWT-----QCEPCAGSCYKQQD---PIFDPSKSSSY 191

Query: 170 KNVSCSHPLCKSRSS--CKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
            N+ C+  LC    S  C S  D  C Y   Y  +++ S G+L  + L + +       +
Sbjct: 192 TNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYG-DNSISRGFLSQERLTITA-------T 243

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
            +    + GCG+   G +   A   G+MGL    +S   +   + +    FS C     S
Sbjct: 244 DIVHDFLFGCGQDNEGLFRGTA---GLMGLSRHPISF--VQQTSSIYNKIFSYCLPSTPS 298

Query: 287 --GSVFFGDQGP--ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQAL 336
             G + FG      A  + T F  I  +   Y + +    +G + L      T S   ++
Sbjct: 299 SLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSI 358

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG   T LP   YA +   F + +    ++        CY+ S  + + VP  R+ F 
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVP--RIDFE 416

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
                 V   +      E     CL   +     D  I G        +V+D E  ++ +
Sbjct: 417 FAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGF 476

Query: 455 SHSKC 459
             + C
Sbjct: 477 GAAGC 481


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 170/401 (42%), Gaps = 54/401 (13%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALD 128
           ++R    V+   N  +   +    ++ +++    +Q  Y + Y+   +G+P    L  +D
Sbjct: 53  FQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRYS---VGSPPFQVLGIVD 109

Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKS 187
            GS++LW     +QC P    Y     +    +DPS S + K + CS   C+S R++  S
Sbjct: 110 TGSDILW-----LQCEPCEDCY----KQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACS 160

Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASF---SKHAPQSSVQSSVIIGCGRKQTGSY 244
             + C Y  DY  + + S G L  + L L S    S H P++      +IGCG    G++
Sbjct: 161 SDNVCEYSIDYG-DGSHSDGDLSVETLTLGSTDGSSVHFPKT------VIGCGHNNGGTF 213

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ 299
            +    +G   +GLG   V  +   +  I   FS C      + N S  + FGD    + 
Sbjct: 214 QE----EGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSG 269

Query: 300 QSTSFLPI----GEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
           + T   P+    G+ +  YF+ +E++ +G++ +                 ++DSG + T 
Sbjct: 270 RGTVSTPLDPLNGQVF--YFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTL 327

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRN 405
           LP E Y  +      ++  +R          CY  +S+E L +P +   F      V  N
Sbjct: 328 LPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDE-LDLPVITAHFKGAD--VELN 384

Query: 406 HIFSF-PENEGFTVFCLTVMSTDGDYGIIG-QNFMMGHRIV 444
            I +F P  +G   F          +G +  QN ++G+ +V
Sbjct: 385 PISTFVPVEKGVVCFAFISSKIGAIFGNLAQQNLLVGYDLV 425


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 152/371 (40%), Gaps = 50/371 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP     + LD GS+++W     +QCAP    Y     +    ++P  S S 
Sbjct: 42  YFTRIGVGTPPKYVYMVLDTGSDIVW-----LQCAPCKNCY----SQTDPVFNPVKSGSF 92

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             V C  PLC+   S  C   +  C Y   Y  + + ++G  V + L          + +
Sbjct: 93  AKVLCRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF--------RRT 142

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS- 286
               V +GCG    G ++  A   G+     G +S PS   +       FS C  +  + 
Sbjct: 143 KVEQVALGCGHDNEGLFVGAAGLLGLG---RGGLSFPSQAGRT--FNQKFSYCLVDRSAS 197

Query: 287 ---GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ--- 334
               SV FG+   A  ++  F P+    + D  Y+V +    +G    S +T S F+   
Sbjct: 198 SKPSSVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 255

Query: 335 -----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
                 ++D G S T L    Y  +   F    SS + + + + +  CY+ S +  +KVP
Sbjct: 256 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVP 315

Query: 390 DMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
            + L F   + S    N++      +G   FC     T     IIG     G R+V+D  
Sbjct: 316 TVVLHFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA 372

Query: 449 NLKLAWSHSKC 459
           + ++ +S   C
Sbjct: 373 SSRVGFSPRGC 383


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 154/368 (41%), Gaps = 63/368 (17%)

Query: 6   AICMLFGCILLDG-----SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
           A+ ++F  + + G     ++ +SF+++L+HR S  +                 P  N+ E
Sbjct: 14  ALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNS-----------------PLFNASE 56

Query: 61  YLELLLSNDWKRQKTRV-KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
             ++ L+N  +R   RV +     ++S     FPS      F            I IG P
Sbjct: 57  TTDIRLANAVERSADRVNRFNDLISNSITAAEFPSILDNGDFL---------MKISIGIP 107

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
               LV +  GS+L+W+PC  +   P +       + +L  +DP  SS+ KNV C    C
Sbjct: 108 PTELLVNVATGSDLVWIPC--LSFKPCTH------NCDLRFFDPMESSTYKNVPCDSYRC 159

Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
           +  ++       C Y  D   +D+   G L  D L L S +    +S +  +    CG +
Sbjct: 160 QITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTG---KSFMLPNTGFICGNR 216

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGP 296
             G Y       G++GLG G +S+ + ++   LI   FS C   +  N +  + FGD+  
Sbjct: 217 IGGDY----PGVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAV 270

Query: 297 ATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA-------LVDSGASFTFL 346
            +     ST     G  Y +Y +      +GN  ++  G  +        +DSG  FT+ 
Sbjct: 271 VSGSAMFSTRLDMTGGPY-SYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYF 329

Query: 347 PTEIYAEV 354
           P   Y+++
Sbjct: 330 PEYFYSQL 337


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 152/369 (41%), Gaps = 51/369 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +++ + +G P   F + LD GS++ W+ CQ C  C       Y   D     +DP SSSS
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPRSSSS 204

Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
             ++ C    C++   S C++ K  C Y   Y  + + + G  V + L   +       S
Sbjct: 205 FASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVIETLTFGN-------S 254

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
            + ++V +GCG    G +        V   GL  +   SL   + +  +SFS C  +   
Sbjct: 255 GMINNVAVGCGHDNEGLF--------VGSAGLLGLGGGSLSLTSQMKASSFSYCLVDRDS 306

Query: 285 -DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQ 334
             S  + F    P+   +   L  G+    Y+VG+    +G   L+          SG+ 
Sbjct: 307 SSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKVPD 390
             +VDSG + T L T+ Y  +    D  VS      + N +     CY+ SS+  + +P 
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           +   F+  +S  +    +  P +   T FC     T     IIG     G R+ +D  N 
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTRVHYDLANS 482

Query: 451 KLAWSHSKC 459
            + +S  KC
Sbjct: 483 VVGFSPHKC 491


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 100/423 (23%), Positives = 183/423 (43%), Gaps = 66/423 (15%)

Query: 66  LSNDWKRQKTRVKLQSNN----NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           L N + R  +RV +        NS +N L+ P+ G             ++  + IGTP V
Sbjct: 59  LRNAFSRSISRVNVFKTKAVDINSFQNDLV-PNGGE------------YFMKMSIGTPLV 105

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK- 180
             +V  D GS+L WV  QC+ C P          +    +DPS SSS +++ C    C  
Sbjct: 106 EVIVIADTGSDLTWV--QCLPCDP-------CYRQKSPLFDPSRSSSYRHMLCGSRFCNA 156

Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
              S  +C    + C Y   YS  D S ++G L  +   + S S         S ++ GC
Sbjct: 157 LDVSEQACTMDTNICEY--HYSYGDKSYTNGNLATEKFTIGSTSSRPVH---LSPIVFGC 211

Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF-----DENDSGSVF 290
           G    G++      +   G+        SL+++ + +I+  FS C        N +  + 
Sbjct: 212 GTGNGGTF-----DELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIK 266

Query: 291 FG-DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVD 338
           FG D   +  Q  S   + ++ D Y+ V +E+  +GN  L           + G   ++D
Sbjct: 267 FGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKG-NVIID 325

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG + TFL +E + E+    ++ V ++R+S     +  C+ ++ +  + +P + + F  N
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGD--IDLPVIAVHF--N 381

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
            + V    + +F + +   + C T++S++   GI G    M   + +D E   +++  + 
Sbjct: 382 DADVKLQPLNTFVKADE-DLLCFTMISSN-QIGIFGNLAQMDFLVGYDLEKRTVSFKPTD 439

Query: 459 CEE 461
           C +
Sbjct: 440 CTK 442


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 110/463 (23%), Positives = 195/463 (42%), Gaps = 74/463 (15%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK-LQSN 82
            ++KL+HR  +        ++  V       + +S+E  + L        ++++K L+S 
Sbjct: 38  LATKLIHR--NSYLHPLYDQNETVEDRSKREQTSSIERFDFL--------ESKIKELKSV 87

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCI 141
            N +R+ L+  + GS   F  N         + IG+P V+ LV +D GS+LLWV C  CI
Sbjct: 88  GNEARSSLIPFNRGSG--FLVN---------LSIGSPPVTQLVVVDTGSSLLWVQCLPCI 136

Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYST 200
            C   S S+          +DP  S S K + C  P     +  K  + +   Y   Y  
Sbjct: 137 NCFQQSTSW----------FDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLG 186

Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
            D SS G L  + L   +  +   +   +S++  GCG     +  D A  +GV GLG   
Sbjct: 187 GD-SSQGILAKESLLFETLDEGKIK---KSNITFGCGHMNIKTNNDDAY-NGVFGLG--- 238

Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS-----GSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
            + P  +  A  + N FS C  + ++       +  G QG   +  ++  P+   +  Y+
Sbjct: 239 -AYPH-ITMATQLGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDST--PLQIHFGHYY 293

Query: 316 VGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE----IYAEVVVKFDKL 361
           V ++S  +G+  L    + F+         L+DSG ++T L       +Y E+V     L
Sbjct: 294 VTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL 353

Query: 362 VSSKRISLQGNSWKYCYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 420
           +  +RI  Q      C+    S +++  P +   F+     V+ +   S     G   FC
Sbjct: 354 L--ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG--SLFRQHGGDRFC 409

Query: 421 LTVMSTDGD---YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           L ++ ++ +     +IG      + + FD E +K+ +    C+
Sbjct: 410 LAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQ 452


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 139/311 (44%), Gaps = 43/311 (13%)

Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
            G     L Y   + IG+P V+  +++D GS++ WV C+ C QC       ++ +D   S
Sbjct: 122 LGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQC-------HSEVD---S 171

Query: 160 EYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
            +DPS+SS+    SCS   C      +  + C S +  C YI  Y  + +S++G    D 
Sbjct: 172 LFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQ--CQYIVSY-VDGSSTTGTYSSDT 228

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGL 272
           L L S +    Q         GC + ++G + D    DG+MGLG GD    SL+++ AG 
Sbjct: 229 LTLGSNAIKGFQ--------FGCSQSESGGFSD--QTDGLMGLG-GDAQ--SLVSQTAGT 275

Query: 273 IQNSFSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKYDAYFVGVESYCIGNSCLT 329
              +FS C      GS  F   G A++     T  L   +    Y V +E+  +G   L 
Sbjct: 276 FGKAFSYCLPPTP-GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLN 334

Query: 330 --QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
              S F A  ++DSG   T LP   Y+ +   F   +     +        C++ S +  
Sbjct: 335 IPTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSS 394

Query: 386 LKVPDMRLIFS 396
           + +P + L+FS
Sbjct: 395 VSIPSVALVFS 405


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 157/375 (41%), Gaps = 60/375 (16%)

Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE 160
           F N  Y +    + +GTP       +D GS + W  C  C+ C            +N   
Sbjct: 375 FDNSVYLMK---LQVGTPPFEIEAVIDTGSEITWTQCLPCVHC----------YKQNAPI 421

Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           +DPS SS+ K   C                 CPY  DY  + T + G L  D + + S S
Sbjct: 422 FDPSKSSTFKEKRCH-------------DHSCPYEVDYF-DKTYTKGTLATDTVTIHSTS 467

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
               +  V +  IIGCGR    S+   +  +G +GL  G +S+  +    G      S C
Sbjct: 468 G---EPFVMAETIIGCGRNN--SWFRPSF-EGFVGLNWGPLSL--ITQMGGEYPGLMSYC 519

Query: 281 FDENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 335
           F  N +  + FG     G     ST+      +   Y++ +++  +G++ +   G  F A
Sbjct: 520 FAGNGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHA 579

Query: 336 L-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
           L     +DSG + T+ P E Y  +V +  + +V +   +    +   CY +++ E+  V 
Sbjct: 580 LEGNIVIDSGTTLTYFP-ESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVI 638

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQ-NFMMGHRIV 444
            M   FS     V+  +   F E+    +FCL ++    + +  +G   Q NF++G    
Sbjct: 639 TMH--FSGGADLVLDKYNM-FMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVG---- 691

Query: 445 FDRENLKLAWSHSKC 459
           +D  +L +++  + C
Sbjct: 692 YDSSSLLVSFKPTNC 706



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 137/343 (39%), Gaps = 69/343 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP       LD GS L+W  C  C+ C           D+    +DPS SS+ K  
Sbjct: 69  LQIGTPPFEVEAVLDTGSELIWTQCLPCLHC----------YDQKAPIFDPSKSSTFKET 118

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            C+ P              CPY   Y  + + + G L  + + + S S       V    
Sbjct: 119 RCNTP-----------DHSCPYKLVYD-DKSYTQGTLATETVTIHSTSG---VPFVMPET 163

Query: 233 IIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
           IIGC R  +GS   G  P   G++GL  G +S+ S +                   G  +
Sbjct: 164 IIGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQM-------------------GGAY 201

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGASF 343
            GD       ST+      K   Y++ +++  +G++ +   G  F AL     +DSG   
Sbjct: 202 PGDG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPL 257

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T+ P      V    +++V++ R+     +   CY +++ E+   P + + FS     V+
Sbjct: 258 TYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEIF--PVITVHFSGGADLVL 315

Query: 404 RNHIFSFPENEGFTVFCLTVMSTD-GDYGIIG----QNFMMGH 441
             +      N G  VFCL ++  +     I G     NF++G+
Sbjct: 316 DKYNMYMELNRG-GVFCLAIICNNPTQVAIFGNRAQNNFLVGY 357


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 93/415 (22%), Positives = 176/415 (42%), Gaps = 81/415 (19%)

Query: 93  PSEGSQTHFFGNQFYWLHYTWID--IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASY 150
           PS  S  + F ++F +     I   IGTP  +  + LD GS L W+ C   +  P     
Sbjct: 53  PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----- 107

Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDT 203
                +  + +DPS SSS   + CSHPLCK R       +SC S +  C Y   Y+ + T
Sbjct: 108 -----KPKTSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGT 160

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
            + G LV + +  ++       + +   +I+GC  + +          G++G+  G +  
Sbjct: 161 FAEGNLVKEKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL-- 204

Query: 264 PSLLAKAGLIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFL 305
            S +++A + + S+ I    N      +GS + GD               P +Q+  +  
Sbjct: 205 -SFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLD 263

Query: 306 PIGEKYDAYFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AE 353
           P+   Y    +G+  + +    ++ S F        Q +VDSG+ FT L    Y    AE
Sbjct: 264 PLA--YTVPMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAE 320

Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKN-QSFVVRNHIFSFP 411
           ++ +  + +  K+  + G +   C++ +   + + + D+  +F++  + FV +  +    
Sbjct: 321 IMTRVGRRL--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVLV-- 376

Query: 412 ENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
            N G  + C+ +  +        IIG        + FD  N ++ ++ + C  V+
Sbjct: 377 -NVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/392 (24%), Positives = 164/392 (41%), Gaps = 77/392 (19%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP+ S  + LD GS L W+ C   +         TS       +DPS SSS  ++ 
Sbjct: 85  LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS-------FDPSLSSSFSDLP 137

Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           CSHPLCK R       +SC S +  C Y   Y+ + T + G LV +    ++     P  
Sbjct: 138 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVKEKFTFSNSQTTPP-- 193

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
                +I+GC ++ T          G++G+ LG +   S +++A + + S+ I    N  
Sbjct: 194 -----LILGCAKESTDV-------KGILGMNLGRL---SFISQAKISKFSYCIPTRSNRP 238

Query: 285 ---DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
               +GS + G+               P +Q+  +  P+     AY V +    IG   L
Sbjct: 239 GLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPL-----AYTVPLLGIRIGQKRL 293

Query: 329 T-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWK 375
                        SG Q +VDSG+ FT L    Y +V  +  +LV S  K+  + G++  
Sbjct: 294 NIPSSVFRPDAGGSG-QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTAD 352

Query: 376 YCYNASSEEMLK--VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDY 430
            C++ + + ++   + D+   F +    +V         N G  + C+ +  +       
Sbjct: 353 MCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQ--RLLVNVGGGIHCVGIGRSSMLGAAS 410

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            IIG        + FD  N ++ +S ++C  +
Sbjct: 411 NIIGNVHQQNLWVEFDVANRRVGFSKAECSRL 442


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/402 (23%), Positives = 163/402 (40%), Gaps = 69/402 (17%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVA-LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
           G+  Y +H   + IGTP    +V  LD GS+L+W  C C  C           D+ +  +
Sbjct: 90  GSSEYLIH---LGIGTPRPQRVVLHLDTGSDLVWTQCACTVC----------FDQPVPVF 136

Query: 162 DPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
             S S +   V CS PLC        S C +    C Y   Y  + + ++G + +D    
Sbjct: 137 RASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGY-MDHSITTGKMAEDTFTF 195

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
            +    A  ++   ++  GCG    G +    +  G+ G G G +S+PS L         
Sbjct: 196 KA-PDRADTAAAVPNIRFGCGMMNYGLFTPNQS--GIAGFGTGPLSLPSQLKV-----RR 247

Query: 277 FSICF---DENDSGSVFFGDQ---------GPATQQSTSF------LPIGEKYDAYFVGV 318
           FS CF   +E+    V  G +         GP   QST F       P+G +   YF+ +
Sbjct: 248 FSYCFTAMEESRVSPVILGGEPENIEAHATGPI--QSTPFAPGPAGAPVGSQ-PFYFLSL 304

Query: 319 ESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIY---AEVVVKFDKLVSSK 365
               +G + L    S F           +DSG + TF P  ++    E  V    L  +K
Sbjct: 305 RGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAK 364

Query: 366 RISLQGNSWKYCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV---FCL 421
             +   N    C++  +++    VP + L        + R +     +++G       C+
Sbjct: 365 GYTDPDN--LLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRKLCV 422

Query: 422 TVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 462
            ++S     G I  NF   +  IV+D E+ K+ ++ ++C+++
Sbjct: 423 VILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARCDKL 464


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 101/417 (24%), Positives = 172/417 (41%), Gaps = 66/417 (15%)

Query: 61  YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTP 119
           Y++   S D K+       Q      ++ +  P+        G     L Y   + +G+P
Sbjct: 92  YIKRKFSGDVKKDG-----QGAGGVEQSHVTVPTT------LGTSLNTLEYLITVRLGSP 140

Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
             +  V +D+GS++ WV C+ C+QC       ++ +D     +DPS SS+    SCS   
Sbjct: 141 AKTQTVLIDSGSDVSWVQCKPCLQC-------HSQVD---PLFDPSLSSTYSPFSCSSAA 190

Query: 179 C----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           C    +  + C S    C YI  Y+ + +S++G    D L L S        +  S+   
Sbjct: 191 CAQLGQDGNGCSS-SSQCQYIVRYA-DGSSTTGTYSSDTLALGS--------NTISNFQF 240

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFFG- 292
           GC   ++G + D    DG+MGLG G    PSL ++ AG    +FS C     S S F   
Sbjct: 241 GCSHVESG-FND--LTDGLMGLGGG---APSLASQTAGTFGTAFSYCLPPTPSSSGFLTL 294

Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--LVDSGASFTFLPT 348
             G +    T  L        Y V +E+  +G + L+   S F A  ++DSG   T LP 
Sbjct: 295 GAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPR 354

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK------NQSFV 402
             Y+ +   F   +   R +   +    C++ S +  +++P + L+FS       + + +
Sbjct: 355 TAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGI 414

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +  +  +F  N           S D   GI+G        +++D     + +    C
Sbjct: 415 ILGNCLAFAAN-----------SDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 92/370 (24%), Positives = 155/370 (41%), Gaps = 51/370 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
           + T + +GTP+ S+ + +D GS+L W     +QC+P       S  R +   +DP +SS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTW-----LQCSPC----VVSCHRQVGPLFDPRASST 184

Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
             +V CS   C        + S+C S  + C Y A Y  + + S GYL  D +   S S 
Sbjct: 185 YTSVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGYLSTDTVSFGSTSY 242

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                    S   GCG+   G +   A   G++GL    +S+   LA +  +  SFS C 
Sbjct: 243 --------PSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289

Query: 282 DENDSGSVFFGDQGP-ATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSG 332
               + S  +   GP  T    S+ P+     DA  YF+ +    +G S L       S 
Sbjct: 290 PT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS 347

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG   T LPT ++  +     + ++  + +   +    C+   + + L+VP + 
Sbjct: 348 LPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVV 406

Query: 393 LIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + F+   S     RN +    ++      CL    TD    IIG        +++D    
Sbjct: 407 MAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTDST-AIIGNTQQQTFSVIYDVAQS 461

Query: 451 KLAWSHSKCE 460
           ++ +S   C 
Sbjct: 462 RIGFSAGGCS 471


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 101/406 (24%), Positives = 157/406 (38%), Gaps = 66/406 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP      ALD  S+L+W  C     AP               ++P  S++  +V C+
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGA--TAP---------------FNPVRSTTVADVPCT 148

Query: 176 HPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
              C+  +  +C +    C Y   Y     +++G L  +              +    V+
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGD--------TRIDGVV 200

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----GSV 289
            GCG K  G +   +   GV+GLG G++S+ S L       + FS  F  +DS      +
Sbjct: 201 FGCGLKNVGDF---SGVSGVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFI 252

Query: 290 FFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-GNSCLTQSG-FQALVDSGASFT 344
            FGD   P T    ST  L        Y+V +    + G      SG F      G+   
Sbjct: 253 LFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGV 312

Query: 345 FLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW--KYCYNASSEEMLKVPDMRLIF 395
           FL       +  E   K  +   + +I L    G++     CY   S    KVP M L+F
Sbjct: 313 FLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVF 372

Query: 396 SKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           +      +   + F      G    CLT++ S+ GD  ++G    +G  +++D    KL 
Sbjct: 373 AGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLV 430

Query: 454 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
           +     E +   +     PPP+G S      T QQ+     A+APP
Sbjct: 431 F-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGRRSASAPP 467


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/408 (23%), Positives = 160/408 (39%), Gaps = 70/408 (17%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y L    + IG+   +    +D GS  + V C                 R+   +DP++S
Sbjct: 97  YALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCG---------------SRSRPVFDPAAS 141

Query: 167 SSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
            S + V C   LC          S   C +    C Y   Y  +  +S+G    D++ L 
Sbjct: 142 QSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYG-DSRNSTGDFSQDVIFLN 200

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
           S +  + Q+     V  GC     G  +D     G++G   G++S+PS L K  L  + F
Sbjct: 201 S-TNSSGQAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KDRLGGSKF 257

Query: 278 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYFVGVESYCIGNS 326
           S CF         +G +F GD G  ++    + P+ +      +   Y+VG+ S  +   
Sbjct: 258 SYCFPSQPWQPRATGVIFLGDSG-LSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGK 316

Query: 327 CLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ----- 370
            L   +S F+          ++DSG +FT +  + Y      F    +S R  L+     
Sbjct: 317 TLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AASNRSGLRKKVGA 373

Query: 371 GNSWKYCYNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENEGFTVFCLTVMS 425
              +  CYN S+   L  VP++RL    N    +R  H+F   S   NE     CL ++S
Sbjct: 374 AAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE--VTVCLAILS 431

Query: 426 TD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 469
           +     G   ++G      + + +D E  ++ +  + C        VH
Sbjct: 432 SQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADCSGAAGSFLVH 479


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 102/420 (24%), Positives = 171/420 (40%), Gaps = 67/420 (15%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
           L    KR K R++  S   +S     F S        GN  + +    + IGTP  ++  
Sbjct: 61  LQRAMKRGKLRLQRLSAKTAS-----FESSVEAPVHAGNGEFLMK---LAIGTPAETYSA 112

Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-- 182
            +D GS+L+W  C+ C  C           D+    +DP  SSS   + CS  LC +   
Sbjct: 113 IMDTGSDLIWTQCKPCKDC----------FDQPTPIFDPKKSSSFSKLPCSSDLCAALPI 162

Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
           SSC    D C Y+  Y  + +S+ G L  +       S         S +  GCG    G
Sbjct: 163 SSC---SDGCEYLYSYG-DYSSTQGVLATETFAFGDASV--------SKIGFGCGEDNDG 210

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQGPAT 298
           S     A  G++GLG G +S+ S L      +  FS C     D     S+  G +  AT
Sbjct: 211 SGFSQGA--GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKGISSLLVGSE--AT 261

Query: 299 QQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTF 345
            ++    P+ +   +   Y++ +E   +G++ L   +S F          ++DSG + T+
Sbjct: 262 MKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITY 321

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIF-SKNQSFVV 403
           L    +A +  +F   +              C+    +   + VP +   F   +     
Sbjct: 322 LEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPA 381

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLKLAWSHSKCEEV 462
            N+I +   + G  V CLT+ S+ G   I G NF   + +V  D E   ++++ ++C ++
Sbjct: 382 ENYIIA---DSGLGVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 149/374 (39%), Gaps = 54/374 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP   +   LD GS+L+W  C  C+ C          +D+    +DP++SS+ +++
Sbjct: 96  MGIGTPARFYSAILDTGSDLIWTQCAPCLLC----------VDQPTPYFDPANSSTYRSL 145

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            CS P C +       +  C Y   Y  +  S++G L ++     +         +    
Sbjct: 146 GCSAPACNALYYPLCYQKTCVYQYFYG-DSASTAGVLANETFTFGTNDTRVTLPRIS--- 201

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
             GCG    GS  +G+   G++G G G +S+ S L         FS C   F       +
Sbjct: 202 -FGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSRL 252

Query: 290 FFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSG 332
           +FG          +T QST F+        YF+ +    +G + L           T   
Sbjct: 253 YFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKYCYN--ASSEEMLK 387
              ++DSG + T+L    Y  V   F   ++S    L   + +    C+       + + 
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT 372

Query: 388 VPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           +P + L F   +    ++N++   P   G    CL  M+T  D  IIG        +++D
Sbjct: 373 LPQLVLHFDGADWELPLQNYMLVDPSTGG---LCL-AMATSSDGSIIGSYQHQNFNVLYD 428

Query: 447 RENLKLAWSHSKCE 460
            EN  L++  + C 
Sbjct: 429 LENSLLSFVPAPCN 442


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 152/372 (40%), Gaps = 65/372 (17%)

Query: 54  PKKNSVE--YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY 111
           P  ++VE    ELL  +  + +  + KL  N+ S  + +   +  +     G+    L Y
Sbjct: 66  PAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAITLPTTLGSALDTLAY 125

Query: 112 T-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
              + IGTP ++  V +D GS++ WV C     A   +S +         +DP  SS+  
Sbjct: 126 VITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGA--GSSLF---------FDPGKSSTYT 174

Query: 171 NVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             SCS   C   + R +  SL   C Y   Y  + ++++G    D L L S  K      
Sbjct: 175 PFSCSSAACTRLEGRDNGCSLNSTCQYTVRYG-DGSNTTGTYGSDTLALNSTEK------ 227

Query: 228 VQSSVIIGCGRK-QTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDEND 285
              +   GC      G  LD    DG+MGLG G    PSL+++ A    ++FS C     
Sbjct: 228 -VENFQFGCSETSDPGEGLDEDQTDGLMGLGGG---APSLVSQTAATYGSAFSYCL---- 279

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------YFVGVESYCIGNS-- 326
                     PAT +S+ FL +G                      YFV ++   +G    
Sbjct: 280 ----------PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPV 329

Query: 327 CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
            ++ + F A  ++DSG   T LP   Y+ +   F   +     +   +    C++ + ++
Sbjct: 330 AISPTVFAAGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQD 389

Query: 385 MLKVPDMRLIFS 396
            + +P + L+FS
Sbjct: 390 NVSIPAVELVFS 401


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 100/415 (24%), Positives = 179/415 (43%), Gaps = 59/415 (14%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
           +R  +RV   S    ++N  +F ++ +Q+    NQ  +L      +GTP    L   D G
Sbjct: 59  RRSMSRVHHFS---PTKNSDIF-TDTAQSEMISNQGEYLM--KFSLGTPAFDILAIADTG 112

Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCK 186
           S+L+W  C+ C QC           +++   +DP SSS+ +++SCS   C   K  +SC 
Sbjct: 113 SDLIWTQCKPCDQC----------YEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCS 162

Query: 187 SLKDP-CPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
              +  C Y   YS  D S +SG +  D + L S S    +  +    IIGCG    GS+
Sbjct: 163 GEGNKTCHY--SYSYGDRSFTSGNVAADTITLGSTSG---RPVLLPKAIIGCGHNNGGSF 217

Query: 245 LDGAAPDGVMGLGLGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPA 297
            +  +    +         P SL+++ G  I   FS C      +  +S  + FG  G  
Sbjct: 218 TEKGSGIVGL------GGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIV 271

Query: 298 TQQSTSFLP-IGEKYDA-YFVGVESYCIGN-------SCLTQSGFQALVDSGASFTFLPT 348
           +       P I +  D  YF+ +E+  +G+       S    S    ++DSG + T  P 
Sbjct: 272 SGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPE 331

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
           + ++E+       V+   +         CY+  ++  LK P +   F  + + V  N + 
Sbjct: 332 DFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDAD--LKFPSITAHF--DGADVKLNPLN 387

Query: 409 SFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIVFDRENLKLAWSHSKCEE 461
           +F + ++    F    +++   +G + Q NF++G    +D E   +++  + C +
Sbjct: 388 TFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVG----YDLEGKTVSFKPTDCTQ 438


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 151/372 (40%), Gaps = 51/372 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W+  QC+ CA      Y   D     ++P++SS+ 
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWI--QCLPCAKC----YGQTD---PLFNPAASSTY 203

Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--Q 225
           + V C+ PLCK    S C++ K  C Y   Y            D    +  FS      +
Sbjct: 204 RKVPCATPLCKKLDISGCRN-KRYCEYQVSYG-----------DGSFTVGDFSTETLTFR 251

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DEN 284
             V   V +GCG    G ++  A   G+     G +S PS           FS C  D +
Sbjct: 252 GQVIRRVALGCGHDNEGLFIGAAGLLGLG---RGSLSFPS--QTGAQFSKRFSYCLVDRS 306

Query: 285 DSG---SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLTQ---SGFQ- 334
            SG   S+ FG    A  +S  F P+    K D  Y+V +    +G   LT    S F+ 
Sbjct: 307 ASGTASSLIFGK--AAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRM 364

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
                   ++DSG S T L    Y+ +   F     + + +   + +  CY+ S  + +K
Sbjct: 365 DATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVK 424

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           VP +   F       +    +  P +   T FC       G   IIG     G+R+VFD 
Sbjct: 425 VPTLVFHFQGGAHISLPATNYLIPVDSSAT-FCFAFAGNTGGLSIIGNIQQQGYRVVFDS 483

Query: 448 ENLKLAWSHSKC 459
              ++ +    C
Sbjct: 484 LANRVGFKAGSC 495


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 157/372 (42%), Gaps = 51/372 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     +QC+P    Y     ++   ++P  S S 
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVW-----LQCSPCRKCY----SQSDPIFNPYKSKSF 160

Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + CS PLC+    S C + +  C Y   Y  + + ++G    + L          + +
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYG-DGSFTTGDFATETLTF--------RGN 211

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSFSICFDENDS 286
             + V +GCG    G ++  A    ++GLG G +S PS   + G+   + FS C  +  +
Sbjct: 212 KIAKVALGCGHHNEGLFVGAAG---LLGLGRGRLSFPS---QTGIRFNHKFSYCLVDRSA 265

Query: 287 ----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ-- 334
                S+ FGD   A  +   F P+    K D  Y+VG+    +G      ++ S F+  
Sbjct: 266 SSKPSSMVFGDA--AISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLD 323

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG S T L    Y  +   F       +   + + +  CY+ S +  +KV
Sbjct: 324 SAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKV 383

Query: 389 PDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           P + L F   + +    N++    EN     FC     T     IIG     G R+V+D 
Sbjct: 384 PTVVLHFRGADMALPATNYLIPVDENGS---FCFAFAGTISGLSIIGNIQQQGFRVVYDL 440

Query: 448 ENLKLAWSHSKC 459
              ++ ++   C
Sbjct: 441 AGSRIGFAPRGC 452


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 106/427 (24%), Positives = 167/427 (39%), Gaps = 54/427 (12%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           K   +  +L L  D KR +  V L + N S   +       S       Q    ++T I 
Sbjct: 76  KTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFSSSIISGLA-QGSGEYFTRIG 134

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP     + LD GS+++W     +QCAP     YT  D     +DP+ S +   + C 
Sbjct: 135 VGTPARYVYMVLDTGSDVVW-----LQCAPCRKC-YTQAD---PVFDPTKSRTYAGIPCG 185

Query: 176 HPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--QSSVQSS 231
            PLC+   S  C +    C Y   Y            D       FS      + +  + 
Sbjct: 186 APLCRRLDSPGCNNKNKVCQYQVSYG-----------DGSFTFGDFSTETLTFRRTRVTR 234

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----G 287
           V +GCG    G ++  A    ++GLG G +S P    +       FS C  +  +     
Sbjct: 235 VALGCGHDNEGLFIGAAG---LLGLGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAKPS 289

Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ------- 334
           SV FGD   A  ++  F P+    K D  Y++ +    +G S    L+ S F+       
Sbjct: 290 SVVFGDS--AVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNG 347

Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG S T L    Y  +   F    S  + + + + +  C++ S    +KVP + L
Sbjct: 348 GVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVL 407

Query: 394 IF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
            F   + S    N++    +N G   FC     T     IIG     G R+ FD    ++
Sbjct: 408 HFRGADVSLPATNYLIPV-DNSG--SFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRV 464

Query: 453 AWSHSKC 459
            ++   C
Sbjct: 465 GFAPRGC 471


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 92/372 (24%), Positives = 145/372 (38%), Gaps = 54/372 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     +  D GS+L W      QC P   S Y    +    +DPS+S +  N+S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWT-----QCQPCVKSCYA---QQQPIFDPSTSKTYSNIS 209

Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           C+   C S  S       C S    C Y   Y  + + + G+   D L L        Q+
Sbjct: 210 CTSAACSSLKSATGNSPGCSS--SNCVYGIQYG-DSSFTIGFFAKDKLTLT-------QN 259

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
            V    + GCG+   G +   A   G++GLG   +S+    A+       FS C      
Sbjct: 260 DVFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314

Query: 285 DSGSVFFGD-----QGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLTQSG--F 333
            +G + FG+        A +   +F P     G  Y  YF+ V    +G   L+ S   F
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAY--YFIDVLGISVGGKALSISPMLF 372

Query: 334 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           Q    ++DSG   T LP+  Y  +   F + +S    +   +    CY+ S+   + +P 
Sbjct: 373 QNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPK 432

Query: 391 MRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 447
           +   F+ N +  +  N I       G +  CL       D   GI G        +V+D 
Sbjct: 433 ISFNFNGNANVELDPNGIL---ITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDV 489

Query: 448 ENLKLAWSHSKC 459
              +L + +  C
Sbjct: 490 AGGQLGFGYKGC 501


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 109/428 (25%), Positives = 167/428 (39%), Gaps = 60/428 (14%)

Query: 57  NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL--FPSEGSQTHFFGNQFYWLHYTWI 114
           NS  +++L+ S  ++R   R+    + NS     +   P +   T   GN     +    
Sbjct: 88  NSSSWIDLV-SQSFERDNARLNTIRSKNSGPYTTMSNLPLQSGTTVGTGN-----YIVTA 141

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
             GTP  + L+ +D GS+L W     IQC P  A  Y+ +D   + ++P  SSS K + C
Sbjct: 142 GFGTPAKNSLLIIDTGSDLTW-----IQCKPC-ADCYSQVD---AIFEPKQSSSYKTLPC 192

Query: 175 SHPLCKSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
               C    + +S   P     C Y  +Y  + +SS G    + L L S S         
Sbjct: 193 LSATCTELITSESNPTPCLLGGCVYEINYG-DGSSSQGDFSQETLTLGSDSFQ------- 244

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-LAKAGLIQNSFSICF-DENDSG 287
            +   GCG   TG +       G++GLG   +S PS   +K G     F+ C  D   S 
Sbjct: 245 -NFAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFGSST 297

Query: 288 SVFFGDQGPAT-QQSTSFLPIGEKY---DAYFVGVESYCIGNSCLT-----QSGFQALVD 338
           S      G  +   S  F P+   +     YFVG+    +G   L+           +VD
Sbjct: 298 STGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVD 357

Query: 339 SGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           SG   T L  + Y  +   F      L S+K  S+       CY+ S    +++P +   
Sbjct: 358 SGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI----LDTCYDLSRHSQVRIPTITFH 413

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMST---DGDYGIIGQNFMMGHRIVFDRENLK 451
           F  N    V +     P   G +  CL   S    DG + IIG       R+ FD    +
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDG-FNIIGNFQQQRMRVAFDTGAGR 472

Query: 452 LAWSHSKC 459
           + ++   C
Sbjct: 473 IGFASGSC 480


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 85/175 (48%), Gaps = 20/175 (11%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y  + T + IGTP   F + +D GSN+ +VPC    C   S  Y    +     +   SS
Sbjct: 47  YGYYATKLYIGTPPQEFTLVVDTGSNMTFVPC----CG--SEEYCGKHED--PAFQTESS 98

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+ + V+C HP C     C  L+  C Y   Y  + + S G L +DI+   + S+ APQ 
Sbjct: 99  STYQPVNC-HPSCD----CDYLRSQCSYKMHYG-DGSYSRGVLAEDIISFGNESEFAPQR 152

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
                ++ GC     GS L     DG++GLG G  ++   L   G+I +SFS+C+
Sbjct: 153 -----LVFGCELDAIGS-LYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 92/386 (23%), Positives = 155/386 (40%), Gaps = 57/386 (14%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           G +   L+Y    +G       V +D  S L WV     QCAP ++ +    D+    +D
Sbjct: 118 GARLRTLNYV-ATVGLGGGEATVIVDTASELTWV-----QCAPCASCH----DQQGPLFD 167

Query: 163 PSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
           P+SS S   + C+   C +          +C   + P C Y   Y  + + S G L  D 
Sbjct: 168 PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAHDK 226

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGL 272
           L LA          V    + GCG    G +       G+MGLG   +S+ S  + + G 
Sbjct: 227 LSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGG 275

Query: 273 IQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVESYC 322
           +   FS C    +   SGS+  GD     + ST  +       P+   +  YFV +    
Sbjct: 276 V---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGIT 330

Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKYCY 378
           IG   +  S  + +VDSG   T L   +Y    AE + +F +   +   S+       C+
Sbjct: 331 IGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDTCF 386

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQN 436
           N +    +++P ++ +F  N    V +    +  +   +  CL + S   +Y   IIG  
Sbjct: 387 NLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNY 446

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEV 462
                R++FD    ++ ++   C+ +
Sbjct: 447 QQKNLRVIFDTLGSQIGFAQETCDYI 472


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 163/373 (43%), Gaps = 47/373 (12%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y + Y+   +GTP++     LD GS+++W+ CQ C +C           ++    +D S 
Sbjct: 89  YLISYS---VGTPSLQVFGILDTGSDIIWLQCQPCKKC----------YEQTTPIFDSSK 135

Query: 166 SSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           S + K + C    C+S     C S K  C Y   Y  + + S G L  + L L S +   
Sbjct: 136 SQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYSIHY-VDGSQSLGDLSVETLTLGSTNG-- 191

Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
             S VQ    +IGCGR       +  +  G++GLG G +S+ + L+ +      FS C  
Sbjct: 192 --SPVQFPGTVIGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKFSYCLV 245

Query: 283 ---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT----QSGF 333
                 S  + FG+    + + T   P+  K     YF+ +E++ +G + +      SG 
Sbjct: 246 PGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGG 305

Query: 334 QA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPD 390
           +   ++DSG + T LP  +Y+++     K V  +R+         CY  + +++   VP 
Sbjct: 306 KGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPV 365

Query: 391 MRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFMMGHRIVFDRE 448
           +   FS     V  N I +F +  +    F      T   +G +  QN ++G    +D +
Sbjct: 366 ITAHFSGAD--VTLNAINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVG----YDLQ 419

Query: 449 NLKLAWSHSKCEE 461
              +++ H+ C +
Sbjct: 420 MNTVSFKHTDCTK 432


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 150/377 (39%), Gaps = 59/377 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ I IG+P     + LD GS++ W     +QCAP +  Y  S       +DP+ SSS 
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTW-----LQCAPCADCYAQSDPL----FDPALSSSY 246

Query: 170 KNVSCSHPLCKS--RSSCKS----LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
             V C  P C++   S+C +        C Y   Y         Y V D     + +   
Sbjct: 247 ATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYG-----DGSYTVGDFA-TETLTLGG 300

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
             S+    V IGCG    G ++  A    + G  L   S PS ++        FS C  +
Sbjct: 301 DGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TEFSYCLVD 352

Query: 284 NDSGS---VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT-------- 329
            DS S   + FG    A+  ST   P+     +   Y+V +    +G   L+        
Sbjct: 353 RDSPSASTLQFG----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFA 408

Query: 330 ---QSGFQALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASS 382
              Q     +VDSG + T L +  Y+ +   F +    L  +  +SL    +  CY+ + 
Sbjct: 409 MDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL----FDTCYDLAG 464

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
              ++VP + L F       +    +  P  +G   +CL   +T G   I+G     G R
Sbjct: 465 RSSVQVPAVSLRFEGGGELKLPAKNYLIPV-DGAGTYCLAFAATGGAVSIVGNVQQQGIR 523

Query: 443 IVFDRENLKLAWSHSKC 459
           + FD     + +S +KC
Sbjct: 524 VSFDTAKNTVGFSPNKC 540


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 92/386 (23%), Positives = 155/386 (40%), Gaps = 57/386 (14%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           G +   L+Y    +G       V +D  S L WV     QCAP ++ +    D+    +D
Sbjct: 119 GARLRTLNYV-ATVGLGGGEATVIVDTASELTWV-----QCAPCASCH----DQQGPLFD 168

Query: 163 PSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
           P+SS S   + C+   C +          +C   + P C Y   Y  + + S G L  D 
Sbjct: 169 PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAHDK 227

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGL 272
           L LA          V    + GCG    G +       G+MGLG   +S+ S  + + G 
Sbjct: 228 LSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGG 276

Query: 273 IQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVESYC 322
           +   FS C    +   SGS+  GD     + ST  +       P+   +  YFV +    
Sbjct: 277 V---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGIT 331

Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKYCY 378
           IG   +  S  + +VDSG   T L   +Y    AE + +F +   +   S+       C+
Sbjct: 332 IGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDTCF 387

Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQN 436
           N +    +++P ++ +F  N    V +    +  +   +  CL + S   +Y   IIG  
Sbjct: 388 NLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNY 447

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEV 462
                R++FD    ++ ++   C+ +
Sbjct: 448 QQKNLRVIFDTLGSQIGFAQETCDYI 473


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/386 (22%), Positives = 157/386 (40%), Gaps = 51/386 (13%)

Query: 93  PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYY 151
           PS    + + G+  Y ++   + IGTP   F   +D GS+L+W  CQ C QC        
Sbjct: 81  PSGVETSVYAGDGEYLMN---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-------- 129

Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
              +++   ++P  SSS   + CS  LC++ SS     + C Y   Y  + + + G +  
Sbjct: 130 --FNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGT 186

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           + L   S S          ++  GCG    G      A  G++G+G G +S+PS L    
Sbjct: 187 ETLTFGSVSI--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT- 235

Query: 272 LIQNSFSICFDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
                FS C     S +   +  G   +   A   +T+ +   +    Y++ +    +G+
Sbjct: 236 ----KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 291

Query: 326 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
           + L    S F           ++DSG + T+     Y  V  +F   ++   ++   + +
Sbjct: 292 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 351

Query: 375 KYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 433
             C+   S+   L++P   + F      +   + F  P N    + CL + S+     I 
Sbjct: 352 DLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNG---LICLAMGSSSQGMSIF 408

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           G        +V+D  N  ++++ ++C
Sbjct: 409 GNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/428 (21%), Positives = 170/428 (39%), Gaps = 61/428 (14%)

Query: 51  DSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           DS       E LE  +    +R + R++   N  S     ++  +G          Y ++
Sbjct: 49  DSGKNLTKFELLERAVERGSRRLQ-RLEAMLNGPSGVETPVYAGDGE---------YLMN 98

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
              + IGTP   F   +D GS+L+W  CQ C QC           +++   ++P  SSS 
Sbjct: 99  ---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC----------FNQSTPIFNPQGSSSF 145

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             + CS  LC++  S     + C Y   Y  + + + G +  + L   S S         
Sbjct: 146 STLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVSI-------- 196

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
            ++  GCG    G      A  G++G+G G +S+PS L         FS C      ++S
Sbjct: 197 PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSNS 249

Query: 287 GSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGF 333
            ++  G   +   A   +T+ +   +    Y++ +    +G++ L          + +G 
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309

Query: 334 QA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPDM 391
              ++DSG + T+     Y  V   F   ++   ++   + +  C+   S++  L++P  
Sbjct: 310 GGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            + F      +   + F  P N    + CL + S+     I G        +V+D  N  
Sbjct: 370 VMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSV 426

Query: 452 LAWSHSKC 459
           +++  ++C
Sbjct: 427 VSFLSAQC 434


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 160/379 (42%), Gaps = 48/379 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +G P   FL+ +D GS+L W     +QC P  A +    D++   +DPS S+S 
Sbjct: 87  YFMDVFVGNPPRHFLLIIDTGSDLTW-----LQCKPCKACF----DQSGPVFDPSQSTSF 137

Query: 170 KNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           K + C+        H  C+  SS  S K  C Y   Y  + + +SG L  + L + S S 
Sbjct: 138 KIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALESLSV-SLSD 194

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           H P S     ++IGCG    G +        ++GLG G +S PS L ++  I  SFS C 
Sbjct: 195 H-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSFSYCL 249

Query: 282 DEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESYCIGNSCL-- 328
            +       S ++ FG     ++      F P     ++    Y++G++   I    L  
Sbjct: 250 VDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPI 309

Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                   T      ++DSG + T+L  + Y  V   F   +S  R     +    CYNA
Sbjct: 310 PAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PFDILGICYNA 368

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           +    +  P + ++F       +    +    +      CL ++ TDG   IIG      
Sbjct: 369 TGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNFQQQN 427

Query: 441 HRIVFDRENLKLAWSHSKC 459
              ++D ++ +L ++++ C
Sbjct: 428 IHFLYDVQHARLGFANTDC 446


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/364 (23%), Positives = 146/364 (40%), Gaps = 43/364 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     +  D GS+L W      QC P +   Y   D     + PS S++  N+S
Sbjct: 135 VGLGTPKKYLSLIFDTGSDLTWT-----QCQPCARYCYNQKD---PVFVPSQSTTYSNIS 186

Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           CS P C    S       C + +  C Y   Y  + + S GY   + L L S       +
Sbjct: 187 CSSPDCSQLESGTGNQPGCSAAR-ACIYGIQYG-DQSFSVGYFAKETLTLTS-------T 237

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDEND 285
            V  + + GCG+   G +   A   G++GLG   +S+    A K G +   FS C  +  
Sbjct: 238 DVIENFLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQV---FSYCLPKTS 291

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVE---------SYCIGNSCLTQSGFQA 335
           S + +    G     +  + PI + +  A F GV+            I +S  + SG  A
Sbjct: 292 SSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSG--A 349

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           ++DSG   T LP + Y+ +   F+K ++    + + +    CY+ S    +++P +  +F
Sbjct: 350 IIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVF 409

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
              +   +      +  +                  IIG       ++V+D    K+ + 
Sbjct: 410 KGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFG 469

Query: 456 HSKC 459
           ++ C
Sbjct: 470 YNGC 473


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 100/417 (23%), Positives = 185/417 (44%), Gaps = 65/417 (15%)

Query: 70  WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
            +R   R +L+    S++     PS  +  H  GN  + ++   + IGTP  ++   +D 
Sbjct: 61  LQRAVKRGRLRLQRLSAKTASFEPSVEAPVHA-GNGEFLMN---LAIGTPAETYSAIMDT 116

Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKS 187
           GS+L+W      QC P    +    D+    +DP  SSS   + CS  LC +   SSC  
Sbjct: 117 GSDLIWT-----QCKPCKVCF----DQPTPIFDPEKSSSFSKLPCSSDLCVALPISSC-- 165

Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLD 246
             D C Y   Y  + +S+ G L  +       S         S +  GCG    G +Y  
Sbjct: 166 -SDGCEYRYSYG-DHSSTQGVLATETFTFGDASV--------SKIGFGCGEDNRGRAYSQ 215

Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSF 304
           GA   G++GLG G +   SL+++ G+ + S+ +   ++  G  ++  G +  AT +S   
Sbjct: 216 GA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLLVGSE--ATVKSAIP 267

Query: 305 LPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTFLPTEIY 351
            P+ +   +   Y++ +E   +G++ L   +S F          ++DSG + T+L    +
Sbjct: 268 TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAF 327

Query: 352 AEVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRLIFSK-NQSFVVRNH 406
           A +  +F   +S  ++ +  +     + C+    +   ++VP +   F   +      N+
Sbjct: 328 AALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKENY 384

Query: 407 IFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLKLAWSHSKCEEV 462
           I    E+    V CLT+ S+ G   I G NF   + +V  D E   ++++ ++C ++
Sbjct: 385 II---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
 gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
          Length = 433

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 95/405 (23%), Positives = 160/405 (39%), Gaps = 56/405 (13%)

Query: 84  NSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--C 140
           N + + L+FP         GN +   +Y   + IG P   + + +D GS+L W+ C   C
Sbjct: 51  NRAGSSLVFP-------LHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPC 103

Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDP--CPYIA 196
            QC              +    P    S+  V C  PLC S       + +DP  C Y  
Sbjct: 104 RQC--------------IEAPHPLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEV 149

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
           +Y+ +  SS G LV D+  L +F+       +   + +GCG  Q     +    DG++GL
Sbjct: 150 EYA-DGGSSLGVLVKDVFVL-NFTN---GKRLNPLLALGCGYDQLPGRSNHPL-DGILGL 203

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-DAYF 315
           G G  S+PS L+  GL+ N    C      G +FFG+         ++ P+   +   Y 
Sbjct: 204 GRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGED-IYDSSGVTWTPMSRDHLKHYS 262

Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNS 373
            G                  + DSG+S+T+L  + Y  +V    + +S K IS  L   +
Sbjct: 263 PGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQT 322

Query: 374 WKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVF------CL 421
              C+         + D++  F       K  S       F F   E + +       CL
Sbjct: 323 LPLCWKG-KRPFKSIRDVKKYFKPFALVFKTSSGRSSKTQFEF-SPEAYLIISSKGNACL 380

Query: 422 TVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            +++       D  +IG   M+   ++++ E   + W+ + C+ +
Sbjct: 381 GILNGTEVGLRDLNVIGDVSMLDRLVIYNNEKQMIGWAAASCDRL 425


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 147/360 (40%), Gaps = 42/360 (11%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y + Y+   +GTP    L  +D GS + W+ CQ C  C           ++    +DPS 
Sbjct: 97  YLMSYS---VGTPPFEILGVVDTGSGITWMQCQRCEDC----------YEQTTPIFDPSK 143

Query: 166 SSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           S + K + CS  +C+S     SC S K  C Y   Y  + + S G L  + L L S +  
Sbjct: 144 SKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYG-DGSHSQGDLSVETLTLGSTNG- 201

Query: 223 APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              SSVQ  + +IGCG    G++    +    +G G   +      +  G      +  F
Sbjct: 202 ---SSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMF 258

Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--------- 328
            +++S S   FGD    +       P+  K  +   Y++ +E++ +G+  +         
Sbjct: 259 SQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSS 318

Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
             +      ++DSG + T LP E Y+ +       + + R+S   N    CY  +    L
Sbjct: 319 GSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQL 378

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 444
            VP +   F      V  N I +F +  EG   F          +G + Q N ++G+ ++
Sbjct: 379 DVPVITAHFKGAD--VELNPISTFVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLVGYDLM 436


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 156/396 (39%), Gaps = 72/396 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +Y  + +GTP V  ++ +D GS++ W+ C  C  C P               ++P  SSS
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALR----------PPFNPRHSSS 188

Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL--HLASFSK 221
              + C+   C +     +  C      C +   Y  + + SSG L  + +  +  +F  
Sbjct: 189 FFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYG-DGSLSSGLLAMETIAGNTPNFGD 247

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
             P     S++ +GC          GA+  G++G+    +S PS L+        FS CF
Sbjct: 248 GEPVK--LSNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCF 301

Query: 282 DE-----NDSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYC 322
            +     N SG VFFG+              Q PA   ++         D Y+VG+    
Sbjct: 302 PDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGIS 353

Query: 323 IGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
           +  S L            T SG   ++DSG +FT+L    +  +  +F    S       
Sbjct: 354 VDESRLPLSHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDD 412

Query: 371 GNSWKYCYNASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLT-V 423
            + +  CYN +S     E   +P + L F      V+  +    P   +E  T  CL  +
Sbjct: 413 NSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL 472

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           MS D  + IIG        + +D E L+L  + ++C
Sbjct: 473 MSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 508


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/428 (21%), Positives = 169/428 (39%), Gaps = 61/428 (14%)

Query: 51  DSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
           DS       E LE  +    +R + R++   N  S     ++  +G          Y ++
Sbjct: 49  DSGKNLTKFELLERAVERGSRRLQ-RLEAMLNGPSGVETPVYAGDGE---------YLMN 98

Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
              + IGTP   F   +D GS+L+W  CQ C QC           +++   ++P  SSS 
Sbjct: 99  ---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC----------FNQSTPIFNPQGSSSF 145

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             + CS  LC++  S     + C Y   Y  + + + G +  + L   S S         
Sbjct: 146 STLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVSI-------- 196

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
            ++  GCG    G      A  G++G+G G +S+PS L         FS C      + S
Sbjct: 197 PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTS 249

Query: 287 GSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGF 333
            ++  G   +   A   +T+ +   +    Y++ +    +G++ L          + +G 
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309

Query: 334 QA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPDM 391
              ++DSG + T+     Y  V   F   ++   ++   + +  C+   S++  L++P  
Sbjct: 310 GGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            + F      +   + F  P N    + CL + S+     I G        +V+D  N  
Sbjct: 370 VMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSV 426

Query: 452 LAWSHSKC 459
           +++  ++C
Sbjct: 427 VSFLFAQC 434


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 148/379 (39%), Gaps = 50/379 (13%)

Query: 102 FGNQFYWLHYTWI---DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRN 157
           F N   W +Y+++    +GTP  +  V +D  S+L WV C+ CI    +           
Sbjct: 115 FANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINACLIPT--------- 165

Query: 158 LSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
              ++P++SS+ K V C   LC        +R SC +  + C Y   Y  + + S G + 
Sbjct: 166 ---FNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYH-DYSLSVGVVS 221

Query: 211 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 270
            D L     S+           I GC     G    G    G++G+ +   S+ S +   
Sbjct: 222 SDTLTYGLGSQ---------KFIFGCCNLFRGV---GGRYSGILGMSVNKFSLFSQMT-V 268

Query: 271 GLIQNSFSICFDE-NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 328
           G    + S CF    + G + FG +    +    F P+    + YFV V +  +    L 
Sbjct: 269 GHRYRAMSYCFPHPRNQGFLQFG-RYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLD 327

Query: 329 -TQSGFQAL---VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-- 382
              SG Q +    D+G  +T LP  ++  +      LV      +  ++ + C+ A    
Sbjct: 328 VQSSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGY-YRVGASTGQTCFQADGNW 386

Query: 383 -EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
            E  L +P +++ F       + +    F E     VFCL     DG   ++G   +MG 
Sbjct: 387 IEGDLYMPTVKIEFQNGARITLNSEDLMFMEEP--NVFCLAFKMNDGGDIVLGSRHLMGV 444

Query: 442 RIVFDRENLKLAWSHSKCE 460
             V D E + +      C 
Sbjct: 445 HTVVDLEMMTMGLRGQGCN 463


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 149/365 (40%), Gaps = 40/365 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T + +GTP   +++ +D GS+L W     +QC+P   S +    ++   +DP +SSS 
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTW-----LQCSPCRVSCH---RQSGPVFDPKTSSSY 168

Query: 170 KNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
             VSCS P C   S+        S  + C Y A Y  + + S GYL  D +   + S   
Sbjct: 169 AAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYG-DSSFSVGYLSKDTVSFGANSV-- 225

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-D 282
                  +   GCG+   G +   A   G+MGL    +S+  L   A  +  SFS C   
Sbjct: 226 ------PNFYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LYQLAPTLGYSFSYCLPS 274

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-----GFQALV 337
            + SG +  G   P     T  +        YF+ +    +    L  S         ++
Sbjct: 275 TSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTII 334

Query: 338 DSGASFTFLPTEIYAEV--VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           DSG   T LPT +Y  +   V      S+KR +   +    C+   + ++  VP + + F
Sbjct: 335 DSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAY-SILDTCFEGQASKLRAVPAVSMAF 393

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
           S   +  +        + +G T  CL   +      IIG        +V+D ++ ++ ++
Sbjct: 394 SGGATLKLSAGNL-LVDVDGATT-CL-AFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFA 450

Query: 456 HSKCE 460
            + C 
Sbjct: 451 AAGCS 455


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 100/395 (25%), Positives = 151/395 (38%), Gaps = 72/395 (18%)

Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
           V +D GS+++W PC   +C      +            P + S S  +SC    C +  +
Sbjct: 107 VYMDTGSDIVWFPCSPFECILCEGKF------EPGTLTPLNVSKSSLISCKSRACSTAHN 160

Query: 185 CKSLKDPCPY----IADYSTEDTS-----SSGYLVDDILHLASFSKH---APQSSVQ--- 229
             S  D C      + +  T D S     S  Y   D   +A   KH    P +S +   
Sbjct: 161 SPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTSNKPFS 220

Query: 230 -SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSFSIC-----FD 282
                 GC     G       P GV G G G +S+P+ LA     + N FS C     FD
Sbjct: 221 LKDFTFGCAHSALGE------PIGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFD 274

Query: 283 ENDS--------GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLT---- 329
                       G V   D    TQ   + +    K+  ++ V +E+  +G+S +     
Sbjct: 275 STKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNA 334

Query: 330 ------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV------SSKRISLQGNSWKYC 377
                       +VDSG ++T LPT  Y  V  + D+ V      +S+  S  G S  Y 
Sbjct: 335 LIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYY 394

Query: 378 YNASSEEMLK--VPDMRLIFSKNQSFVV--RNHIFSF----PENEGFTVFCLTVM----- 424
              +  E L   VP +   F  N S V+  RN+ + F     E +G  V CL +M     
Sbjct: 395 LEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDE 454

Query: 425 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           S  G    +G     G ++V+D E  ++ ++  KC
Sbjct: 455 SEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 80/329 (24%), Positives = 142/329 (43%), Gaps = 34/329 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +++G+P  S L   D GS+L+WV C+       SA+  T      +++DPS SS+   VS
Sbjct: 105 VNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPT------TQFDPSRSSTYGRVS 158

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL-ASFSKHAPQSSVQS 230
           C    C++  R++C    + C Y+  Y  + ++++G L  +        S  +P+     
Sbjct: 159 CQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTFDDGGSGRSPRQVRVG 216

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSG 287
            V  GC     GS+          G     VS+ + L  A  +   FS C      N S 
Sbjct: 217 GVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGRRFSYCLVPHSVNASS 272

Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-FQALVDSGASFT 344
           ++ FG     T+   +  P+  G+    Y V ++S  +GN  +  +   + +VDSG + T
Sbjct: 273 ALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVDSGTTLT 332

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK---VPDMRLIFSKNQSF 401
           FL   +   +V +  + ++   +       + CYN +  E+     +PD+ L F    + 
Sbjct: 333 FLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAV 392

Query: 402 VVRNHIFSFPENEGFTV----FCLTVMST 426
            ++      PEN    V     CL +++T
Sbjct: 393 ALK------PENAFVAVQEGTLCLAIVAT 415


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 95/380 (25%), Positives = 153/380 (40%), Gaps = 47/380 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP   F++  D GS+L WV C        S+S   +       + P+ S S 
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPS----SSSSSPAASPPQRVFRPAGSKSW 159

Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHA 223
             + C    CKS      ++C S  DPC Y  DY  +D SS+ G +  D   ++      
Sbjct: 160 SPLPCDSDTCKSYVPFSLANCSSPPDPCSY--DYRYKDNSSARGVVGLDSATVSLSGNDG 217

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
            + +    V++GC     G     +  DGV+ LG  ++S  S    A      FS C   
Sbjct: 218 TRKAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFAS--RAASRFGGRFSYCLVD 273

Query: 282 ---DENDSGSVFFGDQGPATQQSTSF--LPIGEKYDA-----YFVGVESYCIGNSCLT-- 329
                N +  + FG+   +    +S    P+    DA     YFV V++  +    L   
Sbjct: 274 HLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEIL 333

Query: 330 ------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASS 382
                 +    A++DSG S T L T  Y  VV    K      R+++  + ++YCYN + 
Sbjct: 334 PDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM--DPFEYCYNWTG 391

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFMM 439
               ++P M L F+   +       +      G  V C+ V+  +G +    +IG     
Sbjct: 392 VSA-EIPRMELRFAGAATLAPPGKSYVIDTAPG--VKCIGVV--EGAWPGVSVIGNILQQ 446

Query: 440 GHRIVFDRENLKLAWSHSKC 459
            H   FD  N  L +  S+C
Sbjct: 447 EHLWEFDLANRWLRFKQSRC 466


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 163/368 (44%), Gaps = 46/368 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +   I +G P   F +  D GS++ W+ CQ   CA    + Y   D     +DP SSSS 
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ--PCAS-ENTCYKQFD---PIFDPKSSSSY 201

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +SC+   CK   +++C S  D C Y   Y  + + ++G L  + L   + S   P   
Sbjct: 202 SPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN-SNSIP--- 254

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
              ++ IGCG    G +  GA   G+           SL ++  L  +SFS C    D +
Sbjct: 255 ---NLPIGCGHDNEGLFAGGAGLIGLG------GGAISLSSQ--LKASSFSYCLVNLDSD 303

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL---------TQSGFQ 334
            S ++ F    P+    TS L   +++ +Y +V V    +G   L          +SG  
Sbjct: 304 SSSTLEFNSNMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLG 362

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VDSG   + LP+++Y  +   F KL SS   +   + +  CYN S +  ++VP +  
Sbjct: 363 GIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAF 422

Query: 394 IFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
           + S+  S  +  RN++    +  G   +CL  + T     IIG     G R+ +D  N  
Sbjct: 423 VLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSL 479

Query: 452 LAWSHSKC 459
           + +S +KC
Sbjct: 480 VGFSTNKC 487


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 104/403 (25%), Positives = 165/403 (40%), Gaps = 53/403 (13%)

Query: 27  KLVHRFSDEAKERWISKSGNVSVADSW-PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNS 85
           +L HR    A  R  S +   SVAD+    +   EY+        +R   R     ++ +
Sbjct: 69  RLTHRHGPCAPSRASSLAAP-SVADTLRADQRRAEYI-------LRRVSGRAPQLWDSKA 120

Query: 86  SRNQLLFPSEGSQTHFFGNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
           +      P+       +G     L+Y     +GTP V+  + +D GS+L WV C+    A
Sbjct: 121 AAAAATVPAS------WGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAA 174

Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS---SCKSLKDPCPYIADYSTE 201
           P   S Y+  D     +DP+ SSS   V C  P+C       +       C Y+  Y  +
Sbjct: 175 P---SCYSQKD---PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYG-D 227

Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
            ++++G    D L L++       SS       GCG  Q+G + +G   DG++GLG    
Sbjct: 228 GSNTTGVYSSDTLTLSA-------SSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGR--- 274

Query: 262 SVPSLLAK-AGLIQNSFSICFDENDS--GSVFFGDQGPATQ----QSTSFLPIGEKYDAY 314
             PSL+ + AG     FS C     S  G +  G  GP+       +T  LP       Y
Sbjct: 275 EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYY 334

Query: 315 FVGVESYCIGNSCLT--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
            V +    +G   L+   S F    +VD+G   T LP   YA +   F   ++S      
Sbjct: 335 VVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTA 394

Query: 371 GNS--WKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 410
            ++     CYN +    + +P++ L F    + ++  + I SF
Sbjct: 395 PSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGILSF 437


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 100/416 (24%), Positives = 184/416 (44%), Gaps = 65/416 (15%)

Query: 71  KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
           +R   R +L+    S++     PS  +  H  GN  + ++   + IGTP  ++   +D G
Sbjct: 62  QRAVKRGRLRLQRLSAKTASFEPSVEAPVHA-GNGEFLMN---LAIGTPAETYSAIMDTG 117

Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSL 188
           S+L+W      QC P    +    D+    +DP  SSS   + CS  LC +   SSC   
Sbjct: 118 SDLIWT-----QCKPCKVCF----DQPTPIFDPEKSSSFSKLPCSSDLCVALPISSC--- 165

Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDG 247
            D C Y   Y  + +S+ G L  +       S         S +  GCG    G +Y  G
Sbjct: 166 SDGCEYRYSYG-DHSSTQGVLATETFTFGDAS--------VSKIGFGCGEDNRGRAYSQG 216

Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSFL 305
           A   G++GLG G +   SL+++ G+ + S+ +   ++  G  ++  G +  AT +S    
Sbjct: 217 A---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLLVGSE--ATVKSAIPT 268

Query: 306 PIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTFLPTEIYA 352
           P+ +   +   Y++ +E   +G++ L   +S F          ++DSG + T+L    +A
Sbjct: 269 PLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFA 328

Query: 353 EVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRLIFSK-NQSFVVRNHI 407
            +  +F   +S  ++ +  +     + C+    +   + VP +   F   +      N+I
Sbjct: 329 ALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKENYI 385

Query: 408 FSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLKLAWSHSKCEEV 462
               E+    V CLT+ S+ G   I G NF   + +V  D E   ++++ ++C ++
Sbjct: 386 I---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 151/378 (39%), Gaps = 67/378 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP   F + +D+GS+LLWV     QC+P    Y  + D  L  Y PS+SS+   V C 
Sbjct: 70  LGTPPQKFSLIVDSGSDLLWV-----QCSPCRQCY--AQDSPL--YVPSNSSTFSPVPCL 120

Query: 176 HPLCKSRSSCKSLKDPCPY------IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
              C    + +    PC +        +Y   DTSSS         + ++          
Sbjct: 121 SSDCLLIPATEGF--PCDFRYPGACAYEYLYADTSSSK-------GVFAYESATVDGVRI 171

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
             V  GCG    GS+   AA  GV+GLG G +S  S +  A    N F+ C        +
Sbjct: 172 DKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTS 226

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--TQSGFQ----- 334
            S S+ FGD+  +T     + PI     +   Y+V +E   +G   L  + S ++     
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLG 286

Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPD 390
              ++ DSG + T+     Y+ ++  FD  V   R  S+QG     C   +  +    P 
Sbjct: 287 NGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG--LDLCVELTGVDQPSFPS 344

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGF------TVFCLT---VMSTDGDYGIIGQNFMMGH 441
             + F     F         PE E +       V CL    + S  G +  IG       
Sbjct: 345 FTIEFDDGAVFQ--------PEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNF 396

Query: 442 RIVFDRENLKLAWSHSKC 459
            + +DRE   + ++ +KC
Sbjct: 397 FVQYDREENLIGFAPAKC 414


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 150/364 (41%), Gaps = 54/364 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP  + L+A+D  ++  W+PC  C  CA              + + P  S++ KNVSC
Sbjct: 84  IGTPPQTLLLAMDTSNDAAWIPCTACDGCAS-------------TLFAPEKSTTFKNVSC 130

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           + P CK   +       C +   Y +  +S +  LV D + LA  +   P      S   
Sbjct: 131 AAPECKQVPNPGCGVSSCNFNLTYGS--SSIAANLVQDTITLA--TDPVP------SYTF 180

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
           GC  K TG+    A P G++GLG G +S+ S      L Q++FS C       N SGS+ 
Sbjct: 181 GCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFSGSLR 235

Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
            G    P   + T  L    +   Y+V +E+  +G   +            +G   + DS
Sbjct: 236 LGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDS 295

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           G  FT L   +Y  V  +F + V  K        +  CYN      + VP +  IF+   
Sbjct: 296 GTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP----IVVPTITFIFTGMN 351

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
             + +++I     +   +  CL +     + +    +I       HR+++D  N ++  +
Sbjct: 352 VTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVA 409

Query: 456 HSKC 459
              C
Sbjct: 410 RELC 413


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 139/374 (37%), Gaps = 49/374 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP   F +  D GS L WV C      P               + P +S S 
Sbjct: 91  YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLV------------FRPEASKSW 138

Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
             V CS   CK     S ++C S   PC Y   Y      + G +  D   +A       
Sbjct: 139 APVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVA 198

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           Q      V++GC     G        DGV+ LG   +S  S    A     SFS C    
Sbjct: 199 Q---LQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGGSFSYCLVDH 251

Query: 282 --DENDSGSVFFG----DQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLTQS 331
               N +G + FG     + PATQ      P     G K DA  V  ++  I        
Sbjct: 252 LAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK 311

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYN--ASSEEMLKV 388
               ++DSG + T L T  Y  VV    KL++   ++      +++CYN  A      ++
Sbjct: 312 SGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWTAPRPGAPEI 369

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFMMGHRIVF 445
           P + + F+           +      G  V C+ +   +G++    +IG      H   F
Sbjct: 370 PKLAVQFTGCARLEPPAKSYVIDVKPG--VKCIGLQ--EGEWPGVSVIGNIMQQEHLWEF 425

Query: 446 DRENLKLAWSHSKC 459
           D +N+++ +  S C
Sbjct: 426 DLKNMEVRFMPSTC 439


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 112/473 (23%), Positives = 198/473 (41%), Gaps = 81/473 (17%)

Query: 24  FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK-LQSN 82
            ++KL+HR  +        ++  V       + +S+E  + L        ++++K L+S 
Sbjct: 38  LATKLIHR--NSYLHPLYDQNETVEDRSKREQTSSIERFDFL--------ESKIKELKSV 87

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCI 141
            N +R+ L+  + GS   F  N         + IG+P V+ LV +D GS+LLWV C  CI
Sbjct: 88  GNEARSSLIPFNRGSG--FLVN---------LSIGSPPVTQLVVVDTGSSLLWVQCLPCI 136

Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYST 200
            C   S S+          +DP  S S K + C  P     +  K  + +   Y   Y  
Sbjct: 137 NCFQQSTSW----------FDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLG 186

Query: 201 EDTSSSGYLVDDILHLAS------FSKHAPQSSV----QSSVIIGCGRKQTGSYLDGAAP 250
            D SS G L  + L   +      F  +A  + +    +S++  GCG     +  D A  
Sbjct: 187 GD-SSQGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY- 244

Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-----GSVFFGDQGPATQQSTSFL 305
           +GV GLG    + P  +  A  + N FS C  + ++       +  G QG   +  ++  
Sbjct: 245 NGVFGLG----AYPH-ITMATQLGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDST-- 296

Query: 306 PIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE----IY 351
           P+   +  Y+V ++S  +G+  L    + F+         L+DSG ++T L       +Y
Sbjct: 297 PLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLY 356

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
            E+V     L+  +RI  Q      C+    S +++  P +   F+     V+ +   S 
Sbjct: 357 DEIVDLMKGLL--ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG--SL 412

Query: 411 PENEGFTVFCLTVMSTDGD---YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
               G   FCL ++ ++ +     +IG      + + FD E +K+ +    C+
Sbjct: 413 FRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQ 465


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 151/363 (41%), Gaps = 40/363 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + IG P+    + LD GS++ W     IQCAP +  Y+    +    ++P+SS+S 
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNW-----IQCAPCADCYH----QADPIFEPASSTSY 194

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             +SC    C+S    +   + C Y   Y  + + + G  V + + L S S         
Sbjct: 195 SPLSCDTKQCQSLDVSECRNNTCLYEVSYG-DGSYTVGDFVTETITLGSASV-------- 245

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSG 287
            +V IGCG    G ++  A   G+ G  L   S PS +  +     SFS C    ++DS 
Sbjct: 246 DNVAIGCGHNNEGLFIGAAGLLGLGGGKL---SFPSQINAS-----SFSYCLVDRDSDSA 297

Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQA--------L 336
           S    +        T+ L    + D  Y+VG+    +G   L+  +S F+         +
Sbjct: 298 STLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGII 357

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG + T L T  Y  +   F K      ++ +   +  CY+ S +  ++VP +    +
Sbjct: 358 IDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLA 417

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 456
             +   +    +  P +   T FC     T     IIG     G R+ FD  N  + +  
Sbjct: 418 GGKVLPLPATNYLIPVDSDGT-FCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEP 476

Query: 457 SKC 459
            +C
Sbjct: 477 RQC 479


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 94/408 (23%), Positives = 164/408 (40%), Gaps = 82/408 (20%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
           Y  +   +  GTP+ +     D GS+L+W PC     C  C       ++ LD   +  +
Sbjct: 87  YGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCN------FSGLDPTQIPRF 140

Query: 162 DPSSSSSSKNVSCSHPLCK----SRSSCKSLKD-------PC-PYIADYSTEDTSSSGYL 209
            P +SSSS+ + C +P C+    +   C+           PC PYI  Y     S++G L
Sbjct: 141 IPKNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGL--GSTAGIL 198

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           + + L         P  +V    ++GC      S +    P G+ G G G  S+PS +  
Sbjct: 199 ISEKLDF-------PDLTVP-DFVVGC------SVISTRTPAGIAGFGRGPESLPSQMKL 244

Query: 270 AGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK--------YDA 313
                   S  FD+         D+GS   G +  +     S+ P  +          + 
Sbjct: 245 KSFSHCLVSRRFDDTNVTTDLGLDTGS---GHKSGSKTPGLSYTPFRKNPNVSNTAFLEY 301

Query: 314 YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           Y++ +    +G+  +          T     ++VDSG++FTF+   ++  V  +F   +S
Sbjct: 302 YYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMS 361

Query: 364 --SKRISLQGNSW-KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVF 419
             ++   L+  S    C+N S +  + VP++   F       +  ++ FSF  N      
Sbjct: 362 NYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNA--DTV 419

Query: 420 CLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           CLTV+S +        G   I+G      + + +D EN +  ++  KC
Sbjct: 420 CLTVVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 93/401 (23%), Positives = 169/401 (42%), Gaps = 74/401 (18%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           G ++  +    + IGTP  +  + LD GS L W+  QC +  P             S +D
Sbjct: 75  GFKYSMILLVSLPIGTPPQTQQMILDTGSQLSWI--QCHKKVPRKPP-------PSSVFD 125

Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLA 217
           PS SSS   + C+HPLCK R    +L   C      + + +  + T + G LV + +   
Sbjct: 126 PSLSSSFSVLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKI--- 182

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
           +FS+    S     +I+GC  + + +        G++G+ LG +S  S   +A L + S+
Sbjct: 183 TFSR----SQSTPPLILGCAEESSDA-------KGILGMNLGRLSFAS---QAKLTKFSY 228

Query: 278 SICFDE-----NDSGSVFFGDQGPA-------------TQQSTSFLPIGEKYDAYFVGVE 319
            +   +       +GS + G+   +             +Q+  +  P+     AY V ++
Sbjct: 229 CVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPL-----AYTVAMQ 283

Query: 320 SYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRI 367
              IGN  L               Q ++DSG+ FT+L  E Y +V  +  +LV +  K+ 
Sbjct: 284 GIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKG 343

Query: 368 SLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMS 425
            + G     C+N ++ E+ + + +M   F K    VV +  + +   + G  V C+ +  
Sbjct: 344 YVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEKERVLA---DVGGGVHCVGIGR 400

Query: 426 TD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
           ++       IIG        + FD  N ++ +  + C   +
Sbjct: 401 SEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGKADCSRSV 441


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 161/382 (42%), Gaps = 57/382 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +G+P   F + LD GS+L W+  QC+ C       Y    +N + YDP +S+S KN++C+
Sbjct: 176 VGSPPKHFSLILDTGSDLNWI--QCLPC-------YDCFQQNGAFYDPKASASYKNITCN 226

Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
              C   SS      CKS    CPY   Y     ++  + V+   ++L +    +   +V
Sbjct: 227 DQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNV 286

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
           + +++ GCG    G +   A    ++GLG G +S  S L    L  +SFS C      D 
Sbjct: 287 E-NMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDT 340

Query: 284 NDSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVGVESYCIGNSCL--------- 328
           N S  + FG+            TSF+   E      Y+V ++S  +    L         
Sbjct: 341 NVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNI 400

Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNASSEEML 386
            +      ++DSG + ++     Y  +  K  +    K  +         C+N S    +
Sbjct: 401 SSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNV 460

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFMMG 440
           ++P++ + F+          +++FP    F      + CL ++ T    + IIG      
Sbjct: 461 QLPELGIAFADGA-------VWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQN 513

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             I++D +  +L ++ +KC ++
Sbjct: 514 FHILYDTKRSRLGYAPTKCADI 535


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 159/365 (43%), Gaps = 60/365 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I IGTP +  LV  D GS+L+WV CQ C +C    +            ++P  SS+
Sbjct: 94  YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPI----------FNPKQSST 143

Query: 169 SKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK 221
            + V C    C + +      S       C Y   YS  D S + GYL  +   + S   
Sbjct: 144 YRRVLCETRYCNALNSDMRACSAHGFFKACGY--SYSYGDHSFTMGYLATERFIIGS--- 198

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSFSIC 280
               +S+Q  +  GCG    G++      +   G+        SL+++ G  I N FS C
Sbjct: 199 --TNNSIQ-ELAFGCGNSNGGNF-----DEVGSGIVGLGGGSLSLISQLGTKIDNKFSYC 250

Query: 281 F-----DENDS-GSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 330
                   N S G + FGD     G  T  ST  +   E    Y++ +E+  +GN  L  
Sbjct: 251 LVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVS-KEPETFYYLTLEAISVGNERLAY 309

Query: 331 SGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
              +          ++DSG + TFL +++Y ++ +  +K V  +R+S     +  C+   
Sbjct: 310 ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFR-- 367

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQ-NFM 438
            +  +++P + + F+     +   + F+  E +   + C T++ ++G   +G + Q NF+
Sbjct: 368 DKIGIELPIITVHFTDADVELKPINTFAKAEED---LLCFTMIPSNGIAIFGNLAQMNFL 424

Query: 439 MGHRI 443
           +G+ +
Sbjct: 425 VGYDL 429


>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
 gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
          Length = 490

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 106/457 (23%), Positives = 187/457 (40%), Gaps = 72/457 (15%)

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLW 135
           ++L +N++  R++ L  S     H   +     +YT  + IGTP   F + +D  S   +
Sbjct: 3   LELVANSHRRRDRELLGSARMDLH--DDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---F 57

Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
           V  + + C     S++   D   S   P+ SSS K + C +  C +     S K    Y 
Sbjct: 58  VSPKTMFC-----SFFFLQDPRFS---PALSSSYKPLECGNE-CSTGFCDGSRK----YQ 104

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
             Y+ E ++SSG L  D++  ++ S    Q      ++ GC   +TG   D  A DG++G
Sbjct: 105 RQYA-EKSTSSGVLGKDVISFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIG 157

Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYD 312
           LG G +S+   L +   +++ FS+C+   DE     +  G Q P     TS  P    Y 
Sbjct: 158 LGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY- 216

Query: 313 AYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
            Y + ++   +G S L          +  ++DSG ++ + P   +        + V S +
Sbjct: 217 -YNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK 275

Query: 367 ISLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVV--RNHIFSFPENEGFT 417
             + G   K+   CY  +   +  +    P +  +F   QS  +   N++F   +  G  
Sbjct: 276 -EVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISG-- 332

Query: 418 VFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ 477
            +CL V        ++G   +    + ++R    + +  +KC ++  +            
Sbjct: 333 AYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDLWSR------------ 380

Query: 478 SPNPLPTTEQ--QSTSNGQAAAPPSTAKTAPSKSIAA 512
               LP T +   ST   Q   PP     APS S+ A
Sbjct: 381 ----LPETNEPGHSTQPAQFLLPP-----APSPSVGA 408


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 92/415 (22%), Positives = 175/415 (42%), Gaps = 81/415 (19%)

Query: 93  PSEGSQTHFFGNQFYWLHYTWID--IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASY 150
           PS  S  + F ++F +     I   IGTP  +  + LD GS L W+ C   +  P     
Sbjct: 53  PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----- 107

Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDT 203
                +  + +DPS SSS   + CSHPLCK R       +SC S +  C Y   Y+ + T
Sbjct: 108 -----KPKTSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGT 160

Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
            + G LV + +  ++       + +   +I+GC  + +          G++G+  G +  
Sbjct: 161 FAEGNLVKEKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL-- 204

Query: 264 PSLLAKAGLIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFL 305
            S +++A + + S+ I    N      +GS + GD               P +Q+  +  
Sbjct: 205 -SFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLD 263

Query: 306 PIGEKYDAYFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AE 353
           P+   Y    +G+  + +    ++ S F        Q +VDSG+ FT L    Y    AE
Sbjct: 264 PLA--YTVPMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAE 320

Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFP 411
           ++ +  + +  K+  + G +   C++ +   + + + D+  +F++    +V +  +    
Sbjct: 321 IMTRVGRRL--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEILVPKERVLV-- 376

Query: 412 ENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 463
            N G  + C+ +  +        IIG        + FD  N ++ ++ + C  V+
Sbjct: 377 -NVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 89/380 (23%), Positives = 152/380 (40%), Gaps = 40/380 (10%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP   F + +D GS+L W     IQC P + +  +S       YD SSSSS 
Sbjct: 27  YFVELRVGTPAKKFPLIIDTGSDLTW-----IQCNPPNTTANSS-SPPAPWYDKSSSSSY 80

Query: 170 KNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK-- 221
           + + C+   C        SSC S+K P P    Y   D S ++G L  + + + S  +  
Sbjct: 81  REIPCTDDECLFLPAPIGSSC-SIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSG 139

Query: 222 -----HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
                H  ++    +V +GC R+  G+   GA+  GV+GLG G +S+ +      L    
Sbjct: 140 KRAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGI 196

Query: 277 FSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS----- 326
           FS C  +   GS    F   G    +  +  PI     A   Y+V V    +        
Sbjct: 197 FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256

Query: 327 -----CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                 +   G +  + DSG + ++L    Y++V+   +  +   R       ++ CYN 
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 316

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           +  E   +P + + F       +  + +     E      L  ++T     I+G      
Sbjct: 317 TRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQD 375

Query: 441 HRIVFDRENLKLAWSHSKCE 460
           H I +D    ++ +  S C 
Sbjct: 376 HHIEYDLAKARIGFKWSPCH 395


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 157/381 (41%), Gaps = 67/381 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP ++F V  D GS+L+W      QCAP +  +     +    + P+SSS+   + 
Sbjct: 90  ISVGTPLLTFPVVADTGSDLIWT-----QCAPCTKCF----QQPAPPFQPASSSTFSKLP 140

Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQSS 227
           C+   C+    S  +C +    C Y   Y +  T  +GYL  + L +  ASF        
Sbjct: 141 CTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASF-------- 188

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
              SV  GC  +       G +  G+ GLG G +   SL+ + G+    FS C     + 
Sbjct: 189 --PSVAFGCSTENG----VGNSTSGIAGLGRGAL---SLIPQLGV--GRFSYCLRSGSAA 237

Query: 288 S---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGNSCL---------T 329
               + FG     T    QST F+    +   Y  Y+V +    +G + L         T
Sbjct: 238 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGETDLPVTTSTFGFT 295

Query: 330 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEML 386
           Q+G     +VDSG + T+L  + Y  V   F    ++            C+ ++     +
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGI 355

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD--YGIIGQNFMMGH 441
            VP + L F     + V  + F+  E +     TV CL ++   GD    +IG    M  
Sbjct: 356 AVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDM 414

Query: 442 RIVFDRENLKLAWSHSKCEEV 462
            +++D +    ++S + C +V
Sbjct: 415 HLLYDLDGGIFSFSPADCAKV 435


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 153/374 (40%), Gaps = 62/374 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS-EYDPSSSSSSKN 171
           I IG P +  LV +D GS++LWV C  C  C           D +L   +DPS SS+   
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC-----------DNDLGLLFDPSKSSTFS- 152

Query: 172 VSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
                PLCK+    +  + DP P+   Y+   T+S  +  D ++    F      +S  S
Sbjct: 153 -----PLCKTPCDFEGCRCDPIPFTVTYADNSTASGTFGRDTVV----FETTDEGTSRIS 203

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----ND 285
            V+ GCG    G   D    +G++GL  G     SL+ K G     FS C         +
Sbjct: 204 DVLFGCGH-NIGHDTD-PGHNGILGLNNGP---DSLVTKLG---QKFSYCIGNLADPYYN 255

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL----------TQSGFQ 334
              +  G+       ST F    E Y+  Y+V +E   +G   L                
Sbjct: 256 YHQLILGEGADLEGYSTPF----EVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGG 311

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYC-YNASSEEMLKVPDM 391
            ++D+G++ TFL   ++  +  +   L+  S ++ +++ + W  C Y + S +++  P +
Sbjct: 312 VIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVV 371

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTDGDYGIIGQNFMMGHRIVFD 446
              FS      + +  F    N+   VFC+TV     ++      +IG      + + +D
Sbjct: 372 TFHFSDGADLALDSGSFFNQLND--NVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYD 429

Query: 447 RENLKLAWSHSKCE 460
             N  + +    CE
Sbjct: 430 LVNQFVYFQRIDCE 443


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 99/444 (22%), Positives = 167/444 (37%), Gaps = 75/444 (16%)

Query: 57  NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFG------------- 103
           N+    E  L    +R+  RV+        + +L     GS  +  G             
Sbjct: 88  NATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAGVTAEFGSEVVSGM 147

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYD 162
            Q    ++T I IGTP     + LD GS+++W+ C+ C +C       Y+  D     ++
Sbjct: 148 EQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCREC-------YSQAD---PIFN 197

Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           PSSS S   V C   +C    +       C Y   Y     +   Y  + +    +F   
Sbjct: 198 PSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL----TFGT- 252

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              +S+Q +V IGCG    G ++  A   G+    L   S P+ L        +FS C  
Sbjct: 253 ---TSIQ-NVAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLV 303

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLTQSG 332
           + DS S    + GP +      +PIG  +            Y++ + +  +G   L    
Sbjct: 304 DRDSESSGTLEFGPES------VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVP 357

Query: 333 FQA------------LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
            +A            ++DSG + T L T  Y  +   F      L  +  IS+    +  
Sbjct: 358 SEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI----FDT 413

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 436
           CY+ S+ + + +P +   FS    F++       P +     FC      D +  I+G  
Sbjct: 414 CYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDS-MGTFCFAFAPADSNLSIMGNI 472

Query: 437 FMMGHRIVFDRENLKLAWSHSKCE 460
              G R+ FD  N  + ++  +C+
Sbjct: 473 QQQGIRVSFDSANSLVGFAIDQCQ 496


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 163/368 (44%), Gaps = 46/368 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +   I +G P   F +  D GS++ W+ CQ   CA    + Y   D     +DP SSSS 
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ--PCAS-ENTCYKQFD---PIFDPKSSSSY 201

Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             +SC+   CK   +++C S  D C Y   Y  + + ++G L  + L   + S   P   
Sbjct: 202 SPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN-SNSIP--- 254

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
              ++ IGCG    G +  GA   G+           SL ++  L  +SFS C    D +
Sbjct: 255 ---NLPIGCGHDNEGLFAGGAGLIGLG------GGAISLSSQ--LKASSFSYCLVNLDSD 303

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL---------TQSGFQ 334
            S ++ F    P+    TS L   +++ +Y +V V    +G   L          +SG  
Sbjct: 304 SSSTLEFNSYMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLG 362

Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             +VDSG   + LP+++Y  +   F KL SS   +   + +  CYN S +  ++VP +  
Sbjct: 363 GIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAF 422

Query: 394 IFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
           + S+  S  +  RN++    +  G   +CL  + T     IIG     G R+ +D  N  
Sbjct: 423 VLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSI 479

Query: 452 LAWSHSKC 459
           + +S +KC
Sbjct: 480 VGFSTNKC 487


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 155/396 (39%), Gaps = 72/396 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           +Y  + +GTP V  ++ +D GS++ W+ C  C  C P               ++P  SSS
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALR----------PPFNPRHSSS 187

Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL--HLASFSK 221
              + C+   C +     +  C      C +   Y  + + SSG L  + +  +  +F  
Sbjct: 188 FFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYG-DGSLSSGLLAMETIAGNTPNFGD 246

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
             P     S++ +GC          GA+  G++G+    +S PS L+        FS CF
Sbjct: 247 GEPVK--LSNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCF 300

Query: 282 DE-----NDSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYC 322
            +     N SG VFFG+              Q PA   ++         D Y+VG+    
Sbjct: 301 PDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGIS 352

Query: 323 IGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
           +  S L            T SG   ++DSG +FT+L    +  +  +F    S       
Sbjct: 353 VDESRLPLSHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDD 411

Query: 371 GNSWKYCYNASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTV- 423
            + +  CYN +S     E   +P + L F      V+  +    P   +E  T  CL   
Sbjct: 412 NSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ 471

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           MS D  + IIG        + +D E L+L  + ++C
Sbjct: 472 MSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 507


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 149/372 (40%), Gaps = 51/372 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     +QCAP    Y  S       +DP  S + 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVW-----LQCAPCRRCYSQS----DPIFDPRKSKTY 192

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + CS P C+   S  C + +  C Y   Y     +   +  + +    +F ++  +  
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----TFRRNRVK-- 246

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDS 286
               V +GCG    G ++  A   G+           S   + G      FS C  +  +
Sbjct: 247 ---GVALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKFSYCLVDRSA 297

Query: 287 ----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ-- 334
                SV FG+   A  +   F P+    K D  Y+VG+    +G +    +T S F+  
Sbjct: 298 SSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLD 355

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG S T L    Y  +   F     + + +   + +  C++ S+   +KV
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKV 415

Query: 389 PDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           P + L F + + S    N++     N     FC     T G   IIG     G R+V+D 
Sbjct: 416 PTVVLHFRRADVSLPATNYLIPVDTNGK---FCFAFAGTMGGLSIIGNIQQQGFRVVYDL 472

Query: 448 ENLKLAWSHSKC 459
            + ++ ++   C
Sbjct: 473 ASSRVGFAPGGC 484


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/391 (23%), Positives = 157/391 (40%), Gaps = 70/391 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IG+   +    +D GS  + V C                 R+   +DP++S S + V 
Sbjct: 3   LGIGSLQKNLSAIIDTGSEAVLVQCG---------------SRSRPVFDPAASQSYRQVP 47

Query: 174 CSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
           C   LC          S   C +    C Y   Y  +  +S+G    D++ L S +  + 
Sbjct: 48  CISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSYG-DSRNSTGDFSQDVIFLNS-TNSSS 105

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           Q+     V  GC     G  +D     G++G   G++S+PS L K  L  + FS CF   
Sbjct: 106 QAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KDRLGGSKFSYCFPSQ 163

Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYFVGVESYCIGNSCLT--QS 331
                 +G +F GD G  ++   S+ P+ +      +   Y+VG+ S  +    L   +S
Sbjct: 164 PWQPRATGVIFLGDSG-LSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 222

Query: 332 GFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ-----GNSWKYC 377
            F+          ++DSG +FT +  + Y      F    +S R  L+        +  C
Sbjct: 223 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AASNRSGLRKKVGAAAGFDDC 279

Query: 378 YNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENEGFTVFCLTVMSTD----G 428
           YN S+   L  VP++RL    N    +R  H+F   S   NE     CL ++S+     G
Sbjct: 280 YNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE--VTVCLAILSSQKSGFG 337

Query: 429 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              ++G      + + +D E  ++ +  + C
Sbjct: 338 KINVLGNYQQSNYLVEYDNERSRVGFERADC 368


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/366 (22%), Positives = 148/366 (40%), Gaps = 34/366 (9%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
           L+Y  + IG  N +  V +D GS+L WV C  C+ C       +   + +       +SS
Sbjct: 131 LNYI-VTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSS 189

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           + +N+  +     +  +C+S  +P  C +   Y  + + + G L  + L     S     
Sbjct: 190 TCQNLQFT---TGNTEACES-NNPSSCNHTVSYG-DGSFTDGELGVEHLSFGGISV---- 240

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
               S+ + GCGR   G +       G+MGLG  ++S+ S           FS C    D
Sbjct: 241 ----SNFVFGCGRNNKGLF---GGVSGIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTD 291

Query: 283 ENDSGSVFFGDQGPATQQ-----STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ--- 334
              SGS+  G++    +       TS +   +  + Y + +    +G   +  + F    
Sbjct: 292 SGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGG 351

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
            L+DSG   T L   +Y  +  +F K  S   I+   +    C+N +  E + +P + + 
Sbjct: 352 ILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMH 411

Query: 395 FSKNQSFVVRN-HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           F  N    V    I   P++       L  +S + D  IIG       R+++D +  K+ 
Sbjct: 412 FENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIG 471

Query: 454 WSHSKC 459
           ++   C
Sbjct: 472 FAREDC 477


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 89/382 (23%), Positives = 154/382 (40%), Gaps = 44/382 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP   F + +D GS+L W     IQC P + +  +S       YD SSSSS 
Sbjct: 59  YFVELRVGTPAKKFPLIVDTGSDLTW-----IQCNPPNTTANSS-SPPAPWYDKSSSSSY 112

Query: 170 KNVSCSHPLCK-----SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-- 221
           + + C+   C+       SSC  +   PC Y   YS + + ++G L  + + + S  +  
Sbjct: 113 REIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYS-DQSRTTGILAYETISMKSRKRSG 171

Query: 222 -----HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
                H  +     +V +GC R+  G+   GA+  GV+GLG G +S+ +      L    
Sbjct: 172 KRAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGI 228

Query: 277 FSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS----- 326
           FS C  +   GS    F   G    +  +  PI     A   Y+V V    +        
Sbjct: 229 FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288

Query: 327 -----CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                 +   G +  + DSG + ++L    Y++V+   +  +   R       ++ CYN 
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 348

Query: 381 SSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
           +  E   +P + + F       +   N++    EN       L  ++T     I+G    
Sbjct: 349 TRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAEN--VQCVALQKVTTTNGSNILGNLLQ 405

Query: 439 MGHRIVFDRENLKLAWSHSKCE 460
             H I +D    ++ +  S C 
Sbjct: 406 QDHHIEYDLAKARIGFKWSPCH 427


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 112/464 (24%), Positives = 182/464 (39%), Gaps = 70/464 (15%)

Query: 35  EAKERWISKSGNVSVADSWPKK---NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL 91
           E K R    S  V   DS   K   N+    E  L    +R   RV+        R +L 
Sbjct: 106 ETKPRQTPWSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLN 165

Query: 92  FPSEGSQTHF------FGNQFY-------WLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
               GS  +       FG +           ++T I +GTP     + LD GS+++W+ C
Sbjct: 166 KDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQC 225

Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIAD 197
           + C +C       Y+ +D     ++PS S+S   + C+  +C    +       C Y   
Sbjct: 226 EPCSKC-------YSQVD---PIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVS 275

Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
           Y  + + + G    ++L   + S          +V IGCG    G ++  A    ++GLG
Sbjct: 276 YG-DGSYTIGSFATEMLTFGTTSVR--------NVAIGCGHDNAGLFVGAAG---LLGLG 323

Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDEN---DSGSVFFGDQG-PATQQSTSFLPIGEKYDA 313
            G +S PS L        +FS C  +     SG++ FG +  P     T  L        
Sbjct: 324 AGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTF 381

Query: 314 YFVGVESYCIGNSCLT--------------QSGFQALVDSGASFTFLPTEIYAEV----V 355
           Y+V + S  +G + L               + GF  +VDSG + T L T +Y  V    V
Sbjct: 382 YYVPLISISVGGALLDSVPPDVFRIDETSGRGGF--IVDSGTAVTRLQTPVYDAVRDAFV 439

Query: 356 VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG 415
               +L  ++ +S+    +  CY+ S   ++ VP +   FS   S ++    +  P +  
Sbjct: 440 AGTRQLPKAEGVSI----FDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFM 495

Query: 416 FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            T FC        D  I+G     G R+ FD  N  + ++  +C
Sbjct: 496 GT-FCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/385 (23%), Positives = 161/385 (41%), Gaps = 68/385 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP  +  + LD GS L W+  QC +  P +AS+           DPS SS+   + 
Sbjct: 79  LPIGTPPQTQPMVLDTGSQLSWI--QCHKKQPPTASF-----------DPSLSSTFSILP 125

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+HPLCK R    +L   C      + + +  + T + G LV +     +FS+    S  
Sbjct: 126 CTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKF---TFSR----SVS 178

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLI-Q 274
              +I+GC  + T        P G++G+ LG +S             VP    + G    
Sbjct: 179 TPPLILGCATESTD-------PRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPT 231

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--------SYCIGNS 326
            SF +  + +  G  + G    + Q+  +F P+   Y    VG+         S  +  +
Sbjct: 232 GSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLA--YTIPMVGIRIAGKKLNISPAVFRA 289

Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA--SS 382
               SG Q ++DSG+ FT+L +E Y +V  +  + V    K+  + G     C+++  + 
Sbjct: 290 DAGGSG-QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAV 348

Query: 383 EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFM 438
           E    + +M   F +    V+ +  + +   + G  V C+ + S+D       IIG    
Sbjct: 349 EIGRLIGEMVFEFERGVEVVIPKERVLA---DVGGGVHCVGIGSSDKLGAASNIIGNFHQ 405

Query: 439 MGHRIVFDRENLKLAWSHSKCEEVI 463
               + FD    ++ +  + C  ++
Sbjct: 406 QNLWVEFDLVRRRVGFGKADCSRLV 430


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 150/362 (41%), Gaps = 39/362 (10%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     +  D GS+L WV     QC P  +S +    ++   +DPS SS+   V 
Sbjct: 148 VGLGTPAQPSALIFDTGSDLSWV-----QCQPCGSSGHCHPQQD-PLFDPSKSSTYAAVH 201

Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C  P C +    C      C Y+  Y  + +S++G L  D L L S       S   +  
Sbjct: 202 CGEPQCAAAGDLCSEDNTTCLYLVRYG-DGSSTTGVLSRDTLALTS-------SRALTGF 253

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
             GCG +  G +      DG++GLG G++S+PS  A +      FS C   ++S + +  
Sbjct: 254 PFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLT 308

Query: 293 -DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSG 340
               PAT     Q T+ L   +    YFV + S  IG   L       T+ G   L+DSG
Sbjct: 309 IGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRGG--TLLDSG 366

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
              T+LP + YA +  +F   +     +   +    CY+ + E  + VP +   F     
Sbjct: 367 TVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAV 426

Query: 401 FVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           F +     +    EN G   F    M T G    IIG        +++D    K+ +  +
Sbjct: 427 FELDFFGVMIFLDENVGCLAF--AAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPA 484

Query: 458 KC 459
            C
Sbjct: 485 SC 486


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/379 (24%), Positives = 160/379 (42%), Gaps = 48/379 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +G P   FL+ +D GS+L W     +QC P  A +    D++   +DPS S+S 
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTW-----LQCKPCKACF----DQSGPVFDPSQSTSF 221

Query: 170 KNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           K + C+        H  C+  SS  S K  C Y   Y  + + +SG L  + L + S S 
Sbjct: 222 KIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALESLSV-SLSD 278

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
           H P S     ++IGCG    G +        ++GLG G +S PS L ++  I  SFS C 
Sbjct: 279 H-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSFSYCL 333

Query: 282 DEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESYCIGNSCLTQ 330
            +       S ++ FG     ++      F P     ++    Y++G++   I    L  
Sbjct: 334 VDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPI 393

Query: 331 SGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
              +           ++DSG + T+L  + Y  V   F   +S  R     +    CYNA
Sbjct: 394 PAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PFDILGICYNA 452

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           +    +  P + ++F       +    +    +      CL ++ TDG   IIG      
Sbjct: 453 TGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNFQQQN 511

Query: 441 HRIVFDRENLKLAWSHSKC 459
              ++D ++ +L ++++ C
Sbjct: 512 IHFLYDVQHARLGFANTDC 530


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 109/466 (23%), Positives = 175/466 (37%), Gaps = 92/466 (19%)

Query: 17  DGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
           DG+ +V+ S    HR+            G  S AD    +      ELL  +  +    R
Sbjct: 30  DGTSSVTLS----HRY------------GPCSPADPNSGEKRPTDEELLRRDQLRADYIR 73

Query: 77  VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLW 135
            K   +N ++  +    S+ S     G+    L Y   + +G+P V+  V +D GS++ W
Sbjct: 74  RKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSW 133

Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSL 188
           V C+ C   +P  A          + +DP++SS+    +CS   C         + C + 
Sbjct: 134 VQCEPCPAPSPCHA-------HAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDA- 185

Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
           K  C YI  Y  + ++++G    D+L L+        S V      GC   + G+ +D  
Sbjct: 186 KSRCQYIVKYG-DGSNTTGTYSSDVLTLSG-------SDVVRGFQFGCSHAELGAGMDDK 237

Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 308
             DG++GLG GD   P +   A     SF  C               PAT  S+ FL +G
Sbjct: 238 T-DGLIGLG-GDAQSP-VSQTAARYGKSFFYCL--------------PATPASSGFLTLG 280

Query: 309 EKYDA----------------------YFVGVESYCIGNS--CLTQSGFQA--LVDSGAS 342
                                      YF  +E   +G     L+ S F A  LVDSG  
Sbjct: 281 APASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTV 340

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP   YA +   F   ++    +        C+N +  + + +P + L+F+      
Sbjct: 341 ITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVD 400

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFD 446
           +  H          +  CL    T  D  +G IG        +++D
Sbjct: 401 LDAHGI-------VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
          Length = 150

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 13/121 (10%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +F   + HRFSD  K         +   D  P+K S++Y + +   DW     R+   S 
Sbjct: 29  TFGFDMHHRFSDPVK--------GILDVDDLPEKLSLQYYKAMAHRDWVIHGRRL---ST 77

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
           ++  +  L F S+G++T+   +  Y LHY  + +GTP++ FLVALD GS+L W+PC C  
Sbjct: 78  SDEVKPPLTF-SDGNETYRLSSLGY-LHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTS 135

Query: 143 C 143
           C
Sbjct: 136 C 136


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/293 (27%), Positives = 121/293 (41%), Gaps = 35/293 (11%)

Query: 119 PNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           P V   V LD+ S++ WV   PC    C P   S+Y          DPS S SS   SCS
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFY----------DPSRSPSSAPFSCS 204

Query: 176 HPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
            P C +     +    + C Y+  Y  + +S+SG  + D+L L +        +  S   
Sbjct: 205 SPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTLDA-------GNAVSGFK 256

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GC   + GS+   AA  G+M LG G  S+  L   A    N+FS C     S S FF  
Sbjct: 257 FGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTL 312

Query: 294 QGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTF 345
             P    S    T  +   +    Y V + +  +G   L    + F A  ++DS  + T 
Sbjct: 313 GVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITR 372

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           LP   Y  +   F   ++  R +        CY+ +    +++P + L+F +N
Sbjct: 373 LPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRN 425


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 103/395 (26%), Positives = 153/395 (38%), Gaps = 90/395 (22%)

Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRN 157
            G     L Y   +  GTP V  +V +D GS++ W+   PC   QC P          + 
Sbjct: 70  LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP----------QK 119

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
              YDPS SS+   V C+  +CK        S C S K  C +   Y+ + TS+ G    
Sbjct: 120 DPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQ 177

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG---------LGDV- 261
           D L L      AP + VQ +   GCG    G +      DGV+GLG          G V 
Sbjct: 178 DKLTL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGRLRESLGARYGGVF 227

Query: 262 --SVPSLLAKAGLIQNSFSICFDENDSGSVF--FGD-QGPATQQSTSFLPI---GEKYDA 313
              +PS+ +K G +    ++   +N SG VF   G   G  T  + +   I   G+K D 
Sbjct: 228 SYCLPSVSSKPGFL----ALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD- 282

Query: 314 YFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
                         L  S F    +VDSG   T L +  Y  +   F K + + R+   G
Sbjct: 283 --------------LRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG 328

Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVM 424
           +    CYN +  + + VP + L F+   +        ++ N   +F E+           
Sbjct: 329 D-LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNGCLAFAES----------- 376

Query: 425 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             DG  G++G        ++FD    K  +    C
Sbjct: 377 GPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 103/395 (26%), Positives = 153/395 (38%), Gaps = 90/395 (22%)

Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRN 157
            G     L Y   +  GTP V  +V +D GS++ W+   PC   QC P          + 
Sbjct: 104 LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP----------QK 153

Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
              YDPS SS+   V C+  +CK        S C S K  C +   Y+ + TS+ G    
Sbjct: 154 DPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQ 211

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG---------LGDV- 261
           D L L      AP + VQ +   GCG    G +      DGV+GLG          G V 
Sbjct: 212 DKLTL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGRLRESLGARYGGVF 261

Query: 262 --SVPSLLAKAGLIQNSFSICFDENDSGSVF--FGD-QGPATQQSTSFLPI---GEKYDA 313
              +PS+ +K G +    ++   +N SG VF   G   G  T  + +   I   G+K D 
Sbjct: 262 SYCLPSVSSKPGFL----ALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD- 316

Query: 314 YFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
                         L  S F    +VDSG   T L +  Y  +   F K + + R+   G
Sbjct: 317 --------------LRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG 362

Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVM 424
           +    CYN +  + + VP + L F+   +        ++ N   +F E+           
Sbjct: 363 D-LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNGCLAFAES----------- 410

Query: 425 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             DG  G++G        ++FD    K  +    C
Sbjct: 411 GPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 107/464 (23%), Positives = 182/464 (39%), Gaps = 75/464 (16%)

Query: 31  RFSDEA------KERWISKSGN-VSV------ADSWPKKNSVEYLELLLSNDWKRQKTRV 77
           R+S+ A      + RW+ +  N VSV          P   S +  E  LS   +R + R 
Sbjct: 36  RYSEPAATCSTSRVRWLDEGSNTVSVPLVHRHGPCAPSTRSSD--EPSLSERLRRSRARS 93

Query: 78  KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
           K   +  S  N  +       TH  G+     +   + +GTP VS ++ +D GS+L WV 
Sbjct: 94  KYIMSRASKSNVSI------PTHLGGSVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWV- 146

Query: 138 CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKD- 190
               QCAP +++  T   +    +DPS SS+   + C+   C+        S C S    
Sbjct: 147 ----QCAPCNST--TCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGG 200

Query: 191 --PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
              C Y   Y  + + ++G   ++ L +      AP  +V+     GCG  Q G      
Sbjct: 201 GAQCGYAITYG-DGSQTTGVYSNETLTM------APGVTVK-DFHFGCGHDQDGP---ND 249

Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST-SFLP- 306
             DG++GLG    S+  ++  + +   +FS C    +  + F     P    S   F P 
Sbjct: 250 KYDGLLGLGGAPESL--VVQTSSVYGGAFSYCLPAANDQAGFLALGAPVNDASGFVFTPM 307

Query: 307 IGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLV 362
           + E+   Y V +    +G   +    S F    ++DSG   T L    YA +   F K +
Sbjct: 308 VREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYAALQAAFRKAM 367

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-------VVRNHIFSFPENEG 415
           ++  + L       CYN +    + VP + L FS   +        ++ ++  +F E   
Sbjct: 368 AAYPL-LPNGELDTCYNFTGHSNVTVPRVALTFSGGATVDLDVPDGILLDNCLAFQE--- 423

Query: 416 FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
                      D   GI+G        +++D  + ++ +    C
Sbjct: 424 --------AGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 151

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 13/121 (10%)

Query: 23  SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
           +F   + HRFSD  K         +   D  P+K S++Y + +   DW     R+   S 
Sbjct: 29  TFGFDMHHRFSDPVK--------GILDVDDLPEKLSLQYYKAMAHRDWVIHGRRL---ST 77

Query: 83  NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
           ++  +  L F S+G++T+   +  Y LHY  + +GTP++ FLVALD GS+L W+PC C  
Sbjct: 78  SDEVKPPLTF-SDGNETYRLSSLGY-LHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTS 135

Query: 143 C 143
           C
Sbjct: 136 C 136


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 147/382 (38%), Gaps = 62/382 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP + +   +D GS+L+W  C  C+ CA          D+    + P+ S++ + V
Sbjct: 96  LAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLV 145

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS--VQS 230
            C  PLC +       +        Y  ++ S++G L  +     +F+  A  SS  + S
Sbjct: 146 PCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-----TFTFGAANSSKVMVS 200

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
            V  GCG   +G   + +   G++GLG G +S+ S L       + FS C   F   +  
Sbjct: 201 DVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGP-----SRFSYCLTSFLSPEPS 252

Query: 288 SVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---- 333
            + FG              +  QST  +        YF+ ++   +G   L         
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312

Query: 334 ------QALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
                    +DSG S T+L  + Y  V   +V   + +     +  G    + +      
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTDGDYGIIGQNFMMG 440
            + VPDM L F    +  V       PEN    +G T F    M   GD  IIG      
Sbjct: 373 AVTVPDMELHFDGGANMTVP------PENYMLIDGATGFLCLAMIRSGDATIIGNYQQQN 426

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             I++D  N  L++  + C  V
Sbjct: 427 MHILYDIANSLLSFVPAPCNIV 448


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/382 (24%), Positives = 156/382 (40%), Gaps = 68/382 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP ++F V  D GS+L+W      QCAP +  +     +    + P+SSS+   + 
Sbjct: 90  ISVGTPLLTFSVVADTGSDLIWT-----QCAPCTKCF----QQPAPPFQPASSSTFSKLP 140

Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQSS 227
           C+   C+    S  +C +    C Y   Y +  T  +GYL  + L +  ASF        
Sbjct: 141 CTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASF-------- 188

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
              SV  GC  +       G +  G+ GLG G +   SL+ + G+    FS C     + 
Sbjct: 189 --PSVAFGCSTENG----VGNSTSGIAGLGRGAL---SLIPQLGV--GRFSYCLRSGSAA 237

Query: 288 S---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGNSCL---------T 329
               + FG     T    QST F+    +   Y  Y+V +    +G + L         T
Sbjct: 238 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGETDLPVTTSTFGFT 295

Query: 330 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEM 385
           Q+G     +VDSG + T+L  + Y  V   F    +             C+ ++      
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGG 355

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD--YGIIGQNFMMG 440
           + VP + L F     + V  + F+  E +     TV CL ++   GD    +IG    M 
Sbjct: 356 IAVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMD 414

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             +++D +    +++ + C +V
Sbjct: 415 MHLLYDLDGGIFSFAPADCAKV 436


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 149/367 (40%), Gaps = 57/367 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           +  GTP     + LD GS++ W  C+ C+ C   S  Y          +D S+SS+    
Sbjct: 132 VAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRY----------FDSSASSTYSFG 181

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           SC     ++            Y   Y  + TS   Y  D +            S V    
Sbjct: 182 SCIPSTVENN-----------YNMTYGDDSTSVGNYGCDTMT--------LEPSDVFQKF 222

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFF 291
             GCGR   G +  G+  DG++GLG G +S  S  A        FS C  E DS GS+ F
Sbjct: 223 QFGCGRNNKGDF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLF 278

Query: 292 GDQGPATQQSTSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVD 338
           G++  AT QS+S     L  G    ++   YFV +    +GN  L    S F +   ++D
Sbjct: 279 GEK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIID 336

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLI 394
           S    T LP   Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L 
Sbjct: 337 SRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLH 396

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
           F       VR +  +       +  CL    T  +  IIG    +   +++D +  ++ +
Sbjct: 397 FGGGAD--VRLNGTNIVWGSDASRLCLAFAGTS-ELTIIGNRQQLSLTVLYDIQGRRIGF 453

Query: 455 SHSKCEE 461
             + C +
Sbjct: 454 GGNGCSK 460


>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
          Length = 410

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/398 (20%), Positives = 158/398 (39%), Gaps = 46/398 (11%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++IG P   + + +D GS L W+ C   CI C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y        E   +   + +  +  +   +    C   K+ C Y   Y     SS G L
Sbjct: 80  LYK------PELKYAVKCTEQRCADLYADLRKPMKCGP-KNQCHYGIQYV--GGSSIGVL 130

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           + D     SFS  A   +  +S+  GCG  Q  +  +   P +G++GLG G V++ S L 
Sbjct: 131 IVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 185

Query: 269 KAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVESYCIGN 325
             G+I ++    C      G +FFGD    T   T + P+  ++  Y    G   +   +
Sbjct: 186 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLQFNSNS 244

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNA 380
             ++ +  + + DSGA++T+   + Y   +      +S +      +  +  +   C+  
Sbjct: 245 KPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKG 304

Query: 381 SSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
             +++  + +++  F           K  +  +    +     EG    CL ++    ++
Sbjct: 305 -KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEH 361

Query: 431 ------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  +IG   M+   +++D E   L W + +C+ +
Sbjct: 362 PSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 92/396 (23%), Positives = 157/396 (39%), Gaps = 54/396 (13%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ-----CAPLSASYYTSLDRNL 158
            QF +L    I++GTP V  L   D GS+L+WV C+         AP S  +        
Sbjct: 106 RQFEYL--MAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFV------- 156

Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK--DPCPYIADYSTEDTSSSGYLVDDILHL 216
               PS+SS+   V C    C++ SS  S      C Y+  Y  + + +SG L  +    
Sbjct: 157 ----PSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYG-DGSRASGQLSTETFTF 211

Query: 217 ASFSKHAPQSSVQ--------------SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           ++ +  +  +S                + +  GC    TG++      DG++GLG G VS
Sbjct: 212 STIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVS 267

Query: 263 VPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFV 316
           + S L     +   FS C     + N S ++ FG +   ++   +  P+  GE    Y +
Sbjct: 268 LASQLGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTI 327

Query: 317 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
            ++S  +  +    +  QA  +VDSG + T+L + +   +V    + +   R        
Sbjct: 328 ALDSINVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKIL 387

Query: 375 KYCYNAS---SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 431
             CY+ S    E+ L +PD+ L+        ++         EG     L   S      
Sbjct: 388 DLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVS 447

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 467
           I+G        + +D E   + ++ + C     KSH
Sbjct: 448 ILGNIAQQNLHVGYDLEKGTVTFAAADCA----KSH 479


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 145/362 (40%), Gaps = 48/362 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQC---APLSASYYTSLDRNLSEYDPSSSSSS 169
           +D GTP  S    +D GS++ W+PC QC  C   AP+              +DP+ SSS 
Sbjct: 119 VDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPI--------------FDPAKSSSY 164

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           K  +C    C+  S        C +   Y  + T   G L  D + L   S++ P  S  
Sbjct: 165 KPFACDSQPCQEISGNCGGNSKCQFEVLYG-DGTQVDGTLASDAITLG--SQYLPNFS-- 219

Query: 230 SSVIIGCGRKQT-GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDS 286
                GC    +  +Y          G        P+    A L   +FS C       S
Sbjct: 220 ----FGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPT----AELFGGTFSYCLPSSSTSS 271

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCIGNSCLT------QSGFQALV 337
           GS+  G +   +  S  F  + +       YFV +++  +GN+ ++       SG   ++
Sbjct: 272 GSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTII 331

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG + T+L    Y ++   F + +SS + +        CY+ SS   + VP + L   +
Sbjct: 332 DSGTTITYLVPSAYKDLRDAFRQQLSSLQPT-PVEDMDTCYDLSSSS-VDVPTITLHLDR 389

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           N   V+        +  G +  CL   STD    IIG       RIVFD  N ++ ++  
Sbjct: 390 NVDLVLPKENILITQESGLS--CLAFSSTD-SRSIIGNVQQQNWRIVFDVPNSQVGFAQE 446

Query: 458 KC 459
           +C
Sbjct: 447 QC 448


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 147/382 (38%), Gaps = 62/382 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP + +   +D GS+L+W  C  C+ CA          D+    + P+ S++ + V
Sbjct: 96  LAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLV 145

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS--VQS 230
            C  PLC +       +        Y  ++ S++G L  +     +F+  A  SS  + S
Sbjct: 146 PCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-----TFTFGAANSSKVMVS 200

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
            V  GCG   +G   + +   G++GLG G +S+ S L       + FS C   F   +  
Sbjct: 201 DVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGP-----SRFSYCLTSFLSPEPS 252

Query: 288 SVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---- 333
            + FG              +  QST  +        YF+ ++   +G   L         
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312

Query: 334 ------QALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
                    +DSG S T+L  + Y  V   +V   + +     +  G    + +      
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372

Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTDGDYGIIGQNFMMG 440
            + VPDM L F    +  V       PEN    +G T F    M   GD  IIG      
Sbjct: 373 AVTVPDMELHFDGGANMTVP------PENYMLIDGATGFLCLAMIRSGDATIIGNYQQQN 426

Query: 441 HRIVFDRENLKLAWSHSKCEEV 462
             I++D  N  L++  + C  V
Sbjct: 427 MHILYDIANSLLSFVPAPCNIV 448


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 96/417 (23%), Positives = 170/417 (40%), Gaps = 77/417 (18%)

Query: 81  SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ- 139
           ++N    N +  P+ G    F            + IG P V +   +D GS+L+W  C+ 
Sbjct: 88  ASNPDDTNNIKAPTHGGSGEFL---------MELSIGNPAVKYAAIVDTGSDLIWTQCKP 138

Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIAD 197
           C +C           D+    +DP  SSS   V CS  LC +  RS+C   KD C Y+  
Sbjct: 139 CTEC----------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYT 188

Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGL 256
           Y  + +S+ G L  +            ++S+ S +  GCG +  G   DG +   G++GL
Sbjct: 189 YG-DYSSTRGLLATETFTFED------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGL 237

Query: 257 GLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGD-------------QGPATQ 299
           G G +S+ S L      +  FS C     D   S S+F G               G  T 
Sbjct: 238 GRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVT- 291

Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE 349
           ++ S L   ++   Y++ ++   +G   L+  +S F+         ++DSG + T+L   
Sbjct: 292 KTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEET 351

Query: 350 IYAEVVVKFDKLVSSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIF---SKNQSFVVRN 405
            +  +  +F   +S             C+   ++ + + VP  +LIF     +      N
Sbjct: 352 AFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVP--KLIFHFKGADLELPGEN 409

Query: 406 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           ++ +   +    V CL + S++G   I G        ++ D E   + +  ++C ++
Sbjct: 410 YMVA---DSSTGVLCLAMGSSNG-MSIFGNVQQQNFNVLHDLEKETVTFVPTECGKL 462


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 154/374 (41%), Gaps = 63/374 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS-EYDPSSSSSSKN 171
           + IG P++  LV +D GS++LW+ C  C  C           D +L   +DPS SS+   
Sbjct: 105 LSIGQPSIPQLVVMDTGSDILWIMCNPCTNC-----------DNHLGLLFDPSMSSTFS- 152

Query: 172 VSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
                PLCK+    K  K DP P+   Y  +++S+SG    DIL    F      +S  S
Sbjct: 153 -----PLCKTPCGFKGCKCDPIPFTISY-VDNSSASGTFGRDIL---VFETTDEGTSQIS 203

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----ND 285
            VIIGCG      +      +G++GL  G    P+ LA    I   FS C         +
Sbjct: 204 DVIIGCGHNI--GFNSDPGYNGILGLNNG----PNSLATQ--IGRKFSYCIGNLADPYYN 255

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL---------TQSGFQA 335
              +  G+       ST F    E Y   Y+V +E   +G   L          ++G   
Sbjct: 256 YNQLRLGEGADLEGYSTPF----EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGG 311

Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYC-YNASSEEMLKVPDM 391
           ++ DSG + T+L    +  +  +   L+  S +++  +   WK C Y   S +++  P +
Sbjct: 312 VILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVV 371

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTDGDYGIIGQNFMMGHRIVFD 446
              F       +    F F + +   +FC+TV     ++T     +IG      + + +D
Sbjct: 372 TFHFVDGADLALDTGSF-FSQRD--DIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYD 428

Query: 447 RENLKLAWSHSKCE 460
             N  + +    CE
Sbjct: 429 LVNQFVYFQRIDCE 442


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 94/408 (23%), Positives = 165/408 (40%), Gaps = 73/408 (17%)

Query: 88  NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPL 146
           N +  P+ G    F            + IG P V +   +D GS+L+W  C+ C +C   
Sbjct: 94  NNIKAPTHGGSGEFL---------MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC--- 141

Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTS 204
                   D+    +DP  SSS   V CS  LC +  RS+C   KD C Y+  Y  + +S
Sbjct: 142 -------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSS 193

Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSV 263
           + G L  +            ++S+ S +  GCG +  G   DG +   G++GLG G +S+
Sbjct: 194 TRGLLATETFTFED------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSL 243

Query: 264 PSLLAKAGLIQNSFSICF----DENDSGSVFFGD-------------QGPATQQSTSFLP 306
            S L      +  FS C     D   S S+F G               G  T ++ S L 
Sbjct: 244 ISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLR 297

Query: 307 IGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVV 356
             ++   Y++ ++   +G   L+  +S F+         ++DSG + T+L    +  +  
Sbjct: 298 NPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKE 357

Query: 357 KFDKLVSSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENE 414
           +F   +S             C+    + + + VP M   F   +      N++ +   + 
Sbjct: 358 EFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVA---DS 414

Query: 415 GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
              V CL + S++G   I G        ++ D E   +++  ++C ++
Sbjct: 415 STGVLCLAMGSSNG-MSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/349 (23%), Positives = 137/349 (39%), Gaps = 42/349 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP+ SS+  NVS
Sbjct: 184 VGLGTPVSRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 235

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C   +        C Y   Y  + + S G+   D L L+S+              
Sbjct: 236 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     +G+ +  
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 341

Query: 293 DQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGFQ---ALVDSGASF 343
               +   +++ L      D     Y+VG+    +G   L+  QS F     +VDSG   
Sbjct: 342 FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVI 401

Query: 344 TFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           T LP   Y+ +   F   ++++       +SL       CY+ +    + +P + L+F  
Sbjct: 402 TRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLFQG 457

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
                V      +  +              GD GI+G   +    + +D
Sbjct: 458 GARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 166/434 (38%), Gaps = 78/434 (17%)

Query: 63  ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTPNV 121
           ELL  +  +    R K   +N ++  +    S+ S     G+    L Y   + +G+P +
Sbjct: 87  ELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAM 146

Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
           +  V +D GS++ WV C+ C   +P  A          + +DP++SS+    +CS   C 
Sbjct: 147 TQRVVIDTGSDVSWVQCEPCPAPSPCHA-------HAGALFDPAASSTYAAFNCSAAACA 199

Query: 181 ------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
                   + C + K  C YI  Y  + ++++G    D+L L+        S V      
Sbjct: 200 QLGDSGEANGCDA-KSRCQYIVKYG-DGSNTTGTYSSDVLTLSG-------SDVVRGFQF 250

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFFGD 293
           GC   + G+ +D    DG++GLG GD    SL+++ A     SFS C             
Sbjct: 251 GCSHAELGAGMDDKT-DGLIGLG-GDAQ--SLVSQTAARYGKSFSYCL------------ 294

Query: 294 QGPATQQSTSFLPIGEKYDA----------------------YFVGVESYCIGNS--CLT 329
             PAT  S+ FL +G                           YF  +E   +G     L+
Sbjct: 295 --PATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 352

Query: 330 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
            S F A  LVDSG   T LP   YA +   F   ++    +        C+N +  + + 
Sbjct: 353 PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVS 412

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVF 445
           +P + L+F+      +  H          +  CL    T  D  +G IG        +++
Sbjct: 413 IPTVALVFAGGAVVDLDAHGI-------VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLY 465

Query: 446 DRENLKLAWSHSKC 459
           D       +    C
Sbjct: 466 DVGGGVFGFRAGAC 479


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/381 (23%), Positives = 146/381 (38%), Gaps = 70/381 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     IQCAP    Y     +    +DP  S S 
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVW-----IQCAPCRKCY----SQTDPVFDPKKSGSF 197

Query: 170 KNVSCSHPL--------CKSRSSC---KSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
            ++SC  PL        C SR SC    +  D      ++STE  +  G  V        
Sbjct: 198 SSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRV-------- 249

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSF 277
                        V +GCG    G ++  A   G+           S   + GL     F
Sbjct: 250 -----------PKVALGCGHDNEGLFVGAAGLLGLG------RGRLSFPTQTGLRFGRKF 292

Query: 278 SICFDENDS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SC 327
           S C  +  +     SV FG    A  ++  F P+    K D  Y++ +    +G    + 
Sbjct: 293 SYCLVDRSASSKPSSVVFGQS--AVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAG 350

Query: 328 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
           +T S F+         ++DSG S T L    Y  +   F    +  + +   + +  C++
Sbjct: 351 ITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFD 410

Query: 380 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
            S +  +KVP + + F   + S    N++     N    VFC     T     IIG    
Sbjct: 411 LSGKTEVKVPTVVMHFRGADVSLPATNYLIPVDTNG---VFCFAFAGTMSGLSIIGNIQQ 467

Query: 439 MGHRIVFDRENLKLAWSHSKC 459
            G R+VFD    ++ ++   C
Sbjct: 468 QGFRVVFDVAASRIGFAARGC 488


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 142/370 (38%), Gaps = 50/370 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I +G+P  +  + +D+GS+++WV     QC P S  Y  S       +DP+ SSS 
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWV-----QCKPCSRCYQQS----DPVFDPADSSSF 193

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             VSC   +C    +       C Y   Y  + + + G L  + L +           + 
Sbjct: 194 AGVSCGSDVCDRLENTGCNAGRCRYEVSYG-DGSYTKGTLALETLTVGQV--------MI 244

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
             V IGCG    G ++  A   G+        S+  +    G    +FS C     +GS 
Sbjct: 245 RDVAIGCGHTNQGMFIGAAGLLGLG-----GGSMSFIGQLGGQTGGAFSYCLVSRGTGST 299

Query: 290 FFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSC---------LTQ 330
                  A +     LP+G  + +          Y++G+    +G            LT+
Sbjct: 300 ------GALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE 353

Query: 331 SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
            G   +V D+G + T  PT  Y      F    S+   +   + +  CY+ +  E ++VP
Sbjct: 354 YGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP 413

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            +   FS      +    F  P + G T FCL    +     IIG     G +I FD  N
Sbjct: 414 TVSFYFSDGPVLTLPARNFLIPVDGGGT-FCLAFAPSPSGLSIIGNIQQEGIQISFDGAN 472

Query: 450 LKLAWSHSKC 459
             + +  + C
Sbjct: 473 GFVGFGPNIC 482


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 107/414 (25%), Positives = 167/414 (40%), Gaps = 97/414 (23%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +  GTP  +F   LD GS+L+W+PC     C +C   S       + N  ++ P  S SS
Sbjct: 220 LKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFS-------NNNTPKFIPKDSFSS 272

Query: 170 KNVSCSHPLCK-------SRSSCKSLK----------DPCP-YIADYSTEDTSSSGYLVD 211
           K V C +P C        +   CK  K            CP Y   Y     S++G+L+ 
Sbjct: 273 KFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGL--GSTAGFLLS 330

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           + L+        P  +V S  ++GC      S +    P G+ G G G+ S+P   A+  
Sbjct: 331 ENLNF-------PAKNV-SDFLVGC------SVVSVYQPGGIAGFGRGEESLP---AQMN 373

Query: 272 LIQNSFSIC-----FDENDSGSVFFGD-----QGPATQ--QSTSFL--PIGEK--YDAYF 315
           L +  FS C     FDE+   S    +     +G  T     T+FL  P  +K  + AY+
Sbjct: 374 LTR--FSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYY 431

Query: 316 --------VGVESYCIGNSCLT-----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
                   VG +   +    L        GF  +VDSG++ TF+   I+  V  +F K V
Sbjct: 432 YITLRKIVVGEKRVRVPRRMLEPDVNGDGGF--IVDSGSTLTFMERPIFDLVAEEFVKQV 489

Query: 363 SSKRISLQGNSWKY--CYN-ASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFT 417
           +  R       +    C+  A   E    P+MR  F         V N+     + +   
Sbjct: 490 NYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGD--- 546

Query: 418 VFCLTVMSTD--GDYGIIGQNFMMGH------RIVFDRENLKLAWSHSKCEEVI 463
           V CLT++S D  G  G +G   ++G+       +  D EN +  +    C++ +
Sbjct: 547 VACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLENERFGFRSQSCQKRV 600


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 106/430 (24%), Positives = 168/430 (39%), Gaps = 61/430 (14%)

Query: 65  LLSNDWKRQKTRVK-LQSNNNSSR----NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
           L+    +R K R   L +  N +R    N+   P+        G+  Y +    + IGTP
Sbjct: 49  LIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVD---LAIGTP 105

Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
                  LD GS+L+W   QC  CA       + L +    + P  S+S + + C+  LC
Sbjct: 106 PQPVSALLDTGSDLIWT--QCAPCA-------SCLSQPDPLFAPGQSASYEPMRCAGTLC 156

Query: 180 KS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
                 SC+   D C Y  +Y  + T + G    +    AS S     ++    +  GCG
Sbjct: 157 SDILHHSCER-PDTCTYRYNYG-DGTMTVGVYATERFTFAS-SGGGGLTTTTVPLGFGCG 213

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--------GSV 289
               GS  +G+   G++G G   +S+ S L+        FS C     S        GS+
Sbjct: 214 SVNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFSYCLTSYASRRQSTLLFGSL 265

Query: 290 FFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALV 337
             G  G AT   Q+T  L   +    Y+V      +G   L   +S F          +V
Sbjct: 266 SDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIV 325

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY-------NASSEEMLKVPD 390
           DSG + T LP  + AEVV  F + +     +        C+        +SS   + VP 
Sbjct: 326 DSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPR 385

Query: 391 MRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
           M L F   +     RN++    ++      CL +  +  D   IG       R+++D E 
Sbjct: 386 MVLHFQGADLDLPRRNYVL---DDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEA 442

Query: 450 LKLAWSHSKC 459
             L+ + ++C
Sbjct: 443 ETLSIAPARC 452


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/386 (22%), Positives = 153/386 (39%), Gaps = 65/386 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP  +  + LD GS L W+ C                    + +DPS SSS   + 
Sbjct: 84  LPIGTPPQTQQMVLDTGSQLSWIQCH--------KKSVPKKPPPTTSFDPSLSSSFSVLP 135

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+HPLCK R    +L   C      + + +  + T + G LV + +  +S     P    
Sbjct: 136 CNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPP---- 191

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLIQN 275
              +I+GC    T          G++G+ LG  S             VP+  A+AGL   
Sbjct: 192 ---LILGCAEASTDE-------KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSST 241

Query: 276 SFSICFDENDSGSVFFGDQGPAT--QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
                 +  +SG   + +    T  Q+S +  P+     AY + ++   +GN+ L  S  
Sbjct: 242 GSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPL-----AYTIPMQGIRMGNARLNISAT 296

Query: 334 ----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNAS 381
                     Q ++DSG+ FT+L  E Y +V  +  +LV    K+  + G     C++ +
Sbjct: 297 LFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDGN 356

Query: 382 SEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNF 437
             E+ + + +M   F K    V+    +    + G  V C+ +  ++       IIG   
Sbjct: 357 PMEIGRLIGNMVFEFEKGVEIVIDK--WRVLADVGGGVHCIGIGRSEMLGAASNIIGNFH 414

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEVI 463
                + +D  N ++    + C   +
Sbjct: 415 QQNLWVEYDLANRRIGLGKADCSRSV 440


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 98/404 (24%), Positives = 165/404 (40%), Gaps = 88/404 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP  +  + +D GS L W+ C     A +   +          ++P+ SSS   +S
Sbjct: 70  ITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPF----------FNPNISSSYTPIS 119

Query: 174 CSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           CS P C +R+       SC S  + C     Y+ + +SS G L  D             S
Sbjct: 120 CSSPTCTTRTRDFPIPASCDS-NNLCHATLSYA-DASSSEGNLASDTFGFG--------S 169

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
           S    ++ GC      SY   +  D    G+MG+ LG +S+ S L         FS C  
Sbjct: 170 SFNPGIVFGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP-----KFSYCIS 221

Query: 283 END-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 331
            +D SG +  G+            P  Q ST  LP  ++  AY V +E   I +  L  S
Sbjct: 222 GSDFSGILLLGESNFSWGGSLNYTPLVQISTP-LPYFDR-SAYTVRLEGIKISDKLLNIS 279

Query: 332 G----------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----- 376
           G           Q + D G  F++L   +Y  +  +F    +    +L   ++ +     
Sbjct: 280 GNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMD 339

Query: 377 -CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF-----TVFCLTVMSTDG 428
            CY    +  E+ ++P + L+F   +  V  + +       GF     +V+C T  ++D 
Sbjct: 340 LCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLY--RVPGFVWGNDSVYCFTFGNSD- 396

Query: 429 DYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCEEVIDK 465
              ++G + F++GH       + FD    ++  +H++C+ V  K
Sbjct: 397 ---LLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARCDLVGQK 437


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 147/367 (40%), Gaps = 57/367 (15%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
            +GTP  + L+ALD   +  W+PC+ C+ C   S++ + ++           S++ K + 
Sbjct: 40  KVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVK----------STTFKTLG 86

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C  P CK   +       C +   Y +    S+  L  D + L+      P  +      
Sbjct: 87  CGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALS--MDPVPYYA------ 136

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
            GC +K TGS +    P G++G G G +S   L     L +++FS C       N SGS+
Sbjct: 137 FGCIQKATGSSVP---PQGLLGFGRGPLSF--LSQTQNLYKSTFSYCLPSFRTLNFSGSL 191

Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
             G  G P   ++T  L    +   Y+V +    +G   +            +G   + D
Sbjct: 192 RLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFD 251

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-K 397
           SG  FT L    Y  V  +F K V +  +S  G  +  CY+      +  P +  +FS  
Sbjct: 252 SGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG-FDTCYSVP----IVPPTITFMFSGM 306

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLA 453
           N +    N +       G T  CL + +   +      +I       HRI+FD  N +L 
Sbjct: 307 NVTMPPENLLIH--STAGVTS-CLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLG 363

Query: 454 WSHSKCE 460
            +  +C 
Sbjct: 364 VAREQCS 370


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 149/370 (40%), Gaps = 59/370 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           +  GTP+V  ++ +D GS++ WV   PC   +C P          +    +DPS SS+  
Sbjct: 135 LGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYP----------QKDPLFDPSKSSTYA 184

Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            ++C+   C+       + C S    C Y  +Y+ + + S G   ++ L L      AP 
Sbjct: 185 PIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYA-DGSHSRGVYSNETLTL------APG 237

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +V+     GCGR Q G        DG++GLG   VS+  ++  + +   +FS C    +
Sbjct: 238 ITVE-DFHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSYCLPALN 291

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSCL--TQSGFQA--L 336
           S + F     P +   ++F+    ++       Y V +    +G   L   QS F+   +
Sbjct: 292 SEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMI 351

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG   T LP   Y  +     K + +  + +  + +  CYN +    + VP +   FS
Sbjct: 352 IDSGTVDTELPETAYNALEAALRKALKAYPL-VPSDDFDTCYNFTGYSNITVPRVAFTFS 410

Query: 397 KNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
              +        ++ N   +F E+             D   GIIG        +++D   
Sbjct: 411 GGATIDLDVPNGILVNDCLAFQES-----------GPDDGLGIIGNVNQRTLEVLYDAGR 459

Query: 450 LKLAWSHSKC 459
             + +    C
Sbjct: 460 GNVGFRAGAC 469


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/364 (23%), Positives = 148/364 (40%), Gaps = 42/364 (11%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP V  L   D  S+L+WV C  C  C P          ++   ++P  SS+  N+SC
Sbjct: 96  IGTPPVERLAIADTASDLIWVQCSPCETCFP----------QDTPLFEPHKSSTFANLSC 145

Query: 175 SHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
               C S +   C  + + C Y   Y  + +S+ G L  + +H  S +   P++      
Sbjct: 146 DSQPCTSSNIYYCPLVGNLCLYTNTYG-DGSSTKGVLCTESIHFGSQTVTFPKT------ 198

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
           I GCG      +       G++GLG G +S+ S L     I + FS C   F    +  +
Sbjct: 199 IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKL 256

Query: 290 FFGDQGPATQQSTSFLP--IGEKYDA-YFVGVESYCIGNSCLT-----QSGFQALVDSGA 341
            FG+    T       P  I   Y + YF+ +    IG   L       +    ++D G 
Sbjct: 257 KFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGT 316

Query: 342 SFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
             T+L    Y   V    + L  S+        + +C+   ++  +  P +   F+  + 
Sbjct: 317 VLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCF--PNQANITFPKIVFQFTGAKV 374

Query: 401 FVV-RNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           F+  +N  F F   +   + CL V+       + + G    +  ++ +DR+  K++++ +
Sbjct: 375 FLSPKNLFFRF---DDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPA 431

Query: 458 KCEE 461
            C +
Sbjct: 432 DCSK 435


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 146/377 (38%), Gaps = 55/377 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP   F +  D GS+L WV  +C   +P               + P +S S 
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWV--KCAGASPPG-----------RVFRPKTSRSW 162

Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
             + CS   CK     + ++C S   PC Y  DY  ++ S+       I+   S +   P
Sbjct: 163 APIPCSSDTCKLDVPFTLANCSSPASPCTY--DYRYKEGSAG---ARGIVGTESATIALP 217

Query: 225 QSSVQ--SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
              V     V++GC     G     A  DGV+ LG   +S  +    A     SFS C  
Sbjct: 218 GGKVAQLKDVVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARFGGSFSYCLV 273

Query: 282 ----DENDSGSVFFG----DQGPATQQSTSFLP----IGEKYDAYFVGVESYCIGNSCLT 329
                 N +G + FG     + PATQ      P     G K DA  V  ++  I      
Sbjct: 274 DHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWD 333

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLK- 387
                 ++DSG + T L    Y  VV    K L    ++S     +++CYN ++      
Sbjct: 334 AKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFP--PFEHCYNWTARRPGAP 391

Query: 388 --VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFMMGHR 442
             +P + + F+ +         +      G  V C+ V   +G++    +IG      H 
Sbjct: 392 EIIPKLAVQFAGSARLEPPAKSYVIDVKPG--VKCIGVQ--EGEWPGLSVIGNIMQQEHL 447

Query: 443 IVFDRENLKLAWSHSKC 459
             FD +N+++ +  S C
Sbjct: 448 WEFDLKNMQVRFKQSNC 464


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 96/406 (23%), Positives = 161/406 (39%), Gaps = 83/406 (20%)

Query: 107 YWLHYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSE 160
           Y  H   +  GTP   +SFLV  D GS+++W PC     C  C     S+  +  + +  
Sbjct: 75  YGGHSISLSFGTPPQKLSFLV--DTGSDVVWAPCTTDYTCTNC-----SFSAADPKKVPI 127

Query: 161 YDPSSSSSSKNVSCSHPLCKSR---------SSCKSLKDPCPYIADYSTE--DTSSSGYL 209
           +DP  SSSSK + C +P C S            C      C Y   YST+    +SSGY 
Sbjct: 128 FDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYF 187

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
           + + L         P+ +++ + ++GC    T S     + D + G G    S+P  +  
Sbjct: 188 LLENLKF-------PRKTIR-NFLLGC----TTSAARELSSDALAGFGRSMFSLPIQMG- 234

Query: 270 AGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVE 319
                  F+ C + +D      SG +   D      +  S+ P  +   A    Y +GV+
Sbjct: 235 ----VKKFAYCLNSHDYDDTRNSGKLIL-DYRDGKTKGLSYTPFLKSPPASAFYYHLGVK 289

Query: 320 SYCIGNSCLT------------QSGFQALVDSG-ASFTFLPTEIYAEVVVKFDKLVSSKR 366
              IGN  L             +SG   ++DSG     ++   ++  V  +  K +S  R
Sbjct: 290 DIKIGNKLLRIPSKYLAPGSDGRSG--VIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYR 347

Query: 367 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 422
            SL+  +      CYN +  + +K+P +   F    + VV   + F     E    F   
Sbjct: 348 RSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACF--- 404

Query: 423 VMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +M T+G            I+G +  + + + +D +N +  +    C
Sbjct: 405 LMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 102/425 (24%), Positives = 180/425 (42%), Gaps = 75/425 (17%)

Query: 71  KRQKTRVK-------LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
           KR K+R++         S+   S +QL  P         GN  Y +    + IGTP VS+
Sbjct: 71  KRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHA------GNGEYLIE---LAIGTPPVSY 121

Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
              LD GS+L+W      QC P +  Y     +    +DP  SSS   VSC   LC +  
Sbjct: 122 PAVLDTGSDLIWT-----QCKPCTRCY----KQPTPIFDPKKSSSFSKVSCGSSLCSALP 172

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           S  +  D C Y+  Y  + + + G L  +     +F K   + SV  ++  GCG    G 
Sbjct: 173 S-STCSDGCEYVYSYG-DYSMTQGVLATETF---TFGKSKNKVSVH-NIGFGCGEDNEGD 226

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQ 300
             + A+  G++GLG G +S+ S L      +  FS C    D+     +  G  G     
Sbjct: 227 GFEQAS--GLVGLGRGPLSLVSQLK-----EQRFSYCLTPIDDTKESVLLLGSLGKVKDA 279

Query: 301 ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFL 346
               +T  L    +   Y++ +E+  +G++ L+  +S F+         ++DSG + T++
Sbjct: 280 KEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYV 339

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYN-ASSEEMLKVPDMRLIFSKNQ-SF 401
             + Y  +  +F   +S  +++L   S      C++  S    +++P +   F       
Sbjct: 340 QQKAYEALKKEF---ISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLEL 396

Query: 402 VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVFDRENLKLAWSHS 457
              N++     +    V CL + ++ G   I G    QN ++ H    D E   +++  +
Sbjct: 397 PAENYMIG---DSNLGVACLAMGASSG-MSIFGNVQQQNILVNH----DLEKETISFVPT 448

Query: 458 KCEEV 462
            C+++
Sbjct: 449 SCDQL 453


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 154/370 (41%), Gaps = 51/370 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
           + T + +GTP+ S+ + +D GS+L W     +QC+P       S  R +   +DP +SS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTW-----LQCSPC----VVSCHRQVGPLFDPRASST 184

Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
             +V CS   C        + S+C S  + C Y A Y  + + S G L  D +       
Sbjct: 185 YASVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGSLSTDTVSFG---- 238

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
               S+   S   GCG+   G +   A   G++GL    +S+   LA +  +  SFS C 
Sbjct: 239 ----STRYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289

Query: 282 DENDSGSVFFGDQGP-ATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSG 332
               + S  +   GP  T    S+ P+     DA  YF+ +    +G S L       S 
Sbjct: 290 PT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS 347

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG   T LPT ++  +     + ++  + +   +    C+   + + L+VP + 
Sbjct: 348 LPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVA 406

Query: 393 LIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + F+   S     RN +    ++      CL    TD    IIG        +++D    
Sbjct: 407 MAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTD-STAIIGNTQQQTFSVIYDVAQS 461

Query: 451 KLAWSHSKCE 460
           ++ +S   C 
Sbjct: 462 RIGFSAGGCS 471


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 147/369 (39%), Gaps = 34/369 (9%)

Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
            G     L Y   + IG+P V+  +++D GS++ WV C+ C QC       ++ +D   S
Sbjct: 113 LGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQC-------HSEVD---S 162

Query: 160 EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
            +DPSSSS+    SCS   C    +S+     +   C YI +Y    +++      D L 
Sbjct: 163 LFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTG-TYSSDTLT 221

Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
           L         SS  +    GC + ++G + D    DG+MGLG G  S+ S    AG    
Sbjct: 222 LG--------SSAMTDFQFGCSQSESGGFND--QTDGLMGLGGGAQSLAS--QTAGTFGT 269

Query: 276 SFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 332
           +FS C       S F     G +    T  L   +    Y V +ES  +G+  L    S 
Sbjct: 270 AFSYCLPPTSGSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSV 329

Query: 333 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
           F A  L+DSG   T LP   Y+ +   F   +     +        C++ S +  + +P 
Sbjct: 330 FSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPT 389

Query: 391 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           + L+FS   +  +         +        T    D   GIIG        +++D    
Sbjct: 390 VTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGG 449

Query: 451 KLAWSHSKC 459
            + +    C
Sbjct: 450 AVGFKAGAC 458


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/375 (22%), Positives = 155/375 (41%), Gaps = 54/375 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IG+P  SF   +D GS+L+W  C+ C QC           D++   +DP  SSS   +
Sbjct: 115 LAIGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKI 164

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           SCS  LC +  +     D C Y+  Y  + +S+ G L  +     +F            +
Sbjct: 165 SCSSELCGALPTSTCSSDGCEYLYTYG-DSSSTQGVLAFETF---TFGDSTEDQISIPGL 220

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSV 289
             GCG    G      A  G++GLG G +S+ S L      +  F+ C    D++   S+
Sbjct: 221 GFGCGNDNNGDGFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSL 273

Query: 290 FFGDQGPAT-------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
             G     T        ++T  +    +   Y++ ++   +G + L+  +S F+      
Sbjct: 274 LLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGS 333

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-ASSEEMLKVPDM 391
              ++DSG + T++    +  +  +F   ++             C+N  +    ++VP +
Sbjct: 334 GGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKL 393

Query: 392 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVFDR 447
              F K     +    +   +++   + CL + S+ G   I G    QNFM    +V D 
Sbjct: 394 TFHF-KGADLELPGENYMIGDSKA-GLLCLAIGSSRG-MSIFGNLQQQNFM----VVHDL 446

Query: 448 ENLKLAWSHSKCEEV 462
           +   L++  ++C+ +
Sbjct: 447 QEETLSFLPTQCDSI 461


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 91/367 (24%), Positives = 157/367 (42%), Gaps = 47/367 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +++ + IG+P     + +D GS++ WV     QCAP +  Y     +    ++PS SSS 
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWV-----QCAPCADCY----QQADPIFEPSFSSSY 205

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             ++C    CKS    +   D C Y   Y         Y V D    A+ +     S+  
Sbjct: 206 APLTCETHQCKSLDVSECRNDSCLYEVSY-----GDGSYTVGD---FATETITLDGSASL 257

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
           ++V IGCG    G ++  A    ++GLG G +S PS +  +     SFS C    D + +
Sbjct: 258 NNVAIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQINAS-----SFSYCLVNRDTDSA 309

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--------L 336
            ++ F    P+   +   L   +    Y++G+    +G   L+  +S F+         +
Sbjct: 310 STLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGII 369

Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           VDSG + T L +++Y  +   F +    L S+  ++L    +  CY+ SS   ++VP + 
Sbjct: 370 VDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL----FDTCYDLSSRSSVEVPTVS 425

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             F   +   +    +  P +   T FC     T     IIG     G R+ +D  N  +
Sbjct: 426 FHFPDGKYLALPAKNYLIPVDSAGT-FCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLV 484

Query: 453 AWSHSKC 459
            +S + C
Sbjct: 485 GFSPNGC 491


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 96/395 (24%), Positives = 158/395 (40%), Gaps = 58/395 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE--YDPSSSS 167
           ++    +GTP   FL+  D GS+L WV C+  + A  S +  +S   +     + P  S 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD------ILHL 216
           +   + C+   C      S S+C +   PC Y  DY  +D S++   V        +   
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSSS 212

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
           +S SK+  + +    +++GC    TG   +  A DGV+ LG  +VS  S    A      
Sbjct: 213 SSSSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGR 268

Query: 277 FSICF-----DENDSGSVFFGDQ-----------GPATQQSTSFLPIGEKYDAYFVGVES 320
           FS C        N +  + FG             GP  +Q T  +        Y V +++
Sbjct: 269 FSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQ-TPLVLDSRMRPFYDVSIKA 327

Query: 321 YCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQG 371
             +    L           G   +VDSG S T L    Y  VV     KL    R+++  
Sbjct: 328 ISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM-- 385

Query: 372 NSWKYCYNASS----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 427
           + ++YCYN +S    +E   +P + + F+ +      +  +      G  V C+ V   +
Sbjct: 386 DPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPG--VKCIGVQ--E 441

Query: 428 GDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           G +    +IG      H   FD +N +L +  S+C
Sbjct: 442 GPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 148/372 (39%), Gaps = 51/372 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     +QCAP    Y  S       +DP  S + 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVW-----LQCAPCRRCYSQS----DPIFDPRKSKTY 192

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + CS P C+   S  C + +  C Y   Y     +   +  + +    +F ++  +  
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----TFRRNRVK-- 246

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDS 286
               V +GCG    G ++  A   G+           S   + G      FS C  +  +
Sbjct: 247 ---GVALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKFSYCLVDRSA 297

Query: 287 ----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ-- 334
                SV FG+   A  +   F P+    K D  Y+VG+    +G +    +T S F+  
Sbjct: 298 SSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLD 355

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG S T L    Y  +   F     + + +   + +  C++ S+   +KV
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKV 415

Query: 389 PDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           P + L F   + S    N++     N     FC     T G   IIG     G R+V+D 
Sbjct: 416 PTVVLHFRGADVSLPATNYLIPVDTNGK---FCFAFAGTMGGLSIIGNIQQQGFRVVYDL 472

Query: 448 ENLKLAWSHSKC 459
            + ++ ++   C
Sbjct: 473 ASSRVGFAPGGC 484


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/356 (23%), Positives = 143/356 (40%), Gaps = 49/356 (13%)

Query: 56  KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
           K  V+Y+   LS +  +  +  +L S    +++  L  S        GN     ++  + 
Sbjct: 105 KERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGS--------GN-----YFVVVG 151

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP     +  D GS+L W      QC P + S Y   D     +DPS S+S  N++C+
Sbjct: 152 LGTPKRDLSLIFDTGSDLTWT-----QCEPCARSCYKQQD---VIFDPSKSTSYSNITCT 203

Query: 176 HPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
             LC   S+       C +    C Y   Y  + + S GY   + L + +       + V
Sbjct: 204 SALCTQLSTATGNDPGCSASTKACIYGIQYG-DSSFSVGYFSRERLTVTA-------TDV 255

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
             + + GCG+   G +   A   G++GLG   +S   +   A   +  FS C     S  
Sbjct: 256 VDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAKYRKIFSYCLPSTSSST 310

Query: 287 GSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALVDSG 340
           G + FG        + T F  I      Y + + +  +G   L     T S   A++DSG
Sbjct: 311 GHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSG 370

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
              T LP   Y  +   F + +S    + + +    CY+ S  ++  +P +   F+
Sbjct: 371 TVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEFSFA 426


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 167/389 (42%), Gaps = 64/389 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           H   + IGTP     + +D GS+L+W  C+      ++A + +        YDP  SS+ 
Sbjct: 91  HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSP-----PVYDPGESSTF 145

Query: 170 KNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
             + CS  LC+    S  +C S K+ C Y      ED   S   V  +L   +F+  A +
Sbjct: 146 AFLPCSDRLCQEGQFSFKNCTS-KNRCVY------EDVYGSAAAV-GVLASETFTFGA-R 196

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FD 282
            +V   +  GCG    GS +      G++GL    +S+ + L     IQ  FS C   F 
Sbjct: 197 RAVSLRLGFGCGALSAGSLIGAT---GILGLSPESLSLITQLK----IQR-FSYCLTPFA 248

Query: 283 ENDSGSVFFGDQGPATQ-------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 329
           +  +  + FG     ++       Q+T+ +    K   Y+V +    +G+  L       
Sbjct: 249 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASL 308

Query: 330 ----QSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQGNSWKYCY---- 378
                 G   +VDSG++  +L     E   E V+   +L  + R       ++ C+    
Sbjct: 309 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTV---EDYELCFVLPR 365

Query: 379 --NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTV-MSTDGD-YGII 433
              A++ E ++VP + L F    + V+ R++ F  P      + CL V  +TDG    II
Sbjct: 366 RTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG---LMCLAVGKTTDGSGVSII 422

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G        ++FD ++ K +++ ++C+++
Sbjct: 423 GNVQQQNMHVLFDVQHHKFSFAPTQCDQI 451


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 96/422 (22%), Positives = 168/422 (39%), Gaps = 56/422 (13%)

Query: 69  DWKRQKTR------VKLQSNNNSSRNQLLFPS-EGSQTHF---FGNQFYWLHYTWIDIGT 118
           DW R+  +      ++++S  N  R  +   + E SQT      G     L+Y  + +G 
Sbjct: 13  DWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYI-VTMGL 71

Query: 119 PNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHP 177
            + +  V +D GS+L WV C+ C+ C           ++    + PS+SSS ++VSC+  
Sbjct: 72  GSTNMTVIIDTGSDLTWVQCEPCMSC----------YNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 178 LCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            C+S         +C S    C Y+ +Y  + + ++G L  + L     S         S
Sbjct: 122 TCQSLQFATGNTGACGSNPSTCNYVVNYG-DGSYTNGELGVEQLSFGGVSV--------S 172

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSG 287
             + GCGR   G +       G+MGLG   +S+ S           FS C    +   SG
Sbjct: 173 DFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNATFGGVFSYCLPTTESGASG 227

Query: 288 SVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---QALVDS 339
           S+  G++    +       T  LP  +  + Y + +    +    L    F     L+DS
Sbjct: 228 SLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDS 287

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           G   T LP+ +Y  +   F K  +    +   +    C+N +  + + +P + + F  N 
Sbjct: 288 GTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNA 347

Query: 400 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
              V      +   E+       L  +S   D  IIG       R+++D +  K+ ++  
Sbjct: 348 ELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEE 407

Query: 458 KC 459
            C
Sbjct: 408 SC 409


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/373 (22%), Positives = 154/373 (41%), Gaps = 54/373 (14%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IG+P  SF   +D GS+L+W  C+ C QC           D++   +DP  SSS   +SC
Sbjct: 372 IGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKISC 421

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           S  LC +  +     D C Y+  Y  + +S+ G L  +     +F            +  
Sbjct: 422 SSELCGALPTSTCSSDGCEYLYTYG-DSSSTQGVLAFETF---TFGDSTEDQISIPGLGF 477

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFF 291
           GCG    G      A  G++GLG G +S+ S L      +  F+ C    D++   S+  
Sbjct: 478 GCGNDNNGDGFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLL 530

Query: 292 GDQGPAT-------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ-------- 334
           G     T        ++T  +    +   Y++ ++   +G + L+  +S F+        
Sbjct: 531 GSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGG 590

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-ASSEEMLKVPDMRL 393
            ++DSG + T++    +  +  +F   ++             C+N  +    ++VP +  
Sbjct: 591 VIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 650

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVFDREN 449
            F K     +    +   +++   + CL + S+ G   I G    QNFM    +V D + 
Sbjct: 651 HF-KGADLELPGENYMIGDSKA-GLLCLAIGSSRG-MSIFGNLQQQNFM----VVHDLQE 703

Query: 450 LKLAWSHSKCEEV 462
             L++  ++C+ +
Sbjct: 704 ETLSFLPTQCDSI 716


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 147/385 (38%), Gaps = 70/385 (18%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP    L+A+D  ++  WVPC      P +A            ++P+SS++ + V C 
Sbjct: 100 LGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTA----------PSFNPASSATFRPVPCG 149

Query: 176 HPLCKS--RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            P C      SC SL   K+ C +   Y   D+S    L  D L + +         V  
Sbjct: 150 APPCSQAPNPSCTSLAKSKNSCGFSLSYG--DSSLDATLSQDNLAVTA------NGGVIK 201

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQNSFSICFDE------ 283
               GC  K  GS    AAP    GL          +A+  G+ + +FS C         
Sbjct: 202 GYTFGCLTKSNGS----AAP--AQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAA 255

Query: 284 NDSGSVFFGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQ 330
           N SGS+  G +G   P   ++T  L    +   Y+V +    IG   +            
Sbjct: 256 NFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAA 315

Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLV-----------SSKRISLQGNSWKYCYN 379
           +G   ++DSG  F  L    YA V  +  + V           +S  +S  G  +  CYN
Sbjct: 316 TGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGG-FDTCYN 374

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIG 434
            S+   +  P + L+F       +           G T  CL + ++  D       +IG
Sbjct: 375 VST---VAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTS-CLAMAASPADGVNAALNVIG 430

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                 HR++FD  N ++ ++  +C
Sbjct: 431 SLQQQNHRVLFDVPNARVGFARERC 455


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/334 (24%), Positives = 137/334 (41%), Gaps = 69/334 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP-----SSSSS 168
           I+  TP V   + +D G   LWV C+         ++YTS     S Y P     +  S 
Sbjct: 53  INQRTPLVPLNLVVDLGGKFLWVDCE---------NHYTS-----STYRPVRCPSAQCSL 98

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-HAPQSS 227
           +K+ SC       +  C    + C  I D +   +++ G L +D+L + S S  +  Q+ 
Sbjct: 99  AKSDSCGDCFSSPKPGCN---NTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFNTGQNV 155

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
           V S  +  C        L G A  G+ GLG   +++PS LA A + +  F+ CF  +D G
Sbjct: 156 VVSRFLFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSD-G 213

Query: 288 SVFFGDQGPAT--------------QQSTSFLPI-------------GEKYDAYFVGVES 320
            + FGD GP +               +S ++ P+             GE    YF+GV++
Sbjct: 214 VIIFGD-GPYSFLADNPSLPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGVKT 272

Query: 321 YCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
             I G      S   ++ + G           +T L   IY  V   F K   ++ I+ +
Sbjct: 273 IKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITTE 332

Query: 371 GNS--WKYCYN----ASSEEMLKVPDMRLIFSKN 398
            +S  +++CY+      +     VP + L+   N
Sbjct: 333 DSSPPFEFCYSFDNLPGTPLGASVPTIELLLQNN 366


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 86/397 (21%), Positives = 157/397 (39%), Gaps = 83/397 (20%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS--------SSS 167
           +GTP     + LD GS+L+W PC       +  + YT  +   S  DP+         SS
Sbjct: 80  LGTPPQKVSLVLDTGSSLVWTPCT------IPTATYTCQNCTFSGVDPTKIPIYARNKSS 133

Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIA-DYSTEDTSSSGYLVDDILHLASFSKH 222
           + +++ C  P C     S  +C + K  CPY   +Y     S++G LV D+L L+  ++ 
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKR-CPYYGLEYGLG--STTGQLVSDVLGLSKLNRI 190

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-- 280
                     + GC      S +    P+G+ G G G  S+P   A+ GL + S+ +   
Sbjct: 191 P-------DFLFGC------SLVSNRQPEGIAGFGRGLASIP---AQLGLTKFSYCLVSH 234

Query: 281 -FDENDSGSVFFGDQG----PATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCL- 328
            FD+          +G     A     ++ P  +        + Y++ +    +G   + 
Sbjct: 235 RFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVP 294

Query: 329 ---------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY 376
                     +     +VDSG++FTF+   I+  V  + +K ++  + + +    +    
Sbjct: 295 IPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGP 354

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVM------- 424
           CYN + +  + VP +   F    +          P  + F+     V C+TV+       
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGAN-------MDLPLTDYFSLVTDGVVCMTVLTDPDEPG 407

Query: 425 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           ST G   I+G        I +D +  +  +   +C+ 
Sbjct: 408 STTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQCDR 444


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 152/373 (40%), Gaps = 46/373 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I +GTP     + +D GS++LW     +QCAP    Y+ S     + +DP  SS+ 
Sbjct: 58  YFIRISVGTPPRRMYLVMDTGSDILW-----LQCAPCVNCYHQS----DAIFDPYKSSTY 108

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             + CS   C +        + C Y  DY     ++  +  DD+   +  S       V 
Sbjct: 109 STLGCSTRQCLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDV---SLNSTSGVGQVVL 165

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS--FSICF-----D 282
           + + +GCG    G ++  A    ++GLG G +S P+ +      QN   FS C      D
Sbjct: 166 NKIPLGCGHDNEGYFVGAAG---LLGLGKGPLSFPNQVDP----QNGGRFSYCLTDRETD 218

Query: 283 ENDSGSVFFGDQG--PATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQ- 334
             +  S+ FG+    PA  +   F P          Y++ +    +G + LT   S FQ 
Sbjct: 219 STEGSSLVFGEAAVPPAGAR---FTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQL 275

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
                   ++DSG S T L    YA +   F    S    +   + +  CY+ S    + 
Sbjct: 276 DSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVD 335

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           VP + L F       +    +  P +   T FCL    T G   IIG     G R+++D 
Sbjct: 336 VPTVTLHFQGGTDLKLPASNYLIPVDNSNT-FCLAFAGTTGP-SIIGNIQQQGFRVIYDN 393

Query: 448 ENLKLAWSHSKCE 460
            + ++ +  S+C 
Sbjct: 394 LHNQVGFVPSQCN 406


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/362 (25%), Positives = 143/362 (39%), Gaps = 48/362 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQC---APLSASYYTSLDRNLSEYDPSSSSSS 169
           +D GTP  S    +D GS++ W+PC QC  C   AP+              +DP+ SSS 
Sbjct: 119 VDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPI--------------FDPAKSSSY 164

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           K  +C    C+  S        C +   Y  + T   G L  D + L   S++ P  S  
Sbjct: 165 KPFACDSQPCQEISGNCGGNSKCQFEVSYG-DGTQVDGTLASDAITLG--SQYLPNFS-- 219

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICF--DENDS 286
                GC      S  +  +P   +    G        A  A L   +FS C       S
Sbjct: 220 ----FGCAE----SLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSS 271

Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT------QSGFQALV 337
           GS+  G +   +  S  F  + +       YFV +++  +GN+ ++       SG   ++
Sbjct: 272 GSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTII 331

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           DSG + T L    Y  +   F + +SS + +        CY+ SS   + VP + L   +
Sbjct: 332 DSGTTITHLVPSAYTALRDAFRQQLSSLQPT-PVEDMDTCYDLSSSS-VDVPTITLHLDR 389

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 457
           N   V+        +  G    CL   STD    IIG       RIVFD  N ++ ++  
Sbjct: 390 NVDLVLPKENILITQESGLA--CLAFSSTD-SRSIIGNVQQQNWRIVFDVPNSQVGFAQE 446

Query: 458 KC 459
           +C
Sbjct: 447 QC 448


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 94/418 (22%), Positives = 161/418 (38%), Gaps = 61/418 (14%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           +L++      TR K +  N ++    + P  G Q     N     +     +GTP  + L
Sbjct: 64  MLTSGAGPLTTRAKPKPKNRANPPVPIAP--GRQILSIPN-----YIARAGLGTPAQTLL 116

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---K 180
           VA+D  ++  WVPC  C  CA  S S           + P+ SS+ + V C  P C    
Sbjct: 117 VAIDPSNDAAWVPCSACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQCAQVP 165

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
           S S    +   C +   Y+   ++    L  D L L        +++V  S   GC R  
Sbjct: 166 SPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL--------ENNVVVSYTFGCLRVV 215

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQG- 295
           +G   +   P G++G G G +S   L        + FS C       N SG++  G  G 
Sbjct: 216 SG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQ 270

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
           P   ++T  L    +   Y+V +    +G+  +            +G   ++D+G  FT 
Sbjct: 271 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 330

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-R 404
           L   +YA V   F   V +      G  +  CYN +    + VP +  +F+   +  +  
Sbjct: 331 LAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT----VSVPTVTFMFAGAVAVTLPE 385

Query: 405 NHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++     + G     +    +DG      ++        R++FD  N ++ +S   C
Sbjct: 386 ENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 104/420 (24%), Positives = 156/420 (37%), Gaps = 53/420 (12%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFL 124
            +   +  + R        S R  +      S   + G     L Y   + IGTP V   
Sbjct: 80  FAERLRSDRARADHILRKASGRRMMSEGGGASIPTYLGGFVDSLEYVVTLGIGTPAVQQT 139

Query: 125 VALDAGSNLLWVPCQCIQCAPLSAS-YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
           V +D GS+L WV     QC P +AS  Y   D     +DPS SS+   + C+   CK   
Sbjct: 140 VLIDTGSDLSWV-----QCKPCNASDCYPQKD---PLFDPSKSSTFATIPCASDACKQLP 191

Query: 182 --------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
                    ++   +   C Y  +Y      + G    + L L S       S+V  S  
Sbjct: 192 VDGYDNGCTNNTSGMPPQCGYAIEYG-NGAITEGVYSTETLALGS-------SAVVKSFR 243

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG  Q G Y      DG++GLG    S+ S  A   +   +FS C    +SG+ F   
Sbjct: 244 FGCGSDQHGPY---DKFDGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGAGFLTL 298

Query: 294 QGP-ATQQSTS---FLPI----GEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGA 341
             P +T  S S   F P+     +    Y V +    +G   L    + F    +VDSG 
Sbjct: 299 GAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNIVDSGT 358

Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQS 400
             T +PT  Y  +   F   ++   +    +S    CYN +    + VP + L F    +
Sbjct: 359 VITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVALTFVGGAT 418

Query: 401 FVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             +        E+      CL    + DG +GIIG        +++D     L +    C
Sbjct: 419 VDLDVPSGVLVED------CLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 94/418 (22%), Positives = 162/418 (38%), Gaps = 61/418 (14%)

Query: 65  LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           +L++      TR K +  N ++    + P  G Q     N     +     +GTP  + L
Sbjct: 45  MLTSGAGPLTTRAKPKPKNRANPPVPIAP--GRQILSIPN-----YIARAGLGTPAQTLL 97

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---K 180
           VA+D  ++  WVPC  C  CA  S S           + P+ SS+ + V C  P C    
Sbjct: 98  VAIDPSNDAAWVPCSACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQCAQVP 146

Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
           S S    +   C +   Y+   ++    L  D L L        +++V  S   GC R  
Sbjct: 147 SPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL--------ENNVVVSYTFGCLRVV 196

Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQG- 295
           +G+ +    P G++G G G +S   L        + FS C       N SG++  G  G 
Sbjct: 197 SGNSVP---PQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQ 251

Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
           P   ++T  L    +   Y+V +    +G+  +            +G   ++D+G  FT 
Sbjct: 252 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 311

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-R 404
           L   +YA V   F   V +      G  +  CYN +    + VP +  +F+   +  +  
Sbjct: 312 LAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT----VSVPTVTFMFAGAVAVTLPE 366

Query: 405 NHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            ++     + G     +    +DG      ++        R++FD  N ++ +S   C
Sbjct: 367 ENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 94/371 (25%), Positives = 154/371 (41%), Gaps = 57/371 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +G+P V     +D GS+L+W      QC P    Y     +    ++P  S +   + 
Sbjct: 86  LTLGSPPVDIYGLVDTGSDLVWA-----QCTPCGGCY----RQKSPMFEPLRSKTYSPIP 136

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C    C       S +  C Y   YS  D+S + G L  + +   +FS       V   +
Sbjct: 137 CESEQCSFFGYSCSPQKMCAY--SYSYADSSVTKGVLAREAI---TFSSTDGDPVVVGDI 191

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNS--FSICF-----DEN 284
           I GCG   +G++ +       M         P SL+++ G +  S  FS C      D +
Sbjct: 192 IFGCGHSNSGTFNENDMGIIGM------GGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAH 245

Query: 285 DSGSVFFGDQGPATQQSTSFLPIG--EKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
            SG++ FG++   + +     P+   E   +Y V +E   +G      NS  T S    +
Sbjct: 246 TSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIM 305

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSEEMLKVPDMRL 393
           +DSG   T++P E Y  +V +    V S  + ++ +     + CY   SE  L+ P +  
Sbjct: 306 IDSGTPATYIPQEFYERLVEELK--VQSSLLPIEDDPDLGTQLCYR--SETNLEGPILTA 361

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQ----NFMMGHRIVFDRE 448
            F      ++    F  P  +G  VFC  +  STDGDY I G     N +MG    FD +
Sbjct: 362 HFEGADVQLLPIQTF-IPPKDG--VFCFAMAGSTDGDY-IFGNFAQSNILMG----FDLD 413

Query: 449 NLKLAWSHSKC 459
              +++  + C
Sbjct: 414 RKTISFKPTDC 424


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 157/379 (41%), Gaps = 64/379 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IG P V +   +D GS+L+W  C+ C +C           D+    +DP  SSS   V
Sbjct: 3   LSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKV 52

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            CS  LC +  RS+C   KD C Y+  Y  + +S+ G L  +            ++S+ S
Sbjct: 53  GCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED------ENSI-S 104

Query: 231 SVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 285
            +  GCG +  G   DG +   G++GLG G +S+ S L      +  FS C     D   
Sbjct: 105 GIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEA 156

Query: 286 SGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--Q 330
           S S+F G               G  T ++ S L   ++   Y++ ++   +G   L+  +
Sbjct: 157 SSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQGITVGAKRLSVEK 215

Query: 331 SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-AS 381
           S F+         ++DSG + T+L    +  +  +F   +S             C+    
Sbjct: 216 STFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPD 275

Query: 382 SEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           + + + VP M   F   +      N++ +   +    V CL + S++G   I G      
Sbjct: 276 AAKNIAVPKMIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSSNG-MSIFGNVQQQN 331

Query: 441 HRIVFDRENLKLAWSHSKC 459
             ++ D E   +++  ++C
Sbjct: 332 FNVLHDLEKETVSFVPTEC 350


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 98/394 (24%), Positives = 166/394 (42%), Gaps = 58/394 (14%)

Query: 87  RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPL 146
           +NQLL  S  + T F     Y ++   + +GTP    +  +D GS+L+W   QC+ C P 
Sbjct: 42  KNQLLGASPYADTVFD----YSIYLMRLQLGTPPFEIVAEIDTGSDLIWT--QCMPC-PN 94

Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS 206
             + +  +      +DPS SS+ K   C               + CPY   Y+ E + S+
Sbjct: 95  CYTQFAPI------FDPSKSSTFKEKRCH-------------GNSCPYEIIYADE-SYST 134

Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVP 264
           G L  + + + S S    +  V +   IGCG   +     G  A+  G++GL +G  S+ 
Sbjct: 135 GILATETVTIQSTSG---EPFVMAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLI 191

Query: 265 SL--LAKAGLIQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGV 318
           S   L   GLI    S CF    +  + FG      G  T  +  F+   + +  Y++ +
Sbjct: 192 SQMDLPIPGLI----SYCFSSQGTSKINFGTNAVVAGDGTVAADMFIKKDQPF--YYLNL 245

Query: 319 ESYCIGNSCLTQSG--FQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
           ++  +G+  +   G  F A      +DSG ++T+LPT  Y  +V +           +  
Sbjct: 246 DAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYLPTS-YCNLVREAVAASVVAANQVPD 304

Query: 372 NSWK--YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 429
            S +   CYN  + E+   P + L F+     V+  +        G T FCL +   D  
Sbjct: 305 PSSENLLCYNWDTMEIF--PVITLHFAGGADLVLDKYNMYVETITGGT-FCLAIGCVDPS 361

Query: 430 YGIIGQNFMMGHRIV-FDRENLKLAWSHSKCEEV 462
              I  N    + +V +D   L +++S + C  +
Sbjct: 362 MPAIFGNRAHNNLLVGYDSSTLVISFSPTNCSAL 395


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/314 (26%), Positives = 128/314 (40%), Gaps = 41/314 (13%)

Query: 119 PNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           P V   V LD+ S++ WV   PC    C P   S+Y          DPS S +S   SCS
Sbjct: 25  PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFY----------DPSRSPTSAAFSCS 74

Query: 176 HPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
            P C +     +    + C Y+  Y  + +S+SG  + D+L L +        +  S   
Sbjct: 75  SPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTLDA-------GNAVSGFK 126

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GC   + GS+   AA  G+M LG G  S+  L   A    N+FS C     S S FF  
Sbjct: 127 FGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTL 182

Query: 294 QGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTF 345
             P    S    T  +   +    Y V + +  +G   L    + F A  ++DS  + T 
Sbjct: 183 GVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITR 242

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ------ 399
           LP   Y  +   F   ++  R +        CY+ +    +++P + L+F +N       
Sbjct: 243 LPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDP 302

Query: 400 SFVVRNHIFSFPEN 413
           S ++ N   +F  N
Sbjct: 303 SGILFNDCLAFTSN 316


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 151/373 (40%), Gaps = 42/373 (11%)

Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
            G+    L Y   + +GTP V+  V +D GS++ WV C      P  A       +  + 
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHA-------QTGAL 170

Query: 161 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
           +DP+ SS+ + VSC+   C    +  + C +    C Y   Y  + ++++G    D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
           +  S              GC   ++G + D    DG+MGLG G  S+ S  A A    NS
Sbjct: 230 SGASDAV------KGFQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 277 FSICFDENDSGSVFFGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 330
           FS C     SGS  F   G         +T  L   +    Y   ++   +G     L+ 
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSP 337

Query: 331 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           S F A  +VDSG   T LP   Y+ +   F   +   R +   +    C++ + +  + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 446
           P + L+FS   +  +  +   +         CL   +T  DG  GIIG        +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450

Query: 447 RENLKLAWSHSKC 459
             +  L +    C
Sbjct: 451 VGSSTLGFRSGAC 463


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 156/369 (42%), Gaps = 48/369 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I +GTP  S  +  D GS++ W     +QC+P    Y     +    ++PS SSS 
Sbjct: 81  YFARIGVGTPARSVYMVADTGSDVSW-----LQCSPCRKCY----RQQDPIFNPSLSSSF 131

Query: 170 KNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           K ++C+  +C K +    S K+ C Y   Y     +   +  + +    SF +HA +   
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETL----SFGEHAVR--- 184

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
             SV +GCGR   G +   A    ++GLG G +S PS    +    + FS C    +S  
Sbjct: 185 --SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAI 237

Query: 287 -GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQ 334
             S+ FG    P   + T  LP       Y+VG+    +  S +          ++    
Sbjct: 238 AASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGG 297

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
            +VDSG + + L T  Y  +   F  LV   S+  ISL    +  CY+ SS +   +P +
Sbjct: 298 VIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAV 353

Query: 392 RLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L F    S  +  + I    ++EG   +CL     +  + IIG       RI  D +  
Sbjct: 354 VLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKE 411

Query: 451 KLAWSHSKC 459
           ++  +  +C
Sbjct: 412 QMGIAPDQC 420


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 81/358 (22%), Positives = 141/358 (39%), Gaps = 34/358 (9%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV--- 172
           +GTP +S  +ALD GS++ W      QC P   S Y       +++DP  SSS KNV   
Sbjct: 51  LGTPKLSLSLALDTGSDITWT-----QCEPCVGSCYRQAQ---TKFDPRKSSSYKNVSCS 102

Query: 173 -SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
            S    +  S  +   +   C Y   Y  + + S G+   + L ++        S V S+
Sbjct: 103 SSSCRIITDSGGARGCVSSTCIYKVQYG-DGSYSVGFFATEKLTISP-------SDVISN 154

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGS 288
            + GCG++  G +   A   G+         +   L  +    N F+ C   F  + +G 
Sbjct: 155 FLFGCGQQNAGRFGRIAGLLGLG-----RGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH 209

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALVDSGASF 343
           +  G Q P + + T   P  +    Y + ++   +G   L       S   A++DSG   
Sbjct: 210 LTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVI 269

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
           T L   +Y+ +  KF +L+     +   +    CY+ S  E + VP +   F       +
Sbjct: 270 TRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDI 329

Query: 404 RNH-IFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           +   I +                 DGD+ + G +    + +V D    ++ ++ S C 
Sbjct: 330 KFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGCN 387


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 101/435 (23%), Positives = 168/435 (38%), Gaps = 68/435 (15%)

Query: 65  LLSNDWKRQKTRVK----LQSNNNSSR------NQLLFPSEGSQTHFFGNQFYWLHYTWI 114
           L+    +R K R      +++   S+R      +Q   P  G      G+  Y +    +
Sbjct: 50  LIRRAMQRSKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVD---L 106

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
            IGTP       LD GS+L+W   QC  CA       + L +    + P  S+S + + C
Sbjct: 107 AIGTPPQPVSALLDTGSDLIWT--QCAPCA-------SCLAQPDPLFAPGESASYEPMRC 157

Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           +  LC       C+ + D C Y  +Y     +   Y  +      +F+       +   +
Sbjct: 158 AGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMGVYATERF----TFTSSGGDRLMTVPL 212

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---SV 289
             GCG    GS  +G+   G++G G   +S+ S L+        FS C     SG   ++
Sbjct: 213 GFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFSYCLTSYGSGRKSTL 264

Query: 290 FFGD-----QGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
            FG       G AT   Q+T  L   +    Y+V +    +G   L   +S F       
Sbjct: 265 LFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGS 324

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY-------NASSEEM 385
              +VDSG + T LP  + AEVV  F + +     +        C+        +SS   
Sbjct: 325 GGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQ 384

Query: 386 LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 444
           + VP M   F   +     RN++    ++      CL +  +  D   IG       R++
Sbjct: 385 VPVPRMVFHFQDADLDLPRRNYVL---DDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVL 441

Query: 445 FDRENLKLAWSHSKC 459
           +D E   L+++ ++C
Sbjct: 442 YDLEAETLSFAPAQC 456


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 97/385 (25%), Positives = 157/385 (40%), Gaps = 68/385 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           IGTP ++    LD GS+L+W  C   C +C P  A  Y           P+ S +  NVS
Sbjct: 106 IGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSVTYANVS 155

Query: 174 CSHPLCKSRSSCKSL-------------KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           C   LC +  S +               +  C Y   YS  D SS+    D +L   +F+
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYY--YSYGDGSST----DGVLATETFT 209

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
             A   +    +  GCG    G   + +   G++G+G G +   SL+++ G+ +  FS C
Sbjct: 210 FGA--GTTVHDLAFGCGTDNLGGTDNSS---GLVGMGRGPL---SLVSQLGVTK--FSYC 259

Query: 281 F----DENDSGSVFFGDQG---PATQQSTSFL--PIGEKYDA-YFVGVESYCIGNSC--- 327
           F    D   S  +F G      PA  +ST F+  P G +  + Y++ +E   +G++    
Sbjct: 260 FTPFNDTTTSSPLFLGSSASLSPAA-KSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPI 318

Query: 328 ------LTQSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
                 LT SG   L +DSG +FT L    +  +       V+    S        C+ A
Sbjct: 319 DPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAA 378

Query: 381 ---SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 437
                 E + VP + L F      + R+   +  E+    V CL ++S  G   ++G   
Sbjct: 379 PQGRGPEAVDVPRLVLHFDGADMELPRSS--AVVEDRVAGVACLGIVSARG-MSVLGSMQ 435

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEV 462
                + +D     L++  + C E+
Sbjct: 436 QQNMHVRYDVGRDVLSFEPANCGEL 460


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 148/367 (40%), Gaps = 56/367 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
            GTP  + L+ +D GS++ W     IQC P S   Y+ +D     ++P  SSS K++SC 
Sbjct: 144 FGTPAKNSLLIIDTGSDVTW-----IQCKPCS-DCYSQVD---PIFEPQQSSSYKHLSCL 194

Query: 176 HPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
              C   ++    +   C Y  +Y  + + S G    + L L S S          S   
Sbjct: 195 SSACTELTTMNHCRLGGCVYEINYG-DGSRSQGDFSQETLTLGSDSF--------PSFAF 245

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL-AKAGLIQNSFSIC---FDENDSGSVF 290
           GCG   TG +   A   G++GLG   +S PS   +K G     FS C   F  + S   F
Sbjct: 246 GCGHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYG---GQFSYCLPDFVSSTSTGSF 299

Query: 291 FGDQG--PATQQSTSFLPI--GEKYDA-YFVGVESYCIGN-------SCLTQSGFQALVD 338
              QG  PAT    +F+P+     Y + YFVG+    +G        + L + G   +VD
Sbjct: 300 SVGQGSIPAT---ATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGG--TIVD 354

Query: 339 SGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           SG   T L  + Y  +   F      L S+K  S+       CY+ SS   +++P +   
Sbjct: 355 SGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSI----LDTCYDLSSYSQVRIPTITFH 410

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNFMMGHRIVFDRENLKL 452
           F  N    V      F      +  CL   S        IIG       R+ FD    ++
Sbjct: 411 FQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRI 470

Query: 453 AWSHSKC 459
            ++   C
Sbjct: 471 GFAPGSC 477


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 81/349 (23%), Positives = 136/349 (38%), Gaps = 42/349 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP   + V  D GS+  WV     QC P     Y   ++    +DP  SS+  NVS
Sbjct: 182 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQQEK---LFDPVRSSTYANVS 233

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C   +        C Y   Y  + + S G+   D L L+S+              
Sbjct: 234 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 285

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
            GCG +  G + + A   G++GLG G  S+P     K G +   F+ C     +G+ +  
Sbjct: 286 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 339

Query: 293 DQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGFQ---ALVDSGASF 343
               +   +++ L      D     Y++G+    +G   L+  QS F     +VDSG   
Sbjct: 340 FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVI 399

Query: 344 TFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           T LP   Y+ +   F   ++++       +SL       CY+ +    + +P + L+F  
Sbjct: 400 TRLPPPAYSSLRYAFAAAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLFQG 455

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
                V      +  +              GD GI+G   +    + +D
Sbjct: 456 GARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 504


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 94/409 (22%), Positives = 162/409 (39%), Gaps = 84/409 (20%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNL-SEY 161
           Y  +   +  GTP+ +     D GS+L+W+PC     C  C       ++ LD  L   +
Sbjct: 87  YGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCD------FSGLDPTLIPRF 140

Query: 162 DPSSSSSSKNVSCSHPLCK----SRSSCKSLKDP----C-----PYIADYSTEDTSSSGY 208
            P +SSSSK + C  P C+        C+   DP    C     PYI  Y     S++G 
Sbjct: 141 IPKNSSSSKIIGCQSPKCQFLYGPNVQCRGC-DPNTRNCTVGCPPYILQYGLG--STAGV 197

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           L+ + L     +            ++GC      S +    P G+ G G G VS+PS + 
Sbjct: 198 LITEKLDFPDLT--------VPDFVVGC------SIISTRQPAGIAGFGRGPVSLPSQMN 243

Query: 269 KAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK--------YD 312
                    S  FD+         D+GS   G    +     ++ P  +          +
Sbjct: 244 LKRFSHCLVSRRFDDTNVTTDLDLDTGS---GHNSGSKTPGLTYTPFRKNPNVSNKAFLE 300

Query: 313 AYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
            Y++ +    +G   +          T     ++VDSG++FTF+   ++  V  +F   +
Sbjct: 301 YYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM 360

Query: 363 S--SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTV 418
           S  ++   L+  +    C+N S +  + VP++   F       +  ++ F+F  N     
Sbjct: 361 SNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNT--DT 418

Query: 419 FCLTVMS--TDGDYGIIGQNFMMG------HRIVFDRENLKLAWSHSKC 459
            CLTV+S  T    G  G   ++G      + + +D EN +  ++  KC
Sbjct: 419 VCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
          Length = 394

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 157/369 (42%), Gaps = 69/369 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSS 168
           +Y  I IGTP+  F V  D GS+ LWVP +  QC      Y+T++   + ++YD + SSS
Sbjct: 75  YYGPISIGTPSQDFKVVFDTGSSNLWVPSK--QC------YFTNIACLMHNKYDANKSSS 126

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            K                  K+   +   Y +   S SGYL  D +++A       Q+  
Sbjct: 127 YK------------------KNGTEFAIHYGS--GSLSGYLSTDTVNIAGLGIEG-QTFA 165

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICF 281
           ++         + G    GA  DG++GLG   ++V  +      + + GLI Q  FS   
Sbjct: 166 EA-------LSEPGLVFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQPVFSFYL 218

Query: 282 DEN----DSGSVFFGDQGPATQQST-SFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQA 335
           + +    + G + FG   P   +   ++LP+  K  AY+ + ++S  +GN  L Q G Q 
Sbjct: 219 NRDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRK--AYWQIKMDSASMGNLNLCQGGCQV 276

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           + D+G S   LP           +K +    I + G      Y  + E + K+P +R + 
Sbjct: 277 IADTGTSLIALP----PSEATSINKAIGGTPI-MGGQ-----YMVACENIPKLPVIRFVL 326

Query: 396 -SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNFMMGHRIVFDREN 449
             K      +++I    +  G T+     M  D     G   I+G  F+  +   FD  N
Sbjct: 327 GGKTFELEGKDYILRIAQ-MGKTICLSGFMGIDIPPPNGPIWILGDVFIGKYYTEFDMGN 385

Query: 450 LKLAWSHSK 458
            ++ ++ +K
Sbjct: 386 DRVGFAEAK 394


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 141/353 (39%), Gaps = 45/353 (12%)

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
           + LD GS++ WV CQ C  C       Y   D     +DPS S+S   VSC    C+   
Sbjct: 1   MVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSQRCRDLD 50

Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
            ++C++    C Y   Y  + + + G    + L L         S+   +V IGCG    
Sbjct: 51  TAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG-------DSTPVGNVAIGCGHDNE 102

Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS---GSVFFGDQGPAT 298
           G ++  A    + G  L   S PS ++      ++FS C  + DS    ++ FGD     
Sbjct: 103 GLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEA 154

Query: 299 QQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFL 346
              T+ L    +    Y+V +    +G   L           T      +VDSG + T L
Sbjct: 155 GTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRL 214

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
            +  YA +   F +   S   +   + +  CY+ S    ++VP + L F    +  +   
Sbjct: 215 QSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAK 274

Query: 407 IFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            +  P  +G   +CL    T+    IIG     G R+ FD     + ++ +KC
Sbjct: 275 NYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 154/382 (40%), Gaps = 71/382 (18%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP V F+   D GS+L W  C+ C  C            ++   YD ++SSS   +
Sbjct: 87  LAIGTPPVPFIALADTGSDLTWTQCKPCKLC----------FGQDTPIYDTTTSSSFSPL 136

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
            CS   C     S C +    C Y             Y  DD     ++S      SV  
Sbjct: 137 PCSSATCLPIWSSRCSTPSATCRYR------------YAYDD----GAYSPECAGISV-G 179

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDS 286
            +  GCG    G   +     G +GLG G +   SL+A+ G+    FS C    F+ + S
Sbjct: 180 GIAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFFNTSLS 231

Query: 287 GSVFFGDQGPATQ----------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT------- 329
             VFFG                 QST  +        Y+V +E   +G++ L        
Sbjct: 232 SPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFD 291

Query: 330 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--- 382
                     +VDSG  FT L  E    VVV     V  + +    +  + C+ A +   
Sbjct: 292 LNDDDGSGGMIVDSGTIFTIL-VETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGV 350

Query: 383 EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           +E+  +PDM L F+      + R++  SF E E  + FCL ++ T+   G +  NF   +
Sbjct: 351 QELPDMPDMVLHFAGGADMRLHRDNYMSFNEEE--SSFCLNIVGTESASGSVLGNFQQQN 408

Query: 442 -RIVFDRENLKLAWSHSKCEEV 462
            +++FD    +L++  + C ++
Sbjct: 409 IQMLFDITVGQLSFMPTDCSKL 430


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 99/409 (24%), Positives = 157/409 (38%), Gaps = 93/409 (22%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEYDPSSSSS 168
           +++GTP  +    LD GS+L+W PC     C  C       + ++D   +  + P +SS+
Sbjct: 92  LNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCN------FPNIDPTKIPTFIPKNSST 145

Query: 169 SKNVSCSHPLC------KSRSSCKSLKDP--------CP-YIADYSTEDTSSSGYLVDDI 213
           +K + C +P C         S C   K P        CP YI  Y    T  +G+L+ D 
Sbjct: 146 AKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGAT--AGFLLLDN 203

Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
           L+     K  PQ       ++GC      S L    P G+ G G G  S+PS +      
Sbjct: 204 LNFP--GKTVPQ------FLVGC------SILSIRQPSGIAGFGRGQESLPSQMN----- 244

Query: 274 QNSFSIC-----FDENDSGS---VFFGDQGPATQQSTSFLPIGEK-------YDAYFVGV 318
              FS C     FD+    S   +     G       S+ P            + Y+V +
Sbjct: 245 LKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTL 304

Query: 319 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
               +G   +          +      +VDSG++FTF+   +Y  V  +F + +  K+ S
Sbjct: 305 RKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL-GKKYS 363

Query: 369 LQGN-----SWKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFTVFCL 421
            + N         C+N S  + +  P+    F      S  + N+ FSF  +    V C 
Sbjct: 364 REENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNY-FSFVGDA--EVLCF 420

Query: 422 TVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           TV+S DG  G         I+G        + +D EN +  +    C+ 
Sbjct: 421 TVVS-DGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNCKR 468


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 93/376 (24%), Positives = 147/376 (39%), Gaps = 48/376 (12%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           GN  Y +    I  G+P     V +D GS+L+W   QC+ C   +A+           +D
Sbjct: 76  GNGEYLID---ISFGSPPQKASVIVDTGSDLIWT--QCLPCETCNAAASVI-------FD 123

Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED-TSSSGYLVDDILHLASFSK 221
           P  SS+   VSC+   C S    +S    C Y  DY   D +S+SG L        S   
Sbjct: 124 PVKSSTYDTVSCASNFCSSL-PFQSCTTSCKY--DYMYGDGSSTSGAL--------STET 172

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN-SFSIC 280
               +    +V  GCG    GS+   A   G++GLG G +   SL+++A  I +  FS C
Sbjct: 173 VTVGTGTIPNVAFGCGHTNLGSF---AGAAGIVGLGQGPL---SLISQASSITSKKFSYC 226

Query: 281 ---FDENDSGSVFFGDQGPATQQSTSFLPIGEK----YDAYFVGVE------SYCIGNSC 327
                   +  +  GD   A   + + L         Y A   G+       +Y +G   
Sbjct: 227 LVPLGSTKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFS 286

Query: 328 LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
           +  SG    + DSG + T+L T  +  +V      V             YC++ +     
Sbjct: 287 IDASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANP 346

Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
             P M   F      +   ++F   +  G    CL + ++ G + I+G      H IV D
Sbjct: 347 TYPTMTFHFKGADYELPPENVFVALDTGG--SICLAMAASTG-FSIMGNIQQQNHLIVHD 403

Query: 447 RENLKLAWSHSKCEEV 462
             N ++ +  + CE +
Sbjct: 404 LVNQRVGFKEANCETI 419


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 108/448 (24%), Positives = 172/448 (38%), Gaps = 83/448 (18%)

Query: 34  DEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFP 93
           DEA+ RWI                       + S+D + ++ R  LQ+   SS   L   
Sbjct: 4   DEARLRWIHHR--------------------IQSSDHRHRRGRSLLQTAQVSSGLSL--- 40

Query: 94  SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
             GS  +F            + IG+P  S+ + LD GS++ W     IQCAP S S Y+ 
Sbjct: 41  --GSGEYF----------ARMGIGSPQRSYYLELDTGSDVTW-----IQCAPCS-SCYSQ 82

Query: 154 LDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
           +D     YDPS+SSS + V C   LC++   S+C+ +   C Y   Y     SS      
Sbjct: 83  VD---PIYDPSNSSSYRRVYCGSALCQALDYSACQGMG--CSYRVVYGDSSASSGD---- 133

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-A 270
             L + SF      S+   ++  GCG   +G +   A   G+           S  ++ A
Sbjct: 134 --LGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMG------GGTLSFFSQIA 185

Query: 271 GLIQNSFSICFD------ENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCI 323
             I  +FS C        ++ S  + FG    P   + T  L        Y+  +    +
Sbjct: 186 ASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAILTGISV 245

Query: 324 GNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 373
           G + L     Q          A++DSG S T +    YA  V++     +S+ +      
Sbjct: 246 GGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYA--VLRDAYRAASRNLPPAPGV 303

Query: 374 W--KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 431
           +    C+N      +++P + L F  +   V+       P +   T FCL    +     
Sbjct: 304 YLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGT-FCLAFAPSSMPIS 362

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +IG       RI FD +   +A +  +C
Sbjct: 363 VIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 89/377 (23%), Positives = 147/377 (38%), Gaps = 63/377 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           +  GTP  SF   LD GSN+ W+PC  C  C+                ++PS SS+   +
Sbjct: 128 LGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCS-----------SKQQPFEPSKSSTYNYL 176

Query: 173 SCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           +C+   C+    C    +   C     Y  +        VD+IL   + S  + Q     
Sbjct: 177 TCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSE------VDEILSSETLSVGSQQV---E 227

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDS 286
           + + GC     G  L    P  ++G G   +S  S    A L  ++FS C    F    +
Sbjct: 228 NFVFGCSNAARG--LIQRTPS-LVGFGRNPLSFVS--QTATLYDSTFSYCLPSLFSSAFT 282

Query: 287 GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT----------QSGF 333
           GS+  G +   + Q   F P+    +Y + Y+VG+    +G   ++           +G 
Sbjct: 283 GSLLLGKEA-LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGR 341

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
             ++DSG   T L    Y  +   F   +S+  ++   + +  CYN  S ++ + P + L
Sbjct: 342 GTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDV-EFPLITL 400

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLT----------VMSTDGDYGIIGQNFMMGHRI 443
            F  N    +      +P N+  +V CL           V+ST G+Y           RI
Sbjct: 401 HFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQ------QQKLRI 454

Query: 444 VFDRENLKLAWSHSKCE 460
           V D    +L  +   C+
Sbjct: 455 VHDVAESRLGIASENCD 471


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 86/374 (22%), Positives = 155/374 (41%), Gaps = 52/374 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           I IG P V  L   D GS+L+WV CQ C  C            +N   +DP  SSS +NV
Sbjct: 97  ISIGNPQVEILAIADTGSDLIWVQCQPCEMC----------YKQNSPIFDPRRSSSYRNV 146

Query: 173 SCSHPLC-----KSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
            C +  C     ++RS   +     C Y   Y  + + S G+L  +   + S + +   +
Sbjct: 147 LCGNEFCNKLDGEARSCDARGFVKTCGYTYSYG-DQSFSDGHLAIERFGIGSTNSNTSAA 205

Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICF--- 281
            +    V  GCG K  G++      +   G+        SL+++ G  +   FS C    
Sbjct: 206 IAYFQEVAFGCGTKNGGTF-----DELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPT 260

Query: 282 --DENDSGSVFFGDQGPATQQ-----STSFLPIG-EKYDAYFVGVESYCIGNSCLTQSGF 333
               N +  + FG+    +       ST  LP   E Y  Y++ +E+  + N  L  +  
Sbjct: 261 SEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETY--YYLTLEAISVENKRLPYTNL 318

Query: 334 --------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
                     ++DSG + TFL +E +  +    ++ V  +R+S     +  C+    E+ 
Sbjct: 319 WNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICF--KDEKA 376

Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 445
           +++P +   F+     +   + F+  E +   + C T++ ++ D  I G    M   + +
Sbjct: 377 IELPIITAHFTGADVELQPVNTFAKVEED---LLCFTMIPSN-DIAIFGNLAQMNFLVGY 432

Query: 446 DRENLKLAWSHSKC 459
           D E   +++  + C
Sbjct: 433 DLEKKAVSFLPTDC 446


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 100/397 (25%), Positives = 164/397 (41%), Gaps = 87/397 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSSSKNV 172
           + +G+P  +  + LD GS L W+ C+    AP           NL S +DP  SSS   +
Sbjct: 67  LTVGSPPQTVTMVLDTGSELSWLHCKK---AP-----------NLHSVFDPLRSSSYSPI 112

Query: 173 SCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            C+ P C++R+       SC   K  C  I  Y+ + +S  G L  D  H+         
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDK-KKLCHAIISYA-DASSIEGNLASDTFHIG-------- 162

Query: 226 SSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           +S   + I GC      S  D  +   G++G+  G +   S + + GL    FS C    
Sbjct: 163 NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMGL--QKFSYCISGQ 217

Query: 285 D-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
           D SG + FG+            P  Q ST  LP  ++  AY V +E   + NS L     
Sbjct: 218 DSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEGIKVANSMLQLPKS 275

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLVSSKRISLQGNSWK 375
                 T +G Q +VDSG  FTFL   +Y  +  +F        K++       QG +  
Sbjct: 276 VYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQG-AMD 333

Query: 376 YCYNA--SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPE--NEGFTVFCLTVMSTDGDY 430
            CY    +   +  +P + L+F     S      ++  P       +V+C T     G+ 
Sbjct: 334 LCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF----GNS 389

Query: 431 GIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 460
            ++G +++++GH       + FD    ++ ++  +C+
Sbjct: 390 ELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCD 426


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 77/301 (25%), Positives = 119/301 (39%), Gaps = 52/301 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           +  GTP   F + LD GS++ W  C+ C+ C   S  ++ SL  +   +     S+  N 
Sbjct: 131 VAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPSTVGNT 190

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
                                Y   Y  + TS   Y  D +            S V    
Sbjct: 191 ---------------------YNMTYGDKSTSVGNYGCDTMT--------LEPSDVFQKF 221

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFF 291
             GCGR   G +  G+  DG++GLG G +S  S  A     +  FS C  +EN  GS+ F
Sbjct: 222 QFGCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLF 277

Query: 292 GDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVD 338
           G++  +   S  F  +         E+   YFV +    +GN  L    S F +   ++D
Sbjct: 278 GEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIID 337

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLI 394
           SG   T LP   Y+ +   F K ++   +S     + +    CYN S  + + +P+  L 
Sbjct: 338 SGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLH 397

Query: 395 F 395
           F
Sbjct: 398 F 398


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 152/373 (40%), Gaps = 42/373 (11%)

Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
            G+    L Y   + +GTP V+  V +D GS++ WV C      P  A       +  + 
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYA-------QTGAL 170

Query: 161 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
           +DP+ SS+ + VSC+   C    +  + C +    C Y   Y  + ++++G    D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229

Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
           +  S              GC   ++G + D    DG+MGLG G  S+ S  A A    NS
Sbjct: 230 SGASDAV------KGFQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 277 FSICFDENDSGS----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 330
           FS C     SGS       G  G +   +T  L   +    Y   ++   +G     L+ 
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSP 337

Query: 331 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
           S F A  +VDSG   T LP   Y+ +   F   +   R +   +    C++ + +  + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 446
           P + L+FS   +  +  +   +         CL   +T  DG  GIIG        +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450

Query: 447 RENLKLAWSHSKC 459
             +  L +    C
Sbjct: 451 VGSSTLGFRSGAC 463


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 156/369 (42%), Gaps = 48/369 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  I +GTP  S  +  D GS++ W     +QC+P    Y     +    ++PS SSS 
Sbjct: 14  YFARIGVGTPARSVYMVADTGSDVSW-----LQCSPCRKCY----RQQDPIFNPSLSSSF 64

Query: 170 KNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           K ++C+  +C K +    S K+ C Y   Y     +   +  + +    SF +HA +   
Sbjct: 65  KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETL----SFGEHAVR--- 117

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
             SV +GCGR   G +   A    ++GLG G +S PS    +    + FS C    +S  
Sbjct: 118 --SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAI 170

Query: 287 -GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQ 334
             S+ FG    P   + T  LP       Y+VG+    +  S +          ++    
Sbjct: 171 AASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGG 230

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
            +VDSG + + L T  Y  +   F  LV   S+  ISL    +  CY+ SS +   +P +
Sbjct: 231 VIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAV 286

Query: 392 RLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
            L F    S  +  + I    ++EG   +CL     +  + IIG       RI  D +  
Sbjct: 287 VLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKE 344

Query: 451 KLAWSHSKC 459
           ++  +  +C
Sbjct: 345 QMGIAPDQC 353


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 85/393 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP     + LD GS L W+  QC    P +AS+           DPS SSS   + 
Sbjct: 92  LPIGTPPQPQQMVLDTGSQLSWI--QCHNKTPPTASF-----------DPSLSSSFYVLP 138

Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+HPLCK R    +L   C      + + +  + T + G LV + L  +        S  
Sbjct: 139 CTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSP-------SQT 191

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--- 285
              +I+GC  +   +        G++G+ LG +S P   AK       FS C        
Sbjct: 192 TPPLILGCSSESRDA-------RGILGMNLGRLSFP-FQAKV----TKFSYCVPTRQPAN 239

Query: 286 -----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
                +GS + G+               P +Q+  +  P+     AY V ++   IG   
Sbjct: 240 NNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPL-----AYTVPMQGIRIGGRK 294

Query: 328 LT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSW 374
           L             SG Q +VDSG+ FTFL    Y  V  +  +++    K+  + G   
Sbjct: 295 LNIPPSVFRPNAGGSG-QTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVA 353

Query: 375 KYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTD---GD 429
             C++ ++ E+ + + D+   F K    VV +  + +   + G  V C+ +  ++     
Sbjct: 354 DMCFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLA---DVGGGVHCVGIGRSERLGAA 410

Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             IIG        + FD  N ++ +  + C  +
Sbjct: 411 SNIIGNFHQQNLWVEFDLANRRIGFGVADCSRL 443


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 92/388 (23%), Positives = 154/388 (39%), Gaps = 66/388 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP  +  + LD GS L W     + CAP  A    S       + P +SS+   V 
Sbjct: 89  LAVGTPPQNVTMVLDTGSELSW-----LLCAPAGARNKFSA----MSFRPRASSTFAAVP 139

Query: 174 CSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+   C+SR      +C      C     Y+ + +SS G L  D+  + S          
Sbjct: 140 CASAQCRSRDLPSPPACDGASSRCSVSLSYA-DGSSSDGALATDVFAVGS--------GP 190

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSG 287
                 GC      S  DG A  G++G+  G +S  S  +        FS C  D +D+G
Sbjct: 191 PLRAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAG 245

Query: 288 SVFFGDQGPATQQSTSFLPIGEK------YD--AYFVGVESYCIGNSCL----------- 328
            +  G     T    ++ P+ +       +D  AY V +    +G   L           
Sbjct: 246 VLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDH 305

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYN--- 379
           T +G Q +VDSG  FTFL  + Y+ +  +F +       +L   S+ +      C+    
Sbjct: 306 TGAG-QTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQ 364

Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP--ENEGFTVFCLTVMSTD----GDYGI 432
             S    ++P + L+F+  +  V  +  ++  P     G  V+CLT  + D      Y +
Sbjct: 365 GRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAY-V 423

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           IG +  M   + +D E  ++  +  +C+
Sbjct: 424 IGHHHQMNVWVEYDLERGRVGLAPVRCD 451


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 106/439 (24%), Positives = 163/439 (37%), Gaps = 64/439 (14%)

Query: 53  WPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT 112
           WP  N +  +E ++  D KR     + +      +  L     GS   +   Q++    T
Sbjct: 42  WP--NPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDL-----GSGIDYGTAQYF----T 90

Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
            + +GTP   F V +D GS L WV C+                +N   +    S S K V
Sbjct: 91  EVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKV-------KNRRVFRAEESKSFKTV 143

Query: 173 SCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAP 224
            C    CK       S S+C +   PC Y  DY   D S++ G    + + +   +    
Sbjct: 144 GCFTQTCKVDLMNLFSLSTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNGRKA 201

Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
           +      +++GC    +G    GA  DGV+GL   D S  S      L     S C    
Sbjct: 202 R---LRGLLVGCSSSFSGQSFQGA--DGVLGLAFSDFSFTS--TATSLFGAKLSYCLVDH 254

Query: 282 --DENDSGSVFFG--------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 328
             ++N S  + FG           P          I   Y    +G+    IG+  L   
Sbjct: 255 LSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGIS---IGDDMLDIP 311

Query: 329 TQ-----SGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASS 382
           TQ     +G   ++DSG S T L    Y  VV    + LV  KR+  +G   +YC++++S
Sbjct: 312 TQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTS 371

Query: 383 E-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMG 440
                K+P +         F    H  S+  +    V CL  MS       ++G      
Sbjct: 372 GFNESKLPQLTFHLKGGARF--EPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQN 429

Query: 441 HRIVFDRENLKLAWSHSKC 459
           +   FD     L+++ S C
Sbjct: 430 YLWEFDLMASTLSFAPSTC 448


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 134/335 (40%), Gaps = 47/335 (14%)

Query: 119 PNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           P V  L+ LD  S++ WV   PC   QC       Y   D     YDPS S SS++ +CS
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQC-------YAQTD---VLYDPSKSRSSESFACS 227

Query: 176 HPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            P C+         SS  +    C Y   Y  + +++SG LV D L L      +P S V
Sbjct: 228 SPTCRQLGPYANGCSSSSNSAGQCQYRVRYP-DGSTTSGTLVADQLSL------SPTSQV 280

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSG 287
                 GC     GS+   +   G+M LG G  S+ S  + K G +   FS CF    S 
Sbjct: 281 -PKFEFGCSHAARGSF-SRSKTAGIMALGRGVQSLVSQTSTKYGQV---FSYCFPPTASH 335

Query: 288 SVFFGDQGPATQQST-SFLPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGAS 342
             FF    P    S  +  P+ +    Y V +E+  +    L    T     A +DS   
Sbjct: 336 KGFFVLGVPRRSSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTV 395

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP   Y  +   F   +S  R +        CY+ +    + +P + L+F +  + V
Sbjct: 396 ITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGV 455

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGD---YGIIG 434
             +     P    F   CL   ST GD    GIIG
Sbjct: 456 QLD-----PSGVLFGS-CLAFASTAGDDRATGIIG 484


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 101/425 (23%), Positives = 179/425 (42%), Gaps = 76/425 (17%)

Query: 71  KRQKTRVK------LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
           KR K+R++      L ++   S +QL  P         GN  Y +    + IGTP VS+ 
Sbjct: 72  KRGKSRLQRLNAMVLAASTLDSEDQLEAPIHA------GNGEYLME---LAIGTPPVSYP 122

Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
             LD GS+L+W  C+ C QC            +    +DP  SSS   VSC   LC +  
Sbjct: 123 AVLDTGSDLIWTQCKPCTQC----------YKQPTPIFDPKKSSSFSKVSCGSSLCSAVP 172

Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           S  +  D C Y+  Y  + + + G L  +     +F K   + SV  ++  GCG    G 
Sbjct: 173 S-STCSDGCEYVYSYG-DYSMTQGVLATETF---TFGKSKNKVSVH-NIGFGCGEDNEGD 226

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQ 300
             + A+  G++GLG G +S+ S L      +  FS C    D+     +  G  G     
Sbjct: 227 GFEQAS--GLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESILLLGSLGKVKDA 279

Query: 301 ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFL 346
               +T  L    +   Y++ +E   +G++ L+  +S F+         ++DSG + T++
Sbjct: 280 KEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYI 339

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYN-ASSEEMLKVPDMRLIFSKNQ-SF 401
             + +  +  +F   +S  ++ L   S      C++  S    +++P +   F       
Sbjct: 340 EQKAFEALKKEF---ISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLEL 396

Query: 402 VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVFDRENLKLAWSHS 457
              N++     +    V CL + ++ G   I G    QN ++ H    D E   +++  +
Sbjct: 397 PAENYMIG---DSNLGVACLAMGASSG-MSIFGNVQQQNILVNH----DLEKETISFVPT 448

Query: 458 KCEEV 462
            C+++
Sbjct: 449 SCDQL 453


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 86/381 (22%), Positives = 156/381 (40%), Gaps = 42/381 (11%)

Query: 109 LHY--TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           LH+    +D+   N  F V  DAG++++ +         +  ++       L  +D S+S
Sbjct: 123 LHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTS 182

Query: 167 SSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
           S+    SC   LC+    +SC + K      C Y   Y+ +  ++       +L +  F+
Sbjct: 183 STLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTG------LLEVDKFT 236

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
             A  S     V  GCG    G +       G+ G G G +S+PS L K G    +FS C
Sbjct: 237 FGAGASV--PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHC 287

Query: 281 FDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS----- 326
           F   +     +V           G    QST  +        Y++ ++   +G++     
Sbjct: 288 FTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347

Query: 327 ----CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
                LT      ++DSG S T LP ++Y  V  +F   +    +         C++A S
Sbjct: 348 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPS 407

Query: 383 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
           +    VP + L F      + R N++F  P++ G ++ CL +     +   IG       
Sbjct: 408 QAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNM 467

Query: 442 RIVFDRENLKLAWSHSKCEEV 462
            +++D +N  L++  ++C+++
Sbjct: 468 HVLYDLQNNMLSFVAAQCDKL 488



 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 6/124 (4%)

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           LT      ++DSG S T LP ++Y  V  +F   +    +         C++A S+    
Sbjct: 58  LTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPD 117

Query: 388 VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM---MGHRI 443
           VP + L F      + R N++F  P++ G ++ CL +    GD   I  NF    M    
Sbjct: 118 VPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI--NKGDETTIIGNFQQQNMHALP 175

Query: 444 VFDR 447
            FDR
Sbjct: 176 YFDR 179


>gi|150866171|ref|XP_001385673.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
 gi|149387427|gb|ABN67644.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
          Length = 417

 Score = 68.6 bits (166), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 157/367 (42%), Gaps = 70/367 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP   F V LD GS+ LWVP Q  +C+ L+   +T       +YD  SSS+ 
Sbjct: 103 YFTEISLGTPAQQFKVILDTGSSNLWVPSQ--ECSSLACFLHT-------KYDHDSSST- 152

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                     K+  S  S++     +  Y ++DT + G LV      A       +++ +
Sbjct: 153 ---------YKANGSEFSIQYGSGAMEGYVSQDTLAIGDLVIPKQDFA-------EATSE 196

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-------LAKAGLIQNSFSICF- 281
             +    G+            DG++GL    +SV  +       LA+  L +  F+    
Sbjct: 197 PGLAFAFGKF-----------DGILGLAYNTISVNKIVPPVYNALAQGLLDEPQFAFYLG 245

Query: 282 ----DENDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQ 334
               DEND G   FG  D+   T + T +LP+  K  AY+ V  E   +G+         
Sbjct: 246 DTKKDENDGGLATFGGYDESAFTGKIT-WLPVRRK--AYWEVSFEGIGLGDEYAELDNTG 302

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           A +D+G S   LP+ + AE++    K+ ++K       SW   Y    E+   +PD+ L 
Sbjct: 303 AAIDTGTSLITLPSSL-AEIINA--KIGATK-------SWSGQYQIDCEKQDTLPDLTLN 352

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLK 451
           F+   +F +  H +   E  G  +   T M      GD  IIG  F+  +  ++D +   
Sbjct: 353 FA-GYNFTLTAHDYIL-EVGGSCISVFTPMDFPKPIGDLAIIGDAFLRRYYSIYDLKKDA 410

Query: 452 LAWSHSK 458
           +  + SK
Sbjct: 411 VGLATSK 417


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score = 68.6 bits (166), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 163/396 (41%), Gaps = 87/396 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSSSKNV 172
           + +G+P  +  + LD GS L W+ C+    AP           NL S +DP  SSS   +
Sbjct: 60  LTVGSPPQTVTMVLDTGSELSWLHCKK---AP-----------NLHSVFDPLRSSSYSPI 105

Query: 173 SCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            C+ P C++R+       SC   K  C  I  Y+ + +S  G L  D  H+         
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDK-KKLCHAIISYA-DASSIEGNLASDTFHIG-------- 155

Query: 226 SSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           +S   + I GC      S  D  +   G++G+  G +   S + + GL    FS C    
Sbjct: 156 NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMGL--QKFSYCISGQ 210

Query: 285 D-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
           D SG + FG+            P  Q ST  LP  ++  AY V +E   + NS L     
Sbjct: 211 DSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEGIKVANSMLQLPKS 268

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLVSSKRISLQGNSWK 375
                 T +G Q +VDSG  FTFL   +Y  +  +F        K++       QG +  
Sbjct: 269 VYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQG-AMD 326

Query: 376 YCYNA--SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPE--NEGFTVFCLTVMSTDGDY 430
            CY    +   +  +P + L+F     S      ++  P       +V+C T     G+ 
Sbjct: 327 LCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF----GNS 382

Query: 431 GIIG-QNFMMGHR------IVFDRENLKLAWSHSKC 459
            ++G +++++GH       + FD    ++ ++  +C
Sbjct: 383 ELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 416

 Score = 68.6 bits (166), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 148/362 (40%), Gaps = 69/362 (19%)

Query: 99  THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL 158
           T+F   Q+Y    T IDIGTP  +F V LD GS+ LWVP    QC  ++   +T      
Sbjct: 98  TNFMNAQYY----TEIDIGTPPQTFKVILDTGSSNLWVPSS--QCTSIACFLHT------ 145

Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
            +YD S+SSS K       +     S +                    G++ +D +    
Sbjct: 146 -KYDSSASSSYKANGTEFSIQYGSGSME--------------------GFVSNDDIVFGD 184

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGL 272
            S         SSV      K+ G        DG++GL    ++V  +      L   G+
Sbjct: 185 MS--------LSSVDFAEATKEPGLAFAFGKFDGILGLAYDTIAVNHITPVFYELVNQGI 236

Query: 273 IQN---SFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC 327
           I     SF +   E+D G   FG   P A      + P+  K  AY+ V +E    G+  
Sbjct: 237 ISEPVFSFRLGSSEDDGGEAIFGGIDPSAYSGKIDYAPVRRK--AYWEVELEKVSFGDDD 294

Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
           L      A +D+G S   LPT++ AE++   +  + +K+      SW   Y     ++  
Sbjct: 295 LELENTGAAIDTGTSLIALPTDV-AEML---NTQIGAKK------SWNGQYTVDCAKVPD 344

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIV 444
           +PD+   F++ + + ++   +   E +G  +   T +  +   G   IIG  F+  +  V
Sbjct: 345 LPDLTFYFNE-KPYPLKGTDYVL-EVQGTCISAFTGLDINLPGGSLWIIGDVFLRRYFTV 402

Query: 445 FD 446
           +D
Sbjct: 403 YD 404


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score = 68.6 bits (166), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 87/378 (23%), Positives = 147/378 (38%), Gaps = 62/378 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T I IGTP     + LD GS+++W+ C+ C +C       Y+  D     ++PSSS S
Sbjct: 8   YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCREC-------YSQAD---PIFNPSSSVS 57

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              V C   +C    +       C Y   Y     +   Y  + +    +F   + Q   
Sbjct: 58  FSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL----TFGTTSIQ--- 110

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
             +V IGCG    G ++  A   G+    L   S P+ L        +FS C  + DS S
Sbjct: 111 --NVAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSES 163

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLTQSGFQA--- 335
               + GP +      +PIG  +            Y++ + +  +G   L     +A   
Sbjct: 164 SGTLEFGPES------VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRI 217

Query: 336 ---------LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASS 382
                    ++DSG + T L T  Y  +   F      L  +  IS+    +  CY+ S+
Sbjct: 218 DETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI----FDTCYDLSA 273

Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
            + + +P +   FS    F++       P +     FC      D +  I+G     G R
Sbjct: 274 LQSVSIPAVGFHFSNGAGFILPAKNCLIPMDS-MGTFCFAFAPADSNLSIMGNIQQQGIR 332

Query: 443 IVFDRENLKLAWSHSKCE 460
           + FD  N  + ++  +C+
Sbjct: 333 VSFDSANSLVGFAIDQCQ 350


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 162/374 (43%), Gaps = 67/374 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP    L   D GS+L W+  + C QC P     +          DPS+S++   +
Sbjct: 84  LSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIF----------DPSNSTTFHKL 133

Query: 173 SCSHPLCKS-RSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
            C+   C +   S +S  DP  C Y   Y  + + ++GYL  D + + +       +SVQ
Sbjct: 134 PCTTAPCNALDESARSCTDPTTCGYTYSYG-DHSYTTGYLASDTVTVGN-------ASVQ 185

Query: 230 -SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------- 281
             +V  GCG +  G++ +  +  G++GLG G++S  S L     I   FS C        
Sbjct: 186 IRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLENEI 241

Query: 282 -----DENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDA--YFVGVESYCIGNSCLT 329
                D   +  + FGD    +  ST+ +     P+  K  +  Y++ +E+  +G   L 
Sbjct: 242 SSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLL 301

Query: 330 -----------QSGFQA-------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
                       SG ++       ++DSG + TFL  E Y  +     + +  +R++   
Sbjct: 302 YSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVK 361

Query: 372 NS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
           NS +  C+ +  EE +++P M++ F       ++         EG   F +   +  G Y
Sbjct: 362 NSMFSLCFKSGKEE-VELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLPTNDVGIY 420

Query: 431 GIIGQ-NFMMGHRI 443
           G + Q NF++G+ +
Sbjct: 421 GNLAQMNFVVGYDL 434


>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
          Length = 410

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 81/398 (20%), Positives = 157/398 (39%), Gaps = 46/398 (11%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++I  P   + + +D GS L W+ C   CI C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y        E   +   + +  +  +   +    C   K+ C Y   Y     SS G L
Sbjct: 80  LYK------PELKYAVKCTEQRCADLYADLRKPMKCGP-KNQCHYGIQYV--GGSSIGVL 130

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           + D     SFS  A   +  +S+  GCG  Q  +  +   P +G++GLG G V++ S L 
Sbjct: 131 IVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 185

Query: 269 KAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVESYCIGN 325
             G+I ++    C      G +FFGD    T   T + P+  ++  Y    G   +   +
Sbjct: 186 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLHFNSNS 244

Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNA 380
             ++ +  + + DSGA++T+   + Y   +      +S +      +  +  +   C+  
Sbjct: 245 KPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKG 304

Query: 381 SSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
             +++  + +++  F           K  +  +    +     EG    CL ++    ++
Sbjct: 305 -KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEH 361

Query: 431 ------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
                  +IG   M+   +++D E   L W + +C+ +
Sbjct: 362 PSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 99/421 (23%), Positives = 160/421 (38%), Gaps = 83/421 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE--------- 160
           ++    +GTP   FL+  D GS+L WV C+       + +     +              
Sbjct: 55  YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114

Query: 161 ----------YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSS 205
                     + P  S +   + CS   C      S ++C +   PC Y  +Y  +D S+
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAY--EYRYKDGSA 172

Query: 206 S-GYLVDDILHLASFSKHAPQSSVQS---SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGD 260
           + G +  D   +A   + A +   ++    V++GC    TG S+L   A DGV+ LG  +
Sbjct: 173 ARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSN 229

Query: 261 VSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQ------------------GPA 297
           VS  S    A      FS C        N +  + FG                     P 
Sbjct: 230 VSFASR--AAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPG 287

Query: 298 TQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTE 349
            +Q T  L        Y V V    +    L         Q G  A++DSG S T L + 
Sbjct: 288 ARQ-TPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSP 346

Query: 350 IYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASS----EEM-LKVPDMRLIFSKNQSFVV 403
            Y  VV     KLV   R+++  + + YCYN +S    E++ + VP + + F+ +     
Sbjct: 347 AYRAVVAALGKKLVGLPRVAM--DPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQP 404

Query: 404 RNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
               +      G  V C+ +   +GD+    +IG      H   FD +N +L +  S+C 
Sbjct: 405 PPKSYVIDAAPG--VKCIGLQ--EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCM 460

Query: 461 E 461
           +
Sbjct: 461 Q 461


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 97/389 (24%), Positives = 162/389 (41%), Gaps = 59/389 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           H   + IGTP     + +D GS+L+W  C  +     +A+  +     L  Y+P  SSS 
Sbjct: 84  HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPL--YEPRRSSSF 141

Query: 170 KNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
             + CS  LC+    S  +C +  + C Y   Y + +  + G L  +       +K    
Sbjct: 142 AYLPCSDRLCQEGQFSYKNC-ARNNRCMYDELYGSAE--AGGVLASETFTFGVNAK---- 194

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FD 282
             V   +  GCG    G  L GA+  G+MGL  G +S+ S L+        FS C   F 
Sbjct: 195 --VSLPLGFGCGALSAGD-LVGAS--GLMGLSPGIMSLVSQLSVP-----RFSYCLTPFA 244

Query: 283 ENDSGSVFFGDQG-------PATQQSTSFL--PIGEKYDAYFVGVESYCIGNSCL----T 329
           E  +  + FG            T Q+TS L  P  E    Y+V +    +G   L    T
Sbjct: 245 ERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMET-AYYYVPLVGLSLGTKRLDVPAT 303

Query: 330 QSGF-------QALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCY- 378
             G          +VDSG++ ++L    +  V   VV+  +L  +       + ++ C+ 
Sbjct: 304 SLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYELCFA 363

Query: 379 --NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYG--II 433
                + E +K P + L F    +  + R++ F  P      + CL V ++   +G  II
Sbjct: 364 LPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA---GLMCLAVGTSPDGFGVSII 420

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           G        ++FD  N K +++ +KC+++
Sbjct: 421 GNVQQQNMHVLFDVRNQKFSFAPTKCDDI 449


>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 429

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/406 (22%), Positives = 165/406 (40%), Gaps = 42/406 (10%)

Query: 79  LQSNNNSSRNQLLFPSEGSQT-HFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           L S   SSR++LL P+  S     +GN +    +   ++IG P   + + +D GS+L W+
Sbjct: 36  LPSEATSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWL 95

Query: 137 PCQ--CIQCAPLSASYYTSLDRNLSEYDP--SSSSSSKNVSCSHPLCKSRSSCKSLKDPC 192
            C   C  C+      Y   +  +   DP  +S   +++ +C HP            D C
Sbjct: 96  QCDAPCTHCSETPHPLYRPSNDFVPCRDPLCASLQPTEDYNCEHP------------DQC 143

Query: 193 PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 252
            Y  +Y+ +  S+ G L++D+ +L +F+       ++  + +GCG  Q  S       DG
Sbjct: 144 DYEINYA-DQYSTFGVLLNDV-YLLNFTNGV---QLKVRMALGCGYDQVFSPSSYHPLDG 198

Query: 253 VMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE-KY 311
           ++GLG G  S+ S L   GL++N    C      G +FFG+   + +   ++ PI     
Sbjct: 199 LLGLGRGKASLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNAYDSAR--VTWTPISSVDS 256

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 369
             Y  G      G          A+ D+G+S+T+  +  Y  ++    K +S K  +++ 
Sbjct: 257 KHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAP 316

Query: 370 QGNSWKYCYNASSEEMLKVPDMRLIFS-----------KNQSFVVRNHIFSFPENEGFTV 418
              +   C++        + ++R  F                F +    +    N G   
Sbjct: 317 DDQTLPLCWHG-KRPFTSLREVRKYFKPVALGFTNGGRTKAQFEILPEAYLIISNLGNVC 375

Query: 419 FCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             +   S  G  +  +IG   M    +VF+ E   + W  + C  +
Sbjct: 376 LGILNGSEVGLEELNLIGDISMQDKVMVFENEKQLIGWGPADCSRI 421


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 101/429 (23%), Positives = 171/429 (39%), Gaps = 68/429 (15%)

Query: 69  DWKRQKTR------VKLQSNNNSSRNQL-LFPSEGSQTHF---FGNQFYWLHYTWIDIGT 118
           DW R+  +      ++++S  N  R        E SQT      G     L+Y  + +G 
Sbjct: 13  DWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYI-VTMGL 71

Query: 119 PNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHP 177
            + +  V +D GS+L WV C+ C+ C           ++    + PS+SSS ++VSC+  
Sbjct: 72  GSKNMTVIIDTGSDLTWVQCEPCMSC----------YNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 178 LCKS-------RSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
            C+S         +C S  +P  C Y+ +Y  + + ++G L  + L     S        
Sbjct: 122 TCQSLQFATGNTGACGS-SNPSTCNYVVNYG-DGSYTNGELGVEALSFGGVSV------- 172

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DEND 285
            S  + GCGR   G +       G+MGLG   +S+ S           FS C    +   
Sbjct: 173 -SDFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNATFGGVFSYCLPTTEAGS 226

Query: 286 SGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSCLTQ----SGFQAL 336
           SGS+  G++    + +     T  L   +  + Y + +    +G   L           L
Sbjct: 227 SGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGIL 286

Query: 337 VDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
           +DSG   T LP+ +Y    AE + KF    S+   S+       C+N +  + + +P + 
Sbjct: 287 IDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSI----LDTCFNLTGYDEVSIPTIS 342

Query: 393 LIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 450
           L F  N    V      +   E+       L  +S   D  IIG       R+++D +  
Sbjct: 343 LRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQS 402

Query: 451 KLAWSHSKC 459
           K+ ++   C
Sbjct: 403 KVGFAEEPC 411


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 161/374 (43%), Gaps = 84/374 (22%)

Query: 114  IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
            + +G+P     + LD GS L W+ C+        +   TS+      ++P SSSS   + 
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHCK-------KSPNLTSV------FNPLSSSSYSPIP 1050

Query: 174  CSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            CS P+C++R+  + L +P        C  I  Y+ + +S  G L  D   +         
Sbjct: 1051 CSSPICRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLASDNFRIG-------- 1099

Query: 226  SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
            SS     + GC      S   + A   G+MG+  G +   S + + GL +  FS C    
Sbjct: 1100 SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQLGLPK--FSYCISGR 1154

Query: 285  D-SGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
            D SG + FGD            P  Q ST  LP  ++  AY V ++   +GN  L     
Sbjct: 1155 DSSGVLLFGDLHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQLDGIRVGNKILPLPKS 1212

Query: 329  ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------ 376
                  T +G Q +VDSG  FTFL   +Y  +  +F +        L   ++ +      
Sbjct: 1213 IFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDL 1271

Query: 377  CYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHI--FSFPE----NEGFTVFCLTVMSTDGD 429
            CY+ A+  ++  +P + L+F +    VV   +  +  PE    NE   V+CLT  ++D  
Sbjct: 1272 CYSVAAGGKLPTLPSVSLMF-RGAEMVVGGEVLLYRVPEMMKGNE--WVYCLTFGNSD-- 1326

Query: 430  YGIIG-QNFMMGHR 442
              ++G + F++GH 
Sbjct: 1327 --LLGIEAFVIGHH 1338


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/390 (23%), Positives = 152/390 (38%), Gaps = 61/390 (15%)

Query: 94  SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
           + G Q    GN     +   + +GTP     + LD   +  WVPC    CA  S+     
Sbjct: 88  ASGQQVLNIGN-----YVVRVKLGTPGQLMFMVLDTSRDAAWVPCA--DCAGCSS----- 135

Query: 154 LDRNLSEYDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVD 211
                  + P++SS+  ++ CS P C      SC +      +       D+S S  L  
Sbjct: 136 -----PTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQ 190

Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
           D L LA             S   GC    +GS L    P G++GLG G +   SLL+++G
Sbjct: 191 DSLGLA--------VDTLPSYSFGCVNAVSGSTLP---PQGLLGLGRGPM---SLLSQSG 236

Query: 272 -LIQNSFSICFDEND----SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
            L    FS CF        SGS+  G  G P   ++T  L    +   Y+V +    +G 
Sbjct: 237 SLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGR 296

Query: 326 SCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
             +            +G   ++DSG   T     +YA +  +F K V     ++   ++ 
Sbjct: 297 VLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATI--GAFD 354

Query: 376 YCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----Y 430
            C+ A++E++   P +   F+  +    + N +     +   ++ CL + +   +     
Sbjct: 355 TCFAATNEDI--APPVTFHFTGMDLKLPLENTLI---HSSAGSLACLAMAAAPNNVNSVL 409

Query: 431 GIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
            +I        RI+FD  N +L  +   C 
Sbjct: 410 NVIANLQQQNLRIMFDVTNSRLGIARELCN 439


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/372 (22%), Positives = 139/372 (37%), Gaps = 63/372 (16%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP  + LVA+D  ++  WVP               +       +DP+ SS+ + V C 
Sbjct: 113 LGTPAQALLVAIDPSNDAAWVP-----------CAACAGCARAPSFDPTRSSTYRPVRCG 161

Query: 176 HPLC--KSRSSCK-SLKDPCPYIADYSTEDTSS-----SGYLVDDILHLASFSKHAPQSS 227
            P C      SC   L   C +   Y+     +     +  L DD+  +A+++       
Sbjct: 162 APQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDDVDAVAAYT------- 214

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DE 283
                  GC    TG  +    P G++G G G +S PS      +  + FS C       
Sbjct: 215 ------FGCLHVVTGGSVP---PQGLVGFGRGPLSFPSQTKD--VYGSVFSYCLPSYKSS 263

Query: 284 NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSG 332
           N SG++  G  G P   ++T  L    +   Y+V +    +G   +            SG
Sbjct: 264 NFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSG 323

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              +VD+G  FT L   +YA V   F   V +      G  +  CYN +    + VP + 
Sbjct: 324 RGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG-FDTCYNVT----ISVPTVT 378

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNFMMGHRIVFDR 447
             F    S  +         + G  + CL + +      D    ++       HR++FD 
Sbjct: 379 FSFDGRVSVTLPEENVVIRSSSG-GIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDV 437

Query: 448 ENLKLAWSHSKC 459
            N ++ +S   C
Sbjct: 438 ANGRVGFSRELC 449


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 83/364 (22%), Positives = 130/364 (35%), Gaps = 47/364 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP   F V  D GS+  WV     QC P  A  Y    +    + P+ S++  N+S
Sbjct: 169 IRLGTPAARFTVVFDTGSDTTWV-----QCQPCVAYCY---QQKEPLFTPTKSATYANIS 220

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+   C    +       C Y   Y  + + + G+   D L L                 
Sbjct: 221 CTSSYCSDLDTRGCSGGHCLYAVQYG-DGSYTVGFYAQDTLTLG--------YDTVKDFR 271

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
            GCG K  G +   A   G+MGLG G  SVP  +         F+ C     SG+ F   
Sbjct: 272 FGCGEKNRGLFGKAA---GLMGLGRGKTSVP--VQAYDKYSGVFAYCIPATSSGTGFLDF 326

Query: 294 QGPATQQSTSFLP---IGEKYDAYFVGVESYCIGNSCLTQ-----SGFQALVDSGASFTF 345
              A   + + L    +      Y+VG+    +G   L+      S   ALVDSG   T 
Sbjct: 327 GPGAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITR 386

Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASS-EEMLKVPDMRLIF 395
           LP   Y  +   F K        ++G  +K          CY+ +  +  + +P + L+F
Sbjct: 387 LPPSAYEPLRSAFAK-------GMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVF 439

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                  V      +  +             D D  I+G      + +++D     + ++
Sbjct: 440 QGGACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFA 499

Query: 456 HSKC 459
              C
Sbjct: 500 PGAC 503


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 149/364 (40%), Gaps = 43/364 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP     +  D GS+L WV     QC P  +S +    ++   +DPS SS+   V 
Sbjct: 153 VGLGTPAQPSALIFDTGSDLSWV-----QCQPCGSSGHCHPQQD-PLFDPSKSSTYAAVH 206

Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C  P C +    C      C Y+  Y  + +S++G L  D L L S       S   +  
Sbjct: 207 CGEPQCAAAGGLCSEDNTTCLYLVHYG-DGSSTTGVLSRDTLALTS-------SRALAGF 258

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
             GCG +  G +      DG++GLG G++S+PS  A +      FS C   ++S + +  
Sbjct: 259 PFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLT 313

Query: 293 -DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSG 340
               PAT     Q T+ L   +    YFV + S  IG   L       T+ G   L+DSG
Sbjct: 314 IGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGG--TLLDSG 371

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
              T+LP + Y  +  +F   +     +   +    CY+ + E  + VP +   F     
Sbjct: 372 TVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAV 431

Query: 401 FVVR--NHIFSFPENEGFTVFCLTVMSTDGD---YGIIGQNFMMGHRIVFDRENLKLAWS 455
           F +     +    EN G    CL   + D       IIG        +++D    K+ + 
Sbjct: 432 FELDFFGVMIFLDENVG----CLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 487

Query: 456 HSKC 459
            + C
Sbjct: 488 PASC 491


>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
 gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
          Length = 475

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/259 (23%), Positives = 103/259 (39%), Gaps = 27/259 (10%)

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
           C Y   Y+ E +SS G++V+D           P       ++ GC   +TG      A D
Sbjct: 7   CYYSRTYA-ERSSSEGWMVEDAFGF-------PDDQPPVRMVFGCENGETGEIYRQLA-D 57

Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
           G+MG+G    +  S L   G+I++ FS+CF     G +  GD       +T + P+    
Sbjct: 58  GIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNL 117

Query: 312 DAYFVGVESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
             ++  V    I         N+ +   G+  ++DSG +FT+LPTE +  +         
Sbjct: 118 HLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYAL 177

Query: 364 SKRI-SLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEG 415
           S  + S  G   +Y   C+  + +    +    P    +F  N    +    + F    G
Sbjct: 178 SHGLQSTPGADPQYNDICWKGAPDNFQGLENHFPSAEFVFGDNARLSLPPLRYLFVSRPG 237

Query: 416 FTVFCLTVMSTDGDYGIIG 434
              +CL V    G   +IG
Sbjct: 238 --EYCLGVFDNGGSGTLIG 254


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 154/359 (42%), Gaps = 40/359 (11%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +G P       LD GS++ W+  QC+ CA  +  Y    ++    +DP  SSS   VSC 
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWL--QCLPCAGKNGCY----EQITPIFDPELSSSYNPVSCD 56

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
              C+         + C Y  +Y  + + + G L  + L           S+   ++ IG
Sbjct: 57  SEQCQLLDEAGCNVNSCIYKVEYG-DGSFTIGELATETLTFV-------HSNSIPNISIG 108

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD-- 293
           CG    G ++       ++GLG G +S+ S L  +     SFS C  + DS S    D  
Sbjct: 109 CGHDNEGLFVGADG---LIGLGGGAISISSQLKAS-----SFSYCLVDIDSPSFSTLDFN 160

Query: 294 QGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCLT---------QSGFQAL-VDSGAS 342
             P +    S L   +++ ++ +V V    +G   L          +SG   + VDSG +
Sbjct: 161 TDPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTT 220

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
            T LP+++Y  +   F  L ++   + + + +  CY+ SS+  ++VP +  I     S  
Sbjct: 221 ITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQ 280

Query: 403 V--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +  +N +    + +    FCL  +S      IIG     G R+ +D  N  + +S +KC
Sbjct: 281 LPAKNCLI---QVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 96/377 (25%), Positives = 160/377 (42%), Gaps = 66/377 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP  ++   +D GS+L+W  C+ C QC           D+    +DP  SSS   +
Sbjct: 104 LAIGTPPETYSAIMDTGSDLIWTQCKPCTQC----------FDQPSPIFDPKKSSSFSKL 153

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           SCS  LCK+  +SSC    D C Y+  Y  + +S+ G +  +       S          
Sbjct: 154 SCSSQLCKALPQSSC---SDSCEYLYTYG-DYSSTQGTMATETFTFGKVSI--------P 201

Query: 231 SVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
           +V  GCG    G   DG     G++GLG G +S+ S L +A      FS C    D+  +
Sbjct: 202 NVGFGCGEDNEG---DGFTQGSGLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKT 253

Query: 287 GSVFFG-----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
            ++  G     +   A  ++T  +    +   Y++ +E   +G + L   +S FQ     
Sbjct: 254 STLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDG 313

Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPD 390
               ++DSG + T+L    +  V  +F   +     +      + CYN  S+   L+VP 
Sbjct: 314 TGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVPK 373

Query: 391 MRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHRIVF 445
           + L F+  +      N++ +   +    V CL  M + G   I G    QN  + H    
Sbjct: 374 LVLHFTGADLELPGENYMIA---DSSMGVICL-AMGSSGGMSIFGNVQQQNMFVSH---- 425

Query: 446 DRENLKLAWSHSKCEEV 462
           D E   L++  + C ++
Sbjct: 426 DLEKETLSFLPTNCGQL 442


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 99/410 (24%), Positives = 155/410 (37%), Gaps = 70/410 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           IGTP      ALD  S+L+W  C     AP               ++P  S++  +V C+
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGA--TAP---------------FNPVRSTTVADVPCT 148

Query: 176 HPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
              C+        +   +    C Y   Y     +++G L  +              +  
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGD--------TRI 200

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--- 286
             V+ GCG +  G +   +   GV+GLG G++S+ S L       + FS  F  +DS   
Sbjct: 201 DGVVFGCGLQNVGDF---SGVSGVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDT 252

Query: 287 -GSVFFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-GNSCLTQSG-FQALVDSG 340
              + FGD   P T    ST  L        Y+V +    + G      SG F      G
Sbjct: 253 QSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDG 312

Query: 341 ASFTFLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW--KYCYNASSEEMLKVPDM 391
           +   FL       +  E   K  +   + +I L    G++     CY   S    KVP M
Sbjct: 313 SGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSM 372

Query: 392 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDREN 449
            L+F+      +   + F      G    CLT++ S+ GD  ++G    +G  +++D   
Sbjct: 373 ALVFAGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSVLGSLIQVGTHMMYDING 430

Query: 450 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 499
            KL +     E +   +     PPP+G S      T QQ+     A+APP
Sbjct: 431 SKLVF-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGRRSASAPP 471


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 92/406 (22%), Positives = 156/406 (38%), Gaps = 77/406 (18%)

Query: 107 YWLHYTWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP    + + LD GS+L+W  C C  C            +    +D  +
Sbjct: 100 YLIH---LSIGTPRPQRVALTLDTGSDLVWTQCACHVC----------FAQPFPTFDALA 146

Query: 166 SSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
           S ++  V CS P+C S     S C    + C Y+ DY+ + + +SG +V+D     +F+ 
Sbjct: 147 SQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYA-DKSITSGRIVED-----TFTF 200

Query: 222 HAPQSSVQS---------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
            +PQ +  S         +V  GCG+   G +    +  G+ G   G +S+PS L K   
Sbjct: 201 RSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQL-KVAR 257

Query: 273 IQNSFSICFDENDSGSVFFGDQGP--------ATQQSTSFLPIGEKYDAYFVGVESYCIG 324
             + F+   D   S     G  GP           QST F         Y++ ++   +G
Sbjct: 258 FSHCFTAIADARTSPVFLGGAPGPDNLGAHATGPVQSTPF--ANSNGSLYYLTLKGITVG 315

Query: 325 NSCLTQSGFQ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
            + L  +                ++DSG     LP  +Y  +   F   V+  ++ +   
Sbjct: 316 KTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAF---VARVKLPVANE 372

Query: 373 SW-----KYCYNASSEEMLKVPDMRLIFSKNQSFVV--------RNHIFSFPENE--GFT 417
           S        C+ A+    L          K    V          +++    E+E    +
Sbjct: 373 SAADAESTLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGS 432

Query: 418 VFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             CL + S  D D  IIG        + +D E  KL +  ++C+++
Sbjct: 433 GLCLVMNSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 150/373 (40%), Gaps = 53/373 (14%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I +GTP     + LD GS+++W     +QCAP     YT  D     +DP+ S + 
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVW-----LQCAPCRKC-YTQTDH---VFDPTKSRTY 168

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--Q 225
             + C  PLC+   S  C +    C Y   Y            D       FS      +
Sbjct: 169 AGIPCGAPLCRRLDSPGCSNKNKVCQYQVSYG-----------DGSFTFGDFSTETLTFR 217

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
            +  + V +GCG    G +   A    ++GLG G +S P    +     + FS C  +  
Sbjct: 218 RNRVTRVALGCGHDNEGLFTGAAG---LLGLGRGRLSFPVQTGRR--FNHKFSYCLVDRS 272

Query: 286 S----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ- 334
           +     SV FGD   A  ++  F P+    K D  Y++ +    +G +    L+ S F+ 
Sbjct: 273 ASAKPSSVIFGDS--AVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRL 330

Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
                   ++DSG S T L    Y  +   F    S  + + + + +  C++ S    +K
Sbjct: 331 DAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVK 390

Query: 388 VPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           VP + L F   + S    N++    +N G   FC     T     IIG     G RI +D
Sbjct: 391 VPTVVLHFRGADVSLPATNYLIPV-DNSG--SFCFAFAGTMSGLSIIGNIQQQGFRISYD 447

Query: 447 RENLKLAWSHSKC 459
               ++ ++   C
Sbjct: 448 LTGSRVGFAPRGC 460


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 145/370 (39%), Gaps = 58/370 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP    L+A+D  S++ W+PC  C+ C   +A            + P+ S+S KNV
Sbjct: 103 VLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNV 150

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           SCS P CK   +       C +   Y +   +++  L  D + LA+    A         
Sbjct: 151 SCSAPQCKQVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------F 200

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
             GC  K  G    G  P     LGLG   +  +     + +++FS C         SGS
Sbjct: 201 TFGCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGS 257

Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALV 337
           +  G    P   + T  L    +   Y+V + +  +G   +            +G   + 
Sbjct: 258 LRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIF 317

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
           DSG  +T L   +Y  V  +F K V   ++   SL G  +  CY+      +KVP +  +
Sbjct: 318 DSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGG--FDTCYSG----QVKVPTITFM 371

Query: 395 FSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDREN 449
           F   N +    N +     +   +  CL + S   +      +I       HR++ D  N
Sbjct: 372 FKGVNMTMPADNLML---HSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPN 428

Query: 450 LKLAWSHSKC 459
            +L  +  +C
Sbjct: 429 GRLGLARERC 438


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 125/314 (39%), Gaps = 52/314 (16%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
           Y +H   + IGTP     + LD GS+L+W  CQ C  C           D+ L  +DPS+
Sbjct: 82  YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128

Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
           SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+L  D       
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187

Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
               P       V  GCG    G +       G+ G G G +S+PS L K G    +FS 
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234

Query: 280 CFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
           CF   +     +V           G    QST  +        Y++ ++   +G++ L  
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294

Query: 329 TQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
            +S F         ++DSG + T LPT +Y  V   F   V    +S       +C +A 
Sbjct: 295 PESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354

Query: 382 SEEMLKVPDMRLIF 395
                 VP + L F
Sbjct: 355 LRAKPYVPKLVLHF 368


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 150/371 (40%), Gaps = 59/371 (15%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
           +  GTP+V  ++ +D GS++ WV   PC   +C P          +    +DPS SS+  
Sbjct: 129 LGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYP----------QKDPLFDPSKSSTYA 178

Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            ++C    C       R+ C S    C Y  +Y  + +S+ G   ++ +        AP 
Sbjct: 179 PIACGADACNKLGDHYRNGCTSGGTQCGYRVEYG-DGSSTRGVYSNETITF------APG 231

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
            +V+     GCG  Q G        DG++GLG    S+  ++  A +   +FS C     
Sbjct: 232 ITVK-DFHFGCGHDQRGP---SDKFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALN 285

Query: 284 NDSGSVFFGDQGPATQQSTSF-------LPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
           +++G +  G +  A   +++F       LP+     +Y V +    +G   L   +S F+
Sbjct: 286 SEAGFLALGVRPSAATNTSAFVFTPMWHLPMDAT--SYMVNMTGISVGGKPLDIPRSAFR 343

Query: 335 A--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              L+DSG   T LP   Y  +     K  ++  + +    +  CYN +    + VP + 
Sbjct: 344 GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPM-VASEDFDTCYNFTGYSNVTVPRVA 402

Query: 393 LIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRE 448
           L FS   +    V N I            CL    +  D   GIIG        +++D  
Sbjct: 403 LTFSGGATIDLDVPNGI--------LVKDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAG 454

Query: 449 NLKLAWSHSKC 459
           + K+ +    C
Sbjct: 455 HGKVGFRAGAC 465


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 152/390 (38%), Gaps = 59/390 (15%)

Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
           GN +   +YT  + IG P   + + +D GS+L WV C   C  C         +L RN  
Sbjct: 56  GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGC---------TLPRN-R 105

Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
            Y P        V C  PLC +  S     C    + C Y  +Y+ +  SS G L+ D +
Sbjct: 106 LYKPHGDL----VKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNI 160

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKA 270
            L    K    S  +  +  GCG  QT     G  P     GV+GLG G  S+ S L   
Sbjct: 161 PL----KFTNGSLARPMLAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSL 213

Query: 271 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL 328
           GLI+N    C      G +FFGDQ         + P+ +   A  Y  G           
Sbjct: 214 GLIRNVVGHCLSGRGGGFLFFGDQL-IPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTT 272

Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WK--YCYNA 380
           +  G + + DSG+S+T+  ++ +  +V      +  K +S           WK    + +
Sbjct: 273 SVKGLELIFDSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKS 332

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYG 431
             +       + L F+K+     +N     P      V      CL ++       G+  
Sbjct: 333 LHDVTSNFKPLLLSFTKS-----KNSPLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN 387

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           IIG   +    +++D E  ++ W+ + C+ 
Sbjct: 388 IIGDISLQDKLVIYDNEKQQIGWASANCDR 417


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 103/412 (25%), Positives = 153/412 (37%), Gaps = 95/412 (23%)

Query: 124 LVALDAGSNLLWVPCQ---CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
            + LD GS+L+W PCQ   CI C   + +  TSL    S   P  S ++  VSC    C 
Sbjct: 94  FLYLDTGSDLVWFPCQPFECILCEGKAEN--TSLA---STPPPKLSKTATPVSCKSSACS 148

Query: 181 S----------------------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           +                       S C+  K  CP    Y+  D S    L  D +   S
Sbjct: 149 AAHSNLPSSDLCAISNCPLESIETSDCQ--KHSCPQFY-YAYGDGSLIARLYRDSI---S 202

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSF 277
                P + + ++   GC           A P GV G G G +S+P+ LA  +  + N F
Sbjct: 203 LPLSNPTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256

Query: 278 SIC---------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           S C                     +D ++      G   P     TS L   E    Y V
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCV 315

Query: 317 GVESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLV---- 362
           G+E   IG   +   GF            +VDSG +FT LP  +Y  VV +F+  V    
Sbjct: 316 GLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVN 375

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENE 414
              R+  +      CY   +  +     + L F  N S VV   RN+ + F      + +
Sbjct: 376 ERARVIEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGK 434

Query: 415 GFTVFCLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V CL +M+       + G    +G     G  +V+D EN ++ ++  +C
Sbjct: 435 KRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 105/445 (23%), Positives = 173/445 (38%), Gaps = 74/445 (16%)

Query: 62  LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
           LEL   +  +   T  +++     +  +L    E S    +    Y   Y    IG P  
Sbjct: 26  LELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVHWAESQYIAEYL---IGDPPQ 82

Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK- 180
                +D GSNL+W   QC  C P          +NLS YDPS S +++ V+C+   C  
Sbjct: 83  QAEAIIDTGSNLIWT--QCSTCQPAGC-----FSQNLSFYDPSRSRTARPVACNDTACAL 135

Query: 181 -SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC--G 237
            S + C      C  +  Y          ++  +L   +F+   PQS    S+  GC   
Sbjct: 136 GSETRCARDNKACAVLTAYGAG-------VIGGVLGTEAFTFQ-PQSE-NVSLAFGCIAA 186

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 292
            + T   LDGA+  G++GLG G++S+ S L       N FS C         ++  +F G
Sbjct: 187 TRLTPGSLDGAS--GIIGLGRGNLSLVSQLGD-----NKFSYCLTPYFSQSTNTSRLFVG 239

Query: 293 -----DQGPATQQSTSFL--PIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ-------- 334
                  G A   S  FL  P  + +   Y++ +    +G++ L   ++ F         
Sbjct: 240 ASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGL 299

Query: 335 ---ALVDSGASFTFLPTEIYA----EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
               L+DSG+ FT L    Y     E+V +    +       +G     C   +  ++ K
Sbjct: 300 WAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEG--LDLCAAVAHGDVGK 357

Query: 388 -VPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--------DYGIIGQNF 437
            VP + L F S      V    +  P ++  +  C+ V S+ G        +  IIG   
Sbjct: 358 LVPPLVLHFGSGGGDVAVPPENYWGPVDD--STACMVVFSSGGPNSTLPMNETTIIGNYM 415

Query: 438 MMGHRIVFDRENLKLAWSHSKCEEV 462
                +++D E   L++  + C  +
Sbjct: 416 QQDMHLLYDLEKGMLSFQPADCSSM 440


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 87/368 (23%), Positives = 144/368 (39%), Gaps = 58/368 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP    L+A+D  S++ W+PC  C+ C   +A            + P+ S+S KNVSC
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 168

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           S P CK   +       C +   Y +   +++  L  D + LA+    A           
Sbjct: 169 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------FTF 218

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
           GC  K  G    G  P     LGLG   +  +     + +++FS C         SGS+ 
Sbjct: 219 GCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 275

Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
            G    P   + T  L    +   Y+V + +  +G   +            +G   + DS
Sbjct: 276 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 335

Query: 340 GASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           G  +T L   +Y  V  +F K V   ++   SL G  +  CY+      +KVP +  +F 
Sbjct: 336 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG--FDTCYSG----QVKVPTITFMFK 389

Query: 397 K-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLK 451
             N +    N +     +   +  CL + +   +      +I       HR++ D  N +
Sbjct: 390 GVNMTMPADNLML---HSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGR 446

Query: 452 LAWSHSKC 459
           L  +  +C
Sbjct: 447 LGLARERC 454


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 100/431 (23%), Positives = 157/431 (36%), Gaps = 66/431 (15%)

Query: 66  LSNDWKRQKTR---VKLQSNNNSSRNQLLFPSEGSQTH---FFGNQFYWLHYT-WIDIGT 118
           L+   +R + R   +  ++    +    L  + G  T    F G+    L Y   + IGT
Sbjct: 120 LAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGT 179

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P V   V +D GS+L WV     QC P  A    +    L  +DPSSSSS  +V C    
Sbjct: 180 PAVQQTVLIDTGSDLSWV-----QCKPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDA 232

Query: 179 CKSRSS------CKSLKDP----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+  ++      C  +       C Y  +Y    T++  Y  + +             ++
Sbjct: 233 CRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL-------------TL 279

Query: 229 QSSVII-----GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
           +  V++     GCG  Q G Y      DG++GLG    S+ S  +        FS C   
Sbjct: 280 KPGVVVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPP 334

Query: 284 NDSGSVFFGDQGPATQQST------SFLPIGEKYDA---YFVGVESYCIGNSCLT--QSG 332
              G+ F     P    S+      SF P+         Y V +    +G + L    S 
Sbjct: 335 TSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSA 394

Query: 333 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKV 388
           F +  ++DSG   T LP   YA +   F   +S  R+     G     CY+ +    + V
Sbjct: 395 FSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTV 454

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 448
           P + L FS   +  +        +             TD   GIIG        +++D  
Sbjct: 455 PTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIGIIGNVNQRTFEVLYDSG 510

Query: 449 NLKLAWSHSKC 459
              + +    C
Sbjct: 511 KGTVGFRAGAC 521


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 103/412 (25%), Positives = 153/412 (37%), Gaps = 95/412 (23%)

Query: 124 LVALDAGSNLLWVPCQ---CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
            + LD GS+L+W PCQ   CI C   + +  TSL    S   P  S ++  VSC    C 
Sbjct: 94  FLYLDTGSDLVWFPCQPFECILCEGKAEN--TSLA---STPPPKLSKTATPVSCKSSACS 148

Query: 181 S----------------------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           +                       S C+  K  CP    Y+  D S    L  D +   S
Sbjct: 149 AAHSNLPSSDLCAISNCPLESIETSDCQ--KHSCPQFY-YAYGDGSLIARLYRDSI---S 202

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSF 277
                P + + ++   GC           A P GV G G G +S+P+ LA  +  + N F
Sbjct: 203 LPLSNPTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256

Query: 278 SIC---------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
           S C                     +D ++      G   P     TS L   E    Y V
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCV 315

Query: 317 GVESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLV---- 362
           G+E   IG   +   GF            +VDSG +FT LP  +Y  VV +F+  V    
Sbjct: 316 GLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVN 375

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENE 414
              R+  +      CY   +  +     + L F  N S VV   RN+ + F      + +
Sbjct: 376 ERARVIEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGK 434

Query: 415 GFTVFCLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
              V CL +M+       + G    +G     G  +V+D EN ++ ++  +C
Sbjct: 435 KRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 89/358 (24%), Positives = 146/358 (40%), Gaps = 62/358 (17%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP  + L+A+D  ++  W+PC  C  CA              + + P  S++ KNVSC
Sbjct: 99  IGTPPQTLLLAMDTSNDAAWIPCTACDGCAS-------------TLFAPEKSTTFKNVSC 145

Query: 175 SHPLCKSRSSCKSLKDPCPYIA----DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           + P       CK + +P   ++    + +   +S +  LV D + LA        +    
Sbjct: 146 AAP------ECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLA--------TDPVP 191

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDS 286
           S   GC  K TG+    A P G++GLG G +S+ S      L Q++FS C       N S
Sbjct: 192 SYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 246

Query: 287 GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQA 335
           GS+  G    P   + T  L    +   Y+V +E+  +G   +            +G   
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           + DSG  FT L   +Y  V  +F + V  K        +  CYN      + VP +  IF
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP----IVVPTITFIF 362

Query: 396 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDREN 449
           +     + +++I     +   +  CL +     +      +I       HR+++D  N
Sbjct: 363 TGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPN 418


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 152/379 (40%), Gaps = 66/379 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++T I +GTP     + LD GS++ W+ C+ C +C       Y+  D     ++PS S+S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCREC-------YSQAD---PIFNPSYSAS 206

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              V C   +C    +       C Y A Y  + + S+G    + L   + S        
Sbjct: 207 FSTVGCDSAVCSQLDAYDCHSGGCLYEASYG-DGSYSTGSFATETLTFGTTSV------- 258

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
            ++V IGCG K  G ++  A    ++GLG G +S P+ +       ++FS C  + +S S
Sbjct: 259 -ANVAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDS 312

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLT--------- 329
                 GP  Q     +P+G  +            Y++ V +  +G + L          
Sbjct: 313 -----SGP-LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRI 366

Query: 330 -----QSGFQALVDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSWKYCYNA 380
                  GF  ++DSG   T L T  Y  V    V    +L  +  +S+    +  CY+ 
Sbjct: 367 DETSGHGGF--IIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSI----FDTCYDL 420

Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 440
           S  + + VP +   FS   S ++    +  P +   T FC           I+G      
Sbjct: 421 SGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGT-FCFAFAPAASSVSIMGNTQQQH 479

Query: 441 HRIVFDRENLKLAWSHSKC 459
            R+ FD  N  + ++  +C
Sbjct: 480 IRVSFDSANSLVGFAFDQC 498


>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
          Length = 412

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 154/387 (39%), Gaps = 65/387 (16%)

Query: 75  TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
           +R  +Q        Q  F  EG       N     ++  I +GTP  +F V LD GS+ L
Sbjct: 66  SRPAVQDGEELFWTQEEFSVEGGHNVPLSNFMNAQYFAEISLGTPPQTFKVILDTGSSNL 125

Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
           WVP   ++C  ++   +T       +YD SSSS+ K       +     S +        
Sbjct: 126 WVP--SVKCTSIACFLHT-------KYDSSSSSTYKANGTEFSIQYGSGSMEG------- 169

Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
              + ++DT   G L  D L  A  +K       +  +    G+            DG++
Sbjct: 170 ---FVSQDTFRIGDLTVDGLDFAEATK-------EPGLAFAFGKF-----------DGIL 208

Query: 255 GLGLGDVSVPSL------LAKAGLIQN---SFSICFDENDSGSVFFGD-QGPATQQSTSF 304
           GL    ++V  +      L   GL+     SF +   E+D G   FG     A      +
Sbjct: 209 GLAYDTIAVNHITPPFYHLINKGLVDEPVFSFRLGSSEDDGGEAIFGGVDDSAYTGKIQY 268

Query: 305 LPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
           +P+  K  AY+ V +E   +G+  L      A +D+G S   LPT+I AE++   +  + 
Sbjct: 269 VPVRRK--AYWEVELEKVSLGDDVLELESTGAAIDTGTSLIALPTDI-AEMI---NTQIG 322

Query: 364 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 423
           + +      SW   Y     ++  +PD+   F  N  +V++   +   E +G  +   T 
Sbjct: 323 ATK------SWNGQYTVDCAKVPSLPDLTFTFGGN-PYVLKGTDYIL-EVQGTCISSFTG 374

Query: 424 MSTD---GDYGIIGQNFMMGHRIVFDR 447
           +  +   G   I+G  F+  +  V+D 
Sbjct: 375 LDINVPGGSLWIVGDVFLRKYYTVYDH 401


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 98/397 (24%), Positives = 168/397 (42%), Gaps = 88/397 (22%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +G+P     + LD GS L W+ C+        +   TS+      ++P SSSS   + 
Sbjct: 44  LTVGSPPQQVTMVLDTGSELSWLHCK-------KSPNLTSV------FNPLSSSSYSPIP 90

Query: 174 CSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
           CS P+C++R+  + L +P        C  I  Y+ + +S  G L  D   +         
Sbjct: 91  CSSPVCRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLASDNFRIG-------- 139

Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
           SS     + GC      S   + A   G+MG+  G +   S + + GL +  FS C    
Sbjct: 140 SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQLGLPK--FSYCISGR 194

Query: 285 D-SGSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
           D SG + FGD            P  Q ST  LP  ++  AY V ++   +GN  L     
Sbjct: 195 DSSGVLLFGDSHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQLDGIRVGNKILPLPKS 252

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------ 376
                 T +G Q +VDSG  FTFL   +Y  +  +F +        L   ++ +      
Sbjct: 253 IFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDL 311

Query: 377 CYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMSTDGDY 430
           CY   +  ++ ++P + L+F +    VV   +  + +  G       V+CLT  ++D   
Sbjct: 312 CYRVPAGGKLPELPAVSLMF-RGAEMVVGGEVLLY-KVPGMMKGKEWVYCLTFGNSD--- 366

Query: 431 GIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 460
            ++G + F++GH       + FD    ++ +  ++C+
Sbjct: 367 -LLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 402


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 151/393 (38%), Gaps = 64/393 (16%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
           G +   L+Y    +G       V +D  S L WV     QCAP  + +    D+    +D
Sbjct: 145 GAKLRTLNYVAT-VGLGGGEATVIVDTASELTWV-----QCAPCESCH----DQQDPLFD 194

Query: 163 PSSSSSSKNVSCSHPLCKS-----------RSSCKSLKD---PCPYIADYSTEDTSSSGY 208
           PSSS S   V C+   C +            ++C+        C Y   Y  + + S G 
Sbjct: 195 PSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYR-DGSYSRGV 253

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLL 267
           L  D L LA          V    + GCG    G    G +  G+MGLG   +S V   +
Sbjct: 254 LAHDRLSLAG--------EVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTM 303

Query: 268 AKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVE 319
            + G +   FS C    + + SGS+  GD     + ST  +      D      YFV + 
Sbjct: 304 DQFGGV---FSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLT 360

Query: 320 SYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRIS 368
              +G   +          G +A++DSG   T L   IY    AE + +F +   +   S
Sbjct: 361 GITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFS 420

Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 428
           +       C+N +    ++VP ++L+F       V +    +  +   +  CL +     
Sbjct: 421 I----LDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKS 476

Query: 429 DY--GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +Y   IIG       R++FD    ++ ++   C
Sbjct: 477 EYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 141/334 (42%), Gaps = 60/334 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS-EYDPSSSSSSKN 171
           I IG P +  LV +D GS++LWV C  C  C           D +L   +DPS SS+   
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC-----------DNHLGLLFDPSMSSTFS- 152

Query: 172 VSCSHPLCKSRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                PLCK+    K  S  DP P+   Y+   T+S  +  D ++    F      +S  
Sbjct: 153 -----PLCKTPCDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVV----FETTDEGTSRI 203

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----N 284
             V+ GCG    G   D    +G++GL  G    P  LA    I   FS C  +      
Sbjct: 204 PDVLFGCGH-NIGQDTD-PGHNGILGLNNG----PDSLATK--IGQKFSYCIGDLADPYY 255

Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-----------TQSG 332
           +   +  G+       ST F    E ++  Y+V +E   +G   L            ++G
Sbjct: 256 NYHQLILGEGADLEGYSTPF----EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTG 311

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYC-YNASSEEMLKVP 389
              ++D+G++ TFL   ++  +  +   L+  S ++ +++ + W  C Y + S +++  P
Sbjct: 312 -GVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFP 370

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 423
            +   F+      + +  F    N+   VFC+TV
Sbjct: 371 VVTFHFADGADLALDSGSFFNQLND--NVFCMTV 402


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/378 (23%), Positives = 156/378 (41%), Gaps = 54/378 (14%)

Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYD 162
           NQ    ++  I +G+P     V +D+GS+++WV CQ C QC       Y   D     +D
Sbjct: 136 NQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQC-------YHQTD---PVFD 185

Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           P+ S+S   V CS  +C+   +       C Y   Y  + + + G L  + L   +F + 
Sbjct: 186 PADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYG-DGSYTKGTLALETL---TFGR- 240

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF 281
               +V  +V IGCG +  G ++  A   G+ G  +      SL+ +  G    +FS C 
Sbjct: 241 ----TVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSM------SLVGQLGGQTGGAFSYCL 290

Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC--LTQSGF 333
                + +GS+ FG    A     +++P+     A   Y++ +    +G     +++  F
Sbjct: 291 VSRGTDSAGSLEFGRG--AMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVF 348

Query: 334 Q--------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNAS 381
           Q         ++D+G + T +PT  Y      F      L  +  +S+    +  CYN +
Sbjct: 349 QLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSI----FDTCYNLN 404

Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 441
               ++VP +   F+      +    F  P ++    FC    ++     IIG     G 
Sbjct: 405 GFVSVRVPTVSFYFAGGPILTLPARNFLIPVDD-VGTFCFAFAASPSGLSIIGNIQQEGI 463

Query: 442 RIVFDRENLKLAWSHSKC 459
           +I FD  N  + +  + C
Sbjct: 464 QISFDGANGFVGFGPNVC 481


>gi|452821304|gb|EME28336.1| aspartyl protease isoform 1 [Galdieria sulphuraria]
          Length = 456

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/407 (22%), Positives = 170/407 (41%), Gaps = 81/407 (19%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEY 161
           G +    +Y  I +G   +   V +D GS+ L VP  QC  C           DR    Y
Sbjct: 76  GAEIVGGYYFQIQVGGQPI--YVQIDTGSSTLVVPLSQCNTC--------NVPDR----Y 121

Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSL--------------KDPCPYIADYSTEDTSSSG 207
           + ++S++   +SC+ P C + ++C                    C +  +Y  + T+++G
Sbjct: 122 NLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEYG-DGTTATG 179

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----- 262
            L  DI+ +  +S  A  +   +         +T ++L G A  GV+GL    +S     
Sbjct: 180 ALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAYSSLSCNPTC 229

Query: 263 ---VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSFLPIGEKYDA 313
              V   L ++  + N FS+  ++ D G+   G       +GP    S +     + YD 
Sbjct: 230 ISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLANEQNPQFYD- 287

Query: 314 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-----KRIS 368
             V +ES  + ++ L+   F A+VD+G +       I+  +   F     +        S
Sbjct: 288 --VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCNVPGLCPSSS 345

Query: 369 LQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTV----F 419
             G +W    YC N + EE+ ++PD+    +   +  +   +++F    N  F+     +
Sbjct: 346 NPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNNIFSAASGSY 405

Query: 420 CLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           CL +         ++DG+  I+G    + + +VFDREN ++ ++  K
Sbjct: 406 CLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 452


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 153/379 (40%), Gaps = 69/379 (18%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IG P +  L  +D GS+L WV C  C  C+           +++  +DPS SS+  N+SC
Sbjct: 99  IGEPPIPQLAVMDTGSSLTWVMCHPCSSCS----------QQSVPIFDPSKSSTYSNLSC 148

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           S   C   + C  +   CPY  +Y     SS G    + L L +  +   +     S+I 
Sbjct: 149 SE--C---NKCDVVNGECPYSVEY-VGSGSSQGIYAREQLTLETIDESIIKV---PSLIF 199

Query: 235 GCGRK----QTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICFDENDSGSV 289
           GCGRK      G    G   +GV GLG G  S +PS   K       FS C     + + 
Sbjct: 200 GCGRKFSISSNGYPYQGI--NGVFGLGSGRFSLLPSFGKK-------FSYCIGNLRNTNY 250

Query: 290 FF-----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGF 333
            F     GD+      ST+   I      Y+V +E+  IG   L           T +  
Sbjct: 251 KFNRLVLGDKANMQGDSTTLNVIN---GLYYVNLEAISIGGRKLDIDPTLFERSITDNNS 307

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNA-SSEEMLKVP 389
             ++DSGA  T+L    +  +  + + L+    +  Q    N +  CY+   S+++   P
Sbjct: 308 GVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFP 367

Query: 390 DMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTD--GD----YGIIGQNFMMGH 441
            +   F++       V +      ENE    FC+ ++  +  GD    +  IG      +
Sbjct: 368 LVTFHFAEGAVLDLDVTSMFIQTTENE----FCMAMLPGNYFGDDYESFSSIGMLAQQNY 423

Query: 442 RIVFDRENLKLAWSHSKCE 460
            + +D   +++ +    CE
Sbjct: 424 NVGYDLNRMRVYFQRIDCE 442


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 152/369 (41%), Gaps = 50/369 (13%)

Query: 122 SFLVALDAGSNLLWVPCQCIQ-----CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS- 175
           ++   +D G+ L W+ C+  Q     C P     YTS          S S S K VSC+ 
Sbjct: 100 TYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTS----------SQSKSYKPVSCNQ 149

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
           H  C+     + L   C Y   Y    + +SG L ++      +S H   ++++ S+  G
Sbjct: 150 HSFCEPNQCKEGL---CAYNVTYG-PGSYTSGNLANETFTF--YSNHGKHTALK-SISFG 202

Query: 236 C---GRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVF 290
           C    R    ++L    P  GV+G+G G     S LA+ G I    FS C   N++ + +
Sbjct: 203 CSTDSRNMIYAFLLDKNPVSGVLGMGWGPR---SFLAQLGSISHGKFSYCITANNTHNTY 259

Query: 291 --FGDQGPATQ--QSTSFLPI--GEKYDAYFVGVESYCIG------NSCLTQSGFQA-LV 337
             FG     ++  Q+T  + +     Y    +G+    +       +  + + G +  ++
Sbjct: 260 LRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCII 319

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQGNSWKYCYNASSEEMLKVPDMRL 393
           D+G   T L   I+  +       +SS    KR  +       CY   S+   K   +  
Sbjct: 320 DAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVT 379

Query: 394 IFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
              +N    V+   IF F E EG  VFCL+++S D    IIG    M  + V+D +   L
Sbjct: 380 FHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSK-TIIGAYQQMKQKFVYDTKARVL 438

Query: 453 AWSHSKCEE 461
           ++    CE+
Sbjct: 439 SFGPEDCEK 447


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 90/411 (21%), Positives = 159/411 (38%), Gaps = 72/411 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +   +++G+P  S L   D GS+L+WV C+       SA+  T      +++DPS SS+ 
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPT------TQFDPSRSSTY 154

Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL-ASFSKHAPQS 226
             VSC    C++  R++C    + C Y+  Y  + ++++G L  +        +  +P+ 
Sbjct: 155 GRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTFDDGGAGRSPRQ 212

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DE 283
                V  GC     GS+          G     VS+ + L  A  +   FS C      
Sbjct: 213 VRIGGVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGRRFSYCLVPHSV 268

Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-FQALVDSGAS 342
           N S ++ FG     T+   +  P+               +GN  +  +   + +VDSG +
Sbjct: 269 NASSALNFGALADVTEPGAASTPL---------------VGNKTVASAASSRIIVDSGTT 313

Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK---VPDMRLIFSKNQ 399
            TFL   +   +V +  + ++   +       + CYN +  E+     +PD+ L F    
Sbjct: 314 LTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGA 373

Query: 400 SFVVRNHIFSFPENEGFTV----FCLTVMST---------------------DGDYGIIG 434
           +  ++      PEN    V     CL +++T                     D D G +G
Sbjct: 374 AVALK------PENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGTVG 427

Query: 435 QNFM---MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPL 482
              +      RI+ D          S    ++D+    +  PP  QSP+ L
Sbjct: 428 NKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPV-QSPDGL 477


>gi|452821303|gb|EME28335.1| aspartyl protease isoform 2 [Galdieria sulphuraria]
          Length = 532

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/409 (22%), Positives = 171/409 (41%), Gaps = 85/409 (20%)

Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEY 161
           G +    +Y  I +G   +   V +D GS+ L VP  QC  C           DR    Y
Sbjct: 152 GAEIVGGYYFQIQVGGQPI--YVQIDTGSSTLVVPLSQCNTC--------NVPDR----Y 197

Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSL--------------KDPCPYIADYSTEDTSSSG 207
           + ++S++   +SC+ P C + ++C                    C +  +Y  + T+++G
Sbjct: 198 NLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEYG-DGTTATG 255

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----- 262
            L  DI+ +  +S  A  +   +         +T ++L G A  GV+GL    +S     
Sbjct: 256 ALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAYSSLSCNPTC 305

Query: 263 ---VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSFLPIGEKYDA 313
              V   L ++  + N FS+  ++ D G+   G       +GP    S +     + YD 
Sbjct: 306 ISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLANEQNPQFYD- 363

Query: 314 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-------LVSSKR 366
             V +ES  + ++ L+   F A+VD+G +       I+  +   F         L  S  
Sbjct: 364 --VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCNVPGLCPSS- 420

Query: 367 ISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTV--- 418
            S  G +W    YC N + EE+ ++PD+    +   +  +   +++F    N  F+    
Sbjct: 421 -SNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNNIFSAASG 479

Query: 419 -FCLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
            +CL +         ++DG+  I+G    + + +VFDREN ++ ++  K
Sbjct: 480 SYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 528


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 85/375 (22%), Positives = 147/375 (39%), Gaps = 52/375 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP + +   +D GS+L+W  C  C+ CA          D+    +D   S++ + +
Sbjct: 93  LAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCA----------DQPTPYFDVKKSATYRAL 142

Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
            C    C S SS    K  C Y   Y  +  S++G L ++     +F          +++
Sbjct: 143 PCRSSRCASLSSPSCFKKMCVY-QYYYGDTASTAGVLANETF---TFGAANSTKVRATNI 198

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---V 289
             GCG    G   D A   G++G G G +S+ S L  +      FS C     S +   +
Sbjct: 199 AFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRL 250

Query: 290 FFG---------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF------- 333
           +FG             +  QST F+      + YF+ +++  +G   L            
Sbjct: 251 YFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDD 310

Query: 334 ---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKV 388
                ++DSG S T+L  + Y  V       +    ++        C+         + V
Sbjct: 311 GTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTV 370

Query: 389 PDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           PD+   F S N + +  N++       G+   CL VM+  G   IIG        +++D 
Sbjct: 371 PDLVFHFDSANMTLLPENYML-IASTTGY--LCL-VMAPTGVGTIIGNYQQQNLHLLYDI 426

Query: 448 ENLKLAWSHSKCEEV 462
            N  L++  + C+ +
Sbjct: 427 GNSFLSFVPAPCDII 441


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 87/368 (23%), Positives = 144/368 (39%), Gaps = 58/368 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP    L+A+D  S++ W+PC  C+ C   +A            + P+ S+S KNVSC
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 152

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           S P CK   +       C +   Y +   +++  L  D + LA+    A           
Sbjct: 153 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------FTF 202

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
           GC  K  G    G  P     LGLG   +  +     + +++FS C         SGS+ 
Sbjct: 203 GCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 259

Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
            G    P   + T  L    +   Y+V + +  +G   +            +G   + DS
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319

Query: 340 GASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           G  +T L   +Y  V  +F K V   ++   SL G  +  CY+      +KVP +  +F 
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG--FDTCYSG----QVKVPTITFMFK 373

Query: 397 K-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLK 451
             N +    N +     +   +  CL + +   +      +I       HR++ D  N +
Sbjct: 374 GVNMTMPADNLML---HSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGR 430

Query: 452 LAWSHSKC 459
           L  +  +C
Sbjct: 431 LGLARERC 438


>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
          Length = 396

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 101/403 (25%), Positives = 163/403 (40%), Gaps = 73/403 (18%)

Query: 79  LQSNNNSSRNQLLFP-SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
           L +  NS +  L FP S+G       N     +Y  I +GTP  +F V  D GS+ LWVP
Sbjct: 44  LLAEKNSLKYNLGFPFSKGPTPETLKNYLDAQYYGDITLGTPPQTFSVVFDTGSSNLWVP 103

Query: 138 CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIA 196
              I C+ L                        +++C  H    S  S   +K+   +  
Sbjct: 104 --SIHCSLL------------------------DIACLLHKKYNSAKSSTYVKNGTAFAI 137

Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
            Y +   S SGYL  D   +   +          + + G   KQ G     A  DG++G+
Sbjct: 138 QYGS--GSLSGYLSQDTCSIGGLTVE--------NQLFGEAIKQPGIAFIAAKFDGILGM 187

Query: 257 GLGDVSVPSLL-------AKAGLIQNSFSICFDEN----DSGSVFFGDQGPATQQSTSFL 305
               +SV  +L        +  +  N FS   + N      G +  G   P T  +  F 
Sbjct: 188 AYPRISVDGVLPVFDNIMQQKKVESNVFSFYLNRNPDTAPGGELLLGGTDP-TYYTGEFN 246

Query: 306 PIGEKYDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
            +     AY+ V ++   +G+   L + G QA+VD+G S    P+   AEV     K + 
Sbjct: 247 YVNVTRQAYWQVSMDELAVGSQLTLCKGGCQAIVDTGTSLLTGPS---AEVKA-LQKAIG 302

Query: 364 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK--NQSFVVRNHIFSFPENEGFTVFCL 421
           +  + +QG   +Y  N       K+P + +I  K   QS+ +    +   E++     CL
Sbjct: 303 AIPL-IQG---EYMVNCD-----KIPSLPVITFKMGGQSYSLTGEQYILKESQAGKTICL 353

Query: 422 T-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           +  M+ D     G   I+G  F+  +  VFDR+N ++ ++ SK
Sbjct: 354 SGFMALDIPAPAGPLWILGDVFIGQYYTVFDRDNNRVGFAKSK 396


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 88/364 (24%), Positives = 145/364 (39%), Gaps = 45/364 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           +  G+P  +     D GS+L W     IQC P S   Y   D     +DP+ SSS   V 
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSW-----IQCQPCSGHCYKQHD---PVFDPAKSSSYAVVP 167

Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C    C +    C      C Y  +Y  + +S++G L  + L  +S       SS  +  
Sbjct: 168 CGTTECAAAGGECNGTT--CVYGVEYG-DGSSTTGVLARETLTFSS-------SSEFTGF 217

Query: 233 IIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GS 288
           I GCG    G +  +DG    G   L L   + P+     G+    FS C    ++  G 
Sbjct: 218 IFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF---GGI----FSYCLPSYNTTPGY 270

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVD 338
           +  G      Q    +  +  K D    YF+ + S  IG   L       T++G   L+D
Sbjct: 271 LSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTG--TLLD 328

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           SG   T+LP   Y  +  +F   +   + +   +    CY+ + +  + +P +   FS  
Sbjct: 329 SGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDG 388

Query: 399 QSFVVRNH-IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAWS 455
             F +    I +FP++    V CL  +S   D  + ++G        +++D    K+ + 
Sbjct: 389 AVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFI 448

Query: 456 HSKC 459
            + C
Sbjct: 449 PASC 452


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/433 (23%), Positives = 159/433 (36%), Gaps = 70/433 (16%)

Query: 66  LSNDWKRQKTR---VKLQSNNNSSRNQLLFPSEGSQTH---FFGNQFYWLHYT-WIDIGT 118
           L+   +R + R   +  ++    +    L  + G  T    F G+    L Y   + IGT
Sbjct: 40  LAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGT 99

Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
           P V   V +D GS+L WV     QC P  A    +    L  +DPSSSSS  +V C    
Sbjct: 100 PAVQQTVLIDTGSDLSWV-----QCKPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDA 152

Query: 179 CKSRSS------CKSLKDP----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           C+  ++      C  +       C Y  +Y    T++  Y  + +             ++
Sbjct: 153 CRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL-------------TL 199

Query: 229 QSSVII-----GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
           +  V++     GCG  Q G Y      DG++GLG    S+ S  +        FS C   
Sbjct: 200 KPGVVVADFGFGCGDHQHGPY---EKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPP 254

Query: 284 NDSGSVFFGDQGPATQQST------SFLPIGEKYDA---YFVGVESYCIGNSCLT--QSG 332
              G+ F     P    S+      SF P+         Y V +    +G + L    S 
Sbjct: 255 TSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSA 314

Query: 333 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKV 388
           F +  ++DSG   T LP   YA +   F   +S  R+     G     CY+ +    + V
Sbjct: 315 FSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTV 374

Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFD 446
           P + L FS   +  +        +       CL      TD   GIIG        +++D
Sbjct: 375 PTISLTFSGGATIDLAAPAGVLVDG------CLAFAGAGTDNAIGIIGNVNQRTFEVLYD 428

Query: 447 RENLKLAWSHSKC 459
                + +    C
Sbjct: 429 SGKGTVGFRAGAC 441


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 161/395 (40%), Gaps = 53/395 (13%)

Query: 101 FFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ----------CAPLSASY 150
           F+G+ F +L    +++GTP V FL   D GS+L+W+ C   Q              ++S 
Sbjct: 76  FYGD-FEYL--AAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSP 132

Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSG 207
                  +  ++P  SSS   V C  P C    + +SC      C +   Y  +  S++G
Sbjct: 133 PPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYR-DGASATG 191

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
            L  D         +   S+  +S+  GC     G        DG++GLG G +S+ S L
Sbjct: 192 LLAADTFTFGGNINNDTTST--ASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQL 246

Query: 268 AKAGLIQNSFSIC---FDENDSGSVF-FGDQGPATQQSTSFLP-IGEKYDA---YFVGVE 319
            +       FS C   +D +D+ S+  FG +   +    +  P I    +A   Y + ++
Sbjct: 247 GR------KFSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISID 300

Query: 320 SYCIGNSCL--TQSGFQALVDSGASFTFLP-TEIYAEVVVKFDKLVSSK---RISLQGNS 373
           S  +    +  T S  + +VD+G   TFL    + A +     +++      R      +
Sbjct: 301 SLKVAGQPVPGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDET 360

Query: 374 WKYCYNASSEEMLK--VPDMRLIFSKNQSFVVR---NHIFSFPENEGFTVFCLTVMSTDG 428
            + CY+ S  + +   +PD+ L+        VR      F   + EG  V CL V++T  
Sbjct: 361 LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVK-EG--VLCLAVVTTSP 417

Query: 429 D---YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           +     ++G   +    +  D +     ++ + C+
Sbjct: 418 ELQPLSVLGNVALQDLHVGIDLDARTATFATANCD 452


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 89/386 (23%), Positives = 147/386 (38%), Gaps = 49/386 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I +G+P  + L+  D GS+L WV C  C     +     T L R+ + + P+   S
Sbjct: 83  YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142

Query: 169 SKNVSCSHPLCKSRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S       P   + + C    L   C Y   YS + + +SG+   +   L + S    + 
Sbjct: 143 SLCQLVPQP---NPNPCNHTRLHSTCRYEYVYS-DGSKTSGFFSKETTTLNTSSGREMK- 197

Query: 227 SVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC--- 280
               S+  GCG   +G  L G++     GVMGLG G +S  S L +      SFS C   
Sbjct: 198 --LKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFSYCLLD 253

Query: 281 --FDENDSGSVFFGDQGPATQQS---TSFLPI---GEKYDAYFVGVESYCIGNSCL---- 328
                  +  +  GD     + +    SF P+    E    Y++ ++   +    L    
Sbjct: 254 YTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDP 313

Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK----YCY 378
                        ++DSG + TFL    Y E++  F + V     +  G S +     C 
Sbjct: 314 SVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCV 373

Query: 379 NASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPENEGFTVFCLTVMSTD---GDYGII 433
           N +     + P + L       +    RN+     E     + CL +   +   G + +I
Sbjct: 374 NVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEG----IKCLAIQPVEAESGRFSVI 429

Query: 434 GQNFMMGHRIVFDRENLKLAWSHSKC 459
           G     G  + FDR   +L +S   C
Sbjct: 430 GNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 53/363 (14%)

Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           GTP  + L+ALD  S+  W+PC  C+ C+                + P  S+S +NVSC 
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGCS------------TSKPFAPIKSTSFRNVSCG 151

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P CK   +       C +   Y +   ++S  +V D L LA        +        G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLA--------TDPIPGYTFG 201

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFF 291
           C  K TGS    +AP   +             ++  L +++FS C       N SGS+  
Sbjct: 202 CVNKTTGS----SAPQQGLLGLGRGPLSLLSQSQ-NLYKSTFSYCLPSFKSINFSGSLRL 256

Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 340
           G    P   + T  L    +   Y+V + +  +G   +            +G   + DSG
Sbjct: 257 GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSG 316

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
             FT L   +Y  V  +F + V  K        +  CYN      + VP +  +FS    
Sbjct: 317 TVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVP----IVVPTITFLFSGMNV 372

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLAWSH 456
            +  ++I     +   +  CL +     +      +I       HR++FD  N ++  + 
Sbjct: 373 TLPPDNIVI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIAR 430

Query: 457 SKC 459
             C
Sbjct: 431 ELC 433


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 92/392 (23%), Positives = 151/392 (38%), Gaps = 71/392 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F++  D GS+L WV C+     P S       D    E+  S S S 
Sbjct: 14  YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS-------DPPAREFRASESRSW 66

Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA------ 217
             ++CS   C S      ++C S   PC Y  DY  +D S++ G +  D   +A      
Sbjct: 67  APLACSSDTCTSYVPFSLANCSSPASPCAY--DYRYKDGSAARGVVGTDAATIALSGSGS 124

Query: 218 -SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
              S    + +    V++GC     G     +  DGV+ LG  ++S  S    A      
Sbjct: 125 EDGSGGGGRRAKLQGVVLGCTATYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGGR 180

Query: 277 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--------------------GEKY 311
           FS C        N S  + FG          +  P+                    GE  
Sbjct: 181 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEAL 240

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQ 370
           D   +  + + +G       G  A++DSG S T L T  Y  VV     +L +  R+++ 
Sbjct: 241 D---IPADVWDVGR------GGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM- 290

Query: 371 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
            + ++YCYN ++    ++P + + F+ +         +      G  V C+ V   +G +
Sbjct: 291 -DPFEYCYNWTAGAP-EIPKLEVSFAGSARLEPPAKSYVIDAAPG--VKCIGVQ--EGAW 344

Query: 431 ---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               +IG      H   FD  +  L + H++C
Sbjct: 345 PGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 376


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 53/363 (14%)

Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           GTP  + L+ALD  S+  W+PC  C+ C+                + P  S+S +NVSC 
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGCS------------TSKPFAPIKSTSFRNVSCG 151

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P CK   +       C +   Y +   ++S  +V D L LA        +        G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLA--------ADPIPGYTFG 201

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFF 291
           C  K TGS    +AP   +             ++  L +++FS C       N SGS+  
Sbjct: 202 CVNKTTGS----SAPQQGLLGLGRGPLSLLSQSQ-NLYKSTFSYCLPSFKSINFSGSLRL 256

Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 340
           G    P   + T  L    +   Y+V + +  +G   +            +G   + DSG
Sbjct: 257 GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSG 316

Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
             FT L   +Y  V  +F + V  K        +  CYN      + VP +  +FS    
Sbjct: 317 TVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVP----IVVPTITFLFSGMNV 372

Query: 401 FVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLAWSH 456
            +  ++I     +   +  CL +     +      +I       HR++FD  N ++  + 
Sbjct: 373 ALPPDNIVI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIAR 430

Query: 457 SKC 459
             C
Sbjct: 431 ELC 433


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 92/392 (23%), Positives = 151/392 (38%), Gaps = 71/392 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F++  D GS+L WV C+     P S       D    E+  S S S 
Sbjct: 105 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS-------DPPAREFRASESRSW 157

Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA------ 217
             ++CS   C S      ++C S   PC Y  DY  +D S++ G +  D   +A      
Sbjct: 158 APLACSSDTCTSYVPFSLANCSSPASPCAY--DYRYKDGSAARGVVGTDAATIALSGSGS 215

Query: 218 -SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
              S    + +    V++GC     G     +  DGV+ LG  ++S  S    A      
Sbjct: 216 EDGSGGGGRRAKLQGVVLGCTATYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGGR 271

Query: 277 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--------------------GEKY 311
           FS C        N S  + FG          +  P+                    GE  
Sbjct: 272 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEAL 331

Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQ 370
           D   +  + + +G       G  A++DSG S T L T  Y  VV     +L +  R+++ 
Sbjct: 332 D---IPADVWDVGR------GGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM- 381

Query: 371 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
            + ++YCYN ++    ++P + + F+ +         +      G  V C+ V   +G +
Sbjct: 382 -DPFEYCYNWTAGAP-EIPKLEVSFAGSARLEPPAKSYVIDAAPG--VKCIGVQ--EGAW 435

Query: 431 ---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               +IG      H   FD  +  L + H++C
Sbjct: 436 PGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 82/363 (22%), Positives = 137/363 (37%), Gaps = 73/363 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IGTP        D GS+L+W      QC P  + Y     +    +DPS S+S K VS
Sbjct: 28  ISIGTPPFDVYGIYDTGSDLMWT-----QCLPCLSCY----KQKNPMFDPSKSTSFKEVS 78

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C    C+                     DT +S      IL+                ++
Sbjct: 79  CESQQCR-------------------LLDTPTS------ILN----------------IV 97

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGS 288
            GCG   +G++ +     G+ G G   +S+ S +         FS C      D + +  
Sbjct: 98  FGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 155

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN--------SCLTQSGFQALVD 338
           + FG +   +       P+  K D   YFV ++   +G+        S +   G    +D
Sbjct: 156 IIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKG-NVFID 214

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           +G   T LP + Y  +V    + +  + +       + CY +++  ++  P +   F   
Sbjct: 215 AGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGA 272

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
              +   + F  P+ EG  V+C  +   DGD GI G    M   I FD +  K+++    
Sbjct: 273 DVQLKPLNTFISPK-EG--VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVD 329

Query: 459 CEE 461
           C +
Sbjct: 330 CTK 332


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 143/356 (40%), Gaps = 23/356 (6%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           IGTP V  L   D GS+L+WV C  C  C P S   +  L    S + P++  S     C
Sbjct: 96  IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKS--STFMPTTCRSQP---C 150

Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
           +  L + +   KS +  C Y   Y  + + S G L  + L     S+   Q+    +   
Sbjct: 151 TLLLPEQKGCGKSGE--CIYTYKYGDQYSFSEGLLSTETLRFD--SQGGVQTVAFPNSFF 206

Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFF 291
           GCG     +        G+MGLG G +S+ S +     I + FS C        +  + F
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLKF 264

Query: 292 GDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSCLTQ--SGFQALVDSGASFTFL 346
           G++   T +     P+  K      YF+ +E+  +    +    +    ++DSG   T+L
Sbjct: 265 GNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTYL 324

Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
               Y        + ++ + +    +   +C+     +    P++   F+  +  +   +
Sbjct: 325 GESFYYNFAASLQESLAVELVQDVLSPLPFCF--PYRDNFVFPEIAFQFTGARVSLKPAN 382

Query: 407 IFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           +F   E+   TV  +   S+     I G    +  ++ +D E  K+++  + C +V
Sbjct: 383 LFVMTEDRN-TVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTDCSKV 437


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 97/427 (22%), Positives = 168/427 (39%), Gaps = 89/427 (20%)

Query: 114 IDIGTPNVSFLVAL--DAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           + +G P+ +  V+L  D GS+L+W PC    C  L     T    + S   P   S  + 
Sbjct: 92  LSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCM-LCEGKATPGGNHSSPLPPPIDS--RR 148

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA-PQ---SS 227
           +SC+ PLC +  S     D C            ++     D +   S + HA P    + 
Sbjct: 149 ISCASPLCSAAHSSAPTSDLC------------AAARCPLDAIETDSCASHACPPLYYAY 196

Query: 228 VQSSVIIGCGRKQTG--------------SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
              S++    R + G              ++   A P GV G G G +S+P+ LA +  +
Sbjct: 197 GDGSLVANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPS--L 254

Query: 274 QNSFSICFDEND--------SGSVFFG---DQGPATQQSTSFL--PI--GEKYDAYF-VG 317
              FS C   +         S  +  G   D        T F+  P+    K+  ++ V 
Sbjct: 255 SGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVA 314

Query: 318 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
           +E+  +G   +                 +VDSG +FT LP++ +A V  +F + +++ R 
Sbjct: 315 LEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARF 374

Query: 368 SLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFC 420
           +    +        CY+ S  +   VP + L F  N +  +  RN+   F   EG +V C
Sbjct: 375 TRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGC 433

Query: 421 LTVMSTDGD----------YGIIGQNFMMGHRIVFDRENLKLAWSHSKC--------EEV 462
           L +M+  G+           G +G     G  +V+D +  ++ ++  +C          +
Sbjct: 434 LMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDLWDTLSRRI 493

Query: 463 IDKSHVH 469
           ID+  VH
Sbjct: 494 IDQPLVH 500


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 93/420 (22%), Positives = 160/420 (38%), Gaps = 56/420 (13%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
           L   + R  +R      N  S N +  P   +   +  N         I +GTP VS   
Sbjct: 60  LQKAFHRSISRANHFRANGVSTNSIQSPVISNNGEYLMN---------ISLGTPPVSMHG 110

Query: 126 ALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSC 185
             D GS+LLW      QC P  + Y    ++    +DP+ S + + +SC    C +    
Sbjct: 111 IADTGSDLLWR-----QCKPCDSCY----EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQ 161

Query: 186 KSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
               D    I  YS  D S +SG L  D L + S +   P S  +  V+ GCG    G++
Sbjct: 162 GGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGS-TTGRPVSVPK--VVFGCGHNNGGTF 218

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF-----DENDSGSVFFGDQGPAT 298
               +    +          S++++   LI   FS C      D + S  + FG +G  +
Sbjct: 219 ELHGSGLVGL-----GGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVS 273

Query: 299 QQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGF-------------QALVDSGASF 343
                  P+  +     Y++ +ES  +G+  L   GF               ++DSG + 
Sbjct: 274 GAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTL 333

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SKNQSFV 402
           T LP + Y  +       +  K +    N +  CY  S+   L++P +   F   +    
Sbjct: 334 TLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY--SNLSGLRIPTITAHFVGADLELK 391

Query: 403 VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             N      E+    +FC  ++    D  I G    M   + +D ++  +++  + C ++
Sbjct: 392 PLNTFVQVQED----LFCFAMIPVS-DLAIFGNLAQMNFLVGYDLKSRTVSFKPTDCTKI 446


>gi|281210961|gb|EFA85127.1| hypothetical protein PPL_02125 [Polysphondylium pallidum PN500]
          Length = 601

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 84/366 (22%), Positives = 162/366 (44%), Gaps = 43/366 (11%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS---SSSSK 170
           I +G P+ SF V LD GS  L +P   + C       Y++++R      P+    SS ++
Sbjct: 83  ILVGNPSQSFRVMLDTGSATLNIPS--VDCF-----LYSAVNR------PTKCRCSSKAE 129

Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           +         S+  C++  D C Y   +  +    S  LV+D + +  +S  +   +V  
Sbjct: 130 STYKREYSPNSKVHCRT--DNCIYSEQFVDKSFLMSQ-LVEDTVRIGGYSIDSIFGNVNK 186

Query: 231 SVIIGCGRKQTGSYLDGAAP---DGVMGLG---LGDVSVPSLLAKAGL---IQNSFSICF 281
            +++    K+  +  D   P   DG+ GL    + D +   +L +  L   + NSFS+CF
Sbjct: 187 ILLLAFQYKECPA-PDVYTPRSFDGIFGLSTKVIDDTAGEDILTQISLKYNLSNSFSLCF 245

Query: 282 DENDSGSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 339
            E+  G  F  G   P    +   ++P+ + Y  Y + +    IG   L  + + A +DS
Sbjct: 246 GESGYGGQFKIGGYDPELIVEPMRYIPVAKPY-TYNLTISQVHIGQYKLEHTTYNAWIDS 304

Query: 340 GASFTFLPTEIYAEVV-VKFDK--LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           G++   +PT +Y  ++   ++K  L   +  +    S+  C     +++   P   + F 
Sbjct: 305 GSASIVIPTPLYNNMINTMYEKFPLAGFQDGAFWNTSFP-CAFIDEKDIPNYPKFNISFV 363

Query: 397 KNQSFVVRNHIFSFPEN-----EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
                +   H+   P+N     E    + L + + D +Y IIG   ++G+ I FD++N +
Sbjct: 364 DTDGEIF--HLSVLPQNYLVYNEEEKCYELLLRTVDNNYFIIGDLGLIGYNIHFDKQNQR 421

Query: 452 LAWSHS 457
           + ++ +
Sbjct: 422 IGFAKA 427


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 91/407 (22%), Positives = 154/407 (37%), Gaps = 89/407 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEYDPSSSSS 168
           +++GTP  +    LD GS+L+W PC     C  C       + ++D   +  + P +SS+
Sbjct: 96  LNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCN------FPNIDTTKIPTFIPKNSST 149

Query: 169 SKNVSCSHPLC-------------KSRSSCKSLKDPCP-YIADYSTEDTSSSGYLVDDIL 214
           +K + C +P C             + +   ++    CP YI  Y     S++G+L+ D L
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLG--STAGFLLLDNL 207

Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
           +     K  PQ       ++GC      S L    P G+ G G G  S+PS +       
Sbjct: 208 NFP--GKTVPQ------FLVGC------SILSIRQPSGIAGFGRGQESLPSQMN-----L 248

Query: 275 NSFSIC-----FDENDSGS---VFFGDQGPATQQSTSFLPIGEK--------YDAYFVGV 318
             FS C     FD+    S   +     G       S+ P             + Y++ +
Sbjct: 249 KRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTL 308

Query: 319 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSS 364
               +G   +          +      +VDSG++FTF+   +Y     E V + +K  S 
Sbjct: 309 RKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSR 368

Query: 365 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLT 422
              +   +    C+N S  + +  P++   F         ++N+     + E   V CLT
Sbjct: 369 AEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAE---VVCLT 425

Query: 423 VMS--------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 461
           V+S        T G   I+G        I +D EN +  +    C  
Sbjct: 426 VVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSCRR 472


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 150/366 (40%), Gaps = 56/366 (15%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
           +GTP    L+A+D  ++  W+PC  C  C               + ++P++S S + V C
Sbjct: 114 LGTPPQQLLLAVDTSNDAAWIPCSGCAGCP------------TTTPFNPAASKSYRAVPC 161

Query: 175 SHPLCKSRS---SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
             P C SR+   SC      C +   Y+  D+S    L  D L +A        + V  S
Sbjct: 162 GSPAC-SRAPNPSCSLNTKSCGFSLTYA--DSSLEAALSQDSLAVA--------NDVVKS 210

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSG 287
              GC +K TG+      P G++GLG G +S   L     + + +FS C       N SG
Sbjct: 211 YTFGCLQKATGT---ATPPQGLLGLGRGPLSF--LSQTKDMYEGTFSYCLPSFKSLNFSG 265

Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
           ++  G +G P   ++T  L    +   Y+V +    +G   +            +G   +
Sbjct: 266 TLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTV 325

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
           +DSG  FT L    Y  V  +  + +    +S  G  +  CYN +    +K P +  +F+
Sbjct: 326 LDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGG-FDTCYNTT----VKWPPVTFMFT 380

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLA 453
             Q  +  +++       G T       + DG      +I       HRI+FD  N ++ 
Sbjct: 381 GMQVTLPADNLV-IHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVG 439

Query: 454 WSHSKC 459
           ++  +C
Sbjct: 440 FAREQC 445


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 146/379 (38%), Gaps = 72/379 (18%)

Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           +GTP     + L+ G+ L+W                       +  +PS     +     
Sbjct: 1   MGTPPNPVKLKLENGNELIW-----------------------NHSNPSPECFEQAFPYF 37

Query: 176 HPLCKSR----SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
            PL  SR    +SC S K      C Y   Y  + + ++G+L  D           P   
Sbjct: 38  EPLTFSRGLPFASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGAGASVP--- 93

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---- 283
               V  GCG    G +       G+ G G G +S+PS L K G    +FS CF      
Sbjct: 94  ---GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGA 143

Query: 284 -------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNS------- 326
                  +    +F   QG    Q+T  +   +       Y++ ++   +G++       
Sbjct: 144 IPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPES 201

Query: 327 --CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
              LT      ++DSG S T LP ++Y  V  +F   +    +         C++A S+ 
Sbjct: 202 AFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQA 261

Query: 385 MLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 443
              VP + L F      + R N++F  P++ G ++ CL +   D +  IIG        +
Sbjct: 262 KPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHV 320

Query: 444 VFDRENLKLAWSHSKCEEV 462
           ++D +N  L++  ++C+++
Sbjct: 321 LYDLQNNMLSFVAAQCDKL 339


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 82/365 (22%), Positives = 148/365 (40%), Gaps = 44/365 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IG P V  L+ +D GS+L W+ C   +C P +  +          + PS SS+ +N S
Sbjct: 92  ISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPF----------FHPSRSSTYRNAS 141

Query: 174 C-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C S P    +         C Y   Y  + +++ G L  + L   +F         + ++
Sbjct: 142 CESAPHAMPQIFRDEKTGNCRYHLRYR-DFSNTRGILAKEKL---TFQTSDEGLISKPNI 197

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGS 288
           + GCG+  +G         GV+GLG G  S+  +    G   + FS CF    D     +
Sbjct: 198 VFGCGQDNSGF----TQYSGVLGLGPGTFSI--VTRNFG---SKFSYCFGSLIDPTYPHN 248

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQALVDS 339
                 G   +   + L I +  D Y++ +++  +G   L          +S    ++D+
Sbjct: 249 FLILGNGARIEGDPTPLQIFQ--DRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDT 306

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE-EMLKVPDMRLIFS 396
           G S T L  E Y  +  + D L+    +R+        +CY  + + ++   P +   F+
Sbjct: 307 GCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFA 366

Query: 397 KNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 455
                 +      F  +E    FCL + M+T  D  +IG      + + ++   +K+ + 
Sbjct: 367 GGAELALDVESL-FVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQ 425

Query: 456 HSKCE 460
            + CE
Sbjct: 426 RTDCE 430


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 152/372 (40%), Gaps = 56/372 (15%)

Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
           Y ++   + +GTP    +  +D GS+++W   QC+ C P   S +  +      +DPS S
Sbjct: 418 YSIYLMKLQVGTPPFEIVAEIDTGSDIIWT--QCMPC-PNCYSQFAPI------FDPSKS 468

Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           S+ +   C+              + C Y   Y+ + T S G L  + + + S S    + 
Sbjct: 469 STFREQRCN-------------GNSCHYEIIYA-DKTYSKGILATETVTIPSTSG---EP 511

Query: 227 SVQSSVIIGCGRKQTGSYLDGAA--PDGVMGLGLGDVSVPSL--LAKAGLIQNSFSICFD 282
            V +   IGCG   T     G A    G++GL +G +S+ S   L   GLI    S CF 
Sbjct: 512 FVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFS 567

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG--FQA--- 335
              +  + FG         T    +  K D   Y++ +++  + ++ +   G  F A   
Sbjct: 568 GQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDG 627

Query: 336 --LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
              +DSG + T+ P      V    +++V++ ++   G+    CY + + ++  V  M  
Sbjct: 628 NIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTIDIFPVITMH- 686

Query: 394 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQ-NFMMGHR-----I 443
            FS     V+  +        G  +FCL +   D      +G   Q NF++G+      I
Sbjct: 687 -FSGGADLVLDKYNMYLETITG-GIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVI 744

Query: 444 VFDRENLKLAWS 455
            F   N    WS
Sbjct: 745 SFSPTNCSALWS 756



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 104/425 (24%), Positives = 179/425 (42%), Gaps = 65/425 (15%)

Query: 77  VKLQSNNNS---SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
           ++ +SN++S   S+NQL   S  + T F     Y ++   + +GTP       +D GS+L
Sbjct: 50  IQRRSNSSSFRLSKNQLQGASPYADTLFD----YNIYLMKLQVGTPPFEIAAEIDTGSDL 105

Query: 134 LWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPC 192
           +W  C  C  C       Y+  D     +DPS SS+           + R   KS    C
Sbjct: 106 IWTQCMPCPDC-------YSQFD---PIFDPSKSSTFN---------EQRCHGKS----C 142

Query: 193 PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA--P 250
            Y   Y  ++T S G L  + + + S S    +  V +   IGCG   T     G A   
Sbjct: 143 HYEIIYE-DNTYSKGILATETVTIHSTSG---EPFVMAETTIGCGLHNTDLDNSGFASSS 198

Query: 251 DGVMGLGLGDVSVPSL--LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 308
            G++GL +G  S+ S   L   GLI    S CF    +  + FG         T    + 
Sbjct: 199 SGIVGLNMGPRSLISQMDLPYPGLI----SYCFSGQGTSKINFGTNAIVAGDGTVAADMF 254

Query: 309 EKYDA--YFVGVESYCIGNSCLTQSG--FQA-----LVDSGASFTFLPTEIYAEVVVKFD 359
            K D   Y++ +++  + ++ +   G  F A     ++DSG++ T+ P      V    +
Sbjct: 255 IKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVE 314

Query: 360 KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 419
           ++V++ R+     +   CY + + ++  V  M   FS     V+  +      N G  +F
Sbjct: 315 QVVTAVRVPDPSGNDMLCYFSETIDIFPVITMH--FSGGADLVLDKYNMYMESNSG-GLF 371

Query: 420 CLTVM----STDGDYGIIGQ-NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 474
           CL ++    + +  +G   Q NF++G    +D  +L L  +    + + D S ++L+   
Sbjct: 372 CLAIICNSPTQEAIFGNRAQNNFLVG----YDSSSLLLQGASPYADTLYDYS-IYLMKLQ 426

Query: 475 AGQSP 479
            G  P
Sbjct: 427 VGTPP 431


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 161/392 (41%), Gaps = 67/392 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   F++  D GS+L WV C        S +   + D     +  ++S S 
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKC--------SGAGDGTGDAPRRVFRAAASRSW 163

Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA-----S 218
             ++CS   C S      ++C S   PC Y  DY   D S++ G +  D   +A     S
Sbjct: 164 APIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYNDGSAARGVVGTDSATIALSGSES 221

Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG---AAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
                 ++ +Q  V++GC    T SY DG    + DGV+ LG  ++S  S    A     
Sbjct: 222 RDGGGRRAKLQ-GVVLGC----TASY-DGQSFQSSDGVLSLGNSNISFASR--AAARFGG 273

Query: 276 SFSICF-----DENDSGSVFFGDQGP-----------ATQQSTSFLPIGEKYDAYFVGVE 319
            FS C        N +  + FG  GP           +    T  L        Y V V+
Sbjct: 274 RFSYCLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVD 333

Query: 320 SYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQ 370
           +  +    L           G  A++DSG S T L T  Y  VV    ++L    R+S+ 
Sbjct: 334 AVHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSM- 392

Query: 371 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 430
            + ++YCYN ++   L++P + + F+   S  ++    S+  +    V C+ V   +G +
Sbjct: 393 -DPFEYCYNWTA-AALEIPGLEVRFAG--SARLQPPAKSYVVDAAPGVKCIGVQ--EGAW 446

Query: 431 ---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               +IG      H   FD  +  L + H++C
Sbjct: 447 PGVSVIGNILQQDHLWEFDLRDRWLRFKHTRC 478


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 93/409 (22%), Positives = 149/409 (36%), Gaps = 72/409 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ---------CIQCAPLSASYYTSLDRNLSE 160
           ++    +GTP   FL+  D GS+L WV C              + L A    S  R    
Sbjct: 87  YFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT--- 143

Query: 161 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
           + P  S +   + CS   C+     S ++C +  +PC Y  DY  +D S++   V     
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAY--DYRYKDGSAARGTVGVDSA 201

Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
             + S  A + +    V++GC     G S+L   A DGV+ LG  ++S  S  A      
Sbjct: 202 TIALSGRAARKAKLRGVVLGCTTSYNGQSFL---ASDGVLSLGYSNISFASRAAS--RFG 256

Query: 275 NSFSICF-----DENDSGSVFFG----------DQGPAT----------------QQSTS 303
             FS C        N +  + FG           +G A+                 + T 
Sbjct: 257 GRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTP 316

Query: 304 FLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVV 355
            +        Y V V+   +    L         + G  A++DSG S T L    Y  VV
Sbjct: 317 LVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVV 376

Query: 356 VKFDK-LVSSKRISLQGNSWKYCYN----ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
               K L    R+++  + + YCYN    + S+    +P + + F+ +         +  
Sbjct: 377 AALSKRLAGLPRVTM--DPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVI 434

Query: 411 PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               G     L      G   +IG      H   +D +N +L +  S+C
Sbjct: 435 DAAPGVKCIGLQEGPWPG-LSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 102/408 (25%), Positives = 159/408 (38%), Gaps = 70/408 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC---QCIQC--APLSASYYTSLDRNLS---EYDPSS 165
            ++G+ +    + +D GS+L+W PC   +CI C   P   S    +  N S        S
Sbjct: 80  FNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSAAACS 139

Query: 166 SSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
           ++   ++S SH    SR        S C S   P  Y   Y+  D S    L  D L L 
Sbjct: 140 AAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY---YAYGDGSLVARLYRDSLSLP 196

Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNS 276
           + +   P +    +   GC     G       P GV G G G +S+PS LA  +  + N 
Sbjct: 197 TPAPSPPINV--RNFTFGCAHTTLGE------PVGVAGFGRGVLSMPSQLATFSPQLGNR 248

Query: 277 FSICFDENDSGS--------VFFGD--QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 326
           FS C   +   +        +  G    G      TS L   +    Y VG+    +GN 
Sbjct: 249 FSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNI 308

Query: 327 CLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFD----KLVSSKRISLQGN 372
            +    F            +VDSG +FT LP  +Y  VV +F+    K+ +  R   +  
Sbjct: 309 RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENT 368

Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPE-NEGFT-----VFCLTV 423
               CY    E  + VP + L F   +S VV   +N+ + F +  +G       V CL +
Sbjct: 369 GLSPCY--YYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLML 426

Query: 424 MS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 464
           M+         G    +G     G  +V+D E  ++ ++  +C  + D
Sbjct: 427 MNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD 474


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 78/350 (22%), Positives = 139/350 (39%), Gaps = 39/350 (11%)

Query: 124 LVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
            + +D GS++ W+ C  C QC       Y   D   S + P+ S++ K + C+  +C+  
Sbjct: 2   FLLIDTGSDITWIQCDPCPQC-------YKQQD---SLFQPAGSATYKPLPCNSTMCQQL 51

Query: 183 SSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV---IIGCG 237
            S     L   C Y+  Y  + T+   + + + L L S        ++  SV     GCG
Sbjct: 52  QSFSHSCLNSSCNYMVSYGDKSTTRGDFAL-ETLTLRS------DDTILVSVPNFAFGCG 104

Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND----SGSVFFGD 293
               G + +GAA  G+MGLG   +  P+  + A      FS C         SG + FG+
Sbjct: 105 HANKGLF-NGAA--GLMGLGKSSIGFPAQTSVA--FGKVFSYCLPSVSSTIPSGILHFGE 159

Query: 294 QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEI 350
                     F P+ +       YFV +    +G+  L  S    +VDSG   +      
Sbjct: 160 AA-MLDYDVRFTPLVDSSSGPSQYFVSMTGINVGDELLPISA-TVMVDSGTVISRFEQSA 217

Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
           Y  +   F +++   + ++    +  C+  S+ + + +P + L F  +    +      +
Sbjct: 218 YERLRDAFTQILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILY 277

Query: 411 PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
           P ++G  V C     +     ++G       R V+D    +L  S  +C 
Sbjct: 278 PVDDG--VMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFECN 325


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 143/364 (39%), Gaps = 52/364 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE-YDPSSSSSSKN 171
           + IGTP       +D GS+L+W+ C  C  C          LD +    +   +SSS K 
Sbjct: 9   LSIGTPPQLIPAMIDTGSDLVWLKCDNCDHC---------DLDHHGETIFFSDASSSYKK 59

Query: 172 VSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           + C+   C   SS       ++ C Y  +Y  + + +SG +  D +   S        S 
Sbjct: 60  LPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
               + GCGRK  G   D     G++GLG    S+   L     +   FS C    DS  
Sbjct: 119 FDGFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSPP 173

Query: 287 ---GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT----QSGF-- 333
                +F G             PI  G+  D   Y+V ++S  +G   +     +SG   
Sbjct: 174 SAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNT 233

Query: 334 --------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 383
                   + ++DSG ++T L   +Y  +    ++ V    +   GNS     C+N+S +
Sbjct: 234 SVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDLCFNSSGD 290

Query: 384 EMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
                P +   F+     V+   +IF     +   V CL++ S+ GD  IIG        
Sbjct: 291 TSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGNMQQQNFH 347

Query: 443 IVFD 446
           I++D
Sbjct: 348 ILYD 351


>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
          Length = 411

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 81/399 (20%), Positives = 157/399 (39%), Gaps = 47/399 (11%)

Query: 93  PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
           PS        GN +   H+   ++I  P   + + +D GS L W+ C   CI C  +   
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79

Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
            Y        E   +   + +  +  +   +    C   K+ C Y   Y     SS G L
Sbjct: 80  LYK------PELKYAVKCTEQRCADLYADLRKPMKCGP-KNQCHYGIQYV--GGSSIGVL 130

Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
           + D     SFS  A   +  +S+  GCG  Q  +  +   P +G++GLG G V++ S L 
Sbjct: 131 IVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 185

Query: 269 KAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN-- 325
             G+I ++    C      G +FFGD    T   T + P+  ++  Y     +    +  
Sbjct: 186 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLHFNSNK 244

Query: 326 -SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYN 379
            S ++ +  + + DSGA++T+   + Y   +      +S +      +  +  +   C+ 
Sbjct: 245 QSPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWK 304

Query: 380 ASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 429
              +++  + +++  F           K  +  +    +     EG    CL ++    +
Sbjct: 305 G-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKE 361

Query: 430 Y------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           +       +IG   M+   +++D E   L W + +C+ +
Sbjct: 362 HPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 400


>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
           partial [Brachypodium distachyon]
          Length = 354

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 69/300 (23%), Positives = 126/300 (42%), Gaps = 34/300 (11%)

Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
           + +  CK   + C Y   Y+  + SS G L+ D   L       P    + ++  GCG  
Sbjct: 66  RFKHDCKENPNQCDYDVRYAGGE-SSLGVLIADKFSL-------PGRDARPTLTFGCGYD 117

Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPAT 298
           Q G   +    DGV+G+G G   + S L + G I +N    C      G +FFG +    
Sbjct: 118 QEGGKAEMPV-DGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHE-KVP 175

Query: 299 QQSTSFLPIGEKYDAYFVGVESYC----IGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 354
               +++P+      Y  G+ +      +GN  ++ +  + ++DSG+++T++PTE Y  +
Sbjct: 176 SSVVTWVPMVPNNHYYSPGLAALHFNGNLGNP-ISVAPMEVVIDSGSTYTYMPTETYRRL 234

Query: 355 V-VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF--- 410
           V V    L  S    ++  +   C+ A  E    + D++  F   +   ++    +    
Sbjct: 235 VFVVIASLSKSSLTLVRDPALPVCW-AGKEPFKXIGDVKDKFKPLELAFIQGTSQAIMEI 293

Query: 411 -PEN----EGFTVFCLTVMSTDG------DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
            PEN     G    C+ ++  DG         +IG   M    +++D E  ++ W  + C
Sbjct: 294 PPENYLIISGEGNVCMGIL--DGTQAGLRKLNVIGDISMQNQLVIYDNERARIGWVRAPC 351


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 96/414 (23%), Positives = 164/414 (39%), Gaps = 64/414 (15%)

Query: 72  RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGS 131
           +   R++  S+  + +      + G Q    GN     +   + +GTP  +  + LD  +
Sbjct: 62  KDPARIRYLSSLTAQKTVAAPIASGQQVLNVGN-----YVVRVQLGTPGQTMYMVLDTSN 116

Query: 132 NLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-KSRS-SCKSL 188
           +  W PC  CI C+            + + +   +SS+   + CS P C ++R  SC + 
Sbjct: 117 DAAWAPCSGCIGCS------------STTTFSAQNSSTFATLDCSKPECTQARGLSCPTT 164

Query: 189 KD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 247
            +  C +   Y   D++ S  LV D LHL          +V  +   GC    +GS +  
Sbjct: 165 GNVDCLFNQTYG-GDSTFSATLVQDSLHLG--------PNVIPNFSFGCISSASGSSI-- 213

Query: 248 AAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEND----SGSVFFGDQG-PATQQS 301
             P G+MGLG G +   SL++++G L    FS C         SGS+  G  G P   ++
Sbjct: 214 -PPQGLMGLGRGPL---SLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRT 269

Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIY 351
           T  L    +   Y+V +    +G   +            +G   ++DSG   T     IY
Sbjct: 270 TPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIY 329

Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSF 410
             V  +F K V      L   ++  C+  ++E  +  P + L  S  +    + N +   
Sbjct: 330 TAVRDEFRKQVGGSFSPL--GAFDTCFATNNE--VSAPAITLHLSGLDLKLPMENSLI-- 383

Query: 411 PENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 460
             +   ++ CL + +          +I       HRI+FD  N KL  +   C 
Sbjct: 384 -HSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELCN 436


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 101/385 (26%), Positives = 153/385 (39%), Gaps = 52/385 (13%)

Query: 103 GNQFYWLHY-TWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
           G ++  L+Y T I +G      L V +D GS+L WV  QC  C    +S Y   D     
Sbjct: 172 GIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWV--QCEPCP--GSSCYAQRD---PL 224

Query: 161 YDPSSSSSSKNVSCSHPLCK-------------SRSSCKSLKDPCPYIADYSTEDTSSSG 207
           +DP++S +   V C  P C              +RS+  S +  C Y   Y  + + S G
Sbjct: 225 FDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNS-EQRCYYALSYG-DGSFSRG 282

Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
            L  D L L + +K           + GCG    G +   A   G+MGLG  D+S+ S  
Sbjct: 283 VLAQDTLGLGTTTK-------LDGFVFGCGLSNRGLFGGTA---GLMGLGRTDLSLVS-- 330

Query: 268 AKAGLIQNSFSICF--DENDSGSVFFGDQGPAT----QQSTSFLPIGEKYDAYFVGVE-S 320
             A      FS C       +GS+  G  GP++       T  +    +   YF+ +  +
Sbjct: 331 QTAARFGGVFSYCLPATTTSTGSLSLG-PGPSSSFPNMAYTRMIADPTQPPFYFINITGA 389

Query: 321 YCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKY 376
              G + LT  GF A   LVDSG   T L   +Y  V  +F +    +  +  G S    
Sbjct: 390 AVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRF--EYPAAPGFSILDA 447

Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIG 434
           CY+ +  + + VP + L         V      F   +  +  CL + S   +    IIG
Sbjct: 448 CYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIG 507

Query: 435 QNFMMGHRIVFDRENLKLAWSHSKC 459
                  R+V+D    +L ++   C
Sbjct: 508 NYQQRNKRVVYDTVGSRLGFADEDC 532


>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
          Length = 245

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 96/237 (40%), Gaps = 21/237 (8%)

Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 310
           DG++GLG G  S+ S L   GL++N    C      G +FFGD   +++   ++ P+  +
Sbjct: 13  DGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGDVYDSSR--LTWTPMSSR 70

Query: 311 -YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
               Y  G      G       G   + D+G+S+T+  +  Y  V+    K ++ K +  
Sbjct: 71  DLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKE 130

Query: 370 QGNS------W--KYCYNASSEEMLKVPDMRLIFS----KNQSFVVRNHIFSFPENEGFT 417
             +       W  K  + +  E       M L F+     N  F +    +    N G  
Sbjct: 131 APDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVSNMGNV 190

Query: 418 VFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 470
             CL ++       GD  +IG   M+   +VFD E   + W+ + C  V +  HV +
Sbjct: 191 --CLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADCNRVPNSRHVSI 245


>gi|328768800|gb|EGF78845.1| hypothetical protein BATDEDRAFT_12639 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 355

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 148/359 (41%), Gaps = 64/359 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I +GTP  SFLV  D GS  LW+P    +C+  +   ++  +R LS          
Sbjct: 51  YYGRIALGTPPQSFLVQFDTGSANLWLPST--RCSDPACVKHSQFNRLLS---------- 98

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                        S+  SL     +   Y T   S SG +  D  ++A  +      S  
Sbjct: 99  -------------STWTSLTQT--FSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQSFA 139

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFD 282
            SV       Q G+       DGV+GLG+ ++S+ ++      +   GLI    F +   
Sbjct: 140 ESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLYLT 193

Query: 283 ENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 339
           +N + GSV   G   P+    S ++LP+  K   + VG+ S     + L Q+  QA+ DS
Sbjct: 194 KNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVFDS 251

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           G S   +PT             VS+  I  Q  +  Y           +P +  +   N 
Sbjct: 252 GTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-NNV 297

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           SF +RN  +  P   G+ V     +   G + I+G +FM  +  +FD +N ++  ++S+
Sbjct: 298 SFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANSR 355


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 163/394 (41%), Gaps = 81/394 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP  +  + +D GS L W+ C        + SY T+       +DP+ S+S + + 
Sbjct: 35  LTVGTPPQNVSMVIDTGSELSWLHCN------KTLSYPTT-------FDPTRSTSYQTIP 81

Query: 174 CSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           CS P C +R+       SC S  + C     Y+ + +SS G L  D+ H+         S
Sbjct: 82  CSSPTCTNRTQDFPIPASCDS-NNLCHATLSYA-DASSSDGNLASDVFHIG--------S 131

Query: 227 SVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
           S  S ++ GC      S  D  +   G+MG+  G +S  S L         FS C    D
Sbjct: 132 SDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP-----KFSYCISGTD 186

Query: 286 -SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSG 332
            SG +  G+            P  Q ST  LP  ++  AY V +E   + +  L   +S 
Sbjct: 187 FSGLLLLGESNLTWSVPLNYTPLIQISTP-LPYFDRV-AYTVQLEGIKVLDKLLPIPKST 244

Query: 333 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CY 378
           F        Q +VDSG  FTFL   +Y  +   F    SS    L+   + +      CY
Sbjct: 245 FEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCY 304

Query: 379 --NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGDYGII 433
               S   +  +P + L+F   +  V  + +      E     +V CL+  ++D    ++
Sbjct: 305 LVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSD----LL 360

Query: 434 G-QNFMMGHR------IVFDRENLKLAWSHSKCE 460
           G + +++GH       + FD E  ++  +  +C+
Sbjct: 361 GVEAYVIGHHHQQNVWMEFDLEKSRIGLAQVRCD 394


>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
          Length = 191

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 68/127 (53%), Gaps = 14/127 (11%)

Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           L++T + +G+P   + V +D GS++LWV   C++C+         +D  L+ YDP  S +
Sbjct: 69  LYFTKLGLGSPKKDYYVQVDTGSDILWV--NCVECSRCPTKSQIGMD--LTLYDPKGSHT 124

Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH- 222
           S+ +SC H  C S        C++ + PCPY   Y  + ++++GY V D L     + + 
Sbjct: 125 SELISCDHEFCSSTYDGPIPGCRA-ETPCPYSITYG-DGSATTGYYVRDYLTFDRINGNL 182

Query: 223 --APQSS 227
             APQ+S
Sbjct: 183 HTAPQNS 189


>gi|328768784|gb|EGF78829.1| hypothetical protein BATDEDRAFT_12559 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 355

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 148/359 (41%), Gaps = 64/359 (17%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I +GTP  SFLV  D GS  LW+P    +C+  +   ++  +R LS          
Sbjct: 51  YYGRIALGTPPQSFLVQFDTGSANLWLPST--RCSDPACVKHSQFNRLLS---------- 98

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                        S+  SL     +   Y T   S SG +  D  ++A  +      S  
Sbjct: 99  -------------STWTSLTQT--FSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQSFA 139

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFD 282
            SV       Q G+       DGV+GLG+ ++S+ ++      +   GLI    F +   
Sbjct: 140 ESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLYLT 193

Query: 283 ENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 339
           +N + GSV   G   P+    S ++LP+  K   + VG+ S     + L Q+  QA+ DS
Sbjct: 194 KNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVFDS 251

Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
           G S   +PT             VS+  I  Q  +  Y           +P +  +   N 
Sbjct: 252 GTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-NNV 297

Query: 400 SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
           SF +RN  +  P   G+ V     +   G + I+G +FM  +  +FD +N ++  ++S+
Sbjct: 298 SFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANSR 355


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 112/447 (25%), Positives = 176/447 (39%), Gaps = 72/447 (16%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQT--HFFGNQFYWLHY 111
           P K+ ++    LL +D  R   R  + S  + +R +    S  +Q   H   +     ++
Sbjct: 64  PPKSRLDGTRQLLQSDNAR---RQMISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYF 120

Query: 112 TWIDIGTPN-VSFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
             I IGTP    F++  D GS+L W+ C+     C +  P     + + D          
Sbjct: 121 VSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRAND---------- 170

Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
           SSS + + CS   CK       S + C +   PC +   Y      + G   ++ + +  
Sbjct: 171 SSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRY-LNGPRAIGVFANETVTVG- 228

Query: 219 FSKHAPQSSVQSSVIIGCGR--KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
            + H  +      V+IGC     +T  +     PDGVMGLG    S+   LA+  +  N 
Sbjct: 229 LNDH--KKIRLFDVLIGCTESFNETNGF-----PDGVMGLGYRKHSLALRLAE--IFGNK 279

Query: 277 FSICFDENDSGS-----VFFGD----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
           FS C  ++ S S     + FGD    + P  Q +   L +G     Y V V    +G S 
Sbjct: 280 FSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTE--LLLGYINAFYPVNVSGISVGGSM 337

Query: 328 LTQSG--------FQALVDSGASFTFLPTEIYAEVVVK----FDKLVSSKRISLQGNSWK 375
           L+ S            +VDSG S T L  E Y +VV      FDK      I L   +  
Sbjct: 338 LSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELN-N 396

Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD-GDYGI 432
           +C+     +   VP + + F+    F   V+++I    E     + CL ++  D     I
Sbjct: 397 FCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEG----IKCLGIIKADFPGSSI 452

Query: 433 IGQNFMMGHRIVFDRENLKLAWSHSKC 459
           +G      H   +D    KL +  S C
Sbjct: 453 LGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 94/414 (22%), Positives = 165/414 (39%), Gaps = 81/414 (19%)

Query: 114 IDIGTPNVSFLVAL--DAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
           + +G P+ +  V+L  D GS+L+W PC    C  L     T    + S   P   S  + 
Sbjct: 92  LSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCM-LCEGKATPGGNHSSPLPPPIDS--RR 148

Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA-PQ---SS 227
           +SC+ PLC +  S     D C            ++     D +   S + HA P    + 
Sbjct: 149 ISCASPLCSAAHSSAPTSDLC------------AAARCPLDAIETDSCASHACPPLYYAY 196

Query: 228 VQSSVIIGCGRKQTG--------------SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
              S++    R + G              ++   A P GV G G G +S+P+ LA +  +
Sbjct: 197 GDGSLVANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPS--L 254

Query: 274 QNSFSICFDEND--------SGSVFFG---DQGPATQQSTSFL--PI--GEKYDAYF-VG 317
              FS C   +         S  +  G   D        T F+  P+    K+  ++ V 
Sbjct: 255 SGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVA 314

Query: 318 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
           +E+  +G   +                 +VDSG +FT LP++ +A V  +F + +++ R 
Sbjct: 315 LEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARF 374

Query: 368 SLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFC 420
           +    +        CY+ S  +   VP + L F  N +  +  RN+   F   EG +V C
Sbjct: 375 TRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGC 433

Query: 421 LTVMSTDGD----------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 464
           L +M+  G+           G +G     G  +V+D +  ++ ++  +C ++ D
Sbjct: 434 LMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDLWD 487


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 91/411 (22%), Positives = 169/411 (41%), Gaps = 46/411 (11%)

Query: 66  LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
           +  D KR  + +   S+ ++++ ++     GS      NQ    ++  I +G+P  S  +
Sbjct: 1   MHRDVKRVASLIHRLSSGSAAKYEV--EDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYM 58

Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
            +D+GS+++WV C+ C QC       Y   D     +DP+ S+S   VSCS  +C    +
Sbjct: 59  VIDSGSDIVWVQCKPCTQC-------YHQTD---PLFDPADSASFMGVSCSSAVCDRVEN 108

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
                  C Y   Y  + + + G L    L   +F +     +V  +V IGCG    G +
Sbjct: 109 AGCNSGRCRYEVSYG-DGSYTKGTLA---LETLTFGR-----TVVRNVAIGCGHSNRGMF 159

Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQS 301
           +  A   G+        S+  +   +G   N+FS C      N +G + FG +  A    
Sbjct: 160 VGAAGLLGLG-----GGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSE--AMPVG 212

Query: 302 TSFLPIGEKYDA---YFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
            +++P+     A   Y++ +    +G++   +++  FQ         ++D+G + T  PT
Sbjct: 213 AAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPT 272

Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
             Y      F +   +   +   + +  CYN      ++VP +   FS      +  + F
Sbjct: 273 VAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNF 332

Query: 409 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
             P ++  T FC     +     I+G     G +I  D  N  + +  + C
Sbjct: 333 LIPVDDAGT-FCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 93/387 (24%), Positives = 161/387 (41%), Gaps = 68/387 (17%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP  S  + LD GS L W+  QC +  P      T        +DPS SSS   + 
Sbjct: 81  LPIGTPPQSQQMILDTGSQLSWI--QCHKKVPRKPPPSTV-------FDPSLSSSFSVLP 131

Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
           C+HPLCK R       +SC  L   C Y   Y+ + T + G LV + +  ++     P  
Sbjct: 132 CNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFYA-DGTLAEGNLVREKITFSTSQSTPP-- 187

Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--- 283
                +I+GC         D +   G++G+ LG +S  S   +A + + S+ +   +   
Sbjct: 188 -----LILGCAE-------DASDDKGILGMNLGRLSFAS---QAKITKFSYCVPTRQVRP 232

Query: 284 --NDSGSVFFGDQ-GPATQQSTSFLPIGEKYD-------AYFVGVESYCIGNSCLT--QS 331
               +GS + G+    A  Q  S L   +          A+ V ++   IGN  L    S
Sbjct: 233 GFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVS 292

Query: 332 GF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNAS 381
            F        Q+++DSG+ FT+L    Y +V  +  +L     K+  +       C++ +
Sbjct: 293 AFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGN 352

Query: 382 SEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQN 436
           + E+ + + +M   F K    V+ +  + +   + G  V C+ +  ++       IIG  
Sbjct: 353 AMEIGRLIGNMVFEFDKGVEIVIEKGRVLA---DVGGGVHCVGIGRSEMLGAASNIIGNF 409

Query: 437 FMMGHRIVFDRENLKLAWSHSKCEEVI 463
                 + FD  N ++ +  + C   +
Sbjct: 410 HQQNLWVEFDIANRRVGFGKADCSRSV 436


>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
 gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
          Length = 398

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 96/402 (23%), Positives = 158/402 (39%), Gaps = 70/402 (17%)

Query: 79  LQSNNNSSRNQLLFPS--EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
           L    ++  N L FPS   G       N     +Y  I +GTP  +F V  D GS+ LWV
Sbjct: 45  LAGKEHTKYNNLGFPSSSNGPTPETLKNFMDAQYYGEIGLGTPAQTFTVVFDTGSSNLWV 104

Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYI 195
           P   + C                        S  +++C  H       S   +K+   + 
Sbjct: 105 P--SVHC------------------------SFTDIACLLHHKYNGAKSSTYVKNGTAFA 138

Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
             Y +   S SGYL  D   +   S            + G   KQ G     A  DG++G
Sbjct: 139 IQYGS--GSLSGYLSQDTCTIGGLSIE--------EQVFGEAIKQPGVAFIAAKFDGILG 188

Query: 256 LGLGDVSV-------PSLLAKAGLIQNSFSICFDEN----DSGSVFFGDQGPATQQSTSF 304
           +    +SV        +++++  + QN FS   + N      G +  G   P    S  F
Sbjct: 189 MAYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPESEPGGELLLGGTDPK-YYSGDF 247

Query: 305 LPIGEKYDAYF-VGVESYCIGNSC-LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
             +     AY+ V ++   +G+   L + G +A+VD+G S    PT   AEV     K +
Sbjct: 248 QYLNVSRQAYWQVHMDGMGVGSQLSLCKGGCEAIVDTGTSLITGPT---AEVKA-LQKAI 303

Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT 422
            +  + +QG      Y  + +++  +PD+       QS+ +    +   E++     CL+
Sbjct: 304 GATPL-IQGE-----YMVNCDKIPTMPDITFNLG-GQSYSLTAEQYVLKESQAGKTICLS 356

Query: 423 -VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
             M  D     G   I+G  F+  +  VFDR+N ++ ++ SK
Sbjct: 357 GFMGLDIPAPAGPLWILGDVFIGQYYTVFDRDNNRVGFAKSK 398


>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 242

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 111/248 (44%), Gaps = 35/248 (14%)

Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
           +SSSG L +DI+     S+   Q +V      GC   +TG      A DG+MGLG G +S
Sbjct: 2   SSSSGVLGEDIVSFGRESELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLS 55

Query: 263 VPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
           +   L + G+I +SFS+C+   D G    V  G   P+    +   P+   Y  Y + ++
Sbjct: 56  IMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELK 113

Query: 320 SYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--- 370
              +    L        S    ++DSG ++ +LP + +    + F   V+SK  SL+   
Sbjct: 114 EIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIR 169

Query: 371 --GNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCL 421
               S+K  C+  +   + K+    PD+ ++F   Q  S    N++F   + +G   +CL
Sbjct: 170 GPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCL 227

Query: 422 TVMSTDGD 429
            V     D
Sbjct: 228 GVFQNGKD 235


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 150/371 (40%), Gaps = 48/371 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  +  +  D GS++LW+  QC+ C     S Y   D     ++PS SS+ 
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWL--QCLPC----QSCYGQTD---PLFNPSFSSTF 131

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ--SS 227
           ++++C   LC+        ++ C Y   Y            D    +  FS       S+
Sbjct: 132 QSITCGSSLCQQLLIRGCRRNQCLYQVSYG-----------DGSFTVGEFSTETLSFGSN 180

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS- 286
             +SV IGCG    G +   A   G+     G +S PS + +  L  + FS C    +S 
Sbjct: 181 AVNSVAIGCGHNNQGLFTGAAGLLGLG---KGLLSFPSQVGQ--LYGSVFSYCLPTREST 235

Query: 287 GSV--FFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSCL-----TQS 331
           GSV   FG+Q  A+    + L    K D ++        VG  S  I    L     T +
Sbjct: 236 GSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGN 295

Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPD 390
           G   ++DSG + T L T  Y  +   F   + S      G S +  CY+ S    + +P 
Sbjct: 296 G-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPA 354

Query: 391 MRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
           +  +F+   +  +       P +N G   +CL       ++ IIG       R+ FD   
Sbjct: 355 VSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQSFRMSFDSTG 412

Query: 450 LKLAWSHSKCE 460
            ++    ++C 
Sbjct: 413 NRVGIGANQCN 423


>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
          Length = 379

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 85/364 (23%), Positives = 149/364 (40%), Gaps = 67/364 (18%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   FLV  D GS+ LWVP      AP SA    +   N + YDPS+SS+ 
Sbjct: 68  YYGKITIGTPGQEFLVLFDTGSSNLWVP-----VAPCSAD--NAACENHNTYDPSASSTH 120

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
                S  +     S                     SGYLV+D + +    K   Q    
Sbjct: 121 VKKGESFSIQYGSGSL--------------------SGYLVEDTVDVEGL-KIKKQ---- 155

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-------SLLAKAGLIQNSFSICFD 282
              +      + G     A  DG+MG+G   ++V        +++++  + +  FS    
Sbjct: 156 ---VFAAATNEPGETFVYAPFDGIMGMGFKSIAVDDVTPPWYNMISQHLISEKVFSFYLA 212

Query: 283 E---NDSGSVFF--GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 337
               +D G V    G+     +    ++P+ E+    F   E++  G     +   QA+ 
Sbjct: 213 RRGTSDEGGVMVVGGNDDRYYEGDFHYVPVSEQGYWQFEMAEAHVNGVRICDRC--QAIA 270

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           D+G S   +PT+ Y E+  +     S        ++++Y  + S  + L V   RL    
Sbjct: 271 DTGTSLIAVPTDKYEEIQKEIGATFSY-------DTYEYMLDCSKIDDLPVVTFRL---G 320

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAW 454
           + +F +  R+++    +N+     C +     G D+ I+G  F+  +   FD E+ ++ +
Sbjct: 321 DGTFTLEGRDYVIKSDDNQ-----CSSAFEDGGTDFWILGDVFIGKYYTTFDAEHNRVGF 375

Query: 455 SHSK 458
           + +K
Sbjct: 376 ALAK 379


>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 498

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 80/374 (21%), Positives = 155/374 (41%), Gaps = 50/374 (13%)

Query: 123 FLVALDAGSNLLWVPCQCIQ---CAPLSASYYT-SLDRNLSEYDPSSSSSSKNVSCSHP- 177
           F + +D GS L + PC+      C      YY   + +   + + ++S+       + P 
Sbjct: 79  FDLEVDTGSPLTYFPCKGCPLEVCGIHEHPYYDYDMSKTFRKLNCTTSTEDAAYCNAQPN 138

Query: 178 --LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
             LC +  S     + C +   Y  + +   GY+ +D   L    + AP     + +  G
Sbjct: 139 VLLCDTNIS---YTNTCLFGIGY-VDGSVGRGYMAEDTFTLGD--ELAP-----AKITFG 187

Query: 236 CGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS------ 286
           CG      Y DG+    DG+ G   G+ +  + LAKAG+I  + F  C +  ++      
Sbjct: 188 CGGMY---YPDGSNLRQDGMAGFSRGNTAFHTQLAKAGVIDAHVFGFCSEGMETSTAMLT 244

Query: 287 -GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-TQSGFQALVDSGASFT 344
            G   FG + P    +     +GE  D   V   S+ +G+  + + S    ++DSG + T
Sbjct: 245 LGRYNFGRRVPELAWTRM---LGE--DDLAVRTMSWKLGDKTIASSSNVYTVLDSGTTLT 299

Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-------VPDMRLIFSK 397
            LP+ ++ + +   ++   S  +S+        Y    +  L         P + + +  
Sbjct: 300 VLPSAMHHDFMTHLNETARSAGLSVVVRGTHCFYENQRQSSLTQYTLTRWFPSLTITYDP 359

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMST------DGDYGIIGQNFMMGHRIVFDRENLK 451
           + + V+R   + F +      FC  +MS       +G+  I+GQ  +    + +D EN +
Sbjct: 360 DVTLVLRPENYLFADTVNLHAFCAGIMSASDAALANGEQIILGQQTLRNTFVEYDLENSR 419

Query: 452 LAWSHSKCEEVIDK 465
           +  +  +CE++ +K
Sbjct: 420 VGMATVQCEKLREK 433


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 150/370 (40%), Gaps = 50/370 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I +G+P  +  V +D+GS+++WV C+ C QC       Y   D     ++P+ SSS
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQC-------YHQSD---PVFNPADSSS 183

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              VSC+  +C    +    +  C Y   Y  + + + G L  + L   +F +     ++
Sbjct: 184 YAGVSCASTVCSHVDNAGCHEGRCRYEVSYG-DGSYTKGTLALETL---TFGR-----TL 234

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICFDEND-- 285
             +V IGCG    G ++  A   G++GLG G +S V  L  +AG    +FS C       
Sbjct: 235 IRNVAIGCGHHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQAG---GTFSYCLVSRGIQ 288

Query: 286 -SGSVFFGDQGPATQQSTSFLPIGEKYDA---YF------------VGVESYCIGNSCLT 329
            SG + FG +  A     +++P+     A   Y+            V +       S L 
Sbjct: 289 SSGLLQFGRE--AVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG 346

Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
             G   ++D+G + T LPT  Y      F    ++   +   + +  CY+      ++VP
Sbjct: 347 DGG--VVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVP 404

Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 449
            +   FS      +    F  P ++    FC     +     IIG     G  I  D  N
Sbjct: 405 TVSFYFSGGPILTLPARNFLIPVDD-VGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGAN 463

Query: 450 LKLAWSHSKC 459
             + +  + C
Sbjct: 464 GFVGFGPNVC 473


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 116/286 (40%), Gaps = 53/286 (18%)

Query: 132 NLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 190
           ++ W  C+ C++C          L  +   +DPS+S           L  S  SC     
Sbjct: 97  SITWTQCKPCVRC----------LKDSHRHFDPSAS-----------LTYSLGSCIPSTV 135

Query: 191 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
              Y   Y  + TS   Y  D +            S V      GCGR   G +  GA  
Sbjct: 136 GNTYNMTYGDKSTSVGNYGCDTMT--------LEPSDVFPKFQFGCGRNNEGDFGSGA-- 185

Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQGPATQQSTSFLPIG- 308
           DG++GLG G +S  S  A     +  FS C  E DS GS+ FG++   +Q S  F  +  
Sbjct: 186 DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEKA-TSQSSLKFTSLVN 242

Query: 309 -------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSGASFTFLPTEIYAEVVV 356
                  E+   YFV +    +GN  L    S F +   ++DSG   T LP   Y+ +  
Sbjct: 243 GPGTSGLEESGYYFVKLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTA 302

Query: 357 KFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
            F K ++   +S     +G+    CYN S  + + +P++ L F + 
Sbjct: 303 AFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEG 348


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 88/369 (23%), Positives = 153/369 (41%), Gaps = 44/369 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + +GTP  +  +  D GS++LW+  QC+ C     S Y   D     ++PS SS+ 
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWL--QCLPC----QSCYGQTD---PLFNPSFSSTF 131

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           ++++C   LC+        ++ C Y   Y  + + + G    + L           S+  
Sbjct: 132 QSITCGSSLCQQLLIRGCRRNQCLYQVSYG-DGSFTVGEFSTETLSFG--------SNAV 182

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GS 288
           +SV IGCG    G +   A   G+     G +S PS + +  L  + FS C    +S GS
Sbjct: 183 NSVAIGCGHNNQGLFTGAAGLLGLG---KGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237

Query: 289 V--FFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL------------TQSGF 333
           V   FG+Q  A+    + L    K D  Y+V +    +G + +            T +G 
Sbjct: 238 VPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNG- 296

Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMR 392
             ++DSG + T L T  Y  +   F   + S      G S +  CY+ S    + +P + 
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVS 356

Query: 393 LIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 451
            +F+   +  +       P +N G   +CL       ++ IIG       R+ FD    +
Sbjct: 357 FVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNR 414

Query: 452 LAWSHSKCE 460
           +    ++C 
Sbjct: 415 VGIGANQCN 423


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 161/374 (43%), Gaps = 60/374 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + IGTP  ++   +D GS+L+W  C+ C QC           D+    +DP  SSS   +
Sbjct: 101 LAIGTPPETYSAIMDTGSDLIWTQCKPCTQC----------FDQPTPIFDPKKSSSFSKL 150

Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
           SCS  LC++  +S+C    D C Y+  Y  + +S+ G L  + L     S   P+     
Sbjct: 151 SCSSKLCEALPQSTC---SDGCEYLYGYG-DYSSTQGMLASETLTFGKVS--VPE----- 199

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSG 287
            V  GCG    GS     +  G++GLG G +S+ S L +       FS C    D+  + 
Sbjct: 200 -VAFGCGEDNEGSGFSQGS--GLVGLGRGPLSLVSQLKEP-----KFSYCLTSVDDTKAS 251

Query: 288 SVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ------ 334
           ++  G            ++T  +    +   Y++ +E   +G++ L   +S F       
Sbjct: 252 TLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGS 311

Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPDM 391
              ++DSG + T+L    +  V  +F   ++    +      + C+   S    ++VP +
Sbjct: 312 GGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKL 371

Query: 392 RLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRE 448
              F   +      N++ +   +    V CL + S+ G   +G I Q  M+   ++ D E
Sbjct: 372 VFHFDGADLELPAENYMIA---DASMGVACLAMGSSSGMSIFGNIQQQNML---VLHDLE 425

Query: 449 NLKLAWSHSKCEEV 462
              L++  ++C+E+
Sbjct: 426 KETLSFLPTQCDEL 439


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 91/397 (22%), Positives = 157/397 (39%), Gaps = 86/397 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +  GTP  +  + +D GS+L+W PC     C  C+      +++ + + + + P SSSSS
Sbjct: 94  LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFIPKSSSSS 147

Query: 170 KNVSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           K + C +P C      K +S C+  +   P               +    L+   F  H 
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQ-----------ICPPYLNFLRFWDHR 196

Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
            +S     ++           L  +    + G G G  S+PS L   GL    FS C   
Sbjct: 197 -RSQFHRRMLCP---------LHQSTRREISGFGRGPPSLPSQL---GL--KKFSYCLLS 241

Query: 282 ----DENDSGSVFFGDQGPATQQST--SFLPIGEKYDA---------YFVGVESYCIGNS 326
               D  +S S+    +  + +++   S+ P  +             Y++G+    +G  
Sbjct: 242 RRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGK 301

Query: 327 CL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQG-NSW 374
            +                 ++DSG +FT++  EI+  V  +F+K V SKR + ++G    
Sbjct: 302 HVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGL 361

Query: 375 KYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG- 431
           + C+N S       P++ L F         + N++       G  V CLT++ TDG  G 
Sbjct: 362 RPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFL---GGDDVVCLTIV-TDGAAGK 417

Query: 432 -------IIGQNFMMGHRIV-FDRENLKLAWSHSKCE 460
                  II  NF   +  V +D  N +L +    C+
Sbjct: 418 EFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 454


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 93/428 (21%), Positives = 163/428 (38%), Gaps = 71/428 (16%)

Query: 74  KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF---LVALDAG 130
           K  ++L +   +   +LL P  G      G   Y +    + IGTP        V  D G
Sbjct: 93  KEEIQLATAIAAGDKKLLVPLYGRPQ---GGSTYLVQ---LRIGTPTDRISPRYVLFDTG 146

Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK 189
           S+L W  C+ C  C+  +             +DPS S + + +SC  P+C+    C ++ 
Sbjct: 147 SDLSWTQCEPCTNCSSFTP---------YPPHDPSKSRTFRRLSCFDPMCEL---CTAVV 194

Query: 190 DP------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           D       C +   Y  +  + SG LV D+ H  + +       ++  V  GC   +   
Sbjct: 195 DGGGGSAGCLFRRRYG-DGGAVSGELVSDVFHFGA-AGDGGGYQLERDVAFGCAHVEDSK 252

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-------------DENDSGSVF 290
            + G +  G++ LG+G    PS + + G+  + FS C              +E  +  + 
Sbjct: 253 AVRGYS-TGILALGIGK---PSFVTQLGV--DRFSYCIPASEITDDDDDDDEERSASFLR 306

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGV--------------ESYCIGNSCLTQSGFQAL 336
           FG     T +   F   G  Y      V                Y  G      +    L
Sbjct: 307 FGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPML 364

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           VDSG +  +LP  ++  +  + ++ +S ++R  L   S  YCY  +  ++  V  + L F
Sbjct: 365 VDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSL-YCYLGNMTDVEAV-SVTLGF 422

Query: 396 SKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
                  +    +F   EN      CL V +  G+  I+G        + +D   +++A+
Sbjct: 423 GGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGYDLSTMEIAF 480

Query: 455 SHSKCEEV 462
              +C+ V
Sbjct: 481 DRDQCDRV 488


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 150/367 (40%), Gaps = 44/367 (11%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
           ++  I +G+P  S  + +D+GS+++WV C+ C QC       Y   D     +DP+ S+S
Sbjct: 43  YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQC-------YHQTD---PLFDPADSAS 92

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
              VSCS  +C    +       C Y   Y  + +S+ G L  + L L          +V
Sbjct: 93  FMGVSCSSAVCDQVDNAGCNSGRCRYEVSYG-DGSSTKGTLALETLTLG--------RTV 143

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---ND 285
             +V IGCG    G ++  A   G+ G  +  V    L  + G   N+FS C      N 
Sbjct: 144 VQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVG--QLSRERG---NAFSYCLVSRVTNS 198

Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQSGF 333
           +G + FG +  A     +++P+     +   Y++G+    +G+           LT+ G 
Sbjct: 199 NGFLEFGSE--AMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGN 256

Query: 334 QALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
             +V D+G + T  PT  Y      F     +   +   + +  CYN      ++VP + 
Sbjct: 257 GGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVS 316

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
             FS      +  + F  P ++  T FC     +     I+G     G +I  D  N  +
Sbjct: 317 FYFSGGPILTLPANNFLIPVDDAGT-FCFAFAPSPSGLSILGNIQQEGIQISVDGANEFV 375

Query: 453 AWSHSKC 459
            +  + C
Sbjct: 376 GFGPNVC 382


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 152/371 (40%), Gaps = 49/371 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I IG P V  L+ +D GS+L W+ C   +C P +  +          + PS SS+ +N S
Sbjct: 82  ISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPF----------FHPSRSSTYRNAS 131

Query: 174 C-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C S P    +         C Y   Y  + +++ G L ++ L   +F         + ++
Sbjct: 132 CVSAPHAMPQIFRDEKTGNCQYHLRYR-DFSNTRGILAEEKL---TFETSDDGLISKQNI 187

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGS 288
           + GCG+  +G         GV+GLG G  S+  +    G   + FS CF    +     +
Sbjct: 188 VFGCGQDNSGF----TKYSGVLGLGPGTFSI--VTRNFG---SKFSYCFGSLTNPTYPHN 238

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALV 337
           +     G   +   + L I +  D Y++ +++   G   L           +Q G   ++
Sbjct: 239 ILILGNGAKIEGDPTPLQIFQ--DRYYLDLQAISFGEKLLDIEPGTFQRYRSQGG--TVI 294

Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE-EMLKVPDMRLI 394
           D+G S T L  E Y  +  + D L+    +R+         CY  + + ++   P +   
Sbjct: 295 DTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFH 354

Query: 395 FSKNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGDYGIIGQNFMMGHRIVFDRENLKLA 453
           F+      +      F  +E    FCL + M+T  D  +IG      + + ++   +K+ 
Sbjct: 355 FAGGAELALDVESL-FVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVY 413

Query: 454 WSHSKCEEVID 464
           +  + C E+ID
Sbjct: 414 FQRTDC-EIID 423


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 95/430 (22%), Positives = 172/430 (40%), Gaps = 81/430 (18%)

Query: 78  KLQSNNNSSRNQ--LLFPSEGSQTHFFGNQFYWLHYTWIDI----GTPNVSFLVALDAGS 131
           ++Q+  +SS+ Q  LL P + +QT     +  + H   + I    G+P  +  + LD GS
Sbjct: 22  QIQTCVSSSQTQKPLLLPLK-TQTQTPPRKLAFQHNVTLTISLTIGSPPQNVTMVLDTGS 80

Query: 132 NLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS-------S 184
            L W+ C+              L    S ++P  SSS     C+  +C +R+       S
Sbjct: 81  ELSWLHCK-------------KLPNLNSTFNPLLSSSYTPTPCNSSVCMTRTRDLTIPAS 127

Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC--GRKQTG 242
           C      C  I  Y+ + +S+ G L  +   LA         + Q   + GC      T 
Sbjct: 128 CDPNNKLCHVIVSYA-DASSAEGTLAAETFSLA--------GAAQPGTLFGCMDSAGYTS 178

Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST 302
              + A   G+MG+  G +S+ +      ++   FS C    D+  V     GP+     
Sbjct: 179 DINEDAKTTGLMGMNRGSLSLVT-----QMVLPKFSYCISGEDAFGVLLLGDGPSAPSPL 233

Query: 303 SFLPI------GEKYD--AYFVGVESYCIGNSCL-----------TQSGFQALVDSGASF 343
            + P+         +D  AY V +E   +    L           T +G Q +VDSG  F
Sbjct: 234 QYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAG-QTMVDSGTQF 292

Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYNASSEEMLKVPDMRLIFSK 397
           TFL   +Y  +  +F +        ++  ++ +      CY+A +  +  VP + L+FS 
Sbjct: 293 TFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SLAAVPAVTLVFSG 351

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENL 450
            +  V    +          V+C T  ++D    ++G + +++GH       + FD    
Sbjct: 352 AEMRVSGERLLYRVSKGRDWVYCFTFGNSD----LLGIEAYVIGHHHQQNVWMEFDLVKS 407

Query: 451 KLAWSHSKCE 460
           ++ ++ + C+
Sbjct: 408 RVGFTETTCD 417


>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
          Length = 396

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 100/408 (24%), Positives = 160/408 (39%), Gaps = 73/408 (17%)

Query: 74  KTRVKLQSNNNSSRNQLLFPS-EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
           +T  +L ++ NS +  L FPS  G       N     +Y  I +GTP   F V  D GS+
Sbjct: 39  RTAEELLADKNSLKYNLGFPSSNGPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSS 98

Query: 133 LLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDP 191
            LWVP   + C+ L                        +++C  H    S  S   +K+ 
Sbjct: 99  NLWVP--SVHCSIL------------------------DIACLLHHKYNSAKSSTYVKNG 132

Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
             +   Y T   S SGYL  D   +   S            + G   KQ G     A  D
Sbjct: 133 TAFAIQYGT--GSLSGYLSQDTCTIGDISV--------DKQLFGEAIKQPGVAFIAAKFD 182

Query: 252 GVMGLGLGDVSV-------PSLLAKAGLIQNSFSICFDEN----DSGSVFFGDQGPATQQ 300
           G++G+    +SV        +++++  + +N FS   + N      G +  G   P    
Sbjct: 183 GILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTEPGGELLLGGTDPK-YY 241

Query: 301 STSFLPIGEKYDAYF-VGVESYCIGNSC-LTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
           S  F  +     AY+ + ++   +G    L  SG +A+VD+G S    P+   AE V   
Sbjct: 242 SGDFHYVNITRQAYWQIHMDGMAVGGQLNLCTSGCEAIVDTGTSLITGPS---AE-VRSL 297

Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEGF 416
            K + +    +QG     C         K+P + +I      QS+V+    +    ++  
Sbjct: 298 QKAIGAIPF-IQGEYMVSCD--------KIPSLPVITFNVGGQSYVLTGEQYVLKVSQAG 348

Query: 417 TVFCLT-VMSTD-----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
              CL+  M  D     G   I+G  F+  +  VFDREN ++ ++ SK
Sbjct: 349 KTICLSGFMGLDIPAPAGPLWILGDVFIGQYYTVFDRENNQVGFAKSK 396


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 92/372 (24%), Positives = 149/372 (40%), Gaps = 48/372 (12%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++  + IG P  S+ + LD GS++ W     IQCAP S S Y+ +D     YDPS+SSS 
Sbjct: 12  YFARMGIGNPQRSYYLELDTGSDVTW-----IQCAPCS-SCYSQVD---PIYDPSNSSSY 62

Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
           + V C   LC++   S+C+ +   C Y   Y     SS        L + SF      S+
Sbjct: 63  RRVYCGSALCQALDYSACQGMG--CSYRVVYGDSSASSGD------LGIESFYLGPNSST 114

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFD---- 282
              ++  GCG   +G +   A   G+           S  ++ A  I  +FS C      
Sbjct: 115 AMRNIAFGCGHSNSGLFRGEAGLLGMG------GGTLSFFSQIAASIGPAFSYCLVDRYS 168

Query: 283 --ENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNS---------CLTQ 330
             ++ S  + FG    P   + T  L        Y+  +    +G +          LT 
Sbjct: 169 QLQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTG 228

Query: 331 SGF-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW--KYCYNASSEEMLK 387
           +G   A++DSG S T +    YA  V++     +S+ +      +    C+N      ++
Sbjct: 229 NGTGGAILDSGTSVTRVVPPAYA--VLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQ 286

Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           +P + L F      V+       P +   T FCL    +     +IG       RI FD 
Sbjct: 287 IPSLVLHFDNGVDMVLPGGNILIPVDRSGT-FCLAFAPSSMPISVIGNVQQQTFRIGFDL 345

Query: 448 ENLKLAWSHSKC 459
           +   +A +  +C
Sbjct: 346 QRSLIAIAPREC 357


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 102/431 (23%), Positives = 163/431 (37%), Gaps = 89/431 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDR-NLSEYDPSSSS 167
           + IGTP     V +D GS+L WVPC      C  C      Y  ++    L+ + P+ SS
Sbjct: 25  LSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCE----EYQNNISGPRLAAFLPTHSS 80

Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPC-------------------PYIADYSTEDTSSSGY 208
           +S   +C    C    S  +  DPC                   P  A         +G 
Sbjct: 81  TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140

Query: 209 LVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
           L  D+L       +   ++ Q      GC      +Y +   P G+ G G G +S+P  L
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGC---VGATYRE---PIGIAGFGRGLLSLPFQL 194

Query: 268 AKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFV 316
              G     FS CF       + N S  +  G+   +++    Q T  L      + Y++
Sbjct: 195 ---GFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYI 251

Query: 317 GVESYCIGNS--------------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
           G+ES  IGN                 T+     L+DSG ++T LP  +Y++++   + ++
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI 311

Query: 363 S---SKRISLQGNSWKYCY-------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF 410
               +K++ L    +  CY       N+S  +  ++P +   F  N S V+   N+ ++ 
Sbjct: 312 GYPRAKQVELN-TGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAM 370

Query: 411 PENEGFTVF-CLTVMSTDGDY-----------GIIGQNFMMGHRIVFDRENLKLAWSHSK 458
                 TV  CL   S DG             GI G        +V+D E  +L +    
Sbjct: 371 AAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMD 430

Query: 459 CEEVIDKSHVH 469
           C  V  K  +H
Sbjct: 431 CVSVAAKQGLH 441


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 145/370 (39%), Gaps = 45/370 (12%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP V  L   D GS+L+W      QC P    Y    ++    +DP  S + K + 
Sbjct: 98  ISLGTPPVPMLGIADTGSDLIWR-----QCLPCPNCY----EQVEPLFDPKESETYKTLD 148

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSV 232
           C +  C+      S  D       YS  D S + G L  D L + S ++  P S     +
Sbjct: 149 CDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGS-TEGDPASF--PGI 205

Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSG 287
             GCG    G++ +         +GLG   +  ++  +  +   FS C      D   S 
Sbjct: 206 AFGCGHDNGGTFNE----KDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSS 261

Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGF------------ 333
            + FG  G  +   T   P+  G     Y++ +E   +G+  +   GF            
Sbjct: 262 KINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEE 321

Query: 334 -QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
              ++DSG + T LP + Y +V       +  +  +     +  CY  SS   L++P + 
Sbjct: 322 GNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTIT 379

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIVFDRENLK 451
             F+     +   + F     E    F +   S    +G + Q NF++G    +D +N K
Sbjct: 380 AHFTGADVQLPPLNTF-VQVQEDLVCFSMIPSSNLAIFGNLAQINFLVG----YDLKNNK 434

Query: 452 LAWSHSKCEE 461
           +++  + C E
Sbjct: 435 VSFKQTDCTE 444


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/256 (25%), Positives = 105/256 (41%), Gaps = 35/256 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  +  G+P   + + +D GS+L W     +QC P     +   D     +DPS+S + 
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSW-----LQCKPCVVYCHVQAD---PLFDPSASKTY 169

Query: 170 KNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           K++SC+   C S          C++  + C Y A Y  + + S GYL  D+L LA     
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDLLTLA----- 223

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
              S      + GCG+   G +   A   G++GLG   +S+  L   +     +FS C  
Sbjct: 224 --PSQTLPGFVYGCGQDSDGLFGRAA---GILGLGRNKLSM--LGQVSSKFGYAFSYCLP 276

Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQSGFQ----A 335
               G      +      +  F P+         YF+ + +  +G   L  +  Q     
Sbjct: 277 TRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT 336

Query: 336 LVDSGASFTFLPTEIY 351
           ++DSG   T LP  +Y
Sbjct: 337 IIDSGTVITRLPMSVY 352


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 149/382 (39%), Gaps = 63/382 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP       LD GS+L+W   QC  CA       + L +    + P++SSS   + 
Sbjct: 107 LAIGTPPQPVSALLDTGSDLIWT--QCAPCA-------SCLAQPDPLFAPAASSSYVPMR 157

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
           CS  LC      SC+   D C Y  +Y  + T++ G    +    AS S       +   
Sbjct: 158 CSGQLCNDILHHSCQR-PDTCTYRYNYG-DGTTTLGVYATERFTFASSSGE----KLSVP 211

Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----- 286
           +  GCG    GS  +G+   G++G G   +S+ S L+        FS C     S     
Sbjct: 212 LGFGCGTMNVGSLNNGS---GIVGFGRDPLSLVSQLSI-----RRFSYCLTPYTSTRKST 263

Query: 287 ---GS----VFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ-- 334
              GS    VF GD     Q Q+T  L   +    Y+V      +G   L    S F   
Sbjct: 264 LMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALR 323

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY---------N 379
                  +VDSG + T  P  +  EV+  F   +     S        C+          
Sbjct: 324 PDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRR 383

Query: 380 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 438
           AS+  ++ VP M   F   +     RN++   P      +    +++  GD G    NF+
Sbjct: 384 ASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCI----LLADSGDSGATIGNFV 439

Query: 439 -MGHRIVFDRENLKLAWSHSKC 459
               R+++D E   L+++ ++C
Sbjct: 440 QQDMRVLYDLEAETLSFAPAQC 461


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 96/384 (25%), Positives = 143/384 (37%), Gaps = 80/384 (20%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP V   V +D GS+L WV  QC  C   S+S Y   D     YDP++SS+   V 
Sbjct: 131 LGIGTPAVQQTVLIDTGSDLSWV--QCKPCN--SSSCYPQKD---PLYDPTASSTYAPVP 183

Query: 174 CSHPLCK-----------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
           C    CK           + SS  SL   C Y  +Y   DT+   Y  + +         
Sbjct: 184 CDSKACKDLVPDAYDHGCTNSSGTSL---CQYGIEYGNRDTTVGVYSTETL-------TL 233

Query: 223 APQSSVQSSVIIGCGRKQTGS-------YLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQ 274
           +PQ SV+     GCG  Q G+          G AP+             SL+++ A    
Sbjct: 234 SPQVSVK-DFGFGCGLVQQGTFDLFDGLLGLGGAPE-------------SLVSQTAETYG 279

Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTS---FLP---IGEKYDAYFVGVESYCIGNSCL 328
            +FS C    +S + F     P     T+   F P   + E+   Y V +    +G   L
Sbjct: 280 GAFSYCLPPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPL 339

Query: 329 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASS 382
               T      ++DSG   T LP   Y+ +   F   +S+  +    N      CYN + 
Sbjct: 340 DIPPTVLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTG 399

Query: 383 EEMLKVPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 435
              + VP + L F    +        V+     +F              ++DGD GIIG 
Sbjct: 400 IANVTVPTVALTFDGGATIDLDVPSGVLIQDCLAFAGG-----------ASDGDVGIIGN 448

Query: 436 NFMMGHRIVFDRENLKLAWSHSKC 459
                  +++D     + +    C
Sbjct: 449 VNQRTFEVLYDSGRGHVGFRPGAC 472


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/428 (21%), Positives = 163/428 (38%), Gaps = 71/428 (16%)

Query: 74  KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF---LVALDAG 130
           K  ++L +   +   +LL P  G      G   Y +    + IGTP        V  D G
Sbjct: 72  KEEIQLATAIAAGDKKLLVPLYGRPQ---GGSTYLVQ---LRIGTPTDRISPRYVLFDTG 125

Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK 189
           S+L W  C+ C  C+  +             +DPS S + + +SC  P+C+    C ++ 
Sbjct: 126 SDLSWTQCEPCTNCSSFTP---------YPPHDPSKSRTFRRLSCFDPMCEL---CTAVV 173

Query: 190 DP------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           D       C +   Y  +  + SG LV D+ H  + +       ++  V  GC   +   
Sbjct: 174 DGGGGSAGCLFRRRYG-DGGAVSGELVSDVFHFGA-AGDGGGYQLERDVAFGCAHVEDSK 231

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-------------DENDSGSVF 290
            + G +  G++ LG+G    PS + + G+  + FS C              +E  +  + 
Sbjct: 232 AVRGYS-TGILALGIGK---PSFVTQLGV--DRFSYCIPASEITDDDDDDDEERSASFLR 285

Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGV--------------ESYCIGNSCLTQSGFQAL 336
           FG     T +   F   G  Y      V                Y  G      +    L
Sbjct: 286 FGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPML 343

Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           VDSG +  +LP  ++  +  + ++ +S ++R  L   S  YCY  +  ++  V  + L F
Sbjct: 344 VDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSL-YCYLGNMTDVEAV-SVTLGF 401

Query: 396 SKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 454
                  +    +F   EN      CL V +  G+  I+G        + +D   +++A+
Sbjct: 402 GGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGYDLSTMEIAF 459

Query: 455 SHSKCEEV 462
              +C+ V
Sbjct: 460 DRDQCDRV 467


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 156/369 (42%), Gaps = 52/369 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + +GTP V     +D GS+L+W      QC P    Y     +    ++P  S++   + 
Sbjct: 54  LTLGTPPVDVYGLVDTGSDLVWA-----QCTPCQGCYR----QKSPMFEPLRSNTYTPIP 104

Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQS 230
           C    C S    SC   K  C Y   Y+  D+S + G L  + +   +FS    +  V  
Sbjct: 105 CDSEECNSLFGHSCSPQK-LCAY--SYAYADSSVTKGVLARETV---TFSSTDGEPVVVG 158

Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS--FSICF-----DE 283
            ++ GCG   +G++ +       MG+        SL+++ G +  S  FS C      D 
Sbjct: 159 DIVFGCGHSNSGTFNEND-----MGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADP 213

Query: 284 NDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIG------NSCLTQSGFQA 335
           +  G++ FGD    + +  +  P+   E    Y V +E   +G      NS    S    
Sbjct: 214 HTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNI 273

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSEEMLKVPDMR 392
           ++DSG   T+LP E Y  +V +    V S  + +  +     + CY   SE  L+ P + 
Sbjct: 274 MIDSGTPATYLPQEFYDRLVKELK--VQSNMLPIDDDPDLGTQLCYR--SETNLEGPILI 329

Query: 393 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV-FDRENL 450
             F      ++    F  P  +G  VFC  +  +TDG+Y I G NF   + ++ FD +  
Sbjct: 330 AHFEGADVQLMPIQTF-IPPKDG--VFCFAMAGTTDGEY-IFG-NFAQSNVLIGFDLDRK 384

Query: 451 KLAWSHSKC 459
            +++  + C
Sbjct: 385 TVSFKATDC 393


>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 435

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/281 (25%), Positives = 119/281 (42%), Gaps = 45/281 (16%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY-TSLDRNLSEYDPSSSSS 168
           + T I+  TP V+  + +D G   +WV C       +S+SY     D  L +   S S +
Sbjct: 49  YVTQINQRTPLVAVKLTVDLGGTFMWVDCDNY----VSSSYTPVRCDSALCKLADSHSCT 104

Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           ++  S   P C + +        C +I        S+SG +  D++ L S     P  +V
Sbjct: 105 TECYSSPKPGCYNNT--------CSHIPYNPVVHVSTSGDIGLDVVSLQSMDGKYPGRNV 156

Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-- 283
              +V   CG   TG  L+  A    GV GLG G++S+P+  + A  +Q+ F+IC     
Sbjct: 157 SVPNVPFVCG---TGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSLT 213

Query: 284 NDSGSVFFGDQ-GPATQQSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLT 329
           N SG ++FGD  GP +     + P+             G+    YF+ V++  +G   + 
Sbjct: 214 NSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLRVGGKEIK 273

Query: 330 QSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDK 360
            +     +D+             +T L T IY  V+  F K
Sbjct: 274 FNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAK 314


>gi|45382405|ref|NP_990209.1| pepsin A precursor [Gallus gallus]
 gi|4589838|dbj|BAA76891.1| pepsinogen A [Gallus gallus]
 gi|4757361|dbj|BAA77268.1| pepsinogen A [Gallus gallus]
          Length = 382

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 158/369 (42%), Gaps = 86/369 (23%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           +Y  I IGTP   F V  D GS+ LWVP   I C   + S       N   +DPS SS+ 
Sbjct: 74  YYGTISIGTPQQDFTVIFDTGSSNLWVP--SIYCKSSACS-------NHKRFDPSKSSTY 124

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
             VS +  +               YIA Y T   S SG L  D + ++S         VQ
Sbjct: 125 --VSTNETV---------------YIA-YGT--GSMSGILGYDTVAVSSI-------DVQ 157

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------VPSLLAKAGLIQNSFSICFD 282
           +  I G    + GS+      DG++GL    +S         +++++  + Q+ FS+   
Sbjct: 158 NQ-IFGLSETEPGSFFYYCNFDGILGLAFPSISSSGATPVFDNMMSQHLVAQDLFSVYLS 216

Query: 283 EN-DSGS-VFFGDQGP-ATQQSTSFLPI-GEKYDAYFVGVESYCIGN---SCLTQSGFQA 335
           ++ ++GS V FG   P  T +   ++P+  E Y  + + ++   +GN   +C      QA
Sbjct: 217 KDGETGSFVLFGGIDPNYTTKGIYWVPLSAETY--WQITMDRVTVGNKYVACFFTC--QA 272

Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
           +VD+G S   +P   Y       ++++    +S  G         S +++ K+PD+    
Sbjct: 273 IVDTGTSLLVMPQGAY-------NRIIKDLGVSSDG-------EISCDDISKLPDV---- 314

Query: 396 SKNQSFVVRNHIFSFP------ENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRIVFD 446
               +F +  H F+ P        +G  +     M T  + G   I+G  F+  + ++FD
Sbjct: 315 ----TFHINGHAFTLPASAYVLNEDGSCMLGFENMGTPTELGEQWILGDVFIREYYVIFD 370

Query: 447 RENLKLAWS 455
           R N K+  S
Sbjct: 371 RANNKVGLS 379


>gi|225451013|ref|XP_002284868.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
          Length = 441

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 84/196 (42%), Gaps = 27/196 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T I   TP V   V +D G   LWV C          S Y S     S Y P+   SS
Sbjct: 46  YVTIISQRTPLVPLNVIVDLGGQFLWVGC---------GSNYVS-----SSYRPAQCHSS 91

Query: 170 KNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +        SC H L + R  C +    C   ++       S+G L +D+L L S     
Sbjct: 92  QCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSEDVLSLQSTDGLN 149

Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
           P+S+V     +  C  +     L G A +G+ GLG G + +P+LL+ A      F++C  
Sbjct: 150 PRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSALNFTRKFAVCLP 208

Query: 282 -DENDSGSVFFGDQGP 296
                SG +FFGD GP
Sbjct: 209 PTTTSSGVIFFGD-GP 223


>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
 gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
          Length = 408

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 147/352 (41%), Gaps = 67/352 (19%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T I IG P  SF V LD GS+ LWVP   ++C  ++   +T       +YD +SSS+ 
Sbjct: 97  YFTEISIGNPPQSFKVILDTGSSNLWVPS--VKCTSIACFLHT-------KYDSASSSTF 147

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
           K       +     S +                    G++ +D+L +   +    Q   +
Sbjct: 148 KANGSEFSIHYGSGSME--------------------GFVSNDLLSIGDITIKG-QDFAE 186

Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLIQN---SFSIC 280
           +        K+ G        DG++GLG   +SV  +      +   GLI +   SF + 
Sbjct: 187 AV-------KEPGLAFAFGKFDGILGLGYDTISVNHIIPPFYSMINQGLIDSPVFSFRLG 239

Query: 281 FDENDSG-SVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVD 338
             E D G +VF G    A +   +++P+  K  AY+ V +E    GN  L      A +D
Sbjct: 240 SSEEDGGEAVFGGIDESAYKGKITYVPVRRK--AYWEVELEKVSFGNDDLELESTGAAID 297

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SK 397
           +G S   LPT+I AE++   +  + +K+      SW   Y     ++  +P++   F  K
Sbjct: 298 TGTSLIVLPTDI-AEML---NTQIGAKK------SWNGQYQVDCAKVPSLPELSFYFGGK 347

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFD 446
                  ++I    E +G  +   T M  +   G   IIG  F+  +  V+D
Sbjct: 348 PYPLKGTDYIL---EVQGTCISAFTGMDLNLPGGSLWIIGDAFLRRYFTVYD 396


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/372 (22%), Positives = 147/372 (39%), Gaps = 51/372 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++T + +GTP     + LD GS+++W     +QCAP    Y     ++   +DP  S + 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVW-----LQCAPCRRCY----SQSDPIFDPRKSKTY 192

Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
             + CS P C+   S  C + +  C Y   Y     +   +  + +    +F ++  +  
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----TFRRNRVK-- 246

Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDS 286
               V +GCG    G ++  A   G+           S   + G      FS C  +  +
Sbjct: 247 ---GVALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKFSYCLVDRSA 297

Query: 287 ----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ-- 334
                SV FG+   A  +   F P+    K D  Y+V +    +G +    +  S F+  
Sbjct: 298 SSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLD 355

Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
                  ++DSG S T L    Y  +   F     + + +   + +  C++ S+   +KV
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKV 415

Query: 389 PDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 447
           P + L F   + S    N++     N     FC     T G   IIG     G R+V+D 
Sbjct: 416 PTVVLHFRGADVSLPATNYLIPVDTNGK---FCFAFAGTMGGLSIIGNIQQQGFRVVYDL 472

Query: 448 ENLKLAWSHSKC 459
            + ++ ++   C
Sbjct: 473 ASSRVGFAPGGC 484


>gi|147821119|emb|CAN68736.1| hypothetical protein VITISV_030193 [Vitis vinifera]
          Length = 441

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 84/196 (42%), Gaps = 27/196 (13%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           + T I   TP V   V +D G   LWV C          S Y S     S Y P+   SS
Sbjct: 46  YVTIISQRTPLVPLNVIVDLGGQFLWVGC---------GSNYVS-----SSYRPARCHSS 91

Query: 170 KNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
           +        SC H L + R  C +    C   ++       S+G L +D+L L S     
Sbjct: 92  QCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSEDVLSLQSTDGLN 149

Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
           P+S+V     +  C  +     L G A +G+ GLG G + +P+LL+ A      F++C  
Sbjct: 150 PRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSALNFTRKFAVCLP 208

Query: 282 -DENDSGSVFFGDQGP 296
                SG +FFGD GP
Sbjct: 209 PTTTSSGVIFFGD-GP 223


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 99/451 (21%), Positives = 174/451 (38%), Gaps = 71/451 (15%)

Query: 54  PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
           P+  + E++   L  D  R     + Q   +S+    L     +Q        Y +    
Sbjct: 34  PEVTASEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMT--- 90

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE----YDPSSSSSS 169
           + IGTP +S+    D GS+L+W      QCAP   +   + ++   +    Y+PSSS++ 
Sbjct: 91  LSIGTPPLSYRAIADTGSDLIWT-----QCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145

Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-------YIADYSTEDTSSSGYLVDDILHLASFSKH 222
             + C+ PL    S C ++  P P       Y   Y T  T+     V  +      S  
Sbjct: 146 GVLPCNSPL----SMCAAMAGPSPPPGCACMYNQTYGTGWTAG----VQSVETFTFGSSS 197

Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
            P +    ++  GC    +  + +G+A  G++GLG G +S+ S L        +FS C  
Sbjct: 198 TPPAVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLT 249

Query: 282 ---DENDSGSVFFGD------QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT 329
              D N + ++  G       +G    +ST F+    K      Y++ +    +G + L 
Sbjct: 250 PFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALA 309

Query: 330 ---------QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWK 375
                      G   L +DSG + T L    Y +V      L+ ++     G        
Sbjct: 310 IPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLD 369

Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMS-TDGDYG 431
            C+   +S     +P M L F      V  V N++       G  V+CL + + T G   
Sbjct: 370 LCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI-----LGSGVWCLAMRNQTVGAMS 424

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
           ++G        +++D     L+++ + C  +
Sbjct: 425 MVGNYQQQNIHVLYDVRKETLSFAPAVCSSL 455


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/396 (22%), Positives = 146/396 (36%), Gaps = 61/396 (15%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
           ++    +GTP   FL+  D GS+L WV     +C   +A+   S   +   + P  S + 
Sbjct: 94  YFVRFRVGTPAQPFLLVADTGSDLTWV-----KCRRPAANSSESGSGSGRAFRPEDSRTW 148

Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD---ILHLASFSK 221
             +SC+   C      S ++C +   PC Y  DY  +D S++   V      + L+   +
Sbjct: 149 APISCASDTCTKSLPFSLATCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSGRGR 206

Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
              ++ ++  +++GC    TG   +    DGV+ LG  DVS  S    A      FS C 
Sbjct: 207 EERKAKLK-GLVLGCTSSYTGPSFE--VSDGVLSLGYSDVSFAS--HAASRFAGRFSYCL 261

Query: 282 -----DENDSGSVFFGDQGPATQQS-----------------------TSFLPIGEKYDA 313
                  N +  + FG        S                       T  L        
Sbjct: 262 VDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPF 321

Query: 314 YFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSS 364
           Y V V++  +    L          +G   ++DSG S T L    Y  VV    + L   
Sbjct: 322 YDVAVKAVSVAGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGL 381

Query: 365 KRISLQGNSWKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 423
            R+++  + ++YCYN +S    + +P M + F+           +      G     L  
Sbjct: 382 PRVTM--DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQE 439

Query: 424 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 459
               G   +IG      H   FD +N +L +  S+C
Sbjct: 440 GPWPG-ISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/406 (21%), Positives = 153/406 (37%), Gaps = 88/406 (21%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
           + +GTP   F  A+D  S+L+W  CQ C++C       Y  LD     ++P +S+S   V
Sbjct: 92  LGLGTPQHCFTAAIDTASDLIWTQCQPCVKC-------YKQLD---PVFNPVASTSYAVV 141

Query: 173 SCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
            C+   C        +R      +D C Y   Y    T + G L  D L +         
Sbjct: 142 PCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNAT-TRGILAVDRLAIG-------- 192

Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD- 282
             V   V+ GC     G    G  P   GV+GLG G +S+ S L+        F  C   
Sbjct: 193 DDVFRGVVFGCSSSSVG----GPPPQVSGVVGLGRGALSLVSQLSV-----RRFMYCLPP 243

Query: 283 --ENDSGSVFFGDQGPATQQSTS---FLPI--GEKYDA-YFVGVESYCIGNSCL------ 328
                +G +  G    AT ++ S    +P+  G +Y + Y++ ++   IG+  +      
Sbjct: 244 PVSRSAGRLVLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRN 303

Query: 329 --------TQSG---------------------FQALVDSGASFTFLPTEIYAEVVVKFD 359
                   T +G                     +  ++D  ++ TFL   +Y E+V   +
Sbjct: 304 RMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLE 363

Query: 360 KLVSSKRISLQGNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 416
           + +   R S        C+          +  P + L F      + +  +F   E+   
Sbjct: 364 EEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFV--EDRAS 421

Query: 417 TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
            + CL V  TDG   I+G       +++++    ++ +  + CE V
Sbjct: 422 GMMCLMVGKTDG-VSILGNYQQQNMQVMYNLRRGRITFIKTACESV 466


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 149/367 (40%), Gaps = 57/367 (15%)

Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           ++GTP  +FL+ALD  ++  W+PC  C+ C   S++ + S+          +S++ K + 
Sbjct: 95  NVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLG 141

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C  P CK   +       C +   Y      S+  L  D + L+        + +     
Sbjct: 142 CDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALS--------TDIVPGYT 191

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
            GC +K TGS +    P G++GLG G +S   L     L +++FS C       N SG++
Sbjct: 192 FGCIQKTTGSSV---PPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTL 246

Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
             G  G P   ++T  L    +   Y+V +    +G   +            +G   + D
Sbjct: 247 RLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFD 306

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-K 397
           SG  FT L   +Y  V  +F K V +  +S  G  +  CY       +  P M  +FS  
Sbjct: 307 SGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG-FDTCYTGP----IVAPTMTFMFSGM 361

Query: 398 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMGHRIVFDRENLKLA 453
           N +    N +     +   +  CL + +   +      +I       HRI+FD  N ++ 
Sbjct: 362 NVTLPTDNLLI---RSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIG 418

Query: 454 WSHSKCE 460
            +   C 
Sbjct: 419 VAREPCS 425


>gi|306015413|gb|ADM76760.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015419|gb|ADM76763.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015425|gb|ADM76766.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015431|gb|ADM76769.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015433|gb|ADM76770.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015435|gb|ADM76771.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015437|gb|ADM76772.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015439|gb|ADM76773.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015441|gb|ADM76774.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015443|gb|ADM76775.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015447|gb|ADM76777.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015451|gb|ADM76779.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015453|gb|ADM76780.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015459|gb|ADM76783.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015461|gb|ADM76784.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015463|gb|ADM76785.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015465|gb|ADM76786.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015467|gb|ADM76787.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015471|gb|ADM76789.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015473|gb|ADM76790.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015477|gb|ADM76792.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015481|gb|ADM76794.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015483|gb|ADM76795.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015493|gb|ADM76800.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015495|gb|ADM76801.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015497|gb|ADM76802.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015499|gb|ADM76803.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015501|gb|ADM76804.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015503|gb|ADM76805.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015507|gb|ADM76807.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 487
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWKTRTPL---QQ 56

Query: 488 QSTSNGQAAAPPSTAKT 504
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 91/393 (23%), Positives = 158/393 (40%), Gaps = 80/393 (20%)

Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI-----QCAPLSASYYTSLDRNLSEYDPS 164
           H   + +GTP     V LD GS+LLW  C  +     Q  P+              +D +
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPV--------------FDAA 152

Query: 165 SSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
            SSS   + C   LC+    +  +C   K  C Y  DY     +++G L  +     +F 
Sbjct: 153 RSSSFSVLPCDSKLCEAGTFTNKTCTDRK--CAYENDYGI--MTATGVLATETF---TFG 205

Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
            H     V +++  GCG+   G+  + +   G++GL  G +S+   LA        FS C
Sbjct: 206 AH---HGVSANLTFGCGKLANGTIAEAS---GILGLSPGPLSMLKQLAI-----TKFSYC 254

Query: 281 ---FDENDSGSVFFG---DQGP----ATQQSTSFL--PIGEKYDAYFVGVESYCIGNSCL 328
              F +  +  V FG   D G        Q+   L  P+ + Y  Y+V +    +G+  L
Sbjct: 255 LTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIY--YYVPMVGMSVGSKRL 312

Query: 329 ----------TQSGFQALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWK 375
                            ++DS  +  +L    + E+   V++  KL  + R     + + 
Sbjct: 313 DVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANR---SVDDYP 369

Query: 376 YCY---NASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMST--DGD 429
            C+      S E ++VP + L F  +    + R++ F  P      + CL VM    +G 
Sbjct: 370 VCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSP---GMMCLAVMQAPFEGA 426

Query: 430 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 462
             +IG        +++D  N K +++ +KC+ +
Sbjct: 427 PNVIGNVQQQNMHVLYDVGNRKFSYAPTKCDSI 459


>gi|306015415|gb|ADM76761.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015421|gb|ADM76764.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015423|gb|ADM76765.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015427|gb|ADM76767.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015429|gb|ADM76768.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015445|gb|ADM76776.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015449|gb|ADM76778.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015455|gb|ADM76781.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015457|gb|ADM76782.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015469|gb|ADM76788.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015475|gb|ADM76791.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015479|gb|ADM76793.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015485|gb|ADM76796.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015487|gb|ADM76797.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015489|gb|ADM76798.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015491|gb|ADM76799.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015505|gb|ADM76806.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 487
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56

Query: 488 QSTSNGQAAAPPSTAKT 504
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 103/444 (23%), Positives = 171/444 (38%), Gaps = 67/444 (15%)

Query: 34  DEAKERWISKSGNVSVADSWPKKNSVEYLELL---LSNDWKRQKTRVK-LQSNNNSSRNQ 89
           +E  E+W+ K   V   D     NS ++   L   L  D KR  + ++ L S    S   
Sbjct: 127 EEGGEKWMMK---VVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRV 183

Query: 90  LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
             F ++       G+  Y++    I +G+P  S  + +D+GS+++WV CQ C QC     
Sbjct: 184 DDFGTDVISGMEQGSGEYFVR---IGVGSPPRSQYMVIDSGSDIVWVQCQPCTQC----- 235

Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
             Y   D     +DP+ S+S   VSCS  +C    +       C Y   Y  + + + G 
Sbjct: 236 --YHQSD---PVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGT 289

Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
           L    L   +F +     ++  SV IGCG +  G ++  A   G+ G  +  V       
Sbjct: 290 LA---LETLTFGR-----TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVG-----Q 336

Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN 325
             G    +FS C                    S +++P+     A   Y++G+    +G 
Sbjct: 337 LGGQTGGAFSYCL------------------VSAAWVPLVRNPRAPSFYYIGLAGLGVGG 378

Query: 326 S---------CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
                      LT+ G   +V D+G + T LPT  Y      F    ++   +     + 
Sbjct: 379 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFD 438

Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 435
            CY+      ++VP +   FS      +    F  P ++  T FC     +     I+G 
Sbjct: 439 TCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT-FCFAFAPSTSGLSILGN 497

Query: 436 NFMMGHRIVFDRENLKLAWSHSKC 459
               G +I FD  N  + +  + C
Sbjct: 498 IQQEGIQISFDGANGYVGFGPNIC 521


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/434 (22%), Positives = 167/434 (38%), Gaps = 78/434 (17%)

Query: 65  LLSNDWKRQKTRVK-LQS-------NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDI 116
           LLS   +R K RV  LQS       +  +    L+  SEG      G            I
Sbjct: 48  LLSRAVRRSKARVAALQSLATTTAADAITVARILVLASEGEYLMSMG------------I 95

Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
           GTP   +   LD GS+L+W  C  C+ C          +D+    +DP+ S S   + C+
Sbjct: 96  GTPPRYYSAILDTGSDLIWTQCAPCMLC----------VDQPTPFFDPAQSPSYAKLPCN 145

Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
            P+C +       ++ C Y   Y  +  +++G L ++     +F  +  + +V   +  G
Sbjct: 146 SPMCNALYYPLCYRNVCVYQYFYG-DSANTAGVLSNETF---TFGTNDTRVTVP-RIAFG 200

Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFG 292
           CG    GS  +G+   G++G G G +S+ S L         FS C   F       ++FG
Sbjct: 201 CGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSP-----RFSYCLTSFMSPVPSRLYFG 252

Query: 293 DQGPATQ---------QSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSG 332
                           QST F+        Y++ +    +G   L               
Sbjct: 253 AYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312

Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVS---SKRISLQGNSWKYCY--NASSEEMLK 387
              ++DSG++ T+L    Y  V   F   V    +   SL  +    C+       +++ 
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSL-ADVLDTCFVWPPPPRKIVT 371

Query: 388 VPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 446
           +P++   F   N    + N++    +       CL + ++D D  IIG        +++D
Sbjct: 372 MPELAFHFEGANMELPLENYMLIDGDTGN---LCLAIAASD-DGSIIGSFQHQNFHVLYD 427

Query: 447 RENLKLAWSHSKCE 460
            EN  L+++ + C 
Sbjct: 428 NENSLLSFTPATCN 441


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 85/366 (23%), Positives = 140/366 (38%), Gaps = 49/366 (13%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           I +GTP   F V  D GS+  WV     QC P   S Y   DR    +DP+ SS+  NVS
Sbjct: 167 IGLGTPPSRFTVVFDTGSDTTWV-----QCRPCVVSCYKQKDR---LFDPAKSSTYANVS 218

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
           C+ P C    +       C Y   Y  + + + G+   D L +A       Q +++    
Sbjct: 219 CADPACADLDASGCNAGHCLYGIQYG-DGSYTVGFFAKDTLAVA-------QDAIK-GFK 269

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFF- 291
            GCG K  G +   A   G++GLG G  S+      K G    SFS C   + + + +  
Sbjct: 270 FGCGEKNRGLFGQTA---GLLGLGRGPTSITVQAYEKYG---GSFSYCLPASSAATGYLE 323

Query: 292 ----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--------SCLTQSGFQALVDS 339
                     +   T+ +   +    Y+VG+    +G         S  + SG   LVDS
Sbjct: 324 FGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSG--TLVDS 381

Query: 340 GASFTFLP--TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
           G   T LP                 S  + +   +    CY+ +    + +P + L+F  
Sbjct: 382 GTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQG 441

Query: 398 NQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLA 453
                +     +++  +++     CL   S   D   GI+G      + +++D     + 
Sbjct: 442 GACLDLDASGIVYAISQSQ----VCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVG 497

Query: 454 WSHSKC 459
           ++   C
Sbjct: 498 FAPGAC 503


>gi|306015417|gb|ADM76762.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 432 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 487
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56

Query: 488 QSTSNGQAAAPPSTAKT 504
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 142/364 (39%), Gaps = 52/364 (14%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE-YDPSSSSSSKN 171
           + IGTP       +D GS+L+W+ C  C  C          LD +    +   +SSS K 
Sbjct: 9   LSIGTPPQLIPAMIDTGSDLVWLKCDNCDHC---------DLDHHGETIFFSDASSSYKK 59

Query: 172 VSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
           + C+   C   SS       ++ C Y  +Y  + + +SG +  D +   S        S 
Sbjct: 60  LPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
               + GC RK  G   D     G++GLG    S+   L     +   FS C    DS  
Sbjct: 119 FDGFLFGCARKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSPP 173

Query: 287 ---GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT----QSGF-- 333
                +F G             PI  G+  D   Y+V ++S  IG   +     +SG   
Sbjct: 174 SAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNT 233

Query: 334 --------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 383
                   + ++DSG ++T L   +Y  +    ++ V    +   GNS     C+N+S +
Sbjct: 234 SVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDLCFNSSGD 290

Query: 384 EMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 442
                P +   F+     V+   +IF     +   V CL++ S+ GD  IIG        
Sbjct: 291 TSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGNMQQQNFH 347

Query: 443 IVFD 446
           I++D
Sbjct: 348 ILYD 351


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 93/430 (21%), Positives = 163/430 (37%), Gaps = 73/430 (16%)

Query: 74  KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF---LVALDAG 130
           K  ++L +   +   +LL P  G      G   Y +    + IGTP        V  D G
Sbjct: 92  KKEIQLATAIAAGDKKLLVPLYGRPQ---GGSTYLVQ---LRIGTPTDRISPRYVLFDTG 145

Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK 189
           S+L W  C+ C  C+  +             +DPS S + + +SC  P+C+    C ++ 
Sbjct: 146 SDLSWTQCEPCTNCSSFTP---------YPPHDPSKSRTFRRLSCFDPMCE---LCTAVV 193

Query: 190 D------PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
           D       C +   Y  +  + SG LV D+ H  + +       ++  V  GC   +   
Sbjct: 194 DGGGGSAGCLFRRRYG-DGGAVSGELVSDVFHFGA-AGDGGGYQLERDVAFGCAHVEDSK 251

Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---------------DENDSGS 288
            + G +  G++ LG+G    PS + + G+  + FS C                +E  +  
Sbjct: 252 AVRGYS-TGILALGIGK---PSFVTQLGV--DRFSYCIPASEITDDDDDDDDDEERSASF 305

Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVG--------------VESYCIGNSCLTQSGFQ 334
           + FG     T +   F   G  Y                    V  Y  G      +   
Sbjct: 306 LRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMP 363

Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
            LVDSG +  +LP  ++  +  + ++ +S ++R  L   S  YCY  +  ++  V  + L
Sbjct: 364 MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSL-YCYLGNMTDVEAV-SVTL 421

Query: 394 IFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 452
            F       +    +F   EN      CL V +  G+  I+G        + +D   +++
Sbjct: 422 GFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGYDLSTMEI 479

Query: 453 AWSHSKCEEV 462
           A+   +C+ V
Sbjct: 480 AFDRDQCDRV 489


>gi|331241311|ref|XP_003333304.1| hypothetical protein PGTG_14224 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309312294|gb|EFP88885.1| hypothetical protein PGTG_14224 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 390

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 70/362 (19%), Positives = 145/362 (40%), Gaps = 59/362 (16%)

Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
           + IGTP V  ++  D GS+ LWV    ++ +  +AS +         YDP  SS++K V 
Sbjct: 73  VTIGTPGVEIMLDFDTGSSDLWVWSSDLKASQPTASGHVV-------YDPKKSSTAKEV- 124

Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
                            P         + +S+SG +  D + +     H  +  ++    
Sbjct: 125 -----------------PGGTWKISYGDGSSASGVIYRDDIKIGDL--HCSEQGIE---- 161

Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVS---------VPSLLAKAGLIQNSFSICFDEN 284
               +K + S+L+    DG++GL    ++         + +++ K+   Q  F++C   +
Sbjct: 162 --VAQKLSSSFLNSQGSDGLLGLAWPQINTANPQQKTPMQNMIEKSITDQGLFTVCLKHD 219

Query: 285 DSGSVFF------GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 338
             G  F+       ++   +++  ++ PI  K   +        IG+  +  SG  A+ D
Sbjct: 220 TDGKGFYSFGTICAEEAGVSEKDIAYAPIDNKQGFWAFESTKAKIGDEEIELSGNTAIAD 279

Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
           +G +   +   + A +  +    V  +    QG    Y Y   ++    VPD++L   + 
Sbjct: 280 TGTTLALVSEAVTAALYKQIPGAVLDRS---QGG---YVYPVDAQ----VPDVQLAVGEK 329

Query: 399 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 458
              +    +   P  +G  VF       +  + I+G  F+     VFD++N+++ ++   
Sbjct: 330 MYTIPGKSLAYGPPEKGM-VFGGIQSRGNNPFDILGDTFLKSVYAVFDQKNVRIGFAQRN 388

Query: 459 CE 460
            +
Sbjct: 389 AK 390


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.131    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,533,453,092
Number of Sequences: 23463169
Number of extensions: 363188553
Number of successful extensions: 1309411
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 273
Number of HSP's successfully gapped in prelim test: 2779
Number of HSP's that attempted gapping in prelim test: 1303376
Number of HSP's gapped (non-prelim): 4082
length of query: 538
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 390
effective length of database: 8,886,646,355
effective search space: 3465792078450
effective search space used: 3465792078450
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)