BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 039412
         (433 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/435 (83%), Positives = 394/435 (90%), Gaps = 6/435 (1%)

Query: 1   MKPQLVFFLAFLFLFSLSEG--LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESV 58
           MK  L F LAFLF F+L++G  LNP C  QD  S LQVFHV+SPCSPF PSKPL WEESV
Sbjct: 1   MKTHL-FSLAFLF-FTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESV 58

Query: 59  LEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
           L+M AKDQARLQFLSSL VARKSVVPIASGRQI QSPTYIVRAKIGTPAQT+L+AMDTSN
Sbjct: 59  LQMQAKDQARLQFLSSL-VARKSVVPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSN 117

Query: 119 DAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
           DAAW+PC+GCVGCSSTVFN+ +STTFK +GC+A QCKQVPN  CGG ACAFN+TYGSS+I
Sbjct: 118 DAAWIPCSGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSI 177

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           AANLSQD ++LATD +P YTFGC+ +ATG+S+PPQGLLGLGRG +SLL+QTQNLYQSTFS
Sbjct: 178 AANLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFS 237

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLPSF++L+FSGSLRLGP+GQPKRIK TPLLKNPRRSSLYYVNL+AIRVGRRVVDIPP 
Sbjct: 238 YCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPS 297

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
           AL FNPTTGAGTI DSGTVFTRLVAPAYTAVRD FR+RVG N TVTSLGGFDTCY+ PIV
Sbjct: 298 ALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVG-NATVTSLGGFDTCYTSPIV 356

Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           APTIT MFSGMNVTLP DNLLIHSTA SITCLAMAAAPDNVNSVLNVIANMQQQNHRIL+
Sbjct: 357 APTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILF 416

Query: 419 DVPNSRLGVARELCT 433
           DVPNSRLGVARE CT
Sbjct: 417 DVPNSRLGVAREPCT 431


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/419 (80%), Positives = 374/419 (89%), Gaps = 5/419 (1%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           EGL P CDTQDH STL+VFHVFSPCSPF+P KPLSW ESVL++ AKDQARLQFL+S+ VA
Sbjct: 21  EGLTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLASM-VA 79

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
            +SVVPIASGRQI QSPTYIVRAKIG+P QTLL+AMDTSNDAAW+PCT C GC+ST+F  
Sbjct: 80  GRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTLFAP 139

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +STTFKN+ C + QC QVPNP+CG  AC FNLTYGSS+IAAN+ QDT++LATD +P YT
Sbjct: 140 EKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIAANVVQDTVTLATDPIPDYT 199

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 200 FGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 259

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
            QP RIKYTPLLKNPRRSSLYYVNL+AIRVGR+VVDIPP AL FN  TGAGT+ DSGTVF
Sbjct: 260 AQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVF 319

Query: 319 TRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
           TRLVAPAYTAVRD F+RRV     +NLTVTSLGGFDTCY+VPIVAPTIT MFSGMNVTLP
Sbjct: 320 TRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIVAPTITFMFSGMNVTLP 379

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +DN+LIHSTAGS TCLAMA+APDNVNSVLNVIANMQQQNHR+LYDVPNSRLGVARELCT
Sbjct: 380 EDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 438


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  674 bits (1739), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/419 (79%), Positives = 374/419 (89%), Gaps = 5/419 (1%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           +GL P CDTQDH STL+VFHVFSPCSPF+PSKPLSW ESVL++ AKDQARLQFL+S+ VA
Sbjct: 20  QGLTPKCDTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASM-VA 78

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
            +S+VPIASGRQI QSPTYIVRAKIGTP QTLL+A+DTSNDAAW+PCT C GC+ST+F  
Sbjct: 79  GRSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTSTLFAP 138

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +STTFKN+ C + +C +VP+P+CG  AC FNLTYGSS+IAAN+ QDT++LATD +PGYT
Sbjct: 139 EKSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIAANVVQDTVTLATDPIPGYT 198

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 199 FGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 258

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
            QP RIKYTPLLKNPRRSSLYYVNL AIRVGR++VDIPP AL FN  TGAGT+ DSGTVF
Sbjct: 259 AQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVF 318

Query: 319 TRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
           TRLVAP YTAVRD FRRRV     +NLTVTSLGGFDTCY+VPIVAPTIT MFSGMNVTLP
Sbjct: 319 TRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVAPTITFMFSGMNVTLP 378

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           QDN+LIHSTAGS +CLAMA+APDNVNSVLNVIANMQQQNHR+LYDVPNSRLGVARELCT
Sbjct: 379 QDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 437


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  673 bits (1737), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/433 (78%), Positives = 381/433 (87%), Gaps = 9/433 (2%)

Query: 9   LAFLFLFSLSEGL-NPICDT---QDHS-STLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           L    LF++++GL NP CD     DH  STLQVFHVFSPCSPF+PSKP+SWEESVL++ A
Sbjct: 6   LVLFLLFTIAKGLHNPKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLKLQA 65

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
           KDQAR+Q+LSSL VAR+S+VPIASGRQITQSPTYIV+AKIGTPAQTLL+AMDTSNDA+WV
Sbjct: 66  KDQARMQYLSSL-VARRSIVPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWV 124

Query: 124 PCTGCVGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL 182
           PCT CVGCS+T  F  A+STTFK +GC A+QCKQV NPTC G ACAFN TYG+S++AA+L
Sbjct: 125 PCTACVGCSTTTPFAPAKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAASL 184

Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
            QDT++LATD VP Y FGCIQK TG+SVPPQGLLGLGRG LSLLAQTQ LYQSTFSYCLP
Sbjct: 185 VQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLP 244

Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
           SFK L+FSGSLRLGP+ QPKRIK+TPLLKNPRRSSLYYVNL+AIRVGRR+VDIPP AL F
Sbjct: 245 SFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAF 304

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVAP 360
           N  TGAGT+ DSGTVFTRLV PAY AVR+ FRRR+     LTVTSLGGFDTCY+ PIVAP
Sbjct: 305 NANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYTAPIVAP 364

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           TIT MFSGMNVTLP DN+LIHSTAGS+TCLAMA APDNVNSVLNVIANMQQQNHR+L+DV
Sbjct: 365 TITFMFSGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDV 424

Query: 421 PNSRLGVARELCT 433
           PNSRLGVARELCT
Sbjct: 425 PNSRLGVARELCT 437


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score =  672 bits (1733), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/411 (83%), Positives = 378/411 (91%), Gaps = 1/411 (0%)

Query: 23  PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
           P C+T D  STLQV HV+SPCSPF+P +PLSWEESVL+M AKD+ARLQFLSSL VARKSV
Sbjct: 28  PNCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSL-VARKSV 86

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           VPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D AW+PC GC+GCSST+FNS  ST
Sbjct: 87  VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPAST 146

Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI 202
           T+K+LGCQAAQCKQVP PTCGGG C+FNLTYG S++AANLSQDTI+LATD VPGY+FGCI
Sbjct: 147 TYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLAANLSQDTITLATDAVPGYSFGCI 206

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
           QKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+GQPK
Sbjct: 207 QKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPK 266

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
           RIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+  FNP+TGAGTI DSGTVFTRLV
Sbjct: 267 RIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 326

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHS 382
            PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI APTIT MF+GMNVTLP DNLLIHS
Sbjct: 327 TPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAPTITFMFTGMNVTLPPDNLLIHS 386

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           TAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDVPNSRLGVARELCT
Sbjct: 387 TAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  663 bits (1710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/406 (82%), Positives = 360/406 (88%), Gaps = 1/406 (0%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           +GLNP CD QD+ STLQV HVFSPCSPF+PSKPLSWEESVL+M AKD  RLQFL SL VA
Sbjct: 16  QGLNPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSL-VA 74

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
           RKS+VPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F  
Sbjct: 75  RKSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 134

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +STTFKN+ C A +CKQVPNP CG  +  FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLATDPVPSYT 194

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 254

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
            QPKRIKYTPLLKNPRRSSLYYVNL AIRVGR+VVDIPP AL FNPTTGAGTI DSGTVF
Sbjct: 255 AQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVF 314

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
           TRLVAP Y AVRD FRRRVG  LTVTSLGGFDTCY+VPIV PTIT +F+GMNVTLPQDN+
Sbjct: 315 TRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIVVPTITFIFTGMNVTLPQDNI 374

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR
Sbjct: 375 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  662 bits (1709), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/425 (80%), Positives = 378/425 (88%), Gaps = 15/425 (3%)

Query: 23  PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
           P C+T D  STLQV HV+SPCSPF+P +PLSWEESVL+M AKD+ARLQFLSSL VARKSV
Sbjct: 28  PNCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSL-VARKSV 86

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           VPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D AW+PC GC+GCSST+FNS  ST
Sbjct: 87  VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPAST 146

Query: 143 TFKNLGCQAAQCKQV--------------PNPTCGGGACAFNLTYGSSTIAANLSQDTIS 188
           T+K+LGCQAAQCKQV              P PTCGGG C+FNLTYG S++AANLSQDTI+
Sbjct: 147 TYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLSQDTIT 206

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+
Sbjct: 207 LATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN 266

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+  FNP+TGA
Sbjct: 267 FSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA 326

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG 368
           GTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI APTIT MF+G
Sbjct: 327 GTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAPTITFMFTG 386

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           MNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDVPNSRLGVA
Sbjct: 387 MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVA 446

Query: 429 RELCT 433
           RELCT
Sbjct: 447 RELCT 451


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  662 bits (1708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/433 (78%), Positives = 378/433 (87%), Gaps = 8/433 (1%)

Query: 1   MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
           MK  L F LAFLFL SL +GLN    T+   +T++VFHV+SP SPF+PSKP+SWE+SVL+
Sbjct: 1   MKAYL-FSLAFLFL-SLVQGLN----TRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQ 54

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           MLA+DQARLQFLSSL V RKS VPIASGRQI QSPTYIV+A +GTPAQT LMA+DTSNDA
Sbjct: 55  MLAEDQARLQFLSSL-VGRKSWVPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDA 113

Query: 121 AWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
           AW+PC GCVGCSSTVFNS  STTFK LGC A QCKQVPNPTCGG  C +N TYG STI +
Sbjct: 114 AWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILS 173

Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           NL++DTI+L+TDIVPGYTFGCIQK TG+SVPPQGLLGLGRG LS L+QTQ+LY+STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233

Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           LPSF+ L+FSG+LRLGP GQP RIK TPLLKNPRRSSLYYVNL+ IRVGR++VDIP  AL
Sbjct: 234 LPSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASAL 293

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP 360
            FNPTTGAGTI DSGTVFTRLVAP YTAVRD FR+RVG N  V+SLGGFDTCY+ PIVAP
Sbjct: 294 AFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVG-NAIVSSLGGFDTCYTGPIVAP 352

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           T+T MFSGMNVTLP DNLLI STAGS +CLAMAAAPDNVNSVLNVIANMQQQNHRIL+DV
Sbjct: 353 TMTFMFSGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDV 412

Query: 421 PNSRLGVARELCT 433
           PNSR+GVARE C+
Sbjct: 413 PNSRIGVAREPCS 425


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/428 (78%), Positives = 376/428 (87%), Gaps = 7/428 (1%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +F LAFLFL SL +GLN    T+   +T++VFHV+SP SPF+PSKP+SWE+SVL+MLA+D
Sbjct: 5   LFSLAFLFL-SLVQGLN----TRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAED 59

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
           QARLQFLSSL V RKS VPIASGRQI QSPTYIV+A +GTPAQT LMA+DTSNDAAW+PC
Sbjct: 60  QARLQFLSSL-VGRKSWVPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPC 118

Query: 126 TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQD 185
            GCVGCSSTVFNS  STTFK LGC A QCKQVPNPTCGG  C +N TYG STI +NL++D
Sbjct: 119 NGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSNLTRD 178

Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
           TI+L+TDIVPGYTFGCIQK TG+SVPPQGLLGLGRG LS L+QTQ+LY+STFSYCLPSF+
Sbjct: 179 TIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFR 238

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
            L+FSG+LRLGP GQP RIK TPLLKNPRRSSLYYVNL+ IRVGR++VDIP  AL FNPT
Sbjct: 239 TLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPT 298

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLM 365
           TGAGTI DSGTVFTRLVAP YTAVRD FR+RVG N  V+SLGGFDTCY+ PIVAPT+T M
Sbjct: 299 TGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVG-NAIVSSLGGFDTCYTGPIVAPTMTFM 357

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           FSGMNVTLP DNLLI STAGS +CLAMAAAPDNVNSVLNVIANMQQQNHRIL+DVPNSR+
Sbjct: 358 FSGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRI 417

Query: 426 GVARELCT 433
           GVARE C+
Sbjct: 418 GVAREPCS 425


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/406 (78%), Positives = 370/406 (91%), Gaps = 2/406 (0%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
           D SSTLQVFH+FSPCSPF+PSKPLSW ++VL+M AKDQARLQFLSSL VAR+S VPIAS 
Sbjct: 36  DRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSL-VARRSFVPIASA 94

Query: 89  RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
           RQ+ QSPT++VRAKIGTPAQTLL+A+DTSNDAAW+PC+GC+GC S+TVF+S +S++F+ L
Sbjct: 95  RQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPL 154

Query: 148 GCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG 207
            CQ+ QC QVPNP+C G AC FNLTYGSST+AA+L QD ++LATD VP YTFGCI+KATG
Sbjct: 155 PCQSPQCNQVPNPSCSGSACGFNLTYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATG 214

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYT 267
           +SVPPQGLLGLGRG LSLL Q+Q+LYQSTFSYCLPSFK+++FSGSLRLGP+ QP RIKYT
Sbjct: 215 SSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYT 274

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           PLL+NPRRSSLYYVNL++IRVGR++VDIPP AL FN  TGAGT+IDSGT FTRLVAPAYT
Sbjct: 275 PLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYT 334

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
           AVRD FRRRVG N+TV+SLGGFDTCY+VPI++PTIT MF+GMNVTLP DN LIHSTAGS 
Sbjct: 335 AVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGST 394

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D+PNSR+GVARE C+
Sbjct: 395 TCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 440


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/427 (75%), Positives = 372/427 (87%), Gaps = 3/427 (0%)

Query: 8   FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           FL  LF+ SL +   P CD QD  STL+VFH+FS CSPFKPSKP+SWEESVL + AKDQA
Sbjct: 10  FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68

Query: 68  RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
           R+Q+ SSL VARKSVVPIAS RQI QSPTYIV+AK GTP QTLL+A+DTS+DAAW+PC+G
Sbjct: 69  RMQYFSSL-VARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSG 127

Query: 128 CVGCS-STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDT 186
           CVGCS S  F   +ST+F+N+ C +  CKQVPNPTCGG ACAFN TYGSS+IAA++ QDT
Sbjct: 128 CVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIAASVVQDT 187

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           ++LATD +PGYTFGC+ K TG+S P QGLLGLGRG LSLL+Q+QNLY+STFSYCLPSFK+
Sbjct: 188 LTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKS 247

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
           ++FSGSLRLGP+ QPKRIKYTPLL+NPRRSSLYYVNL+AI+VGR++VDIPP AL FNPTT
Sbjct: 248 INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTT 307

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMF 366
           GAGTI DSGTVFTRL  P YTAVR+ FRRRVG  L VT+LGGFDTCY+VPIV PTIT +F
Sbjct: 308 GAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIVVPTITFLF 367

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SGMNVTLP DN++IHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSR+G
Sbjct: 368 SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIG 427

Query: 427 VARELCT 433
           +ARELCT
Sbjct: 428 IARELCT 434


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/424 (78%), Positives = 369/424 (87%), Gaps = 10/424 (2%)

Query: 18  SEGL-NPICDT---QDHS-STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL 72
           ++GL NP CD     DH  STLQVFHVFSPCSPF+PSKP+SWEESVL++ AKDQAR+Q+L
Sbjct: 23  AKGLHNPKCDAAYQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYL 82

Query: 73  SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS 132
           S+L VAR+S+VPIASGRQITQSPTYIVRAK GTPAQTLL+AMDTSNDAAWVPCT CVGCS
Sbjct: 83  SNL-VARRSIVPIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCS 141

Query: 133 STV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
           +T  F   +STTFK +GC A+QCKQV NPTC G ACAFN TYG+S++AA+L QDT++LAT
Sbjct: 142 TTTPFAPPKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAASLVQDTVTLAT 201

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           D VP YTFGCIQKATG+S+PPQGLLGLGRG LSLLAQTQ LYQSTFSYCLPSFK L+FSG
Sbjct: 202 DPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSG 261

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
              L P+ QP+   Y P  KNPRRSSLYYVNL+AIRVGRR+VDIPP AL FNP TGAGT+
Sbjct: 262 HXDLXPVAQPRDQVY-PSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTV 320

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
            DSGTVFTRLV PAYTAVR+ FRRRV     LTVTSLGGFDTCY+VPIVAPTIT MFSGM
Sbjct: 321 FDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTVPIVAPTITFMFSGM 380

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           NVTLP DN+LIHSTAGS+TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSRLGVAR
Sbjct: 381 NVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVAR 440

Query: 430 ELCT 433
           ELCT
Sbjct: 441 ELCT 444


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/427 (75%), Positives = 370/427 (86%), Gaps = 3/427 (0%)

Query: 8   FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           FL  LF+ SL +   P CD QD  STL+VFH+FS CSPFKPSKP+SWEESVL + AKDQA
Sbjct: 10  FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68

Query: 68  RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
           R+Q+ SSL VARKSVVPIAS RQI QSPTYIV+AK GTP QTLL+A+DTS+DAAW+PC+G
Sbjct: 69  RMQYFSSL-VARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSG 127

Query: 128 CVGCS-STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDT 186
           CVGCS S  F   +ST+F+N+ C +  CKQVPNPTCGG ACAFN TYGSS+IAA++ QDT
Sbjct: 128 CVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIAASVVQDT 187

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           ++LA D +PGYTFGC+ K TG+S P QGLLGLGRG LSLL+Q+QNLY+STFSYCLPSFK+
Sbjct: 188 LTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKS 247

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
           ++FSGSLRLGP+ QPKRIKYTPLL+NPRRSSLYYVNL+AI+VGR++VDIPP AL FNPTT
Sbjct: 248 INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTT 307

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMF 366
           GAGTI DSGTVFTRL  P YTAVR+ FRRRVG  L VT+LGGFDTCY+VPIV PTIT +F
Sbjct: 308 GAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIVVPTITFLF 367

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SGMNV LP DN++IHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSR+G
Sbjct: 368 SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIG 427

Query: 427 VARELCT 433
           +ARELCT
Sbjct: 428 IARELCT 434


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/415 (79%), Positives = 355/415 (85%), Gaps = 16/415 (3%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           +GLNP CD QD+ STLQV HVF               +SVL+M AKD  RLQFL SL VA
Sbjct: 16  QGLNPKCDVQDNGSTLQVIHVF---------------KSVLQMQAKDTTRLQFLDSL-VA 59

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
           RKSVVPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F  
Sbjct: 60  RKSVVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 119

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +STTFKN+ C A +CKQVPNP CG  +C FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 120 EKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIAANLVQDTITLATDPVPSYT 179

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 180 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 239

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
            QPKRIKYTPLLKNPRRSSLYYVNL AIRVGR+VVDIPP AL FNPTTGAGTI DSGTVF
Sbjct: 240 AQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVF 299

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
           TRLVAP Y AVRD FRRRVG  LTVTSLGGFDTCY+VPIV PTIT +F+GMNVTLPQDN+
Sbjct: 300 TRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIVVPTITFIFTGMNVTLPQDNI 359

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR+GVARELCT
Sbjct: 360 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 317/373 (84%), Positives = 346/373 (92%), Gaps = 1/373 (0%)

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           M AKD+ARLQFLSSL VARKSVVPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D 
Sbjct: 1   MQAKDKARLQFLSSL-VARKSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDV 59

Query: 121 AWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
           AW+PC GC+GCSST+FNS  STT+K+LGCQAAQCKQVP PTCGGG C+FNLTYG S++AA
Sbjct: 60  AWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLAA 119

Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           NLSQDTI+LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYC
Sbjct: 120 NLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYC 179

Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           LPSFK+L+FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+ 
Sbjct: 180 LPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF 239

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP 360
            FNP+TGAGTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI AP
Sbjct: 240 TFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAP 299

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           TIT MF+GMNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDV
Sbjct: 300 TITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDV 359

Query: 421 PNSRLGVARELCT 433
           PNSRLGVARELCT
Sbjct: 360 PNSRLGVARELCT 372


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  602 bits (1553), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 295/372 (79%), Positives = 328/372 (88%), Gaps = 2/372 (0%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +AKDQARLQFLSSL VA+KSVVPIASGR + QSP+YIV+AK+GTP QTLLMA+D S DAA
Sbjct: 1   MAKDQARLQFLSSL-VAKKSVVPIASGRGVIQSPSYIVKAKVGTPPQTLLMALDNSYDAA 59

Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN 181
           W+PC GCVGCSSTVFN+ +STTFK LGC A QCKQVPNP CGG  C +N TYGSSTI +N
Sbjct: 60  WIPCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN 119

Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
           L++DTI+L+ D VP Y FGCIQKATG+SVPPQGLLG GRG LS L+QTQNLY+STFSYCL
Sbjct: 120 LTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCL 179

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           PSF+ L+FSGSLRLGP+GQP RIK TPLLKNPRRSSLYYV L  IRVGR++VDIP  AL 
Sbjct: 180 PSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALA 239

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPT 361
           FNPTTGAGTI DSGTVFTRLVAPAY AVR+ FR+RVG N TV+SLGGFDTCYSVPIV PT
Sbjct: 240 FNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVG-NATVSSLGGFDTCYSVPIVPPT 298

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           IT MFSGMNVT+P +NLLIHSTAG  +CLAMAAAPDNVNSVLNVIA+MQQQNHRIL+DVP
Sbjct: 299 ITFMFSGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 358

Query: 422 NSRLGVARELCT 433
           NSRLGVARE C+
Sbjct: 359 NSRLGVAREQCS 370


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  590 bits (1520), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 301/437 (68%), Positives = 359/437 (82%), Gaps = 9/437 (2%)

Query: 5   LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
           LV FL    +  L+ GLN P CD   TQD  STL++FH+ SPCSPFK S PLSWE  VL+
Sbjct: 4   LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 63

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
            LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+A IGTPAQ LL+AMDTS+D 
Sbjct: 64  TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDV 122

Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
           AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNPTCG  AC+FNLTYGSS+IA
Sbjct: 123 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA 182

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           ANLSQDTI LA D +  +TFGC+ K  G     PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 183 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 242

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYCLPSF++L+FSGSLRLGP  QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 243 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 302

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
            A+ FNP+TGAGTI DSGTV+TRL  P Y AVR+ FR+RV  +   VTSLGGFDTCYS  
Sbjct: 303 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQ 362

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           +  PTIT MF G+N+T+P DNL++HSTAGS +CLAMAAAP+NVNSV+NVIA+MQQQNHR+
Sbjct: 363 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 422

Query: 417 LYDVPNSRLGVARELCT 433
           L DVPN RLG+ARE C+
Sbjct: 423 LIDVPNGRLGLARERCS 439


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  589 bits (1519), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 301/437 (68%), Positives = 359/437 (82%), Gaps = 9/437 (2%)

Query: 5   LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
           LV FL    +  L+ GLN P CD   TQD  STL++FH+ SPCSPFK S PLSWE  VL+
Sbjct: 20  LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 79

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
            LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+A IGTPAQ LL+AMDTS+D 
Sbjct: 80  TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDV 138

Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
           AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNPTCG  AC+FNLTYGSS+IA
Sbjct: 139 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA 198

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           ANLSQDTI LA D +  +TFGC+ K  G     PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 199 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 258

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYCLPSF++L+FSGSLRLGP  QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 259 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 318

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
            A+ FNP+TGAGTI DSGTV+TRL  P Y AVR+ FR+RV  +   VTSLGGFDTCYS  
Sbjct: 319 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQ 378

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           +  PTIT MF G+N+T+P DNL++HSTAGS +CLAMAAAP+NVNSV+NVIA+MQQQNHR+
Sbjct: 379 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 438

Query: 417 LYDVPNSRLGVARELCT 433
           L DVPN RLG+ARE C+
Sbjct: 439 LIDVPNGRLGLARERCS 455


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  579 bits (1492), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 296/437 (67%), Positives = 354/437 (81%), Gaps = 9/437 (2%)

Query: 5   LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
           LV FL    +  L+ GLN P CD    QD  STL++FH+ SPCSPFK   PLSWE  VL+
Sbjct: 4   LVLFLQLFSIVPLALGLNHPNCDLTKNQDQGSTLRIFHIDSPCSPFKSPSPLSWEARVLQ 63

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
            LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+  IGTPAQ LL+AMDTS+D 
Sbjct: 64  TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDV 122

Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
           AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNP CG  AC+FNLTYGSS+IA
Sbjct: 123 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSIA 182

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           ANLSQDTI LA D +  +TFGC+ K  G     PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 183 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTF 242

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYCLPSF++L+FSGSLRLGP  QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 243 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 302

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
            A+ FNP+TGAGTI DSGTV+TRL  P Y AVR+ FR+RV      VTSLGGFDTCYS  
Sbjct: 303 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQ 362

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           +  PTIT MF G+N+T+P DNL++HSTAGS +CLAMA+AP+NVNSV+NVIA+MQQQNHR+
Sbjct: 363 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRV 422

Query: 417 LYDVPNSRLGVARELCT 433
           L DVPN RLG+ARE C+
Sbjct: 423 LIDVPNGRLGLARERCS 439


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 283/418 (67%), Positives = 344/418 (82%), Gaps = 11/418 (2%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           SE +N  C+ + HSS L+VFH+ S CSPFK S  +SW +++L+    D+AR  +LSSLA 
Sbjct: 17  SESIN--CNEKSHSSDLRVFHINSQCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
            RKS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69  VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLF 128

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
           + ++S++ + L C+A QCKQ PNP+C    +C FN+TYG STI A L+QDT++LA+D++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTLASDVIP 188

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
            YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           GP  QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP  AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
           TV+TRLV PAY AVR+ FRRRV  N   TSLGGFDTCYS  +V P++T MF+GMNVTLP 
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 288/421 (68%), Positives = 343/421 (81%), Gaps = 14/421 (3%)

Query: 18  SEGLNPICDT---QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS 74
           SE LN  C+    Q H S L+VFHV SPCSPFK    +SWE ++L    KD+ARLQ+LSS
Sbjct: 17  SESLN--CNENNPQGHPSDLRVFHVNSPCSPFKQPNTVSWESTLL----KDKARLQYLSS 70

Query: 75  LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
           LA  +K  VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAWVPC+GCVGC+S+
Sbjct: 71  LA--KKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASS 128

Query: 135 V-FNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYGSSTIAANLSQDTISLATD 192
           V F+ ++S++ +NL C A QCKQ PNPTC  G +C FN+TYG STI A+L+QDT++LA D
Sbjct: 129 VLFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYGGSTIEASLTQDTLTLAND 188

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
           ++  YTFGCI KATG S+P QGL+GLGRG LSL++QTQNLY STFSYCLP+ K+ +FSGS
Sbjct: 189 VIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGS 248

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           LRLGP  QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP  AL F+ +TGAGTI 
Sbjct: 249 LRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIF 308

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVT 372
           DSGTVFTRLV PAY AVR+ FRRR+  N   TSLGGFDTCYS  +V P++T MF+GMNVT
Sbjct: 309 DSGTVFTRLVEPAYVAVRNEFRRRI-KNANATSLGGFDTCYSGSVVYPSVTFMFAGMNVT 367

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP DNLLIHS++GS +CLAMAAAP+NVNSVLNVIA+MQQQNHR+L D+PNSRLG++RE C
Sbjct: 368 LPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427

Query: 433 T 433
           T
Sbjct: 428 T 428


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 283/418 (67%), Positives = 344/418 (82%), Gaps = 11/418 (2%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           SE +N  C+ + HSS L+VFH+ S CSPFK S  +SW +++L+    D+AR  +LSSLA 
Sbjct: 17  SESIN--CNEKSHSSDLRVFHINSLCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
            RKS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69  VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLF 128

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
           + ++S++ + L C+A QCKQ PNP+C    +C FN+TYG STI A L+QDT++LA+D++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTLASDVIP 188

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
            YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           GP  QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP  AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
           TV+TRLV PAY AVR+ FRRRV  N   TSLGGFDTCYS  +V P++T MF+GMNVTLP 
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 281/418 (67%), Positives = 342/418 (81%), Gaps = 11/418 (2%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           SE +N  C+ + HSS L+VFH+ S CSPFK S  +SW +++L+    D+AR  +LSSLA 
Sbjct: 17  SESIN--CNEKSHSSDLRVFHINSQCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
             KS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69  VTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSVLF 128

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
           + ++S++ + L C+A QCKQ PNP+C    +C FN+TYG S I A L+QDT++LATD++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSAIEAYLTQDTLTLATDVIP 188

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
            YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           GP  QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP  AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
           TV+TRLV PAY A+R+ FRRRV  N   TSLGGFDTCYS  +V P++T MF+GMNVTLP 
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 284/364 (78%), Positives = 331/364 (90%), Gaps = 2/364 (0%)

Query: 71  FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           FLSSL VAR+S VPIAS RQ+ QSPT++VRAKIGTPAQTLL+A+DTSNDAAW+PC+GC+G
Sbjct: 1   FLSSL-VARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIG 59

Query: 131 C-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISL 189
           C S+TVF+S +S++F+ L CQ+ QC QVPNP+C G AC FNLTYGSST+AA+L QD ++L
Sbjct: 60  CPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYGSSTVAADLVQDNLTL 119

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
           ATD VP YTFGCI+KATG+SVPPQGLLGLGRG LSLL Q+Q+LYQSTFSYCLPSFK+++F
Sbjct: 120 ATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF 179

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
           SGSLRLGP+ QP RIKYTPLL+NPRRSSLYYVNL++IRVGR++VDIPP AL FN  TGAG
Sbjct: 180 SGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAG 239

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
           T+IDSGT FTRLVAPAYTAVRD FRRRVG N+TV+SLGGFDTCY+VPI++PTIT MF+GM
Sbjct: 240 TVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTITFMFAGM 299

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           NVTLP DN LIHST+GS TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D+PNSR+GVAR
Sbjct: 300 NVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVAR 359

Query: 430 ELCT 433
           E C+
Sbjct: 360 ESCS 363


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  520 bits (1339), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 270/320 (84%), Positives = 296/320 (92%)

Query: 114 MDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY 173
           MDTS+D AW+PC GC+GCSST+FNS  STT+K+LGCQAAQCKQVP PTCGGG C+FNLTY
Sbjct: 1   MDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTY 60

Query: 174 GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
           G S++AANLSQDTI+LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLY
Sbjct: 61  GGSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 120

Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           QSTFSYCLPSFK+L+FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVV
Sbjct: 121 QSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVV 180

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
           D+PPG+  FNP+TGAGTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY
Sbjct: 181 DVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCY 240

Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           +VPI APTIT MF+GMNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQN
Sbjct: 241 TVPIAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 300

Query: 414 HRILYDVPNSRLGVARELCT 433
           HR+LYDVPNSRLGVARELCT
Sbjct: 301 HRLLYDVPNSRLGVARELCT 320


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 248/410 (60%), Positives = 319/410 (77%), Gaps = 5/410 (1%)

Query: 29  DHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS 87
           D  +TLQV H F PCSP    S   SW   + +  A+D +RL +L SLAV  ++  PIAS
Sbjct: 38  DAGATLQVSHAFGPCSPLGAESAAPSWAGFLADQAARDASRLLYLDSLAVKGRAYAPIAS 97

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKN 146
           GRQ+ Q+PTY+VRA++GTPAQ LL+A+DTSNDAAW+PC+GC GC +S+ FN A S +++ 
Sbjct: 98  GRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRP 157

Query: 147 LGCQAAQCKQVPNPTCGGGA--CAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQK 204
           + C + QC   PNP+C   A  C F+L+Y  S++ A LSQDT+++A D+V  YTFGC+Q+
Sbjct: 158 VPCGSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAALSQDTLAVAGDVVKAYTFGCLQR 217

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
           ATG + PPQGLLGLGRG LS L+QT+++Y +TFSYCLPSFK+L+FSG+LRLG  GQP+RI
Sbjct: 218 ATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRI 277

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
           K TPLL NP RSSLYYVN+  IRVG++VV IP  AL F+P TGAGT++DSGT+FTRLVAP
Sbjct: 278 KTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAP 337

Query: 325 AYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHST 383
            Y A+RD  RRRVG+    V+SLGGFDTCY+  +  P +TL+F GM VTLP++N++IH+T
Sbjct: 338 VYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVAWPPVTLLFDGMQVTLPEENVVIHTT 397

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            G+ +CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 398 YGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 447


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 256/414 (61%), Positives = 320/414 (77%), Gaps = 11/414 (2%)

Query: 29  DHSSTLQVFHVFSPCSPFKP-SKPLSWEESVLEMLAKDQARLQFLSSLAVARKS--VVPI 85
           D  +TLQV H F PCSP  P +   SW   + +  ++D +RL +L SLA   K+    PI
Sbjct: 39  DAGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPI 98

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
           ASGRQ+ Q+PTY+VRA++GTP Q LL+A+DTSNDAAW+PC GC GC   S+  F+ A ST
Sbjct: 99  ASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAAST 158

Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
           +++++ C +  C Q PN  C  GG AC F+LTY  S++ A LSQD++++A D V  YTFG
Sbjct: 159 SYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGDAVKTYTFG 218

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C+QKATG + PPQGLLGLGRG LS L+QT+++YQ TFSYCLPSFK+L+FSG+LRLG  GQ
Sbjct: 219 CLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQ 278

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P RIK TPLL NP RSSLYYVN+  IRVGR+VV IPP AL F+P TGAGT++DSGT+FTR
Sbjct: 279 PPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTR 338

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
           LVAPAY AVRD  RRRVG+   V+SLGGFDTC++   VA P +TL+F GM VTLP++N++
Sbjct: 339 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPVTLLFDGMQVTLPEENVV 396

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 397 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 450


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 248/409 (60%), Positives = 316/409 (77%), Gaps = 5/409 (1%)

Query: 29  DHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS 87
           D  +TLQV H F PCSP    +   SW   + +  ++D +RL +L SLAVA ++  PIAS
Sbjct: 39  DAGATLQVSHAFGPCSPLGNAAAAPSWAGFLADQSSRDASRLLYLDSLAVAGRAYAPIAS 98

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-FNSAQSTTFKN 146
           GRQ+ Q+PTY+VRA++GTP Q LL+A+DTSNDAAW+PC+GC GC +T  FN A S +++ 
Sbjct: 99  GRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTPFNPAASKSYRA 158

Query: 147 LGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQK 204
           + C +  C + PNP+C     +C F+LTY  S++ A LSQD++++A D+V  YTFGC+QK
Sbjct: 159 VPCGSPACSRAPNPSCSLNTKSCGFSLTYADSSLEAALSQDSLAVANDVVKSYTFGCLQK 218

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
           ATG + PPQGLLGLGRG LS L+QT+++Y+ TFSYCLPSFK+L+FSG+LRLG  GQP RI
Sbjct: 219 ATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRI 278

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
           K TPLL NP RSSLYYV++  IRVG++VV IPP AL F+P TGAGT++DSGT+FTRLVAP
Sbjct: 279 KTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAP 338

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTA 384
           AY AVRD  RRR+     ++SLGGFDTCY+  +  P +T MF+GM VTLP DNL+IHST 
Sbjct: 339 AYVAVRDEVRRRI-RGAPLSSLGGFDTCYNTTVKWPPVTFMFTGMQVTLPADNLVIHSTY 397

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           G+ +CLAMAAAPD VN+VLNVIA+MQQQNHRIL+DVPN R+G ARE CT
Sbjct: 398 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  493 bits (1270), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 244/431 (56%), Positives = 308/431 (71%), Gaps = 9/431 (2%)

Query: 8   FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           F  F  LFS ++ ++P C TQ  +S L V  ++S CSPF P K  SW  +V+ M +KD  
Sbjct: 10  FFLFALLFSTTKAVDP-CATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPE 68

Query: 68  RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
           RL++LS+LA  + + VPIA G+Q+ +   Y+VR K+GTP Q + M +DTSNDAAWVPC+G
Sbjct: 69  RLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG 128

Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANLS 183
           C GCSST F    STT  +L C  AQC QV   +C   G  AC FN +YG  S++ A L 
Sbjct: 129 CTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLV 188

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
           QD I+LA D++PG+TFGCI   +G S+PPQGLLGLGRG +SL++Q   +Y   FSYCLPS
Sbjct: 189 QDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS 248

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
           FK+  FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL  + VGR  V IP   L F+
Sbjct: 249 FKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFD 308

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPT 361
           P TGAGTIIDSGTV TR V P Y A+RD FR++V  N  ++SLG FDTC++      AP 
Sbjct: 309 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGAFDTCFAATNEAEAPA 366

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           ITL F G+N+ LP +N LIHS++GS+ CL+MAAAP+NVNSVLNVIAN+QQQN RI++D  
Sbjct: 367 ITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTT 426

Query: 422 NSRLGVARELC 432
           NSRLG+ARELC
Sbjct: 427 NSRLGIARELC 437


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 245/432 (56%), Positives = 309/432 (71%), Gaps = 10/432 (2%)

Query: 7   FFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ 66
           FFL  L LFS ++ ++P C TQ  +S L V  ++S CSPF P K  SW  +V+ M +KD 
Sbjct: 10  FFLVAL-LFSTTKAVDP-CATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDP 67

Query: 67  ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
            RL++LS+LA  + + VPIA G+Q+ +   Y+VR K+GTP Q + M +DTSNDAAWVPC+
Sbjct: 68  ERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 127

Query: 127 GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANL 182
           GC G SST F    STT  +L C  AQC QV   +C   G  AC FN +YG  S++ A L
Sbjct: 128 GCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATL 187

Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
            QD I+LA D++PG+TFGCI   +G S+PPQGLLGLGRG +SL++Q   +Y   FSYCLP
Sbjct: 188 VQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLP 247

Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
           SFK+  FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL  + VGR  V IP   L F
Sbjct: 248 SFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAP 360
           +P TGAGTIIDSGTV TR V P Y A+RD FR++V  N  ++SLG FDTC++      AP
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGAFDTCFAATNEAEAP 365

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            ITL F G+N+ LP +N LIHS++GS+ CL+MAAAP+NVNSVLNVIAN+QQQN RI++D 
Sbjct: 366 AITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDT 425

Query: 421 PNSRLGVARELC 432
            NSRLG+ARELC
Sbjct: 426 TNSRLGIARELC 437


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 248/414 (59%), Positives = 316/414 (76%), Gaps = 15/414 (3%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPL-SWEESVLEMLAKDQARLQFLSSLAVA--RKSVVPI 85
           D  +TLQV H F PCSP  P     SW   + +  ++D +RL +L SLAV    ++  PI
Sbjct: 41  DAGNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSLAVRGRARAYAPI 100

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
           ASGRQ+ Q+PTY+VRA +GTP Q LL+A+DTSNDA+W+PC GC GC   S+  F+ A S 
Sbjct: 101 ASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSA 160

Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
           +++ + C +  C Q PN  C  GG AC F+LTY  S++ A LSQD++++A + V  YTFG
Sbjct: 161 SYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGNAVKAYTFG 220

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C+Q+ATG + PPQGLLGLGRG LS L+QT+++Y++TFSYCLPSFK+L+FSG+LRLG  GQ
Sbjct: 221 CLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQ 280

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P+RIK TPLL NP RSSLYYVN+  IRVGR+VV IP     F+P TGAGT++DSGT+FTR
Sbjct: 281 PQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP----AFDPATGAGTVLDSGTMFTR 336

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
           LVAPAY AVRD  RRRVG+   V+SLGGFDTC++   VA P +TL+F GM VTLP++N++
Sbjct: 337 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPVTLLFDGMQVTLPEENVV 394

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 395 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 246/414 (59%), Positives = 315/414 (76%), Gaps = 15/414 (3%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPL-SWEESVLEMLAKDQARLQFLSSLAVA--RKSVVPI 85
           D  +TLQV H F PCSP  P     SW   + +  ++D +RL +L SLAV    ++  PI
Sbjct: 41  DAGNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSLAVRGRARAYAPI 100

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
           ASGRQ+ Q+ TY+VRA +GTP Q LL+A+DTSNDA+W+PC GC GC   S+  F+ A S 
Sbjct: 101 ASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASA 160

Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
           +++ + C +  C Q PN  C  GG AC F+LTY  S++ A LSQD++++A + V  YTFG
Sbjct: 161 SYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGNAVKAYTFG 220

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C+Q+ATG + PPQGLLGLGRG LS L+QT+++Y++TFSYCLPSFK+L+FSG+LRLG  GQ
Sbjct: 221 CLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQ 280

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P+RIK TPLL NP RSSLYYVN+  +RVGR+VV IP     F+P TGAGT++DSGT+FTR
Sbjct: 281 PQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP----AFDPATGAGTVLDSGTMFTR 336

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
           LVAPAY AVRD  RRRVG+   V+SLGGFDTC++   VA P +TL+F GM VTLP++N++
Sbjct: 337 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPMTLLFDGMQVTLPEENVV 394

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 395 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 236/437 (54%), Positives = 304/437 (69%), Gaps = 14/437 (3%)

Query: 8   FLAFLFL---FSLSEGLNPICD--TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEML 62
           F AF+FL    S ++  +P     ++   S L V HV+  CSPF   K  SW  +V+ M 
Sbjct: 4   FTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINMA 63

Query: 63  AKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +KD AR+ +LSSL  + K+  VPIASG+Q+     Y+VR K+GTP Q + M +DTS DAA
Sbjct: 64  SKDPARVTYLSSLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAA 123

Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN---PTCGGGACAFNLTYG-SST 177
           WVPC  C GCSS  F+   S+T+ +L C   QC QV     PT G  AC FN TYG  S+
Sbjct: 124 WVPCADCAGCSSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSS 183

Query: 178 IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
            +A LSQD++ LA D +P Y+FGC+   +G+++PPQGLLGLGRG +SLL+Q+ +LY   F
Sbjct: 184 FSAMLSQDSLGLAVDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVF 243

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYC PSFK+  FSGSLRLGP+GQPK I+ TPLL+NP R +LYYVNL  + VGR +V + P
Sbjct: 244 SYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAP 303

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-- 355
             L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V       ++G FDTC++   
Sbjct: 304 ELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPF--ATIGAFDTCFAATN 361

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
             +AP +T  F+GM++ LP +N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN R
Sbjct: 362 EDIAPPVTFHFTGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLR 421

Query: 416 ILYDVPNSRLGVARELC 432
           I++DV NSRLG+ARELC
Sbjct: 422 IMFDVTNSRLGIARELC 438


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 225/358 (62%), Positives = 290/358 (81%), Gaps = 4/358 (1%)

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNS 138
           ++  PIASGRQ+ Q+PTY+VRA++GTPAQ LL+A+DTSNDAAW+PC+GC GC +S+ FN 
Sbjct: 37  RAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNP 96

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYGSSTIAANLSQDTISLATDIVPG 196
           A S +++ + C + QC   PNP+C   A  C F+L+Y  S++ A LSQDT+++A D+V  
Sbjct: 97  AASASYRPVPCGSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAALSQDTLAVAGDVVKA 156

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           YTFGC+Q+ATG + PPQGLLGLGRG LS L+QT+++Y +TFSYCLPSFK+L+FSG+LRLG
Sbjct: 157 YTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG 216

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             GQP+RIK TPLL NP RSSLYYVN+  IRVG++VV IP  AL F+P TGAGT++DSGT
Sbjct: 217 RNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGT 276

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
           +FTRLVAP Y A+RD  RRRVG+    V+SLGGFDTCY+  +  P +TL+F GM VTLP+
Sbjct: 277 MFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVAWPPVTLLFDGMQVTLPE 336

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +N++IH+T G+ +CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 337 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 394


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 246/431 (57%), Positives = 310/431 (71%), Gaps = 9/431 (2%)

Query: 8   FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           FL F  L S +  L+P C +Q   S L +  ++S CSPF P K      +V++M +KD A
Sbjct: 9   FLLFALLVSSTIALDP-CASQADDSDLSIIPIYSKCSPFIPPKQEPLVNTVIDMASKDPA 67

Query: 68  RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
           RL++LSSLA    + VPIA G+Q+     Y+VR K+GTP Q + M +DTSNDAAWVPC+G
Sbjct: 68  RLKYLSSLAAQMTTAVPIAPGQQVLNIGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSG 127

Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANLS 183
           C GCSST F++  S+T+ +L C  AQC QV   +C   G  +C FN +YG  S+ +A L 
Sbjct: 128 CTGCSSTTFSTNTSSTYGSLDCSMAQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLV 187

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
           +D++ L  D++P + FGCI   +G SVPPQGLLGLGRG LSL+AQ+ +LY   FSYCLPS
Sbjct: 188 EDSLRLVNDVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPS 247

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
           FK+  FSGSL+LGP GQPK I+YTPLL+NP R SLYYVNL  + VGR +V I P  L FN
Sbjct: 248 FKSYYFSGSLKLGPAGQPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFN 307

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPT 361
           P TGAGTIIDSGTV TR V P YTA+RD FR++V      +SLG FDTC++     VAP 
Sbjct: 308 PNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQVAGPF--SSLGAFDTCFAATNEAVAPA 365

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           +TL F+G+N+ LP +N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN R+L+DVP
Sbjct: 366 VTLHFTGLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVP 425

Query: 422 NSRLGVARELC 432
           NSRLG+ARELC
Sbjct: 426 NSRLGIARELC 436


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  450 bits (1157), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 231/406 (56%), Positives = 294/406 (72%), Gaps = 9/406 (2%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
           L V  ++  CSPF   K  SW  +V++M +KD AR+++LSSL   +    PIASG+Q+  
Sbjct: 32  LSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQVLN 91

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-STTFKNLGCQAA 152
              Y+VR ++GTP QT+ M +DTSNDAAW PC+GC+GCSST   SAQ S+TF  L C   
Sbjct: 92  VGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTTFSAQNSSTFATLDCSKP 151

Query: 153 QCKQVPN---PTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
           +C Q      PT G   C FN TYG  ST +A L QD++ L  +++P ++FGCI  A+G+
Sbjct: 152 ECTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGCISSASGS 211

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
           S+PPQGL+GLGRG LSL++Q+ +LY   FSYCLPSFK+  FSGSL+LGP+GQPK I+ TP
Sbjct: 212 SIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTP 271

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LL NP R SLYYVNL  I VGR +V I P  L F+P TGAGTIIDSGTV TR V   YTA
Sbjct: 272 LLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTA 331

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
           VRD FR++VG +   + LG FDTC++    + AP ITL  SG+++ LP +N LIHS+AGS
Sbjct: 332 VRDEFRKQVGGSF--SPLGAFDTCFATNNEVSAPAITLHLSGLDLKLPMENSLIHSSAGS 389

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + CLAMAAAP+NVNSV+NVIAN+QQQNHRIL+D+ NS+LG+ARELC
Sbjct: 390 LACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 214/358 (59%), Positives = 275/358 (76%), Gaps = 8/358 (2%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
           VPIA GRQI   P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC  SS  F+  Q
Sbjct: 88  VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 147

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
           S+T++ + C + QC QVP+P+C  G   +C FNLTY +ST  A L QD+++L  ++V  Y
Sbjct: 148 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 207

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           TFGC++  +GNSVPPQGL+G GRG LS L+QT++ Y S FSYCLP++++ +FSG+L+LGP
Sbjct: 208 TFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGP 267

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P  AL FNP TG+GTIID+GT+
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
           FTRL AP Y AVRD FR RV + +    LGGFDTCY+V +  PT+T MF+G + VTLP++
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 386

Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 387 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 214/358 (59%), Positives = 275/358 (76%), Gaps = 8/358 (2%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
           VPIA GRQI   P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC  SS  F+  Q
Sbjct: 69  VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 128

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
           S+T++ + C + QC QVP+P+C  G   +C FNLTY +ST  A L QD+++L  ++V  Y
Sbjct: 129 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 188

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           TFGC++  +GNSVPPQGL+G GRG LS L+QT++ Y S FSYCLP++++ +FSG+L+LGP
Sbjct: 189 TFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGP 248

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P  AL FNP TG+GTIID+GT+
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
           FTRL AP Y AVRD FR RV + +    LGGFDTCY+V +  PT+T MF+G + VTLP++
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 367

Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 368 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 425


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/435 (53%), Positives = 307/435 (70%), Gaps = 12/435 (2%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAK 64
           L+   + ++L  ++  ++P C +Q  +S L V  ++S CSPFKP K  +W+  ++ M +K
Sbjct: 9   LIVIFSVMWLMRVN-AIDP-CASQPDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMASK 66

Query: 65  DQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
           D  R+++LS+L V++K+V   PIASG+       Y+VR K+GTP Q L M +DTS D A+
Sbjct: 67  DPVRVKYLSTL-VSQKTVSTAPIASGQAFNIG-NYVVRVKLGTPGQLLFMVLDTSTDEAF 124

Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIA 179
           VPC+GC GCS T F+   ST++  L C   QC QV   +C   G GAC+FN +Y  S+ +
Sbjct: 125 VPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS 184

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           A L QD + LATD++P Y+FGC+   TG SVP QGLLGLGRG LSLL+Q+ + Y   FSY
Sbjct: 185 ATLVQDALRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSY 244

Query: 240 CLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
           CLPSFK+  FSGSL+LGP+GQPK I+ TPLL++P R SLYYVN   I VGR +V  P   
Sbjct: 245 CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEY 304

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPI 357
           L FNP TG+GTIIDSGTV TR V P Y AVR+ FR++VG   T TS+G FDTC+  +   
Sbjct: 305 LGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGT-TFTSIGAFDTCFVKTYET 363

Query: 358 VAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
           +AP ITL F G+++ LP +N LIHS+AGS+ CLAMAAAPDNVNSVLNVIAN QQQN RIL
Sbjct: 364 LAPPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRIL 423

Query: 418 YDVPNSRLGVARELC 432
           +D+ N+++G+ARE+C
Sbjct: 424 FDIVNNKVGIAREVC 438


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 235/436 (53%), Positives = 306/436 (70%), Gaps = 13/436 (2%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLA 63
           ++   + ++L  ++ G++P C +Q  +S L V  ++S CSPFKP K   SW+  ++ M +
Sbjct: 9   IILIFSVIWLMRVN-GIDP-CASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMAS 66

Query: 64  KDQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           KD  R ++LS+L V +K+V   PIASG Q      Y+VR K+GTP Q L M +DTS D A
Sbjct: 67  KDPLRFKYLSTL-VGQKTVSTAPIASG-QTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEA 124

Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTI 178
           +VPC+GC GCS T F+   ST++  L C   QC QV   +C   G GAC+FN +Y  S+ 
Sbjct: 125 FVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           +A L QD++ LATD++P Y+FGC+   TG SVP QGLLGLGRG LSLL+Q+ + Y   FS
Sbjct: 185 SATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFS 244

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLPSFK+  FSGSL+LGP+GQPK I+ TPLL++P R SLYYVN   I VGR +V  P  
Sbjct: 245 YCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSE 304

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVP 356
            L FNP TG+GTIIDSGTV TR V P Y AVR+ FR++VG   T TS+G FDTC+  +  
Sbjct: 305 YLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGT-TFTSIGAFDTCFVKTYE 363

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
            +AP ITL F G+++ LP +N LIHS+AGS+ CLAMAAAPDNVNSVLNVIAN QQQN RI
Sbjct: 364 TLAPPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRI 423

Query: 417 LYDVPNSRLGVARELC 432
           L+D  N+++G+ARE+C
Sbjct: 424 LFDTVNNKVGIAREVC 439


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 234/431 (54%), Positives = 298/431 (69%), Gaps = 12/431 (2%)

Query: 9   LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
           + ++   S    ++P C +Q   S L V  ++  CSPF P K  SW+  V+ M +KD AR
Sbjct: 11  ICYVIYISNINAIDP-CASQPDDSDLNVIPMYGKCSPFNPPKADSWDNRVINMASKDPAR 69

Query: 69  LQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
           + +LS+L VA+K+    PIASG+       Y+VR KIGTP Q L M +DTS D A+VP +
Sbjct: 70  MSYLSTL-VAQKTATSAPIASGQTFNIG-NYVVRVKIGTPGQLLFMVLDTSTDEAFVPSS 127

Query: 127 GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLS 183
           GC+GCS+T F    ST+F  L C   QC QV   +C   G GAC+FN +Y  ST +A L 
Sbjct: 128 GCIGCSATTFYPNVSTSFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYAGSTFSATLV 187

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
           QD++ LATD++P Y+FG I   +G+SVP QGLLGLGRG LSLL+Q+  +Y   FSYCLPS
Sbjct: 188 QDSLRLATDVIPSYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPS 247

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
           FK+  FSGSL+LGP+GQPK I+ TPLL NP R SLYYVNL AI VGR  V +P   L FN
Sbjct: 248 FKSYYFSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFN 307

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPT 361
           P+TGAGTIIDSGTV TR V P Y AVRD FR++V      +SLG FDTC+  +   +AP 
Sbjct: 308 PSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPA 365

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           ITL F+ +++ LP +N LIHS++GS+ CLAMAAAP NVNSVLNVIAN QQQN R+L+D  
Sbjct: 366 ITLHFTDLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTV 425

Query: 422 NSRLGVARELC 432
           N+++G+ARELC
Sbjct: 426 NNKVGIARELC 436


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 222/409 (54%), Positives = 288/409 (70%), Gaps = 24/409 (5%)

Query: 48  PSKPL-SWEESVLEMLAKDQARLQFL----------SSLAVA----RKSVVPIASGRQIT 92
           P  P+ +W  ++    A D AR   L          S++  A    R+S VPIA GRQ+ 
Sbjct: 43  PGTPVTAWAATLAAQTASDAARAATLATGPRDPPPASAVDAAKKGPRRSFVPIAPGRQLL 102

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQSTTFKNLGCQA 151
             P+Y+ RA++GTPAQ LL+A+D SNDAAWVPC  C GC+    F+  +S+T++ + C A
Sbjct: 103 SIPSYVARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPSFDPTRSSTYRPVRCGA 162

Query: 152 AQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDI--VPGYTFGCIQKAT 206
            QC Q P P+C GG   +CAFNL+Y +ST  A L QD ++L  D+  V  YTFGC+   T
Sbjct: 163 PQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDDVDAVAAYTFGCLHVVT 222

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
           G SVPPQGL+G GRG LS  +QT+++Y S FSYCLPS+K+ +FSG+LRLGP GQPKRIK 
Sbjct: 223 GGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKT 282

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           TPLL NP R SLYYVN++ IRVG R V +P  AL F+PT+G GTI+D+GT+FTRL AP Y
Sbjct: 283 TPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVY 342

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAG 385
            AVRDVFR RV + +    LGGFDTCY+V I  PT+T  F G ++VTLP++N++I S++G
Sbjct: 343 AAVRDVFRSRVRAPV-AGPLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSG 401

Query: 386 SITCLAMAAA-PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            I CLAMAA  PD V++ LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 402 GIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELCT 450


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 230/427 (53%), Positives = 296/427 (69%), Gaps = 13/427 (3%)

Query: 14  LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
             S+S   +P C +Q   S L V  ++  CSPF P K  SW+  VL M +KD AR+ +LS
Sbjct: 16  FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74

Query: 74  SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
           SL VA+K+V   PIASG+       YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75  SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132

Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
           S+T F+   ST++  L C   QC QV   +C   G GAC+FN +Y  ST +A L QD++ 
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           LATD++P Y+FG I   +G+S+P QGLLGLGRG LSLL+QT +LY   FSYCLPSFK+  
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYY 252

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL  I VG+  V  P   L F+  TG+
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMF 366
           GTIIDSGTV TR V P Y AVRD FR++V      +SLG FDTC+  +   +AP ITL F
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPAITLHF 370

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN-SVLNVIANMQQQNHRILYDVPNSRL 425
           + +++ LP +N LIHS++GS+ CLAMA+ P NVN +VLNVIAN QQQN R+L+D  N+++
Sbjct: 371 TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKV 430

Query: 426 GVARELC 432
           G+ARELC
Sbjct: 431 GIARELC 437


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 211/417 (50%), Positives = 287/417 (68%), Gaps = 15/417 (3%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLSSLAVARK--SVVPI 85
           D S  L +  + + CSPF P+    S  ++VL M + D  RL +LSSL   +   + VP+
Sbjct: 34  DGSDDLSIIPINAKCSPFAPTHVSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTSVPV 93

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT-- 143
           ASG Q+     Y+VRAK+GTP Q + M +DTSNDA W+PC+GC GCS+   +   +++  
Sbjct: 94  ASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSST 152

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
           +  + C  AQC Q    TC   +     C+FN +YG  S+ +A+L QDT++LA D++P +
Sbjct: 153 YSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNF 212

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           +FGCI  A+GNS+PPQGL+GLGRG +SL++QT +LY   FSYCLPSF++  FSGSL+LG 
Sbjct: 213 SFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 272

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           +GQPK I+YTPLL+NPRR SLYYVNL  + VG   V + P  L F+  +GAGTIIDSGTV
Sbjct: 273 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 332

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQ 375
            TR   P Y A+RD FR++V  + + ++LG FDTC+S     VAP ITL  + +++ LP 
Sbjct: 333 ITRFAQPVYEAIRDEFRKQVNVS-SFSTLGAFDTCFSADNENVAPKITLHMTSLDLKLPM 391

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +N LIHS+AG++TCL+MA    N N+VLNVIAN+QQQN RIL+DVPNSR+G+A E C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 207/417 (49%), Positives = 282/417 (67%), Gaps = 16/417 (3%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLSSLAVARK--SVVPI 85
           D S  L +  + + CSPF  +    S  ++VL M + D  R  +LSSL   +   + VP+
Sbjct: 35  DGSHDLSIIPINAKCSPFAHTHVSASVIDTVLHMASSDSHRFTYLSSLVAGKSKPTSVPV 94

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT-- 143
           ASG Q+     Y+VRA++GTP Q + M +DTSNDA W+PC+GC GCS+   +   +++  
Sbjct: 95  ASGNQL-HIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSST 153

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
           +  + C   QC Q    TC         C+FN +YG  S+ +ANL QDT++L+ D++P +
Sbjct: 154 YSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPDVIPNF 213

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           +FGCI  A+GNS+PPQGL+GLGRG +SL++QT +LY   FSYCLPSF++  FSGSL+LG 
Sbjct: 214 SFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 273

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           +GQPK I+YTPLL+NPRR SLYYVNL  + VG   V + P  L F+  +GAGTIIDSGTV
Sbjct: 274 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTV 333

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQ 375
            TR   P Y A+RD FR++V  N + ++LG FDTC+S     V P ITL  + +++ LP 
Sbjct: 334 ITRFAQPVYEAIRDEFRKQV--NGSFSTLGAFDTCFSADNENVTPKITLHMTSLDLKLPM 391

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +N LIHS+AG++TCL+MA    N N+VLNVIAN+QQQN RIL+DVPNSR+G+A E C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 216/407 (53%), Positives = 286/407 (70%), Gaps = 27/407 (6%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
           SW   +    ++D +R+ +LSSLA       P+ASGRQ+  +PTY+VRA +GTP Q LL+
Sbjct: 51  SWTSFIAAQTSRDTSRVLYLSSLASGFGGA-PLASGRQLLHTPTYLVRASLGTPPQRLLL 109

Query: 113 AMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQCKQVPNPTC-----GGG 165
           A+DTSNDAAWVPC GC GC +T   FN A S TF+ + C A  C Q PNP+C        
Sbjct: 110 AVDTSNDAAWVPCAGCHGCPTTAPSFNPASSATFRPVPCGAPPCSQAPNPSCTSLAKSKN 169

Query: 166 ACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
           +C F+L+YG S++ A LSQD +++  +  ++ GYTFGC+ K+ G++ P QGLLGLGRG L
Sbjct: 170 SCGFSLSYGDSSLDATLSQDNLAVTANGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPL 229

Query: 224 SLLAQTQNLYQSTFSYCLPSF--KALSFSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLY 279
             +AQT+ +Y+ TFSYCLPS+   A +FSGSL LG  GQP  +++K TPLL +P R SLY
Sbjct: 230 GFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLY 289

Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
           YV +  +R+G++ V IPP AL F+  TGAGT++DSGT+F RL  PAY AVRD  RRRV  
Sbjct: 290 YVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAG 349

Query: 340 NL----------TVTSLGGFDTCYSVPIVA-PTITLMF-SGMNVTLPQDNLLIHSTAGSI 387
           +L          +V+SLGGFDTCY+V  VA P +TL+F  GM V LP++N++I ST GS 
Sbjct: 350 SLRRRGGGGASVSVSSLGGFDTCYNVSTVAWPAVTLVFGGGMEVRLPEENVVIRSTYGST 409

Query: 388 TCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +CLAMAA+P D VN+ LNVI ++QQQNHR+L+DVPN+R+G ARE CT
Sbjct: 410 SCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERCT 456


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 224/419 (53%), Positives = 288/419 (68%), Gaps = 13/419 (3%)

Query: 14  LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
             S+S   +P C +Q   S L V  ++  CSPF P K  SW+  VL M +KD AR+ +LS
Sbjct: 16  FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74

Query: 74  SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
           SL VA+K+V   PIASG+       YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75  SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132

Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
           S+T F+   ST++  L C   QC QV   +C   G GAC+FN +Y  ST +A L QD++ 
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           LATD++P Y+FG I   +G+S+P QGLLGLGRG LSLL+QT +LY   FSYCLPSFK+  
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYY 252

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL  I VG+  V  P   L F+  TG+
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMF 366
           GTIIDSGTV TR V P Y AVRD FR++V      +SLG FDTC+  +   +AP ITL F
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPAITLHF 370

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN-SVLNVIANMQQQNHRILYDVPNSR 424
           + +++ LP +N LIHS++GS+ CLAMA+ P NVN +VLNVIAN QQQN R+L+D  N++
Sbjct: 371 TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNK 429


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 222/442 (50%), Positives = 291/442 (65%), Gaps = 19/442 (4%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQ--DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           + F + L   S+    N  C +Q  D  S + +  ++  CSPFK +   SWE  +++M +
Sbjct: 13  ILFTSMLLHLSIIAIANDPCASQHDDDDSDITMIPIYGNCSPFK-NYSTSWENIIIDMAS 71

Query: 64  KDQARLQFLSSL--AVARK--SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
           KD  R+ +LSSL  ++ RK  S  PIASG+      +Y+VR K+G+P Q   M +DTS D
Sbjct: 72  KDPERVVYLSSLDASLRRKPISAAPIASGQAFGIG-SYVVRVKLGSPNQLFFMVLDTSTD 130

Query: 120 AAWVPCTGCVGCSS--TVFNSAQSTTFKN-LGCQAAQCKQ----VPNPTCGGGACAFNLT 172
            AWVPCTGC GCSS  T ++   STT+   + C A +C Q    +P P  G  AC FN +
Sbjct: 131 EAWVPCTGCTGCSSSSTYYSPQASTTYGGAVACYAPRCAQARGALPCPYTGSKACTFNQS 190

Query: 173 YGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
           Y  ST +A L QD++ L  D +P Y FGC+  A+G ++P QGLLGLGRG LSL +Q+  L
Sbjct: 191 YAGSTFSATLVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKL 250

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           Y   FSYCLPSF++  FSGSL+LGP GQP+RI+ TPLL+NPRR SLYYVNL  + VGR  
Sbjct: 251 YSGIFSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVK 310

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           V +P   L F+P  G+GTI+DSGTV TR V P Y+A+RD FR +V       S GGFDTC
Sbjct: 311 VPLPIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPF--FSRGGFDTC 368

Query: 353 Y--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
           +  +   + P I L F+G++VTLP +N LIH+  G + CLAMAAAP+NVNSVLNVIAN Q
Sbjct: 369 FVKTYENLTPLIKLRFTGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQ 428

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN R+L+D  N+R+G+ARELC
Sbjct: 429 QQNLRVLFDTVNNRVGIARELC 450


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 207/377 (54%), Positives = 270/377 (71%), Gaps = 22/377 (5%)

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SS 133
           +R + VPIA+GRQI ++P+Y+ RA++GTP QTLL+A+D SNDAAWVPC+ C+GC    SS
Sbjct: 81  SRHTFVPIAAGRQILRTPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS 140

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGG---ACAFNLTYGSSTIAANLSQDTIS 188
             F+  QS+T++ + C A QC QVP   P+C  G   +CAFNL+Y SST+ A L QD +S
Sbjct: 141 PSFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHAVLGQDALS 200

Query: 189 LATD---IVPG--YTFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
           L+      VP   YTFGC++  TG+  SVPPQGL+G GRG LS L+QT+  Y S FSYCL
Sbjct: 201 LSDSNGAAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCL 260

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           PS+K+ +FSG+LRLGP GQP+RIK TPLL NP R SLYYV ++ +RV  + V IP  AL 
Sbjct: 261 PSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALA 320

Query: 302 FNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIV 358
            +  TG  GTI+D+GT+FTRL  PAY A+R+ FRR V S     +LGGFDTCY V     
Sbjct: 321 LDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGV-SAPAAPALGGFDTCYYVNGTKS 379

Query: 359 APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRI 416
            P +  +F+ G  VTLP++N++I ST+G + CLAMAA P D VN+ LNV+A+MQQQNHR+
Sbjct: 380 VPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRV 439

Query: 417 LYDVPNSRLGVARELCT 433
           ++DV N R+G +RELCT
Sbjct: 440 VFDVGNGRVGFSRELCT 456


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 197/376 (52%), Positives = 266/376 (70%), Gaps = 14/376 (3%)

Query: 69  LQFLSSLAVARK--SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
           L +LSSL   +   + VP+ASG Q+     Y+VRAK+GTP Q + M +DTSNDA W+PC+
Sbjct: 1   LTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS 59

Query: 127 GCVGCSSTVFNSAQSTT--FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTI 178
           GC GCS+   +   +++  +  + C  AQC Q    TC   +     C+FN +YG  S+ 
Sbjct: 60  GCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSF 119

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           +A+L QDT++LA D++P ++FGCI  A+GNS+PPQGL+GLGRG +SL++QT +LY   FS
Sbjct: 120 SASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFS 179

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLPSF++  FSGSL+LG +GQPK I+YTPLL+NPRR SLYYVNL  + VG   V + P 
Sbjct: 180 YCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPV 239

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--P 356
            L F+  +GAGTIIDSGTV TR   P Y A+RD FR++V  + + ++LG FDTC+S    
Sbjct: 240 YLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVS-SFSTLGAFDTCFSADNE 298

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
            VAP ITL  + +++ LP +N LIHS+AG++TCL+MA    N N+VLNVIAN+QQQN RI
Sbjct: 299 NVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRI 358

Query: 417 LYDVPNSRLGVARELC 432
           L+DVPNSR+G+A E C
Sbjct: 359 LFDVPNSRIGIAPEPC 374


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/341 (55%), Positives = 242/341 (70%), Gaps = 8/341 (2%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
           SW  +V+ M +KD  RL++LS+LA  + + VPIA G+Q+ +   Y+VR K+GTP Q + M
Sbjct: 1   SWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFM 60

Query: 113 AMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAF 169
            +DTSNDAAWVPC+GC GCSST F    STT  +L C  AQC QV   +C   G  AC F
Sbjct: 61  VLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLF 120

Query: 170 NLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           N +YG  S++AA L QD I+LA D++PG+TFGCI   +G S+PPQGLLGLGRG +SL++Q
Sbjct: 121 NQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQ 180

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              +Y   FSYCLPSFK+  FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL  + V
Sbjct: 181 AGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSV 240

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           GR  V IP   L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V  N  ++SLG 
Sbjct: 241 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGA 298

Query: 349 FDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
           FDTC++      AP +TL F G+N+ LP +N LIHS++GS+
Sbjct: 299 FDTCFAATNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/341 (55%), Positives = 242/341 (70%), Gaps = 8/341 (2%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
           SW  +V+ M +KD  RL++LS+LA  + + VPIA G+Q+ +   Y+VR K+GTP Q + M
Sbjct: 1   SWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFM 60

Query: 113 AMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAF 169
            +DTSNDAAWVPC+GC GCSST F    STT  +L C  AQC QV   +C   G  AC F
Sbjct: 61  VLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLF 120

Query: 170 NLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           N +YG  S++AA L QD I+LA D++PG+TFGCI   +G S+PPQGLLGLGRG +SL++Q
Sbjct: 121 NQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQ 180

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              +Y   FSYCLPSFK+  FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL  + V
Sbjct: 181 AGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSV 240

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           GR  V IP   L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V  N  ++SLG 
Sbjct: 241 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGA 298

Query: 349 FDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
           FDTC++      AP +TL F G+N+ LP +N LIHS++GS+
Sbjct: 299 FDTCFAETNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 203/423 (47%), Positives = 276/423 (65%), Gaps = 34/423 (8%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQIT 92
           L V+H   P SP     PL   ES++ +   D ARL FLSS  A A  S  P+ASG+   
Sbjct: 27  LSVYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA-- 77

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQA 151
             P+Y+VRA +G+P+Q LL+A+DTS DA W  C+ C  C SS++F  A S+++ +L C +
Sbjct: 78  -PPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSS 136

Query: 152 AQC-----KQVPNPTCGGGA---------CAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
           + C     +  P P  GG A         CAF+  +  ++  A L+ DT+ L  D +P Y
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNY 196

Query: 198 TFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
           TFGC+   TG   ++P QGLLGLGRG ++LL+Q  +LY   FSYCLPS+++  FSGSLRL
Sbjct: 197 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 256

Query: 256 GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           G  G QP+ ++YTP+L+NP RSSLYYVN+  + VGR  V +P G+  F+  TGAGT++DS
Sbjct: 257 GAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDS 316

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGM 369
           GTV TR  AP Y A+R+ FRR+V +    TSLG FDTC++   V    AP +T+ M  G+
Sbjct: 317 GTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 376

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           ++ LP +N LIHS+A  + CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+
Sbjct: 377 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAK 436

Query: 430 ELC 432
           E C
Sbjct: 437 ESC 439


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 201/421 (47%), Positives = 274/421 (65%), Gaps = 34/421 (8%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQS 94
           V+H   P SP     PL   ES++ +   D ARL FLSS  A A  S  P+ASG+     
Sbjct: 27  VYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA---P 76

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQ 153
           P+Y+VRA +G+P+Q LL+A+DTS DA W  C+ C  C SS++F  A S+++ +L C ++ 
Sbjct: 77  PSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSW 136

Query: 154 C-----KQVPNPTCGGGA---------CAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
           C     +  P P  GG A         CAF+  +  ++  A L+ DT+ L  D +P YTF
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNYTF 196

Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           GC+   TG   ++P QGLLGLGRG ++LL+Q  +LY   FSYCLPS+++  FSGSLRLG 
Sbjct: 197 GCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGA 256

Query: 258 IG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            G QP+ ++YTP+L+NP RSSLYYVN+  + VG   V +P G+  F+  TGAGT++DSGT
Sbjct: 257 GGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGT 316

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNV 371
           V TR  AP Y A+R+ FRR+V +    TSLG FDTC++   V    AP +T+ M  G+++
Sbjct: 317 VITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDL 376

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP +N LIHS+A  + CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+E 
Sbjct: 377 ALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKES 436

Query: 432 C 432
           C
Sbjct: 437 C 437


>gi|217073884|gb|ACJ85302.1| unknown [Medicago truncatula]
          Length = 259

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 190/244 (77%), Positives = 211/244 (86%), Gaps = 1/244 (0%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           +GLNP CD QD+ STLQV HVFSPCSPF+PSKPLSWEESVL+M AKD  RLQFL SL VA
Sbjct: 16  QGLNPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSL-VA 74

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
           RKS+VPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F  
Sbjct: 75  RKSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 134

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +STTFKN+ C A +CKQVPNP CG  +C FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIAANLVQDTITLATDPVPSYT 194

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 254

Query: 259 GQPK 262
             P+
Sbjct: 255 AHPE 258


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/358 (53%), Positives = 242/358 (67%), Gaps = 30/358 (8%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
           VPIA GRQI   P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC  SS  F+  Q
Sbjct: 88  VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 147

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
           S+T++ + C + QC QVP+P+C  G   +C FNLTY +ST  A L QD+++L  ++V  Y
Sbjct: 148 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 207

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           TFGC++   GNS    G   L   +  LL   Q                        LGP
Sbjct: 208 TFGCLRVVNGNSRAAAGAHRLRPRAALLLVADQG----------------------HLGP 245

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P  AL FNP TG+GTIID+GT+
Sbjct: 246 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 305

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
           FTRL AP Y AVRD FR RV + +    LGGFDTCY+V +  PT+T MF+G + VTLP++
Sbjct: 306 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 364

Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 365 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 422


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  367 bits (941), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 195/417 (46%), Positives = 266/417 (63%), Gaps = 30/417 (7%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-SVVPIASGRQITQS 94
           V+H   P S    S PL   ES++ +  +D ARL FLSS A +   S  P+ASG+     
Sbjct: 25  VYHNVHPPS----SSPL---ESIIALAREDDARLLFLSSKAASTGVSSAPVASGQS---P 74

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQSTTFKNLGCQAA 152
           P+Y+VRA +G+PAQ +L+A+DTS DA W  C+ C  C  S ++F  A ST++  L C + 
Sbjct: 75  PSYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGSLFAPANSTSYAPLPCSST 134

Query: 153 QCKQVPNPTCGGG----------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI 202
            C  +    C              CAF   +  ++  A+L+ D + L  D +P Y FGC+
Sbjct: 135 MCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHLGKDAIPNYAFGCV 194

Query: 203 QKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
              +G   ++P QGLLGLGRG ++LL+Q  N+Y   FSYCLPS+K+  FSGSLRLG  GQ
Sbjct: 195 SAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQ 254

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P+ ++YTP+LKNP RSSLYYVN+  + VGR  V +P G+  F+P TGAGT++DSGTV TR
Sbjct: 255 PRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITR 314

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITL-MFSGMNVTLPQ 375
              P Y A+R+ FRR V +    TSLG FDTC++       VAP +T+ M  G+++ LP 
Sbjct: 315 WTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPM 374

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +N LIHS+A  + CLAMA AP NVN+V+NV+AN+QQQN R+++DV NSR+G ARE C
Sbjct: 375 ENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  344 bits (882), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 199/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
           ++ L V+H   P SP     PL   ES++ +   D ARL FLSS A +   V   P+ASG
Sbjct: 21  AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGVTSAPVASG 73

Query: 89  RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
           +     P+Y+VRA +GTP Q LL+A+DTS DA W  C  C  C + + F  A S+++ +L
Sbjct: 74  QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130

Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
            C +  C       C           ACAF+  +  ++  A+L  DT+ L  D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190

Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           GC+    G   ++P QGLLGLGRG +SLL+QT + Y   FSYCLPS+++  FSGSLRLG 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            GQP+ ++YTPLL NP R SLYYVN+  + VGR  V +P G+  F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
            TR  AP Y A+R+ FRR+V +    TSLG FDTC++   V    AP +TL M  G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N LIHS+A  + CLAMA AP NVN+V+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  343 bits (881), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 199/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
           ++ L V+H   P SP     PL   ES++ +   D ARL FLSS A +   V   P+ASG
Sbjct: 21  AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGVTSAPVASG 73

Query: 89  RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
           +     P+Y+VRA +GTP Q LL+A+DTS DA W  C  C  C + + F  A S+++ +L
Sbjct: 74  QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130

Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
            C +  C       C           ACAF+  +  ++  A+L  DT+ L  D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190

Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           GC+    G   ++P QGLLGLGRG +SLL+QT + Y   FSYCLPS+++  FSGSLRLG 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            GQP+ ++YTPLL NP R SLYYVN+  + VGR  V +P G+  F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
            TR  AP Y A+R+ FRR+V +    TSLG FDTC++   V    AP +TL M  G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N LIHS+A  + CLAMA AP NVN+V+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 198/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
           ++ L V+H   P SP     PL   ES++ +   D ARL FLSS A +   +   P+ASG
Sbjct: 21  AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGITSAPVASG 73

Query: 89  RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
           +     P+Y+VRA +GTP Q LL+A+DTS DA W  C  C  C + + F  A S+++ +L
Sbjct: 74  QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130

Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
            C +  C       C           ACAF+  +  ++  A+L  DT+ L  D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190

Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           GC+    G   ++P QGLLGLGRG +SLL+QT + Y   FSYCLPS+++  FSGSLRLG 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            GQP+ ++YTPLL NP R SLYYVN+  + VGR  V +P G+  F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
            TR  AP Y A+R+ FRR+V +    TSLG FDTC++   V    AP +TL M  G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N LIHS+A  + CLAMA AP NVN+V+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score =  304 bits (779), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 153/271 (56%), Positives = 197/271 (72%), Gaps = 5/271 (1%)

Query: 167 CAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           C   + Y      A L QD ++L    D+V  YTFGC++  TG SVPPQGL+G G G LS
Sbjct: 328 CIIGMIYAYFHPNALLGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLS 387

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
             +Q +++Y   FSYCLPS+K+ +FS +LRLGP GQPKRIK TPLL NP R SLYYVN++
Sbjct: 388 FPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMV 447

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            I VG R + +P  AL F+P +G GTI+D+GT+FTRL AP Y AVRDVFR RV + +T  
Sbjct: 448 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVT-G 506

Query: 345 SLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSV 402
            LGGFDTCY+V I  PT+T  F G ++VTLP++N++I S++  I CLAMAA P D V++V
Sbjct: 507 PLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAV 566

Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 567 LNVLASMQQQNHRVLFDVANGRVGFSRELCT 597


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score =  304 bits (779), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 153/271 (56%), Positives = 197/271 (72%), Gaps = 5/271 (1%)

Query: 167 CAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           C   + Y      A L QD ++L    D+V  YTFGC++  TG SVPPQGL+G G G LS
Sbjct: 267 CIIGMIYAYFHPNALLGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLS 326

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
             +Q +++Y   FSYCLPS+K+ +FS +LRLGP GQPKRIK TPLL NP R SLYYVN++
Sbjct: 327 FPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMV 386

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            I VG R + +P  AL F+P +G GTI+D+GT+FTRL AP Y AVRDVFR RV + +T  
Sbjct: 387 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVT-G 445

Query: 345 SLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSV 402
            LGGFDTCY+V I  PT+T  F G ++VTLP++N++I S++  I CLAMAA P D V++V
Sbjct: 446 PLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAV 505

Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 506 LNVLASMQQQNHRVLFDVANGRVGFSRELCT 536


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score =  303 bits (776), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 152/258 (58%), Positives = 194/258 (75%), Gaps = 5/258 (1%)

Query: 180 ANLSQDTISLATDI--VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           A L QD ++L  D+  +  YTFGC+   TG SVP QGL+G  RG LS  +Q +N+Y S F
Sbjct: 308 ALLGQDALALHDDVDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVF 367

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYCLPS+K+ +FSG+LRLGP GQPKRIK TPLL NP R SLYYVN++ IRVG R V +P 
Sbjct: 368 SYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPA 427

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI 357
            AL F+P +G GTI+D+GT+FTRL AP Y AV DVFR RV + +    LGGFDTCY+V I
Sbjct: 428 SALAFDPASGHGTIVDAGTMFTRLSAPVYAAVCDVFRSRVRAPVA-GPLGGFDTCYNVTI 486

Query: 358 VAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHR 415
             PT+T +F G ++VTLP++N++I S+   I CLAMAA P D+V++VLNV+A+MQQQNHR
Sbjct: 487 SVPTVTFLFDGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHR 546

Query: 416 ILYDVPNSRLGVARELCT 433
           +L+DV N R+G +RELCT
Sbjct: 547 VLFDVANGRVGFSRELCT 564


>gi|255647724|gb|ACU24323.1| unknown [Glycine max]
          Length = 334

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 165/326 (50%), Positives = 217/326 (66%), Gaps = 10/326 (3%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLA 63
           ++   + ++L  +  G++P C +Q  +S L V  ++S CSPFKP K   SW+  ++ M +
Sbjct: 9   IILIFSVIWLMRV-NGIDP-CASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMAS 66

Query: 64  KDQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           KD  R ++LS+L V +K+V   PIASG+       Y+VR K+GTP Q L M +DTS D A
Sbjct: 67  KDPLRFKYLSTL-VGQKTVSTAPIASGQTFNIG-NYVVRVKLGTPGQLLFMVLDTSTDEA 124

Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTI 178
           +VPC+GC GCS   F+   ST++  L C   QC QV   +C   G GAC+FN +Y  S+ 
Sbjct: 125 FVPCSGCTGCSDATFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           +A L QD++ LATD++P Y+FGC+   TG SVP QGLLGLGRG LSLL+Q+ + Y   FS
Sbjct: 185 SATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFS 244

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLPSFK+  FSGSL+L P+GQPK I+ TPLL++P R SLYYVN   I VGR +V  P  
Sbjct: 245 YCLPSFKSYYFSGSLKLRPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSE 304

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAP 324
            L FNP TG+GTIIDSGTV TR V P
Sbjct: 305 YLGFNPNTGSGTIIDSGTVITRFVEP 330


>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
 gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
          Length = 369

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 140/197 (71%), Positives = 166/197 (84%), Gaps = 5/197 (2%)

Query: 240 CLPSFKALSFSGS--LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           CLPSFK+L+FSGS  LRLG  GQP+RIK TPLL NP RSSLYYVN+  IRVGR+VV IPP
Sbjct: 173 CLPSFKSLNFSGSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPP 232

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI 357
            AL F+P TGAGT++DSGT+FTRLVAPAY AVRD  RRRVG+   V+SLGGFDTC++   
Sbjct: 233 PALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTA 290

Query: 358 VA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           VA P +TL+F GM VTLP++N++IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+
Sbjct: 291 VAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRV 350

Query: 417 LYDVPNSRLGVARELCT 433
           L+DVPN R+G ARE CT
Sbjct: 351 LFDVPNGRVGFARERCT 367


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 173/424 (40%), Positives = 239/424 (56%), Gaps = 24/424 (5%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARK 80
            +S+L V H+   CSPF+     SW  +V E +  D AR + +         + +     
Sbjct: 50  ETSSLSVMHIQGKCSPFRLLNS-SWWTAVSESIKGDTARYRAMVKGGWSAGKTMVNPQED 108

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNS 138
           + +P+ASG+ I+ S  YI++   GTP Q+    +DT ++ AW+PC  C GCSS    F  
Sbjct: 109 ADIPLASGQAISSS-NYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEP 167

Query: 139 AQSTTFKNLGCQAAQCK--QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVP 195
           ++S+T+  L C + QC+  +V   +     C+    YG  S +   LS +T+S+ +  V 
Sbjct: 168 SKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQVE 227

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
            + FGC   A G       L+G GR  LS ++QT  LY STFSYCLPS  + +F+GSL L
Sbjct: 228 NFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLL 287

Query: 256 GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           G      + +K+TPLL N R  S YYV L  I VG  +V IP G L  + +TG GTIIDS
Sbjct: 288 GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDS 347

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP---IVAPTITLMF-SGM 369
           GTV TRLV PAY A+RD FR ++ SNLT+ S    FDTCY+ P   +  P ITL F   +
Sbjct: 348 GTVITRLVEPAYNAMRDSFRSQL-SNLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNL 406

Query: 370 NVTLPQDNLLI-HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           ++TLP DN+L   +  GS+ CLA    P   + VL+   N QQQ  RI++DV  SRLG+A
Sbjct: 407 DLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIA 466

Query: 429 RELC 432
            E C
Sbjct: 467 SENC 470


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 184/419 (43%), Positives = 242/419 (57%), Gaps = 58/419 (13%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV---PIASGRQIT 92
           V+H   P SP     PL   ES++ +   D ARL FLSS A +    V   P+ASG+   
Sbjct: 25  VYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSSGGVTSAPVASGQT-- 75

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQA 151
             P+Y+VRA +GTP Q LL+A+DTS DA W  C  C  C + + F  A S+++ +L C +
Sbjct: 76  -PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCAS 134

Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
             C     P   G                      +  A D+        +Q A+    P
Sbjct: 135 DWCPLFRRPAVPG------------------EPGRVGAAADVR------LLQAAS--RTP 168

Query: 212 PQGLLGLGR-------------GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
             G+L   R             G +SLL+QT + Y   FSYCLPS+++  FSGSLRLG  
Sbjct: 169 RSGVLAATRCGWARTPSPATRSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAA 228

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
           GQP+ ++YTPLL NP R SLYYVN+  + VGR +V  P G+  F+P+TGAGT+IDSGTV 
Sbjct: 229 GQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVI 288

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVTL 373
           TR  AP Y A+RD FRR+V +    TSLG FDTC++   V    AP +TL M  G+++TL
Sbjct: 289 TRWTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTL 348

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           P +N LIHS+A  + CLAMA AP NVNSV+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 349 PMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 170/455 (37%), Positives = 254/455 (55%), Gaps = 46/455 (10%)

Query: 5   LVFFLAFLFLFSLSE---GLNPICDTQD-----------HSSTLQVFHVFSPCSPFKPSK 50
           L+  LA  F+  ++E   GLN  C + D           HS +  + H++S CSPF+P  
Sbjct: 13  LILSLAITFMCGVAEIAPGLN--CRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPN 70

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARK----SVVPIASGRQITQSPTYIVRAKIGTP 106
             +WE  + E +  D  RL+FL   + + K    + VP+ SG     S  YI++   GTP
Sbjct: 71  -RTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSG-----SGEYIIQVDFGTP 124

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
            Q++   +DT +D AW+PC  C GC ST  +F+ A+S+++K   C +  C+++     G 
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN 184

Query: 165 GACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
             C F ++YG  T +   L+ D I+L +  +P ++FGC +  + ++ P  GL+GLG GSL
Sbjct: 185 SKCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSL 244

Query: 224 SLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYV 281
           SLL Q  T  L+  TFSYCLPS    S S  L          +K+T L+K+P   + Y+V
Sbjct: 245 SLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFV 304

Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
            L AI VG   + +P      N  +G GTIIDSGT  T LV  AYTA+RD FR+++ S+L
Sbjct: 305 TLKAISVGNTRISVP----GTNIASGGGTIIDSGTTITHLVPSAYTALRDAFRQQL-SSL 359

Query: 342 TVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
             T +   DTCY   S  +  PTITL     +++ LP++N+LI   +G + CLA ++   
Sbjct: 360 QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESG-LACLAFSSTDS 418

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 ++I N+QQQN RI++DVPNS++G A+E C
Sbjct: 419 R-----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 163/455 (35%), Positives = 245/455 (53%), Gaps = 46/455 (10%)

Query: 5   LVFFLAFLFLFSLSE---GLNPICDTQD-----------HSSTLQVFHVFSPCSPFKPSK 50
           L+  LA  F+  ++E   GLN  C + D           HS +  + H++S CSPF+P  
Sbjct: 13  LILSLAITFMCGVAEIAPGLN--CRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPN 70

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARK----SVVPIASGRQITQSPTYIVRAKIGTP 106
             +WE  + E +  D  RL+FL   + + K    + VP+ SG     S  YI++   GTP
Sbjct: 71  -RTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSG-----SGEYIIQVDFGTP 124

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
            Q++   +DT +D AW+PC  C GC ST  +F+ A+S+++K   C +  C+++     G 
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN 184

Query: 165 GACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGN--SVPPQGLLGLGRG 221
             C F + YG  T +   L+ D I+L +  +P ++FGC +  + +  S P    LG G  
Sbjct: 185 SKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSL 244

Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYV 281
           SL   A T  L+  TFSYCLPS    S S  L          +K+T L+K+P   + Y+V
Sbjct: 245 SLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFV 304

Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
            L AI VG   + +P      N  +G GTIIDSGT  T LV  AY  +RD FR+++ S+L
Sbjct: 305 TLKAISVGNTRISVPAT----NIASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQL-SSL 359

Query: 342 TVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
             T +   DTCY   S  +  PTITL     +++ LP++N+LI   +G ++CLA ++   
Sbjct: 360 QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESG-LSCLAFSSTDS 418

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 ++I N+QQQN RI++DVPNS++G A+E C
Sbjct: 419 R-----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|222624328|gb|EEE58460.1| hypothetical protein OsJ_09701 [Oryza sativa Japonica Group]
          Length = 360

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 160/405 (39%), Positives = 213/405 (52%), Gaps = 80/405 (19%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQS 94
           V+H   P SP     PL   ES++ +   D ARL FLSS  A A  S  P+ASG+     
Sbjct: 27  VYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA---P 76

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC 154
           P+Y+VRA +G+P+Q LL+A+DTS DA         G        A   T           
Sbjct: 77  PSYVVRAGLGSPSQQLLLALDTSADATARRARRRRGGGDAAPPPATLPT----------- 125

Query: 155 KQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG--NSVPP 212
                       CAF+  +  ++  A L+ DT+ L  D +P YTFGC+   TG   ++P 
Sbjct: 126 ------------CAFSKPFADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPR 173

Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
           QGLLGLGRG ++LL+Q  +LY                    RL            PLL  
Sbjct: 174 QGLLGLGRGPMALLSQAGSLYNG------------------RL------------PLLPP 203

Query: 273 PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
                     L  I + R     P G+  F+  TGAGT++DSGTV TR  AP Y A+R+ 
Sbjct: 204 ---------ELQVILLLRACSGFPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREE 254

Query: 333 FRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITL-MFSGMNVTLPQDNLLIHSTAGSI 387
           FRR+V +    TSLG FDTC++   VA    P +T+ M  G+++ LP +N LIHS+A  +
Sbjct: 255 FRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPL 314

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+E C
Sbjct: 315 ACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 359


>gi|356551755|ref|XP_003544239.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 249

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 128/257 (49%), Positives = 163/257 (63%), Gaps = 16/257 (6%)

Query: 176 STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
           ST +A L QD++ L  D +P Y F C+  A+G ++P Q  L      L     +     S
Sbjct: 8   STFSATLVQDSLRLGIDTLPSYAFRCVNSASGWTLPAQPGLL----GLGRGPLSLPSQSS 63

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
             SYCLPSF++  FSGSL+LGP GQP+RI+ TPLL+NP+R SLYYVNL  I VGR  V +
Sbjct: 64  XLSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLRNPQRPSLYYVNLTGINVGRVRVSL 123

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
           P   L F+P  G+GTIIDSGTV TR V P Y A+RD FR +V     V +          
Sbjct: 124 PTDYLAFDPNKGSGTIIDSGTVITRFVXPVYNAIRDEFRYQVKGPCFVKTYEN------- 176

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
             +AP I L F+G++VTLP +N LIH+  G + CLAMAAAP+NVNS L    N QQQN R
Sbjct: 177 --LAPLIKLRFTGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSAL---TNFQQQNLR 231

Query: 416 ILYDVPNSRLGVARELC 432
           +L+D  N+R+G+ARELC
Sbjct: 232 VLFDTVNNRVGIARELC 248


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 121/215 (56%), Positives = 156/215 (72%), Gaps = 5/215 (2%)

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
           +SLL+QT + Y   FSYCLPS+++  FSGSLRLG  GQP+ ++YTPLL NP R SLYYVN
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVN 60

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           +  + VGR  V +P G+  F+P TGAGT+IDSGTV TR  AP Y A+R+ FRR+V +   
Sbjct: 61  VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120

Query: 343 VTSLGGFDTCYSVPIV----APTITL-MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
            TSLG FDTC++   V    AP +TL M  G+++TLP +N LIHS+A  + CLAMA AP 
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           NVN+V+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
          Length = 429

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 108/178 (60%), Positives = 136/178 (76%), Gaps = 4/178 (2%)

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
           P+GQPK I+ TPLL+NP R +LYYVNL  + VGR +V + P  L F+P TGAGTIIDSGT
Sbjct: 253 PLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGT 312

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
           V TR V P Y A+RD FR++V       ++G FDTC++     +AP +T  F+GM++ LP
Sbjct: 313 VITRFVEPVYAAIRDEFRKQVKGPFA--TIGAFDTCFAATNEDIAPPVTFHFTGMDLKLP 370

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN RI++DV NSRLG+ARELC
Sbjct: 371 LENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 428



 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/109 (42%), Positives = 65/109 (59%), Gaps = 6/109 (5%)

Query: 7   FFLAFLFL---FSLSEGLNPICD--TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEM 61
            F AF+FL    S ++  +P     ++   S L V HV+  CSPF   K  SW  +V+ M
Sbjct: 3   IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62

Query: 62  LAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQT 109
            +KD AR+ +LSSL  + K+  VPIASG+Q+     Y+VR K+GTPA+T
Sbjct: 63  ASKDPARVTYLSSLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPAET 111


>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
          Length = 216

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 156/215 (72%), Gaps = 5/215 (2%)

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
           +SLL+QT + Y   FSYCLPS+++  FSGSLRLG  GQP+ +++TPLL NP R SLYYVN
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVN 60

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           +  + VGR  V +P G+  F+P TGAGT+IDSGTV TR  AP Y A+R+ FRR+V +   
Sbjct: 61  VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120

Query: 343 VTSLGGFDTCYSVPIV----APTITL-MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
            TSLG FDTC++   V    AP +TL M  G+++TLP +N LIHS+A  + CLAMA AP 
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           NVN+V+NV+AN+QQQN R++ DV  SR+G ARE C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 141/413 (34%), Positives = 211/413 (51%), Gaps = 22/413 (5%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL---AVARKSVVPIASGRQ 90
           +++ H+   CSP +P    SW + V +   +D  RL  + S      +  S +P+  G +
Sbjct: 73  IRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQPGSK 132

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
           +  +  YIV A  GTPA+  L+ +DT +D  W+ C  C  C S V   F   QS+++K+L
Sbjct: 133 V-GTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHL 191

Query: 148 GCQAAQCKQVPNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKA 205
            C ++ C ++     C  G C + + YG  S    + SQ+T++L +D  P + FGC    
Sbjct: 192 SCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDSFPSFAFGCGHTN 251

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
           TG      GLLGLGR +LS  +QT++ Y   FSYCLP F + + +GS  +G    P    
Sbjct: 252 TGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATAT 311

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           + PL+ N    S Y+V L  I VG   + IPP  L        GTI+DSGTV TRLV  A
Sbjct: 312 FVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLG-----RGGTIVDSGTVITRLVPQA 366

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLI 380
           Y A++  FR +  +  +       DTCY +     +  PTIT  F +  +V +    +L 
Sbjct: 367 YDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVAVSAVGILF 426

Query: 381 H-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              + GS  CLA A+A  ++++  N+I N QQQ  R+ +D    R+G A   C
Sbjct: 427 TIQSDGSQVCLAFASASQSIST--NIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 144/418 (34%), Positives = 211/418 (50%), Gaps = 28/418 (6%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---SSLAVARKSVVPIASGRQ 90
           +++ H+   CSP +P    SW + V +   +D ARL  +   +S      S +P+ SG  
Sbjct: 72  IRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSNLPLQSGTT 131

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
           +  +  YIV A  GTPA+  L+ +DT +D  W+ C  C  C S V   F   QS+++K L
Sbjct: 132 VG-TGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTL 190

Query: 148 GCQAAQCKQV----PNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGC 201
            C +A C ++     NPT C  G C + + YG  S+   + SQ+T++L +D    + FGC
Sbjct: 191 PCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDSFQNFAFGC 250

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
               TG      GLLGLG+ SLS  +Q+++ Y   F+YCLP F + + +GS  +G    P
Sbjct: 251 GHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGKGSIP 310

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTR 320
               +TPL+ N    + Y+V L  I VG   + IPP  L      G G TI+DSGTV TR
Sbjct: 311 ASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL------GRGSTIVDSGTVITR 364

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQ 375
           L+  AY A++  FR +     +       DTCY +     +  PTIT  F +  +V +  
Sbjct: 365 LLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNADVAVSD 424

Query: 376 DNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +L+     GS  CLA A+A        N+I N QQQ  R+ +D    R+G A   C
Sbjct: 425 VGILVPVQNGGSQVCLAFASASQMDG--FNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 138/430 (32%), Positives = 215/430 (50%), Gaps = 48/430 (11%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV---PIASGRQIT 92
           V H   PCSP +       E S  E+L +DQ R+  +  LA AR S     P ++ + ++
Sbjct: 68  VVHRHGPCSPLQAR---GGEPSHAEILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVS 124

Query: 93  ---------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
                     +  YIV   +GTP + LL+  DT +D +WV C  C GC      +F+ +Q
Sbjct: 125 LPARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQ 184

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-------TD 192
           STT+  + C A +C+++ + +C  G C + + YG  S    NL++DT++L        +D
Sbjct: 185 STTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
            +  + FGC    TG      GL GLGR  +SL +Q    Y + FSYCLPS  + +  G 
Sbjct: 245 QLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPS--SSTAEGY 302

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L LG    P   ++T ++      S YY+NL+ I+V  R V + P   +       GT+I
Sbjct: 303 LSLGSAAPPN-ARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRT-----PGTVI 356

Query: 313 DSGTVFTRLVAPAYTAVRDVFR--RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
           DSGTV TRL + AY A+R  F    R  S     +L   DTCY       +  P++ L+F
Sbjct: 357 DSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLF 416

Query: 367 SG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            G   +N+   +   +++    S  CLA A+  D+ +  + ++ NMQQ+   ++YDV N 
Sbjct: 417 DGGATLNLGFGE---VLYVANKSQACLAFASNGDDTS--IAILGNMQQKTFAVVYDVANQ 471

Query: 424 RLGVARELCT 433
           ++G   + C+
Sbjct: 472 KIGFGAKGCS 481


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 140/429 (32%), Positives = 215/429 (50%), Gaps = 41/429 (9%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL--------SSLAVARKSV 82
           SS L V H   PCSP +  +      S  E+L +DQ R+  +        ++ + ++   
Sbjct: 62  SSALTVVHGHGPCSPQESRRG---APSHTEILGRDQDRVDAIRRKVAAVTTAASSSKPKG 118

Query: 83  VPIASGR-QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           VP+  G  +   +  Y    ++GTPA  LL+ +DT +D +W+ C  C  C      +F+ 
Sbjct: 119 VPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDP 178

Query: 139 AQSTTFKNLGCQAAQCKQV----PNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TD 192
           ++S+T+ ++ C + +C+++     +       C + +TY   S    NL++DT++L+ TD
Sbjct: 179 SKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTD 238

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LS 248
            VPG+ FGC     G+     GLLGLGRG  SL +Q    Y + FSYCLPS  +    LS
Sbjct: 239 AVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLS 298

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           FSG+        P   ++T ++   +  S YY+NL  I V  R + +PP        T A
Sbjct: 299 FSGAAAA----APTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAIKVPPSVF----ATAA 349

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITL 364
           GTIIDSGT F+ L   AY A+R   R  +G      S   FDTCY +     +  P++ L
Sbjct: 350 GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVAL 409

Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           +F+ G  V L    +L   +  S TCLA    PD+ +  L V+ N QQ+   ++YDV N 
Sbjct: 410 VFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTS--LGVLGNTQQRTLAVIYDVDNQ 467

Query: 424 RLGVARELC 432
           ++G     C
Sbjct: 468 KVGFGANGC 476


>gi|217073830|gb|ACJ85275.1| unknown [Medicago truncatula]
          Length = 267

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 110/254 (43%), Positives = 149/254 (58%), Gaps = 8/254 (3%)

Query: 14  LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
             S+S   +P C +Q   S L V  ++  CSPF P K  SW+  VL M +KD AR+ +LS
Sbjct: 16  FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74

Query: 74  SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
           SL VA+K+V   PIASG+       YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75  SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132

Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
           S+T F+   ST++  L C   QC QV   +C   G GAC+FN +Y  ST +A L QD++ 
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           LATD++P Y+FG I   +G+S+P Q   GLG   +  +   +             F+ L 
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQRTFGLGPWPVIFIITNRVTLLGCILLLPSKFQILL 252

Query: 249 FSGSLRLGPIGQPK 262
           F  SL+LGP+G PK
Sbjct: 253 FFRSLKLGPVGHPK 266


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 133/418 (31%), Positives = 206/418 (49%), Gaps = 35/418 (8%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------ARKSV-VPI 85
           V H   PCSP         E S  E+L +DQ R+  +  +           A K V +P 
Sbjct: 121 VVHRHGPCSPLLAR---GGEPSHAEILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPA 177

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
             G ++  +  YIV   +GTP + LL+  DT +D +WV   PC  C      +F+ +QST
Sbjct: 178 HRGLRLGTA-NYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQST 236

Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL--ATDIVPGYTF 199
           T+  + C A +C  + + TC  G C + + YG  S    NL++DT++L  ++D + G+ F
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVF 294

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC    TG      GL GLGR  +SL +Q    Y + FSYCLPS  +    G L LG   
Sbjct: 295 GCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPS--SWRAEGYLSLGSAA 352

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P   ++T ++      S YY++L+ I+V  R V + P   +       GT+IDSGTV T
Sbjct: 353 APPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFK-----APGTVIDSGTVIT 407

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQ 375
           RL + AY+A+R  F   +       +L   DTCY       +  P++ L+F G       
Sbjct: 408 RLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLG 467

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              +++    S  CLA A+  D+ +  + ++ NMQQ+   ++YD+ N ++G   + C+
Sbjct: 468 FGGVLYVANRSQACLAFASNGDDTS--VGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 138/441 (31%), Positives = 211/441 (47%), Gaps = 38/441 (8%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           S   +P     D  ++L+V H   PCS  +P K  S   S  ++LA+D++R+  + S   
Sbjct: 61  SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANS--PSHTQILAQDESRVASIQSRLA 118

Query: 78  ----------ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
                     A K+ +P  S   +  S  Y+V   +G+P + L    DT +D  W  C  
Sbjct: 119 KNLAGGSNLKASKATLPSKSASTLG-SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEP 177

Query: 128 CVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTI 178
           CVG C      +F+ + S ++ N+ C +  C+++ +     P C    C + + YG  + 
Sbjct: 178 CVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSY 237

Query: 179 AANL-SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
           +    +++ +SL +TD+   + FGC Q   G      GLLGL R  LSL++QT   Y   
Sbjct: 238 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 297

Query: 237 FSYCLPSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           FSYCLPS  + S +G L  G   G  K +K+TP   N    S Y+++++ I VG R + I
Sbjct: 298 FSYCLPS--SSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPI 355

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
           P         + AGTIIDSGTV +RL    Y++V+ VFR  +     V  +   DTCY +
Sbjct: 356 PKSVF-----STAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 410

Query: 356 P----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
                +  P I L FSG          +I+    S  CLA A   D  +  + +I N+QQ
Sbjct: 411 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSD--DDEVAIIGNVQQ 468

Query: 412 QNHRILYDVPNSRLGVARELC 432
           +   ++YD    R+G A   C
Sbjct: 469 KTIHVVYDDAEGRVGFAPSGC 489


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 130/400 (32%), Positives = 196/400 (49%), Gaps = 27/400 (6%)

Query: 55  EESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQ 108
             +VL+++++D AR ++L+S L+ A +      S  ++       S  Y VR  IG+P  
Sbjct: 77  RHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPT 136

Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-G 164
              + +D+ +D  WV C  C+ C   +  +F+ A S TF  + C +A C+ +    CG  
Sbjct: 137 EQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDS 196

Query: 165 GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
           G C + ++YG  S     L+ +T++L    V G   GC  +  G  V   GLLGLG G +
Sbjct: 197 GGCEYEVSYGDGSYTKGTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 256

Query: 224 SLLAQTQNLYQSTFSYCLPSFK-----ALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSS 277
           SL+ Q        FSYCL S       A   +GSL LG     P+   + PL++NP+  S
Sbjct: 257 SLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPS 316

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            YYV +  I VG   + +  G  Q     G G ++D+GT  TRL   AY A+RD F   V
Sbjct: 317 FYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAV 376

Query: 338 GSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAM 392
           G+      +   DTCY +     +  PT++  F G   +TLP  NLL+    G I CLA 
Sbjct: 377 GALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLE-VDGGIYCLAF 435

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           A +    +S L+++ N+QQ+  +I  D  N  +G     C
Sbjct: 436 APS----SSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 137/432 (31%), Positives = 206/432 (47%), Gaps = 40/432 (9%)

Query: 28  QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------- 77
            D  ++L+V H   PCS     K  S   S  +ML +D++R+  + S             
Sbjct: 62  DDKRASLEVIHKHGPCSKLSQDKGRS--PSRTQMLDQDESRVNSIRSRLAKNPADGGKLK 119

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SS 133
             K  +P  SG  I  +  Y+V   +GTP + L    DT +D  W  C  C   C     
Sbjct: 120 GSKVTLPSKSGSTI-GTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQE 178

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTI 187
            +FN ++ST++ N+ C +  C ++ +     P+C    C + + YG  + +    +QD +
Sbjct: 179 PIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKL 238

Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +L +TD+   + FGC Q   G  V   GL+GLGR +LSL++QT   Y   FSYCLPS   
Sbjct: 239 ALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPS--T 296

Query: 247 LSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
            S +G L  G   G  K +K+TP L N +  S Y++NL+AI VG R +            
Sbjct: 297 SSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVF----- 351

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
           + AGTIIDSGTV +RL   AY+ +R  F++++            DTCY       +  P 
Sbjct: 352 STAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPK 411

Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           I L FS G  + L    +        + CLA A   D  +  + ++ N+QQ+   ++YDV
Sbjct: 412 INLYFSDGAEMDLDPSGIFYILNISQV-CLAFAGNSDATD--IAILGNVQQKTFDVVYDV 468

Query: 421 PNSRLGVARELC 432
              R+G A   C
Sbjct: 469 AGGRIGFAPGGC 480


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 133/427 (31%), Positives = 204/427 (47%), Gaps = 37/427 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           S+L V H    CS     K  S +   +E+L  DQAR+  + S  +++K      S  Q 
Sbjct: 61  SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHS-KLSKKLTTNHVSQSQS 117

Query: 92  TQSP----------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
           T  P           YIV   +GTP   L +  DT +D  W  C  CV         +FN
Sbjct: 118 TDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 177

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISL-A 190
            ++ST++ N+ C +A C  + + T   G+C+     + + YG  + +   L++D  +L +
Sbjct: 178 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTS 237

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           +D+  G  FGC +   G      GLLGLGR  LS  +QT   Y   FSYCLPS  + S++
Sbjct: 238 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 295

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
           G L  G  G  + +K+TP+      +S Y +N++AI VG + + IP         +  G 
Sbjct: 296 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 350

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
           +IDSGTV TRL   AY A+R  F+ ++    T + +   DTC+ +     +  P +   F
Sbjct: 351 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 410

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SG  V       + ++   S  CLA A   D+ N+   +  N+QQQ   ++YD    R+G
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 468

Query: 427 VARELCT 433
            A   C+
Sbjct: 469 FAPNGCS 475


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 132/447 (29%), Positives = 212/447 (47%), Gaps = 60/447 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLS---SLAVARKSV--- 82
           S+ +++ H   PCSP   +  KP + +E    +LA DQ R++ +    S    R  +   
Sbjct: 68  SARMRIVHQHGPCSPLADAHGKPPAHDE----ILAADQNRVESIQRRVSATTGRDKLTKH 123

Query: 83  --------------------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
                                     +P  SGR ++    Y+V   +GTPA    +  DT
Sbjct: 124 AAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG-NYVVTVGLGTPASKYTVVFDT 182

Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
            +D  WV C  CV  C      +F+ A+S+T+ N+ C  + C  +    C GG C + + 
Sbjct: 183 GSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQ 242

Query: 173 YGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           YG  +      +QDT+++A D + G+ FGC +K  G      GL+GLGRG  SL  Q  N
Sbjct: 243 YGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYN 302

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
            Y   F+YCLP+    + +G L  GP       + TP+L + +  + YYV +  IRVG +
Sbjct: 303 KYGGAFAYCLPAL--TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQ 359

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--F 349
            V +          + AGT++DSGTV TRL A AYTA+   F + + +     + G    
Sbjct: 360 QVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSIL 414

Query: 350 DTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           DTCY       +  PT++L+F G        + ++++ + +  CLA A+  D+ +  + +
Sbjct: 415 DTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDES--VAI 472

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           + N QQ+ + +LYD+    +G A   C
Sbjct: 473 VGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 123/365 (33%), Positives = 184/365 (50%), Gaps = 25/365 (6%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNS 138
           +P ++G  +  +  ++V    GTPAQT  +  DT +D +W+   PC+G C      +F+ 
Sbjct: 122 IPDSTGTSL-DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDP 180

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
            +S T+  + C   QC       C  G C + + YG  S+ A  LS +T+SL +T  +PG
Sbjct: 181 TKSATYSVVPCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPG 240

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           + FGC Q   G+     GL+GLGRG LSL +Q    +  TFSYCLPS       G L +G
Sbjct: 241 FAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTT--HGYLTIG 298

Query: 257 PI--GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           P        ++YT +++     S Y+V L++I +G  ++ +PP        T  GT +DS
Sbjct: 299 PTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF-----TDDGTFLDS 353

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMN 370
           GT+ T L   AYTA+RD F+  +       +   FDTCY       I  P ++  FS  +
Sbjct: 354 GTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGS 413

Query: 371 V-TLPQDNLLI--HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           V  L    +LI    TA +I CL   A P  +     ++ NMQQ+N  ++YDV   ++G 
Sbjct: 414 VFDLSFFGILIFPDDTAPAIGCLGFVARPSAMP--FTIVGNMQQRNTEVIYDVAAEKIGF 471

Query: 428 ARELC 432
           A   C
Sbjct: 472 ASASC 476


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 133/447 (29%), Positives = 213/447 (47%), Gaps = 60/447 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLSSLAVA---------- 78
           S+ +++ H   PCSP   +  KP + +E    +LA DQ R++ +     A          
Sbjct: 68  SARMRIVHQHGPCSPLADAHGKPPAHDE----ILAADQNRVESIQRRVSATTGRDKLTKH 123

Query: 79  --------RKS--------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
                   +KS               +P  SGR ++    Y+V   +GTPA    +  DT
Sbjct: 124 AAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG-NYVVTVGLGTPASKYTVVFDT 182

Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
            +D  WV C  CV  C      +F+ A+S+T+ N+ C  + C  +    C GG C + + 
Sbjct: 183 GSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQ 242

Query: 173 YGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           YG  +      +QDT+++A D + G+ FGC +K  G      GL+GLGRG  SL  Q  N
Sbjct: 243 YGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYN 302

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
            Y   F+YCLP+    + +G L  GP       + TP+L + +  + YYV +  IRVG +
Sbjct: 303 KYGGAFAYCLPAL--TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQ 359

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--F 349
            V +          + AGT++DSGTV TRL A AYTA+   F + + +     + G    
Sbjct: 360 QVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSIL 414

Query: 350 DTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           DTCY       +  PT++L+F G        + ++++ + +  CLA A+  D+ +  + +
Sbjct: 415 DTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDES--VAI 472

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           + N QQ+ + +LYD+    +G A   C
Sbjct: 473 VGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 135/427 (31%), Positives = 213/427 (49%), Gaps = 36/427 (8%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-SLAVARKSVVPIASG 88
           +SS L V H   PCSP +            E+L  DQAR+  +   +A A   V+  A G
Sbjct: 71  NSSALNVVHRQGPCSPLQARGA---PPPHAELLNDDQARVDSIHRKIAAAASPVLDQARG 127

Query: 89  RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTV 135
           ++    P           Y+V   +GTPA+ + +  DT +D +WV CT C  C      +
Sbjct: 128 KKGVTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPL 187

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLA-TD 192
           F+ A+S+T+  + C + +C+ + + +C     C + + YG  S     L++DT++L  +D
Sbjct: 188 FDPARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD 247

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
           ++PG+ FGC ++ TG      GL+GLGR  +SL +Q  + Y + FSYCLPS  + S +G 
Sbjct: 248 VLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS--SPSAAGY 305

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L LG    P   ++T +       S YYV L+ ++V  R V + P  + F   + AGT+I
Sbjct: 306 LSLGGPA-PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSP--IVF---SAAGTVI 359

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS----VPIVAPTITLMF 366
           DSGTV TRL    Y A+R  F R +G        +L   DTCY       +  P++ L+F
Sbjct: 360 DSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVF 419

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           +G        + +++    S  CLA A   D  ++   +I N QQ+   ++YDV   ++G
Sbjct: 420 AGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADA--GIIGNTQQKTLAVVYDVARQKIG 477

Query: 427 VARELCT 433
                C+
Sbjct: 478 FGANGCS 484


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 37/427 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------LAVARKS 81
           S+L V H    CS     K  S +   +E+L  DQAR+  + S          ++ ++ +
Sbjct: 32  SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHSKLSKKLATDHVSESKST 89

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
            +P   G  +  S  YIV   +GTP   L +  DT +D  W  C  CV         +FN
Sbjct: 90  DLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 148

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISLA- 190
            ++ST++ N+ C +A C  + + T   G+C+     + + YG  + +   L+++  +L  
Sbjct: 149 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTN 208

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           +D+  G  FGC +   G      GLLGLGR  LS  +QT   Y   FSYCLPS  + S++
Sbjct: 209 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 266

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
           G L  G  G  + +K+TP+      +S Y +N++AI VG + + IP         +  G 
Sbjct: 267 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 321

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
           +IDSGTV TRL   AY A+R  F+ ++    T + +   DTC+ +     +  P +   F
Sbjct: 322 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 381

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SG  V       + +    S  CLA A   D+ N+   +  N+QQQ   ++YD    R+G
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 439

Query: 427 VARELCT 433
            A   C+
Sbjct: 440 FAPNGCS 446


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 37/427 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------LAVARKS 81
           S+L V H    CS     K  S +   +E+L  DQAR+  + S          ++ ++ +
Sbjct: 60  SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHSKLSKKLATDHVSESKST 117

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
            +P   G  +  S  YIV   +GTP   L +  DT +D  W  C  CV         +FN
Sbjct: 118 DLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 176

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISLA- 190
            ++ST++ N+ C +A C  + + T   G+C+     + + YG  + +   L+++  +L  
Sbjct: 177 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTN 236

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           +D+  G  FGC +   G      GLLGLGR  LS  +QT   Y   FSYCLPS  + S++
Sbjct: 237 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 294

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
           G L  G  G  + +K+TP+      +S Y +N++AI VG + + IP         +  G 
Sbjct: 295 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 349

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
           +IDSGTV TRL   AY A+R  F+ ++    T + +   DTC+ +     +  P +   F
Sbjct: 350 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 409

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SG  V       + +    S  CLA A   D+ N+   +  N+QQQ   ++YD    R+G
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 467

Query: 427 VARELCT 433
            A   C+
Sbjct: 468 FAPNGCS 474


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 138/426 (32%), Positives = 210/426 (49%), Gaps = 40/426 (9%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQ 90
           ++T+ + H   PCSP  P+K +   E   E L +DQ R  ++                  
Sbjct: 57  AATVPLHHRHGPCSPL-PTKKMPTLE---ERLHRDQLRAAYIQRKFSGGGVNGSRGGAGD 112

Query: 91  ITQS----PT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-- 134
           + QS    PT          Y++  ++G+P ++  M +DT +D +WV C  C  C S   
Sbjct: 113 VQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQAD 172

Query: 135 -VFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNLTYGS-STIAANLSQDTISLA 190
            +F+ + S+T+    C +A C Q+      C    C + +TYG  S+     S DT++L 
Sbjct: 173 PLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLALG 232

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           ++ V  + FGC    +G +    GL+GLG G+ SL++QT   + + FSYCLP+    S S
Sbjct: 233 SNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATS--SSS 290

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
           G L LG  G    +K TP+L++ +  + Y V + AIRVG R + IP           AGT
Sbjct: 291 GFLTLG-AGTSGFVK-TPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFS------AGT 342

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
           I+DSGTV TRL   AY+A+   F+  +    +    G  DTC+       +  PT+ L+F
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVF 402

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SG  V     + ++  T+ SI CLA AA  D  +S L +I N+QQ+   +LYDV    +G
Sbjct: 403 SGGAVVDIASDGIMLQTSNSILCLAFAANSD--DSSLGIIGNVQQRTFEVLYDVGGGAVG 460

Query: 427 VARELC 432
                C
Sbjct: 461 FKAGAC 466


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 135/446 (30%), Positives = 204/446 (45%), Gaps = 59/446 (13%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSVVPI- 85
           + V H   PCSP   ++      S  E+LA DQ R +++       +  A  RK   P+ 
Sbjct: 1   MPVVHQHGPCSPLADNRN-GKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVE 59

Query: 86  -------------------------ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
                                    AS      +  Y+V  ++GTPA+   +  DT +D 
Sbjct: 60  LRPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDT 119

Query: 121 AWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
            WV C  CV  C      +F+  +S T+ N+ C ++ C  +    C GG C + + YG  
Sbjct: 120 TWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDG 179

Query: 177 TIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
           +      +QDT++LA D +  + FGC +K  G      GLLGLGRG  SL  Q  + Y  
Sbjct: 180 SYTIGFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGG 239

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            F+YCLP+  A   +G L LGP       + TP+L + R  + YYV +  I+VG  V+ I
Sbjct: 240 VFAYCLPATSA--GTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVLPI 296

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTC 352
           P         + AGT++DSGTV TRL   AY  +R  F + +   L  ++   F   DTC
Sbjct: 297 PGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAM-QGLGYSAAPAFSILDTC 350

Query: 353 YSV------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           Y +       I  P ++L+F G        + +++    S  CLA A   D+ +  + ++
Sbjct: 351 YDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTD--VAIV 408

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQ+ H +LYD+    +G A   C
Sbjct: 409 GNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 135/446 (30%), Positives = 204/446 (45%), Gaps = 59/446 (13%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSVVPI- 85
           + V H   PCSP   ++      S  E+LA DQ R +++       +  A  RK   P+ 
Sbjct: 66  MPVVHQHGPCSPLADNRN-GKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVE 124

Query: 86  -------------------------ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
                                    AS      +  Y+V  ++GTPA+   +  DT +D 
Sbjct: 125 LRPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDT 184

Query: 121 AWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
            WV C  CV  C      +F+  +S T+ N+ C ++ C  +    C GG C + + YG  
Sbjct: 185 TWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDG 244

Query: 177 TIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
           +      +QDT++LA D +  + FGC +K  G      GLLGLGRG  SL  Q  + Y  
Sbjct: 245 SYTIGFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGG 304

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            F+YCLP+  A   +G L LGP       + TP+L + R  + YYV +  I+VG  V+ I
Sbjct: 305 VFAYCLPATSA--GTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVLPI 361

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTC 352
           P         + AGT++DSGTV TRL   AY  +R  F + +   L  ++   F   DTC
Sbjct: 362 PGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAM-QGLGYSAAPAFSILDTC 415

Query: 353 YSV------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           Y +       I  P ++L+F G        + +++    S  CLA A   D+ +  + ++
Sbjct: 416 YDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTD--VAIV 473

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQ+ H +LYD+    +G A   C
Sbjct: 474 GNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 130/391 (33%), Positives = 190/391 (48%), Gaps = 24/391 (6%)

Query: 62  LAKDQARLQFLSSLAVARKSVVP--------IASGRQITQ-SPTYIVRAKIGTPAQTLLM 112
           L +D AR++ L+ LA A     P         +    ++Q S  Y  R  +GTP + L M
Sbjct: 86  LERDAARVKTLTHLAAATNKTRPANPGSGFSSSVVSGLSQGSGEYFTRLGVGTPPKYLYM 145

Query: 113 AMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGAC 167
            +DT +D  W+   PCT C   +  +F+ ++S +F  + C +  C+++ +P C      C
Sbjct: 146 VLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLC 205

Query: 168 AFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
            + ++YG  +    + S +T++     VP    GC     G  V   GLLGLGRG LS  
Sbjct: 206 QYQVSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFP 265

Query: 227 AQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
            QT   + + FSYCL    A +   S+  G     +  ++TPL+KNP+  + YYV LL I
Sbjct: 266 TQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGI 325

Query: 287 RVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
            VG   V  I     + + T   G IIDSGT  TRL  PAY ++RD FR           
Sbjct: 326 SVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPE 385

Query: 346 LGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
              FDTCY +     +  PT+ L F G +V+LP  N L+        C A A       S
Sbjct: 386 FSLFDTCYDLSGLSEVKVPTVVLHFRGADVSLPAANYLVPVDNSGSFCFAFAG----TMS 441

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            L++I N+QQQ  R+++D+  SR+G A   C
Sbjct: 442 GLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 129/350 (36%), Positives = 180/350 (51%), Gaps = 16/350 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTP +   M +DT +D AW+ C  C  C S    +FN + S +F  +GC 
Sbjct: 154 SGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C Q+    C  G C +  +YG  + +  + + +T++  T  V     GC  K  G  
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTSVANVAIGCGHKNVGLF 273

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +   GLLGLG G+LS   Q       TFSYCL   ++ S SG L+ GP   P    +TPL
Sbjct: 274 IGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDS-SGPLQFGPKSVPVGSIFTPL 332

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
            KNP   + YY+++ AI VG  ++D IPP   + + T+G  G IIDSGTV TRLV  AY 
Sbjct: 333 EKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYD 392

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHS 382
           AVRD F    G      ++  FDTCY +     +  PT+   FS G ++ LP  N LI  
Sbjct: 393 AVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPM 452

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 C A A A  +V    +++ N QQQ+ R+ +D  NS +G A + C
Sbjct: 453 DTVGTFCFAFAPAASSV----SIMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 135/431 (31%), Positives = 209/431 (48%), Gaps = 44/431 (10%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----------LAVA 78
             S+L+V H   PC     + P     +  EML KDQ+R+ F+ S           L  +
Sbjct: 59  EQSSLEVIHRHGPCGDEVSNAP-----TAAEMLVKDQSRVDFIHSKIAGELESVDRLRGS 113

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSST 134
           + + +P  SG  I  S  YIV   +GTP + L +  DT +D  W  C  C          
Sbjct: 114 KATKIPAKSGATIG-SGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDP 172

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGG-ACAFNLTYGSSTIAAN-LSQDTI 187
           VF  +QSTT+ N+ C +  C Q+ +     P C    AC + + YG  + +    +++T+
Sbjct: 173 VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETL 232

Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +L +TD++  + FGC Q   G      GL+GLG+  +S++ QT   Y   FSYCLP  K 
Sbjct: 233 TLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLP--KT 290

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
            S +G L  G  G    +KYTP+ K    ++ Y V+++ ++VG     IP  +  F+ + 
Sbjct: 291 SSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGG--TQIPISSSVFSTS- 347

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTI 362
             G IIDSGTV TRL   AY+A++  F + +        L   DTCY +     I  P +
Sbjct: 348 --GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKV 405

Query: 363 TLMFSGMNVTLPQDNL-LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
             +F G    L  D + +++  + S  CLA A   D   S + +I N+QQ+  +++YDV 
Sbjct: 406 GFVFKGGE-ELDLDGIGIMYGASTSQVCLAFAGNQD--PSTVAIIGNVQQKTLQVVYDVG 462

Query: 422 NSRLGVARELC 432
             ++G     C
Sbjct: 463 GGKIGFGYNGC 473


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 138/387 (35%), Positives = 195/387 (50%), Gaps = 37/387 (9%)

Query: 72  LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
           ++ L+ AR  V P+ S  +   S  YI +  +GTP    L+A+DT++D  W+ C  C  C
Sbjct: 115 VAGLSSARGFVAPVVS--RAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRC 172

Query: 132 ---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG---GACAFNLTYGS-STIAANLSQ 184
              S  VF+   ST+++ +   AA C+ +     G    G C + + YG  ST   +  +
Sbjct: 173 YPQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIE 232

Query: 185 DTISLATDI-VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
           +T++ A  + +P  + GC     G    P  G+LGLGRG +S   Q    +  TFSYCL 
Sbjct: 233 ETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQID--HNGTFSYCLV 290

Query: 243 SFKALSFSGSLR------LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RVVDI 295
            F  LS  GSL        G +     + +TP + N    + YYV L  I VG  RV  +
Sbjct: 291 DF--LSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGV 348

Query: 296 PPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FD 350
               LQ +P TG  G I+DSGT  TRL  PAYTA RD F R V  +L   S+GG    FD
Sbjct: 349 TERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF-RAVAVDLGQVSIGGPSGFFD 407

Query: 351 TCYSVP----IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           TCY+V        PT+++ F+G + V L   N LI   +    C A AA  D+    +++
Sbjct: 408 TCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDH---SVSI 464

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I N+QQQ  RI+YD+   R+G A   C
Sbjct: 465 IGNIQQQGFRIVYDI-GGRVGFAPNSC 490


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 126/395 (31%), Positives = 191/395 (48%), Gaps = 34/395 (8%)

Query: 60  EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------YIVRAKIGTPAQTLLMA 113
           E + +   R+ F +        + P A G Q  QSP       Y++   +G+P Q+  + 
Sbjct: 2   EAVQRSHERVAFYT------LKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVI 55

Query: 114 MDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK--QVPNPTCGGGACA 168
           +DT +D  WV C  C  C       F+ ++S +F+   C    C    +P   C    C 
Sbjct: 56  VDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQ 115

Query: 169 FNLTYGS-STIAANLSQDTISL----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
           +  TYG  S    +L+ +TISL     T  VP + FGC  +  G      GL+GLG+G L
Sbjct: 116 YQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPL 175

Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
           SL +Q  + + + FSYCL S  +LS S  L  G I     I+YT ++ N R  + YYV L
Sbjct: 176 SLNSQLSHTFANKFSYCLVSLNSLSAS-PLTFGSIAAAANIQYTSIVVNARHPTYYYVQL 234

Query: 284 LAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
            +I VG + +++ P     + +TG  GTIIDSGT  T L  PAY+AV   +   V     
Sbjct: 235 NSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRL 294

Query: 343 VTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH-STAGSITCLAMAAAPD 397
             S  G D C+++  V+    P +   F G +  +  +NL +   T+ +  CLAM  +  
Sbjct: 295 DGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGSQG 354

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 ++I N+QQQNH ++YD+   ++G A   C
Sbjct: 355 -----FSIIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 133/392 (33%), Positives = 189/392 (48%), Gaps = 25/392 (6%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPI-ASGRQITQSPT---------YIVRAKIGTPAQTLL 111
           LA+D +R++ L+SLA A  S     A G   + S T         Y  R  +GTPA+ + 
Sbjct: 102 LARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPARYVF 161

Query: 112 MAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-- 166
           M +DT +D  W+ C  C  C S    VFN  +S +F N+ C +  C+++ +P C      
Sbjct: 162 MVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHI 221

Query: 167 CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
           C + ++YG  +      S +T++     V     GC     G  +   GLLGLGRG LS 
Sbjct: 222 CLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSF 281

Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
            +Q    +   FSYCL    A S    +  G     +  ++TPL+ NP+  + YYV LL 
Sbjct: 282 PSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLG 341

Query: 286 IRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
           + V G RV  I     + + T   G IIDSGT  TRL  PAY A+RD FR    +     
Sbjct: 342 VSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAP 401

Query: 345 SLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
               FDTC+ +     +  PT+ L F G +V+LP  N LI        C A A       
Sbjct: 402 EFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDNSGSFCFAFAG----TM 457

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S L+++ N+QQQ  R++YD+  SR+G A   C
Sbjct: 458 SGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 210/425 (49%), Gaps = 53/425 (12%)

Query: 32  STLQVFHVFSPCSPF-------KPSKPLSWEESVLEMLAKDQARLQFL-----------S 73
           ++L+V H   PCS         K + P S      ++L +D+ R++++           S
Sbjct: 70  ASLEVVHKHGPCSQLNDHDGKAKSTTPHS------DILNQDKERVKYINSRLSKNLGQDS 123

Query: 74  SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCV 129
           S+     + +P  SG  I  S  Y V   +GTP + L +  DT +D  W  C      C 
Sbjct: 124 SVEELDSATLPAKSGSLI-GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCY 182

Query: 130 GCSSTVFNSAQSTTFKNLGCQAAQCKQVP-----NPTCGGG--ACAFNLTYGSSTIAAN- 181
                +F+ ++ST++ N+ C +A C Q+      +P C     AC + + YG S+ +   
Sbjct: 183 KQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGY 242

Query: 182 LSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
            S++ +++ ATD+V  + FGC Q   G      GL+GLGR  +S + QT   Y+  FSYC
Sbjct: 243 FSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYC 302

Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           LPS    S +G L  GP    + +KYTP     R SS Y +++ AI VG   V +P  + 
Sbjct: 303 LPS--TSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGG--VKLPVSSS 358

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP---- 356
            F  +TG G IIDSGTV TRL   AY A+R  FR+ +    +   L   DTCY +     
Sbjct: 359 TF--STG-GAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKV 415

Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              PTI   F+ G+ V LP   +L  ++   + CLA AA  D  +S + +  N+QQ+   
Sbjct: 416 FSIPTIEFSFAGGVTVKLPPQGILFVASTKQV-CLAFAANGD--DSDVTIYGNVQQRTIE 472

Query: 416 ILYDV 420
           ++YDV
Sbjct: 473 VVYDV 477


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 139/427 (32%), Positives = 201/427 (47%), Gaps = 30/427 (7%)

Query: 26  DTQDHSST--LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV 83
           DT + S+T  +Q+ HV +      P      E      L +D AR++ +S LA    +  
Sbjct: 52  DTAESSATFSVQLHHVDALSFNSTP------ETLFTTRLQRDAARVEAISYLAETAGTGK 105

Query: 84  PIASGRQIT-------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSS 133
            + +G   +        S  Y  R  +GTP + + M +DT +D  W+   PC  C   S 
Sbjct: 106 RVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSD 165

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLA 190
            VF+  +S +F ++ C++  C ++ +P C      C + ++YG  +    + S +T++  
Sbjct: 166 PVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR 225

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
              V     GC     G  V   GLLGLGRG LS  +QT   +   FSYCL    A S  
Sbjct: 226 RTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKP 285

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAG 309
            S+  G     +  ++TPL+ NP+  + YYV LL I V G RV  I     + + T   G
Sbjct: 286 SSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 345

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLM 365
            IIDSGT  TRL  PAY A RD FR    +         FDTC+ +     +  PT+ L 
Sbjct: 346 VIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLH 405

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F G +V+LP  N LI        CLA A         L++I N+QQQ  R++YD+  SR+
Sbjct: 406 FRGADVSLPASNYLIPVDTSGNFCLAFAGTMGG----LSIIGNIQQQGFRVVYDLAGSRV 461

Query: 426 GVARELC 432
           G A   C
Sbjct: 462 GFAPHGC 468


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 189/384 (49%), Gaps = 20/384 (5%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           L +D  R+  L+S A    S V   SG  ++Q S  Y  R  +GTP + L M +DT +D 
Sbjct: 78  LHRDTLRVHALNSRAAGFSSSV--VSG--LSQGSGEYFTRLGVGTPPRYLYMVLDTGSDV 133

Query: 121 AWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG--GACAFNLTYGS 175
            W+ C+ C  C   S  +FN  +S +F  + C +  C+++ +  C      C + ++YG 
Sbjct: 134 VWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD 193

Query: 176 STI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
            +    + + +T++   + +     GC     G  V   GLLGLGRG LS  +QT   + 
Sbjct: 194 GSFTTGDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFN 253

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RVV 293
             FSYCL    A S   S+  G     +  ++TPL++NP+  + YYV L+ I VG  RV 
Sbjct: 254 HKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVR 313

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            + P   + +     G IIDSGT  TRL  PAYTA+RD FR              FDTCY
Sbjct: 314 GVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCY 373

Query: 354 SV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
            +     +  PT+ L F G ++ LP  N LI        C A A       S L++I N+
Sbjct: 374 DLSGQSSVKVPTVVLHFRGADMALPATNYLIPVDENGSFCFAFAG----TISGLSIIGNI 429

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
           QQQ  R++YD+  SR+G A   CT
Sbjct: 430 QQQGFRVVYDLAGSRIGFAPRGCT 453


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 123/408 (30%), Positives = 195/408 (47%), Gaps = 22/408 (5%)

Query: 35  QVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQIT 92
           ++ H   P SP + +   +  E  L  + +   R   LS   LA  R    P+ASG    
Sbjct: 21  ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHILAEGRLFSTPVASGNG-- 78

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGC 149
               Y++    G+P Q   + +DT +D  W   +PC  C   +S +F+  +S+T+  + C
Sbjct: 79  ---EYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSC 135

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
            +  C  +P  +C   +C ++  YG  S+ +  LS +T+++ T  +P   FGC     G+
Sbjct: 136 ASNFCSSLPFQSCTT-SCKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHTNLGS 194

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
                G++GLG+G LSL++Q  ++    FSYCL    +   S  + +G       + YT 
Sbjct: 195 FAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTS-PMLIGDSAAAGGVAYTA 253

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LL N    + YY +L  I V  + V  P G    + +   G I+DSGT  T L   A+ A
Sbjct: 254 LLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNA 313

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTA 384
           +    +  V       SL G D C+S   VA    PT+T  F G +  LP +N+ +    
Sbjct: 314 LVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENVFVALDT 373

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           G   CLAMAA+     +  +++ N+QQQNH I++D+ N R+G     C
Sbjct: 374 GGSICLAMAAS-----TGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 119/399 (29%), Positives = 199/399 (49%), Gaps = 32/399 (8%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
           K L+  E +   +A+ + RL  L++  LA A  +V        +  +  ++++  IG+P 
Sbjct: 317 KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPP 376

Query: 108 QTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
           ++    MDT +D  W    PC  C   S+ +F+  QS++F  + C +  C  +P  TC  
Sbjct: 377 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS 436

Query: 165 GACAFNLTYG-SSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPP-QGLLG 217
             C +  TYG SS+    L+ +T +          +PG  FGC     G+      GL+G
Sbjct: 437 DGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVG 496

Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PK----RIKYTPLLK 271
           LGRG LSL++Q   L +  F+YCL +    S   SL LG +    PK     +K TPL+K
Sbjct: 497 LGRGPLSLVSQ---LKEQKFAYCLTAIDD-SKPSSLLLGSLANITPKTSKDEMKTTPLIK 552

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           NP + S YY++L  I VG   + IP    + +     G IIDSGT  T +   A+T++++
Sbjct: 553 NPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKN 612

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
            F  ++   +  +  GG D C+++P     +  P +T  F G ++ LP +N +I  +   
Sbjct: 613 EFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG 672

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           + CLA+ ++       +++  N+QQQN  +++D+    L
Sbjct: 673 LLCLAIGSSRG-----MSIFGNLQQQNFMVVHDLQEETL 706


>gi|302142046|emb|CBI19249.3| unnamed protein product [Vitis vinifera]
          Length = 191

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 93/172 (54%), Positives = 123/172 (71%), Gaps = 5/172 (2%)

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            +++TP+L  P     + + +    VGR +V + P  L F+P TGAGTIIDSGTV TR V
Sbjct: 22  HLRFTPMLCAPVHPGPWPL-VPHHGVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFV 80

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLI 380
            P Y A+RD FR++V       ++G FDTC++     +AP +T  F+GM++ LP +N LI
Sbjct: 81  EPVYAAIRDEFRKQVKGPFA--TIGAFDTCFAATNEDIAPPVTFHFTGMDLKLPLENTLI 138

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           HS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN RI++DV NSRLG+ARELC
Sbjct: 139 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 190


>gi|242059939|ref|XP_002459115.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
 gi|241931090|gb|EES04235.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
          Length = 153

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 88/153 (57%), Positives = 119/153 (77%), Gaps = 3/153 (1%)

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           ++ IRVG + V +P  AL F+PT+G GTI+D+GT+FTRL AP Y AVRD FRRRV + + 
Sbjct: 1   MVGIRVGGKPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDAFRRRVRAPVA 60

Query: 343 VTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAA-PDNVN 400
              LGGFDTCY+V +  PT+T +F G ++VTLP++N++I S++G I CLAMAA  PD V+
Sbjct: 61  -GPLGGFDTCYNVTVSVPTVTFVFDGPVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVD 119

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           + LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 120 AALNVLASMQQQNHRVLFDVANGRVGFSRELCT 152


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 119/399 (29%), Positives = 199/399 (49%), Gaps = 32/399 (8%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
           K L+  E +   +A+ + RL  L++  LA A  +V        +  +  ++++  IG+P 
Sbjct: 62  KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPP 121

Query: 108 QTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
           ++    MDT +D  W    PC  C   S+ +F+  QS++F  + C +  C  +P  TC  
Sbjct: 122 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS 181

Query: 165 GACAFNLTYG-SSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPP-QGLLG 217
             C +  TYG SS+    L+ +T +          +PG  FGC     G+      GL+G
Sbjct: 182 DGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVG 241

Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PK----RIKYTPLLK 271
           LGRG LSL++Q   L +  F+YCL +    S   SL LG +    PK     +K TPL+K
Sbjct: 242 LGRGPLSLVSQ---LKEQKFAYCLTAIDD-SKPSSLLLGSLANITPKTSKDEMKTTPLIK 297

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           NP + S YY++L  I VG   + IP    + +     G IIDSGT  T +   A+T++++
Sbjct: 298 NPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKN 357

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
            F  ++   +  +  GG D C+++P     +  P +T  F G ++ LP +N +I  +   
Sbjct: 358 EFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG 417

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           + CLA+ ++       +++  N+QQQN  +++D+    L
Sbjct: 418 LLCLAIGSSRG-----MSIFGNLQQQNFMVVHDLQEETL 451


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 132/399 (33%), Positives = 193/399 (48%), Gaps = 29/399 (7%)

Query: 53  SWEESVLEMLAKDQARLQFLSS-LAVA-------RKSVVPIASGRQITQSPTYIVRAKIG 104
           S   +VL+++A+D AR ++L+S L+ A         S   + SG     S  Y VR  IG
Sbjct: 76  SRRHAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKVVSGLD-EGSGEYFVRVGIG 134

Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           +P     + +D+ +D  WV C  C+ C   +  +F+ A S TF  + C +A C+ +    
Sbjct: 135 SPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSG 194

Query: 162 CG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLG 219
           CG  G C + ++YG  S     L+ +T++L    V G   GC  +  G  V   GLLGLG
Sbjct: 195 CGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLG 254

Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSL 278
            G +SL+ Q        FSYCL S  A    GSL LG     P+   + PL++NP+  S 
Sbjct: 255 WGPMSLVGQLGGAAGGAFSYCLASRGA----GSLVLGRSEAVPEGAVWVPLVRNPQAPSF 310

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
           YYV L  I VG   + +     Q       G ++D+GT  TRL   AY A+RD F   VG
Sbjct: 311 YYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVG 370

Query: 339 SNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMA 393
           +      +   DTCY +     +  PT++  F G   +TLP  NLL+    G I CLA A
Sbjct: 371 ALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLE-VDGGIYCLAFA 429

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +    +S  +++ N+QQ+  +I  D  N  +G     C
Sbjct: 430 PS----SSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 203/433 (46%), Gaps = 46/433 (10%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
           + S TL + H+ +  S   P      +E     L +D  R++ +++LA      +P   G
Sbjct: 69  ESSITLNLDHIDALSSNKTP------QELFSSRLQRDSRRVKSIATLAAQ----IP---G 115

Query: 89  RQITQSP------------------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           R +T +P                   Y  R  +GTPA+ + M +DT +D  W+ C  C  
Sbjct: 116 RNVTHAPRTGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR 175

Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQ 184
           C   S  +F+  +S T+  + C +  C+++ +  C      C + ++YG  +    + S 
Sbjct: 176 CYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFST 235

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           +T++   + V G   GC     G  V   GLLGLG+G LS   QT + +   FSYCL   
Sbjct: 236 ETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDR 295

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFN 303
            A S   S+  G     +  ++TPLL NP+  + YYV LL I V G RV  +     + +
Sbjct: 296 SASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLD 355

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVA 359
                G IIDSGT  TRL+ PAY A+RD FR    +         FDTC+ +     +  
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKV 415

Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           PT+ L F G +V+LP  N LI        C A A         L++I N+QQQ  R++YD
Sbjct: 416 PTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYD 471

Query: 420 VPNSRLGVARELC 432
           + +SR+G A   C
Sbjct: 472 LASSRVGFAPGGC 484


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 139/428 (32%), Positives = 213/428 (49%), Gaps = 40/428 (9%)

Query: 38  HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---LAVA---------------- 78
           H  S  SP++P+   +    V   L +D+ RL  +SS   L VA                
Sbjct: 2   HRDSADSPYRPANA-TVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 79  ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCS 132
              +    P+ SG     S  Y V   +GTP +T+ M  DT +D  W+   PC  C G +
Sbjct: 61  FLQQDFETPLRSGLS-DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119

Query: 133 STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
             +FN + S+TF+++ C ++ C+Q+    C    C + ++YG  +      S +T+S  +
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGS 179

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           + V     GC     G      GLLGLG+G LS  +Q   LY S FSYCLP+ ++   S 
Sbjct: 180 NAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SV 238

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGT 310
            L  G        ++T LL NP+  + YYV ++ I+VG   V+IP G+L  + +TG  G 
Sbjct: 239 PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGV 298

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLM 365
           I+DSGT  TRLV  AY  +RD FR  + S+  +TS    FDTCY +     I+ P ++ +
Sbjct: 299 ILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFV 358

Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F+ G  + LP  N+++        CLA A   +N     ++I N+QQQ+ R+ +D   +R
Sbjct: 359 FNGGATMALPAQNIMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQSFRMSFDSTGNR 414

Query: 425 LGVARELC 432
           +G+    C
Sbjct: 415 VGIGANQC 422


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 133/430 (30%), Positives = 207/430 (48%), Gaps = 39/430 (9%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-----------SSLAVARK 80
           ++L+V H   PCS    +       S  +++  D  R++++           +S+     
Sbjct: 61  ASLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDS 120

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVF 136
           + +P  SG  I  S  Y V   +GTP + L +  DT +D  W  C  C G        +F
Sbjct: 121 TTLPAKSGSLI-GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIF 179

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGS-STIAANLSQDTISL 189
           + ++S+++ N+ C ++ C Q+ +       +    AC + + YG  ST    LSQ+ +++
Sbjct: 180 DPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTI 239

Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
            ATDIV  + FGC Q   G      GL+GLGR  +S + QT ++Y   FSYCLPS    S
Sbjct: 240 TATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPS--TSS 297

Query: 249 FSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
             G L  G        +KYTPL      ++ Y ++++ I VG     +P  A+  +  + 
Sbjct: 298 SLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGG--TKLP--AVSSSTFSA 353

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTIT 363
            G+IIDSGTV TRL   AY A+R  FR+ +         G FDTCY       I  P I 
Sbjct: 354 GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKID 413

Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
             F+ G+ V LP   +LI  +A  + CLA AA  ++ +  + +  N+QQ+   ++YDV  
Sbjct: 414 FEFAGGVTVELPLVGILIGRSAQQV-CLAFAANGNDND--ITIFGNVQQKTLEVVYDVEG 470

Query: 423 SRLGVARELC 432
            R+G     C
Sbjct: 471 GRIGFGAAGC 480


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 130/377 (34%), Positives = 184/377 (48%), Gaps = 43/377 (11%)

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFN 137
           S +P ASG        Y     +GTP    L+ +DT +D  W+ C  CV C    S +++
Sbjct: 90  SGLPFASGE-------YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYD 142

Query: 138 SAQSTTFKNLGCQAAQCKQVPNP-TCGG--GACAFNLTYG-SSTIAANLSQDTISLATDI 193
              S+T+    C   QC+   NP TC G  G C + + YG +S+ + NL+ D +  + D 
Sbjct: 143 PRGSSTYAQTPCSPPQCR---NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT 199

Query: 194 VPG-YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSG 251
             G  T GC     G      GLLG+ RG+ S   Q  + Y   F+YCL    ++ S S 
Sbjct: 200 SVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSS 259

Query: 252 SLRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTG-A 308
            L  G    +P    +TPL  NPRR SLYYV+++   VG   V      +L  +P TG  
Sbjct: 260 YLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRG 319

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFR--------RRVGSNLTVTSLGGFDTCYSVPIV-- 358
           G ++DSGT  TR    AY A+RD F         R+VG  ++V     FD CY +  V  
Sbjct: 320 GVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISV-----FDACYDLRGVAV 374

Query: 359 --APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
             AP + L F+ G +V LP +N L+   +G   C A+ AA    +  L+VI N+ QQ  R
Sbjct: 375 ADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAG---HDGLSVIGNVLQQRFR 431

Query: 416 ILYDVPNSRLGVARELC 432
           +++DV N R+G     C
Sbjct: 432 VVFDVENERVGFEPNGC 448


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 133/423 (31%), Positives = 207/423 (48%), Gaps = 41/423 (9%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSV 82
           + ++L+V H   PCS            +++E+L +DQ+R+  +       S +     + 
Sbjct: 63  NKASLKVVHKHGPCSQLNQQN--GNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAK 120

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P  SG  +  +  YIV   +G+P + L++  DT +D  W  C+     ++  F+  +ST
Sbjct: 121 LPTKSGMSL-GTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCS-----AAETFDPTKST 174

Query: 143 TFKNLGCQAAQCKQV----PNPT-CGGGACAFNLTYGSSTIAAN-LSQDTISL-ATDIVP 195
           ++ N+ C    C  V     NP+ C    C + + YG  + +   L ++ +++ +TDI  
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIFN 234

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
            + FGC Q   G      GLLGLGR  LS+++QT   Y   FSYCLPS  +  F   L  
Sbjct: 235 NFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGF---LSF 291

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           G   Q K  K+TPL   P  SS Y ++L  I VG + + IP         + AGTIIDSG
Sbjct: 292 GS-SQSKSAKFTPLSSGP--SSFYNLDLTGITVGGQKLAIPLSVF-----STAGTIIDSG 343

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMN 370
           TV TRL   AY+A+R  FR+ + S      L   DTCY       I  P I + FS G++
Sbjct: 344 TVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVD 403

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           V + Q  + + +    + CLA A      ++   +  N QQ+N  ++YDV   ++G A  
Sbjct: 404 VDVDQAGIFVANGLKQV-CLAFAGNTGARDTA--IFGNTQQRNFEVVYDVSGGKVGFAPA 460

Query: 431 LCT 433
            C+
Sbjct: 461 SCS 463


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 139/428 (32%), Positives = 212/428 (49%), Gaps = 40/428 (9%)

Query: 38  HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---LAVA---------------- 78
           H  S  SP++P+   +    V   L +D+ RL  +SS   L VA                
Sbjct: 2   HRDSADSPYRPANA-TVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 79  ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCS 132
              +    P+ SG     S  Y V   +GTP +T+ M  DT +D  W+   PC  C G +
Sbjct: 61  FLQQDFETPLRSGLS-DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119

Query: 133 STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
             +FN + S+TF+++ C ++ C+Q+    C    C + ++YG  +      S +T+S  +
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGS 179

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           + V     GC     G      GLLGLG+G LS  +Q   LY S FSYCLP+ ++   S 
Sbjct: 180 NAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SV 238

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGT 310
            L  G        ++T LL NP+  + YYV ++ I+VG   V IP G+L  + +TG  G 
Sbjct: 239 PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGV 298

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLM 365
           I+DSGT  TRLV  AY  +RD FR  + S+  +TS    FDTCY +     I+ P ++ +
Sbjct: 299 ILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFV 358

Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F+ G  + LP  N+++        CLA A   +N     ++I N+QQQ+ R+ +D   +R
Sbjct: 359 FNGGATMALPAQNIMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQSFRMSFDSTGNR 414

Query: 425 LGVARELC 432
           +G+    C
Sbjct: 415 VGIGANQC 422


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 174/368 (47%), Gaps = 30/368 (8%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQ 140
           P+A+ R       Y+   ++GTP +   + +DT +D  WV C+ C  C S    +F    
Sbjct: 5   PVAAARG-----EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNT 59

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN------LSQDTISLATDIV 194
           ST+F  L C +A C  +P P C    C +  +YG  ++         ++ D I+     V
Sbjct: 60  STSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQV 119

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-L 253
           P + FGC     G+     G+LGLG+G LS  +Q +++Y   FSYCL  + A     S L
Sbjct: 120 PNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPL 179

Query: 254 RLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
             G    P    +KY P+L NP+  + YYV L  I VG  +++I       +   GAGTI
Sbjct: 180 LFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTI 239

Query: 312 IDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYS------VPIVAPTITL 364
            DSGT  T+L   AY  V        +  +  +  +   D C S      +P V P +T 
Sbjct: 240 FDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTV-PAMTF 298

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            F G ++ LP  N  I+  +    C AM ++PD     +N+I ++QQQN ++ YD    +
Sbjct: 299 HFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPD-----VNIIGSVQQQNFQVYYDTAGRK 353

Query: 425 LGVARELC 432
           LG   + C
Sbjct: 354 LGFVPKDC 361


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 137/410 (33%), Positives = 194/410 (47%), Gaps = 36/410 (8%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARK--------------SVVPIASGRQITQ----- 93
           S+E  + E L ++ AR++ L    + RK              + V    G ++       
Sbjct: 92  SYERRLEEKLRREAARVRALEQ-RIERKLKLKKDPAGSYENVAGVTAEFGSEVVSGMEQG 150

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  IGTP +   M +DT +D  W+ C  C  C S    +FN + S +F  +GC 
Sbjct: 151 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 210

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C Q+    C GG C + ++YG  S    + + +T++  T  +     GC     G  
Sbjct: 211 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDNVGLF 270

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GSLS  AQ        FSYCL    + S SG+L  GP   P    +TPL
Sbjct: 271 VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSES-SGTLEFGPESVPIGSIFTPL 329

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
           + NP   + YY++++AI VG  ++D +P  A + + TTG  G IIDSGT  TRL   AY 
Sbjct: 330 VANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYD 389

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
           A+RD F            +  FDTCY +     +  P +   FS G    LP  N LI  
Sbjct: 390 ALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPM 449

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +    C A A A  N    L+++ N+QQQ  R+ +D  NS +G A + C
Sbjct: 450 DSMGTFCFAFAPADSN----LSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 126/350 (36%), Positives = 173/350 (49%), Gaps = 16/350 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  IGTP +   M +DT +D  W+ C  C  C S    +FN + S +F  +GC 
Sbjct: 5   SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 64

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C Q+    C GG C + ++YG  S    + + +T++  T  +     GC     G  
Sbjct: 65  SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDNVGLF 124

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GSLS  AQ        FSYCL    + S SG+L  GP   P    +TPL
Sbjct: 125 VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSES-SGTLEFGPESVPIGSIFTPL 183

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
           + NP   + YY++++AI VG  ++D +P  A + + TTG  G IIDSGT  TRL   AY 
Sbjct: 184 VANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYD 243

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
           A+RD F            +  FDTCY +     +  P +   FS G    LP  N LI  
Sbjct: 244 ALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPM 303

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +    C A A A  N    L+++ N+QQQ  R+ +D  NS +G A + C
Sbjct: 304 DSMGTFCFAFAPADSN----LSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 123/359 (34%), Positives = 179/359 (49%), Gaps = 32/359 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTPA+     +DT +D  W    PC  CV   +  F+ A S+T+++LGC A  
Sbjct: 92  YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPA 151

Query: 154 CKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD----IVPGYTFGCIQKATGN 208
           C  +  P C    C +   YG S++ A  L+ +T +  T+     +P  +FGC     G+
Sbjct: 152 CNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNAGS 211

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRI 264
                G++G GRGSLSL++Q   L    FSYCL SF    ++  + G+           +
Sbjct: 212 LANGSGMVGFGRGSLSLVSQ---LGSPRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTV 268

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
           + TP + NP   ++Y++N+  I VG   + I P  L  N T G  GTIIDSGT  T L  
Sbjct: 269 QSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAE 328

Query: 324 PAYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP 374
           PAY AVR+ F   + S    L VT     DTC+  P      +  P + L F G +  LP
Sbjct: 329 PAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELP 388

Query: 375 -QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            Q+ +L+  + G + CLAMA + D      ++I + Q QN  +LYD+ NS L      C
Sbjct: 389 LQNYMLVDPSTGGL-CLAMATSSDG-----SIIGSYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 179/352 (50%), Gaps = 22/352 (6%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTPA+++ M  DT +D +W+ C+ C  C      +FN + S++FK L C 
Sbjct: 78  SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 137

Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
           ++ C ++    C     C + ++YG  +    + S +T+S     V     GC +   G 
Sbjct: 138 SSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMGCGRNNQGL 197

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
                GLLGLGRG LS  +QT   Y S FSYCLP  +  + + SL  GP   P++ ++T 
Sbjct: 198 FHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR-RESAIAASLVFGPSAVPEKARFTK 256

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LL N R  + YYV L  IRV    V+IPP A         G I+DSGT  +RL  PAYTA
Sbjct: 257 LLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTA 316

Query: 329 VRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLI 380
           +RD FR    S +T  S  G   FDTCY +  +     P + L F  G ++ LP D +L+
Sbjct: 317 LRDAFR----SLVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILV 372

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +       CLA A   +      ++I N+QQQ  RI  D    ++G+A + C
Sbjct: 373 NVDDEGTYCLAFAPEEE----AFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 59/432 (13%)

Query: 33  TLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLS---------SLAVARKSV 82
           T+ + H   PCSP   +K P S EE     L +DQ R  ++           +  +  + 
Sbjct: 62  TVPLHHRHGPCSPVPSNKMPASLEE----RLQRDQLRAAYIKRKFSGAKGGDVEQSDAAT 117

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
           VP   G  ++ +  Y++   IG+PA T  M+MDT +D +WV C  C  C S V   F+ +
Sbjct: 118 VPTTLGTSLS-TLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPS 176

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTY--GSSTIAANLSQDTISLATDI 193
            S+T+    C +A C Q+     G G     C + ++Y  GSST     S DT++L ++ 
Sbjct: 177 ASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTT-GTYSSDTLTLGSNA 235

Query: 194 VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
           + G+ FGC Q  +G  S    GL+GLG  + SL++QT   +   FSYCLP       SG 
Sbjct: 236 IKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGS--SGF 293

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L LG   +   +K TP+L++ +  + Y V L AIRVG + ++IP           AG+++
Sbjct: 294 LTLGAASRSGFVK-TPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFS------AGSVM 346

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
           DSGTV TRL   AY+A+   F+  +         G  DTC+       +  P++ L+FSG
Sbjct: 347 DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 406

Query: 369 MNVT--------LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
             V         L  DN           CLA AA  D  +S L  I N+QQ+   +LYDV
Sbjct: 407 GAVVNLDFNGIMLELDNW----------CLAFAANSD--DSSLGFIGNVQQRTFEVLYDV 454

Query: 421 PNSRLGVARELC 432
               +G     C
Sbjct: 455 GGGAVGFRAGAC 466


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 130/373 (34%), Positives = 189/373 (50%), Gaps = 28/373 (7%)

Query: 82  VVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFN 137
           V P+ SG  + Q S  Y  +  +GTPA   LM +DT +D  W+ C  C  C   S  VF+
Sbjct: 128 VAPVVSG--LAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFD 185

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI- 193
             +S ++  +GC A  C+++ +  C     AC + + YG  ++ A + + +T++ A    
Sbjct: 186 PRRSRSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGAR 245

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSF 249
           V     GC     G  V   GLLGLGRGSLS  AQ    Y  +FSYCL     S    S 
Sbjct: 246 VARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASH 305

Query: 250 SGSLRL--GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT 306
           S ++    G +G      +TP++KNPR  + YYV L+ I V G RV  +    L+ +P++
Sbjct: 306 SSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSS 365

Query: 307 G-AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAP 360
           G  G I+DSGT  TRL  PAY+A+RD FR    G  L+      FDTCY +     +  P
Sbjct: 366 GRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVP 425

Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           T+++ F+ G    LP +N LI   +    C A A     V    ++I N+QQQ  R+++D
Sbjct: 426 TVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFD 481

Query: 420 VPNSRLGVARELC 432
               R+G   + C
Sbjct: 482 GDGQRVGFVPKGC 494


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 132/397 (33%), Positives = 184/397 (46%), Gaps = 23/397 (5%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ---------SPTYIVRAKIGT 105
           EE     L +D  R++ LSSL    +++         +          S  Y  R  +GT
Sbjct: 78  EELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGT 137

Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
           P + + M +DT +D  W+ C  C  C S    VFN  +S +F  + C+   C+++ +P C
Sbjct: 138 PPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGC 197

Query: 163 GG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
                C + ++YG  S        +T++     V     GC     G  V   GLLGLGR
Sbjct: 198 NQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGR 257

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY 280
           G LS  +Q    +   FSYCL    A S   S+  G     +  ++TPLL NPR  + YY
Sbjct: 258 GGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYY 317

Query: 281 VNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
           V LL I VG   V  I     + + T   G IID GT  TRL  PAY A+RD FR    S
Sbjct: 318 VELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASS 377

Query: 340 NLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
             +      FDTCY +     +  PT+ L F G +V+LP  N LI        C A A  
Sbjct: 378 LKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAG- 436

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                S L++I N+QQQ  R++YD+ +SR+G +   C
Sbjct: 437 ---TTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 179/352 (50%), Gaps = 22/352 (6%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTPA+++ M  DT +D +W+ C+ C  C      +FN + S++FK L C 
Sbjct: 11  SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 70

Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
           ++ C ++    C     C + ++YG  +    + S +T+S     V     GC +   G 
Sbjct: 71  SSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMGCGRNNQGL 130

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
                GLLGLGRG LS  +QT   Y S FSYCLP  +  + + SL  GP   P++ ++T 
Sbjct: 131 FHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR-RESAIAASLVFGPSAVPEKARFTK 189

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LL N R  + YYV L  IRV    V+IPP A         G I+DSGT  +RL  PAYTA
Sbjct: 190 LLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTA 249

Query: 329 VRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLI 380
           +RD FR    S +T  S  G   FDTCY +  +     P + L F  G ++ LP D +L+
Sbjct: 250 LRDAFR----SLVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILV 305

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +       CLA A   +      ++I N+QQQ  RI  D    ++G+A + C
Sbjct: 306 NVDDEGTYCLAFAPEEE----AFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 134/392 (34%), Positives = 186/392 (47%), Gaps = 25/392 (6%)

Query: 62  LAKDQARLQFLSSLAVA------RKSVVPIASGRQIT----QSPTYIVRAKIGTPAQTLL 111
           L +D AR++ L SLA         ++  P  S   I+     S  Y  R  +GTPA+ + 
Sbjct: 100 LVRDAARVKSLISLAATVGGTNLTRARGPGFSSSVISGLAQGSGEYFTRLGVGTPARYVY 159

Query: 112 MAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-- 166
           M +DT +D  W+ C  C+ C S    VF+  +S +F N+ C +  C+++  P C      
Sbjct: 160 MVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQI 219

Query: 167 CAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
           C + ++YG  +      S +T++     V     GC     G  V   GLLGLGRG LS 
Sbjct: 220 CLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSF 279

Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
            +Q    + S FSYCL    A S   S+  G     +  ++TPLL NP+  + YYV LL 
Sbjct: 280 PSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLG 339

Query: 286 IRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
           I V G RV  I     + + T   G IIDSGT  TRL   AY A+RD F     +     
Sbjct: 340 ISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAP 399

Query: 345 SLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
               FDTC+ +     +  PT+ L F G +V LP  N LI        C A A       
Sbjct: 400 EFSLFDTCFDLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNSGSFCFAFAGTA---- 455

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S L++I N+QQQ  R++YD+  SR+G A   C
Sbjct: 456 SGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 139/427 (32%), Positives = 206/427 (48%), Gaps = 39/427 (9%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESV--LEMLAKDQARLQFLSSL-------------AVA 78
           L V H   PCSP + ++P     +V   E+L +DQAR+  +                A A
Sbjct: 71  LGVVHRHGPCSPVQ-ARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARA 129

Query: 79  RKSVVPIASGRQIT-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SST 134
            +  V + + R I+  +  Y+V   +GTPA+   +  DT +D +WV C  C  C      
Sbjct: 130 SEQGVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP 189

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTYGS-STIAANLSQDTISL-AT 191
           +F+ + S+T+  + C A +C+++    C   + C + + YG  S    NL +DT++L A+
Sbjct: 190 LFDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSAS 249

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           D +PG+ FGC  +  G      GL GLGR  +SL +Q    Y   F+YCLPS  + S  G
Sbjct: 250 DTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS--SSSGRG 307

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            L LG  G P        L +    S YY++L+ I+VG R + IP            GT+
Sbjct: 308 YLSLG--GAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP----ATAFAAAGGTV 361

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS 367
           IDSGTV TRL   AY  +R  F R +       +L   DTCY          PT+ L F+
Sbjct: 362 IDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFA 421

Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G  V+L    +L  S   S  CLA   AP+  +S + ++ N QQ+   + YDV N R+G
Sbjct: 422 GGATVSLDFTGVLYVSKV-SQACLAF--APNADDSSIAILGNTQQKTFAVAYDVANQRIG 478

Query: 427 VARELCT 433
              + C+
Sbjct: 479 FGAKGCS 485


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 173/348 (49%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y +R  +G+P +   + +D+ +D  WV   PCT C   +  VF+ A S +F  + C 
Sbjct: 139 SGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCS 198

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           ++ C+++ N  C  G C + + YG  S     L+ +T++    +V     GC  +  G  
Sbjct: 199 SSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMF 258

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GS+SL+ Q        FSYCL S +    +GSL  G    P    + PL
Sbjct: 259 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTDSAGSLEFGRGAMPVGAAWIPL 317

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YY+ L  + VG   V I     Q N     G ++D+GT  TR+   AY A 
Sbjct: 318 IRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAF 377

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
           RD F  + G+    + +  FDTCY+    V +  PT++  F+G  + TLP  N LI    
Sbjct: 378 RDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDD 437

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A AA+P    S L++I N+QQ+  +I +D  N  +G    +C
Sbjct: 438 VGTFCFAFAASP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 136/416 (32%), Positives = 195/416 (46%), Gaps = 54/416 (12%)

Query: 56  ESVLEMLAKDQARLQ----FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
           E V + L +D  R Q    F   LA +  + V   + + +     Y++   IGTP  +  
Sbjct: 47  EFVRDALRRDMHRQQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYP 106

Query: 112 MAMDTSNDAAW---VPCTG--CVGCSSTVFNSAQSTTFKNLGCQAA--QCKQV-----PN 159
              DT +D  W    PC+G  C    + ++N A STTF  L C ++   C  V     P 
Sbjct: 107 AIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPP 166

Query: 160 PTCGGGACAFNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQG 214
           P C   AC +N TYG+   A     +T +  +       VPG  FGC   ++ +     G
Sbjct: 167 PGC---ACMYNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAG 223

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKN 272
           L+GLGRGSLSL++Q   L    FSYCL  F+  + + +L LGP        ++ TP + +
Sbjct: 224 LVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVAS 280

Query: 273 PRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           P +   S+ YY+NL  I +G + + I P A         G IIDSGT  T LV  AY  V
Sbjct: 281 PAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQV 340

Query: 330 RDVFRRRV------GSNLTVTSLGGFDTCYSVPI------VAPTITLMFSGMNVTLPQDN 377
           R   +  V      GS+ T     G D CY++P         P++TL F G ++ LP D+
Sbjct: 341 RAAVQSLVTLPAIDGSDST-----GLDLCYALPTPTSAPPAMPSMTLHFDGADMVLPADS 395

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +I  +   + CLAM    +  +  ++   N QQQN  ILYDV N  L  A   C+
Sbjct: 396 YMISGSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 140/452 (30%), Positives = 210/452 (46%), Gaps = 43/452 (9%)

Query: 16  SLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF---- 71
           S  E    +   + +S  LQV H     S    S     +E + E L +D AR+      
Sbjct: 52  SAQEWSETVQGEEKNSIVLQVVH---RDSLSSSSNTSLVKEILQERLKRDAARVDSINAR 108

Query: 72  --LSSLAVARKSVVPIASGRQITQ------------------SPTYIVRAKIGTPAQTLL 111
             L+++ V++  + P+ +G  I                    S  Y  R  +GTP +   
Sbjct: 109 VQLAAMGVSKAEMKPL-NGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTY 167

Query: 112 MAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-C 167
           M +DT +D  W+   PC  C G +  +FN A S+T++ + C    CK++    C     C
Sbjct: 168 MVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYC 227

Query: 168 AFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
            + ++YG  +    + S +T++    ++     GC     G  +   GLLGLGRGSLS  
Sbjct: 228 EYQVSYGDGSFTVGDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFP 287

Query: 227 AQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
           +QT   +   FSYCL    A   + SL  G    PK   +TPLL NP+  + YYV L+ I
Sbjct: 288 SQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGI 347

Query: 287 RV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
            V GRR+  IP    + + T   G IIDSGT  TRLV  AY+ +RD FR   G+  +   
Sbjct: 348 SVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGG 407

Query: 346 LGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
              FDTCY +     +  PT+   F  G +++LP  N LI   + +  C A A       
Sbjct: 408 FSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGG-- 465

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             L++I N+QQQ +R+++D   +R+G     C
Sbjct: 466 --LSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 119/370 (32%), Positives = 174/370 (47%), Gaps = 27/370 (7%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           PI SG     +  Y     +GTP + + + +DT +D  W+   PCT C      +FN + 
Sbjct: 4   PIFSGLAF-GTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSS 62

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT- 198
           S++FK L C ++ C  +    C    C +   YG  +     L  D + L     PG   
Sbjct: 63  SSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122

Query: 199 -----FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL-SFSGS 252
                 GC     G      G+LGLGRG LS         ++ FSYCLP  ++  +   +
Sbjct: 123 LTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKST 182

Query: 253 LRLGPIGQPK----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV-DIPPGALQFNPTTG 307
           L  G    P      +K+ P L+NPR ++ YYV +  I VG  ++ +IP    Q +    
Sbjct: 183 LVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGN 242

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTIT 363
            GTI DSGT  TRL A AYTAVRD FR       +      FDTCY       I  PT+T
Sbjct: 243 GGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPTVT 302

Query: 364 LMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
             F G +++ LP  N ++  +  +I C A AA+        +VI N+QQQ+ R++YD  +
Sbjct: 303 FHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGP-----SVIGNVQQQSFRVIYDNVH 357

Query: 423 SRLGVARELC 432
            ++G+  + C
Sbjct: 358 KQIGLLPDQC 367


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 134/408 (32%), Positives = 192/408 (47%), Gaps = 48/408 (11%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSP--------------------TYIVRA 101
           L +D  R++ L+SLA        +++GR +T+ P                     Y +R 
Sbjct: 88  LQRDSLRVESLTSLAA-------VSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRL 140

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
            +GTPA  + M +DT +D  W+ C+ C  C   S  VFN A+S TF  + C +  C+++ 
Sbjct: 141 GVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLD 200

Query: 159 NPT-C---GGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
           + + C      AC + ++YG  +    + S +T++     V     GC     G  V   
Sbjct: 201 DSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAA 260

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           GLLGLGRG LS  +QT+N Y   FSYCL     S  +     ++  G    PK   +TPL
Sbjct: 261 GLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPL 320

Query: 270 LKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L NP+  + YY+ LL I V G RV  +     + + T   G IIDSGT  TRL   AY A
Sbjct: 321 LTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVA 380

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTA 384
           +RD FR          S   FDTC+ +     +  PT+   F+G  V+LP  N LI    
Sbjct: 381 LRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEVSLPASNYLIPVNN 440

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A    +    L++I N+QQQ  R+ YD+  SR+G     C
Sbjct: 441 QGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 136/414 (32%), Positives = 197/414 (47%), Gaps = 35/414 (8%)

Query: 48  PSKPLSWEESVL-EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
           P  P +   S+L + LA D AR  + S +    +   P+ SG    +S  Y     +GTP
Sbjct: 39  PPPPGAKRGSLLRQRLAADAAR--YASLVDATGRLHSPVFSGIPF-ESGEYFALVGVGTP 95

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC- 162
           +   ++ +DT +D  W+ C+ C  C      VF+  +S+T++ + C + QC+ +  P C 
Sbjct: 96  STKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCD 155

Query: 163 ----GGGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLL 216
                GG C + + YG  S+   +L+ D ++ A D  V   T GC +   G      GLL
Sbjct: 156 SGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLL 215

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRR 275
           G+GRG +S+  Q    Y S F YCL    + S   S L  G   +P    +T LL NPRR
Sbjct: 216 GVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRR 275

Query: 276 SSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVF 333
            SLYYV++    V G RV      +L  +  TG  G ++DSGT  +R    AY A+RD F
Sbjct: 276 PSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAF 335

Query: 334 RRRVGSNLTVTSLGG---FDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI----- 380
             R  +       G    FD CY +       AP I L F+ G ++ LP +N  +     
Sbjct: 336 DARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGG 395

Query: 381 -HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               A    CL   AA D     L+VI N+QQQ  R+++DV   R+G A + CT
Sbjct: 396 RRRAASYRRCLGFEAADDG----LSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 142/429 (33%), Positives = 201/429 (46%), Gaps = 34/429 (7%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD-----------QARLQFLSSLAVARKS 81
           ++QV H  S       +   S+E  + E L +D           + RL+     A + ++
Sbjct: 115 SVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHEN 174

Query: 82  VVPIAS--GRQITQ-----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
           V  +A+  G ++       S  Y  R  +GTP +   M +DT +D  W+ C  C  C S 
Sbjct: 175 VAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQ 234

Query: 135 V---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA 190
           V   FN + S +F  LGC +A C  +    C GG C + ++YG  S    + + + ++  
Sbjct: 235 VDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEMLTFG 294

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           T  V     GC     G  V   GLLGLG G LS  +Q        FSYCL    + S S
Sbjct: 295 TTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVDRFSES-S 353

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-A 308
           G+L  GP   P     TPLL NP   + YYV L++I VG  ++D +PP   + + T+G  
Sbjct: 354 GTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRG 413

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---VPIV-APTITL 364
           G I+DSGT  TRL  P Y AVRD F            +  FDTCY    +P+V  PT+  
Sbjct: 414 GFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVF 473

Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            FS G ++ LP  N +I        C A A A     S L+++ N+QQQ  R+ +D  NS
Sbjct: 474 HFSNGASLILPAKNYMIPMDFMGTFCFAFAPA----TSDLSIMGNIQQQGIRVSFDTANS 529

Query: 424 RLGVARELC 432
            +G A   C
Sbjct: 530 LVGFALRQC 538


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 131/432 (30%), Positives = 213/432 (49%), Gaps = 50/432 (11%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPI 85
           D   ++ ++ + H   PC+P   S   S E S+ E L + +AR +++ S A      +P 
Sbjct: 53  DEGSNTVSVPLVHRHGPCAPSTRS---SDEPSLSERLRRSRARSKYIMSRASKSNVSIPT 109

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQ 140
             G  +  S  Y+V   +GTPA + ++ +DT +D +WV C     T C      +F+ ++
Sbjct: 110 HLGGSV-DSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSR 168

Query: 141 STTFKNLGCQAAQCKQVPNP---------TCGGGACAFNLTYGSSTIAANL-SQDTISLA 190
           S+T+  + C    C+ +            + GG  C + +TYG  +    + S +T+++A
Sbjct: 169 SSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMA 228

Query: 191 TDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
             + V  + FGC     G +    GLLGLG    SL+ QT ++Y   FSYCLP+  A   
Sbjct: 229 PGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPA--ANDQ 286

Query: 250 SGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           +G L LG P+       +TP+++   + + Y VN+  I VG   +D+PP A         
Sbjct: 287 AGFLALGAPVNDASGFVFTPMVR--EQQTFYVVNMTGITVGGEPIDVPPSAFS------G 338

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITL 364
           G IIDSGTV T L   AY A++  FR+ + +   + + G  DTCY+      +  P + L
Sbjct: 339 GMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPN-GELDTCYNFTGHSNVTVPRVAL 397

Query: 365 MFSG---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDV 420
            FSG   +++ +P D +L+ +      CLA   A PDN   +L    N+ Q+   +LYDV
Sbjct: 398 TFSGGATVDLDVP-DGILLDN------CLAFQEAGPDNQPGIL---GNVNQRTLEVLYDV 447

Query: 421 PNSRLGVARELC 432
            + R+G   + C
Sbjct: 448 GHGRVGFGADAC 459


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 133/433 (30%), Positives = 198/433 (45%), Gaps = 46/433 (10%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGR 89
           V H   PCSP +       +  +LE    DQAR+  +  +     +VV      P   G 
Sbjct: 22  VMHRHGPCSPLQTPDDAPSDADLLE---HDQARVDSIHRMIANETAVVGQDVSLPAERGI 78

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT--GCVGCSSTVFNSAQSTTF 144
            +     Y+V   +GTPA+ L +  DT +D +WV   PC+  GC      +F  + S+TF
Sbjct: 79  SVGTG-NYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTF 137

Query: 145 KNLGCQAAQC---KQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--------- 191
             + C   +C   +Q  + + G   C + + YG  S    +L  DT++L T         
Sbjct: 138 SAVRCGEPECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASEN 197

Query: 192 --DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
             + +PG+ FGC +  TG      GL GLGRG +SL +Q    Y   FSYCLPS  + + 
Sbjct: 198 NSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPS-SSSNA 256

Query: 250 SGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
            G L LG P   P   ++TP+L      S YYV L+ IRV  R + +      +     A
Sbjct: 257 HGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWP----A 312

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVA------P 360
           G I+DSGTV TRL   AY+A+R  F   +G         L   DTCY     A      P
Sbjct: 313 GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIP 372

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            + L+F+G        + +++    +  CLA   AP+       ++ N QQ+   ++YDV
Sbjct: 373 AVALVFAGGATISVDFSGVLYVAKVAQACLAF--APNGNGRSAGILGNTQQRTVAVVYDV 430

Query: 421 PNSRLGVARELCT 433
              ++G A + C+
Sbjct: 431 GRQKIGFAAKGCS 443


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 136/435 (31%), Positives = 203/435 (46%), Gaps = 63/435 (14%)

Query: 31  SSTLQVFHVFSPCSPFKP--SKPLSWEESVLEMLAKDQARLQFL----SSLAVARKSVVP 84
           ++ + + H   PCSP     SKP S +E    +LA DQ R + +    S+ A +R    P
Sbjct: 88  TTRMTIVHRHGPCSPLAAAHSKPPSHDE----ILAADQNRAESIQHRVSTTATSRGQ--P 141

Query: 85  IASGRQ------------------ITQSP-------TYIVRAKIGTPAQTLLMAMDTSND 119
             S RQ                  +  SP        Y+V   +GTPA    +  DT +D
Sbjct: 142 KRSRRQQPSSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDTGSD 201

Query: 120 AAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS 175
             WV C  CV         +F+ A+S+T+ N+ C A  C  +    C GG C + + YG 
Sbjct: 202 TTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQYGD 261

Query: 176 STIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
            + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + Y
Sbjct: 262 GSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKY 321

Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
              F++CLP+    + +G L  G      R+  TP+L +    + YYV L  IRVG R++
Sbjct: 322 GGVFAHCLPARS--TGTGYLDFGAGSPAARLTTTPMLVD-NGPTFYYVGLTGIRVGGRLL 378

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF----RRRVGSNLTVTSLGGF 349
            IP           AGTI+DSGTV TRL   AY+++R  F      R        SL   
Sbjct: 379 YIPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSL--L 431

Query: 350 DTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           DTCY       +  PT++L+F G        + ++++ + S  CLA AA  D  +  + +
Sbjct: 432 DTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGI 489

Query: 406 IANMQQQNHRILYDV 420
           + N Q +   + YD+
Sbjct: 490 VGNTQLKTFGVAYDI 504


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 123/349 (35%), Positives = 168/349 (48%), Gaps = 14/349 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTP + + M +DT +D  W+ C  C  C S    VFN  +S +F  + C+
Sbjct: 39  SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 98

Query: 151 AAQCKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
              C+++ +P C     C + ++YG  S        +T++     V     GC     G 
Sbjct: 99  TPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGL 158

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   GLLGLGRG LS  +Q    +   FSYCL    A S   S+  G     +  ++TP
Sbjct: 159 FVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTP 218

Query: 269 LLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           LL NPR  + YYV LL I VG   V  I     + + T   G IID GT  TRL  PAY 
Sbjct: 219 LLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYI 278

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHST 383
           A+RD FR    S  +      FDTCY +     +  PT+ L F G +V+LP  N LI   
Sbjct: 279 ALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVD 338

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                C A A       S L++I N+QQQ  R++YD+ +SR+G +   C
Sbjct: 339 GSGRFCFAFAG----TTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 132/457 (28%), Positives = 218/457 (47%), Gaps = 59/457 (12%)

Query: 22  NPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK- 80
           NP    +   ++L+V +   PC+    ++  +   ++ E+LA DQAR+  + +    +  
Sbjct: 60  NPATKGKRRGASLEVVNRQGPCTLL--NQKGAKAPTLTEILAHDQARVDSIQARITDQSY 117

Query: 81  ---------------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
                                + +P  SG  +  +  YIV   +GTP + L +  DT +D
Sbjct: 118 DLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLG-TGNYIVNVGLGTPKKDLSLIFDTGSD 176

Query: 120 AAWVPCTGCV-GCSST---VFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFN 170
             W  C  CV  C +    +F+ + S T+ N+ C +A C  + +     P C    C + 
Sbjct: 177 LTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYG 236

Query: 171 LTYGSSTIAANL-SQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           + YG S+      ++D ++L   D+  G+ FGC Q   G      GL+GLGR  LS++ Q
Sbjct: 237 IQYGDSSFTIGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQ 296

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIK----YTPLLKNPRRSSLYYV 281
           T   +   FSYCLP+ +    +G L  G    +   K +K    +TP   + + ++ Y++
Sbjct: 297 TAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKASKAVKNGITFTP-FASSQGTAYYFI 353

Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
           ++L I VG + + I P   Q      AGTIIDSGTV TRL + AY +++  F++ +    
Sbjct: 354 DVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYP 408

Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP 396
           T  +L   DTCY +     I  P I+  F+G  NV L  + +LI + A  + CLA A   
Sbjct: 409 TAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVELDPNGILITNGASQV-CLAFAGNG 467

Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           D  +  + +  N+QQQ   ++YDV   +LG   + C+
Sbjct: 468 D--DDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 135/419 (32%), Positives = 205/419 (48%), Gaps = 42/419 (10%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEES-VLEMLAKDQARLQFL-----------SSLAVAR 79
           ++L+V H   PCS        +  ++   E+L +D+ R++++           SS++   
Sbjct: 69  ASLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELD 128

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCVGCSSTV 135
              +P  SG  I  S  Y V   +GTP + L +  DT +D  W  C      C      +
Sbjct: 129 SVTLPAKSGSLI-GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAI 187

Query: 136 FNSAQSTTFKNLGCQAAQCKQVP-----NPTCGGG--ACAFNLTYGSSTIAAN-LSQDTI 187
           F+ ++ST++ N+ C +  C Q+       P C     AC + + YG S+ +    S++ +
Sbjct: 188 FDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERL 247

Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           S+ ATDIV  + FGC Q   G      GL+GLGR  +S + QT  +Y+  FSYCLP+   
Sbjct: 248 SVTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPA--T 305

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
            S +G L  G       +KYTP     R SS Y +++  I VG     +P  +  F  +T
Sbjct: 306 SSSTGRLSFGTT-TTSYVKYTPFSTISRGSSFYGLDITGISVGG--AKLPVSSSTF--ST 360

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTI 362
           G G IIDSGTV TRL   AYTA+R  FR+ +    +   L   DTCY +        P I
Sbjct: 361 G-GAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKI 419

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
              F+ G+ V LP   +L  ++A  + CLA AA  D  +S + +  N+QQ+   ++YDV
Sbjct: 420 DFSFAGGVTVQLPPQGILYVASAKQV-CLAFAANGD--DSDVTIYGNVQQKTIEVVYDV 475


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 126/378 (33%), Positives = 176/378 (46%), Gaps = 33/378 (8%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQ 140
           P+ SG     S  Y     +G P    L+ +DT +D  W+ C  C  C   V   ++   
Sbjct: 76  PVMSGVPF-DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRS 134

Query: 141 STTFKNLGCQAAQCKQVPN-PTCGG--GACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
           S+T + + C + +C+ V   P C    G C + + YG  S  + +L+ D +    D  V 
Sbjct: 135 SSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH 194

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--L 253
             T GC     G      GLLG+GRG LS   Q    Y   FSYCL    + + +GS  L
Sbjct: 195 NVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYL 254

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTI 311
             G   +P    +TPL  NPRR SLYYV+++   V G RV      +L  NP TG  G +
Sbjct: 255 VFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIV 314

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSV--------PIVA 359
           +DSGT  +R    AY AVRD F     +  T+  L      FD CY +         +  
Sbjct: 315 VDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRV 374

Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           P+I L F+ G ++ LPQ N LI    G   +  CL + AA D     LNV+ N+QQQ   
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDG----LNVLGNVQQQGFG 430

Query: 416 ILYDVPNSRLGVARELCT 433
           +++DV   R+G     C+
Sbjct: 431 LVFDVERGRIGFTPNGCS 448


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 137/426 (32%), Positives = 203/426 (47%), Gaps = 37/426 (8%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFLSSL-------------AVAR 79
           L V H   PCSP +  +       +  E+L +DQAR+  +                A A 
Sbjct: 71  LGVVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARAS 130

Query: 80  KSVVPIASGRQIT-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTV 135
           +  V + + R I+  +  Y+V   +GTPA+   +  DT +D +WV C  C  C      +
Sbjct: 131 EQGVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPL 190

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTYGS-STIAANLSQDTISL-ATD 192
           F+ + S+T+  + C A +C+++    C   + C + + YG  S    NL +DT++L A+D
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
            +PG+ FGC  +  G      GL GLGR  +SL +Q    Y   F+YCLPS  + S  G 
Sbjct: 251 TLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS--SSSGRGY 308

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L LG  G P        L +    S YY++L+ I+VG R + IP            GT+I
Sbjct: 309 LSLG--GAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP----ATAFAAAGGTVI 362

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS- 367
           DSGTV TRL   AY  +R  F R +       +L   DTCY          PT+ L F+ 
Sbjct: 363 DSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAG 422

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G  V+L    +L  S   S  CLA   AP+  +S + ++ N QQ+   + YDV N R+G 
Sbjct: 423 GATVSLDFTGVLYVSKV-SQACLAF--APNADDSSIAILGNTQQKTFAVTYDVANQRIGF 479

Query: 428 ARELCT 433
             + C+
Sbjct: 480 GAKGCS 485


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 133/403 (33%), Positives = 195/403 (48%), Gaps = 33/403 (8%)

Query: 55  EESVL-EMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
           EE +L   L +  AR+  L SLA +A    +  A    +     Y++   IGTP +    
Sbjct: 46  EEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYSA 105

Query: 113 AMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAF 169
            +DT +D  W    PC  CV   +  F+ A+S T+++LGC +  C  +  P C    C +
Sbjct: 106 ILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVY 165

Query: 170 NLTYG-SSTIAANLSQDTISLATDI----VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
              YG S++ A  L+ +T +  T+     +PG +FGC     G+     G++G GRGSLS
Sbjct: 166 QYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLS 225

Query: 225 LLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSL 278
           L++Q   L    FSYCL SF +     L F     L         ++ TP + NP   ++
Sbjct: 226 LVSQ---LGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTM 282

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
           Y++N+  I VG  ++ I P     N T G  GTIIDSGT  T L  PAY AVR  F  ++
Sbjct: 283 YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQI 342

Query: 338 G-SNLTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP-QDNLLIHSTAGSITC 389
               L VT     DTC+  P      +  P + L F G +  LP Q+ +L+  + G   C
Sbjct: 343 TLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLC 402

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LAMA++     S  ++I + Q QN  +LYD+ NS +      C
Sbjct: 403 LAMASS-----SDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 169/373 (45%), Gaps = 34/373 (9%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           P+ASG        Y+    +GTPA+   +  DT +D  W+   PC  C      +F+   
Sbjct: 32  PVASG-----GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD-----IV 194
           S+++  + C    C  +P  +C    C ++  YG  S     LS +T++L +        
Sbjct: 87  SSSYTTMSCGDTLCDSLPRKSCSP-DCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
               FGC     G+     GL+GLGRG+LS ++Q  +L+   FSYCL P   A S +  +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 254 RLGPI------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
             G        G+     +TP++ NP   S YYV L  I +  R + IP G+    P   
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-------PIVAP 360
            G I DSGT  T L    Y  V    R ++       S  G D CY V        +  P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIP 325

Query: 361 TITLMFSGMNVTLPQDNLLIHST-AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            +   F G +  LP +N  I +  AG+I CLAM ++    N  + +  NM QQN R++YD
Sbjct: 326 AMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSS----NMDIGIYGNMMQQNFRVMYD 381

Query: 420 VPNSRLGVARELC 432
           + +S++G A   C
Sbjct: 382 IGSSKIGWAPSQC 394


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 126/401 (31%), Positives = 193/401 (48%), Gaps = 36/401 (8%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
           K L+  E +   + +   RLQ L ++      V  P+ +G        Y++   IGTPAQ
Sbjct: 52  KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDG-----EYLMNLSIGTPAQ 106

Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
                MDT +D  W    PCT C   S+ +FN   S++F  L C +  C+ + +PTC   
Sbjct: 107 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNN 166

Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSL 223
           +C +   YG  S    ++  +T++  +  +P  TFGC +   G       GL+G+GRG L
Sbjct: 167 SCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226

Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIKYTPLLKNPRRSS 277
           SL +Q   L  + FSYC+    + S S +L LG +      G P     T L+++ +  +
Sbjct: 227 SLPSQ---LDVTKFSYCMTPIGS-SNSSTLLLGSLANSVTAGSPN----TTLIQSSQIPT 278

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
            YY+ L  + VG   + I P   + N   G  G IIDSGT  T  V  AY AVR  F  +
Sbjct: 279 FYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQ 338

Query: 337 VGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
           +  ++   S  GFD C+ +P     +  PT  + F G ++ LP +N  I  + G I CLA
Sbjct: 339 MNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLI-CLA 397

Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           M ++       +++  N+QQQN  ++YD  NS +      C
Sbjct: 398 MGSSSQG----MSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 135/420 (32%), Positives = 206/420 (49%), Gaps = 37/420 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
           S+L+V H+   CS       +  +E    ++ +DQAR++ + S         ++ A+ + 
Sbjct: 63  SSLRVVHMHGACSHLSSDARVDHDE----IIRRDQARVESIYSKLSKNSANEVSEAKSTE 118

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
           +P  SG  +  S  YIV   IGTP   L +  DT +D  W  C  C+G C S     FN 
Sbjct: 119 LPAKSGITL-GSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPG 196
           + S+T++N+ C +  C+     +C    C +++ YG  +     L+++  +L  +D++  
Sbjct: 178 SSSSTYQNVSCSSPMCEDAE--SCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLED 235

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
             FGC +   G      GLLGLG G LSL AQT   Y + FSYCLPSF + S +G L  G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFG 294

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G  + +K+TP+   P   + Y ++++ I VG + + I P     N  +  G IIDSGT
Sbjct: 295 SAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELAITP-----NSFSTEGAIIDSGT 348

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT 372
           VFTRL    Y  +R VF+ ++ S  + +  G FDTCY       +  PTI   F+G  V 
Sbjct: 349 VFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVV 408

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               + +      S  CLA A   D    +  +  N+QQ    ++YDV   R+G A   C
Sbjct: 409 ELDGSGISLPIKISQVCLAFAGNDD----LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 136/465 (29%), Positives = 218/465 (46%), Gaps = 73/465 (15%)

Query: 25  CDT---QDHSST-----LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL-- 72
           CDT    +H ++     + + H   PCSP   +  KP S +E    +LA DQ R++ +  
Sbjct: 73  CDTPREHEHGASSSGTRMTIVHRHGPCSPLADAHGKPPSHDE----ILAADQNRVESIHH 128

Query: 73  --SSLAVARKS----------------------------VVPIASGRQITQSPTYIVRAK 102
             S+ A  R                               +P +SGR +     Y+V   
Sbjct: 129 RVSTTATVRGKPKRRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTG-NYVVTIG 187

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
           +GTPA    +  DT +D  WV C  CV         +F+ A+S+T+ N+ C A  C  + 
Sbjct: 188 LGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAPACSDLY 247

Query: 159 NPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLL 216
              C GG C +++ YG  + +    + DT++L++ D V G+ FGC ++  G      GLL
Sbjct: 248 TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLL 307

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI---KYTPLLKNP 273
           GLGRG  SL  QT + Y   F++CLP+    S +G L  GP G P  +   + TP+L + 
Sbjct: 308 GLGRGKTSLPVQTYDKYGGVFAHCLPARS--SGTGYLDFGP-GSPAAVGARQTTPMLTD- 363

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
              + YYV +  IRVG +++ IP         + AGTI+DSGTV TRL   AY+++R  F
Sbjct: 364 NGPTFYYVGMTGIRVGGQLLSIPQSVF-----STAGTIVDSGTVITRLPPAAYSSLRSAF 418

Query: 334 RRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
              + +       +L   DTCY       +  P ++L+F G        + ++++ + S 
Sbjct: 419 ASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQ 478

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            CL  AA  D+ +  + ++ N Q +   ++YD+    +G +   C
Sbjct: 479 VCLGFAANEDDDD--VGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 136/430 (31%), Positives = 210/430 (48%), Gaps = 48/430 (11%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS-------- 81
           + STL + H   PCSP    +  S EE+    L +DQ R  ++ +   +R +        
Sbjct: 56  NGSTLALSHRHGPCSPVISKEKPSHEET----LRRDQLRAAYIQAKVSSRYNNVAKELQQ 111

Query: 82  ---VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSS--- 133
               +P +SG  +  +  Y++   IGTPA T +M++DT +D +WV C  C    CSS   
Sbjct: 112 SAVTIPTSSGYSLGTTE-YVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD 170

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNLTYGS-STIAANLSQDTISL- 189
            +F+ A S T+    C +AQC Q+ +    C    C + + YG  S  A     DT+SL 
Sbjct: 171 KLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLT 230

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
           ++D V  + FGC  +A G      GL+GLG  + SL++QT   Y   FSYCLP   + S 
Sbjct: 231 SSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPP-PSSSG 289

Query: 250 SGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
            G L LG  G     R  +TP+++     + Y V L  I V   ++++P         +G
Sbjct: 290 GGFLTLGAAGGASSSRYSHTPMVRF-SVPTFYGVFLQGITVAGTMLNVPASVF-----SG 343

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTIT 363
           A +++DSGTV T+L   AY A+R  F++ + +  +   +G  DTC+       I  PT+T
Sbjct: 344 A-SVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVT 402

Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           L FS G  + L    +L    AG   CLA  A   + ++   ++ N+QQ+   +L+DV  
Sbjct: 403 LTFSRGAAMDLDISGILY---AG---CLAFTATAHDGDT--GILGNVQQRTFEMLFDVGG 454

Query: 423 SRLGVARELC 432
             +G     C
Sbjct: 455 RTIGFRSGAC 464


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 178/379 (46%), Gaps = 32/379 (8%)

Query: 72  LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
           +SS AVA    VP+ +G        +++   IGTPA      +DT +D  W  C  CV C
Sbjct: 82  MSSKAVAPALQVPVHAGNG-----EFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVEC 136

Query: 132 ---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTI 187
              S+ VF+ + S+T+  L C +  C  +P+  C    C +  TYG SS+    L+ +T 
Sbjct: 137 FNQSTPVFDPSSSSTYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETF 196

Query: 188 SLATDIVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +LA   +P   FGC     G+      GL+GLGRG LSL++Q   L  + FSYCL S   
Sbjct: 197 TLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLNKFSYCLTSLDD 253

Query: 247 LSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
            S S  L LG +            ++ TPL++NP + S YYVNL  + VG   + +P  A
Sbjct: 254 TSKS-PLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSA 312

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--- 356
                    G I+DSGT  T L    Y A++  F  ++       S  G DTC+  P   
Sbjct: 313 FAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASG 372

Query: 357 ---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
              +  P +     G ++ LP +N ++  +     CL +  +       L++I N QQQN
Sbjct: 373 VDQVEVPKLVFHLDGADLDLPAENYMVLDSGSGALCLTVMGSRG-----LSIIGNFQQQN 427

Query: 414 HRILYDVPNSRLGVARELC 432
            + +YDV  + L  A   C
Sbjct: 428 IQFVYDVGENTLSFAPVQC 446


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 145/442 (32%), Positives = 199/442 (45%), Gaps = 47/442 (10%)

Query: 26  DTQDHSSTLQ--VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-SSLAVARKSV 82
           D     +TL   V H  +   P + + P S+        A   A+L+ L S+ A A    
Sbjct: 22  DATQRPTTLHIPVVHRDAVFPPRRGAPPGSFR---CRHAAPHTAQLESLHSATAAADLLR 78

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
            P+ SG     S  Y     +G P    L+ +DT +D  W+ C  C  C   V   ++  
Sbjct: 79  SPVMSGVPF-DSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPR 137

Query: 140 QSTTFKNLGCQAAQCKQVPN-PTCGG--GACAFNLTYGS-STIAANLSQDTISLATDI-V 194
            S T + + C + QC+ V   P C    G C + + YG  S  + +L+ DT+ L  D  V
Sbjct: 138 NSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRV 197

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF--KALSFSGS 252
              T GC     G      GLLG GRG LS   Q    Y   FSYCL     +A + S  
Sbjct: 198 HNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSY 257

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGT 310
           L  G   +     +TPL  NPRR SLYYV+++   V G RV      +L  NP TG  G 
Sbjct: 258 LVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGV 317

Query: 311 IIDSGTVFTRLVAPAYTAVRDVF--------RRRVGSNLTVTSLGGFDTCYSVP------ 356
           ++DSGT  +R    AY AVRD F         RR+ +  +V     FDTCY V       
Sbjct: 318 VVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSV-----FDTCYDVHGNGPGT 372

Query: 357 -IVAPTITLMF-SGMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQ 411
            +  P+I L F +  ++ LPQ N LI    G   +  CL + AA D     LNV+ N+QQ
Sbjct: 373 GVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDG----LNVLGNVQQ 428

Query: 412 QNHRILYDVPNSRLGVARELCT 433
           Q   +++DV   R+G     C+
Sbjct: 429 QGFGVVFDVERGRIGFTPNGCS 450


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 115/389 (29%), Positives = 187/389 (48%), Gaps = 26/389 (6%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
           E +   + + + RLQ LS+   + +S V  P+ +G        ++++  IGTPA+T    
Sbjct: 59  ERLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNG-----EFLMKLAIGTPAETYSAI 113

Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           MDT +D  W    PC  C    + +F+  +S++F  L C +  C  +P  +C  G C + 
Sbjct: 114 MDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG-CEYL 172

Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
            +YG  S+    L+ +T +     V    FGC +   G+      GL+GLGRG LSL++Q
Sbjct: 173 YSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQ 232

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              L +  FSYCL S        SL +G     K    TPL++NP + S YY++L  I V
Sbjct: 233 ---LGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISV 289

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G  ++ I             G IIDSGT  T L   A+ A++  F  ++  ++  +   G
Sbjct: 290 GDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTG 349

Query: 349 FDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            D C+++P     +  P +   F G ++ LP +N +I  +   + CL M ++     S +
Sbjct: 350 LDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLTMGSS-----SGM 404

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++  N QQQN  +L+D+    +  A   C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 137/451 (30%), Positives = 208/451 (46%), Gaps = 70/451 (15%)

Query: 34  LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS------ 81
           + + H   PCSP   +  KP S E+    +LA DQ R + +    S+ A AR +      
Sbjct: 86  MTIVHRHGPCSPLAAAHGKPPSHED----ILAADQNRAESIQHRVSTTATARGNPKRSRR 141

Query: 82  -----------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
                                   +P +SGR +     Y+V   +GTPA    +  DT +
Sbjct: 142 APSRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGS 200

Query: 119 DAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
           D  WV C  CV         +F+ A+S+T+ N+ C A  C  +    C GG C + + YG
Sbjct: 201 DTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQYG 260

Query: 175 SSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
             + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + 
Sbjct: 261 DGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 320

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVG 289
           Y   F++CLP+    S +G L  GP G P        TP+L +    + YYV +  IRVG
Sbjct: 321 YGGVFAHCLPARS--SGTGYLDFGP-GSPAAAGARLTTPMLTD-NGPTFYYVGMTGIRVG 376

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF----RRRVGSNLTVTS 345
            +++ IP           AGTI+DSGTV TRL  PAY+++R  F      R        S
Sbjct: 377 GQLLSIPQSVFAT-----AGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVS 431

Query: 346 LGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
           L   DTCY       +  PT++L+F G  +     + ++++ + S  CL  AA  D  + 
Sbjct: 432 L--LDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIMYAASVSQVCLGFAANEDGGD- 488

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + ++ N Q +   + YD+    +G +   C
Sbjct: 489 -VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 135/420 (32%), Positives = 206/420 (49%), Gaps = 37/420 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
           S+L+V H+   CS       +  +E    ++ +DQAR++ + S         ++ A+ + 
Sbjct: 63  SSLRVVHMHGACSHLSSDARVDHDE----IIRRDQARVESIYSKLSKNSANEVSEAKSTE 118

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
           +P  SG  +  S  YIV   IGTP   L +  DT +D  W  C  C+G C S     FN 
Sbjct: 119 LPAKSGITL-GSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPG 196
           + S+T++N+ C +  C+     +C    C +++ YG  +     L+++  +L  +D++  
Sbjct: 178 SSSSTYQNVSCSSPMCEDAE--SCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLED 235

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
             FGC +   G      GLLGLG G LSL AQT   Y + FSYCLPSF + S +G L  G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFG 294

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G  + +K+TP+   P   + Y ++++ I VG + + I P     N  +  G IIDSGT
Sbjct: 295 SAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELAITP-----NSFSTEGAIIDSGT 348

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT 372
           VFTRL    Y  +R VF+ ++ S  + +  G FDTCY       +  PTI   F+G  V 
Sbjct: 349 VFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVV 408

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               + +      S  CLA A   D    +  +  N+QQ    ++YDV   R+G A   C
Sbjct: 409 ELDGSGISLPIKISQVCLAFAGNDD----LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 131/467 (28%), Positives = 217/467 (46%), Gaps = 71/467 (15%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           S   N     +   ++L+V +   PC+    ++  +   ++ E+LA DQAR+  + +   
Sbjct: 56  SSSCNTATKGKRRGASLEVVNRQGPCTQL--NQKGAKAPTLTEILAHDQARVDSIQARVT 113

Query: 78  AR----------------------------KSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
            +                            +S +P+ +G        YIV   +GTP + 
Sbjct: 114 DQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGN-------YIVNVGLGTPKKD 166

Query: 110 LLMAMDTSNDAAWVPCTGCV-GCSST---VFNSAQSTTFKNLGCQAAQCKQVPN-----P 160
           L +  DT +D  W  C  CV  C +    +F+ + S T+ N+ C +  C  + +     P
Sbjct: 167 LSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSP 226

Query: 161 TCGGGACAFNLTYGSSTIAANL-SQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGL 218
            C    C + + YG S+      ++DT++L   D+  G+ FGC Q   G      GL+GL
Sbjct: 227 GCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGL 286

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIK----YTPLLK 271
           GR  LS++ QT   +   FSYCLP+ +    +G L  G    +   K +K    +TP   
Sbjct: 287 GRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKTSKAVKNGITFTP-FA 343

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           + + ++ Y++++L I VG + + I P   Q      AGTIIDSGTV TRL +  Y +++ 
Sbjct: 344 SSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKS 398

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGS 386
            F++ +    T  +L   DTCY +     I  P I+  F+G  NV L  + +LI + A  
Sbjct: 399 TFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGILITNGASQ 458

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           + CLA A   D  +  + +  N+QQQ   ++YDV   +LG   + C+
Sbjct: 459 V-CLAFAGNGD--DDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 135/414 (32%), Positives = 195/414 (47%), Gaps = 35/414 (8%)

Query: 48  PSKPLSWEESVL-EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
           P  P +   S+L + LA D AR  + S +    +   P+ SG    +S  Y     +GTP
Sbjct: 39  PPPPGAKRGSLLRQRLAADAAR--YASLVDATGRLHSPVFSGIPF-ESGEYFALVGVGTP 95

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC- 162
           +   ++ +DT +D  W+ C+ C  C      VF+  +S+T++ + C + QC+ +  P C 
Sbjct: 96  STKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCD 155

Query: 163 ----GGGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLL 216
                GG C + + YG  S+    L+ D ++ A D  V   T GC +   G      GLL
Sbjct: 156 SGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLL 215

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRR 275
           G+ RG +S+  Q    Y S F YCL    + S   S L  G   +P    +T LL NPRR
Sbjct: 216 GVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRR 275

Query: 276 SSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVF 333
            SLYYV++    V G RV      +L  +  TG  G ++DSGT  +R    AY A+RD F
Sbjct: 276 PSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAF 335

Query: 334 RRRVGSNLTVTSLGG---FDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI----- 380
             R  +       G    FD CY +       AP I L F+ G ++ LP +N  +     
Sbjct: 336 DARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGG 395

Query: 381 -HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               A    CL   AA D     L+VI N+QQQ  R+++DV   R+G A + CT
Sbjct: 396 RRRAASYRRCLGFEAADDG----LSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 168/373 (45%), Gaps = 34/373 (9%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           P+ASG        Y+    +GTPA+   +  DT +D  W+   PC  C      +F+   
Sbjct: 32  PVASG-----GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD-----IV 194
           S+++  + C    C  +P  +C    C ++  YG  S     LS +T++L +        
Sbjct: 87  SSSYTTMSCGDTLCDSLPRKSCSPN-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
               FGC     G+     GL+GLGRG+LS ++Q  +L+   FSYCL P   A S +  +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 254 RLGPI------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
             G        G+     +TP++ NP   S YYV L  I +  R + IP G+    P   
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-------PIVAP 360
            G I DSGT  T L    Y  V    R +V       S  G D CY V           P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIP 325

Query: 361 TITLMFSGMNVTLPQDNLLIHST-AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            +   F G +  LP +N  I +  AG+I CLAM ++    N  + +  NM QQN R++YD
Sbjct: 326 AMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSS----NMDIGIYGNMMQQNFRVMYD 381

Query: 420 VPNSRLGVARELC 432
           + +S++G A   C
Sbjct: 382 IGSSKIGWAPSQC 394


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 133/403 (33%), Positives = 194/403 (48%), Gaps = 33/403 (8%)

Query: 55  EESVL-EMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
           EE +L   L +  AR+  L SLA +A    +  A    +     Y++   IGTP +    
Sbjct: 46  EEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYSA 105

Query: 113 AMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAF 169
            +DT +D  W    PC  CV   +  F+ A+S T+++LGC +  C  +  P C    C +
Sbjct: 106 ILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVY 165

Query: 170 NLTYG-SSTIAANLSQDTISLATDI----VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
              YG S++ A  L+ +T +  T+     +PG +FGC     G      G++G GRGSLS
Sbjct: 166 QYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLS 225

Query: 225 LLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSL 278
           L++Q   L    FSYCL SF +     L F     L         ++ TP + NP   ++
Sbjct: 226 LVSQ---LGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTM 282

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
           Y++N+  I VG  ++ I P     N T G  GTIIDSGT  T L  PAY AVR  F  ++
Sbjct: 283 YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQI 342

Query: 338 G-SNLTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP-QDNLLIHSTAGSITC 389
               L VT     DTC+  P      +  P + L F G +  LP Q+ +L+  + G   C
Sbjct: 343 TLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLC 402

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LAMA++     S  ++I + Q QN  +LYD+ NS +      C
Sbjct: 403 LAMASS-----SDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 205/430 (47%), Gaps = 47/430 (10%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASG 88
           +T+ + H   PCSP   SK    EE   E+L +DQ R + +    ++  A      +   
Sbjct: 52  TTVALNHRHGPCSPVPSSKKRPTEE---ELLKRDQLRAEHIQRKFAMNAAVDGAGDLQQS 108

Query: 89  RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-----VGCSS 133
           +  +  PT          Y++   +GTPA T  + +DT +D +WV C  C        + 
Sbjct: 109 KVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTG 168

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGS-STIAANLSQDTIS 188
            +F+ A+S+T++ + C AA+C Q+     G GA    C + + YG  ST     S+DT++
Sbjct: 169 ALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT 228

Query: 189 L--ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           L  A+D V G+ FGC    +G S    GL+GLG G+ SL++QT   Y ++FSYCLP    
Sbjct: 229 LSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSG 288

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
            S   +L     G       T +L++ +  + Y   L  I VG + + + P         
Sbjct: 289 SSGFLTLGG--GGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF------ 340

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTI 362
            AG+++DSGT+ TRL   AY+A+   F+  +    +  +    DTC+       I  PT+
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTV 400

Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
            L+FSG        N +++       CLA AA  D  +    +I N+QQ+   +LYDV +
Sbjct: 401 ALVFSGGAAIDLDPNGIMYG-----NCLAFAATGD--DGTTGIIGNVQQRTFEVLYDVGS 453

Query: 423 SRLGVARELC 432
           S LG     C
Sbjct: 454 STLGFRSGAC 463


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 124/392 (31%), Positives = 190/392 (48%), Gaps = 36/392 (9%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
           K L+  E +   + +   RLQ L ++      V  P+ +G        Y++   IGTPAQ
Sbjct: 52  KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDG-----EYLMNLSIGTPAQ 106

Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
                MDT +D  W    PCT C   S+ +FN   S++F  L C +  C+ + +PTC   
Sbjct: 107 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNN 166

Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSL 223
           +C +   YG  S    ++  +T++  +  +P  TFGC +   G       GL+G+GRG L
Sbjct: 167 SCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226

Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIKYTPLLKNPRRSS 277
           SL +Q   L  + FSYC+    + S S +L LG +      G P     T L+++ +  +
Sbjct: 227 SLPSQ---LDVTKFSYCMTPIGS-STSSTLLLGSLANSVTAGSPN----TTLIESSQIPT 278

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
            YY+ L  + VG   + I P   + N   G  G IIDSGT  T     AY AVR  F  +
Sbjct: 279 FYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQ 338

Query: 337 VGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
           +  ++   S  GFD C+ +P     +  PT  + F G ++ LP +N  I  + G I CLA
Sbjct: 339 MNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLI-CLA 397

Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           M ++       +++  N+QQQN  ++YD  NS
Sbjct: 398 MGSSSQG----MSIFGNIQQQNLLVVYDTGNS 425


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 132/433 (30%), Positives = 206/433 (47%), Gaps = 53/433 (12%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASG 88
           +T+ + H   PCSP   SK    EE   E+L +DQ R + +    ++  A      +   
Sbjct: 52  TTVALNHRHGPCSPVPSSKKRPTEE---ELLKRDQLRAEHIQRKFAMNAAVDGAGDLQQS 108

Query: 89  RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-----VGCSS 133
           +  +  PT          Y++   +GTPA T  + +DT +D +WV C  C        + 
Sbjct: 109 KVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTG 168

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGS-STIAANLSQDTIS 188
            +F+ A+S+T++ + C AA+C Q+     G GA    C + + YG  ST     S+DT++
Sbjct: 169 ALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT 228

Query: 189 L--ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           L  A+D V G+ FGC    +G S    GL+GLG G+ SL++QT   Y ++FSYCLP    
Sbjct: 229 LSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLP---- 284

Query: 247 LSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
              SGS     +G          T +L++ +  + Y   L  I VG + + + P      
Sbjct: 285 -PTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF--- 340

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVA 359
               AG+++DSGT+ TRL   AY+A+   F+  +    +  +    DTC+       I  
Sbjct: 341 ---AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           PT+ L+FSG        N +++       CLA AA  D  +    +I N+QQ+   +LYD
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG-----NCLAFAATGD--DGTTGIIGNVQQRTFEVLYD 450

Query: 420 VPNSRLGVARELC 432
           V +S LG     C
Sbjct: 451 VGSSTLGFRSGAC 463


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 132/434 (30%), Positives = 199/434 (45%), Gaps = 51/434 (11%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGR 89
           V H   PCSP +         S  ++L +DQAR+  +  +     S V      P   G 
Sbjct: 91  VMHRHGPCSPLQTPGD---APSDADLLDQDQARVDSILGMITNETSAVGPGVSLPAERGI 147

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT--GCVGCSSTVFNSAQSTTF 144
            +     Y+V   +GTPA+ L +  DT +D +WV   PC+  GC      +F  + S+TF
Sbjct: 148 SVGTG-NYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTF 206

Query: 145 KNLGCQAAQCKQVPNPTCGGG----ACAFNLTYGS-STIAANLSQDTISLAT-------- 191
             + C A +C+     +CGG      C + + YG  S    +L  DT++L T        
Sbjct: 207 SAVRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASA 264

Query: 192 ---DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
              + +PG+ FGC +  TG      GL GLGRG +SL +Q    +   FSYCLPS  + +
Sbjct: 265 ENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSA 324

Query: 249 FSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
             G L LG P+  P   ++TP+L      S YYV L+ IRV  R + +       +P   
Sbjct: 325 -PGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVS------SPRVA 377

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCY------SVPIVA 359
              I+DSGTV TRL   AY A+R  F   +G         L   DTCY      +  +  
Sbjct: 378 LPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSI 437

Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           P + L+F+G        + +++    +  CLA   AP+       ++ N QQ+   ++YD
Sbjct: 438 PAVALVFAGGATISVDFSGVLYVAKVAQACLAF--APNGDGRSAGILGNTQQRTLAVVYD 495

Query: 420 VPNSRLGVARELCT 433
           V   ++G A + C+
Sbjct: 496 VARQKIGFAAKGCS 509


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 123/360 (34%), Positives = 180/360 (50%), Gaps = 25/360 (6%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y  +  +GTPA   LM +DT +D  W+ C  C  C   S  VF+  +S ++  +GC 
Sbjct: 137 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCA 196

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
           A  C+++ +  C     AC + + YG  ++ A + + +T++ A    V     GC     
Sbjct: 197 APLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNE 256

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRL--GPIGQ 260
           G  V   GLLGLGRGSLS   Q    Y  +FSYCL     S    S S ++    G +G 
Sbjct: 257 GLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGAVGS 316

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVF 318
                +TP++KNPR  + YYV L+ I V G RV  +    L+ +P++G  G I+DSGT  
Sbjct: 317 TVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGTSV 376

Query: 319 TRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVT 372
           TRL  PAY+A+RD FR    G  L+      FDTCY +     +  PT+++ F+ G    
Sbjct: 377 TRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAA 436

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N LI   +    C A A     V    ++I N+QQQ  R+++D    R+    + C
Sbjct: 437 LPPENYLIPVDSKGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 178/386 (46%), Gaps = 20/386 (5%)

Query: 62  LAKDQARLQFLS-SLAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMD 115
           + +D  R+  L   LA  + +    A G  +       S  Y VR  +G+P +   + +D
Sbjct: 93  MQRDTKRVAALRRHLAAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVID 152

Query: 116 TSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
           + +D  WV   PCT C   S  VFN A S+++  + C +  C  V N  C  G C + ++
Sbjct: 153 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVS 212

Query: 173 YGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           YG  S     L+ +T++    ++     GC     G  V   GLLGLG G +S + Q   
Sbjct: 213 YGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGG 272

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
               TFSYCL S + +  SG L+ G    P    + PL+ NPR  S YYV L  + VG  
Sbjct: 273 QAGGTFSYCLVS-RGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGL 331

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
            V I     + +     G ++D+GT  TRL   AY A RD F  +  +    + +  FDT
Sbjct: 332 RVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDT 391

Query: 352 CYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           CY     V +  PT++  FSG  + TLP  N LI        C A A +    +S L++I
Sbjct: 392 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPS----SSGLSII 447

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N+QQ+   I  D  N  +G    +C
Sbjct: 448 GNIQQEGIEISVDGANGFVGFGPNVC 473


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 192/404 (47%), Gaps = 34/404 (8%)

Query: 53  SWEESVLEMLAKDQARLQFLS--------SLAVARKSVVPIASGRQITQSPTYIVRAKIG 104
           S   ++L + A+D AR+++L         +  V  + V  I+ G     S  Y VR  +G
Sbjct: 86  STRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISEG-----SGEYFVRVGVG 140

Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           +P     + +D+ +D  W+ C  C  C   +  +F+ A S +F  + C +  C+ +P  +
Sbjct: 141 SPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGVCRTLPGGS 200

Query: 162 CG---GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLL 216
            G    GAC + ++YG  +     L+ +T++      V G   GC  +  G  V   GLL
Sbjct: 201 SGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLL 260

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
           GLG G +SL+ Q        FSYCL S  A + +GSL  G     P    + PLL+N ++
Sbjct: 261 GLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQ 320

Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
            S YYV L  + VG   + +  G        G G ++D+GT  TRL   AY A+RD F  
Sbjct: 321 PSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFAS 380

Query: 336 RVGSNL-TVTSLGGFDTCYSV----PIVAPTITLMF--SGMNVTLPQDNLLIHSTAGSIT 388
            +G +L     +   DTCY +     +  PT+ L F   G  +TLP  NLL+    G + 
Sbjct: 381 TIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVE-MGGGVY 439

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CLA AA+     S L+++ N+QQQ  +I  D  N  +G     C
Sbjct: 440 CLAFAAS----ASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 128/404 (31%), Positives = 195/404 (48%), Gaps = 34/404 (8%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
           S    V+ ++A+D AR++ L    VA  S          VVP         S  Y VR  
Sbjct: 80  SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135

Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           +G+P     + +D+ +D  WV   PC  C   +  +F+ A S++F  + C +A C+ +  
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195

Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
             CGGG     C +++TYG  S     L+ +T++L    V G   GC  + +G  V   G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-GQPKRIKYTPLLKNP 273
           LLGLG G++SL+ Q        FSYCL S +    +GSL LG     P    + PL++N 
Sbjct: 256 LLGLGWGAMSLIGQLGGAAGGVFSYCLAS-RGAGGAGSLVLGRTEAVPVGAVWVPLVRNN 314

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
           + SS YYV L  I VG   + +  G  Q       G ++D+GT  TRL   AY A+R  F
Sbjct: 315 QASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
              +G+     ++   DTCY +     +  PT++  F  G  +TLP  NLL+    G++ 
Sbjct: 375 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVF 433

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CLA A +    +S ++++ N+QQ+  +I  D  N  +G     C
Sbjct: 434 CLAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 126/373 (33%), Positives = 184/373 (49%), Gaps = 27/373 (7%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           V P+ SG     S  Y  +  +GTP    LM +DT +D  W+ C  C  C   S  +F+ 
Sbjct: 133 VAPVVSG-LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-V 194
             S ++  + C A  C+++ +  C     AC + + YG  ++ A + + +T++ A+   V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG--- 251
           P    GC     G  V   GLLGLGRGSLS  +Q    +  +FSYCL    + S S    
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311

Query: 252 ----SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT 306
               +   G +G      +TP++KNPR  + YYV L+ I V G RV  +    L+ +P+T
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371

Query: 307 G-AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAP 360
           G  G I+DSGT  TRL  PAY A+RD FR    G  L+      FDTCY +     +  P
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVP 431

Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           T+++ F+ G    LP +N LI   +    C A A     V    ++I N+QQQ  R+++D
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFD 487

Query: 420 VPNSRLGVARELC 432
               RLG   + C
Sbjct: 488 GDGQRLGFVPKGC 500


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 139/421 (33%), Positives = 214/421 (50%), Gaps = 39/421 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------SLAVARKSVVPI 85
           T+ + H + PCSP  PSK +   E   E L +DQ R  ++         +  +  + VP 
Sbjct: 56  TVPLHHRYDPCSPV-PSKKVPTLE---ERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPT 111

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQST 142
             G  ++ +  Y++   IG+PA T  M+MDT +D +WV C  C  C S V   F+ + S+
Sbjct: 112 TLGTSLS-TLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSS 170

Query: 143 TFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
           T+    C +A C Q+     G G     C + + YG SS+     S DT++L +  +  +
Sbjct: 171 TYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDF 230

Query: 198 TFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
            FGC Q  +G  +    GL+GLG G+ SL +QT   + + FSYCLP       SG L LG
Sbjct: 231 QFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSG--SSGFLTLG 288

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G    +K TP+L++ +  + Y V L +I+VG + +++P           AG+++DSGT
Sbjct: 289 -TGSSGFVK-TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS------AGSLMDSGT 340

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
           + TRL   AY+A+   F+  +      T  G  DTC+       I  PT+TL+FS G  V
Sbjct: 341 IITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAV 400

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            L  D +++  ++ SI CLA    P+  +S L +I N+QQ+   +LYDV    +G     
Sbjct: 401 DLAFDGIMLEISS-SIRCLAF--TPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGA 457

Query: 432 C 432
           C
Sbjct: 458 C 458


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 171/348 (49%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P ++  M +D+ +D  WV   PCT C   +  +F+ A S +F  + C 
Sbjct: 40  SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCS 99

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C QV N  C  G C + ++YG  S+    L+ +T++L   +V     GC     G  
Sbjct: 100 SAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMF 159

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GS+S + Q      + FSYCL S +  + +G L  G    P    + PL
Sbjct: 160 VGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVS-RVTNSNGFLEFGSEAMPVGAAWIPL 218

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NP   S YY+ L  + VG   V I     +       G ++D+GT  TR    AY A 
Sbjct: 219 IRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAF 278

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
           RD F  + G+    + +  FDTCY+    + +  PT++  FSG  + TLP +N LI    
Sbjct: 279 RDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDD 338

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A +P    S L+++ N+QQ+  +I  D  N  +G    +C
Sbjct: 339 AGTFCFAFAPSP----SGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 128/427 (29%), Positives = 203/427 (47%), Gaps = 36/427 (8%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----------LAVARK 80
           ++L+V H   PCS    S       S  +++  D  R++++ S           +     
Sbjct: 65  ASLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDS 124

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVF 136
           + +P  SGR I  +  Y+V   +GTP + L +  DT +   W  C  C G        +F
Sbjct: 125 TTLPAKSGRLIGSADYYVV-VGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIF 183

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISL-AT 191
           + ++S+++ N+ C ++ C Q  +  C      +C +++ YG ++I+   LSQ+ +++ AT
Sbjct: 184 DPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITAT 243

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           DIV  + FGC Q   G      GL+GL R  +S + QT ++Y   FSYCLPS    S  G
Sbjct: 244 DIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS--TPSSLG 301

Query: 252 SLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
            L  G        +KYTP       +S Y ++++ I VG     +P  A+  +  +  G+
Sbjct: 302 HLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGG--TKLP--AVSSSTFSAGGS 357

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
           IIDSGTV TRL   AY A+R  FR+ +            DTCY       I  P I   F
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEF 417

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           + G+ V LP   +L   +A  + CLA AA  +  +  + +  N+QQ+   ++YDV   R+
Sbjct: 418 AGGVKVELPLVGILYGESAQQL-CLAFAANGNGND--ITIFGNVQQKTLEVVYDVEGGRI 474

Query: 426 GVARELC 432
           G     C
Sbjct: 475 GFGAAGC 481


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 112/348 (32%), Positives = 171/348 (49%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P ++  M +D+ +D  WV   PCT C   +  +F+ A S +F  + C 
Sbjct: 40  SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCS 99

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C +V N  C  G C + ++YG  S     L+ +T++    +V     GC     G  
Sbjct: 100 SAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMF 159

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GS+S + Q      + FSYCL S +  + +G L  G    P    + PL
Sbjct: 160 VGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVS-RGTNTNGFLEFGSEAMPVGAAWIPL 218

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YY+ LL + VG   V +     Q N     G ++D+GT  TR    AY A 
Sbjct: 219 VRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAF 278

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
           R+ F  +  +    + +  FDTCY+    + +  PT++  FSG  + T+P +N LI    
Sbjct: 279 RNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDD 338

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A +P    S L+++ N+QQ+  +I  D  N  +G    +C
Sbjct: 339 AGTFCFAFAPSP----SGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 143/411 (34%), Positives = 205/411 (49%), Gaps = 42/411 (10%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVA-----RKSVV-PIASGRQITQ-SPTYIVRAKIGTPAQ 108
           E +   L +D+ R   +S  A A     RK V  P+ SG  + Q S  Y  +  +GTPA 
Sbjct: 83  ELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSG--LAQGSGEYFTKIGVGTPAT 140

Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-- 163
             LM +DT +D  WV C  C  C   S  VF+  +S+++  +GC AA C+++ +  C   
Sbjct: 141 QALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLR 200

Query: 164 GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRG 221
            GAC + + YG  ++ A +   +T++ A    V     GC     G  V   GLLGLGRG
Sbjct: 201 RGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRG 260

Query: 222 SLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLR-------LGPIGQPKRIKYTPLLK 271
            LS   Q    Y  +FSYCL    S  A +  GS R        G +G      +TP+++
Sbjct: 261 GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASS-ASFTPMVR 319

Query: 272 NPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAV 329
           NPR  + YYV L+ I V G RV  +    L+ +P+TG  G I+DSGT  TRL   +Y+A+
Sbjct: 320 NPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSAL 379

Query: 330 RDVFRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIH 381
           RD FR      L + S GG   FDTCY +     +  PT+++ F+ G    LP +N LI 
Sbjct: 380 RDAFRAAAAGGLRL-SPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIP 438

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +    C A A     V    ++I N+QQQ  R+++D    R+G A + C
Sbjct: 439 VDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 201/424 (47%), Gaps = 37/424 (8%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-----SSLAVAR------KSV 82
           + + H   PCSP   +       S  E+LA DQ R + +     ++  V+R      +  
Sbjct: 89  MPIVHRHGPCSPLADAHDGKLP-SHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPS 147

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSSTVFNS 138
           +P +SG  +  +  Y+V   +GTPA    +  DT +D  WV C  CV         +F+ 
Sbjct: 148 LPASSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDP 206

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPG 196
           A+S+T+ N+ C A  C  +    C GG C + + YG  + +    + DT++L++ D + G
Sbjct: 207 ARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 266

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           + FGC ++  G      GLLGLGRG  SL  Q  + Y   F++C P+    S +G L  G
Sbjct: 267 FRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS--SGTGYLDFG 324

Query: 257 PIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           P   P       TP+L +    + YYV L  IRVG +++ IP         T +GTI+DS
Sbjct: 325 PGSLPAVSAKLTTPMLVD-NGPTFYYVGLTGIRVGGKLLSIPQSVF-----TTSGTIVDS 378

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
           GTV TRL   AY+++R  F   +         +L   DTCY       +  PT++L+F G
Sbjct: 379 GTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQG 438

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
                   + +I++ + S  CL  A   ++ +  + ++ N Q +   ++YD+    +G  
Sbjct: 439 GASLDVHASGIIYAASVSQACLGFAGNKEDDD--VGIVGNTQLKTFGVVYDIGKKVVGFC 496

Query: 429 RELC 432
              C
Sbjct: 497 PGAC 500


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 188/408 (46%), Gaps = 48/408 (11%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQ--------------------SPTYIVRA 101
           L +D  R++ ++SLA        +++GR  T+                    S  Y +R 
Sbjct: 87  LQRDSLRVKSITSLAA-------VSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRL 139

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
            +GTPA  + M +DT +D  W+ C+ C  C   +  +F+  +S TF  + C +  C+++ 
Sbjct: 140 GVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD 199

Query: 159 NP----TCGGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
           +     T     C + ++YG  +    + S +T++     V     GC     G  V   
Sbjct: 200 DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAA 259

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           GLLGLGRG LS  +QT+N Y   FSYCL     S  +     ++  G    PK   +TPL
Sbjct: 260 GLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPL 319

Query: 270 LKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L NP+  + YY+ LL I VG  RV  +     + + T   G IIDSGT  TRL  PAY A
Sbjct: 320 LTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVA 379

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTA 384
           +RD FR          S   FDTC+ +     +  PT+   F G  V+LP  N LI    
Sbjct: 380 LRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNT 439

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A    +    L++I N+QQQ  R+ YD+  SR+G     C
Sbjct: 440 EGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 117/349 (33%), Positives = 167/349 (47%), Gaps = 14/349 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTP + + M +DT +D  W+ C  C  C S    VF+  +S +F ++ C+
Sbjct: 144 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 203

Query: 151 AAQCKQVPNPTCGG-GACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGN 208
           +  C ++ +P C    +C + + YG  +      S +T++     VP    GC     G 
Sbjct: 204 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGL 263

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   GLLGLGRG LS   QT   +   FSYCL    A S   S+  G     +   +TP
Sbjct: 264 FVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTP 323

Query: 269 LLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           L+ NP+  + YY+ L  I V G RV  I     + +     G IIDSGT  TRL   AY 
Sbjct: 324 LITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYV 383

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHST 383
           ++RD FR              FDTC+ +     +  PT+ + F G +V+LP  N LI   
Sbjct: 384 SLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGADVSLPATNYLIPVD 443

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              + C A A       S L++I N+QQQ  R+++DV  SR+G A   C
Sbjct: 444 TNGVFCFAFAG----TMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 138/467 (29%), Positives = 209/467 (44%), Gaps = 73/467 (15%)

Query: 25  CDT---QDHSST-----LQVFHVFSPCSPFKPS---KPLSWEESVLEMLAKDQARLQFL- 72
           CDT     H +T     + + H   PCSP   +   KP S EE    +L  DQ R + + 
Sbjct: 73  CDTPREHKHGATSSGTRMPIVHRHGPCSPLADAHGGKPPSHEE----ILDADQNRAESIQ 128

Query: 73  ----SSLAVAR---KSVVPIASGRQITQSP--------------------------TYIV 99
               ++   AR   K   P  S RQ   S                            Y+V
Sbjct: 129 RRVSTTTTAARGKPKRNRPSPSRRQQPSSSAPAPGASLSSSAASLPASSGRALGTGNYVV 188

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCK 155
              +GTPA    +  DT +D  WV C  CV         +F+ A+S+T  N+ C A  C 
Sbjct: 189 TIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPACS 248

Query: 156 QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ 213
            +    C GG C + + YG  + +    + DT++L++ D + G+ FGC ++  G      
Sbjct: 249 DLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLFGEAA 308

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLK 271
           GLLGLGRG  SL  Q  + Y   F++C P+    S +G L  GP   P       TP+L 
Sbjct: 309 GLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS--SGTGYLDFGPGSSPAVSTKLTTPMLV 366

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           +    + YYV L  IRVG +++ IPP        T AGTI+DSGTV TRL   AY+++R 
Sbjct: 367 D-NGLTFYYVGLTGIRVGGKLLSIPPSVF-----TTAGTIVDSGTVITRLPPAAYSSLRS 420

Query: 332 VFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
            F   + +       +L   DTCY       +  PT++L+F G        + +I++ + 
Sbjct: 421 AFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASV 480

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S  CL  AA  ++ +  + ++ N Q +   ++YD+    +G +   C
Sbjct: 481 SQACLGFAANEEDDD--VGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 168/348 (48%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P ++  M +D+ +D  WV   PCT C   S  VF+ A S +F  + C 
Sbjct: 137 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 196

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           ++ C ++ N  C  G C + ++YG  S     L+ +T++    +V     GC  +  G  
Sbjct: 197 SSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMF 256

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GS+S + Q        FSYCL S +    SGSL  G    P    + PL
Sbjct: 257 VGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS-RGTDSSGSLVFGREALPAGAAWVPL 315

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YY+ L  + VG   V I     +       G ++D+GT  TRL   AY A 
Sbjct: 316 VRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAF 375

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
           RD F  +  +    T +  FDTCY     V +  PT++  FSG  + TLP  N LI    
Sbjct: 376 RDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 435

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A +     S L+++ N+QQ+  +I +D  N  +G    +C
Sbjct: 436 AGTFCFAFAPS----TSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 133/449 (29%), Positives = 207/449 (46%), Gaps = 66/449 (14%)

Query: 34  LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS------ 81
           + + H   PCSP   +  KP S E+    +LA DQ R + +    S+ A  R +      
Sbjct: 87  MTIVHRHGPCSPLADAHGKPPSHED----ILAADQNRAESIQHRVSTTATGRGNPKRSRR 142

Query: 82  -----------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
                                   +P +SGR +     Y+V   +GTPA    +  DT +
Sbjct: 143 APSRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGS 201

Query: 119 DAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
           D  WV C  CV         +F+ A+S+T+ N+ C A  C  +    C GG C + + YG
Sbjct: 202 DTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQYG 261

Query: 175 SSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
             + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + 
Sbjct: 262 DGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 321

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVG 289
           Y   F++CLP+    S +G L  GP G P        TP+L +    + YYV +  IRVG
Sbjct: 322 YGGVFAHCLPARS--SGTGYLDFGP-GSPAAAGARLTTPMLTD-NGPTFYYVGMTGIRVG 377

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLG 347
            +++ IP         T AGTI+DSGTV TRL   AY+++R  F   + +       ++ 
Sbjct: 378 GQLLSIPQSVF-----TTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVS 432

Query: 348 GFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
             DTCY       +  PT++L+F G        + ++++ + S  CL  AA  D  +  +
Sbjct: 433 LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGD--V 490

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            ++ N Q +   + YD+    +G +   C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 130/424 (30%), Positives = 194/424 (45%), Gaps = 39/424 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS----------- 81
           +L + H+ +  S   P      E+     L +D  R++ + +LA   +S           
Sbjct: 63  SLHLHHIDALSSNKTP------EQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFSS 116

Query: 82  --VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
             +  +A G     S  Y  R  +GTPA+ + M +DT +D  W+ C  C  C   +  VF
Sbjct: 117 SIISGLAQG-----SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVF 171

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLATDI 193
           +  +S T+  + C A  C+++ +P C      C + ++YG  +    + S +T++     
Sbjct: 172 DPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR 231

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           V     GC     G  +   GLLGLGRG LS   QT   +   FSYCL    A +   S+
Sbjct: 232 VTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSV 291

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTII 312
             G     +  ++TPL+KNP+  + YY+ LL I VG   V  +     + +     G II
Sbjct: 292 VFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVII 351

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
           DSGT  TRL  PAY A+RD FR              FDTC+ +     +  PT+ L F G
Sbjct: 352 DSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG 411

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            +V+LP  N LI        C A A       S L++I N+QQQ  R+ +D+  SR+G A
Sbjct: 412 ADVSLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNIQQQGFRVSFDLAGSRVGFA 467

Query: 429 RELC 432
              C
Sbjct: 468 PRGC 471


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 175/371 (47%), Gaps = 35/371 (9%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           VP+ +G        +++   IGTPA      +DT +D  W  C  CV C   S+ VF+ +
Sbjct: 109 VPVHAGNG-----EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPS 163

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISLATDIVPG 196
            S+T+  L C ++ C  +P  TC   A  C +  TYG +S+    L+ +T +LA   +PG
Sbjct: 164 SSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPG 223

Query: 197 YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
             FGC     G+      GL+GLGRG LSL++Q   L    FSYCL S    S S  L L
Sbjct: 224 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLGKFSYCLTSLDDTSKS-PLLL 279

Query: 256 GPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           G +            I+ TPL+KNP + S YYV L A+ VG   + +P  A         
Sbjct: 280 GSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTG 339

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTI 362
           G I+DSGT  T L    Y  ++  F  ++   +   S  G D C+  P      +  P +
Sbjct: 340 GVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKL 399

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            L F  G ++ LP +N ++  +A    CL +  +       L++I N QQQN + +YDV 
Sbjct: 400 VLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRG-----LSIIGNFQQQNIQFVYDVD 454

Query: 422 NSRLGVARELC 432
              L  A   C
Sbjct: 455 KDTLSFAPVQC 465


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 116/353 (32%), Positives = 172/353 (48%), Gaps = 30/353 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTPAQ     MDT +D  W    PCT C   S+ +FN   S++F  L C +  
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVP 211
           C+ + +PTC    C +   YG  S    ++  +T++  +  +P  TFGC +   G     
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIK 265
             GL+G+GRG LSL +Q   L  + FSYC+    + S   +L LG +      G P    
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGS-STPSNLLLGSLANSVTAGSPN--- 267

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
            T L+++ +  + YY+ L  + VG   + I P A   N   G  G IIDSGT  T  V  
Sbjct: 268 -TTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNN 326

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLL 379
           AY +VR  F  ++   +   S  GFD C+  P     +  PT  + F G ++ LP +N  
Sbjct: 327 AYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYF 386

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           I  + G I CLAM ++       +++  N+QQQN  ++YD  NS +  A   C
Sbjct: 387 ISPSNGLI-CLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 194/404 (48%), Gaps = 34/404 (8%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
           S    V+ ++A+D AR++ L    VA  S          VVP         S  Y VR  
Sbjct: 80  SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135

Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           +G+P     + +D+ +D  WV   PC  C   +  +F+ A S++F  + C +A C+ +  
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195

Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
             CGGG     C +++TYG  S     L+ +T++L    V G   GC  + +G  V   G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-GQPKRIKYTPLLKNP 273
           LLGLG G++SL+ Q        FSYCL S +    +GSL LG     P    + PL++N 
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLVLGRTEAVPVGAVWVPLVRNN 314

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
           + SS YYV L  I VG   + +     Q       G ++D+GT  TRL   AY A+R  F
Sbjct: 315 QASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
              +G+     ++   DTCY +     +  PT++  F  G  +TLP  NLL+    G++ 
Sbjct: 375 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVF 433

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CLA A +    +S ++++ N+QQ+  +I  D  N  +G     C
Sbjct: 434 CLAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 141/441 (31%), Positives = 205/441 (46%), Gaps = 58/441 (13%)

Query: 29  DHSST---LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSL--------- 75
           +HSS+   L + H   PCSP     P S       +L  D AR+  F + L         
Sbjct: 37  NHSSSAVHLPLHHPRGPCSPLSADIPFS------AVLTHDAARIASFAARLAKKSSPSSA 90

Query: 76  ------AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC- 128
                 A +  + VP+  G  +     Y+ R  +GTPA+  +M +DT +   W+ C+ C 
Sbjct: 91  SATTQAAGSSLASVPLTPGTSVGVG-NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCR 149

Query: 129 VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
           V C   S  VF+   S+++  + C + QC  +   T     C+      +  +YG S+ +
Sbjct: 150 VSCHRQSGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFS 209

Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
              LS+DT+S   + VP + +GC Q   G      GL+GL R  LSLL Q       +FS
Sbjct: 210 VGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFS 269

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLPS    S SG L +G    P    YTP++ N    SLY+++L  + V  +     P 
Sbjct: 270 YCLPS---TSSSGYLSIGSY-NPGGYSYTPMVSNTLDDSLYFISLSGMTVAGK-----PL 320

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY---- 353
           A+  +  T   TIIDSGTV TRL    YTA+ + V     GS     +    DTC+    
Sbjct: 321 AVSSSEYTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQA 380

Query: 354 SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
           S     P +++ FS G  + L   NLL+    G+ TCLA A A         +I N QQQ
Sbjct: 381 SKLRAVPAVSMAFSGGATLKLSAGNLLVD-VDGATTCLAFAPARSAA-----IIGNTQQQ 434

Query: 413 NHRILYDVPNSRLGVARELCT 433
              ++YDV ++R+G A   C+
Sbjct: 435 TFSVVYDVKSNRIGFAAAGCS 455


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  167 bits (423), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 185/404 (45%), Gaps = 36/404 (8%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------YIVRAKIGTPAQTL 110
           S L++L +   R     S  VAR + V   +G    Q P       +++   IGTPA + 
Sbjct: 54  SRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPALSY 113

Query: 111 LMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGA 166
              +DT +D  W  C  CV C   S+ VF+ + S+T+  + C +A C  +P  TC     
Sbjct: 114 AAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTSTCTSASK 173

Query: 167 CAFNLTYG-SSTIAANLSQDTISLATDI--VPGYTFGCIQKATGNS-VPPQGLLGLGRGS 222
           C +  TYG +S+    L+ +T +L  +   +PG  FGC     G+      GL+GLGRG 
Sbjct: 174 CGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGP 233

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-------IKYTPLLKNPRR 275
           LSL++Q   L    FSYCL S         L LG              ++ TPL+KNP +
Sbjct: 234 LSLVSQ---LGLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQ 290

Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
            S YYV+L  + VG   + +P  A         G I+DSGT  T L    Y A++  F  
Sbjct: 291 PSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVA 350

Query: 336 RVGSNLTVTSLGGFDTCYSVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
           ++       S  G D C+  P      +  P + L F  G ++ LP +N ++  +A    
Sbjct: 351 QMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGAL 410

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CL +A +       L++I N QQQN + +YDV    L  A   C
Sbjct: 411 CLTVAPS-----RGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQC 449


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  167 bits (423), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 144/470 (30%), Positives = 231/470 (49%), Gaps = 61/470 (12%)

Query: 4   QLVFFLAFLFLFSLSE--GLNPICDTQDHSSTL-QVFHVFSPCSPFKPSKPLSWEESVLE 60
           +LV FL F+ + + S    L     + + SS L  ++HV    S  +P+   S+     +
Sbjct: 10  KLVCFLTFMIVLATSSFAKLEEYKLSANQSSILLNLYHVHGDASSLEPNSSSSF----CD 65

Query: 61  MLAKDQARLQFLSSLAVARKSV--------------------VPIASGRQITQSPTYIVR 100
           +L++D+  ++FLSS  + +K V                    +P+  G  I  S  Y ++
Sbjct: 66  ILSRDEEHVKFLSS-RLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIG-SGNYYLK 123

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQ 156
             +G+P +   M +DT +  +W+ C  CV  C S V   F  + S T++ L C +++C  
Sbjct: 124 LGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSL 183

Query: 157 VP-----NPTC-GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATGN 208
           +      +P C   G C +  +YG ++ +   LS+D ++L  +  +P +T+GC Q   G 
Sbjct: 184 LKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGL 243

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
                G++GL R  LS+LAQ    Y   FSYCLP+  + S  G L +G I  P   K+TP
Sbjct: 244 FGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTS-SGGGFLSIGKI-SPSSYKFTP 301

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           +++N +  SLY++ L AI V  R V +     Q        TIIDSGTV TRL    Y A
Sbjct: 302 MIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAA 355

Query: 329 VRDVFRRRVGSNLT-VTSLGGFDTCYSVPIV----APTITLMF-SGMNVTLPQDNLLIHS 382
           +R+ F + +        +    DTC+   +     AP I ++F  G +++L   N+LI +
Sbjct: 356 LREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEA 415

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             G I CLA A++     + + +I N QQQ + I YDV  S++G A   C
Sbjct: 416 DKG-IACLAFASS-----NQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 178/361 (49%), Gaps = 23/361 (6%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y  R  +G+PA+ L M +DT +D  WV C  C  C   S  VF+ + 
Sbjct: 151 PVVSGVGLG-SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSL 209

Query: 141 STTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VPG 196
           ST++ ++ C   +C  +    C    GAC + + YG  S    + + +T++L     V  
Sbjct: 210 STSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSS 269

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
              GC     G  V   GLL LG G LS  +Q   +  +TFSYCL    + S S +L+ G
Sbjct: 270 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQFG 325

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
                +     PL+++PR S+ YYV L  I VG +++ IPP A   + T   G I+DSGT
Sbjct: 326 DAADAEVT--APLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGT 383

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
             TRL + AY A+RD F R   S    + +  FDTCY +     +  P ++L F+ G  +
Sbjct: 384 AVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGEL 443

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP  N LI        CLA A      N+ +++I N+QQQ  R+ +D   S +G     
Sbjct: 444 RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNK 499

Query: 432 C 432
           C
Sbjct: 500 C 500


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 131/415 (31%), Positives = 196/415 (47%), Gaps = 47/415 (11%)

Query: 53  SWEESVLEMLAKDQARLQFLSS-LAVARK------SVVPIASGRQITQSPTYIVRAKIGT 105
           S   +VL+++A+D AR ++L++ L+ A +      S   + SG     S  Y+VR  +G+
Sbjct: 121 SLRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSESKVVSGLD-EGSGEYLVRVSVGS 179

Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
           P     + +D+ +D  WV C  C+ C   +  +F+ A S TF  + C +A C+ +P   C
Sbjct: 180 PPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSAC 239

Query: 163 GGG---ACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLG 217
           G G    C + ++Y  GS T  A L+ +T++L    V G   GC  +  G  V   GL+G
Sbjct: 240 GDGELGGCEYEVSYADGSYTKGA-LALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMG 298

Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSF------KALSFSGSLRLG-PIGQPKRIKYTPLL 270
           LG G +SL+ Q        FSYCL S        A   +G L LG     P+   + PL+
Sbjct: 299 LGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLV 358

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
           +NPR  S YYV L  I VG   + +  G  Q         ++D+GT  TRL   AY A+R
Sbjct: 359 RNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALR 418

Query: 331 DVFR--------RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDN 377
           D F         R  G + +V      DTCY +     +  PT++  F G   + L   N
Sbjct: 419 DAFVGALAGAVPRAQGVSSSV-----LDTCYDLSGYASVRVPTVSFCFDGDARLILAARN 473

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +L+    G I CLA A +    +S L+++ N QQ   +I  D  N  +G     C
Sbjct: 474 VLLEVDMG-IYCLAFAPS----SSGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 172/347 (49%), Gaps = 24/347 (6%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
           Y++    GTP +   +  DT ++  W+ C  CV  C      +F+   S+T++N+ C +A
Sbjct: 16  YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75

Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
            C  + +  C G  C + +TYG  S+    L+ +T +LA  ++   + FGC Q   G   
Sbjct: 76  ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFIFGCGQNNQGLFT 135

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI-KYTPL 269
              GL+GLGR   SL +Q      + FSYCLPS    S +G L    IG P R   YT +
Sbjct: 136 GAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTS--SATGYLN---IGNPLRTPGYTAM 190

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           L N R  +LY+++L+ I VG   + +     Q       GTIIDSGTV TRL   AY A+
Sbjct: 191 LTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPPTAYGAL 245

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
           R  FR  +       +    DTCY    +  +  PTI L ++G++VT+P   +  +  + 
Sbjct: 246 RTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVF-YVISS 304

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S  CLA A   D+    + +I N+QQ+   + YD    R+G A   C
Sbjct: 305 SQVCLAFAGNSDSTQ--IGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/386 (32%), Positives = 179/386 (46%), Gaps = 39/386 (10%)

Query: 77  VARKSVVPI----ASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
           VAR + VP+    A+G    Q P       +++   IGTPA      +DT +D  W  C 
Sbjct: 75  VARATGVPMTSSKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK 134

Query: 127 GCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAAN 181
            CV C   S+ VF+ + S+T+  + C +A C  +P   C     C +  TYG SS+    
Sbjct: 135 PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGV 194

Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           L+ +T +LA   +PG  FGC     G+      GL+GLGRG LSL++Q   L    FSYC
Sbjct: 195 LATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYC 251

Query: 241 LPSFKALSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           L S    + S  L LG +            ++ TPL+KNP + S YYV+L AI VG   +
Sbjct: 252 LTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 310

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            +P  A         G I+DSGT  T L    Y A++  F  ++       S  G D C+
Sbjct: 311 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCF 370

Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
             P      +  P +   F  G ++ LP +N ++        CL +  +       L++I
Sbjct: 371 RAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSII 425

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQQN + +YDV +  L  A   C
Sbjct: 426 GNFQQQNFQFVYDVGHDTLSFAPVQC 451


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 133/420 (31%), Positives = 200/420 (47%), Gaps = 41/420 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T+ + H   PCS    + P +   ++ +ML +DQ R  +++              G  +T
Sbjct: 58  TVPLHHRHGPCS----TVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEGSDVT 113

Query: 93  QSPT---------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQ 140
              T         Y++   +G+PA    M +DT +D +WV C  C  C S   ++F+ + 
Sbjct: 114 VPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSS 173

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           S+T+    C +A C Q+    C    C + + YG  ST +   S DT++L +  V  + F
Sbjct: 174 SSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSSTVENFQF 233

Query: 200 GCIQKATGNSVPPQGLLGLGRGSL--SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           GC Q  +GN +  Q    +G G    SL  QT   +   FSYCLP       SG L LG 
Sbjct: 234 GCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG--SSGFLTLGA 291

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
                 +K TP+L++ +  S Y V L AIRVG R ++IP  A        AG+I+DSGT+
Sbjct: 292 STSGFVVK-TPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFS------AGSIMDSGTI 344

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT- 372
            TRL   AY+A+   F+  +        +G FDTC+       +  PT+ L+FSG  V  
Sbjct: 345 ITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVD 404

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L  D +++ S      CLA AA  D+ +  L +I N+QQ+   +LYDV    +G     C
Sbjct: 405 LASDGIILGS------CLAFAANSDDTS--LGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/386 (32%), Positives = 179/386 (46%), Gaps = 39/386 (10%)

Query: 77  VARKSVVPI----ASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
           VAR + VP+    A+G    Q P       +++   IGTPA      +DT +D  W  C 
Sbjct: 65  VARATGVPMTSSKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK 124

Query: 127 GCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAAN 181
            CV C   S+ VF+ + S+T+  + C +A C  +P   C     C +  TYG SS+    
Sbjct: 125 PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGV 184

Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           L+ +T +LA   +PG  FGC     G+      GL+GLGRG LSL++Q   L    FSYC
Sbjct: 185 LATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYC 241

Query: 241 LPSFKALSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           L S    + S  L LG +            ++ TPL+KNP + S YYV+L AI VG   +
Sbjct: 242 LTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 300

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            +P  A         G I+DSGT  T L    Y A++  F  ++       S  G D C+
Sbjct: 301 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCF 360

Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
             P      +  P +   F  G ++ LP +N ++        CL +  +       L++I
Sbjct: 361 RAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSII 415

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQQN + +YDV +  L  A   C
Sbjct: 416 GNFQQQNFQFVYDVGHDTLSFAPVQC 441


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/427 (29%), Positives = 209/427 (48%), Gaps = 39/427 (9%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF----LSSLAV--ARKSVV 83
           +S +L+V H   PC      +  +   S +E+L +D+ R+      LSS  V   +++ +
Sbjct: 61  NSLSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATL 120

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ--- 140
           P+ SG  I  S  Y V   +GTP +   +  DT +D  W   T C  C+ T +   +   
Sbjct: 121 PVQSGASIG-SGDYAVTVGLGTPKKEFTLIFDTGSDLTW---TQCEPCAKTCYKQKEPRL 176

Query: 141 ----STTFKNLGCQAAQCKQVP---NPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT- 191
               ST++KN+ C +A CK +      +C    C + + YG  + +    + +T++L++ 
Sbjct: 177 DPTKSTSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSS 236

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           ++   + FGC Q+ +G      GLLGLGR  LSL +QT   Y+  FSYCLP+  + S  G
Sbjct: 237 NVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPA--SSSSKG 294

Query: 252 SLRLGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
            L  G  GQ  K +K+TPL ++ + +  Y +++  + VG   + I          + +GT
Sbjct: 295 YLSFG--GQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIF-----STSGT 347

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
           +IDSGTV TRL + AY+A+   F++ +    +      FDTCY       I  P + + F
Sbjct: 348 VIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSF 407

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             G+ + +    +L         CLA A   D+V +   +  N QQ+ ++++YD    R+
Sbjct: 408 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAA--IFGNTQQKTYQVVYDDAKGRV 465

Query: 426 GVARELC 432
           G A   C
Sbjct: 466 GFAPSGC 472


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 140/441 (31%), Positives = 215/441 (48%), Gaps = 64/441 (14%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL--------AVARK----- 80
           L+++H+ S  SP     P S       M AKD+ R+++  S         A ++K     
Sbjct: 33  LKLYHMTSLKSP-----PNSTSLLFAYMFAKDEERIRYFHSRLAKNSDANASSKKVGPKL 87

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVF 136
           + +P+ SG  +  S  Y V+  +G+P +   M +DT +  +W+ C  C + C      VF
Sbjct: 88  AGIPLKSGLSMG-SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVF 146

Query: 137 NSAQSTTFKNLGCQAAQCKQ-----VPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTIS 188
           N + S T+K + C ++QC       +  PTC     AC +  +YG S+ +   LSQD ++
Sbjct: 147 NPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLT 206

Query: 189 LA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
           L  +  +  + +GC Q   G      G++GL    LS+L+Q    Y + FSYCLP+    
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT---- 262

Query: 248 SFS-------GSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           SFS       G L +G   +      K+TPLLKNP   SLY+++L +I V  R + +   
Sbjct: 263 SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAAS 322

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYS--- 354
           + +        TIIDSGTV TRL  P YT +++ +   +         +   DTC+    
Sbjct: 323 SYK------VPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSL 376

Query: 355 --VPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
             +  VAP I ++F  G ++ L   N L+    G ITCLAMA +     S + +I N QQ
Sbjct: 377 AGISEVAPDIRIIFKGGADLQLKGHNSLVELETG-ITCLAMAGS-----SSIAIIGNYQQ 430

Query: 412 QNHRILYDVPNSRLGVARELC 432
           Q  ++ YDV NSR+G A   C
Sbjct: 431 QTVKVAYDVGNSRVGFAPGGC 451


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 178/361 (49%), Gaps = 23/361 (6%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y  R  +G+PA+ L M +DT +D  WV C  C  C   S  VF+ + 
Sbjct: 155 PVVSGVGL-GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSL 213

Query: 141 STTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VPG 196
           ST++ ++ C   +C  +    C    GAC + + YG  S    + + +T++L     V  
Sbjct: 214 STSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSS 273

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
              GC     G  V   GLL LG G LS  +Q   +  +TFSYCL    + S S +L+ G
Sbjct: 274 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQFG 329

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
                +     PL+++PR S+ YYV L  + VG +++ IPP A   + T   G I+DSGT
Sbjct: 330 DAADAEVTA--PLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGT 387

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
             TRL + AY A+RD F R   S    + +  FDTCY +     +  P ++L F+ G  +
Sbjct: 388 AVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGEL 447

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP  N LI        CLA A      N+ +++I N+QQQ  R+ +D   S +G     
Sbjct: 448 RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNK 503

Query: 432 C 432
           C
Sbjct: 504 C 504


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 138/454 (30%), Positives = 199/454 (43%), Gaps = 60/454 (13%)

Query: 16  SLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL 75
           S+SE    +     H   L  F   SP   FK              L +D  R++ ++SL
Sbjct: 56  SVSESTTSLSVHLSHVDALSSFSDASPVDLFKL------------RLQRDSLRVKSITSL 103

Query: 76  AVARKSVVPIASGRQITQ--------------------SPTYIVRAKIGTPAQTLLMAMD 115
           A        +++GR  T+                    S  Y +R  +GTPA  + M +D
Sbjct: 104 AA-------VSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLD 156

Query: 116 TSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCGGGACA 168
           T +D  W+ C+ C  C   S  +F+  +S TF  + C +  C+++ +     T     C 
Sbjct: 157 TGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCL 216

Query: 169 FNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLA 227
           + ++YG  +    + S +T++     V     GC     G  V   GLLGLGRG LS  +
Sbjct: 217 YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPS 276

Query: 228 QTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
           QT++ Y   FSYCL     S  +     ++  G    PK   +TPLL NP+  + YY+ L
Sbjct: 277 QTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQL 336

Query: 284 LAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           L I VG  RV  +     + + T   G IIDSGT  TRL   AY A+RD FR        
Sbjct: 337 LGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKR 396

Query: 343 VTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
             S   FDTC+ +     +  PT+   F G  V+LP  N LI        C A A    +
Sbjct: 397 APSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGS 456

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               L++I N+QQQ  R+ YD+  SR+G     C
Sbjct: 457 ----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 131/455 (28%), Positives = 199/455 (43%), Gaps = 68/455 (14%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLA--VARKSVV 83
           D    ++ + + H   PCSP   +       S  E+LA DQ+R + +           V 
Sbjct: 81  DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTDRVN 138

Query: 84  PIASGRQITQ--------------------SP-------TYIVRAKIGTPAQTLLMAMDT 116
           P  S  +  Q                    SP        Y+V   +GTPA    +  DT
Sbjct: 139 PKRSRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDT 198

Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
            +D  WV C  CV  C      +F+ A S+T+ N+ C A  C  +    C GG C + + 
Sbjct: 199 GSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQ 258

Query: 173 YGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
           YG  + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT 
Sbjct: 259 YGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTY 318

Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
             Y   F++CLP+    + +G L  G  G P     TP+L      + YYV +  IRVG 
Sbjct: 319 GKYGGVFAHCLPARS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRVGG 374

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGSNL 341
           R++ I P          AGTI+DSGTV TRL   AY+++R           +R+    +L
Sbjct: 375 RLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL 429

Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
                   DTCY       +  PT++L+F G        + ++++ + S  CLA A   D
Sbjct: 430 -------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNED 482

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +  + ++ N Q +   + YD+    +G +   C
Sbjct: 483 GGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/371 (31%), Positives = 173/371 (46%), Gaps = 34/371 (9%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           +VP+ +G        +++   IGTPA      +DT +D  W  C  CV C   S+ VF+ 
Sbjct: 64  LVPVHAGNG-----EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDP 118

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPG 196
           + S+T+  + C +A C  +P   C     C +  TYG SS+    L+ +T +LA   +PG
Sbjct: 119 SSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG 178

Query: 197 YTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
             FGC     G+      GL+GLGRG LSL++Q   L    FSYCL S    + S  L L
Sbjct: 179 VVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYCLTSLDDTNNS-PLLL 234

Query: 256 GPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           G +            ++ TPL+KNP + S YYV+L AI VG   + +P  A         
Sbjct: 235 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 294

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTI 362
           G I+DSGT  T L    Y A++  F  ++       S  G D C+  P      +  P +
Sbjct: 295 GVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 354

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
              F  G ++ LP +N ++        CL +  +       L++I N QQQN + +YDV 
Sbjct: 355 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSIIGNFQQQNFQFVYDVG 409

Query: 422 NSRLGVARELC 432
           +  L  A   C
Sbjct: 410 HDTLSFAPVQC 420


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 178/352 (50%), Gaps = 25/352 (7%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNSAQSTTFKNLGCQAA 152
           ++V    GTPAQT  +  DT +D +W+   PC+G C      +F+  +S T+  + C   
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179

Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
           QC          G C + + YG  S+ A  LS +T+SL +   +PG+ FGC +   G+  
Sbjct: 180 QCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGCGETNLGDFG 239

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYT 267
              GL+GLGRG LSL +Q    + + FSYCLPS+   +  G L +G   P      ++YT
Sbjct: 240 DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN--TSHGYLTIGTTTPASGSDGVRYT 297

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
            +++     S Y+V+L++I VG  V+ +PP  + F   T  GT++DSGTV T L   AYT
Sbjct: 298 AMIQKQDYPSFYFVDLVSIVVGGFVLPVPP--ILF---TRDGTLLDSGTVLTYLPPEAYT 352

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI-- 380
           A+RD F+  +       +   FDTCY       I  P ++  FS G +  L    +LI  
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVLIFP 412

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             TA +  CLA    P  +     ++ N QQ+N  ++YDV   ++G     C
Sbjct: 413 DDTAPATGCLAFVPRPSTMP--FTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 139/431 (32%), Positives = 199/431 (46%), Gaps = 25/431 (5%)

Query: 17  LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLA 76
           LSE L+   +T   S  L + H+ +  S   P      E+     L +D  R++ L +  
Sbjct: 40  LSETLSEPQETLSLSLHLHLHHIDALSSNKTP------EQLFHLRLQRDAKRVEALLNQI 93

Query: 77  VARKSVVPIASGRQITQ----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC- 131
            AR+S     S   I+     S  Y  R  +GTPA+ + M +DT +D  W+ C  C  C 
Sbjct: 94  HARRSAGSSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCY 153

Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDT 186
             +  VF+  +S T+  + C A  C+++ +P C      C + ++YG  +    + S +T
Sbjct: 154 TQTDHVFDPTKSRTYAGIPCGAPLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTET 213

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           ++   + V     GC     G      GLLGLGRG LS   QT   +   FSYCL    A
Sbjct: 214 LTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSA 273

Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPT 305
            +   S+  G     +   +TPL+KNP+  + YY+ LL I VG   V  +     + +  
Sbjct: 274 SAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAA 333

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPT 361
              G IIDSGT  TRL  PAY A+RD FR              FDTC+ +     +  PT
Sbjct: 334 GNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPT 393

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           + L F G +V+LP  N LI        C A A       S L++I N+QQQ  RI YD+ 
Sbjct: 394 VVLHFRGADVSLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNIQQQGFRISYDLT 449

Query: 422 NSRLGVARELC 432
            SR+G A   C
Sbjct: 450 GSRVGFAPRGC 460


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 168/356 (47%), Gaps = 27/356 (7%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQSTTFKNLGCQAAQ 153
           Y+   ++GTP +   + +DT +D  WV C+ C  C S   ++F    ST+F  L C    
Sbjct: 3   YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN------LSQDTISLATDIVPGYTFGCIQKATG 207
           C  +P P C    C +  +YG  +++        ++ D I+     VP + FGC     G
Sbjct: 63  CNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG 122

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPK--RI 264
           +     G+LGLG+G LS  +Q + ++   FSYCL  + A     S L  G    P    +
Sbjct: 123 SFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGV 182

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
           KY  LL NP+  + YYV L  I VG ++++I   A   +    AGTI DSGT  T+L   
Sbjct: 183 KYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGE 242

Query: 325 AYTAV--------RDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQD 376
            +  V         D  R+   S+     LGGF     +P V P++T  F G ++ LP  
Sbjct: 243 VHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEG-QLPTV-PSMTFHFEGGDMELPPS 300

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           N  I   +    C +M ++PD     + +I ++QQQN ++ YD    ++G   + C
Sbjct: 301 NYFIFLESSQSYCFSMVSSPD-----VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 186/389 (47%), Gaps = 26/389 (6%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
           E +   + + + RLQ LS+   + +  V  P+ +G        +++   IGTPA+T    
Sbjct: 59  ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNG-----EFLMNLAIGTPAETYSAI 113

Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           MDT +D  W    PC  C    + +F+  +S++F  L C +  C  +P  +C  G C + 
Sbjct: 114 MDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CEYR 172

Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
            +YG  S+    L+ +T +     V    FGC +   G +     GL+GLGRG LSL++Q
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ 232

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              L    FSYCL S        +L +G     K    TPL++NP R S YY++L  I V
Sbjct: 233 ---LGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISV 289

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G  ++ I             G IIDSGT  T L   A+ A++  F  ++  ++  +    
Sbjct: 290 GDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTE 349

Query: 349 FDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            + C+++     P+  P +   F G+++ LP++N +I  +A  + CL M ++     S +
Sbjct: 350 LELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSS-----SGM 404

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++  N QQQN  +L+D+    +  A   C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 187/412 (45%), Gaps = 45/412 (10%)

Query: 58  VLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT---------YIVRAKIGTPAQ 108
           V + L +D  R Q   S    R   +  + GR    + T         Y++   IGTP  
Sbjct: 65  VRDALRRDMHR-QRSRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPL 123

Query: 109 TLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA--QCKQVPNPTC 162
                 DT +D  W  C    T C    + ++N A STTF  L C ++   C        
Sbjct: 124 PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAA 183

Query: 163 GGGACA--FNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQGL 215
               CA  +N TYG+   A     +T +  +       VPG  FGC   ++ +     GL
Sbjct: 184 PPPGCACMYNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGL 243

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNP 273
           +GLGRGSLSL++Q   L    FSYCL  F+  + + +L LGP        ++ TP + +P
Sbjct: 244 VGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASP 300

Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
            R   S+ YY+NL  I +G + + I PGA    P    G IIDSGT  T L   AY  VR
Sbjct: 301 ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVR 360

Query: 331 DVFRRRVGSNLTV--TSLGGFDTCYSV-------PIVAPTITLMFSGMNVTLPQDNLLIH 381
              +  V +  TV  +   G D C+++       P V P++TL F G ++ LP D+ +I 
Sbjct: 361 AAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMIS 420

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +   + CLAM    +  +  ++   N QQQN  ILYDV    L  A   C+
Sbjct: 421 GSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 171/363 (47%), Gaps = 38/363 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP +     +DT +D  W    PC  CV   +  F+ AQS ++  L C +  
Sbjct: 89  YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148

Query: 154 CKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD----IVPGYTFGCIQKATGN 208
           C  +  P C    C +   YG S+  A  LS +T +  T+     VP   FGC     G+
Sbjct: 149 CNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGS 208

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRL----GPIG 259
                G++G GRG LSL++Q   L    FSYCL SF +     L F     L       G
Sbjct: 209 LFNGSGMVGFGRGPLSLVSQ---LGSPRFSYCLTSFMSPVPSRLYFGAYATLNSTSASTG 265

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVF 318
           +P  ++ TP + NP   ++YY+N+  I VG  ++ I P     N   G  G IIDSG+  
Sbjct: 266 EP--VQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTI 323

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLT-VTSLGG-FDTCYSVP------IVAPTITLMFSGMN 370
           T L   AY  V   F  +VG  LT  TSL    DTC+  P      +  P +   F G N
Sbjct: 324 TYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFEGAN 383

Query: 371 VTLPQDN-LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           + LP +N +LI    G++ CLA+AA+ D      ++I + Q QN  +LYD  NS L    
Sbjct: 384 MELPLENYMLIDGDTGNL-CLAIAASDDG-----SIIGSFQHQNFHVLYDNENSLLSFTP 437

Query: 430 ELC 432
             C
Sbjct: 438 ATC 440


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 184/394 (46%), Gaps = 32/394 (8%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +A+ +AR+  L SLA A  ++       + ++   Y++   IG+P +     +DT +D  
Sbjct: 51  VARSRARVAALQSLATAADAITAARILLRFSEG-EYLMDVGIGSPPRYFSAMIDTGSDLI 109

Query: 122 W---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
           W    PC  CV   +  F  A+ST++ +L C +A C  + +P C   AC +   YG S  
Sbjct: 110 WTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYGDSAS 169

Query: 179 AAN-LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
           +A  L+ +T +  T+     VP  +FGC     G      G++G GRG+LSL++Q   L 
Sbjct: 170 SAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQ---LG 226

Query: 234 QSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAI 286
              FSYCL SF +     L F     L          ++ TP + NP   ++Y++N+  I
Sbjct: 227 SPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGI 286

Query: 287 RVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVT 344
            V   ++ I P     N T G  G IIDSGT  T L  PAY  V+  F   VG      T
Sbjct: 287 SVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANAT 346

Query: 345 SLGGFDTCYSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
               FDTC+  P      +  P + L F G ++ LP +N ++        CLAM  + D 
Sbjct: 347 PSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDG 406

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                ++I + Q QN  +LYD+ NS L      C
Sbjct: 407 -----SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 122/400 (30%), Positives = 192/400 (48%), Gaps = 38/400 (9%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------VPIASGRQITQSPTYIVRA 101
           K L+  E V   + + ++RLQ L+++ +A  S          PI +G     +  Y++  
Sbjct: 58  KNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAG-----NGEYLIEL 112

Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
            IGTP  +    +DT +D  W    PCT C    + +F+  +S++F  + C ++ C  +P
Sbjct: 113 AIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALP 172

Query: 159 NPTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI----VPGYTFGCIQKATGNSVP-P 212
           + TC  G C +  +YG  ++    L+ +T +         V    FGC +   G+     
Sbjct: 173 SSTCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQA 231

Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL--RLGPIGQPKRIKYTPLL 270
            GL+GLGRG LSL++Q   L +  FSYCL        S  L   LG +   K +  TPLL
Sbjct: 232 SGLVGLGRGPLSLVSQ---LKEQRFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLL 288

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
           KNP + S YY++L AI VG   + I     +       G IIDSGT  T +   AY A++
Sbjct: 289 KNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALK 348

Query: 331 DVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
             F  +    L  TS  G D C+S+P     +  P +   F G ++ LP +N +I  +  
Sbjct: 349 KEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSNL 408

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            + CLAM A+     S +++  N+QQQN  + +D+    +
Sbjct: 409 GVACLAMGAS-----SGMSIFGNVQQQNILVNHDLEKETI 443


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 184/394 (46%), Gaps = 32/394 (8%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +A+ +AR+  L SLA A  ++       + ++   Y++   IG+P +     +DT +D  
Sbjct: 54  VARSRARVAALQSLATAADAITAARILLRFSEG-EYLMDVGIGSPPRYFSAMIDTGSDLI 112

Query: 122 W---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
           W    PC  CV   +  F  A+ST++ +L C +A C  + +P C   AC +   YG S  
Sbjct: 113 WTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYGDSAS 172

Query: 179 AAN-LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
           +A  L+ +T +  T+     VP  +FGC     G      G++G GRG+LSL++Q   L 
Sbjct: 173 SAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQ---LG 229

Query: 234 QSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAI 286
              FSYCL SF +     L F     L          ++ TP + NP   ++Y++N+  I
Sbjct: 230 SPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGI 289

Query: 287 RVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVT 344
            V   ++ I P     N T G  G IIDSGT  T L  PAY  V+  F   VG      T
Sbjct: 290 SVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANAT 349

Query: 345 SLGGFDTCYSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
               FDTC+  P      +  P + L F G ++ LP +N ++        CLAM  + D 
Sbjct: 350 PSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDG 409

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                ++I + Q QN  +LYD+ NS L      C
Sbjct: 410 -----SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 129/455 (28%), Positives = 200/455 (43%), Gaps = 68/455 (14%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVAR-- 79
           D    ++ + + H   PCSP   +       S  E+LA DQ+R + +    S+    R  
Sbjct: 85  DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTGRVN 142

Query: 80  -----------------------KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
                                   +    AS  +   +  Y+V   +GTPA    +  DT
Sbjct: 143 PKRRRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDT 202

Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
            +D  WV C  CV  C      +F+ A S+T+ N+ C A  C  +    C GG C + + 
Sbjct: 203 GSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQ 262

Query: 173 YGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
           YG  + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT 
Sbjct: 263 YGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTY 322

Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
             Y   F++CLP+    + +G L  G  G P     TP+L      + YYV +  IRVG 
Sbjct: 323 GKYGGVFAHCLPARS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRVGG 378

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGSNL 341
           R++ I P          AGTI+DSGTV TRL   AY+++R           +R+    +L
Sbjct: 379 RLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL 433

Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
                   DTCY       +  PT++L+F G        + ++++ + S  CLA A   D
Sbjct: 434 -------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNED 486

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +  + ++ N Q +   + YD+    +G +   C
Sbjct: 487 GGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 133/457 (29%), Positives = 201/457 (43%), Gaps = 72/457 (15%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVARKS 81
           D    ++ + + H   PCSP   +       S  E+LA DQ+R + +    S+    R  
Sbjct: 82  DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTGR-- 137

Query: 82  VVPIASGRQITQ--------------------SP-------TYIVRAKIGTPAQTLLMAM 114
           V P  S  +  Q                    SP        Y+V   +GTPA    +  
Sbjct: 138 VNPKRSRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVF 197

Query: 115 DTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           DT +D  WV C  CV  C      +F+ A S+T+ N+ C A  C  +    C GG C + 
Sbjct: 198 DTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYG 257

Query: 171 LTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           + YG  + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  Q
Sbjct: 258 VQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQ 317

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
           T   Y   F++CLP     + +G L  G  G P     TP+L      + YYV +  IRV
Sbjct: 318 TYGKYGGVFAHCLPPRS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRV 373

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGS 339
           G R++ I P          AGTI+DSGTV TRL   AY+++R           +R+    
Sbjct: 374 GGRLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAV 428

Query: 340 NLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
           +L        DTCY       +  PT++L+F G        + ++++ + S  CLA A  
Sbjct: 429 SL-------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGN 481

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            D  +  + ++ N Q +   + YD+    +G +   C
Sbjct: 482 EDGGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 186/389 (47%), Gaps = 26/389 (6%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
           E +   + + + RLQ LS+   + +  V  P+ +G        +++   IGTPA+T    
Sbjct: 59  ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNG-----EFLMNLAIGTPAETYSAI 113

Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           MDT +D  W    PC  C    + +F+  +S++F  L C +  C  +P  +C  G C + 
Sbjct: 114 MDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CEYR 172

Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
            +YG  S+    L+ +T +     V    FGC +   G +     GL+GLGRG LSL++Q
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ 232

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              L    FSYCL S        +L +G     K    TPL++NP R S YY++L  I V
Sbjct: 233 ---LGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISV 289

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G  ++ I             G IIDSGT  T L   A+ A++  F  ++  ++  +    
Sbjct: 290 GDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTE 349

Query: 349 FDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            + C+++     P+  P +   F G+++ LP++N +I  +A  + CL M ++     S +
Sbjct: 350 LELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSS-----SGM 404

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++  N QQQN  +L+D+    +  A   C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 127/413 (30%), Positives = 189/413 (45%), Gaps = 52/413 (12%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------------------T 96
           +E     L +D  R++ +++LA      +P   GR +T +P                   
Sbjct: 89  DELFSSRLQRDSRRVKSIATLAAQ----IP---GRNVTHAPRPGGFSSSVVSGLSQGSGE 141

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ---------STTFKNL 147
           Y  R  +GTPA+ + M +DT +D  W+ C  C          +Q         S T+  +
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR------CYSQSDPIFDPRKSKTYATI 195

Query: 148 GCQAAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQK 204
            C +  C+++ +  C      C + ++YG  +    + S +T++   + V G   GC   
Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
             G  V   GLLGLG+G LS   QT + +   FSYCL    A S   S+  G     +  
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           ++TPLL NP+  + YYV LL I V G RV  +     + +     G IIDSGT  TRL+ 
Sbjct: 316 RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLL 379
           PAY A+RD FR    +         FDTC+ +     +  PT+ L F G +V+LP  N L
Sbjct: 376 PAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           I        C A A         L++I N+QQQ  R++YD+ +SR+G A   C
Sbjct: 436 IPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 123/451 (27%), Positives = 198/451 (43%), Gaps = 60/451 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------SSLAVARKSV-- 82
           ++ + + H   PCSP    K      S  E+L  DQ R++++      ++  V R+    
Sbjct: 64  ATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVADQRRVEYIHRRVSETTGRVRRQKHSA 123

Query: 83  ----------------------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
                                       +P  SG  +  +  Y+V  ++GTPA    +  
Sbjct: 124 PVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSGLSL-NTGNYVVPIRLGTPAARFTVVF 182

Query: 115 DTSNDAAWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           DT +D  WV C  CV  C      +F   +S T+ N+ C ++ C  +    C GG C + 
Sbjct: 183 DTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSSYCSDLDTRGCSGGHCLYA 242

Query: 171 LTYGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
           + YG  +      +QDT++L  D V  + FGC +K  G      GL+GLGRG  S+  Q 
Sbjct: 243 VQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQA 302

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRL-GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
            + Y   F+YC+P+    S +G L            + TP+L +    + YYV +  I+V
Sbjct: 303 YDKYSGVFAYCIPATS--SGTGFLDFGPGAPAAANARLTPMLVD-NGPTFYYVGMTGIKV 359

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSL 346
           G  ++ IP         + AG ++DSGTV TRL   AY  +R  F + +      T  + 
Sbjct: 360 GGHLLSIPATVF-----SDAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAF 414

Query: 347 GGFDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
              DTCY +      I  P ++L+F G        + +++    S  CLA AA  D+ + 
Sbjct: 415 SILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDDTD- 473

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + ++ N QQ+ + +LYD+    +G A   C
Sbjct: 474 -MTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 113/348 (32%), Positives = 168/348 (48%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P +   M +D+ +D  WV   PC+ C   S  VF+ A S++F  + C 
Sbjct: 140 SGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCG 199

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +  C ++ N  C  G C + ++YG  S     L+ +T+++   ++     GC     G  
Sbjct: 200 SDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGHTNQGMF 259

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +   GLLGLG GS+S + Q        FSYCL S +    +G+L  G    P    +  L
Sbjct: 260 IGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS-RGTGSTGALEFGRGALPVGATWISL 318

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YY+ L  I VG   V +P    Q       G ++D+GT  TR    AY A 
Sbjct: 319 IRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAF 378

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
           RD F  +  +      +  FDTCY +     +  PT++  FS G  +TLP  N LI    
Sbjct: 379 RDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDG 438

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           G   CLA A +P    S L++I N+QQ+  +I +D  N  +G    +C
Sbjct: 439 GGTFCLAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 133/423 (31%), Positives = 201/423 (47%), Gaps = 39/423 (9%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
           ++T+ + H   PCSP  P+K +   E   E L +DQ R  ++         +   V R  
Sbjct: 57  AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 112

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
           + VP A G  +  +  Y++   +G+PA +  M +DT +D +WV C  C  C S    +F+
Sbjct: 113 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
            + S+T+    C +A C Q+     G      C + +TYG  S+     S DT++L +  
Sbjct: 172 PSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           V  + FGC    +G +    GL+GLG G+ SL++QT       FSYCLP   + S   +L
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 291

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
                        TP+L++ +  + Y V L AIRVG R + IP           AGT++D
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 345

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
           SGTV TRL   AY+A+   F+  +         G  DTC+       +  P++ L+FSG 
Sbjct: 346 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 405

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            V     + +I S      CLA AA  D  +S L +I N+QQ+   +LYDV    +G   
Sbjct: 406 AVVSLDASGIILS-----NCLAFAANSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458

Query: 430 ELC 432
             C
Sbjct: 459 GAC 461


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 185/372 (49%), Gaps = 29/372 (7%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG     S  Y  +  +GTP+   LM +DT +D  W+ C  C  C   S  VF+  +
Sbjct: 128 PVVSG-LAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRR 186

Query: 141 STTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPG 196
           S+++  + C A  C+++ +  C     AC + + YG  ++ A + + +T++ A    V  
Sbjct: 187 SSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVAR 246

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--------PSFKALS 248
              GC     G  V   GLLGLGRGSLS   Q    Y  +FSYCL            + S
Sbjct: 247 VALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRS 306

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG 307
            S ++  GP        +TP+++NPR  + YYV L+ I V G RV  +    L+ +P+TG
Sbjct: 307 RSSTVTFGPP-SASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 365

Query: 308 -AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
             G I+DSGT  TRL  P+Y+A+RD FR    G  L+      FDTCY +     +  PT
Sbjct: 366 RGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPT 425

Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +++ F+ G    LP +N LI   +    C A A     V    ++I N+QQQ  R+++D 
Sbjct: 426 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDG 481

Query: 421 PNSRLGVARELC 432
              R+G A + C
Sbjct: 482 DGQRVGFAPKGC 493


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 129/432 (29%), Positives = 199/432 (46%), Gaps = 53/432 (12%)

Query: 28  QDHSSTLQVFHVFSPCSPFKPS-KPLSWEESVLEMLAKDQARLQF---------LSSLAV 77
            + SS+L++ H F PC+P + S  P S   S  E+L +D+ R+           L+S   
Sbjct: 57  NEGSSSLKLVHRFGPCNPHRTSTAPAS---SFNEILRRDKLRVDSIIQARRSMNLTSSVE 113

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-- 135
             KS VP     +IT S  YIV   IGTP + + +  DT +   W  C  C  C   V  
Sbjct: 114 HMKSSVPFYGLSKITAS-DYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKVPV 172

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY-GSSTIAANLSQDTISLATDIV 194
           F+  +S +FK L C +  C+ +    C    C +   Y  +S+    L+ +TIS +    
Sbjct: 173 FDPTKSASFKGLPCSSKLCQSI-RQGCSSPKCTYLTAYVDNSSSTGTLATETISFSH--- 228

Query: 195 PGYTF-----GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA--- 246
             Y F     GC  + +G S+   G++GL R  +SL +QT N+Y   FSYC+PS      
Sbjct: 229 LKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTG 288

Query: 247 -LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
            L+F G +       P  ++++P+ K    SS Y + +  I VG R + I   A +   T
Sbjct: 289 HLTFGGKV-------PNDVRFSPVSKTA-PSSDYDIKMTGISVGGRKLLIDASAFKIAST 340

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
                 IDSG V TRL   AY+A+R VFR  +     +      DTCY       +  P+
Sbjct: 341 ------IDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPS 394

Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           I++ F  G+ + +    ++       + CLA A   D V    ++  N QQ+ + +++D 
Sbjct: 395 ISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEV----SIFGNFQQKTYTVVFDG 450

Query: 421 PNSRLGVARELC 432
              R+G A   C
Sbjct: 451 AKERIGFAPGGC 462


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 131/414 (31%), Positives = 199/414 (48%), Gaps = 59/414 (14%)

Query: 61  MLAKDQARLQFLSSLAVARKSV-------------VPIASGRQITQSPTYIVRAKIGTPA 107
           M AKD+ R+++  S                     +P+ SG  +  S  Y V+  +G+P 
Sbjct: 55  MFAKDEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMG-SGNYYVKMGLGSPT 113

Query: 108 QTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQ-----VP 158
           +   M +DT +  +W+ C  C + C      VFN + S T+K + C ++QC       + 
Sbjct: 114 KYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLN 173

Query: 159 NPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQG 214
            PTC     AC +  +YG S+ +   LSQD ++L  +  +  + +GC Q   G      G
Sbjct: 174 EPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-------GSLRLG--PIGQPKRIK 265
           ++GL    LS+L+Q    Y + FSYCLP+    SFS       G L +G   +      K
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPT----SFSTPNSPKEGFLSIGTSSLTPSSSYK 289

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           +TPLLKNP   SLY+++L +I V  R + +   + +        TIIDSGTV TRL  P 
Sbjct: 290 FTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYK------VPTIIDSGTVITRLPTPV 343

Query: 326 YTAVRDVFRRRVGSNL-TVTSLGGFDTCYS-----VPIVAPTITLMFS-GMNVTLPQDNL 378
           YT +++ +   +         +   DTC+      +  VAP I ++F  G ++ L   N 
Sbjct: 344 YTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNS 403

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L+    G ITCLAMA +     S + +I N QQQ  ++ YDV NSR+G A   C
Sbjct: 404 LVELETG-ITCLAMAGS-----SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 124/404 (30%), Positives = 196/404 (48%), Gaps = 38/404 (9%)

Query: 58  VLEMLAKDQARLQFLSS---LAVA---RKSVVPIASGRQITQ-------------SPTYI 98
           +L  LA+D AR++ +++   LAV+   +  +VP+ +     Q             S  Y 
Sbjct: 102 MLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYF 161

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCK 155
           +R  IG P++T  M +DT +D  W+ C  C  C   V   F+ A S++F  LGCQ  QC+
Sbjct: 162 LRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCR 221

Query: 156 QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ 213
            +    C   +C + ++YG  S    + + +T+S   +  V     GC     G  V   
Sbjct: 222 NLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAA 281

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
           GL+GLG G LSL +Q +    S+FSYCL +  ++  S +L      +P      P+ KN 
Sbjct: 282 GLIGLGGGPLSLTSQIK---ASSFSYCLVNRDSVD-SSTLEFNS-AKPSDSVTAPIFKNS 336

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
           +  + YYV +  + VG   + IPP   + + +   G I+D GT  TRL   AY A+RD F
Sbjct: 337 KVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDTF 396

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
            +      + +    FDTCY++     +  PT+  +F  G ++ LP  N LI   +    
Sbjct: 397 VKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTF 456

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CLA A       + L++I N+QQQ  R+ YD+ NS++  +   C
Sbjct: 457 CLAFAP----TTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 185/370 (50%), Gaps = 38/370 (10%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT-GCVGCSSTVFNSAQSTTFKNLG 148
            S  Y+V   IGTPA+   +  DT +D  WV   PCT  C      +F+ ++S+T+ ++ 
Sbjct: 122 HSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVP 181

Query: 149 CQAAQCK--QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQ 203
           C   QCK     + TCGG  C +++ YG  ++   NL+Q+  +L+    P  G  FGC  
Sbjct: 182 CGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSH 241

Query: 204 ------KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ-STFSYCLPSFKALSFSGSLRLG 256
                 K     +   GLLGLGRG  S+L+QT+       FSYCLP     S +G L +G
Sbjct: 242 EYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRG--SSAGYLTIG 299

Query: 257 PIGQPK-RIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
               P+  + +TPL+  N + SS+Y VNL+ I V    + I   A         GT+IDS
Sbjct: 300 AAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDS 353

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP----IVAPTITLMF-S 367
           GTV T + A AY  +RD FRR +G  + L    +   DTCY V     + AP + L F  
Sbjct: 354 GTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGG 413

Query: 368 GMNVTLPQDNLL----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G  + +    +L    + ++  S+T   +A  P N+   + +I NMQQ+ + +++DV   
Sbjct: 414 GARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFV-IIGNMQQRAYNVVFDVEGR 472

Query: 424 RLGVARELCT 433
           R+G     C+
Sbjct: 473 RIGFGANGCS 482


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 131/403 (32%), Positives = 196/403 (48%), Gaps = 47/403 (11%)

Query: 62  LAKDQARLQFLSSLAVARKSVV-PIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSN 118
           +A+ +AR+  L S AV+   V  PI + R +    S  Y+V   IGTP       MDT +
Sbjct: 51  IARSKARVAALQSAAVSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGS 110

Query: 119 DAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG- 174
           D  W  C  C+ C++     F+  +S T++ L C++++C  + +P+C    C +   YG 
Sbjct: 111 DLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGD 170

Query: 175 SSTIAANLSQDTISL----ATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
           +++ A  L+ +T +     +T +     +FGC     G      G++G GRG LSL++Q 
Sbjct: 171 TASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQ- 229

Query: 230 QNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSSLYY 280
             L  S FSYCL S+ +     L F     L       G P  ++ TP + NP   ++Y+
Sbjct: 230 --LGPSRFSYCLTSYLSPTPSRLYFGVFANLNSTNTSSGSP--VQSTPFVINPALPNMYF 285

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           +++  I +G + + I P     N     G IIDSGT  T L   AY AV    RR + S 
Sbjct: 286 LSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGLAST 341

Query: 341 LTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVTLPQDN-LLIHSTAGSITC 389
           + + ++     G DTC+  P      +  P     F G N+TLP +N +LI ST G + C
Sbjct: 342 IPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDGANMTLPPENYMLIASTTGYL-C 400

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LAMA       SV  +I N QQQN  +LYD+ NS L      C
Sbjct: 401 LAMAP-----TSVGTIIGNYQQQNLHLLYDIANSFLSFVPAPC 438


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 130/409 (31%), Positives = 186/409 (45%), Gaps = 42/409 (10%)

Query: 60  EMLAKDQARLQFLSSLAVARKSV----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
           + LA D  RL FLS   + RK +     P+ SG   + S  Y V  +IG P Q+LL+  D
Sbjct: 47  QALALDTRRLHFLS---LRRKPIPFVKSPVVSG-AASGSGQYFVDLRIGQPPQSLLLIAD 102

Query: 116 TSNDAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNP--------TCG 163
           T +D  WV C+ C  CS    +TVF    S+TF    C    C+ VP P        T  
Sbjct: 103 TGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRI 162

Query: 164 GGACAFNLTYGSSTIAANL-SQDTISLATDI-----VPGYTFGCIQKATGNSVP------ 211
              C +   Y   ++ + L +++T SL T       +    FGC  + +G SV       
Sbjct: 163 HSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNG 222

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP-KRIKYTPL 269
             G++GLGRG +S  +Q    + + FSYCL  +  +   +  L +G  G    ++ +TPL
Sbjct: 223 ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPL 282

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           L NP   + YYV L ++ V    + I P   + + +   GT++DSGT    L  PAY +V
Sbjct: 283 LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSV 342

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIHST 383
               RRRV   +      GFD C +V  V       P +   FSG  V +P        T
Sbjct: 343 IAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 402

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              I CLA+ +    V    +VI N+ QQ     +D   SRLG +R  C
Sbjct: 403 EEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 129/416 (31%), Positives = 200/416 (48%), Gaps = 38/416 (9%)

Query: 49  SKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVV---------------------PIA 86
           +  LS+ E + + L +D AR+  ++S L +A   +                      P+ 
Sbjct: 76  NNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSPVV 135

Query: 87  SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTT 143
           SG     S  Y  R  +G P +  LM +DT +D  W+   PC+ C   S  ++N A S++
Sbjct: 136 SGMD-QGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSS 194

Query: 144 FKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGC 201
           +K +GCQA  C+Q+    C   G+C + ++YG  S    N + +T++L    +     GC
Sbjct: 195 YKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQNVAIGC 254

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
                G  V   GLLGLG GSLS  +Q  +     FSYCL    + S S +L+ G    P
Sbjct: 255 GHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSES-SSTLQFGRAAVP 313

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
                 P+LKN R  + YYV+L  I VG +++ I       + +   G I+DSGT  TRL
Sbjct: 314 NGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRL 373

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQD 376
              AY ++RD FR    +  +   +  FDTCY +     +  PT+   FS G +++LP  
Sbjct: 374 QTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAK 433

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           N L+   +    C A A      +S L+++ N+QQQ  R+ +D  N+++G A   C
Sbjct: 434 NYLVPVDSMGTFCFAFAP----TSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 137/417 (32%), Positives = 207/417 (49%), Gaps = 44/417 (10%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKS---VVPIASGRQ-----ITQSPT---YIVRAKIG 104
           E +   L +D+ R  ++ S A A  +   VV +++GR      ++++PT   Y+ +  +G
Sbjct: 82  ELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPTSGEYMAKIAVG 141

Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           TPA   L+A+DT++D  W+ C  C  C   S  VF+   ST++  +   A  C+ +    
Sbjct: 142 TPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSG 201

Query: 162 CGG---GACAFNLTYG-----SSTIAANLSQDTISLATDIVPGY-TFGCIQKATG-NSVP 211
            G    G C + + YG     +ST   +L ++T++ A  +   Y + GC     G    P
Sbjct: 202 GGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAP 261

Query: 212 PQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPSFKALSFSGSLRL----GPIGQPKRIKY 266
             G+LGLGRG +S+  Q   L Y ++FSYCL  F +   S S  L    G +       +
Sbjct: 262 AAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASF 321

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
           TP + N    + YYV L+ + VG  RV  +    LQ +P TG  G I+DSGT  TRL  P
Sbjct: 322 TPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARP 381

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSV----PIVAPTITLMFS-GMNVTLPQ 375
           AY A RD FR    ++L   S GG    FDTCY+V     +  P +++ F+ G+ V+L  
Sbjct: 382 AYVAFRDAFRAAA-TSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQP 440

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            N LI   +    C A A   D     ++VI N+ QQ  R++YD+   R+G A   C
Sbjct: 441 KNYLIPVDSRGTVCFAFAGTGDR---SVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/406 (32%), Positives = 198/406 (48%), Gaps = 54/406 (13%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSND 119
           +A+ +AR+  L S AV    V PI + R +    S  Y+V   IGTP       MDT +D
Sbjct: 52  IARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSD 111

Query: 120 AAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-S 175
             W  C  C+ C+   +  F+  +S T++ L C++++C  + +P+C    C +   YG +
Sbjct: 112 LIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQYYYGDT 171

Query: 176 STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
           ++ A  L+ +T +          AT+I     FGC     G+     G++G GRG LSL+
Sbjct: 172 ASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGRGPLSLV 227

Query: 227 AQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSS 277
           +Q   L  S FSYCL S+ +     L F     L       G P  ++ TP + NP   +
Sbjct: 228 SQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVINPALPN 282

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
           +Y+++L AI +G +++ I P     N     G IIDSGT  T L   AY AV    RR +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGL 338

Query: 338 GSNLTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVT-LPQDNLLIHSTAGS 386
            S + + ++     G DTC+  P      +  P +   F   N+T LP++ +LI ST G 
Sbjct: 339 VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGY 398

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + CL M  AP  V +   +I N QQQN  +LYD+ NS L      C
Sbjct: 399 L-CLVM--APTGVGT---IIGNYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 121/399 (30%), Positives = 192/399 (48%), Gaps = 37/399 (9%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-------VPIASGRQITQSPTYIVRAK 102
           K L+  E V   + + ++RLQ L+++ +A  ++        PI +G     +  Y++   
Sbjct: 59  KNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAG-----NGEYLMELA 113

Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           IGTP  +    +DT +D  W    PCT C    + +F+  +S++F  + C ++ C  VP+
Sbjct: 114 IGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPS 173

Query: 160 PTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI----VPGYTFGCIQKATGNSVP-PQ 213
            TC  G C +  +YG  ++    L+ +T +         V    FGC +   G+      
Sbjct: 174 STCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQAS 232

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL--RLGPIGQPKRIKYTPLLK 271
           GL+GLGRG LSL++Q   L +  FSYCL        S  L   LG +   K +  TPLLK
Sbjct: 233 GLVGLGRGPLSLVSQ---LKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLK 289

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           NP + S YY++L  I VG   + I     +       G IIDSGT  T +   A+ A++ 
Sbjct: 290 NPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKK 349

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
            F  +    L  TS  G D C+S+P     +  P I   F G ++ LP +N +I  +   
Sbjct: 350 EFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLG 409

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           + CLAM A+     S +++  N+QQQN  + +D+    +
Sbjct: 410 VACLAMGAS-----SGMSIFGNVQQQNILVNHDLEKETI 443


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 131/420 (31%), Positives = 196/420 (46%), Gaps = 45/420 (10%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV------VPI 85
           +T+ + H + PCSP     P +   ++LE+L  DQ R +++         +      VP 
Sbjct: 63  TTVPLNHRYGPCSP----APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQPLDLTVPT 118

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
             G  +  +  Y++   IG+PA T  M +DT +D +WV C    G   T+F+ ++STT+ 
Sbjct: 119 TLGSAL-DTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGL--TLFDPSKSTTYA 175

Query: 146 NLGCQAAQCKQVPN--PTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPGYTFGC 201
              C +A C Q+ N    C    C + + YG  S      S DT++L A+D V  + FGC
Sbjct: 176 PFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGC 235

Query: 202 IQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIG 259
                  +     GL+GLG  + SL++QT   Y  +FSYCLP       SG L  G P G
Sbjct: 236 SHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRT--SGFLTFGAPNG 293

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
                  TP+L+ P+  +LY V L  I VG   + I P  L        G+++DSGTV T
Sbjct: 294 TSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS------NGSVMDSGTVIT 347

Query: 320 RLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVT- 372
            L   AY+A+   FR  +    +     LG  DTCY     V +  P ++L+  G  V  
Sbjct: 348 WLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVD 407

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L  + ++I        CLA AA   +     ++I N+QQ+   +L+DV     G     C
Sbjct: 408 LDGNGIMIQD------CLAFAATSGD-----SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/441 (29%), Positives = 201/441 (45%), Gaps = 51/441 (11%)

Query: 27  TQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAKDQARLQFLSSLAVA---- 78
           + D +S++ + H + PCSP  P+    +P   E    + L  D  R +F  S   A    
Sbjct: 55  SSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGED 114

Query: 79  ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------TGCV 129
               K  VP   G  +  +  Y++   +G+PA T  + +DT +D +WV C      + C 
Sbjct: 115 GQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCH 173

Query: 130 GCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYGS-STIAANLS 183
             +  +F+ A S+T+    C AA C Q+ +     G      C + + YG  S      S
Sbjct: 174 AHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYS 233

Query: 184 QDTISLA-TDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYC 240
            D ++L+ +D+V G+ FGC     G  +  +  GL+GLG  + SL++QT   Y  +FSYC
Sbjct: 234 SDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYC 293

Query: 241 LPSFKALSFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           LP+  A S  G L LG           R   TP+L++ +  + Y+  L  I VG + + +
Sbjct: 294 LPATPASS--GFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGL 351

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
            P          AG+++DSGTV TRL   AY A+   FR  +        LG  DTC++ 
Sbjct: 352 SPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNF 405

Query: 356 ----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
                +  PT+ L+F+G  V     +L  H    S  CLA   AP   +     I N+QQ
Sbjct: 406 TGLDKVSIPTVALVFAGGAVV----DLDAHGIV-SGGCLAF--APTRDDKAFGTIGNVQQ 458

Query: 412 QNHRILYDVPNSRLGVARELC 432
           +   +LYDV     G     C
Sbjct: 459 RTFEVLYDVGGGVFGFRAGAC 479


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 183/389 (47%), Gaps = 32/389 (8%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
           K L+  E +   + + + R++ ++++  +   +  P+ +G        Y++   IGTP  
Sbjct: 53  KNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDG-----EYLMNVAIGTPDS 107

Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
           +    MDT +D  W    PCT C    + +FN   S++F  L C++  C+ +P+ TC   
Sbjct: 108 SFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN 167

Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQK----ATGNSVPPQGLLGLGR 220
            C +   YG  ST    ++ +T +  T  VP   FGC +       GN     GL+G+G 
Sbjct: 168 ECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGW 224

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI--GQPKRIKYTPLLKNPRRSSL 278
           G LSL +Q   L    FSYC+ S+ + S S +L LG    G P+    T L+ +    + 
Sbjct: 225 GPLSLPSQ---LGVGQFSYCMTSYGSSSPS-TLALGSAASGVPEGSPSTTLIHSSLNPTY 280

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
           YY+ L  I VG   + IP    Q       G IIDSGT  T L   AY AV   F  ++ 
Sbjct: 281 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 340

Query: 339 SNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
                 S  G  TC+  P     +  P I++ F G  + L + N+LI    G I CLAM 
Sbjct: 341 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVI-CLAMG 399

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           ++       +++  N+QQQ  ++LYD+ N
Sbjct: 400 SSS---QLGISIFGNIQQQETQVLYDLQN 425


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 212/439 (48%), Gaps = 56/439 (12%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
            S+TL++ H    CS     K + W + +   L  D  R+Q L       + +   +SV 
Sbjct: 67  ESTTLEMKHR-ELCS----GKTIDWGKKMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVS 121

Query: 83  ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
              +P+ SG ++ ++  YIV  ++G    +L++  DT +D  WV C  C  C +    ++
Sbjct: 122 ETQIPLTSGIKL-ETLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 178

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGG------GACAFNLTYGS-STIAANLSQ 184
           + + S+++K + C ++ C+ +   T     CGG        C + ++YG  S    +L+ 
Sbjct: 179 DPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLAS 238

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           ++I L    +    FGC +   G      GL+GLGR S+SL++QT   +   FSYCLPS 
Sbjct: 239 ESIVLGDTKLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 298

Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           +  + SG+L  G           + YTPL++NP+  S Y +NL    +G     +    L
Sbjct: 299 EDGA-SGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIG----GVELKTL 353

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
            F    G G +IDSGTV TRL    Y AV+  F ++     +       DTC+++     
Sbjct: 354 SF----GRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYED 409

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           I  PTI ++F G N  L  D   +        S+ CLA+A+   +  + + +I N QQ+N
Sbjct: 410 ISIPTIKMIFEG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 466

Query: 414 HRILYDVPNSRLGVARELC 432
            R++YD    RLG+A E C
Sbjct: 467 QRVIYDTTQERLGIAGENC 485


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 137/432 (31%), Positives = 199/432 (46%), Gaps = 38/432 (8%)

Query: 21  LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSLAVAR 79
           +N  C+       +Q+ HV          + LS  E +  M  + +AR  + LSS A A 
Sbjct: 24  INSCCNAAAAPVRMQLTHV-------DAGRGLSGRELMRRMALRSKARAPRLLSSSATAP 76

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
            S      G  +T+   Y++   IGTP Q + + +DT +D  W  C  C  C   S   +
Sbjct: 77  VSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYY 133

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-CGG---GACAFNLTYGS-STIAANLSQDTIS-LA 190
           ++++S+TF    C + QCK  P+ T C       CAF+ +YG  S     L  +T+S +A
Sbjct: 134 DASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVA 193

Query: 191 TDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
              VPG  FGC    TG     + G+ G GRG LSL +Q   L    FS+C  +      
Sbjct: 194 GASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVSGRKP 250

Query: 250 SGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           S  L   P    K     ++ TPL+KNP   + YY++L  I VG   + +P  A      
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNG 310

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----P 360
           TG GTIIDSGT FT L    Y  V D F   V   +  ++  G   C+S P +      P
Sbjct: 311 TG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 369

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            + L F G  + LP++N +  +  G    + +A     +   + +I N QQQN  +LYD+
Sbjct: 370 KLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDL 425

Query: 421 PNSRLGVARELC 432
            NS+L   R  C
Sbjct: 426 KNSKLSFVRAKC 437


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/423 (31%), Positives = 200/423 (47%), Gaps = 39/423 (9%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
           ++T+ + H   PCSP  P+K +   E   E L +DQ R  ++         +   V R  
Sbjct: 127 AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 182

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
           + VP A G  +  +  Y++   +G+PA +  M +DT +D +WV C  C  C S    +F+
Sbjct: 183 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 241

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
            + S+T+    C +A C Q+     G      C + +TYG  S+     S DT++L +  
Sbjct: 242 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 301

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           V  + FGC    +G +    GL+GLG G+ SL++QT       FSYCLP   + S   +L
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 361

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
                        TP+L++ +  + Y V L AIRVG R + IP           AGT++D
Sbjct: 362 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 415

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
           SGTV TRL   AY+A+   F+  +         G  DTC+       +  P++ L+FSG 
Sbjct: 416 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 475

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            V     + +I S      CLA A   D  +S L +I N+QQ+   +LYDV    +G   
Sbjct: 476 AVVSLDASGIILS-----NCLAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 528

Query: 430 ELC 432
             C
Sbjct: 529 GAC 531


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/423 (31%), Positives = 200/423 (47%), Gaps = 39/423 (9%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
           ++T+ + H   PCSP  P+K +   E   E L +DQ R  ++         +   V R  
Sbjct: 57  AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 112

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
           + VP A G  +  +  Y++   +G+PA +  M +DT +D +WV C  C  C S    +F+
Sbjct: 113 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
            + S+T+    C +A C Q+     G      C + +TYG  S+     S DT++L +  
Sbjct: 172 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           V  + FGC    +G +    GL+GLG G+ SL++QT       FSYCLP   + S   +L
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 291

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
                        TP+L++ +  + Y V L AIRVG R + IP           AGT++D
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 345

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
           SGTV TRL   AY+A+   F+  +         G  DTC+       +  P++ L+FSG 
Sbjct: 346 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 405

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            V     + +I S      CLA A   D  +S L +I N+QQ+   +LYDV    +G   
Sbjct: 406 AVVSLDASGIILS-----NCLAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458

Query: 430 ELC 432
             C
Sbjct: 459 GAC 461


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 138/426 (32%), Positives = 208/426 (48%), Gaps = 49/426 (11%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARK-----SVVPIASGR 89
           + H  SPCSP     PLS +      +  D AR+  L+S LA   K     S VP+ASG 
Sbjct: 46  LHHPQSPCSP----APLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASGA 101

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFK 145
            +     YI R  +GTP  T +M +D+ +   W+ C  C V C   +  +++   S+T+ 
Sbjct: 102 SVGVG-NYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYA 160

Query: 146 NLGCQAAQCKQV------PNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-IVPGY 197
            + C A QC ++      P+   G G C +  +YG  + +   LS+DT+SL++    PG+
Sbjct: 161 AVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGF 220

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
            +GC Q   G      GL+GL R  LSLL+Q      ++F+YCLP+  A S +G L  G 
Sbjct: 221 YYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAAS-AGYLSFGS 279

Query: 258 IGQ---PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
                 P +  YT ++ +   +SLY+V+L  + V    + +P       P     TIIDS
Sbjct: 280 NSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLP-----TIIDS 334

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD---TCYSVPIV---APTITLMFS- 367
           GTV TRL  P YTA+     + VG+ L   S   +    TC+   +     P + + F+ 
Sbjct: 335 GTVITRLPTPVYTAL----SKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVNMAFAG 390

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G  + L   N+L+     + TCLA   AP +  +   +I N QQQ   ++YDV  SR+G 
Sbjct: 391 GATLRLTPGNVLVDVNE-TTTCLAF--APTDSTA---IIGNTQQQTFSVVYDVKGSRIGF 444

Query: 428 ARELCT 433
           A   C+
Sbjct: 445 AAGGCS 450


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/390 (30%), Positives = 196/390 (50%), Gaps = 25/390 (6%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           K L+  + +   + +   RL+ L+++ +A  S   I S   ++ +  +++   IGTP +T
Sbjct: 54  KNLTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINS-PVLSGNGEFLMNLAIGTPPET 112

Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
               MDT +D  W    PCT C    S +F+  +S++F  L C +  CK +P  +C   +
Sbjct: 113 YSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCSD-S 171

Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
           C +  TYG  S+    ++ +T +     +P   FGC +   G+      GL+GLGRG LS
Sbjct: 172 CEYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLS 231

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSSLYY 280
           L++Q   L ++ FSYCL S      S +L +G +    G    I+ TPL++NP + S YY
Sbjct: 232 LVSQ---LKEAKFSYCLTSIDDTKTS-TLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYY 287

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           ++L  I VG   + I     Q       G IIDSGT  T L   A+  V+  F  ++G  
Sbjct: 288 LSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLP 347

Query: 341 LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
           +  +   G + CY++P     +  P + L F+G ++ LP +N +I  ++  + CLAM ++
Sbjct: 348 VDNSGATGLELCYNLPSDTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSS 407

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
                  +++  N+QQQN  + +D+    L
Sbjct: 408 -----GGMSIFGNVQQQNMFVSHDLEKETL 432


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 135/431 (31%), Positives = 204/431 (47%), Gaps = 44/431 (10%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----RKSVVPIA 86
           S+L V H   PCSP +     S   S  E+L +DQ R+  +     A     +  V  +A
Sbjct: 71  SSLTVVHRHGPCSPLRSRG--SGAPSHTEILRRDQDRVDAIRRKVTASSNKPKGGVSLLA 128

Query: 87  SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTT 143
           +  +   +  Y+   ++GTPA  L++ +DT +D +WV C  C  C      VF+   S+T
Sbjct: 129 NWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASST 188

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-------CAFNLTYGS-STIAANLSQDTISLA----- 190
           +  + C A +C+++ + +            C + ++Y   S    +L++DT++L+     
Sbjct: 189 YSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSP 248

Query: 191 --TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
              D VPG+ FGC     G      GLLGLG G  SL +Q    Y + FSYCLPS  + S
Sbjct: 249 SPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPS--SPS 306

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
            +G L  G        ++T ++     +S YY+NL  I V  R + +P  A      T A
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTS-YYLNLTGIVVAGRAIKVPASAF----ATAA 361

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSV----PIVAPTI 362
           GTIIDSGT F+RL   AY A+R  FR  +G      +     FDTCY       +  P +
Sbjct: 362 GTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAV 421

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            L+F+ G  V L    +L      + TCLA        N  L ++ N QQ+   ++YDV 
Sbjct: 422 ELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-----NHDLGILGNTQQRTLAVIYDVG 476

Query: 422 NSRLGVARELC 432
           + R+G  R+ C
Sbjct: 477 SQRIGFGRKGC 487


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 183/368 (49%), Gaps = 28/368 (7%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNS 138
           +P ++G  +  +  ++V    G+PAQ   +++DT +D +W+   PC+G C      VF+ 
Sbjct: 148 IPDSTGTSL-DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDP 206

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPG 196
            +S T+  + C   QC          G C + +TYG  S+ A  LS +T+SL+ T  +PG
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPG 266

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           + FGC Q   G      GL+GLGRG+LSL +Q    + +TFSYCLPS+      G L +G
Sbjct: 267 FAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTT--HGYLTMG 324

Query: 257 PI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
                       ++YT +++     SLY+V +++I +G  ++ +PP        T  GT+
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF-----TRDGTL 379

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS 367
            DSGT+ T L   AY ++RD F+  +       +   FDTCY       I  P +   FS
Sbjct: 380 FDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFS 439

Query: 368 -GMNVTLPQDNLLIH--STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            G    L    +LI+   TA +  CLA    P  +    N+I N QQ+   ++YDV   +
Sbjct: 440 DGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMP--FNIIGNTQQRGTEVIYDVAAEK 497

Query: 425 LGVARELC 432
           +G  +  C
Sbjct: 498 IGFGQFTC 505


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 125/407 (30%), Positives = 185/407 (45%), Gaps = 37/407 (9%)

Query: 60  EMLAKDQARLQFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
           + L+ D  RL F  S+L   +    P+ SG   T S  Y V  ++GTP Q LL+  DT +
Sbjct: 52  QALSFDSHRLSFFFSALHTPQSLKSPVVSGAS-TGSGQYFVDLRLGTPPQKLLLVADTGS 110

Query: 119 DAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGG----AC 167
           D  WV C+ C  C+     + F +  STTF    C  + C+ VP P    C        C
Sbjct: 111 DLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPC 170

Query: 168 AFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVP------PQGL 215
            +  +YG  S  +   S++T +L T       + G  FGC  + +G SV         G+
Sbjct: 171 RYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGV 230

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +GLGRG +SL +Q  + + + FSYCL      PS  +    GS +       +R+++TPL
Sbjct: 231 MGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPL 290

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
             NP   + YY+ + ++ V    + I P     +     GTI+DSGT  T L  PAY  +
Sbjct: 291 HINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI 350

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAG 385
             V +RRV          GFD C +V  +     P ++    G +V  P        T  
Sbjct: 351 LTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE 410

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + CLA+ A      S  +VI N+ QQ   + +D   +RLG +R  C
Sbjct: 411 DVKCLALQAV--MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 135/434 (31%), Positives = 194/434 (44%), Gaps = 48/434 (11%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWE----------ESVLEMLAKDQARLQFLSSLAVA 78
           ++ + L+V H   PCS  +       +          +S+   L+KD      LS +   
Sbjct: 80  ENKAFLKVVHKHGPCSDLRQGHKAEAQYILLQDQSRVDSIHSKLSKDSG----LSDVKAT 135

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SST 134
             + +P   G  I  S  Y V   +GTP +   +  DT +D  W  C  CV         
Sbjct: 136 AATTLPAKDG-SIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEA 194

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGACAFNLTYGSSTIAANL-SQDTIS 188
           +FN +QST++ N+ C +  C  + + T     C    C + + YG S+ +     ++ +S
Sbjct: 195 IFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLS 254

Query: 189 L-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
           L ATD+   + FGC Q   G      GLLGLGR  LSL++QT   Y   FSYCLPS  + 
Sbjct: 255 LTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSS 314

Query: 247 ---LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
              L+F GS         K   +TPL      SS Y ++L  I VG R + I P      
Sbjct: 315 TGFLTFGGS-------TSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF--- 364

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVA 359
             + AGTIIDSGTV TRL   AY+A+   FR+ +       +L   DTC+       I  
Sbjct: 365 --STAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISV 422

Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           P I L FSG  V       + +    +  CLA A   D   S + +  N+QQ+   ++YD
Sbjct: 423 PKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSD--ASDVAIFGNVQQKTLEVVYD 480

Query: 420 VPNSRLGVARELCT 433
               R+G A   C+
Sbjct: 481 GAAGRVGFAPAGCS 494


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/427 (29%), Positives = 205/427 (48%), Gaps = 38/427 (8%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQ-FLSSLAVARKSVVPIAS 87
           + +S+L+V + + PC P   +       S  E L +DQ R++ F   L++   S V    
Sbjct: 66  NRASSLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFKEM 125

Query: 88  GRQITQS--PT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SSTVFNS 138
              I  S  PT   Y+V   +GTP +   ++ DT +D  W  C  C+G C   +   F+ 
Sbjct: 126 QTTIPASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDP 185

Query: 139 AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT-D 192
             ST++KN+ C +  CK +     P   C    C + + YGS      L+ +T+++A+ D
Sbjct: 186 TTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGYTIGFLATETLAIASSD 245

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
           +   + FGC +++ G      GLLGLGR  ++L +QT N Y++ FSYCLP+  + S +G 
Sbjct: 246 VFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA--SPSSTGH 303

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L  G +   +  K TP+  +P+   LY +N + I V  R + I  G++       + TII
Sbjct: 304 LSFG-VEVSQAAKSTPI--SPKLKQLYGLNTVGISVRGRELPI-NGSI-------SRTII 352

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTITLMF 366
           DSGT FT L +P Y+A+   FR  + +         F  CY         +  P I++ F
Sbjct: 353 DSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFF 412

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             G+ V +    ++I        CLA A      +S   +  N QQ+ + ++YDV    +
Sbjct: 413 EGGVEVEIDVSGIMIPVNGLKEVCLAFADT--GSDSDFAIFGNYQQKTYEVIYDVAKGMV 470

Query: 426 GVARELC 432
           G A + C
Sbjct: 471 GFAPKGC 477


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/434 (30%), Positives = 199/434 (45%), Gaps = 54/434 (12%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV----------- 82
           L++ H  SPCSP     P+  +     +L  D AR+  L++      S            
Sbjct: 45  LELHHPRSPCSP----APVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADA 100

Query: 83  --------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC-- 131
                   VP++ G  +     Y+ R  +GTPA   +M +DT +   W+ C+ C V C  
Sbjct: 101 GLAGSLASVPLSPGASVGVG-NYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHR 159

Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIAAN-LS 183
            S  VFN   S+T+ ++GC A QC  +P+ T    AC+      +  +YG S+ +   LS
Sbjct: 160 QSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLS 219

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
           +DT+S  +  +P + +GC Q   G      GL+GL R  LSLL Q       +F+YCLPS
Sbjct: 220 KDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPS 279

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
             +  +           P +  YTP++ +    SLY++ L  + V    + +   A    
Sbjct: 280 SSSSGYLSLGSY----NPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSL 335

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAP 360
           P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   +  + AP
Sbjct: 336 P-----TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAP 390

Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            +T+ F+ G  + L   NLL+     S TCLA A A         +I N QQQ   ++YD
Sbjct: 391 AVTMSFAGGAALKLSAQNLLVD-VDDSTTCLAFAPARSAA-----IIGNTQQQTFSVVYD 444

Query: 420 VPNSRLGVARELCT 433
           V +SR+G A   C+
Sbjct: 445 VKSSRIGFAAGGCS 458


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 196/422 (46%), Gaps = 49/422 (11%)

Query: 50  KPLSWEESVLEMLAKDQAR-----LQFLSSLAVARK-----SVVPIASGRQITQSPTYIV 99
            P + +  +  +LA D++R     L+  +  A A       + VP+ SG +  Q+  Y+ 
Sbjct: 129 DPAAHDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRF-QTLNYVT 187

Query: 100 RAKIG-----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
              +G     +PA  L + +DT +D  WV   PC+ C      +F+ A S T+  + C A
Sbjct: 188 TIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 247

Query: 152 AQCKQVPNP------TCGGG--ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
           + C            +CGGG   C + L YG  + +   L+ DT++L    + G+ FGC 
Sbjct: 248 SACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCG 307

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
               G      GL+GLGR  LSL++QT   Y   FSYCLP+  +   SGSL LG      
Sbjct: 308 LSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSY 367

Query: 263 R----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
           R    + YT ++ +P +   Y++N+    VG         AL       +  +IDSGTV 
Sbjct: 368 RNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------ALAAQGLGASNVLIDSGTVI 420

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSV----PIVAPTITLMFS-GMNV 371
           TRL    Y  VR  F R+  +    T+ G    DTCY +     +  P +TL    G  V
Sbjct: 421 TRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEV 480

Query: 372 TLPQDNLL-IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           T+    +L +    GS  CLAMA+   +      +I N QQ+N R++YD   SRLG A E
Sbjct: 481 TVDAAGMLFVVRKDGSQVCLAMASL--SYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADE 538

Query: 431 LC 432
            C
Sbjct: 539 DC 540


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/415 (31%), Positives = 186/415 (44%), Gaps = 49/415 (11%)

Query: 52  LSWEESVLEMLAKDQAR-LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
           +S  E V + L +D  R  +F   LA +    V   + + +     YI+   IGTP  + 
Sbjct: 42  VSATEFVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSY 101

Query: 111 LMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQ------AAQCKQVPNP 160
               DT +D  W  C    + C   +   +N + STTF  L C       AA     P P
Sbjct: 102 PAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPP 161

Query: 161 TCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGL 215
            C   +C +N TYG+   A   S +T +  +       VPG  FGC   ++ +     GL
Sbjct: 162 GC---SCMYNQTYGTGWTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGL 218

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNP 273
           +GLGRGS+SL++Q   L    FSYCL  F+  + + +L LGP        +  TP + +P
Sbjct: 219 VGLGRGSMSLVSQ---LGAGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASP 275

Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
            +   S+ YY+NL  I +G   + IPP A         G IIDSGT  T LV  AY  VR
Sbjct: 276 SKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVR 335

Query: 331 DVFRRRV------GSNLTVTSLGGFDTCY------SVPIVAPTITLMFSGMNVTLPQDNL 378
                 V      GS+ T     G D C+      S P   P++T  F G ++ LP DN 
Sbjct: 336 AAIESLVTLPVADGSDST-----GLDLCFALTSETSTPPSMPSMTFHFDGADMVLPVDNY 390

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +I  +   + CLAM    +     ++   N QQQN  +LYD+    L  A   C+
Sbjct: 391 MILGSG--VWCLAMR---NQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 126/415 (30%), Positives = 200/415 (48%), Gaps = 45/415 (10%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAV---------ARKSVVPIASGRQITQSPTYIVR 100
           K   W + + + L  D  R++ L S            A  S +P++SG ++ Q+  YIV 
Sbjct: 12  KSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGVRL-QTLNYIVT 70

Query: 101 AKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
            +IG   + + + +DT +D  WV   PC  C      +FN + S +++ + C ++ C+ +
Sbjct: 71  VEIG--GRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSL 128

Query: 158 PNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
              T   G C  N       + YG  S    +L  + ++L T  V  + FGC +   G  
Sbjct: 129 QYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLF 188

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IK 265
               GL+GLG+  LSL++QT  +++  FSYCLP+  A   SGSL LG      +    I 
Sbjct: 189 GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPT-TAADASGSLILGGNSSVYKNTTPIS 247

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           YT ++ NP+  + Y++NL  I +G         ALQ      +G +IDSGTV TRL  P 
Sbjct: 248 YTRMIANPQLPTFYFLNLTGISIGGV-------ALQAPNYRQSGILIDSGTVITRLPPPV 300

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIH 381
           Y  ++  F ++     +       DTC+++     +  PTI + F G N  L  D   I 
Sbjct: 301 YRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEG-NAELTVDVTGIF 359

Query: 382 ---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               T  S  CLA+A+   + +  + +I N QQ+N R++Y+   S+LG A E C+
Sbjct: 360 YFVKTDASQVCLALASL--SFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/352 (29%), Positives = 164/352 (46%), Gaps = 17/352 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y+++  +GTP Q     +DT +D  WV C  C  C      +F    S+++ N  C 
Sbjct: 5   SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCT 64

Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
            + C  +P PTC     C ++ +YG  S    + + +T++L    +    FGC     G 
Sbjct: 65  DSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHNQEGT 124

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
                GL+GLG+G LSL +Q  + +   FSYCL           +  G   +  R  +TP
Sbjct: 125 FAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRASFTP 184

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LL+N    S YYV + +I VG R V  PP A + +     G I+DSGT  T     A+  
Sbjct: 185 LLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLAAFIP 244

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIH- 381
           +    RR++       +  G + CY +  V+      P++T+  + ++  +P  NL +  
Sbjct: 245 ILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIPVSNLWVLV 304

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              G   C AM+ +        ++I N+QQQN+ I+ DV NSR+G     C+
Sbjct: 305 DNFGETVCTAMSTSDQ-----FSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 135/440 (30%), Positives = 204/440 (46%), Gaps = 55/440 (12%)

Query: 23  PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
           P+       +T+ + H   PCSP     P + E ++ E+L +DQ R +++     A+ SV
Sbjct: 44  PVTPPSSSGTTVPLSHRHGPCSP----APSTVEPTMAELLRRDQLRAKYIQ----AKLSV 95

Query: 83  VPIASGRQITQS-----PT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
              +    + QS     PT          Y++   IGTPA T  + +DT +D +WV C  
Sbjct: 96  NSGSGTDGVQQSAAITLPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHA 155

Query: 128 CVGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANL 182
             G  S++ F+  +S+T+    C +A C ++     G      C + + YG  S      
Sbjct: 156 RAGAGSSLFFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTY 215

Query: 183 SQDTISL-ATDIVPGYTFGCIQKAT-GNSVPP---QGLLGLGRGSLSLLAQTQNLYQSTF 237
             DT++L +T+ V  + FGC + +  G  +      GL+GLG G+ SL++QT   Y S F
Sbjct: 216 GSDTLALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAF 275

Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
           SYCLP+      SG L LG          TP+ ++ R  + Y+V L  I VG   V I P
Sbjct: 276 SYCLPA--TTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISP 333

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP- 356
                     AG+I+DSGT+ TRL   AY+A+   FR  +       +    DTC+    
Sbjct: 334 TVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTG 387

Query: 357 ---IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
              +  P + L+FSG  V  L  D ++  S      CLA A A   + S   +I N+QQ+
Sbjct: 388 QDNVSIPAVELVFSGGAVVDLDADGIMYGS------CLAFAPATGGIGS---IIGNVQQR 438

Query: 413 NHRILYDVPNSRLGVARELC 432
              +L+DV  S LG     C
Sbjct: 439 TFEVLHDVGQSVLGFRPGAC 458


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 139/486 (28%), Positives = 224/486 (46%), Gaps = 73/486 (15%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHS----STLQVFHVFSPCSPFKPS----------- 49
           L FFL+F+FL+ +    N  C+ +         LQ  H F       P            
Sbjct: 9   LPFFLSFVFLYFIIA--NGGCELEQKKMFKVQMLQRNHQFGSKGCILPESRKEKGAIVLE 66

Query: 50  ---------KPLSWEESVLEMLAKDQARLQFLSSLAVARKS-----------VVPIASGR 89
                    + ++W   + + L  D  R++ + +   A+ S            +P+ASG 
Sbjct: 67  MKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLASGI 126

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
            + ++  YIV   IG   Q + + +DT +D  WV C  C+ C S    VFN + S+++ +
Sbjct: 127 NL-ETLNYIV--TIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNS 183

Query: 147 LGCQAAQCKQVPNPTCGGGACAFN--------LTYGSSTIA-ANLSQDTISLATDIVPGY 197
           L C ++ C+ +   T    AC  N        ++YG  +     L  + +S     V  +
Sbjct: 184 LLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNF 243

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
            FGC +   G      G++GLGR +LS+++QT   +   FSYCLP+  + + SGSL +G 
Sbjct: 244 VFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGA-SGSLVIGN 302

Query: 258 IGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
                +    I YT ++ NP+ S+ Y +NL  I VG         A+Q       G +ID
Sbjct: 303 ESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGV-------AIQDTSFGNGGILID 355

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
           SGTV TRL    Y A++  F ++        +L   DTC+++     +  PT+++ F   
Sbjct: 356 SGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFEN- 414

Query: 370 NVTLPQD--NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           NV L  D   +L     GS  CLA+A+  D  +  + +I N QQ+N R++YD   S++G 
Sbjct: 415 NVDLNVDAVGILYMPKDGSQVCLALASLSDEND--MAIIGNYQQRNQRVIYDAKQSKIGF 472

Query: 428 ARELCT 433
           ARE C+
Sbjct: 473 AREDCS 478


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 127/376 (33%), Positives = 184/376 (48%), Gaps = 26/376 (6%)

Query: 71  FLSSLAVARKSVVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
           F +SLA A +   P+ SG  + Q S  Y  R  IG+PA+ L M +DT +D  WV C  C 
Sbjct: 146 FGASLAAAIQG--PVVSG--VGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCA 201

Query: 130 GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLS 183
            C   S  VF+ + S ++  + C + +C+ +    C    GAC + + YG  S    + +
Sbjct: 202 DCYQQSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFA 261

Query: 184 QDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
            +T++L     V     GC     G  V   GLL LG G LS  +Q   +  STFSYCL 
Sbjct: 262 TETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCLV 318

Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
              + + S +L+ G  G        PL+++PR  + YYV L  I VG + + IP  A   
Sbjct: 319 DRDSPAAS-TLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAM 377

Query: 303 NPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
           + T+G+G  I+DSGT  TRL + AY A+RD F R   S    + +  FDTCY +     +
Sbjct: 378 DATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSV 437

Query: 358 VAPTITLMFSGMN-VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
             P ++L F G   + LP  N LI        CLA A      N+ +++I N+QQQ  R+
Sbjct: 438 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRV 493

Query: 417 LYDVPNSRLGVARELC 432
            +D     +G     C
Sbjct: 494 SFDTAKGVVGFTPNKC 509


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 126/413 (30%), Positives = 189/413 (45%), Gaps = 52/413 (12%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------------------T 96
           +E     L +D  R++ +++LA      +P   GR +T +P                   
Sbjct: 89  QELFSSRLQRDSRRVRSIATLAAQ----IP---GRNVTHAPRPGGFSSSVVSGLSQGSGE 141

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ---------STTFKNL 147
           Y  R  +GTPA+ + M +DT +D  W+ C  C          +Q         S T+  +
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR------CYSQSDPIFDPRKSKTYATI 195

Query: 148 GCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQK 204
            C +  C+++ +  C      C + ++YG  +    + S +T++   + V G   GC   
Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
             G  V   GLLGLG+G LS   QT + +   FSYCL    A S   S+  G     +  
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           ++TPLL NP+  + YYV LL I V G RV  +     + +     G IIDSGT  TRL+ 
Sbjct: 316 RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLL 379
           PAY A+RD FR    +     +   FDTC+ +     +  PT+ L F   +V+LP  N L
Sbjct: 376 PAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRADVSLPATNYL 435

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           I        C A A         L++I N+QQQ  R++YD+ +SR+G A   C
Sbjct: 436 IPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 115/350 (32%), Positives = 164/350 (46%), Gaps = 29/350 (8%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           IGTPA      +DT +D  W  C  CV C   S+ VF+ + S+T+  + C +A C  +P 
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 160 PTC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLL 216
             C     C +  TYG SS+    L+ +T +LA   +PG  FGC     G+      GL+
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLV 292

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG-------QPKRIKYTPL 269
           GLGRG LSL++Q   L    FSYCL S    + S  L LG +            ++ TPL
Sbjct: 293 GLGRGPLSLVSQ---LGLDKFSYCLTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPL 348

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           +KNP + S YYV+L AI VG   + +P  A         G I+DSGT  T L    Y A+
Sbjct: 349 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 408

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTITLMFS-GMNVTLPQDNLLIHS 382
           +  F  ++       S  G D C+  P      +  P +   F  G ++ LP +N ++  
Sbjct: 409 KKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLD 468

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 CL +       +  L++I N QQQN + +YDV +  L  A   C
Sbjct: 469 GGSGALCLTVMG-----SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 39/409 (9%)

Query: 53  SWEESVLEMLAKDQARLQFLSS-LAVARKSV-------------------VPIASGRQIT 92
           +++  VL  LA+D AR+  L++ L +A  S+                    P++SG    
Sbjct: 94  NYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSSG-TAQ 152

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGC 149
            S  Y  R  +G P++   M +DT +D  W+   PC+ C   S  +F+   S+++  L C
Sbjct: 153 GSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTC 212

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
            A QC+ +    C  G C + ++YG  +        +T+S     V     GC     G 
Sbjct: 213 DAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVNRVAIGCGHDNEGL 272

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   G  GL       L+ T  +  ++FSYCL   +    S +L      +P      P
Sbjct: 273 FV---GSAGLLGLGGGPLSLTSQIKATSFSYCLVD-RDSGKSSTLEFNS-PRPGDSVVAP 327

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           LLKN + ++ YYV L  + VG  +V +PP     + +   G I+DSGT  TRL   AY +
Sbjct: 328 LLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNS 387

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNV-TLPQDNLLIHST 383
           VRD F+R+  +      +  FDTCY +     +  PT++  FSG     LP  N LI   
Sbjct: 388 VRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVD 447

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                C A A       S +++I N+QQQ  R+ +D+ NS +G +   C
Sbjct: 448 GAGTYCFAFAP----TTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 123/390 (31%), Positives = 189/390 (48%), Gaps = 32/390 (8%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           L+K+  R   +  L     + +P  SG  I  S  Y+V   +GTP + L +  DT +D  
Sbjct: 15  LSKNLGRENTVKDL---DSTTLPAESGSLI-GSANYVVVVGLGTPKRDLSLVFDTGSDLT 70

Query: 122 WVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNP-------TCGGGACAFN 170
           W  C  C G        +F+ ++S+++ N+ C ++ C Q+ +        +    +C ++
Sbjct: 71  WTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYD 130

Query: 171 LTYG-SSTIAANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
             YG +ST    LSQ+ +++ ATDIV  + FGC Q   G      GL+GLGR  +S++ Q
Sbjct: 131 AKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQ 190

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIR 287
           T + Y   FSYCLP+    S  G L  G        + YTPL      +S Y +++++I 
Sbjct: 191 TSSNYNKIFSYCLPATS--SSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSIS 248

Query: 288 VGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG 347
           VG     +P  A+  +  +  G+IIDSGTV TRL    Y A+R  FRR +         G
Sbjct: 249 VGG--TKLP--AVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAG 304

Query: 348 GFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
             DTCY +     I  P I   FS G+ V L    +L   +   + CLA AA  +  ++ 
Sbjct: 305 LLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQV-CLAFAA--NGSDND 361

Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + V  N+QQ+   ++YDV   R+G     C
Sbjct: 362 ITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 188/369 (50%), Gaps = 29/369 (7%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST---VFNS 138
           +P+  G  I  S  Y V+  +GTP +   M +DT +  +W+ C  C V C +    +++ 
Sbjct: 112 IPLNPGLSIG-SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDP 170

Query: 139 AQSTTFKNLGCQAAQCKQVP-----NPTC--GGGACAFNLTYGSSTIA-ANLSQDTISL- 189
           + S T+K L C + +C ++      +P C     AC +  +YG ++ +   LSQD ++L 
Sbjct: 171 SVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLT 230

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
           ++  +P +T+GC Q   G      G++GL R  LS+LAQ    Y   FSYCLP+  + S 
Sbjct: 231 SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSS 290

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
            G         P   K+TP+L + +  SLY++ L AI V  R +D+   A+   P     
Sbjct: 291 GGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDL-AAAMYRVP----- 344

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITL 364
           T+IDSGTV TRL    Y A+R  F + + +      +    DTC+   +      P I +
Sbjct: 345 TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKM 404

Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           +F  G ++TL   ++LI +  G ITCLA A +  +  + + +I N QQQ + I YDV  S
Sbjct: 405 IFQGGADLTLRAPSILIEADKG-ITCLAFAGS--SGTNQIAIIGNRQQQTYNIAYDVSTS 461

Query: 424 RLGVARELC 432
           R+G A   C
Sbjct: 462 RIGFAPGSC 470


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 148/470 (31%), Positives = 217/470 (46%), Gaps = 58/470 (12%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSST-LQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           L +FL F    +L+  L      Q      L ++HV    S    + P S+ +    M+ 
Sbjct: 3   LFWFLVFSAHLALASSLVEFQGMQKQEGMQLNLYHVKGLDSSQTSTSPFSFSD----MIT 58

Query: 64  KDQARLQFLSSLAVARKSV----------------VPIASGRQITQSPTYIVRAKIGTPA 107
           KD+ R++FL S    ++S                  P+ SG  I  S  Y V+  +GTPA
Sbjct: 59  KDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSI-GSGNYYVKIGVGTPA 117

Query: 108 QTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCK-------Q 156
           +   M +DT +  +W+ C  CV  C   V   F  + S T+K L C ++QC         
Sbjct: 118 KYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLN 177

Query: 157 VPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQKATGNSVPPQ 213
            P  +   GAC +  +YG ++ +   LSQD ++L     P  G+ +GC Q   G      
Sbjct: 178 APGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSA 237

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKAL---SFSGSLRLGPIGQPKR-IKYTP 268
           G++GL    LS+L Q  N Y + FSYCLP SF A    S SG L +G         K+TP
Sbjct: 238 GIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSGFLSIGASSLSSSPYKFTP 297

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+KNP+  SLY++ L  I V  + + +   A  +N      TIIDSGTV TRL    Y A
Sbjct: 298 LVKNPKIPSLYFLGLTTITVAGKPLGV--SASSYN----VPTIIDSGTVITRLPVAIYNA 351

Query: 329 VRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQDNLLIHS 382
           ++  F   +             DTC+   +      P I ++F  G  + L   N L+  
Sbjct: 352 LKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEI 411

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             G+ TCLA+AA+ + +    ++I N QQQ   + YDV NS++G A   C
Sbjct: 412 EKGT-TCLAIAASSNPI----SIIGNYQQQTFTVAYDVANSKIGFAPGGC 456


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 206/424 (48%), Gaps = 51/424 (12%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKS----VVPIASGRQ-----ITQSPT---YIVRAKI 103
           E +   L +D+ R  ++ S A A  +    VV +++GR      ++++PT   YI +  +
Sbjct: 88  ELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIAKIAV 147

Query: 104 GTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
           GTPA   L+A+DT++D  W+ C  C  C   S  VF+   ST++  +   A  C+ +   
Sbjct: 148 GTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRS 207

Query: 161 TCGG---GACAFNLTYG-------SSTIAANLSQDTISLATDIVPGY-TFGCIQKATG-N 208
             G    G C + + YG       +ST   +L ++T++ A  +   Y + GC     G  
Sbjct: 208 GGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLF 267

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPSFKALSFSGSLRL----GPIGQPKR 263
             P  G+LGL RG +S+  Q   L Y ++FSYCL  F +   S S  L    G +     
Sbjct: 268 GAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPP 327

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRL 321
             +TP + N    + YYV L+ + VG  RV  +    LQ +P TG  G I+DSGT  TRL
Sbjct: 328 ASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRL 387

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSVP--------IVAPTITLMFS-G 368
             PAYTA RD FR    + L   S GG    FDTCY+V         +  P +++ F+ G
Sbjct: 388 ARPAYTAFRDAFRAAA-TGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGG 446

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           + ++L   N LI   +    C A A   D     ++VI N+ QQ  R++YD+   R+G A
Sbjct: 447 VELSLQPKNYLITVDSRGTVCFAFAGTGDR---SVSVIGNILQQGFRVVYDIGGQRVGFA 503

Query: 429 RELC 432
              C
Sbjct: 504 PNSC 507


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 120/383 (31%), Positives = 178/383 (46%), Gaps = 38/383 (9%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS----STVFNSA 139
           P+ SG   + S  Y V  +IGTP QTLL+  DT +D  WV C+ C  CS     + F + 
Sbjct: 74  PVISGAS-SGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFAR 132

Query: 140 QSTTFKNLGCQAAQCKQVPNP-------TCGGGACAFNLTYG-SSTIAANLSQDTISLAT 191
            STT+  + C + QC+ VP+P       T     C +  TY  SST     S++ ++L T
Sbjct: 133 HSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNT 192

Query: 192 DI-----VPGYTFGCIQKATGNSVP------PQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
                  + G +FGC  + +G S+        QG++GLGR  +S  +Q    + S FSYC
Sbjct: 193 STGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYC 252

Query: 241 L-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           L       P    L+  G+  +  + +   + +TPLL NP   + YY+ +  + V    +
Sbjct: 253 LMDYTLSPPPTSFLTIGGAQNVA-VSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            I P     +     GTIIDSGT  T +  PAYT +   F++RV          GFD C 
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCM 371

Query: 354 SVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
           +V  V     P ++   +G +V  P        T   I CLA+   P + +   +V+ N+
Sbjct: 372 NVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAV--QPVSQDGGFSVLGNL 429

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
            QQ   + +D   SRLG  R  C
Sbjct: 430 MQQGFLLEFDRDKSRLGFTRRGC 452


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 110/351 (31%), Positives = 165/351 (47%), Gaps = 28/351 (7%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNSAQSTTFKNLGCQAAQCKQ-- 156
           +GTP Q   + +D  +D  W  C+  VG ++     VF++A+S++F  L C +  C+   
Sbjct: 113 VGTPPQPSKVILDLGSDLLWTQCS-LVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGT 171

Query: 157 VPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQG 214
             N TC    CA+   YG  T    L+ +T +      +    TFGC + A G      G
Sbjct: 172 FTNKTCTDRKCAYENDYGIMTATGVLATETFTFGAHHGVSANLTFGCGKLANGTIAEASG 231

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSLRLGPIGQPKRIKYTPL 269
           +LGL  G LS+L Q   L  + FSYCL  F       + F     LG      +++  PL
Sbjct: 232 ILGLSPGPLSMLKQ---LAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPL 288

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           LKNP     YYV ++ + VG + +D+P   L   P    GT++DS T    LV PA+T +
Sbjct: 289 LKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTEL 348

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLMFSG-MNVTLPQDNLLIH 381
           +      +   +   S+  +  C+ +P       +  P + L F G   ++LP+DN    
Sbjct: 349 KKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQE 408

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + G + CLA+  AP       NVI N+QQQN  +LYDV N +   A   C
Sbjct: 409 PSPG-MMCLAVMQAP--FEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 191/403 (47%), Gaps = 41/403 (10%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
           S    V+ ++A+D AR++ L    VA  S          VVP         S  Y VR  
Sbjct: 80  SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135

Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           +G+P     + +D+ +D  WV   PC  C   +  +F+ A S++F  + C +A C+ +  
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195

Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
             CGGG     C +++TYG  S     L+ +T++L    V G   GC  + +G  V   G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
           LLGLG G++SL+ Q        FSYCL S +    +GSL LG      R +  P  +  R
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLVLG------RTEAVP--RGRR 306

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
            SS YYV L  I VG   + +     Q       G ++D+GT  TRL   AY A+R  F 
Sbjct: 307 ASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFD 366

Query: 335 RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITC 389
             +G+     ++   DTCY +     +  PT++  F  G  +TLP  NLL+    G++ C
Sbjct: 367 GAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVFC 425

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LA A +    +S ++++ N+QQ+  +I  D  N  +G     C
Sbjct: 426 LAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 111/348 (31%), Positives = 171/348 (49%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P +   M +D+ +D  WV   PC  C   S  VF+ A+S ++  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           ++ C ++ N  C  G C + + YG  S     L+ +T++ A  +V     GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +   GLLG+G GS+S + Q        F YCL S +    +GSL  G    P    + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 306

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YYV L  + VG   + +P G      T   G ++D+GT  TRL   AY A 
Sbjct: 307 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 366

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
           RD F+ +  +    + +  FDTCY     V +  PT++  F+ G  +TLP  N L+    
Sbjct: 367 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 426

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A AA+P    + L++I N+QQ+  ++ +D  N  +G    +C
Sbjct: 427 SGTYCFAFAASP----TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 118/365 (32%), Positives = 171/365 (46%), Gaps = 36/365 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
           Y++   IGTP        DT +D  W  C    T C    + ++N A STTF  L C ++
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 173

Query: 153 --QCKQVPNPTCGGGACA--FNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQ 203
              C            CA  +  TYG+   A     +T +  +       VPG  FGC  
Sbjct: 174 LSMCAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSN 233

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QP 261
            ++ +     GL+GLGRGSLSL++Q   L    FSYCL  F+  + + +L LGP      
Sbjct: 234 ASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNG 290

Query: 262 KRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             ++ TP + +P R   S+ YY+NL  I +G + + I PGA    P    G IIDSGT  
Sbjct: 291 TGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTI 350

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTV---TSLGGFDTCYSV-------PIVAPTITLMFSG 368
           T L   AY  VR   + ++ + L     +   G D C+++       P V P++TL F G
Sbjct: 351 TSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDG 410

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            ++ LP D+ +I  +   + CLAM    +  +  ++   N QQQN  ILYDV    L  A
Sbjct: 411 ADMVLPADSYMISGSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVREETLSFA 465

Query: 429 RELCT 433
              C+
Sbjct: 466 PAKCS 470


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 140/455 (30%), Positives = 211/455 (46%), Gaps = 56/455 (12%)

Query: 13  FLFSLSEGLNP--ICDTQD-----HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           ++   S  L P  +C  Q      + +TL + H   PCSP    +  S EE+    L +D
Sbjct: 33  YMVVASSSLEPSEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEET----LGRD 88

Query: 66  QARL-QFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           Q R     + L+  R S           +P +SG  +  +P Y++   +GTPA T +M++
Sbjct: 89  QLRAANIHAKLSSPRNSSAKELQQSGVTIPTSSGYSL-GTPEYVITVSLGTPAVTQVMSI 147

Query: 115 DTSNDAAWVPCTGCVG--CSS---TVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGAC 167
           DT +D +WV C  C    CSS    +F+ A+S T+    C +AQC Q+      C    C
Sbjct: 148 DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHC 207

Query: 168 AFNLTY-GSSTIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
            + + Y   S        DT+ L T D V  + FGC  +A G      GL+GLG  + SL
Sbjct: 208 QYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESL 267

Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG---QPKRIKYTPLLKNPRRSSLYYVN 282
           ++QT   Y   FSYCLP   + S  G L LG         R   TPL++     + Y V 
Sbjct: 268 VSQTAATYGKAFSYCLPP-SSSSAGGFLTLGAAAGGTSSSRYSRTPLVRF-NVPTFYGVF 325

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           L AI V    +++P         +GA +++DSGTV T+L   AY A+R  F++ + +  +
Sbjct: 326 LQAITVAGTKLNVPASVF-----SGA-SVVDSGTVITQLPPTAYQALRTAFKKEMKAYPS 379

Query: 343 VTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
              +G  DTC+       +  P +TL FS G  + L    +     AG   CLA  A   
Sbjct: 380 AAPVGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY---AG---CLAFTATAQ 433

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + ++   ++ N+QQ+   +L+DV  S LG     C
Sbjct: 434 DGDT--GILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 111/348 (31%), Positives = 171/348 (49%), Gaps = 14/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P +   M +D+ +D  WV   PC  C   S  VF+ A+S ++  + C 
Sbjct: 129 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 188

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           ++ C ++ N  C  G C + + YG  S     L+ +T++ A  +V     GC  +  G  
Sbjct: 189 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 248

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +   GLLG+G GS+S + Q        F YCL S +    +GSL  G    P    + PL
Sbjct: 249 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YYV L  + VG   + +P G      T   G ++D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAF 367

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
           RD F+ +  +    + +  FDTCY     V +  PT++  F+ G  +TLP  N L+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A AA+P    + L++I N+QQ+  ++ +D  N  +G    +C
Sbjct: 428 SGTYCFAFAASP----TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 471


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 134/409 (32%), Positives = 194/409 (47%), Gaps = 47/409 (11%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPI--ASGRQITQSPT---YIVRAKIGTPAQT-- 109
           S  ++LA+   R     +  +  K+  P    +G  +T +PT   YI +  +GTP +   
Sbjct: 81  SAADLLARRLQR-DMRRAAWIITKAATPADPENGTVVTGAPTSGEYIAKITVGTPYENDS 139

Query: 110 ---LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
               L++ D  +D  W+ C  C  C      V+N  +S++  ++GC A  C+ +   + G
Sbjct: 140 SFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRALG--SSG 197

Query: 164 G-----GACAFNLTYGS-STIAANLSQDTISLATDI-VPGYTFGCIQKATG-NSVPPQGL 215
           G       C + + YG  S+ A +   +T++    + VPG   GC     G    P  G+
Sbjct: 198 GCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGI 257

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-----LL 270
           LGLGRGSLS  +Q    Y  +FSYCL        S +L  G          TP     +L
Sbjct: 258 LGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPML 317

Query: 271 KNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTA 328
            N R  + YYV L+ I VG  RV  +    L+ +P+TG  G I+DSGT  TRL  PAY A
Sbjct: 318 TNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAA 377

Query: 329 VRDVFRRRVGSNLTVTSLGG----FDTCYS-----VPIVAPTITLMFS-GMNVTLPQDNL 378
            RD FR      L   S GG    FDTCYS     V    P +++ F+ G+ V LP  N 
Sbjct: 378 FRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNY 437

Query: 379 LI--HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           LI   S  G++ C A A + D     +++I N+Q Q  R++YDV   R+
Sbjct: 438 LIPVDSNKGTM-CFAFAGSGDR---GVSIIGNIQLQGFRVVYDVDGQRV 482


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 121/392 (30%), Positives = 193/392 (49%), Gaps = 29/392 (7%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           K L+  E +   + + + RLQ L ++A+   S   I +   +  +  ++++  IGTP +T
Sbjct: 51  KNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEA-PVLPGNGEFLMKLAIGTPPET 109

Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
               +DT +D  W    PCT C   S+ +F+  +S++F  L C +  C+ +P  +C  G 
Sbjct: 110 YSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG- 168

Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
           C +  +YG  S+    L+ +T++     VP   FGC     G+      GL+GLGRG LS
Sbjct: 169 CEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLS 228

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKRIKYTPLLKNPRRSSLYY 280
           L++Q   L +  FSYCL +      S +L +G +         IK TPL+ +P   S YY
Sbjct: 229 LVSQ---LKEPKFSYCLTTVDDTKTS-TLLMGSLASVNASSSAIKTTPLIHSPAHPSFYY 284

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           ++L  I VG   + I             G IIDSGT  T L   A+  V   F  ++  N
Sbjct: 285 LSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI--N 342

Query: 341 LTVTSLG--GFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
           L V S G  G D C+++P     I  P +   F G ++ LP +N +I  ++  + CLAM 
Sbjct: 343 LPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMG 402

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           ++     S +++  N+QQQN  +L+D+    L
Sbjct: 403 SS-----SGMSIFGNVQQQNMLVLHDLEKETL 429


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 174/366 (47%), Gaps = 23/366 (6%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y +R  +GTP + + + MDT +D  W+ C  CV C      VF+  +
Sbjct: 25  PVISGLSL-GSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYK 83

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT- 198
           S+T+  LGC + QC  +    C G  C + + YG  + +    + D +SL +    G   
Sbjct: 84  SSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143

Query: 199 -----FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GS 252
                 GC     G  V   GLLGLG+G LS   Q  +     FSYCL      S    S
Sbjct: 144 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSS 203

Query: 253 LRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           L  G     P  +++TP   N R S+ YY+ +  I VG  ++ IP  A Q +     G I
Sbjct: 204 LIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVI 263

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF- 366
           IDSGT  TRL   AY ++R+ FR      +  T    FDTCY++     +  PT+TL F 
Sbjct: 264 IDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQ 323

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G ++ LP  N L+     S  CLA A       +  ++I N+QQQ  R++YD  ++++G
Sbjct: 324 GGADLKLPASNYLVPVDNSSTFCLAFAGT-----TGPSIIGNIQQQGFRVIYDNLHNQVG 378

Query: 427 VARELC 432
                C
Sbjct: 379 FVPSQC 384


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 124/417 (29%), Positives = 203/417 (48%), Gaps = 45/417 (10%)

Query: 49  SKPLSWEESVLEMLAKDQARLQFLSSL---------AVARKSVVPIASGRQITQSPTYIV 99
            K + W   + + L  D  R++ + +            A ++ +P++SG  + Q+  YIV
Sbjct: 9   EKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL-QTLNYIV 67

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
              +G+   T+++  DT +D  WV C  C+ C +    +F  + S++++++ C ++ C+ 
Sbjct: 68  TMGLGSKNMTVII--DTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125

Query: 157 VPNPTCGGGACA--------FNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG 207
           +   T   GAC         + + YG  S     L  + +S     V  + FGC +   G
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCGRNNKG 185

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKR 263
                 GL+GLGR  LSL++QT   +   FSYCLP+ +A S SGSL +G           
Sbjct: 186 LFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGS-SGSLVMGNESSVFKNANP 244

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           I YT +L NP+ S+ Y +NL  I VG   +  P   L F      G +IDSGTV TRL +
Sbjct: 245 ITYTRMLSNPQLSNFYILNLTGIDVGGVALKAP---LSFG---NGGILIDSGTVITRLPS 298

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG---MNVTLPQD 376
             Y A++  F ++     +       DTC+++     +  PTI+L F G   +NV     
Sbjct: 299 SVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGT 358

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             ++   A  + CLA+A+  D  ++   +I N QQ+N R++YD   S++G A E C+
Sbjct: 359 FYVVKEDASQV-CLALASLSDAYDTA--IIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)

Query: 78  ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
           ARK  +  +SG  ++     SPT   Y++   IGTP        DT +D  W    PCT 
Sbjct: 6   ARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 65

Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
            C    + ++N + STTF  L C ++              P P C   AC +N+TYGS  
Sbjct: 66  QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 122

Query: 178 IAANLSQDTISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
            +     +T +  +       VPG  FGC   ++G N+    GL+GLGRG LSL++Q   
Sbjct: 123 TSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 179

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
           L    FSYCL  ++  + + +L LGP   +     +  TP + +P  +   + YY+NL  
Sbjct: 180 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 239

Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
           I +G   + IPP A   N     G IIDSGT  T L   AY  VR      V    T  S
Sbjct: 240 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 299

Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
              G D C+      S P   P++TL F+G ++ LP D+ ++   +G + CLAM    + 
Sbjct: 300 ADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 355

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +  +N++ N QQQN  ILYD+    L  A   C+
Sbjct: 356 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 120/354 (33%), Positives = 167/354 (47%), Gaps = 22/354 (6%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  IG+P +   + MDT +D  W+ C+ C  C   +  VF+   S++F+ L C 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCS 70

Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATG 207
             QCK +    C      C + ++YG  +    +L+ D+ S++        FGC     G
Sbjct: 71  TPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTSPVVFGCGHDNEG 130

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP--KRI 264
             V   GLLGLG G LS  +Q   L    FSYCL S    +  S +L  G    P     
Sbjct: 131 LFVGAAGLLGLGAGKLSFPSQ---LSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASF 187

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
            YT LLKNP+  + YY  L  I +G  ++ IP  A + + +TG  G IIDSGT  TRL  
Sbjct: 188 AYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPT 247

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNL 378
            AYT +RD FR              FDTCY       +  PT++  F  G +V LP  N 
Sbjct: 248 YAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNY 307

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L+        C A +    +    L++I N+QQQ  R+  D+ +SR+G A   C
Sbjct: 308 LVPVDTSGTFCFAFSKTSLD----LSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 138/435 (31%), Positives = 205/435 (47%), Gaps = 59/435 (13%)

Query: 21  LNPICDTQDH--SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------ 72
           L+P+   + +  S+ L++ H   PC+P + S   +   SV + L  DQ R +++      
Sbjct: 52  LDPVAQRRRNGTSAVLRLTHKHGPCAPSRASSLAT--PSVADTLRADQRRAEYILRRVSG 109

Query: 73  -------SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
                   S A A  + VP   G  I  +  Y+V   +GTP     + +DT +D +WV C
Sbjct: 110 RGTPQLWDSKAEAATATVPANWGFNI-GTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQC 168

Query: 126 TGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYGSSTI 178
           T C    C S    +F+ AQS+++  + C    C    +   +C    C + ++YG  + 
Sbjct: 169 TPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSK 228

Query: 179 AANL-SQDTISLA-TDIVPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
              + S DT++L+  D V G+ FGC       TGN     GLLGLGR   SL+ QT   Y
Sbjct: 229 TTGVYSSDTLTLSPNDAVRGFFFGCGHAQSGFTGN----DGLLGLGREEASLVEQTAGTY 284

Query: 234 QSTFSYCLPSFKALSFSGSLRL-GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
              FSYCLP+    S +G L L GP G  P     T LL +P  ++ Y V L  I VG +
Sbjct: 285 GGVFSYCLPTRP--STTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQ 342

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGF 349
            + +P            GT++D+GTV TRL   AY A+R  FR  + S    +  + G  
Sbjct: 343 QLSVPSSVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGIL 396

Query: 350 DTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
           DTCY+      +  P + L FS G  VTL  D +L      S  CLA   AP   +  + 
Sbjct: 397 DTCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL------SFGCLAF--APSGSDGGMA 448

Query: 405 VIANMQQQNHRILYD 419
           ++ N+QQ++  +  D
Sbjct: 449 ILGNVQQRSFEVRID 463


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 188/378 (49%), Gaps = 37/378 (9%)

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFN 137
           S +PI+SG ++ Q+  YIV   IG    TL++  DT +D  WV   PC  C      +FN
Sbjct: 130 SQIPISSGARL-QTLNYIVTVGIGGQNSTLIV--DTGSDLTWVQCLPCRLCYNQQEPLFN 186

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGG---------ACAFNLTYGSSTIA-ANLSQDTI 187
            + S++F +L C +  C  +  PT G           +C + + YG  + +   L  + +
Sbjct: 187 PSNSSSFLSLPCNSPTCVAL-QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKL 245

Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
           +L    +  + FGC +   G      GL+GL R  LSL++QT +L+ S FSYCLP+   +
Sbjct: 246 TLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT-TGV 304

Query: 248 SFSGSLRLG-----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
             SGSL LG            I YT +++NP+ S+ Y++NL  I +G   +++P    + 
Sbjct: 305 GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----RL 360

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIV 358
           +   G  +++DSGTV TRL    Y A +  F ++     T       +TC+++     + 
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420

Query: 359 APTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            PT+  +F G   M V +      + S A  I CLA A+      ++  +I N QQ+N R
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTM--IIGNYQQKNQR 477

Query: 416 ILYDVPNSRLGVARELCT 433
           ++Y+   S++G A E C+
Sbjct: 478 VIYNSKESKVGFAGEPCS 495


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)

Query: 78  ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
           ARK  +  +SG  ++     SPT   Y++   IGTP        DT +D  W    PCT 
Sbjct: 66  ARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 125

Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
            C    + ++N + STTF  L C ++              P P C   AC +N+TYGS  
Sbjct: 126 QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 182

Query: 178 IAANLSQDTISLATD-----IVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
            +     +T +  +       VPG  FGC   ++G N+    GL+GLGRG LSL++Q   
Sbjct: 183 TSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 239

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
           L    FSYCL  ++  + + +L LGP   +     +  TP + +P  +   + YY+NL  
Sbjct: 240 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 299

Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
           I +G   + IPP A   N     G IIDSGT  T L   AY  VR      V    T  S
Sbjct: 300 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 359

Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
              G D C+      S P   P++TL F+G ++ LP D+ ++   +G + CLAM    + 
Sbjct: 360 ADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 415

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +  +N++ N QQQN  ILYD+    L  A   C+
Sbjct: 416 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 170/351 (48%), Gaps = 32/351 (9%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  +++   IG PA      +DT +D  W    PCT C    + +F+  +S+++  +GC 
Sbjct: 105 SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 164

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKAT 206
           +  C  +P   C     +C +  TYG  S+    L+ +T +   +  + G  FGC  +  
Sbjct: 165 SGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 224

Query: 207 GNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG------ 259
           G+      GL+GLGRG LSL++Q   L ++ FSYCL S +    S SL +G +       
Sbjct: 225 GDGFSQGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 281

Query: 260 -----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
                  +  K   LL+NP + S YY+ L  I VG + + +     + +     G IIDS
Sbjct: 282 TGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDS 341

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGM 369
           GT  T L   A+  +++ F  R+   +  +   G D C+ +P     I  P +   F G 
Sbjct: 342 GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFKGA 401

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           ++ LP +N ++  ++  + CLAM ++     + +++  N+QQQN  +L+D+
Sbjct: 402 DLELPGENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 447


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 121/392 (30%), Positives = 191/392 (48%), Gaps = 29/392 (7%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           K L+  E +   + + + RLQ   ++A+   S   I     +  +  ++++  IGTP +T
Sbjct: 51  KNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEI-DAPVLPGNGEFLMKLAIGTPPET 109

Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
               MDT +D  W    PCT C    + +F+  +S++F  L C +  C+ +P  TC  G 
Sbjct: 110 YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG- 168

Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
           C +   YG  S+    L+ +T++     VP   FGC +   G+      GL+GLGRG LS
Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLS 228

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK----RIKYTPLLKNPRRSSLYY 280
           L++Q   L +  FSYCL S      S +L +G +   K     IK TPL++N  + S YY
Sbjct: 229 LVSQ---LKEPKFSYCLTSVDDTKAS-TLLMGSLASVKASDSEIKTTPLIQNSAQPSFYY 284

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           ++L  I VG   + I             G IIDSGT  T L   A+  V   F  ++  N
Sbjct: 285 LSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQI--N 342

Query: 341 LTVTSLG--GFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
           L V + G  G + C+++P     I  P +   F G ++ LP +N +I   +  + CLAM 
Sbjct: 343 LPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMG 402

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           ++     S +++  N+QQQN  +L+D+    L
Sbjct: 403 SS-----SGMSIFGNIQQQNMLVLHDLEKETL 429


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 169/351 (48%), Gaps = 32/351 (9%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  +++   IG PA      +DT +D  W    PCT C    + +F+  +S+++  +GC 
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 163

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKAT 206
           +  C  +P   C     AC +  TYG  S+    L+ +T +   +  + G  FGC  +  
Sbjct: 164 SGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 223

Query: 207 GNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG------ 259
           G+      GL+GLGRG LSL++Q   L ++ FSYCL S +    S SL +G +       
Sbjct: 224 GDGFSQGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 280

Query: 260 -----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
                  +  K   LL+NP + S YY+ L  I VG + + +     +       G IIDS
Sbjct: 281 TGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDS 340

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGM 369
           GT  T L   A+  +++ F  R+   +  +   G D C+ +P     I  P +   F G 
Sbjct: 341 GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGA 400

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           ++ LP +N ++  ++  + CLAM ++     + +++  N+QQQN  +L+D+
Sbjct: 401 DLELPGENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 446


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)

Query: 78  ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
           ARK  +  +SG  ++     SPT   Y++   IGTP        DT +D  W    PCT 
Sbjct: 64  ARKLALAASSGATVSAPTQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 123

Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
            C    + ++N + STTF  L C ++              P P C   AC +N+TYGS  
Sbjct: 124 QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 180

Query: 178 IAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
            +     +T +  +       VPG  FGC   ++G N+    GL+GLGRG LSL++Q   
Sbjct: 181 TSVFQGSETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 237

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
           L    FSYCL  ++  + + +L LGP   +     +  TP + +P  +   + YY+NL  
Sbjct: 238 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 297

Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
           I +G   + IPP A   N     G IIDSGT  T L   AY  VR      V    T  S
Sbjct: 298 ISLGTTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 357

Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
              G D C+      S P   P++TL F+G ++ LP D+ ++   +G + CLAM    + 
Sbjct: 358 AATGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 413

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +  +N++ N QQQN  ILYD+    L  A   C+
Sbjct: 414 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 116/379 (30%), Positives = 188/379 (49%), Gaps = 37/379 (9%)

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVF 136
            S +PI+SG ++ Q+  YIV   IG    TL++  DT +D  WV   PC  C      +F
Sbjct: 50  DSQIPISSGARL-QTLNYIVTVGIGGQNSTLIV--DTGSDLTWVQCLPCRLCYNQQEPLF 106

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGGG---------ACAFNLTYGSSTIA-ANLSQDT 186
           N + S++F +L C +  C  +  PT G           +C + + YG  + +   L  + 
Sbjct: 107 NPSNSSSFLSLPCNSPTCVAL-QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEK 165

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           ++L    +  + FGC +   G      GL+GL R  LSL++QT +L+ S FSYCLP+   
Sbjct: 166 LTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT-TG 224

Query: 247 LSFSGSLRLG-----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           +  SGSL LG            I YT +++NP+ S+ Y++NL  I +G   +++P    +
Sbjct: 225 VGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----R 280

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
            +   G  +++DSGTV TRL    Y A +  F ++     T       +TC+++     +
Sbjct: 281 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 340

Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
             PT+  +F G   M V +      + S A  I CLA A+      ++  +I N QQ+N 
Sbjct: 341 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTM--IIGNYQQKNQ 397

Query: 415 RILYDVPNSRLGVARELCT 433
           R++Y+   S++G A E C+
Sbjct: 398 RVIYNSKESKVGFAGEPCS 416


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/446 (28%), Positives = 199/446 (44%), Gaps = 59/446 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
           ++ + + H   PCSP   +  KP S  E    +LA DQ R + +    S+ A  R     
Sbjct: 89  TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 144

Query: 82  ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
                                 +P +SGR +     Y+V   +GTPA    +  DT +D 
Sbjct: 145 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGSDT 203

Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
            WV C  CV         +F+ A+S+T+ N+ C A  C  +    C GG C + + YG  
Sbjct: 204 TWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 263

Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
           + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + Y 
Sbjct: 264 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 323

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
             F++CLP+    +       G +   +    TP+L      + YYV +  IRVG +++ 
Sbjct: 324 GVFAHCLPARSTGTGYLDFGAGSLAAARARLTTPMLTE-NGPTFYYVGMTGIRVGGQLLS 382

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
           IP           AGTI+DSGTV TRL   AY+++R         R        SL   D
Sbjct: 383 IPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL--LD 435

Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           TCY       +  PT++L+F G        + ++++ + S  CLA AA  D  +  + ++
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 493

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N Q +   + YD+    +G     C
Sbjct: 494 GNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 123/416 (29%), Positives = 202/416 (48%), Gaps = 45/416 (10%)

Query: 49  SKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYIV 99
            K + W   + + L  D  R++ +         S    A ++ +P++SG  + Q+  YIV
Sbjct: 9   EKKIDWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINL-QTLNYIV 67

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
              +G+   T+++  DT +D  WV C  C+ C +    +F  + S++++++ C ++ C+ 
Sbjct: 68  TMGLGSTNMTVII--DTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125

Query: 157 VPNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
           +   T   GAC  N       + YG  S     L  + +S     V  + FGC +   G 
Sbjct: 126 LQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFGCGRNNKGL 185

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----I 264
                GL+GLGR  LSL++QT   +   FSYCLP+ ++ + SGSL +G      +    I
Sbjct: 186 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGA-SGSLVMGNESSVFKNVTPI 244

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
            YT +L NP+ S+ Y +NL  I       D+   ALQ       G +IDSGTV TRL + 
Sbjct: 245 TYTRMLPNPQLSNFYILNLTGI-------DVDGVALQVPSFGNGGVLIDSGTVITRLPSS 297

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD---N 377
            Y A++ +F ++     +       DTC+++     +  PTI++ F G N  L  D    
Sbjct: 298 VYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEG-NAELKVDATGT 356

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             +     S  CLA+A+  D  ++   +I N QQ+N R++YD   S++G A E C+
Sbjct: 357 FYVVKEDASQVCLALASLSDAYDTA--IIGNYQQRNQRVIYDTKQSKVGFAEESCS 410


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 128/408 (31%), Positives = 183/408 (44%), Gaps = 52/408 (12%)

Query: 67  ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
           AR Q   S A A    V   + + +     YI+   IGTP  +     DT +D  W  C 
Sbjct: 57  AREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116

Query: 127 -----------GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCKQVPNPTCGGGACAF 169
                       C   S  ++N + STTF  L C       AA     P P C   AC +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGC---ACMY 173

Query: 170 NLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
           N TYG+   A   S +T +  +        VP   FGC   ++ +     GL+GLGRGS+
Sbjct: 174 NQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSM 233

Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-----IGQPKRIKYTPLLKNPRR--- 275
           SL++Q   L    FSYCL  F+  + + +L LGP     +     ++ TP +  P +   
Sbjct: 234 SLVSQ---LGAGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPM 290

Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
           S+ YY+NL  I VG   + IPP A         G IIDSGT  T LV  AY  VR   R 
Sbjct: 291 STYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRS 350

Query: 336 RVGSNLTVT----SLGGFDTCYSV-----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAG 385
            + + L +        G D C+++     P   P++TL F  G ++ LP +N +I  +  
Sbjct: 351 LLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGSG- 409

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + CLAM    +     ++++ N QQQN  +LYDV    L  A  +C+
Sbjct: 410 -VWCLAMR---NQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCS 453


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/429 (30%), Positives = 198/429 (46%), Gaps = 55/429 (12%)

Query: 48  PSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYI 98
           P  P++ +  +  +LA D++R             S+   +  + VP+ SG ++ Q+  Y+
Sbjct: 87  PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL-QTLNYV 145

Query: 99  VRAKIG----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
               +G    +PA  L + +DT +D  WV   PC+ C      +F+ A S T+  + C A
Sbjct: 146 TTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 205

Query: 152 AQCKQ-------VPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTF 199
           + C          P      GA    C + L YG  + +   L+ DT++L    + G+ F
Sbjct: 206 SACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVF 265

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G      GL+GLGR  LSL++QT + Y   FSYCLP+  +   SGSL LG   
Sbjct: 266 GCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 325

Query: 260 QPKR-------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
                      + YT ++ +P +   Y++N+    VG         AL       +  +I
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------ALAAQGLGASNVLI 378

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTCYSV----PIVAPTITLM 365
           DSGTV TRL    Y AVR  F R+ G+     +  GF   DTCY +     +  P +TL 
Sbjct: 379 DSGTVITRLAPSVYRAVRAEFMRQFGA-AGYPAAPGFSILDTCYDLTGHDEVKVPLLTLR 437

Query: 366 FS-GMNVTLPQDNLL-IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
              G +VT+    +L +    GS  CLAMA+   +      +I N QQ+N R++YD   S
Sbjct: 438 LEGGADVTVDAAGMLFVVRKDGSQVCLAMASL--SYEDETPIIGNYQQKNKRVVYDTLGS 495

Query: 424 RLGVARELC 432
           RLG A E C
Sbjct: 496 RLGFADEDC 504


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 130/428 (30%), Positives = 199/428 (46%), Gaps = 27/428 (6%)

Query: 23  PICDTQDHSS----TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
           PI +  ++SS     L++FH       F P  P  ++E +    ++D  R+  L  L  +
Sbjct: 58  PIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERI----SRDSKRVSSLLRLLSS 113

Query: 79  RKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVG 130
                    G  +       S  Y VR  +G+P ++  + +D+ +D  WV   PC+ C  
Sbjct: 114 GSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQ 173

Query: 131 CSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
            S  VF+ A S T+  + C ++ C ++ N  C  G C + ++YG  S     L+ +T++ 
Sbjct: 174 QSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF 233

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
              ++     GC     G  +   GLLGLG G++S + Q        FSYCL S +    
Sbjct: 234 GRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVS-RGTES 292

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
           +G+L  G    P    + PL++NPR  S YYV L  + VG   V IP    +       G
Sbjct: 293 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 352

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLM 365
            ++D+GT  TRL APAY A RD F  +  +      +  FDTCY+    V +  PT++  
Sbjct: 353 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 412

Query: 366 FSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           FSG  + TLP  N LI        C A AA+     S L++I N+QQ+  +I  D  N  
Sbjct: 413 FSGGPILTLPARNFLIPVDGEGTFCFAFAASA----SGLSIIGNIQQEGIQISIDGSNGF 468

Query: 425 LGVARELC 432
           +G    +C
Sbjct: 469 VGFGPTIC 476


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/426 (30%), Positives = 198/426 (46%), Gaps = 47/426 (11%)

Query: 22  NPICDTQDHSST-LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR- 79
           +P+   Q+ + T L++ H   PC+P + S   +   SV + L  DQ R + +      R 
Sbjct: 53  DPVAPQQNDTFTVLRLTHRHGPCAPLRASSLAA--PSVADTLRADQRRAEHILRRVSGRG 110

Query: 80  ----------KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
                      + VP   G  I  S  Y+V A +GTP     + +DT +D +WV C  C 
Sbjct: 111 APQLWDYKAAAATVPANWGYDIGTS-NYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCA 169

Query: 130 GCS-----STVFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG-SSTIAAN 181
             S       +F+ AQS+++  + C  + C    +    C    C + ++YG  S     
Sbjct: 170 APSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGV 229

Query: 182 LSQDTISLATD-IVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSY 239
            S DT++LA +  V G+ FGC    +G       GLLG GR   SL+ QT   Y   FSY
Sbjct: 230 YSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSY 289

Query: 240 CLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           CLP+    S +G L LG P G       T LL +P   + Y V L  I VG + + +P  
Sbjct: 290 CLPTKS--STTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPAS 347

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
           A        AGT++D+GTV TRL   AY A+R  FR  + S  +   +G  DTCYS    
Sbjct: 348 AFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGY 401

Query: 357 --IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
             +   ++ L F SG  +TL  D ++      S  CLA A++    +  + ++ N+QQ++
Sbjct: 402 GTVNLTSVALTFSSGATMTLGADGIM------SFGCLAFASS--GSDGSMAILGNVQQRS 453

Query: 414 HRILYD 419
             +  D
Sbjct: 454 FEVRID 459


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 125/437 (28%), Positives = 207/437 (47%), Gaps = 46/437 (10%)

Query: 24  ICDTQDH----SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR 79
           +CD  +     +S+L+V   + PC+     K      S  E+L +DQ R++ + +     
Sbjct: 53  VCDHSNKVLNKASSLKVVSKYGPCTVTGDPKTF---PSAAEILRRDQLRVKSIRAKHSMN 109

Query: 80  KSVVPIASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C- 131
            S   + +    T+ PT      Y V   +GTP +   +  DT +D  W  C  C G C 
Sbjct: 110 SSTTGVFN-EMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCF 168

Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG----GACAFNLTYGSSTIAANLSQD 185
             +   F+  +ST++KNL C +  CK +   +  G     +C + + YG+      L+ +
Sbjct: 169 PQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTGYTVGFLATE 228

Query: 186 TISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           T+++  +D+   +  GC ++  G      GLLGLGR  ++L +QT + Y++ FSYCLP+ 
Sbjct: 229 TLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPA- 287

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
            + S +G L  G  G  +  K+TP+    +   LY +++  I VG R + I P   +   
Sbjct: 288 -SSSSTGHLSFGG-GVSQAAKFTPITS--KIPELYGLDVSGISVGGRKLPIDPSVFRT-- 341

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP------I 357
              AGTIIDSGT  T L + A++A+   F+  + +N T+T    G   CY         I
Sbjct: 342 ---AGTIIDSGTTLTYLPSTAHSALSSAFQEMM-TNYTLTKGTSGLQPCYDFSKHANDNI 397

Query: 358 VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV-LNVIANMQQQNHR 415
             P I++ F  G+ V +    + I +      CLA     DN N   + +  N+QQ+ + 
Sbjct: 398 TIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFK---DNGNDTDVAIFGNVQQKTYE 454

Query: 416 ILYDVPNSRLGVARELC 432
           ++YDV    +G A   C
Sbjct: 455 VVYDVAKGMVGFAPGGC 471


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 125/410 (30%), Positives = 199/410 (48%), Gaps = 34/410 (8%)

Query: 41  SPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIV 99
           SP SPF P   +S  E     + + Q RL+ L       K+V  P+ +G        +++
Sbjct: 64  SPLSPFSPGN-ISSTERFKRAIKRSQDRLEKLQMSVDEVKAVEAPVYAGNG-----EFLM 117

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
           +  IGTP+ +    +DT +D  W    PCT C    + +++ +QS+T+  + C ++ C+ 
Sbjct: 118 KMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQA 177

Query: 157 VPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGL 215
           +P  +C G  C +  +YG  S+    LS ++ +L +  +P   FGC Q+  G      G 
Sbjct: 178 LPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGG 237

Query: 216 LGLGRGS-LSLLAQTQNLYQSTFSYCLPSF-KALSFSGSLRLGPIG--QPKRIKYTPLLK 271
           L       LSL++Q      + FSYCL S   + S +  L +G       K +  TPL++
Sbjct: 238 LVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQ 297

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           +  R + YY++L  I VG +++DI  G          G IIDSGT  T L    Y    D
Sbjct: 298 SRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGY----D 353

Query: 332 VFRRRVGSNLTVTSLG----GFDTCY-----SVPIVAPTITLMFSGMNVTLPQDNLLIHS 382
           V ++ V S++ +  +     G D C+     S     PTIT  F G +  LP++N +   
Sbjct: 354 VVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFEGADFNLPKENYIYTD 413

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++G I CLAM   P N    +++  N+QQQN++ILYD   + L  A  +C
Sbjct: 414 SSG-IACLAM--LPSN---GMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 131/428 (30%), Positives = 202/428 (47%), Gaps = 49/428 (11%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
           ++T+ + H   PCSP    K  S E+     L +DQ R  ++              +  V
Sbjct: 56  ATTVPLHHRHGPCSPLPTKKMPSLED----RLHRDQLRAAYIKRKFSGDVKKDGQGAGGV 111

Query: 78  ARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV- 135
            +  V VP   G  +  +  Y++  ++G+PA+T  + +D+ +D +WV C  C+ C S V 
Sbjct: 112 EQSHVTVPTTLGTSL-NTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD 170

Query: 136 --FNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTY--GSSTIAANLSQDTIS 188
             F+ + S+T+    C +A C Q+     G      C + + Y  GSST     S DT++
Sbjct: 171 PLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTG-TYSSDTLA 229

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           L ++ +  + FGC    +G +    GL+GLG G+ SL +QT   + + FSYCLP     S
Sbjct: 230 LGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLP--PTPS 287

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
            SG L LG  G    +K TP+L++    + Y V L AIRVG   + IP           A
Sbjct: 288 SSGFLTLG-AGTSGFVK-TPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS------A 339

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITL 364
           G ++DSGT+ TRL   AY+A+   F+  +            DTC+       +  P++ L
Sbjct: 340 GMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVAL 399

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           +FSG  V     N +I        CLA AA  D  +S   ++ N+QQ+   +LYDV    
Sbjct: 400 VFSGGAVVNLDANGIILG-----NCLAFAANSD--DSSPGIVGNVQQRTFEVLYDVGGGA 452

Query: 425 LGVARELC 432
           +G     C
Sbjct: 453 VGFKAGAC 460


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 130/445 (29%), Positives = 202/445 (45%), Gaps = 64/445 (14%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----------RK-- 80
           L + H  SPCSP     PL  +     +L  D AR+  L+S   A           RK  
Sbjct: 46  LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQK 101

Query: 81  -----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
                            + VP++ G  +     Y+ +  +GTP+ +  M +DT +   W+
Sbjct: 102 KAAGGASGGHHLDDDSLASVPLSPGTSVGVG-NYVTQLGLGTPSTSYAMVVDTGSSLTWL 160

Query: 124 PCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQV------PNPTCGGGACAFNLTY 173
            C+ CV  C   V   F+   S+T+ ++ C A+QC ++      P+       C +  +Y
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY 220

Query: 174 GSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
           G S+ +  +LS DT+S  +   P + +GC Q   G      GL+GL R  LSLL Q    
Sbjct: 221 GDSSFSVGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
              +FSYCLP+  +   +G L +GP        YTP+  +   +SLY++ L  + VG   
Sbjct: 281 LGYSFSYCLPTAAS---TGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           + + P      P     TIIDSGTV TRL    +TA+     + +       +    DTC
Sbjct: 338 LAVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTC 392

Query: 353 Y---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +   +  +  PT+ + F+ G ++ L   N+LI     S TCLA   AP +  +   +I N
Sbjct: 393 FEGQASQLRVPTVAMAFAGGASMKLTTRNVLI-DVDDSTTCLAF--APTDSTA---IIGN 446

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
            QQQ   ++YDV  SR+G +   C+
Sbjct: 447 TQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/362 (33%), Positives = 168/362 (46%), Gaps = 34/362 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQAA 152
           +++   IGTP    L   DT +D  W  C      C    + ++N + STTF  L C ++
Sbjct: 85  FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144

Query: 153 QCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKAT 206
               +  P C   AC +N+TYGS         +T +  +        VPG  FGC   ++
Sbjct: 145 --LGLCAPAC---ACMYNMTYGSGWTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASS 199

Query: 207 G-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP---IGQPK 262
           G N+    GL+GLGRGSLSL++Q   L    FSYCL  ++  + + +L LGP   +    
Sbjct: 200 GFNASSASGLVGLGRGSLSLVSQ---LGAPKFSYCLTPYQDTNSTSTLLLGPSASLNDTG 256

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            +  TP + +P  S  YY+NL  I +G   + IPP A         G IIDSGT  T L 
Sbjct: 257 VVSSTPFVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLG 315

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQ 375
             AY  VR      V    T  S   G D C+      S P   P++TL F G ++ LP 
Sbjct: 316 NTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADMVLPA 375

Query: 376 DNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           DN ++  +         CLAM    D    V++++ N QQQN  ILYDV    L  A   
Sbjct: 376 DNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAK 435

Query: 432 CT 433
           C+
Sbjct: 436 CS 437


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 135/443 (30%), Positives = 211/443 (47%), Gaps = 78/443 (17%)

Query: 46  FKPSKPLSWEESVLEMLAKDQARLQFL---------------SSLAV-ARKSVVPIASGR 89
           F P+   S EE    +L+ D AR+  L               + +AV A K+ VP++SG 
Sbjct: 77  FSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGA 136

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
           ++ ++  Y+    +G    T+++  DT+++  WV C  C  C      +F+ + S ++  
Sbjct: 137 RL-RTLNYVATVGLGGGEATVIV--DTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAA 193

Query: 147 LGCQAAQCKQVPN----------PTCGGG---ACAFNLTYGSSTIAAN-LSQDTISLATD 192
           + C +  C  +            P C  G   AC++ L+Y   + +   L+ D +SLA +
Sbjct: 194 VPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE 253

Query: 193 IVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
           ++ G+ FGC    T N  PP     GL+GLGR  LSL++QT + +   FSYCLP  +   
Sbjct: 254 VIDGFVFGC---GTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD 310

Query: 249 FSGSLRLGPIGQPKR----IKYT-------PLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
            SGSL LG      R    + YT       PLL+ P     Y VNL  I VG + V+   
Sbjct: 311 ASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGP----FYLVNLTGITVGGQEVE--- 363

Query: 298 GALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
                  +TG  A  I+DSGTV T LV   Y AVR  F  ++            DTC+++
Sbjct: 364 -------STGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNM 416

Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANM 409
                +  P++TL+F  G  V +    +L   S+  S  CLA+A+      +  ++I N 
Sbjct: 417 TGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDET--SIIGNY 474

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
           QQ+N R+++D   S++G A+E C
Sbjct: 475 QQKNLRVVFDTSASQVGFAQETC 497


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 135/432 (31%), Positives = 198/432 (45%), Gaps = 38/432 (8%)

Query: 21  LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSLAVAR 79
           +N  C+       +Q+ HV          + LS  E +  M  + +AR  + LSS A A 
Sbjct: 24  INSCCNAAAAPVRMQLTHV-------DAGRGLSGRELMRRMALRSKARAPRLLSSSATAP 76

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
            S      G  +T+   Y++   IGTP Q + + +DT +   W  C  C  C   S   +
Sbjct: 77  VSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYY 133

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA---CAFNLTYGS-STIAANLSQDTIS-LA 190
           ++++S+TF    C + QCK  P+ T C       CA++ +YG  S     L  +T+S +A
Sbjct: 134 DASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA 193

Query: 191 TDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
              VPG  FGC    TG     + G+ G GRG LSL +Q   L    FS+C  +      
Sbjct: 194 GASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVSGRKP 250

Query: 250 SGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           S  L   P    K     ++ TPL+KNP   + YY++L  I VG   + +P  A      
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNG 310

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----P 360
           TG GTIIDSGT FT L    Y  V D F   V   +  ++  G   C+S P +      P
Sbjct: 311 TG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 369

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            + L F G  + LP++N +  +  G    + +A     +   + +I N QQQN  +LYD+
Sbjct: 370 KLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDL 425

Query: 421 PNSRLGVARELC 432
            NS+L   R  C
Sbjct: 426 KNSKLSFVRAKC 437


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 167/358 (46%), Gaps = 19/358 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           PI SG     S  Y  R  IG P+  + M +DT +D  W+ C  C  C   +  +F  A 
Sbjct: 132 PIISGTS-QGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPAS 190

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           ST++  L C   QC+ +    C    C + ++YG  S    +   +TI+L +  V     
Sbjct: 191 STSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI 250

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G  +   GLLGLG G LS  +Q   +  S+FSYCL    + S S +L      
Sbjct: 251 GCGHNNEGLFIGAAGLLGLGGGKLSFPSQ---INASSFSYCLVDRDSDSAS-TLEFNSAL 306

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P  I   PLL+N    + YYV +  + VG  ++ IP    + + +   G IIDSGT  T
Sbjct: 307 LPHAIT-APLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVT 365

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT-LP 374
           RL   AY A+RD F +        + +  FDTCY +     +  PT+T   +G  V  LP
Sbjct: 366 RLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLP 425

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             N LI   +    C A A      +S L++I N+QQQ  R+ +D+ NS +G     C
Sbjct: 426 ATNYLIPVDSDGTFCFAFAP----TSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 123/386 (31%), Positives = 176/386 (45%), Gaps = 20/386 (5%)

Query: 62  LAKDQARL-QFLSSLAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMD 115
           + +D  R    L  LA  + +    A G  +       S  Y VR  +G+P +   + MD
Sbjct: 95  MQRDTKRAASLLRRLAAGKPTYAAEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVMD 154

Query: 116 TSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
           + +D  WV   PCT C   S  VFN A S++F  + C +  C  V N  C  G C + ++
Sbjct: 155 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVS 214

Query: 173 YGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           YG  S     L+ +TI+    ++     GC     G  V   GLLGLG G +S + Q   
Sbjct: 215 YGDGSYTKGTLALETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGG 274

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
                FSYCL S + +  SG L  G    P    + PL+ NPR  S YY+ L  + VG  
Sbjct: 275 QTGGAFSYCLVS-RGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGL 333

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
            V I     + +     G ++D+GT  TRL   AY A RD F  +  +    + +  FDT
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDT 393

Query: 352 CYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           CY     V +  PT++  FSG  + TLP  N LI        C A A +    +S L++I
Sbjct: 394 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPS----SSGLSII 449

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N+QQ+  +I  D  N  +G    +C
Sbjct: 450 GNIQQEGIQISVDGANGFVGFGPNVC 475


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 130/445 (29%), Positives = 201/445 (45%), Gaps = 64/445 (14%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----------RK-- 80
           L + H  SPCSP     PL  +     +L  D AR+  L+S   A           RK  
Sbjct: 46  LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQK 101

Query: 81  -----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
                            + VP++ G  +     Y+ +  +GTP+ +  M +DT +   W+
Sbjct: 102 KAAGGASGGHHLDDDSLASVPLSPGTSVGVG-NYVTQLGLGTPSTSYAMVVDTGSSLTWL 160

Query: 124 PCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQV------PNPTCGGGACAFNLTY 173
            C+ CV  C   V   F+   S+T+ ++ C A+QC ++      P+       C +  +Y
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY 220

Query: 174 GSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
           G S+ +   LS DT+S  +   P + +GC Q   G      GL+GL R  LSLL Q    
Sbjct: 221 GDSSFSVGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
              +FSYCLP+  +   +G L +GP        YTP+  +   +SLY++ L  + VG   
Sbjct: 281 LGYSFSYCLPTAAS---TGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           + + P      P     TIIDSGTV TRL    +TA+     + +       +    DTC
Sbjct: 338 LAVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTC 392

Query: 353 Y---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +   +  +  PT+ + F+ G ++ L   N+LI     S TCLA   AP +  +   +I N
Sbjct: 393 FEGQASQLRVPTVVMAFAGGASMKLTTRNVLI-DVDDSTTCLAF--APTDSTA---IIGN 446

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
            QQQ   ++YDV  SR+G +   C+
Sbjct: 447 TQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 100/346 (28%), Positives = 166/346 (47%), Gaps = 32/346 (9%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           +   IG PA      +DT +D  W    PCT C    + +F+  +S+++  +GC +  C 
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60

Query: 156 QVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVP 211
            +P   C     AC +  TYG  S+    L+ +T +   +  + G  FGC  +  G+   
Sbjct: 61  ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFS 120

Query: 212 P-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG----------- 259
              GL+GLGRG LSL++Q   L ++ FSYCL S +    S SL +G +            
Sbjct: 121 QGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASL 177

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
             +  K   LL+NP + S YY+ L  I VG + + +     +       G IIDSGT  T
Sbjct: 178 DGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTIT 237

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLP 374
            L   A+  +++ F  R+   +  +   G D C+ +P     I  P +   F G ++ LP
Sbjct: 238 YLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELP 297

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            +N ++  ++  + CLAM ++     + +++  N+QQQN  +L+D+
Sbjct: 298 GENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 338


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 128/409 (31%), Positives = 184/409 (44%), Gaps = 42/409 (10%)

Query: 60  EMLAKDQARLQFLSSLAVARKSV----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
           + LA D  RL FLS   + RK V     P+ SG   + S  Y V  +IG P Q+LL+  D
Sbjct: 46  QALALDTRRLHFLS---LRRKPVPFVKSPVVSGAS-SGSGQYFVDLRIGQPPQSLLLIAD 101

Query: 116 TSNDAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNP--------TCG 163
           T +D  WV C+ C  CS    +TVF    S+TF    C    C+ VP P        T  
Sbjct: 102 TGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRI 161

Query: 164 GGACAFNLTYGSSTIAANL-SQDTISLATD-----IVPGYTFGCIQKATGNSVP------ 211
              C +   Y   ++ + L +++T SL T       +    FGC  + +G SV       
Sbjct: 162 HSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNG 221

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP-KRIKYTPL 269
             G++GLGRG +S  +Q    + + FSYCL  +  +   +  L +G  G    ++ +TPL
Sbjct: 222 ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPL 281

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           L NP   + YYV L ++ V    + I P   + + +   GT++DSGT    L  PAY  V
Sbjct: 282 LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIHST 383
               ++R+          GFD C +V  V       P +   FSG  V +P        T
Sbjct: 342 IAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 401

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              I CLA+ +    V    +VI N+ QQ     +D   SRLG +R  C
Sbjct: 402 EEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 178/348 (51%), Gaps = 15/348 (4%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAA 152
           S  Y+++  IGTPA +L   MDT +D  W  C  C  CS+ ++++ + S+T+  + CQ++
Sbjct: 39  SGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPSSSSTYSKVLCQSS 98

Query: 153 QCKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSV 210
            C+     +C   G C +   YG  S+ +  LS +T S+++  +P  TFGC     G   
Sbjct: 99  LCQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQGFD- 157

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTP 268
              GL+G GRGSLSL++Q      + FSYCL S    S +  L +G     +   +  TP
Sbjct: 158 KVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTP 217

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+++   ++ YY++L  I VG + + IP G          G IIDSGT  T L   AY A
Sbjct: 218 LVQS-SSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDA 276

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTA 384
           V++     +  NL     G  D C++    +    P++T  F G +  +P++N L   + 
Sbjct: 277 VKEAMVSSI--NLPQAD-GQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDST 333

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             I CLAM     N+ + + +  N+QQQN++ILYD  N+ L  A   C
Sbjct: 334 SDIVCLAMMPTNSNLGN-MAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 119/354 (33%), Positives = 166/354 (46%), Gaps = 22/354 (6%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  IG+P +   + MDT +D  W+ C+ C  C   +  VF+   S++F+ L C 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCS 70

Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATG 207
             QCK +    C      C + ++YG  +    +L+ D+  ++        FGC     G
Sbjct: 71  TPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTSPVVFGCGHDNEG 130

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP--KRI 264
             V   GLLGLG G LS  +Q   L    FSYCL S    +  S +L  G    P     
Sbjct: 131 LFVGAAGLLGLGAGKLSFPSQ---LSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASF 187

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
            YT LLKNP+  + YY  L  I +G  ++ IP  A + + +TG  G IIDSGT  TRL  
Sbjct: 188 AYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPT 247

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNL 378
            AYT +RD FR              FDTCY       +  PT++  F  G +V LP  N 
Sbjct: 248 YAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNY 307

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L+        C A +    +    L++I N+QQQ  R+  D+ +SR+G A   C
Sbjct: 308 LVPVDTSGTFCFAFSKTSLD----LSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 186/387 (48%), Gaps = 33/387 (8%)

Query: 52  LSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTL 110
           L+  E +   + + + R++ ++++  +   +  P+ +G     S  Y++   IGTPA +L
Sbjct: 55  LTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG-----SGEYLMNVAIGTPASSL 109

Query: 111 LMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGAC 167
              MDT +D  W    PCT C    + +FN   S++F  L C++  C+ +P+ +C    C
Sbjct: 110 SAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DC 168

Query: 168 AFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQK----ATGNSVPPQGLLGLGRGS 222
            +   YG  S+    ++ +T +  T  VP   FGC +       GN     GL+G+G G 
Sbjct: 169 QYTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGWGP 225

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI--GQPKRIKYTPLLKNPRRSSLYY 280
           LSL +Q   L    FSYC+ +    S   +L LG    G P+    T L+ +    + YY
Sbjct: 226 LSLPSQ---LGVGQFSYCM-TSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYY 281

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           + L  I VG   + IP    Q       G IIDSGT  T L   AY AV   F  ++  +
Sbjct: 282 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLS 341

Query: 341 LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
               S  G  TC+ +P     +  P I++ F G  + L ++N+LI S A  + CLAM ++
Sbjct: 342 PVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLI-SPAEGVICLAMGSS 400

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPN 422
                  +++  N+QQQ  ++LYD+ N
Sbjct: 401 SQQ---GISIFGNIQQQETQVLYDLQN 424


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 186/371 (50%), Gaps = 40/371 (10%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSA 139
           P+  G  I  S  Y V+  +G+PA+   M +DT +  +W+ C  CV  C   +  +F+ +
Sbjct: 1   PLNPGASI-GSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPS 59

Query: 140 QSTTFKNLGCQAAQCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA- 190
            S T+K+L C ++QC  +      NP C      C +  +YG S+ +   LSQD ++LA 
Sbjct: 60  ASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAP 119

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
           +  +PG+ +GC Q + G      G+LGLGR  LS+L Q  + +   FSYCLP+     F 
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGF- 178

Query: 251 GSLRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
             L +G         K+TP+  +P   SLY++ L AI VG R + +   A Q+       
Sbjct: 179 --LSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV--AAAQYR----VP 230

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTCYSVPI----VAPTI 362
           TIIDSGTV TRL    YT  +  F + + S        GF   DTC+   +      P +
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAP--GFSILDTCFKGNLKDMQSVPEV 288

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            L+F  G ++ L   N+L+    G +TCLA A      N+ + +I N QQQ  ++ +D+ 
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVDEG-LTCLAFAG-----NNGVAIIGNHQQQTFKVAHDIS 342

Query: 422 NSRLGVARELC 432
            +R+G A   C
Sbjct: 343 TARIGFATGGC 353


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/380 (32%), Positives = 182/380 (47%), Gaps = 50/380 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---------CSST-VFNSAQSTTFKN 146
           Y+V    GTP Q +L+  DT +D  W+ C+             CS    F +++S T   
Sbjct: 53  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112

Query: 147 LGCQAAQCKQVPNPTCGGGAC----------AFNLTYGSSTIAANLSQDTISLATDI--- 193
           + C AAQC  VP P   G AC          A++   GSST    L++DT +++      
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTG-FLARDTATISNGTSGG 171

Query: 194 --VPGYTFGCIQKATGNSVPPQG-LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
             V G  FGC  +  G S    G ++GLG+G LS  AQ+ +L+  TFSYCL   +     
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231

Query: 251 GSLRLGPIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
            S     +G+P+R     YTPL+ NP   + YYV ++AIRVG RV+ +P      +    
Sbjct: 232 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 291

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSLGGFDTCYSVPIVA--- 359
            GT+IDSG+  T L   AY  +   F       R+ S+ T     G + CY+V   +   
Sbjct: 292 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF--FQGLELCYNVSSSSSSA 349

Query: 360 ------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
                 P +T+ F+ G+++ LP  N L+   A  + CLA+   P       NV+ N+ QQ
Sbjct: 350 PANGGFPRLTIDFAQGLSLELPTGNYLV-DVADDVKCLAI--RPTLSPFAFNVLGNLMQQ 406

Query: 413 NHRILYDVPNSRLGVARELC 432
            + + +D  ++R+G AR  C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/446 (28%), Positives = 199/446 (44%), Gaps = 59/446 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
           ++ + + H   PCSP   +  KP S  E    +LA DQ R + +    S+ A  R     
Sbjct: 87  TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 142

Query: 82  ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
                                 +P +SGR +     Y+V   +GTPA    +  DT +D 
Sbjct: 143 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGSDT 201

Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
            WV C  CV         +F+  +S+T+ N+ C A  C  +    C GG C + + YG  
Sbjct: 202 TWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 261

Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
           + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + Y 
Sbjct: 262 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 321

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
             F++CLP+    +       G          TP+L +    + YY+ +  IRVG +++ 
Sbjct: 322 GVFAHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTD-NGPTFYYIGMTGIRVGGQLLS 380

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
           IP           AGTI+DSGTV TRL  PAY+++R         R        SL   D
Sbjct: 381 IPQSVFAT-----AGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSL--LD 433

Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           TCY       +  PT++L+F G        + ++++ + S  CLA AA  D  +  + ++
Sbjct: 434 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 491

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N Q +   + YD+    +G    +C
Sbjct: 492 GNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 178/361 (49%), Gaps = 26/361 (7%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y  +  +GTPA T LM +DT +D  W+   PC  C   S  VF+  +S ++  + C 
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
           A  C+++ +  C     +C + + YG  ++ A + + +T++ A    V     GC     
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 238

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
           G  +   GLLGLGRG LS  +Q    +  +FSYCL    +     S R   +        
Sbjct: 239 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
                 +TP+ +NPR ++ YYV+LL   V G RV  +    L+ NPTTG  G I+DSGT 
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 358

Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
            TRL  P Y AVRD FR   VG  ++      FDTCY++     +  PT+++  + G +V
Sbjct: 359 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 418

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP +N LI        C AMA     V    ++I N+QQQ  R+++D    R+G   + 
Sbjct: 419 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 474

Query: 432 C 432
           C
Sbjct: 475 C 475


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 170/366 (46%), Gaps = 23/366 (6%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y +R  +GTP + + + MDT +D  W+ C  CV C   S  +F+  +
Sbjct: 46  PVVSGLSL-GSGEYFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYK 104

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD------I 193
           S+T+  LGC   QC  +   TC    C + + YG  +        D +SL +       +
Sbjct: 105 SSTYSTLGCSTRQCLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS- 252
           +     GC     G  V   GLLGLG+G LS   Q        FSYCL   +  S  GS 
Sbjct: 165 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSS 224

Query: 253 LRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           L  G     P   ++TP   N R  + YY+ +  I VG  ++ IP  A Q +     G I
Sbjct: 225 LVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVI 284

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF- 366
           IDSGT  TRL   AY ++RD FR              FDTCY +  +A    PT+TL F 
Sbjct: 285 IDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQ 344

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G ++ LP  N LI     +  CLA A       +  ++I N+QQQ  R++YD  ++++G
Sbjct: 345 GGTDLKLPASNYLIPVDNSNTFCLAFAGT-----TGPSIIGNIQQQGFRVIYDNLHNQVG 399

Query: 427 VARELC 432
                C
Sbjct: 400 FVPSQC 405


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 170/345 (49%), Gaps = 29/345 (8%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           TY+V   IGTP   L   +DT +D  W     PC  C    + ++  A+S T+ N+ C++
Sbjct: 91  TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150

Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
             C+  Q P   C      CA+  +YG  T     L+ +T +L +D  V G  FGC  + 
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRI 264
            G++    GL+G+GRG LSL++Q   L  + FSYC   F A + S  L LG   +     
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ---LGVTRFSYCFTPFNATAAS-PLFLGSSARLSSAA 266

Query: 265 KYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
           K TP + +P     RRSS YY++L  I VG  ++ I P   +  P    G IIDSGT FT
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFT 326

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQ 375
            L   A+ A+      RV   L   +  G   C++      +  P + L F G ++ L +
Sbjct: 327 ALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 386

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           ++ ++   +  + CL M +A       ++V+ +MQQQN  ILYD+
Sbjct: 387 ESYVVEDRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 426


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 178/361 (49%), Gaps = 26/361 (7%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y  +  +GTPA T LM +DT +D  W+   PC  C   S  VF+  +S ++  + C 
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 184

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
           A  C+++ +  C     +C + + YG  ++ A + + +T++ A    V     GC     
Sbjct: 185 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 244

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
           G  +   GLLGLGRG LS  +Q    +  +FSYCL    +     S R   +        
Sbjct: 245 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 304

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
                 +TP+ +NPR ++ YYV+LL   V G RV  +    L+ NPTTG  G I+DSGT 
Sbjct: 305 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 364

Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
            TRL  P Y AVRD FR   VG  ++      FDTCY++     +  PT+++  + G +V
Sbjct: 365 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 424

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP +N LI        C AMA     V    ++I N+QQQ  R+++D    R+G   + 
Sbjct: 425 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 480

Query: 432 C 432
           C
Sbjct: 481 C 481


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 181/374 (48%), Gaps = 42/374 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y V  ++GTPA  +++ MDT +D +W   VPC  CV      FN   S++F  L C ++ 
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198

Query: 154 CKQV---PNPTCG--GGACAFNLTYGSSTIAANL-SQDTISLAT----DIVP----GYTF 199
           C  V     P C   G  C F++ YG  ++++ L + +TI+  T    D  P      T 
Sbjct: 199 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 258

Query: 200 GCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP 257
           GC      G      GLLG+ R  +S  +Q  + Y   FS+C P   A L+ SG +  G 
Sbjct: 259 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 318

Query: 258 --IGQPKRIKYTPLLKNPRRSS----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-T 310
             I  P  ++YTPL++NP   S     YYV L+ I V    + +       +  TG+G T
Sbjct: 319 SDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 377

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--------PIVAPTI 362
           IIDSGT FT L  PA+ A+R  F  R      V    GF  CY++          + P+I
Sbjct: 378 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSI 437

Query: 363 TLMF-SGMNVTLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           TL F  G++V LP++++LI    S   +  CLA   + D      N+I N QQQN  + Y
Sbjct: 438 TLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGD---IPFNIIGNYQQQNLWVEY 494

Query: 419 DVPNSRLGVARELC 432
           D+   RLG+A   C
Sbjct: 495 DLEKLRLGIAPAQC 508


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 126/415 (30%), Positives = 196/415 (47%), Gaps = 45/415 (10%)

Query: 53  SWEESVLEMLAKDQARLQFL------------SSLAVARK-SVVPIASGRQITQSPTYIV 99
           S  E    +LA D AR+  L            S  A A K + VP+ SG ++ ++  Y+ 
Sbjct: 57  SRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARL-RTLNYVA 115

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
              IG    T+++  DT+++  WV C  C  C      +F+ + S ++  + C ++ C  
Sbjct: 116 TVGIGGGEATVIV--DTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDA 173

Query: 157 VPNPTCGGG--------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
           +   T   G        AC++ L+Y   + +   L+ D +SLA + + G+ FGC     G
Sbjct: 174 LRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGTSNQG 233

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
                 GL+GLGR  LSL++QT + +   FSYCLP  ++ S SGSL LG      R    
Sbjct: 234 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGS-SGSLVLGDDASVYRNSTP 292

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           I YT ++ +P +   Y  NL  I VG   V  P     F+   G   I+DSGT+ T LV 
Sbjct: 293 IVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPG----FSAGGGGKAIVDSGTIITSLVP 348

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
             Y AVR  F  ++            DTC+ +     +  P++ L+F  G  V +    +
Sbjct: 349 SVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGV 408

Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L   T   S  CLA+A+     ++   +I N QQ+N R+++D   S++G A+E C
Sbjct: 409 LYVVTGDASQVCLALASLKSEYDT--PIIGNYQQKNLRVIFDTVGSQIGFAQETC 461


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 124/417 (29%), Positives = 195/417 (46%), Gaps = 44/417 (10%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
           S+ L++ H   PC+P   +  L    S L+ L  DQ R +++               LA 
Sbjct: 53  SAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 112

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSST- 134
           ++ + VP   G  I  +  Y+V   +GTPA    + +DT +D +WV C  C    C S  
Sbjct: 113 SKAATVPANLGFSI-GTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQR 171

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
             +F+  +S+++  + C AA C Q+   +  C GG C + ++YG  ST     S DT++L
Sbjct: 172 DPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL 231

Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
             ++ + G+ FGC     G      GLLGLGR   SL++Q  + Y   FSYCLP  +  +
Sbjct: 232 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQ--N 289

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G + LG          TPLL      + Y V L  I VG + + I            +
Sbjct: 290 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------S 343

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSV----PIVAPTI 362
           G ++D+GTV TRL   AY+A+R  FR  +      +  + G  DTCY       +  PTI
Sbjct: 344 GAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTI 403

Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           ++ F G          +   T+G +T   +A AP   +S  +++ N+QQ++  + +D
Sbjct: 404 SIAFGGGAA-------MDLGTSGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD 453


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 170/345 (49%), Gaps = 29/345 (8%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           TY+V   IGTP   L   +DT +D  W     PC  C    + ++  A+S T+ N+ C++
Sbjct: 91  TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150

Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
             C+  Q P   C      CA+  +YG  T     L+ +T +L +D  V G  FGC  + 
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRI 264
            G++    GL+G+GRG LSL++Q   L  + FSYC   F A + S  L LG   +     
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ---LGVTRFSYCFTPFNATAAS-PLFLGSSARLSSAA 266

Query: 265 KYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
           K TP + +P     RRSS YY++L  I VG  ++ I P   +  P    G IIDSGT FT
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFT 326

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQ 375
            L   A+ A+      RV   L   +  G   C++      +  P + L F G ++ L +
Sbjct: 327 ALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 386

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           ++ ++   +  + CL M +A       ++V+ +MQQQN  ILYD+
Sbjct: 387 ESYVVEDRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 426


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 181/374 (48%), Gaps = 42/374 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y V  ++GTPA  +++ MDT +D +W   VPC  CV      FN   S++F  L C ++ 
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197

Query: 154 CKQV---PNPTCG--GGACAFNLTYGSSTIAANL-SQDTISLAT----DIVP----GYTF 199
           C  V     P C   G  C F++ YG  ++++ L + +TI+  T    D  P      T 
Sbjct: 198 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 257

Query: 200 GCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP 257
           GC      G      GLLG+ R  +S  +Q  + Y   FS+C P   A L+ SG +  G 
Sbjct: 258 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 317

Query: 258 --IGQPKRIKYTPLLKNPRRSS----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-T 310
             I  P  ++YTPL++NP   S     YYV L+ I V    + +       +  TG+G T
Sbjct: 318 SDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 376

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--------PIVAPTI 362
           IIDSGT FT L  PA+ A+R  F  R      V    GF  CY++          + P+I
Sbjct: 377 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSI 436

Query: 363 TLMF-SGMNVTLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           TL F  G++V LP++++LI    S   +  CLA   + D      N+I N QQQN  + Y
Sbjct: 437 TLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGD---IPFNIIGNYQQQNLWVEY 493

Query: 419 DVPNSRLGVARELC 432
           D+   RLG+A   C
Sbjct: 494 DLEKLRLGIAPAQC 507


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 177/361 (49%), Gaps = 26/361 (7%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y  +  +GTPA T LM +DT +D  W+   PC  C   S  VF+  +S ++  + C 
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
           A  C+++ +  C     +C + + YG  ++ A + + +T++ A    V     GC     
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 238

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
           G  +   GLLGLGRG LS   Q    +  +FSYCL    +     S R   +        
Sbjct: 239 GLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
                 +TP+ +NPR ++ YYV+LL   V G RV  +    L+ NPTTG  G I+DSGT 
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 358

Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
            TRL  P Y AVRD FR   VG  ++      FDTCY++     +  PT+++  + G +V
Sbjct: 359 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 418

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            LP +N LI        C AMA     V    ++I N+QQQ  R+++D    R+G   + 
Sbjct: 419 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 474

Query: 432 C 432
           C
Sbjct: 475 C 475


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 124/417 (29%), Positives = 195/417 (46%), Gaps = 44/417 (10%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
           S+ L++ H   PC+P   +  L    S L+ L  DQ R +++               LA 
Sbjct: 64  SAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 123

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSST- 134
           ++ + VP   G  I  +  Y+V   +GTPA    + +DT +D +WV C  C    C S  
Sbjct: 124 SKAATVPANLGFSI-GTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQR 182

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
             +F+  +S+++  + C AA C Q+   +  C GG C + ++YG  ST     S DT++L
Sbjct: 183 DPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL 242

Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
             ++ + G+ FGC     G      GLLGLGR   SL++Q  + Y   FSYCLP  +  +
Sbjct: 243 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQ--N 300

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G + LG          TPLL      + Y V L  I VG + + I            +
Sbjct: 301 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------S 354

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSV----PIVAPTI 362
           G ++D+GTV TRL   AY+A+R  FR  +      +  + G  DTCY       +  PTI
Sbjct: 355 GAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTI 414

Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           ++ F G          +   T+G +T   +A AP   +S  +++ N+QQ++  + +D
Sbjct: 415 SIAFGGGAA-------MDLGTSGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD 464


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/363 (32%), Positives = 165/363 (45%), Gaps = 29/363 (7%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           PI SG     S  Y  R  IG P     + +DT +D  WV C  C  C   +  +F  A 
Sbjct: 137 PIISGTS-QGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPAS 195

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           S +F  L C   QC+ +    C    C + ++YG  S    +   +TI+L +  V     
Sbjct: 196 SASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI 255

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFSGSLR 254
           GC     G  V   GLLGLG GSLS  +Q   +  ++FSYCL      S   L F+ +L 
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INATSFSYCLVDRDSESASTLEFNSTL- 311

Query: 255 LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
                 P      PLL+N    + YYV L  + VG  +V IP  A Q + +   G I+DS
Sbjct: 312 ------PPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDS 365

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGM 369
           GT  TRL    Y ++RD F +R     +   +  FDTCY +     +  PT++  F  G 
Sbjct: 366 GTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGK 425

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            + LP  N L+   +    C A A       S L++I N+QQQ  R++YD+ N  +G   
Sbjct: 426 ELPLPAKNYLVPLDSEGTFCFAFAPTA----SSLSIIGNVQQQGTRVVYDLVNHLVGFVP 481

Query: 430 ELC 432
             C
Sbjct: 482 NKC 484


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 176/360 (48%), Gaps = 21/360 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           P+ SG     S  Y  R  +G PA+   M +DT +D  W+   PCT C   +  +F+   
Sbjct: 149 PVTSGTS-QGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTA 207

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYT 198
           S+T+  + CQ+ QC  +   +C  G C + + YG  +    + + +++S   +  V    
Sbjct: 208 SSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVA 267

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
            GC     G  V   GLLGLG G LSL   T  L  ++FSYCL + +  + S +L     
Sbjct: 268 LGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVN-RDSAGSSTLDFNSA 323

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
                    PL+KN +  + YYV L  + VG ++V IP    + + +   G I+D GT  
Sbjct: 324 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 383

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLMFS-GMNVT 372
           TRL   AY  +RD F  R+  NL +TS +  FDTCY +     +  PT++  F+ G +  
Sbjct: 384 TRLQTQAYNPLRDAF-VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 442

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP  N LI   +    C A A       S L++I N+QQQ  R+ +D+ N+R+G +   C
Sbjct: 443 LPAANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/380 (31%), Positives = 182/380 (47%), Gaps = 50/380 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---------CSST-VFNSAQSTTFKN 146
           Y+V    GTP Q +L+  DT +D  W+ C+             CS    F +++S T   
Sbjct: 54  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113

Query: 147 LGCQAAQCKQVPNPTCGGGAC----------AFNLTYGSSTIAANLSQDTISLATDI--- 193
           + C AAQC  VP P   G +C          A++   GSST    L++DT +++      
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTG-FLARDTATISNGTSGG 172

Query: 194 --VPGYTFGCIQKATGNSVPPQG-LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
             V G  FGC  +  G S    G ++GLG+G LS  AQ+ +L+  TFSYCL   +     
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232

Query: 251 GSLRLGPIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
            S     +G+P+R     YTPL+ NP   + YYV ++AIRVG RV+ +P      +    
Sbjct: 233 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 292

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSLGGFDTCYSVPIVA--- 359
            GT+IDSG+  T L   AY  +   F       R+ S+ T     G + CY+V   +   
Sbjct: 293 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF--FQGLELCYNVSSSSSLA 350

Query: 360 ------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
                 P +T+ F+ G+++ LP  N L+   A  + CLA+   P       NV+ N+ QQ
Sbjct: 351 PANGGFPRLTIDFAQGLSLELPTGNYLV-DVADDVKCLAI--RPTLSPFAFNVLGNLMQQ 407

Query: 413 NHRILYDVPNSRLGVARELC 432
            + + +D  ++R+G AR  C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 117/381 (30%), Positives = 184/381 (48%), Gaps = 31/381 (8%)

Query: 73  SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC- 131
           S +A + ++ VP+ SG +  Q+  YIV   +G+  Q + + +DT +D  WV C  C  C 
Sbjct: 99  SQIADSSETQVPLTSGIKF-QTLNYIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCY 155

Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----GACAFNLTYGS-STIAANLS 183
             +  +F  + S +++ + C +  C+ +    CG        C + + YG  S  +  L 
Sbjct: 156 NQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELG 215

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
            + +      V  + FGC +   G      GL+GLGR  LS+++QT   +   FSYCLPS
Sbjct: 216 IEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPS 275

Query: 244 FKALSFSGSLRLG-PIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
                 SGSL +G   G  K    I YT +L N + S+ Y +NL  I VG   + +    
Sbjct: 276 TDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHV---- 331

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---- 355
            Q +     G I+DSGTV +RL    Y A++  F  +     +       DTC+++    
Sbjct: 332 -QASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYD 390

Query: 356 PIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
            +  PTI++ F G   +NV       L+   A  + CLA+A+  D     + +I N QQ+
Sbjct: 391 QVNIPTISMYFEGNAELNVDATGIFYLVKEDASRV-CLALASLSDEYE--MGIIGNYQQR 447

Query: 413 NHRILYDVPNSRLGVARELCT 433
           N R+LYD   S++G A+E CT
Sbjct: 448 NQRVLYDAKLSQVGFAKEPCT 468


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 161/348 (46%), Gaps = 33/348 (9%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  +G+P ++  M +D+ +D  WV   PCT C   S  VF+ A S +F  + C 
Sbjct: 198 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 257

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           ++ C ++ N  C  G C + ++YG  S     L+ +T++    +V     GC  +  G  
Sbjct: 258 SSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMF 317

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GS+S + Q        FSYCL S                      + PL
Sbjct: 318 VGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS--------------------AAWVPL 357

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           ++NPR  S YY+ L  + VG   V I     +       G ++D+GT  TRL   AY A 
Sbjct: 358 VRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAF 417

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
           RD F  +  +    T +  FDTCY     V +  PT++  FSG  + TLP  N LI    
Sbjct: 418 RDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 477

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A +     S L+++ N+QQ+  +I +D  N  +G    +C
Sbjct: 478 AGTFCFAFAPS----TSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 121/363 (33%), Positives = 177/363 (48%), Gaps = 24/363 (6%)

Query: 84  PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           P+ SG  + Q S  Y  R  IG+PA+ L M +DT +D  WV C  C  C   S  VF+ +
Sbjct: 154 PVVSG--VGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 211

Query: 140 QSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
            S ++  + C + +C+ +    C    GAC + + YG  S    + + +T++L     V 
Sbjct: 212 LSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVG 271

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
               GC     G  V   GLL LG G LS  +Q   +  STFSYCL    + + S +L+ 
Sbjct: 272 NVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCLVDRDSPAAS-TLQF 327

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDS 314
           G           PL+++PR S+ YYV L  I VG + + IP  A   + T+G+G  I+DS
Sbjct: 328 GDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDS 387

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMN 370
           GT  TRL + AY A+RD F +   S    + +  FDTCY +     +  P ++L F G  
Sbjct: 388 GTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGG 447

Query: 371 -VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            + LP  N LI        CLA A      N+ +++I N+QQQ  R+ +D     +G   
Sbjct: 448 ALRLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTARGAVGFTP 503

Query: 430 ELC 432
             C
Sbjct: 504 NKC 506


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 141/476 (29%), Positives = 216/476 (45%), Gaps = 64/476 (13%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQD-----HSSTLQVFHVFSPCSPFKPSKPLSWEESVL 59
           L +F+ F     L+  L    D  +         L ++HV    S    + P S+ +   
Sbjct: 3   LFWFIVFSAHLVLASSLVEFQDNDNPRQKQEGMQLNLYHVKGLDSSQTSTSPFSFSD--- 59

Query: 60  EMLAKDQARLQFLSSLAVARKSV------------------VPIASGRQITQSPTYIVRA 101
            M+ KD+ R++FL S    ++SV                   P+ SG  I  S  Y V+ 
Sbjct: 60  -MITKDEERVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSI-GSGNYYVKI 117

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCK-- 155
            +GTPA+   M +DT +  +W+ C  CV  C   V   F  + S T+K L C ++QC   
Sbjct: 118 GLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSL 177

Query: 156 -----QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQKATG 207
                  P  +   GAC +  +YG ++ +   LSQD ++L     P  G+ +GC Q   G
Sbjct: 178 KSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQG 237

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP----SFKALSFSGSLRLGPIGQPKR 263
                 G++GL    +S+L Q    Y + FSYCLP    +  + S SG L +G       
Sbjct: 238 LFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSS 297

Query: 264 -IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
             K+TPL+KN +  SLY+++L  I V  + + +   A  +N      TIIDSGTV TRL 
Sbjct: 298 PYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGV--SASSYN----VPTIIDSGTVITRLP 351

Query: 323 APAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQD 376
              Y A++  F   +             DTC+   +      P I ++F  G  + L   
Sbjct: 352 VAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAH 411

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           N L+    G+ TCLA+AA+ + +    ++I N QQQ  ++ YDV N ++G A   C
Sbjct: 412 NSLVEIEKGT-TCLAIAASSNPI----SIIGNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 128/435 (29%), Positives = 205/435 (47%), Gaps = 56/435 (12%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV----VPIASG 88
           ++ + H + PC+P + S   +   S+ E L + +AR  ++ S A     +     P    
Sbjct: 56  SMSLVHRYGPCAPSQYSNVPT--PSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDD 113

Query: 89  RQIT---------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC---SST 134
             +T          S  Y+V    GTP+   ++ MDT +D +WV CT C    C      
Sbjct: 114 AAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDP 173

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNP-----TCGGGACAFNLTYGSSTIAANL-SQDTIS 188
           +F+ ++S+T+  + C    C+++ +      T GG  C +++ Y   + +  + S +T++
Sbjct: 174 LFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT 233

Query: 189 LATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
           LA  I V  + FGC +   G S    GLLGLG   +SL+ QT ++Y   FSYCLP+    
Sbjct: 234 LAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN-- 291

Query: 248 SFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           S +G L LG  P G      +TP+   P  ++ Y V +  I VG + + IP  A +    
Sbjct: 292 SEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR---- 347

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
              G IIDSGTV T L   AY A+    R+ + +   V S   FDTCY+      I  P 
Sbjct: 348 --GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPS-DDFDTCYNFTGYSNITVPR 404

Query: 362 ITLMFSG---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRIL 417
           +   FSG   +++ +P + +L++       CLA   + PD+    L +I N+ Q+   +L
Sbjct: 405 VAFTFSGGATIDLDVP-NGILVND------CLAFQESGPDD---GLGIIGNVNQRTLEVL 454

Query: 418 YDVPNSRLGVARELC 432
           YD     +G     C
Sbjct: 455 YDAGRGNVGFRAGAC 469


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 136/416 (32%), Positives = 190/416 (45%), Gaps = 59/416 (14%)

Query: 58  VLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT---YIVRAKIGTPAQTLLMAM 114
           V + L +D  R      LA +  +   +++  QI  SPT   Y++   IGTP  +     
Sbjct: 47  VRDALRRDMHR-HNARQLAASSSNGTTVSAPTQI--SPTAGEYLMTLAIGTPPVSYQAIA 103

Query: 115 DTSNDAAWVPCTGCVGCSSTVF-------NSAQSTTFKNLGCQ-------AAQCKQVPNP 160
           DT +D  W   T C  CSS  F       N + STTF  L C        AA     P P
Sbjct: 104 DTGSDLIW---TQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTPPP 160

Query: 161 TCGGGACAFNLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKATG-NSVPPQ 213
            C    C +N+TYGS   +     +T +  +        VPG  FGC   + G N+    
Sbjct: 161 GC---TCMYNMTYGSGWTSVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGGFNTSSAS 217

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLL 270
           GL+GLGRGSLSL++Q   L    FSYCL  ++  + + +L LGP   +     +  TP +
Sbjct: 218 GLVGLGRGSLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFV 274

Query: 271 KNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
            +P     S+ YY+NL  I +G   + IP  AL        G IIDSGT  T L   AY 
Sbjct: 275 ASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQ 334

Query: 328 AVRDVFRRRVGSNLTVTSLG----GFDTCY------SVPIVAPTITLMFSGMNVTLPQDN 377
            VR      V   L  T  G    G D C+      S P   P++TL F G ++ LP D+
Sbjct: 335 QVRAAVVSLV--TLPTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHFDGADMVLPADS 392

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            ++  +  ++ CLAM    +  +  ++++ N QQQN  ILYDV    L  A   C+
Sbjct: 393 YMMLDS--NLWCLAMQ---NQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKCS 443


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 164/339 (48%), Gaps = 31/339 (9%)

Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN--PTCGG 164
           + + +DT +D  W+ C  C  C     ++F  A S T+K L C +  C+Q+ +   +C  
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60

Query: 165 GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQGLLGL 218
            +C + ++YG  ST   + + +T++L +D      VP + FGC     G      GL+GL
Sbjct: 61  SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGL 120

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPRRSS 277
           G+ S+   AQT   +   FSYCLPS  +   SG L  G        +++TPL+ +    S
Sbjct: 121 GKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPS 180

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y+V++  I VG  ++ I            A  ++DSGTV +R    AY  +RD F + +
Sbjct: 181 QYFVSMTGINVGDELLPI-----------SATVMVDSGTVISRFEQSAYERLRDAFTQIL 229

Query: 338 GSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
               T  S+  FDTC+ V  V     P ITL F            +++     + C A A
Sbjct: 230 PGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFA 289

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +    +S  +V+ N QQQN R +YD+P SRLG++   C
Sbjct: 290 PS----SSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 134/472 (28%), Positives = 220/472 (46%), Gaps = 62/472 (13%)

Query: 11  FLFLFSLSEG-----------------LNPICDTQ--DHSS------TLQVFHVFSPCSP 45
           FL LFSL +G                 +N +  T   +HSS      +L+V H   PC  
Sbjct: 2   FLLLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHSSKVSNSLSLEVVHRHGPCIG 61

Query: 46  FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSVVPIASGRQITQSPTYI 98
               +  +   S +E+  +DQ R+  + +   +R        + +P+ SG  I  +  Y+
Sbjct: 62  IVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATTLPVQSGASI-GAGDYV 120

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQC 154
           V   +GTP +   +  DT +D  W  C  CV           N + ST++KN+ C +A C
Sbjct: 121 VTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALC 180

Query: 155 KQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATG 207
           K V +      +C    C + + YG  + +    + +T++L++ ++   + FGC Q+  G
Sbjct: 181 KLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNG 240

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKY 266
                 GLLGLGR  L+L +QT   Y+  FSYCLP+  + S  G L LG  GQ  K +K+
Sbjct: 241 LFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKGYLSLG--GQVSKSVKF 296

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           TPL  +   +  Y +++  + VG R + I   A        AGT+IDSGTV TRL   AY
Sbjct: 297 TPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGTVIDSGTVITRLSPTAY 350

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIH 381
           + +   F+  +    + +    FDTCY       +  P + + F  G+ + +    +L  
Sbjct: 351 SELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYP 410

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
                  CLA A   D+ ++  ++  N+QQ+ ++++YD    R+G A   C+
Sbjct: 411 VNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 186/383 (48%), Gaps = 46/383 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSA 139
           VP+ SG ++ ++  Y+    +G    T+++  DT+++  WV C  C  C      +F+ +
Sbjct: 140 VPVTSGAKL-RTLNYVATVGLGGGEATVIV--DTASELTWVQCAPCESCHDQQDPLFDPS 196

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG--GGA------------CAFNLTYGSSTIAAN-LSQ 184
            S ++  + C ++ C  +   T G  GGA            C++ L+Y   + +   L+ 
Sbjct: 197 SSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAH 256

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           D +SLA +++ G+ FGC    T N  PP     GL+GLGR  LSL++QT + +   FSYC
Sbjct: 257 DRLSLAGEVIDGFVFGC---GTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYC 313

Query: 241 LPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           LP  K    SGSL +G      R    I Y  ++ +P +   Y+VNL  I VG + V+  
Sbjct: 314 LP-LKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESS 372

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
             +           IIDSGTV T LV   Y AV+  F  +             DTC+++ 
Sbjct: 373 GFSSGGGGGK---AIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMT 429

Query: 356 ---PIVAPTITLMFSGMNVTLPQDN---LLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
               +  P++ L+F G  V +  D+   L   S+  S  CLAM  AP       N+I N 
Sbjct: 430 GLREVQVPSLKLVFDG-GVEVEVDSGGVLYFVSSDSSQVCLAM--APLKSEYETNIIGNY 486

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
           QQ+N R+++D   S++G A+E C
Sbjct: 487 QQKNLRVIFDTSGSQVGFAQETC 509


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 131/446 (29%), Positives = 203/446 (45%), Gaps = 58/446 (13%)

Query: 15  FSLSEGLNPIC------DTQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAK 64
           F L E L P C       + D +S++ + H + PCSP  P+    +P   E    + L  
Sbjct: 11  FGLCEEL-PACGAATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRA 69

Query: 65  DQARLQFLSSLAVA-------RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTS 117
           D  R +F  S   A        K  VP   G  +  +  Y++   +G+PA T  + +DT 
Sbjct: 70  DYIRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAVTQRVVIDTG 128

Query: 118 NDAAWVPC------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----- 166
           +D +WV C      + C   +  +F+ A S+T+    C AA C Q+ +     G      
Sbjct: 129 SDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSR 188

Query: 167 CAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGS 222
           C + + YG  S      S D ++L+ +D+V G+ FGC     G  +  +  GL+GLG  +
Sbjct: 189 CQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDA 248

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSS 277
            S ++QT   Y  +F YCLP+  A S  G L LG           R   TP+L++ +  +
Sbjct: 249 QSPVSQTAARYGKSFFYCLPATPASS--GFLTLGAPASGGGGGASRFATTPMLRSKKVPT 306

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y+  L  I VG + + + P          AG+++DSGTV TRL   AY A+   FR  +
Sbjct: 307 YYFAALEDIAVGGKKLGLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGM 360

Query: 338 GSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
                   LG  DTC++      +  PT+ L+F+G  V     +L  H   G ++   +A
Sbjct: 361 TRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVV----DLDAH---GIVSGGCLA 413

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYD 419
            AP   +     I N+QQ+   +LYD
Sbjct: 414 FAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 144/452 (31%), Positives = 205/452 (45%), Gaps = 67/452 (14%)

Query: 24  ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----------- 72
           + + + HSST  V           P  P + E  +  +LA D AR   L           
Sbjct: 107 VLELKHHSSTATV-----------PDHPAARERYLKHLLAADSARAASLQLRKPKPASST 155

Query: 73  -SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVG 130
            ++ A A  + VP+ SG +  Q+  Y+    +G   A+ L + +DT +D  WV C  C G
Sbjct: 156 TTTQASAAAAEVPLGSGIRY-QTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPG 214

Query: 131 CS-----STVFNSAQSTTFKNLGCQAAQCK-QVPNPTCGGGACA-----------FNLTY 173
            S       +F+ A S TF  + C +  C   + + T   G+CA           + L+Y
Sbjct: 215 SSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSY 274

Query: 174 GSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           G  + +   L+QDT+ L T   + G+ FGC     G      GL+GLGR  LSL++QT  
Sbjct: 275 GDGSFSRGVLAQDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAA 334

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
            +   FSYCLP+    + S SL  GP      + YT ++ +P +   Y++N+    VG  
Sbjct: 335 RFGGVFSYCLPATTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGG 394

Query: 292 VVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF- 349
                PG        GAG + +DSGTV TRL    Y AVR  F RR        +  GF 
Sbjct: 395 AALTAPG-------FGAGNVLVDSGTVITRLAPSVYKAVRAEFARR----FEYPAAPGFS 443

Query: 350 --DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL-IHSTAGSITCLAMAAAPDNVNS 401
             D CY +     +  P +TL    G  VT+    +L +    GS  CLAMA+ P     
Sbjct: 444 ILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLP--YED 501

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              +I N QQ+N R++YD   SRLG A E CT
Sbjct: 502 QTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 174/360 (48%), Gaps = 21/360 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           P+ SG     S  Y  R  +G PA+   M +DT +D  W+   PCT C   +  +F+   
Sbjct: 8   PVTSGTS-QGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTA 66

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYT 198
           S+T+  + CQ+ QC  +   +C  G C + + YG  +    + + +++S   +  V    
Sbjct: 67  SSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVA 126

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
            GC     G  V   GLLGLG G LSL   T  L  ++FSYCL +  +   S +L     
Sbjct: 127 LGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSAG-SSTLDFNSA 182

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
                    PL+KN +  + YYV L  + VG ++V IP    + + +   G I+D GT  
Sbjct: 183 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 242

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLMFS-GMNVT 372
           TRL   AY  +RD F R    NL +TS +  FDTCY +     +  PT++  F+ G +  
Sbjct: 243 TRLQTQAYNPLRDAFVRMT-QNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 301

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP  N LI   +    C A A       S L++I N+QQQ  R+ +D+ N+R+G +   C
Sbjct: 302 LPAANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 124/416 (29%), Positives = 198/416 (47%), Gaps = 47/416 (11%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSS----------LAVARKSVVPIASGRQITQSPTYIV 99
           K L W + + + L  D  +L+ L S          +  +  + +P+ SG ++ QS  YIV
Sbjct: 10  KILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRL-QSLNYIV 68

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
             ++G    T+++  DT +D +WV   PC  C      VFN ++S +++ + C +  C+ 
Sbjct: 69  TVELGGRKMTVIV--DTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRS 126

Query: 157 VPNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
           +   T   G C  N       + YG  S  +  +  + ++L    V  + FGC +K  G 
Sbjct: 127 LQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQGL 186

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----I 264
                GL+GLGR  LSL++Q   ++   FSYCLP+ +A + SGSL +G      +    I
Sbjct: 187 FGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEA-SGSLVMGGNSSVYKNTTPI 245

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
            YT ++ NP     Y++NL  I VG   V  P     F        IIDSGTV +RL   
Sbjct: 246 SYTRMIHNPLL-PFYFLNLTGITVGGVEVQAP----SFGKDR---MIIDSGTVISRLPPS 297

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG---MNVTLPQDN 377
            Y A++  F ++     +  S    D+C+++     +  P I + F G   +NV +    
Sbjct: 298 IYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVF 357

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             + + A  + CLA+A+ P      + +I N QQ+N RI+YD   S LG A E C+
Sbjct: 358 YSVKTDASQV-CLAIASLP--YEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 136/411 (33%), Positives = 202/411 (49%), Gaps = 39/411 (9%)

Query: 55  EESVLEMLAKDQARLQFLSS---LAVARKSVV-------PIASGRQITQSPTYIVRAKIG 104
           E+ +LE L +D+ R++++ S   LA  +K          P+ SG  +  S  Y VR  +G
Sbjct: 78  EQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTDLNGPVTSGL-LYGSGEYFVRLGVG 136

Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           TPA++L M +DT +D  W+ C  C  C   +  +F+   S++F+ + C +  CK +   +
Sbjct: 137 TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEIHS 196

Query: 162 CGG--GA---CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQG 214
           C G  GA   C++ + YG  + +  + S D  +L T        FGC     G      G
Sbjct: 197 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 256

Query: 215 LLGLGRGSLSLLAQ-----TQNLYQSTFSYCL--PSFKALSFSGSLRLGPIGQPKRIKYT 267
           LLGLG G LS  +Q     T +   ++FSYCL   S      S SL  G    P     +
Sbjct: 257 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALS 316

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           PLLKNP+  + YY  ++ + VG   + I   +LQ + +   G IIDSGT  TR     Y 
Sbjct: 317 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 376

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMF-SGMNVTLPQDNLLIH- 381
            +RD FR    +  +      FDTCY+      +  P + L F +G ++ LP  N LI  
Sbjct: 377 TIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 436

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +TAGS  CLA   AP ++   L +I N+QQQ+ RI +D+  S L  A + C
Sbjct: 437 NTAGSF-CLAF--APTSME--LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 482


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 116/377 (30%), Positives = 173/377 (45%), Gaps = 55/377 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y+V   +GTP + + + +DT +D  W  C  C  C      + + A S+T+  L C A +
Sbjct: 92  YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPR 151

Query: 154 CKQVPNPTCGGG----------ACAFNLTYGSSTI-AANLSQDTISLATDIVPG------ 196
           C+ +P  +CGGG          +CA+   YG  ++    ++ D  +   D   G      
Sbjct: 152 CRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPT 211

Query: 197 --YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGS 252
              TFGC     G     + G+ G GRG  SL +Q   L  +TFSYC  S F++ S   +
Sbjct: 212 RRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQ---LNVTTFSYCFTSMFESKSSLVT 268

Query: 253 LRLGPIGQ---------PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
           L   P               ++ TPLLKNP + SLY+++L  I VG+  + +P   L+  
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKLR-- 326

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPIVA--- 359
                 TIIDSG   T L    Y AV+  F  +VG   T V      D C+++P+ A   
Sbjct: 327 -----STIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWR 381

Query: 360 ----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
               P++TL   G +  LP+ N +    A  + C+ + AAP +      VI N QQQN  
Sbjct: 382 RPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGD----QTVIGNFQQQNTH 437

Query: 416 ILYDVPNSRLGVARELC 432
           ++YD+ N  L  A   C
Sbjct: 438 VVYDLENDWLSFAPARC 454


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 134/488 (27%), Positives = 218/488 (44%), Gaps = 82/488 (16%)

Query: 3   PQLVFFLAFLFLFSLSEGLN---------------PIC-------DTQDHSSTLQVFHVF 40
           P LV  +   + +SL+ G N               P+C       D   ++ ++ + H  
Sbjct: 5   PLLVCIILCTYEYSLAHGGNEHGFVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVHRH 64

Query: 41  SPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLSS------LAVARKSVVPIASGRQIT 92
            PC+P + S  KP S+ +     L +++AR +++ S      +       +P   G  + 
Sbjct: 65  GPCAPTQLSSDKPSSFTD----RLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGGSV- 119

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNL 147
            S  Y+V   +GTP+ + ++ +DT +D +WV C     T C      +F+ ++S+T+  +
Sbjct: 120 DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPI 179

Query: 148 GCQAAQCKQVPNPTCGGGA--------CAFNLTYGSSTIAANL-SQDTISLATDI-VPGY 197
            C    C+ + +   GGG         C F +TYG  +    + S +T++LA  + V  +
Sbjct: 180 PCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDF 239

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-----ALSFSGS 252
            FGC     G +    GLLGLG    SL+ QT ++Y   FSYCLP+            G 
Sbjct: 240 RFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGG 299

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
              G +       +TP+++     + Y VN+  I VG   +D+PP A         G II
Sbjct: 300 APSGGVVNTSGFVFTPMIR--EEETFYVVNMTGITVGGEPIDVPPSAFS------GGMII 351

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSG 368
           DSGTV T L   AY A++  FR+ + +   V + G  DTCY       +  P + L FSG
Sbjct: 352 DSGTVVTELQHTAYNALQAAFRKAMAAYPLVRN-GELDTCYDFSGYSNVTLPKVALTFSG 410

Query: 369 ---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
              +++ +P   LL         CLA   + PD+   +L    N+ Q+   +LYD    R
Sbjct: 411 GATIDLDVPNGILLDD-------CLAFQESGPDDQPGIL---GNVNQRTLEVLYDAGRGR 460

Query: 425 LGVARELC 432
           +G    +C
Sbjct: 461 VGFRAAVC 468


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 122/425 (28%), Positives = 204/425 (48%), Gaps = 37/425 (8%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSVVPI 85
           +L+V H   PC      +  +   S +E+  +DQ R+  + +   +R        + +P+
Sbjct: 1   SLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATTLPV 60

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQS 141
            SG  I  +  Y+V   +GTP +   +  DT +D  W  C  CV           N + S
Sbjct: 61  QSGASIG-AGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTS 119

Query: 142 TTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIV 194
           T++KN+ C +A CK V +      +C    C + + YG  + +    + +T++L++ ++ 
Sbjct: 120 TSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVF 179

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
             + FGC Q+  G      GLLGLGR  L+L +QT   Y+  FSYCLP+  + S  G L 
Sbjct: 180 KNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKGYLS 237

Query: 255 LGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
           LG  GQ  K +K+TPL  +   +  Y +++  + VG R + I   A        AGT+ID
Sbjct: 238 LG--GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFS------AGTVID 289

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
           SGTV TRL   AY+ +   F+  +    + +    FDTCY       +  P + + F  G
Sbjct: 290 SGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGG 349

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           + + +    +L         CLA A   D+ ++  ++  N+QQ+ ++++YD    R+G A
Sbjct: 350 VEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRVGFA 407

Query: 429 RELCT 433
              C+
Sbjct: 408 PGGCS 412


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 123/428 (28%), Positives = 206/428 (48%), Gaps = 37/428 (8%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSV 82
           +S +L+V H   PC      +  +   S +E+  +DQ R+  + +   +R        + 
Sbjct: 58  NSLSLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATT 117

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNS 138
           +P+ SG  I  +  Y+V   +GTP +   +  DT +D  W  C  CV           N 
Sbjct: 118 LPVQSGASI-GAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNP 176

Query: 139 AQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT- 191
           + ST++KN+ C +A CK V +      +C    C + + YG  + +    + +T++L++ 
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
           ++   + FGC Q+  G      GLLGLGR  L+L +QT   Y+  FSYCLP+  + S  G
Sbjct: 237 NVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKG 294

Query: 252 SLRLGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
            L LG  GQ  K +K+TPL  +   +  Y +++  + VG R + I   A        AGT
Sbjct: 295 YLSLG--GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGT 346

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
           +IDSGTV TRL   AY+ +   F+  +    + +    FDTCY       +  P + + F
Sbjct: 347 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 406

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             G+ + +    +L         CLA A   D+ ++  ++  N+QQ+ ++++YD    R+
Sbjct: 407 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRV 464

Query: 426 GVARELCT 433
           G A   C+
Sbjct: 465 GFAPGGCS 472


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 128/396 (32%), Positives = 189/396 (47%), Gaps = 39/396 (9%)

Query: 60  EMLAKDQARLQFL---------SSLAVAR-KSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           E L +DQ R  ++         +   V R  + VP A G  +  +  Y++   +G+PA +
Sbjct: 6   ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSL-NTLEYLITVGLGSPATS 64

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG--- 163
             M +DT +D +WV C  C  C S    +F+ + S+T+    C +A C Q+     G   
Sbjct: 65  QTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSS 124

Query: 164 GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
              C + +TYG  S+     S DT++L +  V  + FGC    +G +    GL+GLG G+
Sbjct: 125 SSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGA 184

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLKNPRRSSLYY 280
            SL++QT       FSYCLP     S SG L LG  G      +  TP+L++ +  + Y 
Sbjct: 185 QSLVSQTAGTLGRAFSYCLP--PTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYG 242

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           V L AIRVG R + IP           AGT++DSGTV TRL   AY+A+   F+  +   
Sbjct: 243 VRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQY 296

Query: 341 LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP 396
                 G  DTC+       +  P++ L+FSG  V     + +I S      CLA A   
Sbjct: 297 PPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILS-----NCLAFAGNS 351

Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           D  +S L +I N+QQ+   +LYDV    +G     C
Sbjct: 352 D--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 127/392 (32%), Positives = 184/392 (46%), Gaps = 31/392 (7%)

Query: 61  MLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
           M  + +AR  + LSS A A  S      G  +T+   Y++   IGTP Q + + +DT + 
Sbjct: 1   MALRSKARAPRLLSSSATAPVSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSV 57

Query: 120 AAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA---CAFNLT 172
             W  C  C  C   S   +++++S+TF    C + QCK  P+ T C       CA++ +
Sbjct: 58  LVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYS 117

Query: 173 YGS-STIAANLSQDTIS-LATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQT 229
           YG  S     L  +T+S +A   VPG  FGC    TG     + G+ G GRG LSL +Q 
Sbjct: 118 YGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ- 176

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLA 285
             L    FS+C  +      S  L   P    K     ++ TPL+KNP   + YY++L  
Sbjct: 177 --LKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKG 234

Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
           I VG   + +P  A      TG GTIIDSGT FT L    Y  V D F   V   +  ++
Sbjct: 235 ITVGSTRLPVPESAFALKNGTG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSN 293

Query: 346 LGGFDTCYSVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
             G   C+S P +      P + L F G  + LP++N +  +  G    + +A     + 
Sbjct: 294 ETGPLLCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IE 349

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             + +I N QQQN  +LYD+ NS+L   R  C
Sbjct: 350 GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 123/418 (29%), Positives = 193/418 (46%), Gaps = 44/418 (10%)

Query: 44  SPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARKSVVPIASGRQITQ---- 93
           +P K  K L     VL  L +D +R+Q +++        V++  + P+ +  Q       
Sbjct: 93  TPHKDYKAL-----VLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTP 147

Query: 94  --------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
                   S  Y  R  +G PA++  M +DT +D  W+   PC+ C   S  +F  A S+
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASS 207

Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFG 200
           ++  L C + QC  +   +C  G C + + YG  +    +   +T+S   +  V     G
Sbjct: 208 SYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIG 259
           C     G  V   GLLGLG G LSL +Q   L  ++FSYCL +   A S +      P+G
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSLTSQ---LKATSFSYCLVNRDSAASSTLDFNSAPVG 324

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
                   PLLK+ +  + YYV L  + VG  ++ IP    + + +   G I+D GT  T
Sbjct: 325 DSV---IAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAIT 381

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLP 374
           RL + AY ++RD F        + + +  FDTCY +     +  PT++  F  G +  LP
Sbjct: 382 RLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLP 441

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             N LI   +    C A A       S L++I N+QQQ  R+ +D+ N+R+G +   C
Sbjct: 442 AANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 208/439 (47%), Gaps = 53/439 (12%)

Query: 28  QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD-------QARLQFLSSLAVARK 80
           ++ ++ L++ H  S CS     K L W + + + L  D       Q+R++ + S      
Sbjct: 62  ENGATILEMKHKDS-CS----GKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDD 116

Query: 81  SV---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST 134
           SV   +P+ SG ++ Q+  YIV  ++G    T+++  DT +D +WV   PC  C      
Sbjct: 117 SVDAPIPLTSGIRL-QTLNYIVTVELGGRKMTVIV--DTGSDLSWVQCQPCKRCYNQQDP 173

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN-------LTYGS-STIAANLSQDT 186
           VFN + S +++ + C +  C+ + + T   G C  N       + YG  S     L  + 
Sbjct: 174 VFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEH 233

Query: 187 ISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
           + L     V  + FGC +   G      GL+GLGR SLSL++QT  ++   FSYCLP   
Sbjct: 234 LDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLP-IT 292

Query: 246 ALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
               SGSL +G      +    I YT ++ NP+    Y++NL  I VG   V  P     
Sbjct: 293 ETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQL-PFYFLNLTGITVGSVAVQAPSFGKD 351

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
                  G +IDSGTV TRL    Y A++D F ++     +  +    DTC+++     +
Sbjct: 352 -------GMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEV 404

Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
             P I + F G   +NV +      + + A  + CLA+A+   +  + + +I N QQ+N 
Sbjct: 405 EIPNIKMHFEGNAELNVDVTGVFYFVKTDASQV-CLAIASL--SYENEVGIIGNYQQKNQ 461

Query: 415 RILYDVPNSRLGVARELCT 433
           R++YD   S LG A E CT
Sbjct: 462 RVIYDTKGSMLGFAAEACT 480


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 117/374 (31%), Positives = 170/374 (45%), Gaps = 48/374 (12%)

Query: 84  PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSA 139
           P+ SG  ++Q S  Y  R  +GTPA+ + + +DT +D  W+   PC+ C   S  VFN  
Sbjct: 150 PVVSG--VSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPT 207

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
            S+T+K+L C A QC  +    C    C + ++YG  +           LATD V   TF
Sbjct: 208 SSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TF 258

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQ--------------TQNLYQSTFSYCLPSFK 245
           G   K   N V     LG G  +  L                 T  +  ++FSYCL   +
Sbjct: 259 GNSGKI--NDVA----LGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVD-R 311

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
               S SL    +         PLL+N +  + YYV L    VG + V +P      + +
Sbjct: 312 DSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDAS 371

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT--VTSLGGFDTCYSV----PIVA 359
              G I+D GT  TRL   AY ++RD F  ++ +NL    +S+  FDTCY       +  
Sbjct: 372 GSGGVILDCGTAVTRLQTQAYNSLRDAF-LKLTTNLKKGTSSISLFDTCYDFSSLSSVKV 430

Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           PT+   F+ G ++ LP  N LI        C A A      +S L++I N+QQQ  RI Y
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAP----TSSSLSIIGNVQQQGTRITY 486

Query: 419 DVPNSRLGVARELC 432
           D+ N  +G++   C
Sbjct: 487 DLANKIIGLSGNKC 500


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 128/424 (30%), Positives = 192/424 (45%), Gaps = 39/424 (9%)

Query: 38  HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARKSVVPIASGRQI 91
           H+ S  S  KPS    ++   L  LA+D AR++ L +        V+   + P  S  + 
Sbjct: 71  HLRSRASIQKPSH-RDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEF 129

Query: 92  T----QSP----------TYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST 134
                Q P           Y +R  IG P     + +DT +D +W+   PC+ C   S  
Sbjct: 130 EANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDP 189

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDI 193
           +F+   S ++  + C A QCK +    C  G C + ++YG  S      + +T++L T  
Sbjct: 190 IFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGTAA 249

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           V     GC     G  V   GLLGLG G LS  AQ      ++FSYCL +  + + S   
Sbjct: 250 VENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVN---ATSFSYCLVNRDSDAVSTLE 306

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
              P+  P+ +   PL +NP   + YY+ L  I VG   + IP    + +   G G IID
Sbjct: 307 FNSPL--PRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIID 364

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SG 368
           SGT  TRL +  Y A+RD F +          +  FDTCY +     +  PT++  F  G
Sbjct: 365 SGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEG 424

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
             + LP  N LI   +    C A A       S L+++ N+QQQ  R+ +D+ NS +G +
Sbjct: 425 RELPLPARNYLIPVDSVGTFCFAFAP----TTSSLSIMGNVQQQGTRVGFDIANSLVGFS 480

Query: 429 RELC 432
            + C
Sbjct: 481 ADSC 484


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 117/362 (32%), Positives = 176/362 (48%), Gaps = 25/362 (6%)

Query: 84  PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           P+ SG  + Q S  Y  R  +G PA+ L M +DT +D  W+ C  C  C   S  V++ +
Sbjct: 151 PVVSG--VGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPS 208

Query: 140 QSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
            ST++  +GC + +C+ +    C    G+C + + YG  S    + + +T++L     V 
Sbjct: 209 VSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVS 268

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
               GC     G  V   GLL LG G LS  +Q   +  +TFSYCL    + S S +L+ 
Sbjct: 269 NVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQF 324

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           G   QP      PL+++PR ++ YYV L  I VG   + IP  A   +     G I+DSG
Sbjct: 325 GDSEQPAVT--APLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSG 382

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMN 370
           T  TRL + AY A+R+ F +   S    + +  FDTCY +     +  P + L F  G  
Sbjct: 383 TAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGE 442

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + LP  N LI   A    CLA A      +  +++I N+QQQ  R+ +D   + +G   +
Sbjct: 443 LKLPAKNYLIPVDAAGTYCLAFA----GTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTAD 498

Query: 431 LC 432
            C
Sbjct: 499 KC 500


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 122/416 (29%), Positives = 198/416 (47%), Gaps = 53/416 (12%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLA------------VARKSVVPIASGRQITQSPTYIVR 100
           S EE +  + + D AR+  L   A             A    VP+ SG ++ ++  Y+  
Sbjct: 71  SREEELGGLFSSDAARVSSLQRRAGGGSWAEDEAAAAAATGRVPVTSGARL-RTLNYVAT 129

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK-- 155
             +G    T+++  DT+++  WV C  C  C      +F+ A S ++  L C ++ C   
Sbjct: 130 VGLGGGEATVIV--DTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDAL 187

Query: 156 QVPNPTCGGG-------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
           QV   +  G        +C++ L+Y   + +   L+ D +SLA +++ G+ FGC     G
Sbjct: 188 QVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQG 247

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
                 GL+GLGR  LSL++QT + +   FSYCLP  K    SGSL LG      R    
Sbjct: 248 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-LKESESSGSLVLGDDTSVYRNSTP 306

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           I YT ++ +P +   Y+VNL  I +G + V+   G +          I+DSGT+ T LV 
Sbjct: 307 IVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKV----------IVDSGTIITSLVP 356

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN-- 377
             Y AV+  F  +             DTC+++     +  P++  +F G NV +  D+  
Sbjct: 357 SVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG-NVEVEVDSSG 415

Query: 378 -LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            L   S+  S  CLA+A+      +  ++I N QQ+N R+++D   S++G A+E C
Sbjct: 416 VLYFVSSDSSQVCLALASLKSEYET--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 122/416 (29%), Positives = 198/416 (47%), Gaps = 53/416 (12%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLA------------VARKSVVPIASGRQITQSPTYIVR 100
           S EE +  + + D AR+  L   A             A    VP+ SG ++ ++  Y+  
Sbjct: 72  SREEELGGLFSSDAARVSSLQRRAGGGSWAEDEAAAAAATGRVPVTSGARL-RTLNYVAT 130

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK-- 155
             +G    T+++  DT+++  WV C  C  C      +F+ A S ++  L C ++ C   
Sbjct: 131 VGLGGGEATVIV--DTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDAL 188

Query: 156 QVPNPTCGGG-------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
           QV   +  G        +C++ L+Y   + +   L+ D +SLA +++ G+ FGC     G
Sbjct: 189 QVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQG 248

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
                 GL+GLGR  LSL++QT + +   FSYCLP  K    SGSL LG      R    
Sbjct: 249 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-LKESESSGSLVLGDDTSVYRNSTP 307

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           I YT ++ +P +   Y+VNL  I +G + V+   G +          I+DSGT+ T LV 
Sbjct: 308 IVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKV----------IVDSGTIITSLVP 357

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN-- 377
             Y AV+  F  +             DTC+++     +  P++  +F G NV +  D+  
Sbjct: 358 SVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG-NVEVEVDSSG 416

Query: 378 -LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            L   S+  S  CLA+A+      +  ++I N QQ+N R+++D   S++G A+E C
Sbjct: 417 VLYFVSSDSSQVCLALASLKSEYET--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 133/419 (31%), Positives = 204/419 (48%), Gaps = 44/419 (10%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI----TQSPTYIVRAKIGTPAQTLL 111
           E V E L++ Q+++Q   +  +  +   P +  R +         + ++  IG+  + L 
Sbjct: 55  EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114

Query: 112 MAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----- 166
             +DT ++A  V C      S  VF+ A S +++ + C +  C  V   T  G +     
Sbjct: 115 AIIDTGSEAVLVQCGSR---SRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVN 171

Query: 167 ----CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT-------FGCIQKATGNSVP--P 212
               C ++L+YG S  +  + SQD I L +    G         FGC     G  V    
Sbjct: 172 SSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGS 231

Query: 213 QGLLGLGRGSLSLLAQTQN-LYQSTFSYCLPSFKAL-SFSGSLRLGPIGQPK-RIKYTPL 269
            G++G  RG+LSL +Q ++ L  S FSYC PS       +G + LG  G  K ++ YTPL
Sbjct: 232 LGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPL 291

Query: 270 LKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPA 325
           L NP    RS LYYV L +I V  + + IP  A + +P+TG  GT++DSGT FTR+V  A
Sbjct: 292 LDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDA 351

Query: 326 YTAVRDVF--RRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLMFSGMNVTLPQDN 377
           YTA R+ F    R G    V +  GFD CY      S+P V      + + + + L  ++
Sbjct: 352 YTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEH 411

Query: 378 LLIH-STAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           L +  S AG+    CLA+ ++  +    +NV+ N QQ N+ + YD   SR+G  R  C+
Sbjct: 412 LFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADCS 470


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 128/446 (28%), Positives = 198/446 (44%), Gaps = 59/446 (13%)

Query: 31  SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
           ++ + + H   PCSP   +  KP S  E    +LA DQ R + +    S+ A  R     
Sbjct: 89  TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 144

Query: 82  ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
                                 +P +SGR +     Y+V   +GTP     +  DT +D 
Sbjct: 145 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPVSRYTVVFDTGSDT 203

Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
            WV C  CV         +F+ A+S+T+ N+ C A  C  +    C GG C + + YG  
Sbjct: 204 TWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 263

Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
           + +    + DT++L++ D V G+ FGC ++  G      GLLGLGRG  SL  QT + Y 
Sbjct: 264 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 323

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
             F++CLP+    +       G +        TP+L +    + YYV +  IRVG +++ 
Sbjct: 324 GVFAHCLPARSTGTGYLDFGAGSLAAASARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLS 382

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
           IP           AGTI+DSGTV TRL   AY+++R         R        SL   D
Sbjct: 383 IPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL--LD 435

Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           TCY       +  PT++L+F G        + ++++ + S  CLA AA  D  +  + ++
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 493

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N Q +   + YD+    +G     C
Sbjct: 494 GNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 133/434 (30%), Positives = 200/434 (46%), Gaps = 52/434 (11%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
           L + H  SPCSP     PL  +     +L  D AR+  L++      S  P    R  + 
Sbjct: 43  LTLHHPRSPCSP----APLPADVPFSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSSS 98

Query: 94  SP-------------------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC-- 131
           SP                    Y+ R  +GTPA++ +M +DT +   W+ C+ C V C  
Sbjct: 99  SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHR 158

Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVP----NP-TCG-GGACAFNLTYGSSTIAAN-LS 183
            S  VFN   S+++ ++ C A QC  +     NP TC     C +  +YG S+ +   LS
Sbjct: 159 QSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLS 218

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
           +DT+S  +  VP + +GC Q   G      GL+GL R  LSLL Q       +FSYCLP+
Sbjct: 219 KDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT 278

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
             + S   S+       P +  YTP+ K+    SLY++ +  I V  + + +   A    
Sbjct: 279 SSSSSGYLSIG---SYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSL 335

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAP 360
           P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   +  +  P
Sbjct: 336 P-----TIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVP 390

Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            +++ F+ G  + L   NLL+   + + TCLA A A         +I N QQQ   ++YD
Sbjct: 391 QVSMAFAGGAALKLKATNLLVDVDSAT-TCLAFAPARSAA-----IIGNTQQQTFSVVYD 444

Query: 420 VPNSRLGVARELCT 433
           V NS++G A   C+
Sbjct: 445 VKNSKIGFAAGGCS 458


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 129/434 (29%), Positives = 199/434 (45%), Gaps = 52/434 (11%)

Query: 31  SSTLQV--FHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
           S+TL V   H + PC+  + S   +   S  E L   +AR  ++ S A    +  P  + 
Sbjct: 52  SATLSVPLVHRYGPCAASQYSDMPT--PSFSETLRHSRARTNYIKSRASTGMASTPDDAA 109

Query: 89  RQI-------TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVF 136
             +         S  Y+V    GTP+   ++ MDT +D +WV C     T C      +F
Sbjct: 110 VTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLF 169

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP-----TCGGGACAFNLTYGS-STIAANLSQDTISLA 190
           + ++S+T+  + C A  C ++ +      T GG  C + + YG  S+     S +TI+ A
Sbjct: 170 DPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFA 229

Query: 191 TDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
             I V  + FGC     G S    GLLGLG    SL+ QT ++Y   FSYCLP+    S 
Sbjct: 230 PGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALN--SE 287

Query: 250 SGSLRLG----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           +G L LG             +TP+   P  ++ Y VN+  I VG + +DIP  A +    
Sbjct: 288 AGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR---- 343

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
              G +IDSGT+ T L   AY A+    R+   +   V S   FDTCY+      +  P 
Sbjct: 344 --GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASE-DFDTCYNFTGYSNVTVPR 400

Query: 362 ITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           + L FSG   +++ +P + +L+        CLA   +  +V   L +I N+ Q+   +LY
Sbjct: 401 VALTFSGGATIDLDVP-NGILVKD------CLAFRESGPDVG--LGIIGNVNQRTLEVLY 451

Query: 419 DVPNSRLGVARELC 432
           D  + ++G     C
Sbjct: 452 DAGHGKVGFRAGAC 465


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 165/366 (45%), Gaps = 32/366 (8%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG     S  Y  R  +GTPA+ + + +DT +D  W+ C  C  C   S  VFN   
Sbjct: 150 PVVSGAS-QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTS 208

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
           S+T+K+L C A QC  +    C    C + ++YG  +           LATD V   TFG
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TFG 259

Query: 201 CIQK----ATGNSVPPQGLL----GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
              K    A G     +GL     GL      +L+ T  +  ++FSYCL   +    S S
Sbjct: 260 NSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVD-RDSGKSSS 318

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L    +         PLL+N +  + YYV L    VG   V +P      + +   G I+
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378

Query: 313 DSGTVFTRLVAPAYTAVRDVF-RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
           D GT  TRL   AY ++RD F +  V      +S+  FDTCY       +  PT+   F+
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438

Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G ++ LP  N LI        C A A      +S L++I N+QQQ  RI YD+  + +G
Sbjct: 439 GGKSLDLPAKNYLIPVDDSGTFCFAFAP----TSSSLSIIGNVQQQGTRITYDLSKNVIG 494

Query: 427 VARELC 432
           ++   C
Sbjct: 495 LSGNKC 500


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 176/348 (50%), Gaps = 33/348 (9%)

Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGA 166
           M +DT +D  WV C  C  C   S  VF+  +S+++  +GC AA C+++ +  C    GA
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 167 CAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           C + + YG  ++ A +   +T++ A    V     GC     G  V   GLLGLGRG LS
Sbjct: 61  CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120

Query: 225 LLAQTQNLYQSTFSYCL---PSFKALSFSGSLR-------LGPIGQPKRIKYTPLLKNPR 274
              Q    Y  +FSYCL    S  A +  GS R        G +G      +TP+++NPR
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASS-ASFTPMVRNPR 179

Query: 275 RSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDV 332
             + YYV L+ I V G RV  +    L+ +P+TG  G I+DSGT  TRL   +Y+A+RD 
Sbjct: 180 METFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDA 239

Query: 333 FRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
           FR      L + S GG   FDTCY +     +  PT+++ F+ G    LP +N LI   +
Sbjct: 240 FRAAAAGGLRL-SPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDS 298

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A     V    ++I N+QQQ  R+++D    R+G A + C
Sbjct: 299 RGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 116/384 (30%), Positives = 171/384 (44%), Gaps = 38/384 (9%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS----STVFNSA 139
           P+ SG   T S  Y V  ++GTP Q+LL+  DT +D  WV C+ C  CS    S+ F   
Sbjct: 76  PLISGAS-TGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134

Query: 140 QSTTFKNLGCQAAQCKQVP-------NPTCGGGACAFNLTYGSSTIAANL-SQDTISL-- 189
            S++F    C    C+ +P       N T     C F  +Y   ++++   S++T +L  
Sbjct: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194

Query: 190 --ATDI-VPGYTFGCIQKATGNSVP------PQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
              ++I + G +FGC  + +G SV        +G++GLGRGS+S  +Q    + + FSYC
Sbjct: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254

Query: 241 L-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           L       P    L   G L   P+    +I YTPL  NP   + YY+ + +I +    +
Sbjct: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            I P   + +     GT++DSGT  T L   AY  V    RRRV          GFD C 
Sbjct: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCV 374

Query: 354 SVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +    +     P +     G  V  P        T   + CLA+ A      +  +VI N
Sbjct: 375 NASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV--ESGNGFSVIGN 432

Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
           + QQ   + +D   SRLG  R  C
Sbjct: 433 LMQQGFLLEFDKEESRLGFTRRGC 456


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 136/444 (30%), Positives = 200/444 (45%), Gaps = 63/444 (14%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------------LAVA 78
           L + H  SPCSP     PL  +     ++  D AR+  L+S               L   
Sbjct: 45  LTLHHPQSPCSP----APLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGH 100

Query: 79  RK-------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
           RK             S VP+  G  +     Y+ R  +GTPA + +M +DT +   W+ C
Sbjct: 101 RKKKAGGVGGSQASSSSVPLTPGASVAVG-NYVTRLGLGTPATSYVMVVDTGSSLTWLQC 159

Query: 126 TGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGS 175
           + C V C   +  VF+   S T+  + C +++C ++   T    AC+      +  +YG 
Sbjct: 160 SPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGD 219

Query: 176 STIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
           S+ +   LS+DT+S  +   PG+ +GC Q   G      GL+GL +  LSLL Q      
Sbjct: 220 SSYSVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLG 279

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
             FSYCLP+  A   +G L +G    P +  YTP+  +   +SLY+V L  I V    + 
Sbjct: 280 YAFSYCLPTSSAA--AGYLSIGSY-NPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLA 336

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY 353
           +PP   +  P     TIIDSGTV TRL    YTA+ R V      +     +    DTC+
Sbjct: 337 VPPSEYRSLP-----TIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCF 391

Query: 354 ---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
              +  +  P + + F+ G  + L   N+LI     S TCLA A           +I N 
Sbjct: 392 RGSAAGLRVPRVDMAFAGGATLALSPGNVLI-DVDDSTTCLAFAPTGGTA-----IIGNT 445

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
           QQQ   ++YDV  SR+G A   C+
Sbjct: 446 QQQTFSVVYDVAQSRIGFAAGGCS 469


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 165/366 (45%), Gaps = 32/366 (8%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG     S  Y  R  +GTPA+ + + +DT +D  W+ C  C  C   S  VFN   
Sbjct: 150 PVVSGAS-QGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTS 208

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
           S+T+K+L C A QC  +    C    C + ++YG  +           LATD V   TFG
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TFG 259

Query: 201 CIQK----ATGNSVPPQGLL----GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
              K    A G     +GL     GL      +L+ T  +  ++FSYCL   +    S S
Sbjct: 260 NSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVD-RDSGKSSS 318

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           L    +         PLL+N +  + YYV L    VG   V +P      + +   G I+
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378

Query: 313 DSGTVFTRLVAPAYTAVRDVF-RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
           D GT  TRL   AY ++RD F +  V      +S+  FDTCY       +  PT+   F+
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438

Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G ++ LP  N LI        C A A      +S L++I N+QQQ  RI YD+  + +G
Sbjct: 439 GGKSLDLPAKNYLIPVDDSGTFCFAFAP----TSSSLSIIGNVQQQGTRITYDLSKNVIG 494

Query: 427 VARELC 432
           ++   C
Sbjct: 495 LSGNKC 500


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 136/411 (33%), Positives = 202/411 (49%), Gaps = 39/411 (9%)

Query: 55  EESVLEMLAKDQARLQFLSS---LAVARKSVV-------PIASGRQITQSPTYIVRAKIG 104
           E+ +LE L +D+ R++++ S   LA  +K          P+ SG  +  S  Y VR  +G
Sbjct: 3   EQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGL-LYGSGEYFVRLGLG 61

Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           TPA++L M +DT +D  W+ C  C  C   +  +F+   S++F+ + C +  CK +   +
Sbjct: 62  TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHS 121

Query: 162 CGG--GA---CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQG 214
           C G  GA   C++ + YG  + +  + S D  +L T        FGC     G      G
Sbjct: 122 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 181

Query: 215 LLGLGRGSLSLLAQ-----TQNLYQSTFSYCL--PSFKALSFSGSLRLGPIGQPKRIKYT 267
           LLGLG G LS  +Q     T +   ++FSYCL   S      S SL  G    P     +
Sbjct: 182 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALS 241

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           PLLKNP+  + YY  ++ + VG   + I   +LQ + +   G IIDSGT  TR     Y 
Sbjct: 242 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 301

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMF-SGMNVTLPQDNLLIH- 381
            +RD FR    +  +      FDTCY+      +  P + L F +G ++ LP  N LI  
Sbjct: 302 TIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 361

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +TAGS  CLA   AP ++   L +I N+QQQ+ RI +D+  S L  A + C
Sbjct: 362 NTAGSF-CLAF--APTSME--LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 123/435 (28%), Positives = 191/435 (43%), Gaps = 33/435 (7%)

Query: 8   FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           F + LF F  ++        ++H S           SP +     +  E  +  + +   
Sbjct: 15  FCSVLFCFVFNQVFRAELIYREHQS-----------SPLRSETLKTPSEIFIAAVKRGHE 63

Query: 68  RLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
           R   L+   +A   +   P+ASG        Y++    G P Q     +DT +D  WV C
Sbjct: 64  RRARLAKHVLAGDQLFETPVASGNG-----EYLIDISYGNPPQKSTAIVDTGSDLNWVQC 118

Query: 126 TGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAAN 181
             C  C  T+   F+ ++S ++K LGC +  C+ +P  +C   +C ++  YG  S+ +  
Sbjct: 119 LPCKSCYETLSAKFDPSKSASYKTLGCGSNFCQDLPFQSC-AASCQYDYMYGDGSSTSGA 177

Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
           LS D +++ T  +P   FGC     G      GL+GLG+G LSL++Q        FSYCL
Sbjct: 178 LSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCL 237

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
               +   S  L +G       + YTP+L N    + YY  L  I V  + V+ P     
Sbjct: 238 VPLGSTKTS-PLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFD 296

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-- 359
              T   G I+DSGT  T L   A+  +    +  +       S  G + C+S   VA  
Sbjct: 297 IAATGRGGLILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANP 356

Query: 360 --PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
             PT+   F+G +V L  DN  I       TCLAMA++     +  ++  N+QQ NH I+
Sbjct: 357 TYPTVVFHFNGADVALAPDNTFIALDFEGTTCLAMASS-----TGFSIFGNIQQLNHVIV 411

Query: 418 YDVPNSRLGVARELC 432
           +D+ N R+G     C
Sbjct: 412 HDLVNKRIGFKSANC 426


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 111/368 (30%), Positives = 169/368 (45%), Gaps = 47/368 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y+VR  +GTP + + + +DT +D  W  C  C  C      V + A S+T+  L C AA+
Sbjct: 84  YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143

Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTI-AANLSQDTISLATDIVPG-------YTF 199
           C+ +P  +CG        +C +   YG  ++    ++ D  +       G        TF
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203

Query: 200 GC--IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLG 256
           GC  + K    S    G+ G GRG  SL +Q   L  ++FSYC  S F++ S   +L   
Sbjct: 204 GCGHLNKGVFQS-NETGIAGFGRGRWSLPSQ---LNVTSFSYCFTSMFESKSSLVTLGGS 259

Query: 257 PI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           P           ++ TP+LKNP + SLY+++L  I VG+  + +P    +        TI
Sbjct: 260 PAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------STI 312

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------PTITL 364
           IDSG   T L    Y AV+  F  +VG   +       D C+++P+ A       P++TL
Sbjct: 313 IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTL 372

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
              G +  LP+ N +       + C+ + AAP        VI N QQQN  ++YD+ N R
Sbjct: 373 HLEGADWELPRSNYVFEDLGARVMCIVLDAAPGE----QTVIGNFQQQNTHVVYDLENDR 428

Query: 425 LGVARELC 432
           L  A   C
Sbjct: 429 LSFAPARC 436


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 120/389 (30%), Positives = 179/389 (46%), Gaps = 27/389 (6%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           + S L+ +  D+ R Q             P+ SG     S  Y  R  +GTPA+ + + +
Sbjct: 130 DRSDLKPVDIDETRFQ-------PEDLTTPVVSGTS-QGSGEYFSRIGVGTPAKEMYVVL 181

Query: 115 DTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNL 171
           DT +D  W+   PC+ C   S  +F+   S+TFK+L C   +C  +    C    C + +
Sbjct: 182 DTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNKCLYQV 241

Query: 172 TYGSSTI-AANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
           +YG  +    N + DT++   +  V     GC     G      GLLGLG G+LS+   T
Sbjct: 242 SYGDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSM---T 298

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
             +   +FSYCL    +   S SL    +         PLL+N +  + YYV L    VG
Sbjct: 299 NQIKAKSFSYCLVDRDSAK-SSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVG 357

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGG 348
            + V IP    + + +   G I+D GT  TRL   AY ++RD F +        TS +  
Sbjct: 358 GQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISL 417

Query: 349 FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
           FDTCY       +  PT+T  F+ G ++ LP  N LI        C A A      +S L
Sbjct: 418 FDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAP----TSSSL 473

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++I N+QQQ  RI YD+ N+ +G++   C
Sbjct: 474 SIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 126/385 (32%), Positives = 187/385 (48%), Gaps = 59/385 (15%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNL 147
           QS  Y+V   IGTP +   +  DT +D  WV C  C   S       +F+ ++S+T+ ++
Sbjct: 118 QSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDV 177

Query: 148 GCQAAQCK--QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLA--TDIVP---GYTF 199
            C A +C    V    CG  +C +++ YG  S    +L+++T +L+  + + P   G  F
Sbjct: 178 PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237

Query: 200 GCIQK------ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKALSFS 250
           GC  +       TG  V   GLLGLGRG  S+L+QT+    S    FSYCLP     S +
Sbjct: 238 GCSHEYISVFNDTGMGV--AGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRG--SST 293

Query: 251 GSLRLG-----PIGQPKRIKYTPLLKN-PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
           G L +G     P  Q   + +TPL+    +  S Y VNL  + V    VDIP  A     
Sbjct: 294 GYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL-- 351

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVP----IV 358
               G +IDSGTV T + A AY  +RD FR  +GS   L   S+   DTCY V     + 
Sbjct: 352 ----GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVT 407

Query: 359 APTITLMF----------SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           AP + L F          SG+ + LP ++     +  S+T   +A  P N ++ L ++ N
Sbjct: 408 APRVALEFGGGARIDVDASGILLVLPAED----GSGQSLTLACLAFLPTN-SAGLVIVGN 462

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
           MQQ+ + +++DV   R+G     C+
Sbjct: 463 MQQRAYNVVFDVDGGRIGFGPNGCS 487


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 132/444 (29%), Positives = 198/444 (44%), Gaps = 64/444 (14%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARK------- 80
           L + H  SPCSP     PL  +     +L  D AR   L+S       A +R+       
Sbjct: 47  LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARAAHLASRLATTSNAPSRRPTTSLRK 102

Query: 81  ----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVP 124
                           + VP+  G  +     Y+    +GTPA +  M +DT +   W+ 
Sbjct: 103 PKAAAGASGGPLDDSLASVPLTPGTSVGVG-NYVTELGLGTPATSYAMVVDTGSSLTWLQ 161

Query: 125 CTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYG 174
           C+ CV  C   V   ++   S+T+  + C A+QC ++   T    AC+      +  +YG
Sbjct: 162 CSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYG 221

Query: 175 SSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
            S+ +   LS+DT+S  +   P + +GC Q   G      GL+GL R  LSLL Q     
Sbjct: 222 DSSFSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSL 281

Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
             +FSYCLP+  +   +G L +GP        YTP+  +   +SLY+V L  + VG   +
Sbjct: 282 GYSFSYCLPTPAS---TGYLSIGPYTS-GHYSYTPMASSSLDASLYFVTLSGMSVGGSPL 337

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            + P      P     TIIDSGTV TRL    YTA+       +    +  +    DTC+
Sbjct: 338 AVSPAEYSSLP-----TIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCF 392

Query: 354 ---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
              +  +  P + + F+ G  + L   N+LI     S TCLA   AP +  +   +I N 
Sbjct: 393 QGQASQLRVPAVAMAFAGGATLKLATQNVLID-VDDSTTCLAF--APTDSTT---IIGNT 446

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
           QQQ   ++YDV  SR+G A   C+
Sbjct: 447 QQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 168/358 (46%), Gaps = 19/358 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           PI SG     S  Y  R  IG P   + M +DT +D +WV C  C  C   +  +F    
Sbjct: 139 PIVSGAS-QGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTS 197

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           S +F +L C+  QCK +    C  G C + ++YG  S    +   +T++L +  +     
Sbjct: 198 SASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAI 257

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G  +   GLLGLG GSLS  +Q   L  S+FSYCL    + S S      PI 
Sbjct: 258 GCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCLVDRDSDSTSTLDFNSPI- 313

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P  +   PL +NP   + +Y+ L  + VG  V+ IP  + Q +     G I+DSGT  T
Sbjct: 314 TPDAVT-APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVT 372

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLP 374
           RL    Y  +RD F +      T   +  FDTCY +     +  PT++  F+ G  + LP
Sbjct: 373 RLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLP 432

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             N LI   +    C A A      +S L+++ N QQQ  R+ +D+ NS +G +   C
Sbjct: 433 AKNYLIPVDSEGTFCFAFAP----TDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 174/340 (51%), Gaps = 28/340 (8%)

Query: 112 MAMDTSNDAAWVPCTGC-VGCSST---VFNSAQSTTFKNLGCQAAQCKQVP-----NPTC 162
           M +DT +  +W+ C  C V C +    +++ + S T+K L C + +C ++      +P C
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 163 --GGGACAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
                AC +  +YG ++ +   LSQD ++L ++  +P +T+GC Q   G      G++GL
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL 278
            R  LS+LAQ    Y   FSYCLP+  + S  G         P   K+TP+L + +  SL
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
           Y++ L AI V  R +D+   A+   P     T+IDSGTV TRL    Y A+R  F + + 
Sbjct: 181 YFLRLTAITVSGRPLDL-AAAMYRVP-----TLIDSGTVITRLPMSMYAALRQAFVKIMS 234

Query: 339 SNLTVT-SLGGFDTCYSVPI----VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM 392
           +      +    DTC+   +      P I ++F  G ++TL   ++LI +  G ITCLA 
Sbjct: 235 TKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG-ITCLAF 293

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           A +  +  + + +I N QQQ + I YDV  SR+G A   C
Sbjct: 294 AGS--SGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 174/374 (46%), Gaps = 42/374 (11%)

Query: 84  PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           P+ SG  + Q S  Y  R  IG+PA+ L M +DT +D  W+ C  C  C   S  +F+ A
Sbjct: 184 PVVSG--VGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPA 241

Query: 140 QSTTFKNLGCQAAQCKQVPNPTC------GGGACAFNLTYGS-STIAANLSQDTISLATD 192
            S+++  + C +  C+ +    C      G  +C + + YG  S    + + +T++L  D
Sbjct: 242 LSSSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGD 301

Query: 193 ---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSF 244
               V     GC     G  V   GLL LG G LS  +Q   +  + FSYCL     PS 
Sbjct: 302 GSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATEFSYCLVDRDSPSA 358

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV-DIPPGALQFN 303
             L F  S               PL+++PR ++ YYV L  I VG   + DIPP A   +
Sbjct: 359 STLQFGAS--------DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMD 410

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVA 359
                G I+DSGT  TRL + AY+A+RD F R   +    + +  FDTCY +     +  
Sbjct: 411 EQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQV 470

Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           P ++L F  G  + LP  N LI        CLA AA        ++++ N+QQQ  R+ +
Sbjct: 471 PAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAA----TGGAVSIVGNVQQQGIRVSF 526

Query: 419 DVPNSRLGVARELC 432
           D   + +G +   C
Sbjct: 527 DTAKNTVGFSPNKC 540


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 127/443 (28%), Positives = 192/443 (43%), Gaps = 62/443 (13%)

Query: 36  VFHVFSPCSPF---KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           + H   PCSP       KP S  E    +LA DQ R++   SL     S      G+  T
Sbjct: 77  IVHRHGPCSPLAGAHAGKPPSHAE----ILAADQNRVE---SLHHRVSSTTTGLGGKPRT 129

Query: 93  QSPT-----------------------------YIVRAKIGTPAQTLLMAMDTSNDAAWV 123
           +  T                             Y+V   +GTP     +  DT +D  WV
Sbjct: 130 KKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWV 189

Query: 124 PCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
            C  CV  C      +F+ A+S+T+ N+ C    C  +    C  G C + + YG  +  
Sbjct: 190 QCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQYGDGSYT 249

Query: 180 ANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
               ++DT+++A D + G+ FGC +K  G      GLLGLGRG  S+  Q    Y  +FS
Sbjct: 250 VGFFAKDTLAVAQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFS 309

Query: 239 YCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           YCLP+  A   +G L      P       K TP+L + +  + YYV L  IRVG + +  
Sbjct: 310 YCLPASSAA--TGYLEFGPLSPSSSGSNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGA 366

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRL--VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            P ++  N    +GT++DSGTV TRL   A A  +                +    DTCY
Sbjct: 367 IPESVFSN----SGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCY 422

Query: 354 SV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
                  +  PT++L+F G        + ++++ + S  CL  A+  D+ +  + ++ N 
Sbjct: 423 DFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDES--VGIVGNT 480

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
           QQ+ + +LYDV    +G A   C
Sbjct: 481 QQRTYGVLYDVSKKVVGFAPGAC 503


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 126/371 (33%), Positives = 182/371 (49%), Gaps = 48/371 (12%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
           IG+  + L   +DT ++A  V C      S  VF+ A S +++ + C +  C  V   T 
Sbjct: 5   IGSLQKNLSAIIDTGSEAVLVQCGSR---SRPVFDPAASQSYRQVPCISQLCLAVQQQTS 61

Query: 163 GG---------GACAFNLTYGSS-TIAANLSQDTISLAT-----------DIVPGYTFGC 201
            G          AC ++L+YG S     + SQD I L +           D+     FGC
Sbjct: 62  NGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVA----FGC 117

Query: 202 IQKATGNSVP--PQGLLGLGRGSLSLLAQTQN-LYQSTFSYCLPSFKAL-SFSGSLRLGP 257
                G  V     G++G  RG+LSL +Q ++ L  S FSYC PS       +G + LG 
Sbjct: 118 AHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGD 177

Query: 258 IGQPK-RIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTII 312
            G  K ++ YTPLL NP    RS LYYV L +I V  + + IP  A + +P+TG  GT++
Sbjct: 178 SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVL 237

Query: 313 DSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCY------SVPIVAPTITL 364
           DSGT FTR+V  AYTA R+ F    R G    V +  GFD CY      S+P V      
Sbjct: 238 DSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLS 297

Query: 365 MFSGMNVTLPQDNLLIH-STAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           + + + + L  ++L +  S AG+    CLA+ ++  +    +NV+ N QQ N+ + YD  
Sbjct: 298 LQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNE 357

Query: 422 NSRLGVARELC 432
            SR+G  R  C
Sbjct: 358 RSRVGFERADC 368


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 173/350 (49%), Gaps = 19/350 (5%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           ++V   +GTP Q  ++ +DT +D  W+   PC  C   +  +F+ ++S+T+  + C ++ 
Sbjct: 25  FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84

Query: 154 CKQV-PNPTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATG--N 208
           C  +    TC   A C +   YG  ++     S++TI+          FG     TG   
Sbjct: 85  CADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG 144

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSGSLRLGPIGQPK-RIKY 266
               +G+LGLG+G +S+ +Q  ++  + FSYCL  +  A S + ++  G    P   ++Y
Sbjct: 145 DTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQY 204

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           TP++ N    + YY+ +  I VG  ++DI     + +     GTIIDSGT  T L    +
Sbjct: 205 TPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVF 264

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHS 382
            A+   +  +V    T TS  G D C++       V P +T+   G+++ LP  N  I S
Sbjct: 265 NALVAAYTSQV-RYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLDGVHLELPTANTFI-S 322

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              +I CLA A+A D     + +  N+QQQN  I+YD+ N R+G A   C
Sbjct: 323 LETNIICLAFASALD---FPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 121/403 (30%), Positives = 184/403 (45%), Gaps = 54/403 (13%)

Query: 53  SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
           S    V+ ++A+D AR++ L    VA  S          VVP         S  Y VR  
Sbjct: 80  SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135

Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           +G+P     + +D+ +D  WV   PC  C   +  +F+ A S++F  + C +A C+ +  
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195

Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
             CGGG     C +++TYG  S     L+ +T++L    V G   GC  + +G  V   G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
           LLGLG G++SL+ Q        FSYCL S +    +GSL                     
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLA-------------------- 294

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
            SS YYV L  I VG   + +     Q       G ++D+GT  TRL   AY A+R  F 
Sbjct: 295 -SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFD 353

Query: 335 RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITC 389
             +G+     ++   DTCY +     +  PT++  F  G  +TLP  NLL+    G++ C
Sbjct: 354 GAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVFC 412

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LA A +    +S ++++ N+QQ+  +I  D  N  +G     C
Sbjct: 413 LAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 207/429 (48%), Gaps = 55/429 (12%)

Query: 28  QDHSSTLQV--FHVFSPCSPFKPSKPLSWE-ESVLEMLAKDQARLQFLSSLAVARKSVVP 84
           + + ST+ V   H   PC+P  PS  LS +  S  ++  + +AR  ++      +K  VP
Sbjct: 48  EQNGSTVYVPLVHRHGPCAP-APS--LSTDTRSFADIFRRSRARPSYI---VRGKKVSVP 101

Query: 85  IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSA 139
              G  +  S  Y+VR   GTPA   ++ +DT +D +W+ C  C    C      +++ +
Sbjct: 102 AHLGTSV-MSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPS 160

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD 192
            S+T+  + C +  CK++     G G      C F ++Y  G+ST+ A  SQD ++LA  
Sbjct: 161 HSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGA-YSQDKLTLAPG 219

Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
            IV  + FGC            G+LGLGR   SL A+    Y   FSYCLPS    S  G
Sbjct: 220 AIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS--SKPG 273

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            L LG    P    +TP+   P + +   V L  I VG + +D+ P A         G I
Sbjct: 274 FLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMI 327

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
           +DSGTV T L + AY A+R  FR+ + +   + + G  DTCY++     +V P I L F+
Sbjct: 328 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFT 386

Query: 368 G---MNVTLPQDNLLIHSTAGSITCLAMA-AAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G   +N+ +P + +L++       CLA A + PD    VL    N+ Q+   +L+D   S
Sbjct: 387 GGATINLDVP-NGILVNG------CLAFAESGPDGSAGVL---GNVNQRAFEVLFDTSTS 436

Query: 424 RLGVARELC 432
           + G   + C
Sbjct: 437 KFGFRAKAC 445


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 168/346 (48%), Gaps = 30/346 (8%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
           +GTPA   +M +DT +   W+ C+ C V C   S  VFN   S+T+ ++GC A QC  +P
Sbjct: 3   LGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLP 62

Query: 159 NPTCGGGACA------FNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVP 211
           + T    AC+      +  +YG S+ +   LS+DT+S  +  +P + +GC Q   G    
Sbjct: 63  SATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGR 122

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
             GL+GL R  LSLL Q       +F+YCLPS  +  +           P +  YTP++ 
Sbjct: 123 SAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSY----NPGQYSYTPMVS 178

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           +    SLY++ L  + V    + +   A    P     TIIDSGTV TRL    Y+A+  
Sbjct: 179 SSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP-----TIIDSGTVITRLPTSVYSALSK 233

Query: 332 VFRRRVGSNLTVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSI 387
                +      ++    DTC+   +  + AP +T+ F+ G  + L   NLL+     S 
Sbjct: 234 AVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVD-VDDST 292

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           TCLA A A         +I N QQQ   ++YDV +SR+G A   C+
Sbjct: 293 TCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 167/358 (46%), Gaps = 19/358 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           PI SG     S  Y  R  IG P   + M +DT +D +WV C  C  C   +   F    
Sbjct: 139 PIVSGAS-QGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTS 197

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           S +F +L C+  QCK +    C  G C + ++YG  S    +   +T++L +  +     
Sbjct: 198 SASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAI 257

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G  +   GLLGLG GSLS  +Q   L  S+FSYCL    + S S      PI 
Sbjct: 258 GCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCLVDRDSDSTSTLDFNSPI- 313

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P  +   PL +NP   + +Y+ L  + VG  V+ IP  + Q +     G I+DSGT  T
Sbjct: 314 TPDAVT-APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVT 372

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLP 374
           RL    Y  +RD F +      T   +  FDTCY +     +  PT++  F+ G  + LP
Sbjct: 373 RLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLP 432

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             N LI   +    C A A      +S L+++ N QQQ  R+ +D+ NS +G +   C
Sbjct: 433 AKNYLIPVDSEGTFCFAFAP----TDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 131/434 (30%), Positives = 198/434 (45%), Gaps = 68/434 (15%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S      S        +
Sbjct: 63  SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 118

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
                +   ++V    GTP   + + +DT +   W  C  CV C   S+  F+S+ S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTY 178

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
               C  +  +             +N+TYG  ST   N   DT++L  +D+   + FGC 
Sbjct: 179 SFGSCIPSTVEN-----------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCG 227

Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
           +   G+      G+LGLG+G LS ++QT + +   FSYCLP   ++   GSL  G     
Sbjct: 228 RNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATS 284

Query: 260 QPKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
           Q   +K+T L+  P   + S  Y+VNL  I VG   ++IP            GTIIDS T
Sbjct: 285 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRT 339

Query: 317 VFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----PIVAP 360
           V TRL   AY+A++  F            RR+ G  L        DTCY++     ++ P
Sbjct: 340 VITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKDVLLP 391

Query: 361 TITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            I L F  G +V L   N++  S A S  CLA A       S L +I N QQ +  +LYD
Sbjct: 392 EIVLHFGGGADVRLNGTNIVWGSDA-SRLCLAFAGT-----SELTIIGNRQQLSLTVLYD 445

Query: 420 VPNSRLGVARELCT 433
           +   R+G     C+
Sbjct: 446 IQGRRIGFGGNGCS 459


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 169/358 (47%), Gaps = 19/358 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           P+ SG     S  Y +R  IG P     + +DT +D +W+   PC+ C   S  +F+   
Sbjct: 137 PVVSGTS-QGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPIS 195

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
           S ++  + C   QCK +    C  G C + ++YG  S      + +T++L +  V     
Sbjct: 196 SNSYSPIRCDEPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGSAAVENVAI 255

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G  V   GLLGLG G LS  AQ      ++FSYCL +  + + S      P+ 
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGKLSFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSPL- 311

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P+     PL++NP   + YY+ L  I VG   + IP  + + +   G G IIDSGT  T
Sbjct: 312 -PRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVT 370

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLP 374
           RL +  Y A+RD F +          +  FDTCY +     +  PT++  F  G  + LP
Sbjct: 371 RLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLP 430

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             N LI   +    C A A       S L++I N+QQQ  R+ +D+ NS +G + + C
Sbjct: 431 ARNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 207/429 (48%), Gaps = 55/429 (12%)

Query: 28  QDHSSTLQV--FHVFSPCSPFKPSKPLSWE-ESVLEMLAKDQARLQFLSSLAVARKSVVP 84
           + + ST+ V   H   PC+P  PS  LS +  S  ++  + +AR  ++      +K  VP
Sbjct: 14  EQNGSTVYVPLVHRHGPCAP-APS--LSTDTRSFADIFRRSRARPSYI---VRGKKVSVP 67

Query: 85  IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSA 139
              G  +  S  Y+VR   GTPA   ++ +DT +D +W+ C  C    C      +++ +
Sbjct: 68  AHLGTSV-MSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPS 126

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD 192
            S+T+  + C +  CK++     G G      C F ++Y  G+ST+ A  SQD ++LA  
Sbjct: 127 HSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGA-YSQDKLTLAPG 185

Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
            IV  + FGC            G+LGLGR   SL A+    Y   FSYCLPS    S  G
Sbjct: 186 AIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS--SKPG 239

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            L LG    P    +TP+   P + +   V L  I VG + +D+ P A         G I
Sbjct: 240 FLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMI 293

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
           +DSGTV T L + AY A+R  FR+ + +   + + G  DTCY++     +V P I L F+
Sbjct: 294 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFT 352

Query: 368 G---MNVTLPQDNLLIHSTAGSITCLAMA-AAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G   +N+ +P + +L++       CLA A + PD    VL    N+ Q+   +L+D   S
Sbjct: 353 GGATINLDVP-NGILVNG------CLAFAESGPDGSAGVL---GNVNQRAFEVLFDTSTS 402

Query: 424 RLGVARELC 432
           + G   + C
Sbjct: 403 KFGFRAKAC 411


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 127/430 (29%), Positives = 201/430 (46%), Gaps = 45/430 (10%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ-------ARLQFLSSLAVARKS 81
           D +S+LQV H + PC        +  + S +E L +DQ       ARL  +S   +  + 
Sbjct: 65  DKASSLQVLHKYGPC------MQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEM 118

Query: 82  V--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTV 135
           V  +P  SG  I  +  Y+V   +GTP +   +  DT +   W  C  C+G         
Sbjct: 119 VTKLPAQSGIAIG-TGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQK 177

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGSSTIAANL-SQDTISLA 190
           F+  +ST++ N+ C +A C  +P    G  A    C + + YG  + +    + +T++++
Sbjct: 178 FDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTIS 237

Query: 191 T-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
           + D+   + FGC Q   G      GLLGL   S+SL +QT   YQ  FSYCLPS    S 
Sbjct: 238 SSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--TPSS 295

Query: 250 SGSLRLGPIGQPKRIK-YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           +G L  G  G+  +   +TP+  +P  SS Y ++++ I V    + I P        T +
Sbjct: 296 TGYLNFG--GKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIF-----TTS 346

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITL 364
           G IIDSGTV TRL   AY A+++ F  ++ +          DTCY       +  P +++
Sbjct: 347 GAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSV 406

Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            F  G+ V +    +L       + CLA AA  D  +S   +  N QQ+ + ++YD    
Sbjct: 407 SFKGGVEVDIDASGILYLVNGVKMVCLAFAANKD--DSEFGIFGNHQQKTYEVVYDGAKG 464

Query: 424 RLGVARELCT 433
            +G A   C+
Sbjct: 465 MIGFAAGACS 474


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 168/352 (47%), Gaps = 30/352 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAA 152
           Y+ R  +GTPA+  +M +DT +   W+ C+ C V C   S  VF+   S+++  + C   
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196

Query: 153 QCKQVPNPTCGGGACA------FNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKA 205
           QC  +   T    AC+      +  +YG S+ +   LS+DT+S  ++ VP + +GC Q  
Sbjct: 197 QCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDN 256

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            G      GL+GL R  LSLL Q       +FSYCLPS  +  +           P +  
Sbjct: 257 EGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSY----NPGQYS 312

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           YTP++ +    SLY++ L  + V  +     P A+  +  +   TIIDSGTV TRL    
Sbjct: 313 YTPMVSSTLDDSLYFIKLSGMTVAGK-----PLAVSSSEYSSLPTIIDSGTVITRLPTTV 367

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH 381
           Y A+       +       +    DTC+   +  +  P +++ FS G  + L   NLL+ 
Sbjct: 368 YDALSKAVAGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVD 427

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             + S TCLA A A         +I N QQQ   ++YDV ++R+G A   CT
Sbjct: 428 VDS-STTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 124/395 (31%), Positives = 173/395 (43%), Gaps = 46/395 (11%)

Query: 71  FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           F   L    +S V + SG        Y +   +GTP +   + +DT +D  W+ C  C  
Sbjct: 176 FSGQLVATLESGVSLGSGE-------YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYA 228

Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT----CGGG--ACAFNLTYGSS----- 176
           C   +   ++   S++FKN+ C   +C+ V +P     C G   +C +   YG S     
Sbjct: 229 CFEQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTG 288

Query: 177 -----TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
                T   NL+         IV    FGC     G      GLLGLGRG LS   Q Q+
Sbjct: 289 DFALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQS 348

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLL---------KNPRRSSLYYVN 282
           LY  +FSYCL    + S   S  +   G+ K +   P L         +NP   + YYV 
Sbjct: 349 LYGHSFSYCLVDRNSNSSVSSKLI--FGEDKELLSHPNLNFTSFVGGKENPV-DTFYYVL 405

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           + +I VG  V+ IP      +   G GTIIDSGT  T    PAY  +++ F R++     
Sbjct: 406 IKSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL 465

Query: 343 VTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
           V +      CY+V  V     P   ++F+ G     P +N  I      + CLA+   P 
Sbjct: 466 VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTP- 524

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              S L++I N QQQN  ILYD+  SRLG A   C
Sbjct: 525 --RSALSIIGNYQQQNFHILYDLKKSRLGYAPMKC 557


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 138/430 (32%), Positives = 198/430 (46%), Gaps = 68/430 (15%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
           S+ L++ H   PC+P + S   +   SV + L  DQ R +++             S A A
Sbjct: 65  SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
             + VP + G  I  +  Y+V A +GTP     M +DT +D +WV   PC+    C S  
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
             +F+ AQS+++  + C          P C G             C + ++YG  S    
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233

Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
             S DT++L A+  V G+ FGC    +G  N V   GLLGLGR   SL+ QT   Y   F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291

Query: 238 SYCLPSFKALSFSGSLRL-GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           SYCLP+  + +   +L L GP G       T LL +P   + Y V L  I VG + + +P
Sbjct: 292 SYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVP 351

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTCYS 354
             A         GT++D+GTV TRL   AY A+R  FR  + S    T  S G  DTCY+
Sbjct: 352 ASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN 405

Query: 355 VP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
                 +  P + L F SG  V L  D +L      S  CLA   AP   +  + ++ N+
Sbjct: 406 FAGYGTVTLPNVALTFGSGATVMLGADGIL------SFGCLAF--APSGSDGGMAILGNV 457

Query: 410 QQQNHRILYD 419
           QQ++  +  D
Sbjct: 458 QQRSFEVRID 467


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 20/359 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           PI SG     S  Y  R  +G PA+   M +DT +D  W+   PCT C   +  +F+   
Sbjct: 143 PIISGTS-QGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRS 201

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLA-TDIVPGYT 198
           S++F +L C++ QC+ +    C    C + ++YG  +        +T++   + ++    
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVA 261

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
            GC     G  V   G  GL       L+ T  +  S+FSYCL   +  S S  L     
Sbjct: 262 VGCGHDNEGLFV---GSAGLLGLGGGPLSLTSQMKASSFSYCLVD-RDSSSSSDLEFNSA 317

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             P      PLLK+ +  + YYV L  + VG +++ IPP   Q + +   G I+DSGT  
Sbjct: 318 A-PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAI 376

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTL 373
           TRL   AY  +RD F  R            FDTCY +     +  PT++  F+ G ++ L
Sbjct: 377 TRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQL 436

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           P  N LI   +    C A A       S L++I N+QQQ  R+ YD+ NS +G +   C
Sbjct: 437 PPKNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 171/368 (46%), Gaps = 22/368 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y V   +GTP Q   + +D+ +D  WV C  C+ C    + ++  + 
Sbjct: 53  PVVSGSTLG-SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSN 111

Query: 141 STTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSSTIAANLSQDTISLATDI 193
           S+TF  + C + +C  +P  T G        GACA+   Y  ++++  +     +   D+
Sbjct: 112 SSTFNPVPCLSPECLLIP-ATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDV 170

Query: 194 -VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSG 251
            +    FGC +   G+     G+LGLG+G LS  +Q    Y + F+YCL ++    S S 
Sbjct: 171 RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSS 230

Query: 252 SLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
            L  G   I     +++TP++ N R  +LYYV +  + VG   + I   A   +     G
Sbjct: 231 WLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGG 290

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLM 365
           +I DSGT  T  + PAY  +   F + V       S+ G D C  V  V     P+ T++
Sbjct: 291 SIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQGLDLCVDVTGVDQPSFPSFTIV 349

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             G  V  PQ        A ++ CLAMA  P +V    N I N+ QQN  + YD   +R+
Sbjct: 350 LGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGG-FNTIGNLLQQNFLVQYDREENRI 408

Query: 426 GVARELCT 433
           G A   C+
Sbjct: 409 GFAPAKCS 416


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 165/371 (44%), Gaps = 37/371 (9%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y +   IGTP +   + +DT +D  W+ C  C  C   +   ++  +S++FKN+GC 
Sbjct: 189 SGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCH 248

Query: 151 AAQCKQVPNP------TCGGGACAFNLTYGSS----------TIAANLSQDTISLATDIV 194
             +C  V +P            C +   YG S          T   NL+          V
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSL 253
               FGC     G      GLLGLGRG LS  +Q Q+LY  +FSYCL    +  + S  L
Sbjct: 309 ENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 368

Query: 254 RLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
             G     +  P+ + +T L+   +NP   + YYV + +I VG  V+ IP      +P  
Sbjct: 369 IFGEDKDLLNHPE-VNFTSLVAGKENPV-DTFYYVQIKSIMVGGEVLKIPEETWHLSPEG 426

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTI 362
             GTI+DSGT  +    P+Y  ++D F ++V     +      D CY+V  V     P  
Sbjct: 427 AGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEF 486

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            ++F  G     P +N  I      I CLA+   P    S L++I N QQQN  ILYD  
Sbjct: 487 RILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTP---RSALSIIGNYQQQNFHILYDTK 543

Query: 422 NSRLGVARELC 432
            SRLG A   C
Sbjct: 544 KSRLGYAPMKC 554


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 126/381 (33%), Positives = 176/381 (46%), Gaps = 45/381 (11%)

Query: 84  PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNS 138
           P++ G      PT  Y+V   IGTP Q + + +DT +D  W  C  CV C       F++
Sbjct: 20  PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDT 79

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC------GGGACAFNLTYGSSTIAAN-LSQDTIS-LA 190
           ++S+T   L C++ QCK  P  T           CA+  +YG +++    L+ D  + +A
Sbjct: 80  SRSSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVA 139

Query: 191 TDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC-------L 241
              +PG TFGC    TG  NS    G+ G GRG LSL +Q   L    FS+C       +
Sbjct: 140 GTSLPGVTFGCGLNNTGVFNSN-ETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAI 195

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           PS   L     L     G    ++ TPL+   KN    +LYY++L  I VG   + +P  
Sbjct: 196 PSTVLLDLPADLFSNGQG---AVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPES 252

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
           A      TG GTIIDSGT  T L    Y  VRD F  ++   +   +  G  TC+S P  
Sbjct: 253 AFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQ 311

Query: 359 A----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
           A    P + L F G  + LP++N +         SI CLA+     N      +I N QQ
Sbjct: 312 AKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQ 366

Query: 412 QNHRILYDVPNSRLGVARELC 432
           QN  +LYD+ N+ L      C
Sbjct: 367 QNMHVLYDLQNNMLSFVAAQC 387


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 21/359 (5%)

Query: 84  PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           P+ SG   TQ S  Y  R  IG PA+ + M +DT +D  W+ CT C  C   +  +F  +
Sbjct: 139 PLISG--TTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPS 196

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT 198
            S++++ L C   QC  +    C    C + ++YG  S    + + +T+++ + +V    
Sbjct: 197 SSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVA 256

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
            GC     G  V   GLLGLG G L+L +Q   L  ++FSYCL    + S S ++  G  
Sbjct: 257 VGCGHSNEGLFVGAAGLLGLGGGLLALPSQ---LNTTSFSYCLVDRDSDSAS-TVEFGTS 312

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             P  +   PLL+N +  + YY+ L  I VG  ++ IP  + + + +   G IIDSGT  
Sbjct: 313 LPPDAV-VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAV 371

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTL 373
           TRL    Y ++RD F +          +  FDTCY++     I  PT+   F  G  + L
Sbjct: 372 TRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLAL 431

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           P  N +I   +    CLA A       S L +I N+QQQ  R+ +D+ NS +G +   C
Sbjct: 432 PAKNYMIPVDSVGTFCLAFAPTA----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 20/359 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
           PI SG     S  Y  R  +G PA+   M +DT +D  W+   PCT C   +  +F+   
Sbjct: 143 PIISGTS-QGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRS 201

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLA-TDIVPGYT 198
           S++F +L C++ QC+ +    C    C + ++YG  +        +T++   + ++    
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVA 261

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
            GC     G  V   G  GL       L+ T  +  S+FSYCL   +  S S  L     
Sbjct: 262 VGCGHDNEGLFV---GSAGLLGLGGGSLSLTSQMKASSFSYCLVD-RDSSSSSDLEFNSA 317

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             P      PLLK+ +  + YYV L  + VG +++ IPP   Q + +   G I+DSGT  
Sbjct: 318 A-PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAI 376

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTL 373
           TRL   AY  +RD F  R            FDTCY +     +  PT++  F+ G ++ L
Sbjct: 377 TRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQL 436

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           P  N LI   +    C A A       S L++I N+QQQ  R+ YD+ NS +G +   C
Sbjct: 437 PPKNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 185/379 (48%), Gaps = 23/379 (6%)

Query: 66  QARLQFLSSLAVARKSVV--PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
           +A L+ +S++    +  +  P+ SG   TQ S  Y  R  IG PA+ + M +DT +D  W
Sbjct: 116 KADLKPISTMYTTEEQDIEAPLISG--TTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNW 173

Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STI 178
           + CT C  C   +  +F  + S++++ L C   QC  +    C    C + ++YG  S  
Sbjct: 174 LQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYT 233

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
             + + +T+++ + +V     GC     G  V   GLLGLG G L+L +Q   L  ++FS
Sbjct: 234 VGDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQ---LNTTSFS 290

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCL    + S S ++  G    P  +   PLL+N +  + YY+ L  I VG  ++ IP  
Sbjct: 291 YCLVDRDSDSAS-TVDFGTSLSPDAV-VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQS 348

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
           + + + +   G IIDSGT  TRL    Y ++RD F +          +  FDTCY++   
Sbjct: 349 SFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAK 408

Query: 357 --IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
             +  PT+   F  G  + LP  N +I   +    CLA A       S L +I N+QQQ 
Sbjct: 409 TTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTA----SSLAIIGNVQQQG 464

Query: 414 HRILYDVPNSRLGVARELC 432
            R+ +D+ NS +G +   C
Sbjct: 465 TRVTFDLANSLIGFSSNKC 483


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 192/408 (47%), Gaps = 39/408 (9%)

Query: 54  WEESVLEMLAKDQARLQFLSS---LAVA----------------RKSVVPIASGRQITQS 94
           ++  VL  L +D  R++ L++   LA+A                     P+ SG     S
Sbjct: 94  YKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGAS-QGS 152

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQA 151
             Y  R  IG+P + + M +DT +D  WV C  C  C   +  +F  + S+++  L C+ 
Sbjct: 153 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCET 212

Query: 152 AQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPGYTFGCIQKATGNS 209
            QCK +    C   +C + ++YG  S    + + +TI+L  +  +     GC     G  
Sbjct: 213 HQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLF 272

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG GSLS  +Q   +  S+FSYCL +    S S      PI  P      PL
Sbjct: 273 VGAAGLLGLGGGSLSFPSQ---INASSFSYCLVNRDTDSASTLEFNSPI--PSHSVTAPL 327

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           L+N +  + YY+ +  I VG +++ IP  + + + +   G I+DSGT  TRL +  Y ++
Sbjct: 328 LRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYNSL 387

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHSTA 384
           RD F R      + + +  FDTCY +     +  PT++  F  G  + LP  N LI   +
Sbjct: 388 RDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDS 447

Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               C A A       S L++I N+QQQ  R+ YD+ NS +G +   C
Sbjct: 448 AGTFCFAFAP----TTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 171/352 (48%), Gaps = 27/352 (7%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST---VFNSAQSTTFKNLGC 149
           S  Y++    GTP +T  +  DT +D  W+ C  C V C +    +F+ + S+T++N+ C
Sbjct: 13  SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSC 72

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYG--SSTIAANLSQDTISLA-TDIVPGYTFGCIQKAT 206
               C  +    C    C + + YG  SSTI   L+ DT  L        + FGC Q  T
Sbjct: 73  TEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGF-LAMDTFMLTPAQKFKNFIFGCGQNNT 131

Query: 207 GNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI- 264
           G      GL+GLGR S  SL +Q      + FSYCLPS    S +G L    IG P+   
Sbjct: 132 GLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTS--SATGYLN---IGNPQNTP 186

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
            YT +L + R  +LY+++L+ I VG   + +     Q       GTIIDSGTV TRL   
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVITRLPPT 241

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSGMNVTLPQDNLLI 380
           AY+A++   R  +       ++   DTCY    +  +V P I L F+G++V +P   +  
Sbjct: 242 AYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFF 301

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              +  + CLA A   D  ++++ +I N+QQ    + YD    R+G +   C
Sbjct: 302 VFNSSQV-CLAFAGNTD--STMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 130/429 (30%), Positives = 201/429 (46%), Gaps = 50/429 (11%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S      S        +
Sbjct: 62  SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 117

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
                +   ++V    GTP Q   + +DT +   W  C  CV C   S   F+S  S+T+
Sbjct: 118 NNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTY 177

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
               C       +P+ T G     +N+TYG  ST   N   DT++L  +D+   + FGC 
Sbjct: 178 SFGSC-------IPS-TVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCG 226

Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
           +   G+      G+LGLG+G LS ++QT + ++  FSYCLP   ++   GSL  G     
Sbjct: 227 RNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSI---GSLLFGEKATS 283

Query: 260 QPKRIKYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           Q   +K+T L+  P       S  Y+V LL I VG + ++IP            GTIIDS
Sbjct: 284 QSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDS 338

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
           GTV TRL   AY+A++  F++ +     SN         DTCY++     ++ P   L F
Sbjct: 339 GTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHF 398

Query: 367 -SGMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDVPNSR 424
             G +V L    ++  + A  + CLA A  +   +N  L +I N QQ +  +LYD+   R
Sbjct: 399 GDGADVRLNGKRVVWGNDASRL-CLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRR 457

Query: 425 LGVARELCT 433
           +G     C+
Sbjct: 458 IGFGGNGCS 466


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 133/382 (34%), Positives = 193/382 (50%), Gaps = 39/382 (10%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           V P+ S R  T S  Y+ +  +GTPA   L+AMDT +D  W+ C  C  C   S  VF+ 
Sbjct: 120 VAPVVS-RAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDP 178

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGG-----ACAFNLTYGS--STIAANLSQDTISLAT 191
             ST+++ +G  A  C+ +     GGG      C + + YG   ST   +  ++T++ A 
Sbjct: 179 RHSTSYREMGYDAPDCQALGR--SGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG 236

Query: 192 DI-VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNL-YQST-FSYCLPSF--- 244
            + VP  + GC     G  + P  G+LGLGRG +S  +Q   L Y  T FSYCL  F   
Sbjct: 237 GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLS 296

Query: 245 -KALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP-PGA 299
               S S +L +G     G P    +TP ++N   ++ YYV L+ + VG   V       
Sbjct: 297 SPGRSVSSTLTIGDGAAAGSPPP-SFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDD 355

Query: 300 LQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYS 354
           L+ +P TG  G I+DSGT  TRL   AY A RD FR     +L   S+GG    FDTCY+
Sbjct: 356 LKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAA-VDLGQVSIGGPSGFFDTCYT 414

Query: 355 V---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
           +    +  PT+++ F+ G+ +TLP  N LI   +    C A A   D     +++I N+Q
Sbjct: 415 MGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDR---SVSIIGNIQ 471

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQ  R++Y++   R+G A   C
Sbjct: 472 QQGFRVVYNIGGGRVGFAPNSC 493


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 126/438 (28%), Positives = 191/438 (43%), Gaps = 56/438 (12%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-------SVVP 84
           +++ + H   PC+P   S     + S  E L  D+AR   +   A  R+       + +P
Sbjct: 54  ASVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGASIP 113

Query: 85  IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSA 139
              G     S  Y+V   IGTPA    + +DT +D +WV C     + C      +F+ +
Sbjct: 114 TYLG-GFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPS 172

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA----------CAFNLTYGSSTIAANL-SQDTIS 188
           +S+TF  + C +  CKQ+P      G           C + + YG+  I   + S +T++
Sbjct: 173 KSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLA 232

Query: 189 LATD-IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
           L +  +V  + FGC     G      GLLGLG    SL++QT ++Y   FSYCLP     
Sbjct: 233 LGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLN-- 290

Query: 248 SFSGSLRLGPIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
           S +G L LG             +TP+   +P+ ++ Y V L  I VG + +DIPP     
Sbjct: 291 SGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFA- 349

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSV----PI 357
                 G I+DSGTV T +   AY A+R  FR  +    L   +    DTCY+      +
Sbjct: 350 -----KGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTV 404

Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
             P + L F G   +++ +P   L+         CLA A A D       +I N+  +  
Sbjct: 405 TVPKVALTFVGGATVDLDVPSGVLVED-------CLAFADAGDG---SFGIIGNVNTRTI 454

Query: 415 RILYDVPNSRLGVARELC 432
            +LYD     LG     C
Sbjct: 455 EVLYDSGKGHLGFRAGAC 472


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 158/346 (45%), Gaps = 25/346 (7%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQSTTFKNLG 148
           T +  ++V+  +G P Q   M  D   D  W+ C  C+ C     ++F+ +QS+++  L 
Sbjct: 182 TGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLS 241

Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAAN-LSQDTISL-ATDIVPGYTFGCIQKA 205
           C+   C  +PN +C   G C +N+TY   T     L  +T+S  ++  V   + GC  K 
Sbjct: 242 CETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWVDRVSLGCSNKN 301

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            G  V   G  GLGRGSLS  ++      S+ SYCL   K    S +L          +K
Sbjct: 302 QGPFVGSDGTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSPPCSGSVK 358

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
              LL+NP+  +LYYV L  I+VG   +D+P      +P    G I+ S ++ T L    
Sbjct: 359 -AKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLENDT 417

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYS--------VPIVAPTITLMFSGMNVTLPQDN 377
           Y  VRD F  +      + +   FDTCY+        +PI+   +     G +  LP+++
Sbjct: 418 YNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVN---DGKSWLLPKES 474

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            L         C A A +  +     +++  +QQ   R+ +D+ NS
Sbjct: 475 YLYAVDKNGTFCFAFAPSKGS----FSILGTLQQYGTRVTFDLVNS 516


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 126/437 (28%), Positives = 192/437 (43%), Gaps = 54/437 (12%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVAR-- 79
           D+    +T+ + H   PCSP    K    + +  E+L +DQ R  ++    S     R  
Sbjct: 52  DSSSSGATVPLNHRHGPCSPVPSGK--KKQPTFTELLRRDQLRANYIQRQFSDEHYPRTG 109

Query: 80  -----KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
                ++ VPIA G  +  +  Y++   IG+PA    M +DT +D +W+ C       S 
Sbjct: 110 GLQQSEATVPIALG-SLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRC------KSR 162

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLA 190
           +++   S+T+    C A  C Q+     G   G  C +++ YG  S        DT++LA
Sbjct: 163 LYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLA 222

Query: 191 ---TDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
                ++ G+ FGC     G       GL+GLG  + S ++QT   Y S FSYCLP    
Sbjct: 223 GTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLP--PT 280

Query: 247 LSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
            + SG L LG            TP+L++ + ++ Y + L  I VG + ++IP        
Sbjct: 281 WNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS--- 337

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP------ 356
              AG+I+DSGTV TRL   AY A+   FR  +           G  DTC+         
Sbjct: 338 ---AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGN 394

Query: 357 -IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
               P++ L+  G  V     N ++        CLA AA  D+  +   +I N+QQ+   
Sbjct: 395 NFTVPSVALVLDGGAVVDLHPNGIVQD-----GCLAFAATDDDGRT--GIIGNVQQRTFE 447

Query: 416 ILYDVPNSRLGVARELC 432
           +LYDV  S  G     C
Sbjct: 448 VLYDVGQSVFGFRPGAC 464


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/362 (32%), Positives = 181/362 (50%), Gaps = 19/362 (5%)

Query: 85  IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQS 141
           I+SG  +  S  Y  R  IG P ++  + +DT +D  W+ C  C  C S V   ++ + S
Sbjct: 1   ISSGLSLG-SGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 59

Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISL---ATDIVPGY 197
           ++++ + C +A C+ +    C G  C++ + YG SS  + +L  ++  L   ++  +   
Sbjct: 60  SSYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNI 119

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKAL-SFSGSLRL 255
            FGC    +G      GLLG+G G+LS  +Q        FSYCL   +  L S S  L  
Sbjct: 120 AFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIF 179

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           G    P   ++TPLLKNPR ++ YY  L  I VG   + IPP           G I+DSG
Sbjct: 180 GRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSG 239

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---VPIVA-PTITLMF-SGMN 370
           T  TR+V PAY  +RD +R    +      +   DTC++   +P V  P++ L F +G++
Sbjct: 240 TSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVD 299

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + LP  N+LI        CLA A +    +  ++VI N+QQQ  RI +D+  S + +A  
Sbjct: 300 MVLPGGNILIPVDRSGTFCLAFAPS----SMPISVIGNVQQQTFRIGFDLQRSLIAIAPR 355

Query: 431 LC 432
            C
Sbjct: 356 EC 357


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 132/425 (31%), Positives = 198/425 (46%), Gaps = 74/425 (17%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----LAVARKSVVPI 85
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S          K   P 
Sbjct: 97  SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKFNQYAPENLKDHTP- 151

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
            + +   +   ++V    GTP Q   + +DT +   W  C  CV C   S   F+ + S 
Sbjct: 152 -NNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASL 210

Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFG 200
           T+    C       +P+ T G     +N+TYG  ST   N   DT++L  +D+ P + FG
Sbjct: 211 TYSLGSC-------IPS-TVGN---TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFG 259

Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
           C +   G+      G+LGLG+G LS ++QT + ++  FSYCLP   ++   GSL  G   
Sbjct: 260 CGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSI---GSLLFGEKA 316

Query: 258 IGQPKRIKYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
             Q   +K+T L+  P       S  Y+V LL I VG + ++IP            GTII
Sbjct: 317 TSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTII 371

Query: 313 DSGTVFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----P 356
           DSGTV TRL   AY+A++  F            RR+ G  L        DTCY++     
Sbjct: 372 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKD 423

Query: 357 IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           ++ P I L F  G +V L    ++  + A  + CLA A      NS L +I N QQ +  
Sbjct: 424 VLLPEIVLHFGEGADVRLNGKRVIWGNDASRL-CLAFAG-----NSELTIIGNRQQVSLT 477

Query: 416 ILYDV 420
           +LYD+
Sbjct: 478 VLYDI 482


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 172/373 (46%), Gaps = 30/373 (8%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y V   +GTP Q   + +DT +D A+V C  C  C      ++  + 
Sbjct: 22  PLVSGTTLG-SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSN 80

Query: 141 STTFKNLGCQAAQCKQVPNPT---CGG--------GACAFNLTYG--SSTIAANLSQDTI 187
           S+TF  + C +A+C  +P P    C          GAC++   YG  SST+    + +T 
Sbjct: 81  SSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGV-FAYETA 139

Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KA 246
           ++    V    FGC  +  G+ V   G+LGLG+G+LS  +Q    +++ F+YCL S+   
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSP 199

Query: 247 LSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
            S   SL  G   +     +++TPL+ NP   S+YYV ++ I  G   + IP  A + + 
Sbjct: 200 TSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDS 259

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-----PIVA 359
               GTI DSGT  T     AY  +   F + V       S  G   C +V     PI  
Sbjct: 260 VGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIY- 318

Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           P+ T+ F       P         + +I CLAM    ++ +   NVI N+ QQN+ + YD
Sbjct: 319 PSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAML---ESSSDGFNVIGNIIQQNYLVQYD 375

Query: 420 VPNSRLGVARELC 432
               R+G A   C
Sbjct: 376 REEHRIGFAHANC 388


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 123/382 (32%), Positives = 176/382 (46%), Gaps = 46/382 (12%)

Query: 84  PIASGRQITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNS 138
           PI + R +  +    Y++   IGTP       +DT +D  W  C  CV C+   +  F  
Sbjct: 77  PITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRP 136

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATD---- 192
           A+S T++ + C++  C  +P P C     C +   YG  ++ A  L+ +T +        
Sbjct: 137 ARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSK 196

Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----- 246
            +V    FGC    +G      G++GLGRG LSL++Q   L  S FSYCL SF +     
Sbjct: 197 VMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQ---LGPSRFSYCLTSFLSPEPSR 253

Query: 247 LSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           L+F     L        G P  ++ TPL+ N    SLY+++L  I +G++ + I P    
Sbjct: 254 LNFGVFATLNGTNASSSGSP--VQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRD----VFRRRVGSNLTVTSLGGFDTCY---- 353
            N     G  IDSGT  T L   AY AVR     V R    +N T     G +TC+    
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEI---GLETCFPWPP 368

Query: 354 --SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
             SV +  P + L F  G N+T+P +N ++   A    CLAM  + D       +I N Q
Sbjct: 369 PPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-----TIIGNYQ 423

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  ILYD+ NS L      C
Sbjct: 424 QQNMHILYDIANSLLSFVPAPC 445


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 123/382 (32%), Positives = 176/382 (46%), Gaps = 46/382 (12%)

Query: 84  PIASGRQITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNS 138
           PI + R +  +    Y++   IGTP       +DT +D  W  C  CV C+   +  F  
Sbjct: 77  PITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRP 136

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATD---- 192
           A+S T++ + C++  C  +P P C     C +   YG  ++ A  L+ +T +        
Sbjct: 137 ARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSK 196

Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----- 246
            +V    FGC    +G      G++GLGRG LSL++Q   L  S FSYCL SF +     
Sbjct: 197 VMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQ---LGPSRFSYCLTSFLSPEPSR 253

Query: 247 LSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           L+F     L        G P  ++ TPL+ N    SLY+++L  I +G++ + I P    
Sbjct: 254 LNFGVFATLNGTNASSSGSP--VQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFDTCY---- 353
            N     G  IDSGT  T L   AY AVR     V R    +N T     G +TC+    
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEI---GLETCFPWPP 368

Query: 354 --SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
             SV +  P + L F  G N+T+P +N ++   A    CLAM  + D       +I N Q
Sbjct: 369 PPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-----TIIGNYQ 423

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  ILYD+ NS L      C
Sbjct: 424 QQNMHILYDIANSLLSFVPAPC 445


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 130/439 (29%), Positives = 209/439 (47%), Gaps = 56/439 (12%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
            S+TL++ H    CS     K +   + +   L  D  R+Q L       + +   +SV 
Sbjct: 16  ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 70

Query: 83  ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
              +P+ SG ++ +S  YIV  ++G    +L++  DT +D  WV C  C  C +    ++
Sbjct: 71  ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 127

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
           + + S+++K + C ++ C+ +   T     CGG        C + ++YG  S    +L+ 
Sbjct: 128 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 187

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           ++I L    +  + FGC +   G      GL+GLGR S+SL++QT   +   FSYCLPS 
Sbjct: 188 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 247

Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           +    SGSL  G           + YTPL++NP+  S Y +NL    +G        G  
Sbjct: 248 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 298

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
             + + G G +IDSGTV TRL    Y AV+  F ++     T       DTC+++     
Sbjct: 299 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 358

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           I  P I ++F G N  L  D   +        S+ CLA+A+   +  + + +I N QQ+N
Sbjct: 359 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 415

Query: 414 HRILYDVPNSRLGVARELC 432
            R++YD    RLG+  E C
Sbjct: 416 QRVIYDTTQERLGIVGENC 434


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 130/433 (30%), Positives = 191/433 (44%), Gaps = 56/433 (12%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF------- 71
           E   P+   +  + TL V  +   C P   S       S  E+   D+ R+++       
Sbjct: 57  EPAGPVIAPRQRNGTLAVLRLAHRCGPSTASA------SFAEVQRADEQRVEYIQRRVSG 110

Query: 72  ---------LSSLAV-ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
                    L  LA  +R + VP   G    Q   Y+V   +GTP  +  + +DT +D +
Sbjct: 111 GGARGAKGALQQLATGSRSATVPTTMGVGTFQ---YVVTVSLGTPGVSQTVEVDTGSDVS 167

Query: 122 WVPCTGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG 174
           WV C  C    C+S    +F+ A+S+T+  + C A  C +  +    C G  C + ++YG
Sbjct: 168 WVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYG 227

Query: 175 -SSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
             S        DT++LA  + V  + FGC     G      GLL LGR S+SL +Q    
Sbjct: 228 DGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGA 287

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           Y   FSYCLPS +  S +G L LG          T LL      + Y V L  I VG + 
Sbjct: 288 YGGVFSYCLPSKQ--SAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQ 345

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
           V +P  A         GT++D+GTV TRL   AY A+R  FR  +      +  + G  D
Sbjct: 346 VAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILD 399

Query: 351 TCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           TCY       +  PT+ L FSG   TL  +   I S+     CLA   AP+  +    ++
Sbjct: 400 TCYDFSRYGVVTLPTVALTFSG-GATLALEAPGILSSG----CLAF--APNGGDGDAAIL 452

Query: 407 ANMQQQNHRILYD 419
            N+QQ++  + +D
Sbjct: 453 GNVQQRSFAVRFD 465


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 120/379 (31%), Positives = 173/379 (45%), Gaps = 42/379 (11%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
           VP+ +G     +  +++   +GTPA      +DT +D  W  C  CV C   ++ VF+ A
Sbjct: 107 VPVHAG-----NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPA 161

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGAC--------AFNLTYG-SSTIAANLSQDTISLA 190
            S+T+  L C +A C  +P  TC   +          +  TYG +S+    L+ +T +LA
Sbjct: 162 ASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLA 221

Query: 191 TDIVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
              VPG  FGC     G+      GL+GLGRG LSL++Q   L    FSYCL S    + 
Sbjct: 222 RQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGIDRFSYCLTSLDDAAG 278

Query: 250 SGSLRLGPIGQPKRI------KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
              L LG              + TPL+KNP + S YYV+L  + VG   + +P  A    
Sbjct: 279 RSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQ 338

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------- 356
                G I+DSGT  T L   AY A+R  F   +       S  G D C+  P       
Sbjct: 339 DDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQD 398

Query: 357 --IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
             +  P + L F  G ++ LP +N ++  +A    CL + A     +  L++I N QQQN
Sbjct: 399 VQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMA-----SRGLSIIGNFQQQN 453

Query: 414 HRILYDVPNSRLGVARELC 432
            + +YDV    L  A   C
Sbjct: 454 FQFVYDVAGDTLSFAPAEC 472


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 121/395 (30%), Positives = 192/395 (48%), Gaps = 29/395 (7%)

Query: 62  LAKDQARLQFL-----SSLAVARKS-----VVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
           + +D+ARL+++     SS    R+         ++SG  +  S  Y  R  IG+P ++  
Sbjct: 1   MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLG-SGEYFARMGIGSPQRSYY 59

Query: 112 MAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA 168
           + +DT +D  W+ C  C  C S V   ++ + S++++ + C +A C+ +    C G  C+
Sbjct: 60  LELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119

Query: 169 FNLTYG-SSTIAANLSQDTISL---ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           + + YG SS  + +L  ++  L   ++  +    FGC    +G      GLLG+G G+LS
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLS 179

Query: 225 LLAQTQNLYQSTFSYCL-PSFKAL-SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
             +Q        FSYCL   +  L S S  L  G    P   ++TPLLKNPR  + YY  
Sbjct: 180 FFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAI 239

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           L  I VG   + IPP           G I+DSGT  TR+V  AY  +RD +R    +   
Sbjct: 240 LTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPP 299

Query: 343 VTSLGGFDTCYS---VPIVA-PTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
              +   DTC++   +P V  P++ L F   +++ LP  N+LI        CLA A +  
Sbjct: 300 APGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPS-- 357

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +  ++VI N+QQQ  RI +D+  S + +A   C
Sbjct: 358 --SMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 130/439 (29%), Positives = 208/439 (47%), Gaps = 56/439 (12%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
            S+TL++ H    CS     K +   + +   L  D  R+Q L       + +   +SV 
Sbjct: 64  ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 118

Query: 83  ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
              +P+ SG ++ +S  YIV  ++G    +L++  DT +D  WV C  C  C +    ++
Sbjct: 119 ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 175

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
           + + S+++K + C ++ C+ +   T     CGG        C + ++YG  S    +L+ 
Sbjct: 176 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 235

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           ++I L    +  + FGC +   G      GL+GLGR S+SL++QT   +   FSYCLPS 
Sbjct: 236 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 295

Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           +    SGSL  G           + YTPL++NP+  S Y +NL    +G        G  
Sbjct: 296 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 346

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
             + + G G +IDSGTV TRL    Y AV+  F ++     T       DTC+++     
Sbjct: 347 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 406

Query: 357 IVAPTITLMFSGMNVTLPQD---NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           I  P I ++F G N  L  D            S+ CLA+A+   +  + + +I N QQ+N
Sbjct: 407 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 463

Query: 414 HRILYDVPNSRLGVARELC 432
            R++YD    RLG+  E C
Sbjct: 464 QRVIYDTTQERLGIVGENC 482


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 114/356 (32%), Positives = 158/356 (44%), Gaps = 28/356 (7%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +GTP  T  +  DT +D  W    PCT C    +  F  A S+TF  L C ++ 
Sbjct: 86  YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145

Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
           C+ +PN   TC    C +N  YGS   A  L+ +T+ +     P   FGC  +   GNS 
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGCSTENGVGNST 205

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
              G+ GLGRG+LSL+ Q   L    FSYCL S  A   S  +  G +       ++ TP
Sbjct: 206 --SGIAGLGRGALSLIPQ---LGVGRFSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTP 259

Query: 269 LLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAP 324
            + NP    S YYVNL  I VG    D+P     F  T    G GTI+DSGT  T L   
Sbjct: 260 FVNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYS------VPIVAPTITLMFS-GMNVTLPQDN 377
            Y  V+  F  +     TV    G D C+         I  P++ L F  G    +P   
Sbjct: 318 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 377

Query: 378 LLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             + + + GS+T   +   P   +  ++VI N+ Q +  +LYD+       A   C
Sbjct: 378 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 130/433 (30%), Positives = 191/433 (44%), Gaps = 56/433 (12%)

Query: 19  EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF------- 71
           E   P+   +  + TL V  +   C P   S       S  E+   D+ R+++       
Sbjct: 57  EPAGPVIAPRQRNGTLAVLRLAHRCGPSTASA------SFAEVQRADEQRVEYIQRRVSG 110

Query: 72  ---------LSSLAV-ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
                    L  LA  +R + VP   G    Q   Y+V   +GTP  +  + +DT +D +
Sbjct: 111 GGARGAKGALQQLATGSRSATVPTTMGVGTFQ---YVVTVSLGTPGVSQTVEVDTGSDVS 167

Query: 122 WVPCTGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG 174
           WV C  C    C+S    +F+ A+S+T+  + C A  C +  +    C G  C + ++YG
Sbjct: 168 WVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYG 227

Query: 175 -SSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
             S        DT++LA  + V  + FGC     G      GLL LGR S+SL +Q    
Sbjct: 228 DGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGA 287

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           Y   FSYCLPS +  S +G L LG          T LL      + Y V L  I VG + 
Sbjct: 288 YGGVFSYCLPSKQ--SAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQ 345

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
           V +P  A         GT++D+GTV TRL   AY A+R  FR  +      +  + G  D
Sbjct: 346 VAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILD 399

Query: 351 TCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           TCY       +  PT+ L FSG   TL  +   I S+     CLA   AP+  +    ++
Sbjct: 400 TCYDFSRYGVVTLPTVALTFSG-GATLALEAPGILSSG----CLAF--APNGGDGDAAIL 452

Query: 407 ANMQQQNHRILYD 419
            N+QQ++  + +D
Sbjct: 453 GNVQQRSFAVRFD 465


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 130/439 (29%), Positives = 208/439 (47%), Gaps = 56/439 (12%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
            S+TL++ H    CS     K +   + +   L  D  R+Q L       + +   +SV 
Sbjct: 64  ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 118

Query: 83  ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
              +P+ SG ++ +S  YIV  ++G    +L++  DT +D  WV C  C  C +    ++
Sbjct: 119 ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 175

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
           + + S+++K + C ++ C+ +   T     CGG        C + ++YG  S    +L+ 
Sbjct: 176 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 235

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           ++I L    +  + FGC +   G      GL+GLGR S+SL++QT   +   FSYCLPS 
Sbjct: 236 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 295

Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           +    SGSL  G           + YTPL++NP+  S Y +NL    +G        G  
Sbjct: 296 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 346

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
             + + G G +IDSGTV TRL    Y AV+  F ++     T       DTC+++     
Sbjct: 347 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 406

Query: 357 IVAPTITLMFSGMNVTLPQD---NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           I  P I ++F G N  L  D            S+ CLA+A+   +  + + +I N QQ+N
Sbjct: 407 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 463

Query: 414 HRILYDVPNSRLGVARELC 432
            R++YD    RLG+  E C
Sbjct: 464 QRVIYDSTQERLGIVGENC 482


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 121/395 (30%), Positives = 173/395 (43%), Gaps = 43/395 (10%)

Query: 70  QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
           ++ S L    +S V + SG        Y +   IGTP +   + +DT +D  W+ C  C+
Sbjct: 172 EYSSQLVATLESGVSLGSGE-------YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCI 224

Query: 130 GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT----CGGG--ACAFNLTYGSS---- 176
            C   S   ++  +S++F+N+ C   +CK V +P     C      C +   YG S    
Sbjct: 225 ACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTT 284

Query: 177 ------TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
                 T   NL+          V    FGC     G      GLLGLGRG LS  +Q Q
Sbjct: 285 GDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 344

Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLLKNPRRS--SLYYVNL 283
           ++Y  +FSYCL    +  S S  L  G     +  P  + +T  +     S  + YYV +
Sbjct: 345 SIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHP-NLNFTSFVGGEENSVDTFYYVGI 403

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
            +I V   V+ IP      +   G GTIIDSGT  T    PAY  +++ F +++     V
Sbjct: 404 KSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELV 463

Query: 344 TSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
                   CY+V  +     P   ++FS G     P +N  I      + CLA+   P  
Sbjct: 464 EGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQ-IEPDLVCLAILGTP-- 520

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             S L++I N QQQN  ILYD+  SRLG A   CT
Sbjct: 521 -KSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCT 554


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 134/457 (29%), Positives = 218/457 (47%), Gaps = 71/457 (15%)

Query: 27  TQDHSSTLQVFH----VFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
           T+  S+ L++ H     FSP  P +PSK  S  E    +L+ D AR+  L     + +S 
Sbjct: 35  TESGSTILELRHHISSSFSP-GPNRPSK-TSRGEVDGGVLSSDAARVSSLQRRIESYRSS 92

Query: 83  --------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
                         VPI SG  + ++  Y+  A +G  A    + +DT+++  WV C  C
Sbjct: 93  SEGEEEEASKLALQVPITSGANL-RTLNYV--ATVGLGAAEATVVVDTASELTWVQCQPC 149

Query: 129 VGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG-----------ACAFNLTYG 174
             C      +F+ + S ++  + C ++ C  +      G            AC++ L+Y 
Sbjct: 150 ESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR 209

Query: 175 SSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQT 229
             + +   L++D + LA   + G+ FGC    T N   P     GL+GLGR  +SL++QT
Sbjct: 210 DGSYSRGVLARDKLRLAGQDIEGFVFGC---GTSNQGAPFGGTSGLMGLGRSHVSLVSQT 266

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKN--PRRSSLYYVNL 283
            + +   FSYCLP  ++ S SGSL LG      R    I YT ++ +  P +   Y++NL
Sbjct: 267 MDQFGGVFSYCLPMRESGS-SGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNL 325

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
             I VG + V+        +P   AG  IIDSGT+ T LV   Y AVR  F  ++     
Sbjct: 326 TGITVGGQEVE--------SPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQ 377

Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN---LLIHSTAGSITCLAMAAA 395
             +    DTC+++     +  P++  +F G +V +  D+   L   S+  S  CLA+A+ 
Sbjct: 378 APAFSILDTCFNLTGLKEVQVPSLKFVFEG-SVEVEVDSKGVLYFVSSDASQVCLALASL 436

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               ++  ++I N QQ+N R+++D   S++G A+E C
Sbjct: 437 KSEYDT--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 471


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 113/376 (30%), Positives = 186/376 (49%), Gaps = 34/376 (9%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGC 131
           A A  + +P  +G  + ++P ++V    G+PAQT     DT +D +W+   PC+G C   
Sbjct: 92  AEAPSATIPDHTGTNL-KTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQ 150

Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL- 189
              VF+ A+S+++  + C   +C       C G  C + + YG  S+    L+++T++  
Sbjct: 151 HDPVFDPAKSSSYAVVPCGTTECAAA-GGECNGTTCVYGVEYGDGSSTTGVLARETLTFS 209

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
           ++    G+ FGC +   G+     GLLGLGRGSLSL +Q    +   FSYCLPS+     
Sbjct: 210 SSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPG 269

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
             S+   P+     ++YT ++  P   S Y++ L++I +G  V+ +PP        T  G
Sbjct: 270 YLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEF-----TKTG 324

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLM 365
           T++DSGT+ T L  PAYTA+RD F+  +  +         DTCY       I+ P ++  
Sbjct: 325 TLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFN 384

Query: 366 FSGMNV---------TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           FS   V         T P D      T  ++ CLA  + P ++    +V+ +  Q++  +
Sbjct: 385 FSDGAVFNLNFFGIMTFPDD------TKPAVGCLAFVSRPADMP--FSVVGSTTQRSAEV 436

Query: 417 LYDVPNSRLGVARELC 432
           +YDVP  ++G     C
Sbjct: 437 IYDVPAQKIGFIPASC 452


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 174/394 (44%), Gaps = 44/394 (11%)

Query: 71  FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           F   L    +S V + SG        Y +   +GTP +   + +DT +D  W+ C  C  
Sbjct: 162 FSGQLIATLESGVSLGSGE-------YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYE 214

Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
           C   +   ++  QS++++N+GC  ++C  V +P            C +   YG S     
Sbjct: 215 CFEQNGPHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTG 274

Query: 177 -----TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
                T   NL+  +       V    FGC     G      GLLGLGRG LS  +Q Q+
Sbjct: 275 DFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQS 334

Query: 232 LYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNL 283
           LY  +FSYCL    +  + S  L  G     +  P+ + +T L+   +NP   + YYV +
Sbjct: 335 LYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVAGKENPV-DTFYYVQI 392

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
            +I VG  VV+IP    Q       GTIIDSGT  +    PAY  +++ F  +V     V
Sbjct: 393 KSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVV 452

Query: 344 TSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
                 + CY+V  V     P   ++FS G     P +N  I      + CLA+   P  
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPP- 511

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             S L++I N QQQN  ILYD   SRLG A   C
Sbjct: 512 --SALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 123/416 (29%), Positives = 180/416 (43%), Gaps = 47/416 (11%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ--------SPTYIVRAKIGTPAQ 108
           S L+ L K+Q +  F    A A  S  P+ SG+ +          S  Y +   +GTP +
Sbjct: 148 SRLQRLQKEQPKQSFKPVFAPAASSTSPV-SGQLVATLESGVSLGSGEYFMDVFVGTPPK 206

Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQV-----PNP 160
              + +DT +D  W+ C  C+ C   S   ++   S++F+N+ C   +C+ V     PNP
Sbjct: 207 HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNP 266

Query: 161 -TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
                 +C +   YG            T   NL+          V    FGC     G  
Sbjct: 267 CKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLF 326

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
               GLLGLG+G LS  +Q Q+LY  +FSYCL    + +   S  +   G+ K +   P 
Sbjct: 327 HGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLI--FGEDKELLSHPN 384

Query: 270 L--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
           L        K+    + YYV + ++ V   V+ IP      +     GTIIDSGT  T  
Sbjct: 385 LNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYF 444

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQD 376
             PAY  +++ F R++     V  L     CY+V  +     P   ++F+ G     P +
Sbjct: 445 AEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVE 504

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           N  I      + CLA+   P    S L++I N QQQN  ILYD+  SRLG A   C
Sbjct: 505 NYFIQIDP-DVVCLAILGNP---RSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 556


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 183/360 (50%), Gaps = 28/360 (7%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SSTVFNS 138
           +P  +G  +  +  ++V    GTPAQT  + +DT +D +W+ C  C G C       F+ 
Sbjct: 124 IPDHTGTNL-DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDP 182

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
           A+S+++  + C    C       C G  C + + YG  S+    LS+DT++  ++    G
Sbjct: 183 AKSSSYAAVPCGTPVCAAA-GGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTG 241

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           +TFGC +K  G+     GLLGLGRG LSL +Q    +   FSYCLPS+      G L +G
Sbjct: 242 FTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTT--PGYLNIG 299

Query: 257 PIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
              +P     ++YT ++K P+  S Y++ L++I +G  ++ +PP        T  GT++D
Sbjct: 300 AT-KPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF-----TKTGTLLD 353

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-G 368
           SGT+ T L  PAYT++RD F+  +  N         DTCY       IV P ++  FS G
Sbjct: 354 SGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDG 413

Query: 369 MNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
               L    ++I        I CLA  + P  +    +++ N QQ+   ++YDVP+ ++G
Sbjct: 414 AVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMP--FSIVGNTQQRAAEVIYDVPSQKIG 471


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 112/343 (32%), Positives = 157/343 (45%), Gaps = 27/343 (7%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +GTP  T  +  DT +D  W    PCT C    +  F  A S+TF  L C ++ 
Sbjct: 86  YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145

Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
           C+ +PN   TC    C +N  YGS   A  L+ +T+ +     P   FGC  +   GNS 
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGCSTENGVGNST 205

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
              G+ GLGRG+LSL+ Q   L    FSYCL S  A   S  +  G +       ++ TP
Sbjct: 206 --SGIAGLGRGALSLIPQ---LGVGRFSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTP 259

Query: 269 LLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAP 324
            + NP    S YYVNL  I VG    D+P     F  T    G GTI+DSGT  T L   
Sbjct: 260 FVNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFS-GMNVTLPQDNL 378
            Y  V+  F  +  +  TV    G D C+        I  P++ L F  G    +P    
Sbjct: 318 GYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFA 377

Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            + + + GS+T   +   P   +  ++VI N+ Q +  +LYD+
Sbjct: 378 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 420


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 131/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           + LS  E +  M A+ +AR   L S   A   V P +    +  +  Y+V   IGTP Q 
Sbjct: 65  RGLSTRELLHRMAARSKARSARLLSGRAASARVDPGSYTDGVPDT-EYLVHMAIGTPPQP 123

Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
           + + +DT +D  W  C  CV C   S   FN ++S TF  L C    C+ +   +C    
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183

Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
            G G C +   Y   +I   +L  DT S A+         VP  TFGC     G  V  +
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 243

Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
            G+ G  RG+LS+ AQ   L    FSYC        PS   L    +L     G    + 
Sbjct: 244 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300

Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
            +  L     S L  YY++L  + VG   + IP            GTI+DSGT  T L  
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
             Y  V D F  +    +  ++      C+SVP  A    P + L F G  + LP++N +
Sbjct: 361 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 420

Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                AG I  TCLA+ A  D     L+VI N QQQN  +LYD+ N  L      C
Sbjct: 421 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 163/351 (46%), Gaps = 43/351 (12%)

Query: 106 PAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
           P    LM +DT++D AWV C     + C   +  +++ ++S + ++  C +  C+Q+  P
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GP 236

Query: 161 TCGG--------GACAFNLTY-GSSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGN-- 208
              G        G C + + Y   ST +  L  D +SL+ T  VP + FGC   A G+  
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGSFS 296

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
                G++ LGRG  SL++QT   Y   FSYC P     S  G   LG P     R   T
Sbjct: 297 RSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFP--PTASHKGFFVLGVPRRSSSRYAVT 354

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           P+LK P    LY V L AI V  + +D+PP          AG  +DS TV TRL   AY 
Sbjct: 355 PMLKTPM---LYQVRLEAIAVAGQRLDVPPTVF------AAGAALDSRTVITRLPPTAYQ 405

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF--SGMNVTLPQDNLLIH 381
           A+R  FR ++       + G  DTCY       I+ PTI+L+F  +G  V L    +L  
Sbjct: 406 ALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG 465

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S      CLA A+   + +    +I  +Q Q   +LY+V    +G  R  C
Sbjct: 466 S------CLAFASTAGD-DRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 175/406 (43%), Gaps = 59/406 (14%)

Query: 83  VPIASGRQ--ITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPC-------------- 125
           +P  S RQ    +   Y +   +G+ P+Q++ + MDT +D  W PC              
Sbjct: 3   LPSPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNA 62

Query: 126 --------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYG 174
                   +  V C S   ++A S+   +  C  A+C    +    C    C  F   YG
Sbjct: 63  TKPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYG 122

Query: 175 SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL-- 232
             +  A+L +DT+S++   +  +TFGC   A      P G+ G GRG LSL AQ   L  
Sbjct: 123 DGSFIAHLHRDTLSMSQLFLKNFTFGCAHTALAE---PTGVAGFGRGLLSLPAQLATLSP 179

Query: 233 -YQSTFSYCLPSF----KALSFSGSLRLGPIGQ--PKRIK--YTPLLKNPRRSSLYYVNL 283
              + FSYCL S     + +     L LG       +R++  YT +L+NP+ S  Y V L
Sbjct: 180 NLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGL 239

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG----S 339
             I VG+R +  P    + +     G ++DSGT FT L A  Y +V   F RRVG     
Sbjct: 240 TGISVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKR 299

Query: 340 NLTVTSLGGFDTCYSVP--IVAPTITLMFSG--MNVTLPQDNLLIHSTAGS------ITC 389
              V    G   CY +   +  PT+T  F G   NV LP+ N       G       + C
Sbjct: 300 ASEVEEKTGLGPCYFLEGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGC 359

Query: 390 LAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELC 432
           L +    D+         ++ N QQQ   ++YD+ N R+G A+  C
Sbjct: 360 LMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 173/352 (49%), Gaps = 52/352 (14%)

Query: 114 MDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
           MDT +D  W  C  C+ C+   +  F+  +S T++ L C++++C  + +P+C    C + 
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60

Query: 171 LTYG-SSTIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
             YG +++ A  L+ +T +          AT+I     FGC     G+     G++G GR
Sbjct: 61  YYYGDTASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGR 116

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLK 271
           G LSL++Q   L  S FSYCL S+ +     L F     L       G P  ++ TP + 
Sbjct: 117 GPLSLVSQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVI 171

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           NP   ++Y+++L AI +G +++ I P     N     G IIDSGT  T L   AY AV  
Sbjct: 172 NPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV-- 229

Query: 332 VFRRRVGSNLTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVT-LPQDNLLI 380
             RR + S + + ++     G DTC+  P      +  P +   F   N+T LP++ +LI
Sbjct: 230 --RRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLI 287

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            ST G + CL M  AP  V +   +I N QQQN  +LYD+ NS L      C
Sbjct: 288 ASTTGYL-CLVM--APTGVGT---IIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 127/448 (28%), Positives = 188/448 (41%), Gaps = 75/448 (16%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------------------ 96
           +ES +E   +D AR+Q L +  + +K+   I+  ++  + P                   
Sbjct: 10  KESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGT 69

Query: 97  --------------------YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSS 133
                               Y +   IGTP +   + +DT +D  W   VPC  C   + 
Sbjct: 70  GLSGQLMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNG 129

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPT------CGGGACAFNLTYGSS----------T 177
             ++  +S++F+N+GC   +C  V +P            C +   YG S          T
Sbjct: 130 PYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATET 189

Query: 178 IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
              NL+  T       V    FGC     G      GLLGLGRG LS  +Q Q+LY  +F
Sbjct: 190 FTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSF 249

Query: 238 SYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVG 289
           SYCL    +  + S  L  G     +  P+ + +T L+   +NP   + YYV + +I VG
Sbjct: 250 SYCLVDRNSDTNVSSKLIFGEDKDLLNHPE-LNFTTLVGGKENPV-DTFYYVQIKSIMVG 307

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
             V++IP            GTI+DSGT  +    PAY  ++D F ++V     V      
Sbjct: 308 GEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPIL 367

Query: 350 DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
           D CY+V     I  P   ++F+ G     P +N  I      + CLA+   P    S L+
Sbjct: 368 DPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTP---RSALS 424

Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
           +I N QQQN  +LYD   SRLG A   C
Sbjct: 425 IIGNYQQQNFHVLYDTKKSRLGYAPMNC 452


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 178/407 (43%), Gaps = 48/407 (11%)

Query: 60  EMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTS 117
           E+L +  ARL F +S   A AR    P A+G   T+   Y+V   IGTP Q + + +DT 
Sbjct: 379 EVLHRMAARLLFSASGRAASARVDPGPYANGVPDTE---YLVHLAIGTPPQPVQLILDTG 435

Query: 118 NDAAWVPCTGCVGCSSTVF---NSAQSTTFKNLGCQAAQCKQVPNPTCG-----GGACAF 169
           +D  W  C  C  C S      + + S+TF  L C +  C  +   +CG        C +
Sbjct: 436 SDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVY 495

Query: 170 NLTYGSSTIA-ANLSQDTISLATD------IVPGYTFGCIQKATGNSVPPQ-GLLGLGRG 221
              Y   +I   +L  +T + A         VP   FGC     G     + G+ G GRG
Sbjct: 496 VYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRG 555

Query: 222 SLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
           +LSL +Q   L    FS+C        PS   L    +L     G    ++ TPL++N  
Sbjct: 556 ALSLPSQ---LKVDNFSHCFTAITGSEPSSVLLGLPANLYSDADGA---VQSTPLVQNFS 609

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
               YY++L  I VG   + IP            GTIIDSGT  T L   AY  V D F 
Sbjct: 610 SLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFT 669

Query: 335 RRVG---SNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL--IHSTAG 385
            +V     N T +SL      +SVP  A    P + L F G  + LP++N +       G
Sbjct: 670 AQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEFEDAGG 729

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S+TCLA+ A  D     L +I N QQQN  +LYD+  + L      C
Sbjct: 730 SVTCLAINAGDD-----LTIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 200/442 (45%), Gaps = 49/442 (11%)

Query: 24  ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS--------- 74
           +    D+ ++L+V H   PCS     +  S   +  E+L +DQ+R++ + S         
Sbjct: 66  VLSNNDNKASLKVVHKHGPCSKLSQDEA-SAAPTHTEILLQDQSRVKSIHSRLSNSKTSG 124

Query: 75  ---LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----G 127
              + V   + +P   G  +  S  YIV   +GTP + L +  DT +D  W  C      
Sbjct: 125 GKDVKVTDSTTIPAKDGSTV-GSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS 183

Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL 182
           C      +F+ +QST++ N+ C ++ C  + +     P C   AC + + YG S+ +   
Sbjct: 184 CYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGF 243

Query: 183 -SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
              + ++L +TD      FGC Q   G      GLLGLGR  LS+++QT   Y   FSYC
Sbjct: 244 FGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYC 303

Query: 241 LPSFKA----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           LPS  +    L+F GS         K  K+TPL       S Y ++   I VG + + I 
Sbjct: 304 LPSSSSSTGFLTFGGS-------ASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAIS 356

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-- 354
                    + AG IIDSGTV TRL   AY+A+R  FR  +       +L   DTCY   
Sbjct: 357 ASVF-----STAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFS 411

Query: 355 --VPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
               I  P I   F SG+ V +    +L  S+   + CLA A   D  +  + +  N+QQ
Sbjct: 412 SYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQV-CLAFAGNSDATD--VFIFGNVQQ 468

Query: 412 QNHRILYDVPNSRLGVARELCT 433
           +   + YD    ++G A   C+
Sbjct: 469 KTLEVFYDGSAGKVGFAPGGCS 490


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 174/365 (47%), Gaps = 56/365 (15%)

Query: 5   LVFFLAFLFLFSLSE----------GLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSW 54
           LV+FL + +L + +            L P  +       + + HV  P S   P  P+S+
Sbjct: 3   LVWFLGWFYLLATASSFVEKENEAVALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSF 62

Query: 55  EESVLEMLAKDQARLQFLSSLAVAR-----------------KSV-VPIASGRQITQSPT 96
            +    +LA D AR++ L+S    +                 KSV VP+  G  I  S  
Sbjct: 63  SD----VLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASI-GSGN 117

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
           Y V+   G+PA+   M +DT +  +W+ C  CV  C   +  +F+ + S T+K+L C ++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177

Query: 153 QCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
           QC  +      NP C      C +  +YG S+ +   LSQD ++LA +  +PG+ +GC Q
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQ 237

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG-QPK 262
            + G      G+LGLGR  LS+L Q  + +   FSYCLP+     F   L +G       
Sbjct: 238 DSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGF---LSIGKASLAGS 294

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
             K+TP+  +P   SLY++ L AI VG R + +   A Q+       TIIDSGTV TRL 
Sbjct: 295 AYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV--AAAQYR----VPTIIDSGTVITRLP 348

Query: 323 APAYT 327
              YT
Sbjct: 349 MSVYT 353


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 177/418 (42%), Gaps = 71/418 (16%)

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSS--- 133
           R+  +P++ G   T S     +A+    AQ + + MDT +D  W PC    C+ C     
Sbjct: 58  RQLSLPLSPGSDYTLSFNLGPQAQ----AQPITLYMDTGSDLVWFPCAPFKCILCEGKPN 113

Query: 134 -------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA----------------FN 170
                  T    + + + K+  C AA     P+  C    C                 F 
Sbjct: 114 EPNASPPTNITQSVAVSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFY 173

Query: 171 LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
             YG  ++ A L +DT+SL++  +  +TFGC          P G+ G GRG LSL AQ  
Sbjct: 174 YAYGDGSLIARLYRDTLSLSSLFLRNFTFGCAHTTLAE---PTGVAGFGRGLLSLPAQLA 230

Query: 231 NL---YQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIK---------YTPLLKNPR 274
            L     + FSYCL S     + +     L LG   + ++ K         YT +L+NP+
Sbjct: 231 TLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPK 290

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
               Y V+L+ I VG+R +  P    + N     G ++DSGT FT L A  Y +V D F 
Sbjct: 291 HPYFYTVSLIGIAVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFD 350

Query: 335 RRVGSN----LTVTSLGGFDTCYSVPIVA--PTITLMFSG---MNVTLPQDNLLIHSTAG 385
           RRVG +      +    G   CY +  VA  P +TL F+G    +V LP+ N     + G
Sbjct: 351 RRVGRDNKRARKIEEKTGLAPCYYLNSVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDG 410

Query: 386 S--------ITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S        + CL +    D  +        + N QQQ   + YD+   R+G AR  C
Sbjct: 411 SDGAKGKRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 130/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           + LS  E +  M A+ +AR   L S   A   + P +    +  +  Y+V   IGTP Q 
Sbjct: 65  RGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQP 123

Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
           + + +DT +D  W  C  CV C   S   FN ++S TF  L C    C+ +   +C    
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183

Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
            G G C +   Y   +I   +L  DT S A+         VP  TFGC     G  V  +
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 243

Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
            G+ G  RG+LS+ AQ   L    FSYC        PS   L    +L     G    + 
Sbjct: 244 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300

Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
            +  L     S L  YY++L  + VG   + IP            GTI+DSGT  T L  
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
             Y  V D F  +    +  ++      C+SVP  A    P + L F G  + LP++N +
Sbjct: 361 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 420

Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                AG I  TCLA+ A  D     L+VI N QQQN  +LYD+ N  L      C
Sbjct: 421 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 130/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           + LS  E +  M A+ +AR   L S   A   + P +    +  +  Y+V   IGTP Q 
Sbjct: 39  RGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQP 97

Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
           + + +DT +D  W  C  CV C   S   FN ++S TF  L C    C+ +   +C    
Sbjct: 98  VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 157

Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
            G G C +   Y   +I   +L  DT S A+         VP  TFGC     G  V  +
Sbjct: 158 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 217

Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
            G+ G  RG+LS+ AQ   L    FSYC        PS   L    +L     G    + 
Sbjct: 218 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 274

Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
            +  L     S L  YY++L  + VG   + IP            GTI+DSGT  T L  
Sbjct: 275 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 334

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
             Y  V D F  +    +  ++      C+SVP  A    P + L F G  + LP++N +
Sbjct: 335 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 394

Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                AG I  TCLA+ A  D     L+VI N QQQN  +LYD+ N  L      C
Sbjct: 395 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 445


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 170/370 (45%), Gaps = 58/370 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQA 151
           Y++   IGTP Q +   +DT +D  W+ C  C  C       T+F S  S+++K L C +
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 152 AQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATD--------IVPGYTF 199
             C  + +   G      C +   YG  S  + ++  D IS  +            G+ F
Sbjct: 65  THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGS- 252
           GC +K  G+    QGL+GLG+ S SL+ Q  +     FSYCL      PS K+  F GS 
Sbjct: 125 GCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSS 184

Query: 253 --LRLGPIGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-- 307
             LR         +  TP+L       +LYYV+L +I +G   V +       N + G  
Sbjct: 185 AALR------GHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPF 238

Query: 308 --AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV---------GSNLTVTSLGGFDTCYSVP 356
               T+IDSGT +T L  P Y A+R     +V         G +L   S G  DT Y   
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSG--DTSYGF- 295

Query: 357 IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              P++T  F+  + + LP +N+    T+  + CL+M    D+    L++I NMQQQN  
Sbjct: 296 ---PSVTFYFANQVQLVLPFENIF-QVTSRDVVCLSM----DSSGGDLSIIGNMQQQNFH 347

Query: 416 ILYDVPNSRL 425
           ILYD+  S++
Sbjct: 348 ILYDLVASQI 357


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 117/391 (29%), Positives = 181/391 (46%), Gaps = 42/391 (10%)

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCS----STV 135
           S  P+ SG   + S  Y V  ++G+P QTLL+  DT +D  WV C+ C   CS     + 
Sbjct: 68  SKSPLMSGAS-SGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST 126

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT---CGG----GACAFNLTYGS-STIAANLSQDTI 187
           F +  STTF    C ++ C+ VP P    C        C +   Y   S  +   S++T 
Sbjct: 127 FLARHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186

Query: 188 SLATD-----IVPGYTFGCIQKATGNSV------PPQGLLGLGRGSLSLLAQTQNLYQST 236
           +L T       +    FGC   A+G S+         G++GLGRG +S  +Q    +  +
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRS 246

Query: 237 FSYCLPSFK-ALSFSGSLRLGPIGQPKR-----IKYTPLLKNPRRSSLYYVNLLAIRVGR 290
           FSYCL  +  +   +  L +G +   K+     + +TPLL NP   + YY+++  + V  
Sbjct: 247 FSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDG 306

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG----SNLTVTSL 346
             + I P     +     GT+IDSGT  T L  PAY  +   F+R V     +    ++ 
Sbjct: 307 VKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTR 366

Query: 347 GGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
            GFD C +V  V+    P ++L   G ++ + P  N  I  + G I CLA+    +  + 
Sbjct: 367 SGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEG-IKCLAIQPV-EAESG 424

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +VI N+ QQ   + +D   SRLG +R  C
Sbjct: 425 RFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 128/422 (30%), Positives = 176/422 (41%), Gaps = 68/422 (16%)

Query: 72  LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCV 129
           LS+    R+  +P++ G   T S     RA+    AQ + + MDT +D  W PC    C+
Sbjct: 29  LSAKRFRRQLSLPLSPGSDYTLSFNLGPRAQ----AQPITLYMDTGSDLVWFPCAPFKCI 84

Query: 130 GC-----SSTVFNSAQSTTF--KNLGCQAAQCKQVPNPTCGGGACA-------------- 168
            C     +S   N+ +S     K+  C AA     P+  C    C               
Sbjct: 85  LCEGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKC 144

Query: 169 --FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
             F   YG  ++ A L +DT+SL++  +  +TFGC   A      P G+ G GRG LSL 
Sbjct: 145 PPFYYAYGDGSLIARLYRDTLSLSSLFLRNFTFGC---AYTTLAEPTGVAGFGRGLLSLP 201

Query: 227 AQTQNL---YQSTFSYCLPSF----KALSFSGSLRLGPI----------GQPKRIKYTPL 269
           AQ   L     + FSYCL S     + +     L LG            G      YTP+
Sbjct: 202 AQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPM 261

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           L+NP+    Y V L+ I VG+R+V  P    + N     G ++DSGT FT L A  Y +V
Sbjct: 262 LENPKHPYFYTVGLIGISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSV 321

Query: 330 RDVFRRRVG----SNLTVTSLGGFDTCYSVPIVA--PTITLMFSGMN--VTLPQDNLLIH 381
            D F R VG        +    G   CY +  VA  P +TL F+G N  V LP+ N    
Sbjct: 322 VDEFDRGVGRVNERARKIEEKTGLAPCYYLNSVAEVPVLTLRFAGGNSSVVLPRKNYFYE 381

Query: 382 STAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARE 430
              G         + CL +    D           + N QQQ   + YD+   R+G AR 
Sbjct: 382 FLDGRDAAKGKRRVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARR 441

Query: 431 LC 432
            C
Sbjct: 442 QC 443


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 136/447 (30%), Positives = 209/447 (46%), Gaps = 61/447 (13%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------------- 74
           +T+ + H   PCSP  P+K +   E   E L +D+ R  ++                   
Sbjct: 62  ATVPLHHRHGPCSPL-PNKKMPTLE---ERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDV 117

Query: 75  -LAVARKSVVPIASGRQITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPCTGCV-GC 131
            +  +    VP   G  +  +  Y++  ++G+ P ++  M +DT +D +WV C  C   C
Sbjct: 118 VVQQSHAMTVPTTLGTSL-DTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQC 176

Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-----PNPTCGGGACAFNLTYGSSTIA--AN 181
              V   F+ + S+T+    C +A C Q+      N     G C +   YG  ++     
Sbjct: 177 RPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGT 236

Query: 182 LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST- 236
            S DT++L ++    +V  + FGC    TG +    GL+GLG G+ SL++QT   + +T 
Sbjct: 237 YSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTA 296

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           FSYCLP     S SG L LG  G        TP+L++ +  + Y V L AIRVG R + I
Sbjct: 297 FSYCLP--PTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSI 354

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGF-DTC 352
           P           AG I+DSGTV TRL   AY+++   F+  +       S   GGF DTC
Sbjct: 355 PTTVFS------AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTC 408

Query: 353 YSV----PIVAPTITLMFSGMN---VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           + +     +  PT+ L+FSG     V L    +L+     SI CLA  A  D+ ++   +
Sbjct: 409 FDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGST--GI 466

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I N+QQ+  ++LYDV    +G     C
Sbjct: 467 IGNVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 165/376 (43%), Gaps = 46/376 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-------AQSTTFKNLGC 149
           Y +    GTP QTL   MDT +   W PCT    C++  F S         S++ K +GC
Sbjct: 77  YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGC 136

Query: 150 QAAQCKQVPNPTCGGGACAFN------------LTYGSSTIAANLSQDTISLATDIVPGY 197
           +  +C  +         C  N            + YGS T       +T+ L   IVP +
Sbjct: 137 KNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSETLHLHGLIVPNF 196

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRL 255
             GC   +  +S  P G+ G GRG  SL +Q   L  + FSYCL S  F     S SL L
Sbjct: 197 LVGC---SVFSSRQPAGIAGFGRGPSSLPSQ---LGLTKFSYCLLSHKFDDTQESSSLVL 250

Query: 256 GPIGQPKR----IKYTPLLKNPR------RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
                  +    + YTPL+KNP+       S  YYV+L  I +G R V IP   L  +  
Sbjct: 251 DSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKD 310

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSV----PIV 358
              GTIIDSGT FT +   A+  + + F  +V +    L V +L G   C++V     + 
Sbjct: 311 GNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELE 370

Query: 359 APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRI 416
            P + L F  G +V LP +N      +  + C  +     +  +    ++ N Q QN  +
Sbjct: 371 LPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFYV 430

Query: 417 LYDVPNSRLGVARELC 432
            YD+ N RLG  +E C
Sbjct: 431 EYDLQNERLGFKKESC 446


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)

Query: 58  VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
           V + L +D   R +F   LA +  S  P  +      + +     YI+   IGTP Q+  
Sbjct: 47  VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 106

Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
              DT +D  W  C      C    S ++N + S TF+ L C       AA+ +     P
Sbjct: 107 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 166

Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
            P C   AC +N TYG+   +     +T +  +       VPG  FGC   ++ +     
Sbjct: 167 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 220

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
           G  GL       L+    L    FSYCL  F+      +L LGP           ++ TP
Sbjct: 221 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 280

Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            + +P +   S+ YY+NL  I VG   + IPPGA         G IIDSGT  T LV  A
Sbjct: 281 FVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 340

Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
           Y  VR   R  V      GSN T     G D C+++      P   P++TL F  G ++ 
Sbjct: 341 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 395

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N +I    G + CLAM +  D     L+ + N QQQN  ILYDV    L  A   C
Sbjct: 396 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450

Query: 433 T 433
           +
Sbjct: 451 S 451


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 116/370 (31%), Positives = 170/370 (45%), Gaps = 58/370 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQA 151
           Y++   IGTP Q +   +DT +D  W+ C  C  C       T+F S  S+++K L C +
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 152 AQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATD--------IVPGYTF 199
             C  + +   G      C +   YG  S  + ++  D IS  +            G+ F
Sbjct: 65  THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGS- 252
           GC +K  G+    QGL+GLG+ S SL+ Q  +     FSYCL      PS K+  F GS 
Sbjct: 125 GCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSS 184

Query: 253 --LRLGPIGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-- 307
             LR         +  TP+L       +LYYV+L +I VG   V +       N + G  
Sbjct: 185 AALR------GHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPF 238

Query: 308 --AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV---------GSNLTVTSLGGFDTCYSVP 356
               T+IDSGT +T L  P Y A+R     +V         G +L   S G  DT Y   
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSG--DTSYGF- 295

Query: 357 IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              P++T  F+  + + LP +N+    T+  + CL+M    D+    L++I NMQQQN  
Sbjct: 296 ---PSVTFYFANQVQLVLPFENIF-QVTSRDVVCLSM----DSSGGDLSIIGNMQQQNFH 347

Query: 416 ILYDVPNSRL 425
           ILYD+  S++
Sbjct: 348 ILYDLVASQI 357


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 123/404 (30%), Positives = 182/404 (45%), Gaps = 50/404 (12%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------TYIVRAKIGTPAQT 109
           E++  ++AK  AR+++++  A A  S     +G    +SP       Y++   +GTP + 
Sbjct: 10  EAIRGLVAKSHARVRWMA--ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKR 67

Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GG 164
                DT +D  WV   PCTGC G   T+F+  QS+TF+ + C +  C ++P  +C  G 
Sbjct: 68  FRAIADTGSDLVWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCTELPG-SCEPGS 124

Query: 165 GACAFNLTYGSSTIAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPPQGLLGL 218
            AC+++  YGS       ++DTISL T        P +  GC    +G + V   GL+GL
Sbjct: 125 SACSYSYEYGSGETEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFDGV--DGLVGL 182

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNPRRS 276
           G+G +SL +Q      S FSYCL    + S S  L  GP        I+ T +       
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242

Query: 277 SLYY---VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
             YY   VN +A+           G    +P T   TIIDSGT  T + +  Y  V    
Sbjct: 243 PTYYLLTVNGIAV----------AGQTMGSPGT---TIIDSGTTLTYVPSGVYGRVLSRM 289

Query: 334 RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDN-LLIHSTAGSIT 388
              V       S  G D CY          P +T+  +G  +T P  N  L+   +G   
Sbjct: 290 ESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTV 349

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CLAM +A       +++I N+ QQ + ILYD  +S L   +  C
Sbjct: 350 CLAMGSAG---GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)

Query: 58  VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
           V + L +D   R +F   LA +  S  P  +      + +     YI+   IGTP Q+  
Sbjct: 52  VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 111

Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
              DT +D  W  C      C    S ++N + S TF+ L C       AA+ +     P
Sbjct: 112 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 171

Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
            P C   AC +N TYG+   +     +T +  +       VPG  FGC   ++ +     
Sbjct: 172 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 225

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
           G  GL       L+    L    FSYCL  F+      +L LGP           ++ TP
Sbjct: 226 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 285

Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            + +P +   S+ YY+NL  I VG   + IPPGA         G IIDSGT  T LV  A
Sbjct: 286 FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 345

Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
           Y  VR   R  V      GSN T     G D C+++      P   P++TL F  G ++ 
Sbjct: 346 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 400

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N +I    G + CLAM +  D     L+ + N QQQN  ILYDV    L  A   C
Sbjct: 401 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455

Query: 433 T 433
           +
Sbjct: 456 S 456


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)

Query: 58  VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
           V + L +D   R +F   LA +  S  P  +      + +     YI+   IGTP Q+  
Sbjct: 47  VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 106

Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
              DT +D  W  C      C    S ++N + S TF+ L C       AA+ +     P
Sbjct: 107 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 166

Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
            P C   AC +N TYG+   +     +T +  +       VPG  FGC   ++ +     
Sbjct: 167 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 220

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
           G  GL       L+    L    FSYCL  F+      +L LGP           ++ TP
Sbjct: 221 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 280

Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            + +P +   S+ YY+NL  I VG   + IPPGA         G IIDSGT  T LV  A
Sbjct: 281 FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 340

Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
           Y  VR   R  V      GSN T     G D C+++      P   P++TL F  G ++ 
Sbjct: 341 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 395

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LP +N +I    G + CLAM +  D     L+ + N QQQN  ILYDV    L  A   C
Sbjct: 396 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450

Query: 433 T 433
           +
Sbjct: 451 S 451


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 123/410 (30%), Positives = 183/410 (44%), Gaps = 50/410 (12%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------TYIVRAKI 103
           K +   E++  ++AK  AR+++++  A A  S     +G    +SP       Y++   +
Sbjct: 4   KGVKRSEAIRALVAKSHARVRWMA--ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISV 61

Query: 104 GTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
           GTP +      DT +D  WV   PCTGC G   T+F+  QS+TF+ + C +  C ++P  
Sbjct: 62  GTPGKRFRAIADTGSDLVWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCAELPG- 118

Query: 161 TC--GGGACAFNLTYGSSTIAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPP 212
           +C  G   C+++  YGS       ++DTISL T        P +  GC    +G + V  
Sbjct: 119 SCEPGSSTCSYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFDGV-- 176

Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLL 270
            GL+GLG+G +SL +Q      S FSYCL    + S S  L  GP        I+ T + 
Sbjct: 177 DGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKIT 236

Query: 271 KNPRRSSLYY---VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
                   YY   VN +A+           G    +P T   TIIDSGT  T + +  Y 
Sbjct: 237 PPSDTYPTYYLLTVNGIAV----------AGQTMGSPGT---TIIDSGTTLTYVPSGVYG 283

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDN-LLIHS 382
            V       V       S  G D CY          P +T+  +G  +T P  N  L+  
Sbjct: 284 RVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +G   CLAM +A       +++I N+ QQ + ILYD  +S L   +  C
Sbjct: 344 DSGDTVCLAMGSAS---GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 128/449 (28%), Positives = 199/449 (44%), Gaps = 72/449 (16%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------- 82
           +++ + H   PC+P   S     + S+ E L +D+AR  ++ + A   ++          
Sbjct: 17  ASVPLVHRHGPCAP---SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAG 73

Query: 83  ----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSS 133
               +P   G  +  S  Y+V   IGTPA    + +DT +D +WV C       C     
Sbjct: 74  GGTSIPTFLGDSV-NSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD 132

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGA---CAFNLTYGS-STIAANL 182
            +F+ + S+++ ++ C +  C+++     G       GGA   C + + YG+ +T     
Sbjct: 133 PLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVY 192

Query: 183 SQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
           S +T++L   +V   + FGC     G      GLLGLG    SL++QT + +   FSYCL
Sbjct: 193 STETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 252

Query: 242 PSFKALSFSGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           P       SG      +G P           + +TP+ + P   + Y V L  I VG   
Sbjct: 253 P-----PTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAP 307

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGG-FD 350
           + IPP A        +G +IDSGTV T L A AY A+R  FR  +    L   S GG  D
Sbjct: 308 LAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLD 361

Query: 351 TCYS----VPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
           TCY       +  PTI+L FSG   +++  P   L+         CLA A A    ++ +
Sbjct: 362 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-------CLAFAGA--GTDNAI 412

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            +I N+ Q+   +LYD     +G     C
Sbjct: 413 GIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 110/349 (31%), Positives = 157/349 (44%), Gaps = 28/349 (8%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y VR  IG+PA    M +D+ +D  W+   PC  C   +  +FN A S +F  + C 
Sbjct: 126 SGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACS 185

Query: 151 AAQCKQVPNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
           +  C Q+ +   C  G C + + YG  S     L+ +TI++   ++     GC     G 
Sbjct: 186 SNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTVIQDTAIGCGHWNEGM 245

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   GLLGLG G +S + Q        F YCL S          R  P+G      + P
Sbjct: 246 FVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVS----------RAMPVGA----MWVP 291

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+ NP   S YYV+L  + VG   V I     Q       G ++D+GT  TRL   AY A
Sbjct: 292 LIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNA 351

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHST 383
            RD F  +  +      +  FDTCY     V +  PT++  FSG  + T P  N LI + 
Sbjct: 352 FRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPAD 411

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                C A A +P    S L++I N+QQ+  ++  D  N  +G    +C
Sbjct: 412 DVGTFCFAFAPSP----SGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 111/350 (31%), Positives = 172/350 (49%), Gaps = 35/350 (10%)

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV--- 157
           A  G+ +  + + +DT+ D  W+ C  C       ++  +S+T+    C ++ CKQ+   
Sbjct: 154 ATDGSSSPPVTVVLDTAGDVPWMRCVPCTFAQCADYDPTRSSTYSAFPCNSSACKQLGRY 213

Query: 158 PNPTCGGGACAFNL-TYGSS-TIAANLSQDTISLAT-DIVPGYTFGCIQKATGN-SVPPQ 213
            N     G C + + T G S T +   S D +++ + D V G+ FGC Q   G+      
Sbjct: 214 ANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQAD 273

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
           G++ LGRG  SL+AQT + Y   FSYCLP  +  +  G  ++G PIG   R   TP+LK 
Sbjct: 274 GIMALGRGVQSLMAQTSSTYGDAFSYCLPPTE--TTKGFFQIGVPIGASYRFVTTPMLKE 331

Query: 273 PRRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
              +S     LY   LLAI V  + +++P           AGT++DS T+ TRL   AY 
Sbjct: 332 RGGASAAAATLYRALLLAITVDGKELNVPAEVF------AAGTVMDSRTIITRLPVTAYG 385

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMN-VTLPQDNLLIHS 382
           A+R  FR R+   +        DTCY +  V     P I L+F G   V + +  +L++ 
Sbjct: 386 ALRAAFRNRMRYRV-APPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILLNG 444

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                 CLA A+  D  +S  +++ N+QQQ  ++L+DV   R+G     C
Sbjct: 445 ------CLAFASNDD--DSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 124/426 (29%), Positives = 187/426 (43%), Gaps = 58/426 (13%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------------Y 97
           K LS  E +   + + +AR   LS  AV  ++     SG+   Q+P             Y
Sbjct: 42  KQLSRPELIRRAMRRSKARAAALS--AVRNRARF---SGKNEQQTPAGVLPVRPSGDLEY 96

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQC 154
           +V   IGTP Q +   +DT +D  W  C  C  C S    +F   QS +++ + C    C
Sbjct: 97  VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLC 156

Query: 155 KQVPNPTCGG-GACAFNLTYGSSTIAANL---------SQDTISLATDIVPGYTFGCIQK 204
             + + +C     C +   YG  T+   +         S     L T  VP   FGC   
Sbjct: 157 SDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGFGCGSV 215

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS----GSLRLGPIGQ 260
             G+     G++G GR  LSL++Q   L    FSYCL S+ +   S    GSL  G  G 
Sbjct: 216 NVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGD 272

Query: 261 PK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
              R++ TPLL++P+  + YYV+   + VG R + IP  A    P    G I+DSGT  T
Sbjct: 273 ATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALT 332

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVP-----------IVAPTITLMF 366
            L A     V   FR+++   L   + G  +   C+ VP           +  P + L F
Sbjct: 333 LLPAAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF 390

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G ++ LP+ N ++        CL +A + D+ ++    I N+ QQ+ R+LYD+    L 
Sbjct: 391 QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST----IGNLVQQDMRVLYDLEAETLS 446

Query: 427 VARELC 432
           +A   C
Sbjct: 447 IAPARC 452


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 112/372 (30%), Positives = 165/372 (44%), Gaps = 50/372 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAA 152
           Y++   +GTP + + + +DT +D  W  C  C+ C    ++ V + A S+T   L C A 
Sbjct: 90  YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAP 149

Query: 153 QCKQVPNPTCGG-----GACAFNLTYGSSTI-AANLSQDTISLATDIVPG------YTFG 200
            C+ +P  +CGG      +C +   YG  ++    L+ D+ +   D   G       TFG
Sbjct: 150 LCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFG 209

Query: 201 CIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
           C     G       G+ G GRG  SL +Q   L  ++FSYC  S      S  + LG   
Sbjct: 210 CGHINKGIFQANETGIAGFGRGRWSLPSQ---LNVTSFSYCFTSMFDTKSSSVVTLGAAA 266

Query: 258 --------IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
                         ++ T L+KNP + SLY+V L  I VG   V +P   L+      + 
Sbjct: 267 AELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR------SS 320

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------PTI 362
           TIIDSG   T L    Y AV+  F  +VG           D C+++P+ A       P +
Sbjct: 321 TIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPAL 380

Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDV 420
           TL    G +  LP+ N +    A  + C+ + AAA + V     VI N QQQN  ++YD+
Sbjct: 381 TLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQV-----VIGNYQQQNTHVVYDL 435

Query: 421 PNSRLGVARELC 432
            N  L  A   C
Sbjct: 436 ENDVLSFAPARC 447


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 122/436 (27%), Positives = 196/436 (44%), Gaps = 66/436 (15%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVL--EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
           VFH    CS   P+  L  +  V+  E + +   +L F  ++++                
Sbjct: 30  VFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFRHNISLT--------------- 74

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSSTVFNSAQSTTFKNLGCQA 151
                V   +GTP Q + M +DT ++ +W+ C        SS+ FN   S+++  + C +
Sbjct: 75  -----VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSS 129

Query: 152 AQC----KQVP-NPTCGGGA-CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQK 204
           + C    +  P  P+C     C   L+Y  +S+   NL+ DT  + +  +P   FGC+  
Sbjct: 130 STCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDS 189

Query: 205 ATGNSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--I 258
              ++        GL+G+ RGSLS ++Q   +    FSYC+  +    FSG L LG    
Sbjct: 190 IFSSNSEEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCISEYD---FSGLLLLGDANF 243

Query: 259 GQPKRIKYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
                + YTPL++     P    + Y V L  I+V  +++ IP    + + T    T++D
Sbjct: 244 SWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVD 303

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PT 361
           SGT FT L+ PAYTA+RD F  +   +L V         G  D CY VP         P+
Sbjct: 304 SGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPS 363

Query: 362 ITLMFSGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           +TL+F G  +T+  D +L           SI C     + D +     VI ++ QQN  +
Sbjct: 364 VTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNS-DLLGVEAFVIGHLHQQNVWM 422

Query: 417 LYDVPNSRLGVARELC 432
            +D+  SR+G+A   C
Sbjct: 423 EFDLKKSRIGLAEIRC 438


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 128/449 (28%), Positives = 199/449 (44%), Gaps = 72/449 (16%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------- 82
           +++ + H   PC+P   S     + S+ E L +D+AR  ++ + A   ++          
Sbjct: 97  ASVPLVHRHGPCAPSAAS---GGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAG 153

Query: 83  ----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSS 133
               +P   G  +  S  Y+V   IGTPA    + +DT +D +WV C       C     
Sbjct: 154 GGTSIPTFLGDSV-NSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD 212

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGA---CAFNLTYGS-STIAANL 182
            +F+ + S+++ ++ C +  C+++     G       GGA   C + + YG+ +T     
Sbjct: 213 PLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVY 272

Query: 183 SQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
           S +T++L   +V   + FGC     G      GLLGLG    SL++QT + +   FSYCL
Sbjct: 273 STETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 332

Query: 242 PSFKALSFSGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           P       SG      +G P           + +TP+ + P   + Y V L  I VG   
Sbjct: 333 P-----PTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAP 387

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGG-FD 350
           + IPP A        +G +IDSGTV T L A AY A+R  FR  +    L   S GG  D
Sbjct: 388 LAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLD 441

Query: 351 TCYS----VPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
           TCY       +  PTI+L FSG   +++  P   L+         CLA A A    ++ +
Sbjct: 442 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-------CLAFAGA--GTDNAI 492

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            +I N+ Q+   +LYD     +G     C
Sbjct: 493 GIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 109/334 (32%), Positives = 161/334 (48%), Gaps = 21/334 (6%)

Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGGA 166
           M +DT +D  WV C  C  C   S  VF+ + S ++  + C + +C+ +    C    GA
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 167 CAFNLTYGS-STIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           C + + YG  S    + + +T++L     V     GC     G  V   GLL LG G LS
Sbjct: 61  CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
             +Q   +  STFSYCL    + + S +L+ G           PL+++PR S+ YYV L 
Sbjct: 121 FPSQ---ISASTFSYCLVDRDSPAAS-TLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALS 176

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
            I VG + + IP  A   + T+G+G  I+DSGT  TRL + AY A+RD F +   S    
Sbjct: 177 GISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRT 236

Query: 344 TSLGGFDTCYSV----PIVAPTITLMFSGMN-VTLPQDNLLIHSTAGSITCLAMAAAPDN 398
           + +  FDTCY +     +  P ++L F G   + LP  N LI        CLA A     
Sbjct: 237 SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP---- 292

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            N+ +++I N+QQQ  R+ +D     +G     C
Sbjct: 293 TNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/440 (28%), Positives = 196/440 (44%), Gaps = 57/440 (12%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
           L + H  SPCSP     PL  +     +LA D AR+  L++            L  +R  
Sbjct: 45  LTLHHPQSPCSP----APLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDESRAG 100

Query: 81  -------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
                        + VP+  G  +     Y+ R  +GTPA++ +M +DT +   W+ C+ 
Sbjct: 101 SSSSSSPDDESSLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSP 159

Query: 128 CV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSST 177
           CV  C   S  VFN   S+++ ++ C A QC  +   T    +C+      +  +YG S+
Sbjct: 160 CVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSS 219

Query: 178 IAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
            +   LS+DT+S  +  VP + +GC Q   G      GL+GL R  LSLL Q       +
Sbjct: 220 FSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYS 279

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           FSYCLP+  + S            P +  YTP+  +    SLY++ +  I+V  + + + 
Sbjct: 280 FSYCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVS 337

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--- 353
             A    P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   
Sbjct: 338 SSAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQ 392

Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           +  +  P +T+ F+G          L+     + TCLA A A         +I N QQQ 
Sbjct: 393 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQT 447

Query: 414 HRILYDVPNSRLGVARELCT 433
             ++YDV NS++G A   C+
Sbjct: 448 FSVVYDVKNSKIGFAAGGCS 467


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 161/361 (44%), Gaps = 36/361 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y+V   IGTP Q + + +DT +D  W  C  C  C       F+ + S+T     C +  
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141

Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
           C+ +P  +CG         C +  +YG  ++    L  D  +   A   VPG  FGC   
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
             G     + G+ G GRG LSL +Q   L    FS+C  +   L  S  L   P    K 
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258

Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
               ++ TPL++NP   + YY++L  I VG   + +P         TG GTIIDSGT  T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG-GTIIDSGTAMT 317

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
            L    Y  VRD F  +V   L V S    D   C S P+ A    P + L F G  + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375

Query: 374 PQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           P++N +  +     SI CLA+    +     +  I N QQQN  +LYD+ NS+L      
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 432 C 432
           C
Sbjct: 431 C 431


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/440 (28%), Positives = 196/440 (44%), Gaps = 57/440 (12%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
           L + H  SPCSP     PL  +     +LA D AR+  L++            L  +R  
Sbjct: 45  LTLHHPQSPCSP----APLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDESRAG 100

Query: 81  -------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
                        + VP+  G  +     Y+ R  +GTPA++ +M +DT +   W+ C+ 
Sbjct: 101 SSSSSSPDDESSLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSP 159

Query: 128 CV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSST 177
           CV  C   S  VFN   S+++ ++ C A QC  +   T    +C+      +  +YG S+
Sbjct: 160 CVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSS 219

Query: 178 IAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
            +   LS+DT+S  +  VP + +GC Q   G      GL+GL R  LSLL Q       +
Sbjct: 220 FSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYS 279

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           FSYCLP+  + S            P +  YTP+  +    SLY++ +  I+V  + + + 
Sbjct: 280 FSYCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVS 337

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--- 353
             A    P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   
Sbjct: 338 SSAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQ 392

Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           +  +  P +T+ F+G          L+     + TCLA A A         +I N QQQ 
Sbjct: 393 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQT 447

Query: 414 HRILYDVPNSRLGVARELCT 433
             ++YDV NS++G A   C+
Sbjct: 448 FSVVYDVKNSKIGFAAGGCS 467


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 161/361 (44%), Gaps = 36/361 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y+V   IGTP Q + + +DT +D  W  C  C  C       F+ + S+T     C +  
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141

Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
           C+ +P  +CG         C +  +YG  ++    L  D  +   A   VPG  FGC   
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
             G     + G+ G GRG LSL +Q   L    FS+C  +   L  S  L   P    K 
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258

Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
               ++ TPL++NP   + YY++L  I VG   + +P         TG GTIIDSGT  T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTG-GTIIDSGTAMT 317

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
            L    Y  VRD F  +V   L V S    D   C S P+ A    P + L F G  + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375

Query: 374 PQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           P++N +  +     SI CLA+    +     +  I N QQQN  +LYD+ NS+L      
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 432 C 432
           C
Sbjct: 431 C 431


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 123/397 (30%), Positives = 198/397 (49%), Gaps = 36/397 (9%)

Query: 61  MLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
           ML +DQ R++ +          S     ++ +P+ SG  +  +  Y+V+  +GTP  +L 
Sbjct: 1   MLLQDQLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLG-AGNYLVKMALGTPKLSLS 59

Query: 112 MAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCG 163
           +A+DT +D  W  C  CVG     + T F+  +S+++KN+ C ++ C+ + +      C 
Sbjct: 60  LALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCV 119

Query: 164 GGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRG 221
              C + + YG  + +    + + ++++ +D++  + FGC Q+  G      GLLGLGRG
Sbjct: 120 SSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRG 179

Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPRRSSLYY 280
            LSL  QT   Y + F+YCLPSF + S +G L LG  GQ PK +K+TPL    + +  Y 
Sbjct: 180 KLSLALQTSEKYNNLFTYCLPSFSSSS-TGHLTLG--GQVPKSVKFTPLSPAFKNTPFYG 236

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           +++  + VG  V+ I          + AG IIDSGTV TRL    Y+A+   F++ +   
Sbjct: 237 IDIKGLSVGGHVLPIDASVF-----SNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDY 291

Query: 341 LTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA 395
                    DTCY       I  P I+  F  G+ V +    +L    A    CLA A  
Sbjct: 292 PKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPN 351

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            D+ + V  V  N QQQ + +++D+   R+G A   C
Sbjct: 352 DDDGDFV--VFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/438 (28%), Positives = 196/438 (44%), Gaps = 55/438 (12%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
           L + H  SPCSP     PL  +     +LA D AR+  L++            L  +R  
Sbjct: 45  LTLHHPQSPCSP----APLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRAG 100

Query: 81  -----------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
                      + VP+  G  +     Y+ R  +GTPA++ +M +DT +   W+ C+ CV
Sbjct: 101 SSSSSPDDESLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159

Query: 130 -GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
             C   S  VFN   S+++ ++ C A QC  +   T    +C+      +  +YG S+ +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219

Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
              LS+DT+S  +  VP + +GC Q   G      GL+GL R  LSLL Q       +FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLP+  + S            P +  YTP+  +    SLY++ +  I+V  + + +   
Sbjct: 280 YCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SV 355
           A    P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   + 
Sbjct: 338 AYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAA 392

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            +  P +T+ F+G          L+     + TCLA A A         +I N QQQ   
Sbjct: 393 RLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFS 447

Query: 416 ILYDVPNSRLGVARELCT 433
           ++YDV NS++G A   C+
Sbjct: 448 VVYDVKNSKIGFAAAGCS 465


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 127/438 (28%), Positives = 196/438 (44%), Gaps = 55/438 (12%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
           L + H  SPCSP     PL  +     +LA D AR+  L++            L  +R  
Sbjct: 45  LTLHHPQSPCSP----APLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRAG 100

Query: 81  -----------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
                      + VP+  G  +     Y+ R  +GTPA++ +M +DT +   W+ C+ CV
Sbjct: 101 SSSSSPDDESLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159

Query: 130 -GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
             C   S  VFN   S+++ ++ C A QC  +   T    +C+      +  +YG S+ +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219

Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
              LS+DT+S  +  VP + +GC Q   G      GL+GL R  LSLL Q       +FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           YCLP+  + S            P +  YTP+  +    SLY++ +  I+V  + + +   
Sbjct: 280 YCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SV 355
           A    P     TIIDSGTV TRL    Y+A+       +      ++    DTC+   + 
Sbjct: 338 AYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAA 392

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            +  P +T+ F+G          L+     + TCLA A A         +I N QQQ   
Sbjct: 393 RLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFS 447

Query: 416 ILYDVPNSRLGVARELCT 433
           ++YDV NS++G A   C+
Sbjct: 448 VVYDVKNSKIGFAAGGCS 465


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 52/400 (13%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
           A ARK+VV  A    +     Y+V+  IGTP      A+DT++D  W  C  C GC   V
Sbjct: 70  ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127

Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
              FN   S+T+  L C +  C ++    CG     +C +  TY G++T    L+ D + 
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +  D   G  FGC   +TG + PPQ  G++GLGRG LSL++Q   L    F+YCLP   A
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ---LSVRRFAYCLPP-PA 243

Query: 247 LSFSGSLRLGPIGQPKRIK----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG---- 298
               G L LG      R        P+ ++PR  S YY+NL  + +G R + +PP     
Sbjct: 244 SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTT 303

Query: 299 -------------------ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
                              A+        G IID  +  T L A  Y  + +     +  
Sbjct: 304 ATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRL 363

Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
                S  G D C+ +P       +  P + L F G  + L +  L        + CL +
Sbjct: 364 PRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMV 423

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             A     SV +++ N QQQN ++LY++   R+   +  C
Sbjct: 424 GRA--EAGSV-SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 128/435 (29%), Positives = 193/435 (44%), Gaps = 62/435 (14%)

Query: 55  EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
           EES+L++  KD  R++ +                    A++ + V  + SG  +  S  Y
Sbjct: 93  EESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALSERMVATVESGVAVG-SGEY 151

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
           ++   +GTP +   M MDT +D  W+ C  C+ C      VF+ A S++++N+ C   +C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRC 211

Query: 155 KQVPNP---------TC---GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVP 195
             V  P         TC   G   C +   YG  S    +L+ ++ ++      A+  V 
Sbjct: 212 GHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 271

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF------KALSF 249
           G  FGC  +  G      GLLGLGRG LS  +Q + +Y  TFSYCL         K +  
Sbjct: 272 GVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVFG 331

Query: 250 SGSLRLGPIGQPKRIKYTPL----LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
                L     P+ +KYT        +    + YYV L  + VG  +++I          
Sbjct: 332 EDDDALALAAHPQ-LKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKD 390

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVPIVA----P 360
              GTIIDSGT  +  V PAY  +R  F  R+  +   V        CY+V  V     P
Sbjct: 391 GSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEVP 450

Query: 361 TITLMFS-GMNVTLPQDNLLIH--STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
            ++L+F+ G     P +N  I      GSI CLA+   P    + +++I N QQQN  ++
Sbjct: 451 ELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP---RTGMSIIGNFQQQNFHVV 507

Query: 418 YDVPNSRLGVARELC 432
           YD+ N+RLG A   C
Sbjct: 508 YDLQNNRLGFAPRRC 522


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 52/400 (13%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
           A ARK+VV  A    +     Y+V+  IGTP      A+DT++D  W  C  C GC   V
Sbjct: 70  ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127

Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
              FN   S+T+  L C +  C ++    CG     +C +  TY G++T    L+ D + 
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +  D   G  FGC   +TG + PPQ  G++GLGRG LSL++Q   L    F+YCLP   A
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ---LSVRRFAYCLPP-PA 243

Query: 247 LSFSGSLRLGPIGQPKRIK----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG---- 298
               G L LG      R        P+ ++PR  S YY+NL  + +G R + +PP     
Sbjct: 244 SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTT 303

Query: 299 -------------------ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
                              A+        G IID  +  T L A  Y  + +     +  
Sbjct: 304 ATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRL 363

Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
                S  G D C+ +P       +  P + L F G  + L +  L        + CL +
Sbjct: 364 PRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMV 423

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             A     SV +++ N QQQN ++LY++   R+   +  C
Sbjct: 424 GRA--EAGSV-SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 119/399 (29%), Positives = 165/399 (41%), Gaps = 66/399 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCV------------------------G 130
           Y +   +G  +Q + + MDT +D  W PCT   C+                         
Sbjct: 75  YTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTPIS 134

Query: 131 CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQDTI 187
           C+S   + A S+T  +  C  A C    +    CG   C  F   YG  ++ A+L +DT+
Sbjct: 135 CNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIASLYRDTL 194

Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCL--P 242
           SL+T  +  +TFGC          P G+ G GRG LSL AQ         + FSYCL   
Sbjct: 195 SLSTLQLTNFTFGCAHTTFSE---PTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSH 251

Query: 243 SFKA--LSFSGSLRLGPIGQPKR--------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           SF++  +     L LG     K+          YT +L+NP+ S  Y V L  I VG++ 
Sbjct: 252 SFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKT 311

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV-GSNL---TVTSLGG 348
           V  P    + N     G ++DSGT FT L    Y +V + F RR   SN     +    G
Sbjct: 312 VPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTG 371

Query: 349 FDTCY--SVPIVAPTITLMFSGMN--VTLPQDNLLIHSTAGS--------ITCLAMAAAP 396
              CY  +   + P +TL F GMN  V LP+ N       G         + CL      
Sbjct: 372 LSPCYYLNTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGG 431

Query: 397 DNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           D          V+ N QQQ   + YD+   R+G AR  C
Sbjct: 432 DEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 169/370 (45%), Gaps = 42/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQSTTFKNLGCQA 151
           Y +   +GTP     + +DT ++  W  C  C  C      + V   A+S+TF  L C  
Sbjct: 91  YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150

Query: 152 AQCKQVPNP----TCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKA 205
           + C+ +P      TC    ACA+N TYGS   A  L+ +T+++     P   FGC  +  
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGYLATETLTVGDGTFPKVAFGCSTENG 210

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-- 263
             NS    G++GLGRG LSL++Q   L    FSYCL S  A   +  +  G + +     
Sbjct: 211 VDNS---SGIVGLGRGPLSLVSQ---LAVGRFSYCLRSDMADGGASPILFGSLAKLTERS 264

Query: 264 -IKYTPLLKNP--RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT-GAGTIIDSGTVFT 319
            ++ TPLLKNP  +RS+ YYVNL  I V    + +      F  T  G GTI+DSGT  T
Sbjct: 265 VVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLT 324

Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGG---FDTCYS-------VPIVAPTITLMFS- 367
            L    Y  V+  F+ ++ + N T  + G     D CY          +  P + L F+ 
Sbjct: 325 YLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAG 384

Query: 368 GMNVTLPQDNLLIHSTAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           G    +P  N      A S     + CL +  A D++   +++I N+ Q +  +LYD+  
Sbjct: 385 GAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNLMQMDMHLLYDIDG 442

Query: 423 SRLGVARELC 432
                A   C
Sbjct: 443 GMFSFAPADC 452


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 173/371 (46%), Gaps = 44/371 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQCKQ 156
           V   +GTP Q + M +DT ++ +W+ C      +S    FN  +S +++ + C ++ C  
Sbjct: 33  VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTN 92

Query: 157 ------VPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
                 +P        C   L+Y  +S+   NL+ DT  +    +PG  FGC+     ++
Sbjct: 93  QTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSSN 152

Query: 210 VPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKR 263
                   GL+G+ RGSLS ++Q   +    FSYC+       FSG L LG         
Sbjct: 153 SDEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCI---SGTDFSGMLLLGESNFTWAVP 206

Query: 264 IKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
           + YTPL++     P    + Y V L  I+V  R++ IP    + + T    T++DSGT F
Sbjct: 207 LNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQF 266

Query: 319 TRLVAPAYTAVRDVFRR------RVGSNLTVTSLGGFDTCYSVPIVA------PTITLMF 366
           T L+ PAYTA+R  F        RV  +      G  D CY VPI        PT++L+F
Sbjct: 267 TFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVF 326

Query: 367 SGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           +G  +T+  + +L           S+ CL+   + D +     VI +  QQN  + +D+ 
Sbjct: 327 NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNS-DLLGVEAYVIGHHHQQNVWMEFDLE 385

Query: 422 NSRLGVARELC 432
            SR+G+A+  C
Sbjct: 386 RSRIGLAQVRC 396


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 119/419 (28%), Positives = 182/419 (43%), Gaps = 31/419 (7%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T+ + H  SP SPF  S+    +  +   L +  +R+     +A A  SV P A+   +T
Sbjct: 33  TVDLIHRDSPLSPFYNSEETDLQR-INNALRRSISRVHHFDPIAAA--SVSPKAAESDVT 89

Query: 93  QS-PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLG 148
            +   Y++   +GTP   ++   DT +D  W  C  C  C   V   F+   S T+++  
Sbjct: 90  SNRGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFS 149

Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTFGCI 202
           C A QC  +   TC G  C +  +YG  S    N++ DTI+L +        P    GC 
Sbjct: 150 CDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCG 209

Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP--- 257
            +  G  S    G++GLG G LSL++Q  +     FSYCL P       S  L  G    
Sbjct: 210 HENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAV 269

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IIDSGT 316
           +  P  ++ TPLL +   SS Y++ L A+ VG   +     +L     TG G  IIDSGT
Sbjct: 270 VSGPG-VQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLG----TGEGNIIIDSGT 324

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
             T +    ++ +      +V         G    CYS    +  P IT  F+G +V L 
Sbjct: 325 TLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKVPAITAHFTGADVKLK 384

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             N  +   +  + CLA A+      S +++  N+ Q N  + Y++    L      CT
Sbjct: 385 PINTFVQ-VSDDVVCLAFAS----TTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCT 438


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 169/370 (45%), Gaps = 42/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQSTTFKNLGCQA 151
           Y +   +GTP     + +DT ++  W  C  C  C      + V   A+S+TF  L C  
Sbjct: 91  YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150

Query: 152 AQCKQVPNP----TCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKA 205
           + C+ +P      TC    ACA+N TYGS   A  L+ +T+++     P   FGC  +  
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGYLATETLTVGDGTFPKVAFGCSTENG 210

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-- 263
             NS    G++GLGRG LSL++Q   L    FSYCL S  A   +  +  G + +     
Sbjct: 211 VDNS---SGIVGLGRGPLSLVSQ---LAVGRFSYCLRSDMADGGASPILFGSLAKLTEGS 264

Query: 264 -IKYTPLLKNP--RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT-GAGTIIDSGTVFT 319
            ++ TPLLKNP  +RS+ YYVNL  I V    + +      F  T  G GTI+DSGT  T
Sbjct: 265 VVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLT 324

Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGG---FDTCY-------SVPIVAPTITLMFS- 367
            L    Y  V+  F+ ++ + N T  + G     D CY          +  P + L F+ 
Sbjct: 325 YLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAG 384

Query: 368 GMNVTLPQDNLLIHSTAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           G    +P  N      A S     + CL +  A D++   +++I N+ Q +  +LYD+  
Sbjct: 385 GAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNLMQMDMHLLYDIDG 442

Query: 423 SRLGVARELC 432
                A   C
Sbjct: 443 GMFSFAPADC 452


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 38/372 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +GTP + + + +DT +D +W+ C  C  C   + + +    S+T++N+ C   +
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPR 230

Query: 154 CK-----------QVPNPTC---GGGACAFNLT--YGSSTIAANLSQDTISLATDIVPGY 197
           C+           +  N TC      A   N T  + S T   NL+          V   
Sbjct: 231 CQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDV 290

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLG 256
            FGC     G      GLLGLGRG +S  +Q Q++Y  +FSYCL   F   S S  L  G
Sbjct: 291 MFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFG 350

Query: 257 PIGQ---PKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT-----T 306
              +      + +T LL        + YY+ + +I VG  V+DI      ++        
Sbjct: 351 EDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADA 410

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPT 361
           G GTIIDSG+  T     AY  +++ F +++              CY+V      +  P 
Sbjct: 411 GGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPD 470

Query: 362 ITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
             + F+   V   P +N         + CLA+   P+  +S L +I N+ QQN  ILYDV
Sbjct: 471 FGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPN--HSHLTIIGNLLQQNFHILYDV 528

Query: 421 PNSRLGVARELC 432
             SRLG +   C
Sbjct: 529 KRSRLGYSPRRC 540


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 128/417 (30%), Positives = 193/417 (46%), Gaps = 49/417 (11%)

Query: 36  VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP 95
           + H   PC+P   S       S+ EM  +  ARL ++ S    +K  VP   G  + +S 
Sbjct: 58  LLHRHGPCAP---SLSTDTPPSMSEMFRRSHARLSYIVS---GKKVSVPAHLGTSV-KSL 110

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CS---STVFNSAQSTTFKNLGCQ 150
            Y+     GTPA   ++ +DT +D  W+ C  C    CS     +F+ + S+T+  + C 
Sbjct: 111 EYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170

Query: 151 AAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD-IVPGYTFGCI 202
           + +CK++     G G      C F ++Y  G+ST+     +D ++LA   IV  + FGC 
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGV-YGKDKLTLAPGAIVKDFYFGCG 229

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
              +       GLLGLGR S SL AQ        FSYCLP+    S  G L  G    P 
Sbjct: 230 HSKSSLPGLFDGLLGLGRLSESLGAQYGG--GGGFSYCLPAVN--SKPGFLAFGAGRNPS 285

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
              +TP+ + P + +   V L  I VG + +D+ P A         G I+DSGTV T L 
Sbjct: 286 GFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS------GGMIVDSGTVVTVLQ 339

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSG---MNVTLPQ 375
           +  Y A+R  FR  + +   V   G  DTCY +     +V P I L FSG   +N+ +P 
Sbjct: 340 STVYRALRAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVP- 396

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + +L++       CLA A      +    V+ N+ Q+   +L+D   S+ G   + C
Sbjct: 397 NGILVNG------CLAFAET--GKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 107/340 (31%), Positives = 158/340 (46%), Gaps = 23/340 (6%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IG P    +   DT +D  W  C  C  C    + V++ + S+TF  L C +A 
Sbjct: 71  YLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSAT 130

Query: 154 CKQVPNPTCGGGA-CAFNLTYGSSTIAAN-LSQDTISLATDIVP----GYTFGCIQKATG 207
           C  + +  C   + C +   YG    +A  L  +T++L     P    G  FGC     G
Sbjct: 131 CLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGG 190

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKR 263
           +S+   G +GLGRG+LSLLAQ   L    FSYCL  F   +      LG + +    P  
Sbjct: 191 DSLNSTGTVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPST 247

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           ++ TPLL++P+  S Y+V+L  I +G   + IP G          G I+DSGT FT L  
Sbjct: 248 VQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAE 307

Query: 324 PAYTAVRDVFRRRVGS-NLTVTSLGG--FDTCYSVPIVAPTITLMFS-GMNVTLPQDNLL 379
             +  V     R +G   +  +SL    F      P   P + L F+ G ++ L +DN +
Sbjct: 308 SGFREVVGRVARVLGQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYM 367

Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            ++   S  CL +A       SVL    N QQQN ++L+D
Sbjct: 368 SYNEEDSSFCLNIAGTTPESTSVL---GNFQQQNIQMLFD 404


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 173/370 (46%), Gaps = 43/370 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ- 156
           IV   +GTP Q + M +DT ++ +W+ C   +   +T F+  +ST+++ + C +  C   
Sbjct: 32  IVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTT-FDPTRSTSYQTIPCSSPTCTNR 90

Query: 157 -----VPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----T 206
                +P        C   L+Y  +S+   NL+ D   + +  + G  FGC+       +
Sbjct: 91  TQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFSSNS 150

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRI 264
                  GL+G+ RGSLS ++Q   L    FSYC+       FSG L LG   +     +
Sbjct: 151 DEDSKSTGLMGMNRGSLSFVSQ---LGFPKFSYCI---SGTDFSGLLLLGESNLTWSVPL 204

Query: 265 KYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            YTPL++     P    + Y V L  I+V  +++ IP    + + T    T++DSGT FT
Sbjct: 205 NYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFT 264

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPI------VAPTITLMFS 367
            L+ P Y A+R  F  +  S L V         G  D CY VP+      + PT+TL+F 
Sbjct: 265 FLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFR 324

Query: 368 GMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           G  +T+  D +L           S+ CL+   + D +     VI +  QQN  + +D+  
Sbjct: 325 GAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNS-DLLGVEAYVIGHHHQQNVWMEFDLEK 383

Query: 423 SRLGVARELC 432
           SR+G+A+  C
Sbjct: 384 SRIGLAQVRC 393


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 114/406 (28%), Positives = 183/406 (45%), Gaps = 40/406 (9%)

Query: 58  VLEMLAKDQARLQFLS------------------SLAVARKSVVPIASGRQITQSPTYIV 99
           V   L +D AR+QFL+                     +      P+ SG+       Y+ 
Sbjct: 91  VRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLA 150

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQ 153
           +  +G P +   +  DT +D  W+ C  C   ++       +F+   S+++  L C + Q
Sbjct: 151 QIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQ 210

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVP 211
           CK +    C    C + + YG  +     L+ +T+S   ++ +P    GC     G    
Sbjct: 211 CKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG 270

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
             GL+GLG G++SL +Q   L  S+FSYCL +  + S S +L       P     +PL+K
Sbjct: 271 GAGLIGLGGGAISLSSQ---LKASSFSYCLVNLDSDS-SSTLEFNS-NMPSDSLTSPLVK 325

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           N R  S  YV ++ I VG + + I P   + + +   G I+DSGT+ +RL +  Y ++R+
Sbjct: 326 NDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLRE 385

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
            F +   S      +  FDTCY+      +  PTI  + S G ++ LP  N LI      
Sbjct: 386 AFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAG 445

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             CLA         S L++I + QQQ  R+ YD+ NS +G +   C
Sbjct: 446 TYCLAFI----KTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 159/355 (44%), Gaps = 38/355 (10%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           TY+V   IGTP   L   +DT +D  W     PC  C    + ++  A+S T+ N+ C +
Sbjct: 99  TYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGS 158

Query: 152 AQCKQVPN-------------PTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
             C  +P+             P    G C +  +YG  S+    L+ +T +  A   V  
Sbjct: 159 RLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGTTVHD 218

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
             FGC     G +    GL+G+GRG LSL++Q   L  + FSYC   F   + S  L LG
Sbjct: 219 LAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQ---LGVTKFSYCFTPFNDTTTSSPLFLG 275

Query: 257 PIGQ----PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
                    K   + P    PRRSS YY++L  I VG  ++ I P   +   +   G II
Sbjct: 276 SSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGRGGLII 335

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
           DSGT FT L   A+  +      RV   L   +  G   C++ P       +  P + L 
Sbjct: 336 DSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRLVLH 395

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           F G ++ LP+ + ++      + CL + +A       ++V+ +MQQQN  + YDV
Sbjct: 396 FDGADMELPRSSAVVEDRVAGVACLGIVSA-----RGMSVLGSMQQQNMHVRYDV 445


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 182/424 (42%), Gaps = 52/424 (12%)

Query: 55  EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
           +ES L+   KD AR+  +                    A+A + V  + SG  +  S  Y
Sbjct: 94  KESFLDSAGKDVARIHTMLRRVAGAGGGRAATNSTPRRALAERIVATVESGVAVG-SGEY 152

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
           +V   +GTP +   M MDT +D  W+ C  C+ C      VF+ A S +++N+ C   +C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRC 212

Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
             V  PT            C +   YG  S    +L+ +  ++      A+  V    FG
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFG 272

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
           C     G      GLLGLGRG+LS  +Q + +Y   FSYCL    + S    +  G    
Sbjct: 273 CGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS-SVGSKIVFGDDDA 331

Query: 258 -IGQPKRIKYTPLLKNPRR--SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
            +G P R+ YT    +      + YYV L  + VG   ++I P           GTIIDS
Sbjct: 332 LLGHP-RLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDS 390

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
           GT  +    PAY  +R  F  R+      V        CY+V     +  P  +L+F+ G
Sbjct: 391 GTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADG 450

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
                P +N  +      I CLA+   P    S +++I N QQQN  +LYD+ N+RLG A
Sbjct: 451 AVWDFPAENYFVRLDPDGIMCLAVLGTP---RSAMSIIGNFQQQNFHVLYDLQNNRLGFA 507

Query: 429 RELC 432
              C
Sbjct: 508 PRRC 511


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 166/381 (43%), Gaps = 44/381 (11%)

Query: 84  PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNS 138
           P++ G      PT  Y+V   IGTP Q + + +DT +D  W  C  C  C       F+ 
Sbjct: 20  PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDP 79

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL-- 189
           + S+T     C +  C+ +P  +CG         C +  +YG  ++    L  D  +   
Sbjct: 80  STSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVG 139

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC-------L 241
           A   VPG  FGC     G     + G+ G GRG LSL +Q   L    FS+C       +
Sbjct: 140 AGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAI 196

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           PS   L     L     G    ++ TPL+   KN    +LYY++L  I VG   + +P  
Sbjct: 197 PSTVLLDLPADLFSNGQG---AVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPES 253

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
           A      TG GTIIDSGT  T L    Y  VRD F  ++   +   +  G  TC+S P  
Sbjct: 254 AFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQ 312

Query: 359 A----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
           A    P + L F G  + LP++N +         SI CLA+     N      +I N QQ
Sbjct: 313 AKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQ 367

Query: 412 QNHRILYDVPNSRLGVARELC 432
           QN  +LYD+ N+ L      C
Sbjct: 368 QNMHVLYDLQNNMLSFVAAQC 388


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 182/424 (42%), Gaps = 52/424 (12%)

Query: 55  EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
           +ES L+   KD AR+  +                    A+A + V  + SG  +  S  Y
Sbjct: 94  KESFLDSAGKDVARIHTMLRRVAGAGGGRAATNSTPRRALAERIVATVESGVAVG-SGEY 152

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
           +V   +GTP +   M MDT +D  W+ C  C+ C      VF+ A S +++N+ C   +C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRC 212

Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
             V  PT            C +   YG  S    +L+ +  ++      A+  V    FG
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFG 272

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
           C     G      GLLGLGRG+LS  +Q + +Y   FSYCL    + S    +  G    
Sbjct: 273 CGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS-SVGSKIVFGDDDA 331

Query: 258 -IGQPKRIKYTPLLKNPRR--SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
            +G P R+ YT    +      + YYV L  + VG   ++I P           GTIIDS
Sbjct: 332 LLGHP-RLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDS 390

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
           GT  +    PAY  +R  F  R+      V        CY+V     +  P  +L+F+ G
Sbjct: 391 GTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADG 450

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
                P +N  +      I CLA+   P    S +++I N QQQN  +LYD+ N+RLG A
Sbjct: 451 AVWDFPAENYFVRLDPDGIMCLAVLGTP---RSAMSIIGNFQQQNFHVLYDLQNNRLGFA 507

Query: 429 RELC 432
              C
Sbjct: 508 PRRC 511


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 124/452 (27%), Positives = 196/452 (43%), Gaps = 68/452 (15%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---SSLAVARKSV 82
           ++  + +++ + H   PC+P   S     + S+ E L +D+AR  ++   ++      + 
Sbjct: 37  NSDPNRASVPLVHRHGPCAP---SAASGGKPSLAERLRRDRARANYIVTKAAGGRTAATA 93

Query: 83  VPIASGRQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----G 127
           V  A G   T  PT          Y+V   IGTPA   ++ +DT +D +WV C       
Sbjct: 94  VSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGE 153

Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-------CAFNLTYGS-STIA 179
           C      +F+ + S+++ ++ C +  C+++     G G        C + + YG+ +T  
Sbjct: 154 CYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTT 213

Query: 180 ANLSQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
              S +T++L   +V   + FGC     G      GLLGLG    SL++QT + +   FS
Sbjct: 214 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 273

Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIK---------YTPLLKNPRRSSLYYVNLLAIRVG 289
           YCLP       SG      +G P             +TP+ + P   + Y V L  I VG
Sbjct: 274 YCLP-----PTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVG 328

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG- 348
              + +PP A        +G +IDSGTV T L A AY A+R  FR  +     +    G 
Sbjct: 329 GAPLAVPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGA 382

Query: 349 -FDTCYSVP----IVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
             DTCY       +  PTI L FSG   +++  P   L+         CLA A A    +
Sbjct: 383 VLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLVDG-------CLAFAGA--GTD 433

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             + +I N+ Q+   +LYD     +G     C
Sbjct: 434 DTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/406 (28%), Positives = 183/406 (45%), Gaps = 40/406 (9%)

Query: 58  VLEMLAKDQARLQFLS------------------SLAVARKSVVPIASGRQITQSPTYIV 99
           V   L +D AR+QFL+                     +      P+ SG+       Y+ 
Sbjct: 91  VRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLA 150

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQ 153
           +  +G P +   +  DT +D  W+ C  C   ++       +F+   S+++  L C + Q
Sbjct: 151 QIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQ 210

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVP 211
           CK +    C    C + + YG  +     L+ +T+S   ++ +P    GC     G    
Sbjct: 211 CKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG 270

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
             GL+GLG G++SL +Q   L  S+FSYCL +  + S S +L       P     +PL+K
Sbjct: 271 GAGLIGLGGGAISLSSQ---LKASSFSYCLVNLDSDS-SSTLEFNSY-MPSDSLTSPLVK 325

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           N R  S  YV ++ I VG + + I P   + + +   G I+DSGT+ +RL +  Y ++R+
Sbjct: 326 NDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLRE 385

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
            F +   S      +  FDTCY+      +  PTI  + S G ++ LP  N LI      
Sbjct: 386 AFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAG 445

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             CLA         S L++I + QQQ  R+ YD+ NS +G +   C
Sbjct: 446 TYCLAFI----KTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 163/371 (43%), Gaps = 38/371 (10%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           S  Y +   +GTP +   + +DT +D  W+ C  C+ C   S   ++   S++F+N+ C 
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCH 253

Query: 151 AAQCKQV--PNP----TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIV 194
             +C+ V  P+P         +C +   YG            T   NL+    +     V
Sbjct: 254 DPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHV 313

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
               FGC     G      GLLGLG+G LS  +Q Q+LY  +FSYCL    + +   S  
Sbjct: 314 ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKL 373

Query: 255 LGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
           +   G+ K +   P L        K+    + YYV + ++ V   V+ IP      +   
Sbjct: 374 I--FGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEG 431

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTI 362
             GTIIDSGT  T    PAY  +++ F R++     V  L     CY+V  +     P  
Sbjct: 432 AGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDF 491

Query: 363 TLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            ++F+   V   P +N  I      + CLA+   P    S L++I N QQQN  ILYD+ 
Sbjct: 492 GILFADEAVWNFPVENYFIW-IDPEVVCLAILGNP---RSALSIIGNYQQQNFHILYDMK 547

Query: 422 NSRLGVARELC 432
            SRLG A   C
Sbjct: 548 KSRLGYAPMKC 558


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 124/437 (28%), Positives = 192/437 (43%), Gaps = 63/437 (14%)

Query: 42  PCSPFKPSKPLSWEESVLEMLAKDQARLQF----LSSLAVARKSVVPI------ASGRQI 91
           PCS             + +ML  DQ R  +    LS+        +P+       S R  
Sbjct: 75  PCSSTSSRASEDMGIDIDDMLMWDQLRTSYIRTQLSTHVGVVGGGMPVIARSTTVSNRDY 134

Query: 92  TQSPTYIVRAKIGT----------------PAQTLLMAMDTSNDAAWVPCTGCV--GC-- 131
           T S T  V    GT                 A +  + +DTS+D  WV C  C    C  
Sbjct: 135 TPSSTASVGTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQCLPCPIPQCHL 194

Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----GACAFNLTYGS-STIAANLSQ 184
               +++ A+S+TF  + C +  CK++ +    G       C + + YG           
Sbjct: 195 QKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYGDGKATTGTYVT 254

Query: 185 DTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
           DT++++ T +V  + FGC     G+ S    G+L LG G  SLL QT + Y + FSYC+P
Sbjct: 255 DTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIP 314

Query: 243 SFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
              +  F   L LG P+    +  YTPL+KN    + Y V+L AI V  + + +PP A  
Sbjct: 315 KPSSAGF---LSLGGPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAF- 370

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----P 356
                  G ++DSG V T+L    Y A+R  FR  + +   + + +   DTCY       
Sbjct: 371 -----ATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPD 425

Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           +  P ++L+F+ G  + L   ++++        CLA AA P   +  +  I N+QQQ + 
Sbjct: 426 VKVPKVSLVFAGGATLDLEPASIILDG------CLAFAATPGEES--VGFIGNVQQQTYE 477

Query: 416 ILYDVPNSRLGVARELC 432
           +LYDV   ++G  R  C
Sbjct: 478 VLYDVGGGKVGFRRGAC 494


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 129/454 (28%), Positives = 201/454 (44%), Gaps = 55/454 (12%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------------ 73
           + +D S +L++    S  SP + +   + ++S LE   KD  R+  +             
Sbjct: 62  EQKDRSPSLKLH--MSRRSPAEATAGRTRKDSFLESAQKDGVRIATMHRRVALQAQAQPG 119

Query: 74  --------SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
                     A++ + V  + SG  +  S  Y+V   +GTP +   M MDT +D  W+ C
Sbjct: 120 RRSASSSPRRALSERLVATVESGVAVG-SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC 178

Query: 126 TGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCGGGA---CAFNLTYGS 175
             C+ C      VF+   ST+++N+ C   +C  V  P    TC       C +   YG 
Sbjct: 179 APCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGD 238

Query: 176 -STIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
            S    +L+ +  ++     ++  V G   GC  +  G      GLLGLGRG LS  +Q 
Sbjct: 239 QSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQL 298

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
           + +Y   FSYCL    + +    +  G     +  P+ + YT    +   ++ YYV L  
Sbjct: 299 RAVYGHAFSYCLVDHGS-AVGSKIVFGDDNVLLSHPQ-LNYTAFAPSAAENTFYYVQLKG 356

Query: 286 IRVGRRVVDIPPGALQFNPTTGA-GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TV 343
           I VG  ++DIP      +   G+ GTIIDSGT  +    PAY A+R  F  R+      +
Sbjct: 357 ILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLI 416

Query: 344 TSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
                   CY+V     +  P  +L+F+ G     P +N  I      I CLA+   P  
Sbjct: 417 ADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTP-- 474

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             S +++I N QQQN  +LYD+ ++RLG A   C
Sbjct: 475 -RSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 118/371 (31%), Positives = 171/371 (46%), Gaps = 47/371 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   IGTP  T  +  DT +   W    PCT C    +  F  A S+TF  L C ++ 
Sbjct: 90  YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149

Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
           C+ + +P  TC    C +   YG    A  L+ +T+ +     PG  FGC  +   GNS 
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPGVAFGCSTENGVGNS- 208

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
              G++GLGR  LSL++Q        FSYCL S  A +    +  G + +     ++ TP
Sbjct: 209 -SSGIVGLGRSPLSLVSQVG---VGRFSYCLRS-DADAGDSPILFGSLAKVTGGNVQSTP 263

Query: 269 LLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA------GTIIDSGTVFTR 320
           LL+NP    SS YYVNL  I VG    D+P  +  F  T GA      GTI+DSGT  T 
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTY 321

Query: 321 LVAPAYTAVRDVFRRRVG-SNLTVTSLG---GFDTCYS---------VPIVAPTITLMFS 367
           LV   Y  V+  F  ++  +NLT T  G   GFD C+          VP+  PT+ L F+
Sbjct: 322 LVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPV--PTLVLRFA 379

Query: 368 GMNVTLPQDNLLIHSTA------GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           G      +    +   A       ++ CL +  A + ++  +++I N+ Q +  +LYD+ 
Sbjct: 380 GGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLS--ISIIGNVMQMDLHVLYDLD 437

Query: 422 NSRLGVARELC 432
                 A   C
Sbjct: 438 GGMFSFAPADC 448


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/441 (26%), Positives = 184/441 (41%), Gaps = 84/441 (19%)

Query: 18  SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           S   +P     D  ++L+V H   PCS  +P K  S   S  ++LA+D++R+  + S   
Sbjct: 3   SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANS--PSHTQILAQDESRVASIQSRLA 60

Query: 78  ----------ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
                     A K+ +P  S   +  S  Y+V   +G+P + L    DT +D  W  C  
Sbjct: 61  KNLAGGSNLKASKATLPSKSASTLG-SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEP 119

Query: 128 CVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTI 178
           CVG C      +F+ + S ++ N+ C +  C+++ +     P C    C + + YG  + 
Sbjct: 120 CVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSY 179

Query: 179 AANL-SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
           +    +++ +SL +TD+   + FGC Q   G      GLLGL R  LSL++QT   Y   
Sbjct: 180 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 239

Query: 237 FSYCLPSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           FSYCLPS  + S +G L  G   G  K +K+TP                           
Sbjct: 240 FSYCLPS--SSSSTGYLSFGSGDGDSKAVKFTP--------------------------- 270

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
                                   RL    Y++V+ VFR  +     V  +   DTCY +
Sbjct: 271 ------------------------RLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 306

Query: 356 P----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
                +  P I L FSG          +I+    S  CLA A   D  +  + +I N+QQ
Sbjct: 307 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSD--DDEVAIIGNVQQ 364

Query: 412 QNHRILYDVPNSRLGVARELC 432
           +   ++YD    R+G A   C
Sbjct: 365 KTIHVVYDDAEGRVGFAPSGC 385


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 129/422 (30%), Positives = 189/422 (44%), Gaps = 51/422 (12%)

Query: 57  SVLEMLAKDQARLQFLS--------------SLAVARKSVVPIASGRQITQSPTYIVRAK 102
           S L++  KD  R++ +               +L+ + + V  + SG  +  S  Y++   
Sbjct: 93  SFLDLAEKDAVRVEAMHRRVASSSSSPRRGRALSESERVVATVESGVAVG-SAEYLMDVY 151

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
           +GTP +   M MDT +D  W+ C  C+ C      VF+ A S++++NL C   +C  V  
Sbjct: 152 VGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAP 211

Query: 160 PTC---------GGGACAFNLTYG---SSTIAANLSQDTISL----ATDIVPGYTFGCIQ 203
           P           G   C +   YG   +ST    L   T++L    A+  V G  FGC  
Sbjct: 212 PEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGH 271

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS-TFSYCLPSFKA-----LSFSGSLRLGP 257
           +  G      GLLGLGRG LS  +Q + +Y   TFSYCL    +     + F     L  
Sbjct: 272 RNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHGSDVASKVVFGEDDALAL 331

Query: 258 IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
              P R+KYT     +    + YYV L  + VG  +++I       +     GTIIDSGT
Sbjct: 332 AAHP-RLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGT 390

Query: 317 VFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMN 370
             +  V PAY  +R  F  R+ GS   V        CY+V  V     P ++L+F+ G  
Sbjct: 391 TLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGAV 450

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
              P +N  I      I CLA+   P    + +++I N QQQN  + YD+ N+RLG A  
Sbjct: 451 WDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPR 507

Query: 431 LC 432
            C
Sbjct: 508 RC 509


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 127/440 (28%), Positives = 195/440 (44%), Gaps = 64/440 (14%)

Query: 48  PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS-------------VVPIASGRQI 91
           P  P + E  +  +LA D+AR   LQ  +  A  +                VP+ SG + 
Sbjct: 93  PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSGIRF 152

Query: 92  TQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
            Q+  Y+    +G           L + +DT +D  WV   PC+ C      +F+ + S 
Sbjct: 153 -QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSA 211

Query: 143 TFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQD 185
           ++  + C A+ C+       G  G+CA               ++L YG  + +   L+ D
Sbjct: 212 SYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD 271

Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
           T++L    V G+ FGC     G      GL+GLGR  LSL++QT   +   FSYCLP+  
Sbjct: 272 TVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAAT 331

Query: 246 ALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           +   +GSL LG      R    + YT ++ +P +   Y++N+    VG   V        
Sbjct: 332 SGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAV-------A 384

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTCYSV---- 355
                 A  ++DSGTV TRL    Y AVR  F R+ G+     +      D CY++    
Sbjct: 385 AAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHD 444

Query: 356 PIVAPTITLMFS-GMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQN 413
            +  P +TL    G ++T+    +L  +   GS  CLAMA+   +      +I N QQ+N
Sbjct: 445 EVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPIIGNYQQKN 502

Query: 414 HRILYDVPNSRLGVARELCT 433
            R++YD   SRLG A E C+
Sbjct: 503 KRVVYDTVGSRLGFADEDCS 522


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 168/386 (43%), Gaps = 57/386 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTV---------FNSAQSTTF 144
           Y V    GTP QTL   MDT +D  W PCT    C  CS +          F   +S++ 
Sbjct: 67  YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126

Query: 145 KNLGCQAAQCKQVPNPT------CGGGAC------AFNLTYGSSTIAANLSQDTISLATD 192
           K LGC+  +C  + +        C   +C       + + YGS T       +T+ L + 
Sbjct: 127 KLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHLHSL 186

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-----AL 247
             P +  GC   +  +S  P G+ G GRG  SL +Q   L    FSYCL S +       
Sbjct: 187 SKPNFLVGC---SVFSSHQPAGIAGFGRGLSSLPSQ---LGLGKFSYCLLSHRFDDDTKK 240

Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPR---RSSL---YYVNLLAIRVGRRVVDIPPG 298
           S S  L +  +   K+   + YTP +KNP+   +SS    YY+ L  I VG   V +P  
Sbjct: 241 SSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYK 300

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG---GFDTCYSV 355
            L        G IIDSGT FT +   A+  + D F R++     V  +    G   C++V
Sbjct: 301 YLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNV 360

Query: 356 P----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVI 406
                +  P + L F  G +V LP +N       G + CL +     A P+ V     ++
Sbjct: 361 SDAKTVSFPELRLYFKGGADVALPVENYFAF-VGGEVACLTVVTDGVAGPERVGGPGMIL 419

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N Q QN  + YD+ N RLG  +E C
Sbjct: 420 GNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 169/368 (45%), Gaps = 22/368 (5%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
           P+ SG  +  S  Y V   +GTP Q   + +D+ +D  WV C+ C  C    S ++  + 
Sbjct: 52  PVVSGSTLG-SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSN 110

Query: 141 STTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSSTIAANL-SQDTISLATD 192
           S+TF  + C ++ C  +P  T G        GACA+   Y  ++ +  + + ++ ++   
Sbjct: 111 SSTFSPVPCLSSDCLLIP-ATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGV 169

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSG 251
            +    FGC     G+     G+LGLG+G LS  +Q    Y + F+YCL ++    S S 
Sbjct: 170 RIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSS 229

Query: 252 SLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
           SL  G   I     ++YTP++ NP+  +LYYV +  + VG + + I   A + +     G
Sbjct: 230 SLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGG 289

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLM 365
           +I DSGT  T     AY+ +   F   V       S+ G D C  +  V     P+ T+ 
Sbjct: 290 SIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVELTGVDQPSFPSFTIE 348

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F    V  P+        A ++ CLAMA     +    N I N+ QQN  + YD   + +
Sbjct: 349 FDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGG-FNTIGNLLQQNFFVQYDREENLI 407

Query: 426 GVARELCT 433
           G A   C+
Sbjct: 408 GFAPAKCS 415


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 127/441 (28%), Positives = 195/441 (44%), Gaps = 65/441 (14%)

Query: 48  PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS--------------VVPIASGRQ 90
           P  P + E  +  +LA D+AR   LQ  +  A  +                 VP+ SG +
Sbjct: 93  PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIR 152

Query: 91  ITQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQS 141
             Q+  Y+    +G           L + +DT +D  WV   PC+ C      +F+ + S
Sbjct: 153 F-QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGS 211

Query: 142 TTFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQ 184
            ++  + C A+ C+       G  G+CA               ++L YG  + +   L+ 
Sbjct: 212 ASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLAT 271

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           DT++L    V G+ FGC     G      GL+GLGR  LSL++QT   +   FSYCLP+ 
Sbjct: 272 DTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAA 331

Query: 245 KALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
            +   +GSL LG      R    + YT ++ +P +   Y++N+    VG   V       
Sbjct: 332 TSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAV------- 384

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTCYSV--- 355
                  A  ++DSGTV TRL    Y AVR  F R+ G+     +      D CY++   
Sbjct: 385 AAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGH 444

Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQ 412
             +  P +TL    G ++T+    +L  +   GS  CLAMA+   +      +I N QQ+
Sbjct: 445 DEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPIIGNYQQK 502

Query: 413 NHRILYDVPNSRLGVARELCT 433
           N R++YD   SRLG A E C+
Sbjct: 503 NKRVVYDTVGSRLGFADEDCS 523


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 165/349 (47%), Gaps = 32/349 (9%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCK 155
           ++G P Q     +DT +D  W+   PC G  GC   +   F+   S+++  + C + QC+
Sbjct: 2   RVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQ 61

Query: 156 QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ 213
            +    C   +C + + YG  +     L+ +T++   ++ +P  + GC     G  V   
Sbjct: 62  LLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGAD 121

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFSGSLRLGPIGQPKRIKYTP 268
           GL+GLG G++S+ +Q   L  S+FSYCL     PSF  L F+          P     +P
Sbjct: 122 GLIGLGGGAISISSQ---LKASSFSYCLVDIDSPSFSTLDFN-------TDPPSDSLISP 171

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+KN R  S  YV ++ + VG + + I     + + +   G I+DSGT  T+L +  Y  
Sbjct: 172 LVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
           +R+ F     +      +  FDTCY +     +  PTI  +  G N + LP  N LI   
Sbjct: 232 LREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVD 291

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    CLA  +A       L++I N QQQ  R+ YD+ NS +G +   C
Sbjct: 292 SAGTFCLAFVSA----TFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 143/459 (31%), Positives = 205/459 (44%), Gaps = 84/459 (18%)

Query: 24  ICDTQDHSSTLQVFHVFSPCSPF------KPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
           I     + + L + H  SPCSP       K  KP     S+ E+L +D  RLQ+LS +  
Sbjct: 44  ITSGHTNGNKLPLVHRLSPCSPVTGGGAQKKGKP-----SLQEILHRDGLRLQYLSQVQA 98

Query: 78  ARKSV------------------VPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDT 116
           A  +                   VP A+   I+  P    Y V A  GTPAQ L +  D 
Sbjct: 99  ATAAAAPAAAPAPSATTPASGLSVP-ATQNIISSLPGVFEYTVLAGYGTPAQQLPLFFDV 157

Query: 117 S--NDAAWVPC-TGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA 168
           S  ++    PC +G  G  +T      F+ + S++F+++ C +  C    +    GG+C 
Sbjct: 158 SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSSFRSVLCGSPDCGG--HSCSAGGSCT 215

Query: 169 FNL-----TYGSSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGL------L 216
           F L      +G+ TI      DT++L+ +     +  GC+Q    N +   G+      L
Sbjct: 216 FTLQNSTFVFGNGTIV----MDTLTLSPSATFENFAVGCMQ--LDNDLFTDGVAVGNIDL 269

Query: 217 GLGRGSLSL-LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLK 271
            L R SL+  +  +     + FSYCLP+       G L + P          +KY PL+ 
Sbjct: 270 SLSRHSLATRVLNSSPPGMAAFSYCLPA--DTDTHGFLTIAPALSDYSDHAGVKYVPLVT 327

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           NP   + YYV+L+AI +    + IPP        TG GT+IDS + FT L  P Y A+RD
Sbjct: 328 NPTGPNFYYVDLVAIAINGEDLPIPPALF-----TGNGTMIDSQSAFTYLNPPIYAALRD 382

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLI----HS 382
            FR+ +     V + GG DTCY+      I  P ITL FS G  + L     +     H 
Sbjct: 383 EFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHL 442

Query: 383 TAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           T G    CLA AAAPD  N   N + +  Q+   I+YDV
Sbjct: 443 TDGFPFGCLAFAAAPDQ-NFPWNYLGSQVQRTKEIVYDV 480


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 112/400 (28%), Positives = 173/400 (43%), Gaps = 68/400 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGC---------SSTVFNSAQSTT 143
           Y++   IGTP Q + + MDT +D  WVPC      C+ C         SS++F+   S++
Sbjct: 11  YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70

Query: 144 FKNLGCQAAQCKQV-----PNPTCGGGAC---------------AFNLTYGSSTIAAN-L 182
                C ++ C ++     P   C    C               +F  TYG   + +  L
Sbjct: 71  SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130

Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
           ++D +   T  VP ++FGC+   T     P G+ G GRG LSL +Q   L +  FS+C  
Sbjct: 131 TRDILKARTRDVPRFSFGCV---TSTYHEPIGIAGFGRGLLSLPSQLGFL-EKGFSHCFL 186

Query: 243 SFKAL---SFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV--V 293
            FK +   + S  L LG     I     +++TP+L  P   + YY+ L +I +G  +   
Sbjct: 187 PFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNITPT 246

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDT 351
            +P    QF+     G ++DSGT +T L  P Y+ +  + +  +         S  GFD 
Sbjct: 247 QVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATETESRTGFDL 306

Query: 352 CYSVP--------------IVAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAM 392
           CY VP              +V P+IT  F +   + LPQ N     +A S    + CL  
Sbjct: 307 CYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLF 366

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               D       V  + QQQN +++YD+   R+G     C
Sbjct: 367 QNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 121/395 (30%), Positives = 171/395 (43%), Gaps = 45/395 (11%)

Query: 71  FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           F   L    +S V + SG        Y +   IG+P +   + +DT +D  W+ C  C  
Sbjct: 177 FSGQLMATLESGVSLGSGE-------YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFD 229

Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
           C   +   ++   S +F+N+ C   +C+ V +P           +C +   YG S     
Sbjct: 230 CFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTG 289

Query: 177 -----TIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
                T   NL+  T   +    V    FGC     G      GLLGLGRG LS  +Q Q
Sbjct: 290 DFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 349

Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVN 282
           +LY  +FSYCL    +  S S  L  G     +  P+ + +T L+   +NP   + YY+ 
Sbjct: 350 SLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENPV-DTFYYLQ 407

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           + +I VG   + IP      +     GTIIDSGT  +    PAY  +++ F R+V     
Sbjct: 408 IKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL 467

Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
           V        CY+V     +  P   + F+ G     P +N  I      I CLAM   P 
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTP- 526

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              S L++I N QQQN  ILYD  NSRLG A   C
Sbjct: 527 --KSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 121/395 (30%), Positives = 171/395 (43%), Gaps = 45/395 (11%)

Query: 71  FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           F   L    +S V + SG        Y +   IG+P +   + +DT +D  W+ C  C  
Sbjct: 177 FSGQLMATLESGVSLGSGE-------YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFD 229

Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
           C   +   ++   S +F+N+ C   +C+ V +P           +C +   YG S     
Sbjct: 230 CFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTG 289

Query: 177 -----TIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
                T   NL+  T   +    V    FGC     G      GLLGLGRG LS  +Q Q
Sbjct: 290 DFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 349

Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVN 282
           +LY  +FSYCL    +  S S  L  G     +  P+ + +T L+   +NP   + YY+ 
Sbjct: 350 SLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENPV-DTFYYLQ 407

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           + +I VG   + IP      +     GTIIDSGT  +    PAY  +++ F R+V     
Sbjct: 408 IKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL 467

Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
           V        CY+V     +  P   + F+ G     P +N  I      I CLAM   P 
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTP- 526

Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              S L++I N QQQN  ILYD  NSRLG A   C
Sbjct: 527 --KSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 111/352 (31%), Positives = 169/352 (48%), Gaps = 32/352 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
           Y V A  G PAQ   +A DT+   + + C  CVG +     F  ++S++F  + C + +C
Sbjct: 88  YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 147

Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
                  C G +C F + +G+ T+A   L +DT++L  +    G+TFGCI+         
Sbjct: 148 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 203

Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
              GL+ L R S SL    ++       + FSYCLPS  A S  G L +G   P      
Sbjct: 204 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 263

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           IKY P+  NP   + Y+V L+ I VG   + +PP     +     GT++++ T FT L  
Sbjct: 264 IKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 318

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
            AY A+RD FRR +            DTCY++     +  PT+ L F+ G  + L    +
Sbjct: 319 AAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQM 378

Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           +      S   S+ CLA AAAP     V +VI  + Q++  ++YD+   R+G
Sbjct: 379 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVG 429


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 114/423 (26%), Positives = 183/423 (43%), Gaps = 40/423 (9%)

Query: 47  KPSKPLSWEESVLEMLAKDQARLQ--FLSSLAVARKS-----VVPIASGRQITQSPTYIV 99
           K +K +SW++ V  +  + Q  L    ++SL  ++       +  + SG  +  +  Y +
Sbjct: 114 KDTKSMSWKQEVKVITIQQQNNLANAVVASLKSSKDEFSGNIMATLESGASLG-TGEYFI 172

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
              +GTP + + + +DT +D +W+ C  C  C   +   +N  +S++++N+ C   +C+ 
Sbjct: 173 DMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQL 232

Query: 157 VPNP------TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIVPGYTFG 200
           V +P            C +   Y             T   NL+          V    FG
Sbjct: 233 VSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFG 292

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIG 259
           C     G      GLLGLGRG LS  +Q Q++Y  +FSYCL   F   S S  L  G   
Sbjct: 293 CGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDK 352

Query: 260 Q---PKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           +      + +T LL        + YY+ + +I VG  V+DIP     ++     GTIIDS
Sbjct: 353 ELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDS 412

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GM 369
           G+  T     AY  +++ F +++              CY+V     +  P   + F+ G 
Sbjct: 413 GSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGA 472

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
               P +N         + CLA+   P+  +S L +I N+ QQN  ILYDV  SRLG + 
Sbjct: 473 VWNFPAENYFYQYEPDEVICLAILKTPN--HSHLTIIGNLLQQNFHILYDVKRSRLGYSP 530

Query: 430 ELC 432
             C
Sbjct: 531 RRC 533


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 180/371 (48%), Gaps = 38/371 (10%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQC- 154
           ++ KIGTP + +L+ +DT+++  WV  T C  CS T    FN   S++F +  C ++ C 
Sbjct: 1   MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60

Query: 155 ---KQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISL-----ATDIVPGYTFGCIQ 203
              K      C    G+C+F + Y   + A   ++++  SL     A   +    FGC  
Sbjct: 61  GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120

Query: 204 KATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQS----TFSYCLPS-FKALSFSGSLRLGP 257
           K     V    G LGL RGS S  AQ  +  +S     FSYC P+  + L+ SG +  G 
Sbjct: 121 KDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGD 180

Query: 258 IGQP-KRIKYTPLLKNPRRSSL---YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
            G P    +Y  L + P  +S+   YYV L  I VG  ++ IP  A + +     GT  D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYSVPI------VAPTITLM 365
           SGT  + LV PA+TA+ + F RRV  +L  TS   F  + CY V         AP +TL 
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRV-LHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLH 299

Query: 366 F-SGMNVTLPQDNLLI--HSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           F + +++ L + ++ +    T   +T CLA   A       +NVI N QQQ++ I +D+ 
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359

Query: 422 NSRLGVARELC 432
            SR+G A   C
Sbjct: 360 RSRIGFAPANC 370


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/435 (27%), Positives = 186/435 (42%), Gaps = 77/435 (17%)

Query: 63  AKDQARLQF-LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           ++ Q R++  LSS+ V  + +  +  G        Y++   IGTP Q + + +DT +D  
Sbjct: 56  SQTQERIKKPLSSVDVVMEPLREVRDG--------YLITLNIGTPPQAVQVYLDTGSDLT 107

Query: 122 WVPCTG----CVGC---------SSTVFNSAQSTTFKNLGCQAAQCKQV-----PNPTCG 163
           WVPC      C+ C         S +VF+   S+T     C ++ C ++     P   C 
Sbjct: 108 WVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCA 167

Query: 164 GGAC---------------AFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG 207
              C               +F  TYG    I+  L++D +   T  VP ++FGC+   T 
Sbjct: 168 VAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCV---TS 224

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---SFSGSLRLGP----IGQ 260
               P G+ G GRG LSL +Q   L +  FS+C   FK +   + S  L LG     I  
Sbjct: 225 TYREPIGIAGFGRGLLSLPSQLGFL-EKGFSHCFLPFKFVNNPNISSPLILGASALSINL 283

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV--VDIPPGALQFNPTTGAGTIIDSGTVF 318
              +++TP+L  P   + YY+ L +I +G  +    +P    QF+     G ++DSGT +
Sbjct: 284 TDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTY 343

Query: 319 TRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP--------------IVAPTI 362
           T L  P Y+ +    +  +         S  GFD CY VP              ++ P+I
Sbjct: 344 THLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSI 403

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
           T  F +   + LPQ N     +A S    + CL      D       V  + QQQN +++
Sbjct: 404 TFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVV 463

Query: 418 YDVPNSRLGVARELC 432
           YD+   R+G     C
Sbjct: 464 YDLEKERIGFQAMDC 478


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 22/349 (6%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQ 150
           Y  R  +G P Q+     DT +D +W+   PC G  GC   +   F+   S+++  L C 
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISL-ATDIVPGYTFGCIQKATGN 208
           + QC  +    C   +C + + YG  +     L+ +T S   ++ +P    GC     G 
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGL 303

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   GL+GLG G++SL +Q   L  ++FSYCL    + S S +L      QP     +P
Sbjct: 304 FVGADGLIGLGGGAISLSSQ---LEATSFSYCLVDLDSES-SSTLDFN-ADQPSDSLTSP 358

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+KN R  +  YV ++ + VG + + I   + + + +   G I+DSGT  T + +  Y  
Sbjct: 359 LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDV 418

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
           +RD F     +      +  FDTCY +     +  PTI  +  G N + LP  N LI   
Sbjct: 419 LRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVD 478

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    CLA   +       L++I N+QQQ  R+ YD+ NS +G + + C
Sbjct: 479 SAGTFCLAFLPS----TFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 181/399 (45%), Gaps = 47/399 (11%)

Query: 64  KDQARLQFLSSLAVAR-KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
           +D A  + LS   VA  +S VP+ SG        Y+V   +GTP +   M MDT +D  W
Sbjct: 122 RDSAPRRALSERVVATVESGVPVGSGE-------YLVDVYLGTPPRRFRMIMDTGSDLNW 174

Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG---------GACAFN 170
           + C  C+ C   S  +F+ A S +++N+ C   +C+ V  P               C + 
Sbjct: 175 LQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYY 234

Query: 171 LTYGS-STIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
             YG  S    +L+ +  ++      T  V G  FGC  +  G      GLLGLGRG LS
Sbjct: 235 YWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLS 294

Query: 225 LLAQTQNLY-QSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLY 279
             +Q + +Y    FSYCL    + + S  +  G     +  P+ + YT         + Y
Sbjct: 295 FASQLRGVYGGHAFSYCLVEHGSAAGS-KIIFGHDDALLAHPQ-LNYTAFAPTTDADTFY 352

Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG- 338
           Y+ L +I VG   V+I    L     +  GTIIDSGT  +    PAY A+R  F  R+  
Sbjct: 353 YLQLKSILVGGEAVNISSDTL-----SAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSP 407

Query: 339 SNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMA 393
           S   +        CY+V     +  P ++L+F+ G     P +N  I      I CLA+ 
Sbjct: 408 SYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVL 467

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             P    S +++I N QQQN  +LYD+ ++RLG A   C
Sbjct: 468 GTP---RSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRC 503


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 172/394 (43%), Gaps = 62/394 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
           Y   A +GTP Q L + +DT +   WVPCT    C  CSS       VF+   S++ + +
Sbjct: 99  YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158

Query: 148 GCQ-------------AAQCKQVPN-------PTCGGGACA-FNLTYGSSTIAANLSQDT 186
           GC+             A +C++ P        P      C  + + YGS + A  L  DT
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK- 245
           +      VPG+  GC   +     PP GL G GRG+ S+ AQ   L    FSYCL S + 
Sbjct: 219 LRAPGRAVPGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRF 273

Query: 246 --ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPG 298
               + SGSL LG  G  + ++Y PL+K+     L     YY+ L  + VG + V +P  
Sbjct: 274 DDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPAR 333

Query: 299 ALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
           A   N     GTI+DSGT FT L      P   AV      R   +       G   C++
Sbjct: 334 AFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFA 393

Query: 355 VP-----IVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---- 404
           +P     +  P ++  F G  V  LP +N  + +  G++  + +A   D           
Sbjct: 394 LPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEG 453

Query: 405 -----VIANMQQQNHRILYDVPNSRLGVARELCT 433
                ++ + QQQN+ + YD+   RLG  R+ CT
Sbjct: 454 SGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCT 487


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 114/439 (25%), Positives = 188/439 (42%), Gaps = 74/439 (16%)

Query: 52  LSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           L+  E +   + + + RL  ++   L  + ++ V +A    ++    Y+V+  +GTP   
Sbjct: 41  LTDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHC 100

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG--- 163
              A+DT++D  W  C  CV C      VFN   ST++  + C +  C ++    C    
Sbjct: 101 FTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDG 160

Query: 164 ----GGACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLL 216
                 AC +  +Y G++T    L+ D +++  D+  G  FGC   + G   PPQ  G++
Sbjct: 161 DSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGGP-PPQVSGVV 219

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-----IGQPKRIKYTPLLK 271
           GLGRG+LSL++Q   L    F YCLP   + S +G L LG      +         P+  
Sbjct: 220 GLGRGALSLVSQ---LSVRRFMYCLPPPVSRS-AGRLVLGADAAATVRNASERVVVPMST 275

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDI---------PPGALQFNPTT---------------- 306
             R  S YY+NL  I +G R +            PG     P +                
Sbjct: 276 GSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPD 335

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV------GSNLTVTSLGGFDTCYSVP---- 356
             G IID  +  T L    Y  + D     +      GS+L      G D C+ +P    
Sbjct: 336 AYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDL------GLDLCFILPEGVP 389

Query: 357 ---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
              + AP ++L F G+ + L ++ + +   A  + CL M    D V    +++ N QQQN
Sbjct: 390 MSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCL-MVGKTDGV----SILGNYQQQN 444

Query: 414 HRILYDVPNSRLGVARELC 432
            +++Y++   R+   +  C
Sbjct: 445 MQVMYNLRRGRITFIKTAC 463


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 124/424 (29%), Positives = 191/424 (45%), Gaps = 53/424 (12%)

Query: 55  EESVLEMLAKDQARLQFLSSLA----VAR-------------KSVVPIASGRQITQSPTY 97
           +ES L+   KD  R++ +   A    VAR             + V  + SG  +  S  Y
Sbjct: 91  KESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATVESGVAVG-SGEY 149

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
           ++   +GTP +   M MDT +D  W+ C  C+ C      VF+ A S++++N+ C   +C
Sbjct: 150 LIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRC 209

Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
             V  P            +C +   YG  S    +L+ ++ ++      A+  V G  FG
Sbjct: 210 GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFG 269

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
           C  +  G      GLLGLGRG LS  +Q + +Y  TFSYCL    + + S  +  G    
Sbjct: 270 CGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGS-KVVFGEDYL 328

Query: 258 -IGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
            +  P+ +KYT          + YYV L  + VG  +++I             GTIIDSG
Sbjct: 329 VLAHPQ-LKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSG 387

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLT--VTSLGGFDTCYSVPIVA----PTITLMFS-G 368
           T  +  V PAY  +R  F   + S L   +      + CY+V  V     P ++L+F+ G
Sbjct: 388 TTLSYFVEPAYQVIRQAFVDLM-SRLYPLIPDFPVLNPCYNVSGVERPEVPELSLLFADG 446

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
                P +N  +      I CLA+   P    + +++I N QQQN  ++YD+ N+RLG A
Sbjct: 447 AVWDFPAENYFVRLDPDGIMCLAVRGTP---RTGMSIIGNFQQQNFHVVYDLQNNRLGFA 503

Query: 429 RELC 432
              C
Sbjct: 504 PRRC 507


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 171/369 (46%), Gaps = 43/369 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--- 155
           V   +G+P QT+ M +DT ++ +W+ C       S VF+  +S+++  + C +  C+   
Sbjct: 58  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCRTRT 116

Query: 156 ---QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
               +P        C   ++Y  +S+I  NL+ DT  +    +P   FGC+       + 
Sbjct: 117 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 176

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
                 GL+G+ RGSLS + Q   +    FSYC+    +   SG L  G       K +K
Sbjct: 177 EDSKTTGLIGMNRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALK 230

Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           YTPL++     P    + Y V L  I+V   ++ +P      + T    T++DSGT FT 
Sbjct: 231 YTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 290

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
           L+ P YTA+++ F R+  ++L V         G  D CY VP+        PT+TLMF G
Sbjct: 291 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 350

Query: 369 MNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
             +++  + L+     +   + S+ C     + + +     +I +  QQN  + +D+  S
Sbjct: 351 AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKS 409

Query: 424 RLGVARELC 432
           R+G A   C
Sbjct: 410 RVGFAEVRC 418


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 171/369 (46%), Gaps = 43/369 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--- 155
           V   +G+P QT+ M +DT ++ +W+ C       S VF+  +S+++  + C +  C+   
Sbjct: 65  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCRTRT 123

Query: 156 ---QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
               +P        C   ++Y  +S+I  NL+ DT  +    +P   FGC+       + 
Sbjct: 124 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 183

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
                 GL+G+ RGSLS + Q   +    FSYC+    +   SG L  G       K +K
Sbjct: 184 EDSKTTGLIGMNRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALK 237

Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           YTPL++     P    + Y V L  I+V   ++ +P      + T    T++DSGT FT 
Sbjct: 238 YTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 297

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
           L+ P YTA+++ F R+  ++L V         G  D CY VP+        PT+TLMF G
Sbjct: 298 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 357

Query: 369 MNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
             +++  + L+     +   + S+ C     + + +     +I +  QQN  + +D+  S
Sbjct: 358 AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKS 416

Query: 424 RLGVARELC 432
           R+G A   C
Sbjct: 417 RVGFAEVRC 425


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/358 (30%), Positives = 170/358 (47%), Gaps = 32/358 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
           Y V A  G PAQ   +A DT+   + + C  CVG +     F  ++S++F  + C + +C
Sbjct: 88  YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 147

Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
                  C G +C F + +G+ T+A   L +DT++L  +    G+TFGCI+         
Sbjct: 148 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 203

Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
              GL+ L R S SL    ++       + FSYCLPS  A S  G L +G   P      
Sbjct: 204 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 263

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           IKY P+  NP   + Y+V+L+ I VG   + +PP     +     GT++++ T FT L  
Sbjct: 264 IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 318

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
            AY A+RD FR+ +            DTCY++     +  P + L F+ G  + L    +
Sbjct: 319 AAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQM 378

Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +      S   S+ CLA AAAP     V +VI  + Q++  ++YD+   R+G     C
Sbjct: 379 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 65/397 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNSAQSTTFKNLG---- 148
           Y++   IGTP Q + + MDT +D  WVPC      C+ C     N   +T   +      
Sbjct: 82  YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141

Query: 149 --------CQAAQCKQVPNPTCGGGAC---------------AFNLTYGSSTIAAN-LSQ 184
                   C        P  TC    C               +F  TYG+  +    L++
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201

Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           DT+ +          +P + FGC+  A      P G+ G GRG+LS+++Q   L Q  FS
Sbjct: 202 DTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTLSMVSQLGFL-QKGFS 257

Query: 239 YCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
           +C  +FK     + S  L +G I    +  +++TP+L +P   + YYV L AI VG    
Sbjct: 258 HCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVSA 317

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
            ++P    +F+     G  IDSGT +T L  P Y+ V  + +  +    +  +    GFD
Sbjct: 318 TEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQTGFD 377

Query: 351 TCYSVP----------IVAPTITLMF-SGMNVTLPQDNLLIHSTA----GSITCLAMAAA 395
            CY VP           + P+IT  F + +++ LPQ N     +A      + CL   + 
Sbjct: 378 LCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQST 437

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            D  +    V  + QQQN  ++YD+   R+G     C
Sbjct: 438 DDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 114/395 (28%), Positives = 180/395 (45%), Gaps = 47/395 (11%)

Query: 52  LSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
           + + ++ L   A  ++RLQ LS             S R  +    Y++   IGTP    +
Sbjct: 29  IGFTKTELMRRAAHRSRLQALSGYDAN--------SPRLHSVQVEYLMELAIGTPPVPFV 80

Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC------KQVPNPTC 162
              DT +D  W  C  C  C    + V++ + S+TF  + C +A C      +   NP+ 
Sbjct: 81  ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPS- 139

Query: 163 GGGACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYT-------FGCIQKATGNSVPPQG 214
               C +  +Y     +   L  +T+++ +  VPG T       FGC     G+S+   G
Sbjct: 140 --SPCRYIYSYSDGAYSVGILGTETLTIGSS-VPGQTVSVGSVAFGCGTDNGGDSLNSTG 196

Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKRIKYTPLL 270
            +GLGRG+LSLLAQ   L    FSYCL  F   +      LG + +    P  ++ TPLL
Sbjct: 197 TVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLL 253

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
           ++P   S Y+VNL  I +G   + IP G          G ++DSGT FT L    +  V 
Sbjct: 254 QSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVV 313

Query: 331 DVFRRRVGS-NLTVTSLGGFDTCYSVPI---VAPTITLMFS-GMNVTLPQDNLLIHSTAG 385
           D   + +G   +  +SL     C+  P      P + L F+ G ++ L +DN + ++   
Sbjct: 314 DRVAQLLGQPPVNASSLD--SPCFPSPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDD 371

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           S  CL +  +P    S  + + N QQQN ++L+D+
Sbjct: 372 SSFCLNIVGSP----STWSRLGNFQQQNIQMLFDM 402


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 169/375 (45%), Gaps = 44/375 (11%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
           S  Y +  ++G+P +     +DT +D  W+   PC+ C   S  +++ + S+TF    C 
Sbjct: 1   SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60

Query: 151 AAQCKQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISL-----ATDIVPGYTFGCI 202
            + C+ +P   C   A  C +   YG SS+   + + +T++L     ++   P + FGC 
Sbjct: 61  TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-------LSFSGSLRL 255
           +  +G+     G++GLG+G +SL  Q  +   + FSYCL  F         L F  S   
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF------------- 302
           G          TP++ N  RS+ Y+V L  I VG + + +   A+ F             
Sbjct: 181 G-----SGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IV 358
                 GTI DSGT  T L    Y+ V+  F   V       S  GFD CY V       
Sbjct: 236 LEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFK 295

Query: 359 APTITLMFSGMNVTLPQDN-LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
            P +TL F G   + PQ N  +I  TA ++ CLAM  +    +  L +I N+ QQN+ ++
Sbjct: 296 FPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGS---GSLGLGIIGNLMQQNYHVV 352

Query: 418 YDVPNSRLGVARELC 432
           YD   S + ++   C
Sbjct: 353 YDRGTSTISMSPAQC 367


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 164/349 (46%), Gaps = 22/349 (6%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQ 150
           Y  R  +G P Q+     DT +D +W+   PC G  GC   +   F+   S+++  L C 
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISL-ATDIVPGYTFGCIQKATGN 208
           + QC  +    C   +C + + YG  +     L+ +T S   ++ +P    GC     G 
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGL 303

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            V   GL+GLG G++SL +Q   L  ++FSYCL    + S S +L      QP     +P
Sbjct: 304 FVGAAGLIGLGGGAISLSSQ---LEATSFSYCLVDLDSES-SSTLDFN-ADQPSDSLTSP 358

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+KN R  +  YV ++ + VG + + I   + + + +   G I+DSGT  T + +  Y  
Sbjct: 359 LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDV 418

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
           +RD F     +      +  FDTCY +     +  PTI  +  G N + LP  N L    
Sbjct: 419 LRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVD 478

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    CLA   +       L++I N+QQQ  R+ YD+ NS +G + + C
Sbjct: 479 SAGTFCLAFLPS----TFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 165/382 (43%), Gaps = 41/382 (10%)

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMAMDTSNDAAWVPCTGCVGCSST---V 135
           +   P+ASG  +     Y++   IGTP  Q + + +DT +D  W  C  C  C +     
Sbjct: 75  RVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPR 134

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-- 192
           F+++ S T   + C    C+ +    C  G C + + YG +++    L++D+ +      
Sbjct: 135 FDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGG 194

Query: 193 ---IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCL------- 241
               VP   FGC Q  TGN    + G+ G GRG LSL  Q   L  S+FSYC        
Sbjct: 195 GKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQ---LGVSSFSYCFTTIFESK 251

Query: 242 --PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
             P F   + +  LR    G    I  TP L  P     YY++L  I VG+  + +P  A
Sbjct: 252 STPVFLGGAPADGLRAHATGP---ILSTPFL--PNHPEYYYLSLKGITVGKTRLAVPESA 306

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPI 357
                    GTIIDSGT  T      + ++ + F  +V    T  +  G  T  C+S   
Sbjct: 307 FVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTES 366

Query: 358 V-------APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
           V        P +TL   G +  LP++N +         C+ + A  D+      +I N Q
Sbjct: 367 VPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSDQLCVVVLAGDDD----RTMIGNFQ 422

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  I++D+  ++L +    C
Sbjct: 423 QQNMHIVHDLAGNKLVIEPAQC 444


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 167/382 (43%), Gaps = 53/382 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNSAQSTTFKNLGCQAA 152
           Y+V   +GTP + + + +DT +D  W  C  C+ C       V + A S+T   + C A 
Sbjct: 94  YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAP 153

Query: 153 QCKQVPNPTCGGG-------ACAFNLTYGSSTI-AANLSQDTISLA-TDIVPG------- 196
            C+ +P  +CG G       +C +   YG  +I    L+ D  +    D   G       
Sbjct: 154 VCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSERR 213

Query: 197 YTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLR 254
            TFGC     G       G+ G GRG  SL +Q   L  ++FSYC  S F++ S   +L 
Sbjct: 214 LTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQ---LGVTSFSYCFTSMFESTSSLVTLG 270

Query: 255 LGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           + P  +    +++ TPLL++P + SLY+++L AI VG   + IP    +      A  II
Sbjct: 271 VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLRE---ASAII 327

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------------- 359
           DSG   T L    Y AV+  F  +VG  ++       D C+++P  A             
Sbjct: 328 DSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRGR 387

Query: 360 --------PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
                   P +      G +  LP++N +       + CL + AA    +  + VI N Q
Sbjct: 388 GRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTV-VIGNYQ 446

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  ++YD+ N  L  A   C
Sbjct: 447 QQNTHVVYDLENDVLSFAPARC 468


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 125/434 (28%), Positives = 189/434 (43%), Gaps = 89/434 (20%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----LAVARKSVVPI 85
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S          K   P 
Sbjct: 63  SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKFNQYAPENLKDHTP- 117

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
            + +   +   ++V    GTP Q   + +DT +   W  C  C     TV N+       
Sbjct: 118 -NNKLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKAC-----TVENN------- 164

Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQ 203
                                  +N+TYG  ST   N   DT++L  +D+   + FG  +
Sbjct: 165 -----------------------YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGR 201

Query: 204 KATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQ 260
              G+      G+LGLG+G LS ++QT + +   FSYCLP   ++   GSL  G     Q
Sbjct: 202 NNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATSQ 258

Query: 261 PKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
              +K+T L+  P   + S  Y+VNL  I VG   ++IP            GTIIDS TV
Sbjct: 259 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTV 313

Query: 318 FTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----PIVAPT 361
            TRL   AY+A++  F            RR+ G  L        DTCY++     ++ P 
Sbjct: 314 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKDVLLPE 365

Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYD 419
           I L F  G +V L   N++  S    + CLA A  +   +N  L +I N QQ +  +LYD
Sbjct: 366 IVLHFGGGADVRLNGTNIVWGSDESRL-CLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424

Query: 420 VPNSRLGVARELCT 433
           +   R+G     C+
Sbjct: 425 IQGGRIGFRSNGCS 438


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 41/366 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPC--TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           I+   IGTP Q   M +DT +  +W+ C          T F+ + S++F  L C    CK
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 156 -QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATG 207
            ++P+   PT C     C ++  Y   T A  NL ++ I+ + T+I P    GC  +++ 
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD 192

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGPIGQPKRI 264
           +    +G+LG+ RG LS ++Q +    S FSYC+P   +    + +GS  LG        
Sbjct: 193 D----RGILGMNRGRLSFVSQAK---ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 265 KYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAG--TIIDSG 315
           KY  LL  P    +       Y V ++ IR G + ++I      F P  G    T++DSG
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNI--SGSVFRPDAGGSGQTMVDSG 303

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS-----VPIVAPTITLMFS- 367
           + FT LV  AY  VR     RVG  L    +  G  D C+      +P +   +  +F+ 
Sbjct: 304 SEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTR 363

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G+ + +P++ +L++   G I C+ +  +   + +  N+I N+ QQN  + +DV N R+G 
Sbjct: 364 GVEILVPKERVLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 421

Query: 428 ARELCT 433
           A+  C+
Sbjct: 422 AKADCS 427


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 181/375 (48%), Gaps = 41/375 (10%)

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTF 144
            I  S   I+   IGTP+Q+  + +DT +  +W+ C        +   +T F+ + S++F
Sbjct: 73  NIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSF 132

Query: 145 KNLGCQAAQCK-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPG 196
            +L C    CK ++P+ T    C     C ++  Y   T A  NL ++  + + +   P 
Sbjct: 133 SDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPP 192

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSL 253
              GC +++T      +G+LG+  G LS ++Q +    S FSYC+P+      L+ +GS 
Sbjct: 193 LILGCAKESTDE----KGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSF 245

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTT 306
            LG     +  KY  LL  P+   +       Y V L  IR+G++ ++IP    + +   
Sbjct: 246 YLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGG 305

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAP 360
              T++DSG+ FT LV  AY  V++   R VGS L    + G      FD  +S+ I   
Sbjct: 306 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRL 365

Query: 361 TITLMFS---GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
              L+F    G+ + + + +LL++   G I C+ +  +   + +  N+I N+ QQN  + 
Sbjct: 366 IGDLVFEFGRGVEILVEKQSLLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVE 423

Query: 418 YDVPNSRLGVARELC 432
           +DV N R+G ++  C
Sbjct: 424 FDVTNRRVGFSKAEC 438


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 140/432 (32%), Positives = 198/432 (45%), Gaps = 72/432 (16%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
           S+ L++ H   PC+P + S   +   SV + L  DQ R +++             S A A
Sbjct: 65  SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
             + VP + G  I  +  Y+V A +GTP     M +DT +D +WV   PC+    C S  
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
             +F+ AQS+++  + C          P C G             C + ++YG  S    
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233

Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
             S DT++L A+  V G+ FGC    +G  N V   GLLGLGR   SL+ QT   Y   F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291

Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
           SYCLP+    S +G L L   GP G       T LL +P   + Y V L  I VG + + 
Sbjct: 292 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTC 352
           +P  A          T++D+GTV TRL   AY A+R  FR  + S    T  S G  DTC
Sbjct: 350 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 403

Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
           Y+      +  P + L F SG  VTL  D +L      S  CLA   AP   +  + ++ 
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 455

Query: 408 NMQQQNHRILYD 419
           N+QQ++  +  D
Sbjct: 456 NVQQRSFEVRID 467


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 41/366 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPC--TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           I+   IGTP Q   M +DT +  +W+ C          T F+ + S++F  L C    CK
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 156 -QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATG 207
            ++P+   PT C     C ++  Y   T A  NL ++ I+ + T+I P    GC  +++ 
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD 192

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGPIGQPKRI 264
           +    +G+LG+ RG LS ++Q +    S FSYC+P   +    + +GS  LG        
Sbjct: 193 D----RGILGMNRGRLSFVSQAK---ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 265 KYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAG--TIIDSG 315
           KY  LL  P    +       Y V ++ IR G + ++I      F P  G    T++DSG
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNI--SGSVFRPDAGGSGQTMVDSG 303

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS-----VPIVAPTITLMFS- 367
           + FT LV  AY  VR     RVG  L    +  G  D C+      +P +   +  +F+ 
Sbjct: 304 SEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTR 363

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G+ + +P++ +L++   G I C+ +  +   + +  N+I N+ QQN  + +DV N R+G 
Sbjct: 364 GVEIFVPKERVLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 421

Query: 428 ARELCT 433
           A+  C+
Sbjct: 422 AKADCS 427


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 136/454 (29%), Positives = 196/454 (43%), Gaps = 70/454 (15%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEE---SVLEMLAKDQARLQFLSS-LAVARKSVVPI 85
           +S+   + H++ PCSP   S   +  +   S+ +M+  DQ R  ++   L  A     P+
Sbjct: 61  NSTWAPLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRRADYIQKRLTGATDDKQPM 120

Query: 86  A-SGR--QITQSPTYIVRAKI--------------------GTPAQTLLMAMDTSNDAAW 122
           A S R  Q  ++  Y     +                    GT A T  + +D+ +D +W
Sbjct: 121 AFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTNSAPDGTSAVTQTVIIDSGSDVSW 180

Query: 123 VPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGGA-CAFNLTY 173
           V C  C    C      +F+ A STT+  + C +A C Q+  P    C   A C F + Y
Sbjct: 181 VQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPYRRGCSANAQCQFGINY 239

Query: 174 GS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQT 229
           G  ST     S D ++L   D++ G+ FGC     G++      G L LG GS SL+ QT
Sbjct: 240 GDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQT 299

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY------TPLLKNPRRSSLYYVNL 283
              Y   FSYCLP     S  G L LG    P+R +       TPLL +    + Y V L
Sbjct: 300 ATRYGRVFSYCLP--PTASSLGFLVLG--VPPERAQLIPSFVSTPLLSSSMAPTFYRVLL 355

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
            AI V  R + +PP          A ++IDS T+ +RL   AY A+R  FR  +      
Sbjct: 356 RAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA 409

Query: 344 TSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
             +   DTCY       I  P+I L+F  G  V L    +L+ S      CLA   AP  
Sbjct: 410 PPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAF--APTA 461

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + +   I N+QQ+   ++YDVP   +      C
Sbjct: 462 SDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 169/352 (48%), Gaps = 32/352 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
           Y V A  G PAQ   +A DT+   + + C  CVG +     F  ++S++F  + C + +C
Sbjct: 176 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 235

Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
                  C G +C F + +G+ T+A   L +DT++L  +    G+TFGCI+         
Sbjct: 236 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 291

Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
              GL+ L R S SL    ++       + FSYCLPS  A S  G L +G   P      
Sbjct: 292 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 351

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           IKY P+  NP   + Y+V+L+ I VG   + +PP     +     GT++++ T FT L  
Sbjct: 352 IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 406

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
            AY A+RD FR+ +            DTCY++     +  P + L F+ G  + L    +
Sbjct: 407 AAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQM 466

Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           +      S   S+ CLA AAAP     V +VI  + Q++  ++YD+   R+G
Sbjct: 467 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVG 517


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 163/362 (45%), Gaps = 42/362 (11%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           IGTP Q   + +DT +D  W  C             S  V++  +S+TF  L C    C+
Sbjct: 97  IGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156

Query: 156 --QVPNPTC-GGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSV 210
             Q     C     C +   YGS+     L+ +T +      +     FGC   + G+ +
Sbjct: 157 EGQFSFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI 216

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSLRLGPIGQPKRIK 265
              G+LGL   SLSL+ Q   L    FSYCL  F       L F     L      + I+
Sbjct: 217 GATGILGLSPESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQ 273

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            T ++ NP ++  YYV L+ I +G + + +P  +L   P  G GTI+DSG+    LV  A
Sbjct: 274 TTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 333

Query: 326 YTAVR----DVFRRRVGSNLTVTSLGGFDTCYSVP----------IVAPTITLMFS-GMN 370
           + AV+    DV R  V +N TV     ++ C+ +P          +  P + L F  G  
Sbjct: 334 FEAVKEAVMDVVRLPV-ANRTVED---YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAA 389

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + LP+DN      AG + CLA+    D   S +++I N+QQQN  +L+DV + +   A  
Sbjct: 390 MVLPRDNYFQEPRAG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLFDVQHHKFSFAPT 446

Query: 431 LC 432
            C
Sbjct: 447 QC 448


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 115/419 (27%), Positives = 170/419 (40%), Gaps = 61/419 (14%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
           L FL+S +  R   +       + +SP        Y      GTP QTL +  DT +   
Sbjct: 46  LTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLV 105

Query: 122 WVPCTGCVGCSSTVFNSAQ-----------STTFKNLGCQAAQCKQVPNP---------- 160
           W PCT    CS   F               S++ K +GCQ  +C  +  P          
Sbjct: 106 WFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN 165

Query: 161 ----TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
                C     A+ + YGS + A  L  +T+      +P +  GC   +      P G+ 
Sbjct: 166 PKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQ---PSGIA 222

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIG-QPKRIKYTPLLKNP 273
           G GRGS SL +Q   +    F+YCL S  F     SG L L   G +   + YTP  +NP
Sbjct: 223 GFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNP 279

Query: 274 RRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
             S+      YY+N+  I VG + V +P   L   P    G+IIDSG+ FT +  P    
Sbjct: 280 SVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEV 339

Query: 329 VRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL 379
           V   F +++ +N T    V +L G   C+ +     +  P +   F  G    LP +N  
Sbjct: 340 VAREFEKQL-ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYF 398

Query: 380 IHSTAGSITCLA-----MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              ++  + CL      M            ++   QQQN  + YD+ N RLG  ++ C+
Sbjct: 399 ALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 179/368 (48%), Gaps = 41/368 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
           I+   IGTP+Q+  + +DT +  +W+ C        +   +T F+ + S++F +L C   
Sbjct: 82  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 153 QCK-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQK 204
            CK ++P+ T    C     C ++  Y   T A  NL ++  + + +   P    GC ++
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKE 201

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQP 261
           +T      +G+LG+  G LS ++Q +    S FSYC+P+      L+ +GS  LG     
Sbjct: 202 STD----VKGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSFYLGENPNS 254

Query: 262 KRIKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           +  KY  LL  P+   +       Y V LL IR+G++ ++IP    + +      T++DS
Sbjct: 255 RGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDS 314

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAPTITLMFS- 367
           G+ FT LV  AY  V++   R VGS L    + G      FD  + + I      L+F  
Sbjct: 315 GSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEF 374

Query: 368 --GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             G+ + + +  LL++   G I C+ +  +   + +  N+I N+ QQN  + +DV N R+
Sbjct: 375 GRGVEILVEKQRLLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVANRRV 432

Query: 426 GVARELCT 433
           G ++  C+
Sbjct: 433 GFSKAECS 440


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 43/369 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---- 154
           V   +GTP Q + M +DT ++ +W+ C        T F+  +S+++  + C +  C    
Sbjct: 87  VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSSYSPVPCSSLTCTDRT 145

Query: 155 KQVPNP-TCGGGA-CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
           +  P P +C     C   L+Y  +S+   NL+ DT  +    +PG  FGC+  +    T 
Sbjct: 146 RDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTNTE 205

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
                 GL+G+ RGSLS ++Q        FSYC+       FSG L LG         + 
Sbjct: 206 EDSKNTGLMGMNRGSLSFVSQMDF---PKFSYCI---SDSDFSGVLLLGDANFSWLMPLN 259

Query: 266 YTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           YTPL++     P    + Y V L  I+V  +++ +P      + T    T++DSGT FT 
Sbjct: 260 YTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTF 319

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
           L+ P Y+A+R+ F  +    L V         GG D CY VP+        PT++LMF G
Sbjct: 320 LLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRG 379

Query: 369 MNVTLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
             + +  D LL         + S+ C     + D +     VI +  QQN  + +D+  S
Sbjct: 380 AEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNS-DLLAVEAYVIGHHHQQNVWMEFDLEKS 438

Query: 424 RLGVARELC 432
           R+G A+  C
Sbjct: 439 RIGFAQVQC 447


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 171/371 (46%), Gaps = 31/371 (8%)

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
           +++PI  G     +  Y V    GTP Q   M +DT    + V C  C   S++    F+
Sbjct: 134 TIIPI-DGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFD 192

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDI-VPG 196
           ++QSTTF ++ C +  C    N    G  C FNL +    +    SQD +++A  + V  
Sbjct: 193 TSQSTTFTHVPCDSPDCPSTAN-CSAGSVCPFNLFF----VEGTFSQDVLTVAPSVAVQD 247

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
           +TF C+     + +P  G L L R   SL ++      + FSYC+P +      G L LG
Sbjct: 248 FTFVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYP--DSPGFLSLG 305

Query: 257 PIGQPKR---IKYTPLL--KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
                +      + PLL   +P  +++Y+++++ + +G   + IP G    N    A TI
Sbjct: 306 DDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTFGNN----ASTI 361

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDTCYSV----PIVAPTITLMF 366
           +++GT FT L   AYT +RD FR+ +   N +V     FDTCY+      +  P +   F
Sbjct: 362 VEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKF 421

Query: 367 -SGMNVTLPQDNLLIHSTAG----SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            +G ++ +  D +L +        ++TCLA +    + + V  VI         ++YDV 
Sbjct: 422 GNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVA 481

Query: 422 NSRLGVARELC 432
              +G   E C
Sbjct: 482 GGTVGFIPESC 492


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 115/419 (27%), Positives = 170/419 (40%), Gaps = 61/419 (14%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
           L FL+S +  R   +       + +SP        Y      GTP QTL +  DT +   
Sbjct: 46  LTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLV 105

Query: 122 WVPCTGCVGCSSTVFNSAQ-----------STTFKNLGCQAAQCKQVPNP---------- 160
           W PCT    CS   F               S++ K +GCQ  +C  +  P          
Sbjct: 106 WFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN 165

Query: 161 ----TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
                C     A+ + YGS + A  L  +T+      +P +  GC   +      P G+ 
Sbjct: 166 PKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQ---PSGIA 222

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIG-QPKRIKYTPLLKNP 273
           G GRGS SL +Q   +    F+YCL S  F     SG L L   G +   + YTP  +NP
Sbjct: 223 GFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNP 279

Query: 274 RRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
             S+      YY+N+  I VG + V +P   L   P    G+IIDSG+ FT +  P    
Sbjct: 280 SVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEV 339

Query: 329 VRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL 379
           V   F +++ +N T    V +L G   C+ +     +  P +   F  G    LP +N  
Sbjct: 340 VAREFEKQL-ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYF 398

Query: 380 IHSTAGSITCLA-----MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              ++  + CL      M            ++   QQQN  + YD+ N RLG  ++ C+
Sbjct: 399 ALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 166/345 (48%), Gaps = 42/345 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   IGTP Q L    DT +D  W  C  C  C    S  +   +S++F  L C  + 
Sbjct: 82  YDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSL 141

Query: 154 CKQVPNPTC--GGGACAFNLTYGSSTIAANLSQ-----DTISLATDIVPGYTFGCIQKAT 206
           C  +P+  C  GG  C +  +YG ++   + +Q     +T +L +D VPG  FGC   + 
Sbjct: 142 CSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTTMSE 201

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS---FSGSLRLGPIGQPKR 263
           G      GL+GLGRG LSL++Q   L    FSYCL S  A +     GS  L   G    
Sbjct: 202 GGYGSGSGLVGLGRGPLSLVSQ---LNVGAFSYCLTSDAAKTSPLLFGSGALTGAG---- 254

Query: 264 IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
           ++ TPLL   R S+ YY VNL +I +         GA     T  +G I DSGT    L 
Sbjct: 255 VQSTPLL---RTSTYYYTVNLESISI---------GAATTAGTGSSGIIFDSGTTVAFLA 302

Query: 323 APAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP-IVAPTITLMFSGMNVTLPQDNLLI 380
            PAYT  ++    +  +NLT+ S   G++ C+     V P++ L F G ++ LP +N   
Sbjct: 303 EPAYTLAKEAVLSQT-TNLTMASGRDGYEVCFQTSGAVFPSMVLHFDGGDMDLPTENYF- 360

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            +   S++C  +  +P      L+++ N+ Q N+ I YDV  S L
Sbjct: 361 GAVDDSVSCWIVQKSPS-----LSIVGNIMQMNYHIRYDVEKSML 400


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 158/323 (48%), Gaps = 41/323 (12%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSND 119
           +A+ +AR+  L S AV    V PI + R +    S  Y+V   IGTP       MDT +D
Sbjct: 52  IARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSD 111

Query: 120 AAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-S 175
             W  C  C+ C+   +  F+  +S T++ L C++++C  + +P+C    C +   YG +
Sbjct: 112 LIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQYYYGDT 171

Query: 176 STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
           ++ A  L+ +T +          AT+I     FGC     G+     G++G GRG LSL+
Sbjct: 172 ASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGRGPLSLV 227

Query: 227 AQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSS 277
           +Q   L  S FSYCL S+ +     L F     L       G P  ++ TP + NP   +
Sbjct: 228 SQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVINPALPN 282

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
           +Y+++L AI +G +++ I P     N     G IIDSGT  T L   AY AV    RR +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGL 338

Query: 338 GSNLTVTSLG----GFDTCYSVP 356
            S + +T++     G DTC+  P
Sbjct: 339 VSAIPLTAMNDTDIGLDTCFQWP 361


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/403 (28%), Positives = 185/403 (45%), Gaps = 51/403 (12%)

Query: 66  QARL-QFLSSLAVARKSVVPIAS-GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
            ARL + L +L+ A   V P++  G  +T          IGTP Q   + +DT +D  W 
Sbjct: 59  NARLARVLGNLSAADVPVAPLSDQGHSLT--------VGIGTPPQPRTLIVDTGSDLIWT 110

Query: 124 PCTGCVGCSST----------VFNSAQSTTFKNLGCQAAQCK--QVPNPTCG-GGACAFN 170
            C+     + T          ++   +S++F  L C    C+  Q     C     C ++
Sbjct: 111 QCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYD 170

Query: 171 LTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
             YGS+     L+ +T +   +  +     FGC   + G+ V   GL+GL  G +SL++Q
Sbjct: 171 ELYGSAEAGGVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQ 230

Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR------IKYTPLLKNP-RRSSLYYV 281
              L    FSYCL  F     S  L  G +   +R      ++ T +L+NP   ++ YYV
Sbjct: 231 ---LSVPRFSYCLTPFAERKTS-PLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYV 286

Query: 282 NLLAIRVGRRVVDIPPGAL-QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-- 338
            L+ + +G + +D+P  +L    P    GTI+DSG+  + L   A+ AV+      V   
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346

Query: 339 -SNLTVTSLGGFDTCYSVP-------IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITC 389
            +N T      ++ C+++P       +  P + L F G   +TLP+DN      AG + C
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAG-LMC 405

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LA+  +PD     +++I N+QQQN  +L+DV N +   A   C
Sbjct: 406 LAVGTSPDGFG--VSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 107/352 (30%), Positives = 167/352 (47%), Gaps = 30/352 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP    +   DT +D  W  C  C  C    + ++++A S++F  + C +A 
Sbjct: 93  YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152

Query: 154 CKQV---PNPTCGGGACAFNLTYGSSTIAAN-LSQDTISL--ATDI-VPGYTFGCIQKAT 206
           C  +    N T     C +   YG    +A  L  +T++   A  + V G  FGC     
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCGVDNG 212

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ------ 260
           G S    G +GLGRGSLSL+AQ   L    FSYCL  F   S    +  G + +      
Sbjct: 213 GLSYNSTGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPST 269

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
              ++ TPL+++P   + YYV+L  I +G   + IP G          G I+DSGT FT 
Sbjct: 270 GAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTF 329

Query: 321 LVAPAYTAVRD----VFRRRV--GSNLTVTSLGGFDTCYSVPIVAPTITLMFS-GMNVTL 373
           LV  A+  V D    V R+ V   S+L             +P + P + L F+ G ++ L
Sbjct: 330 LVESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAM-PDMVLHFAGGADMRL 388

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            +DN +  +   S  CL +A +P   ++ ++++ N QQQN ++L+D+   +L
Sbjct: 389 HRDNYMSFNQEESSFCLNIAGSP---SADVSILGNFQQQNIQMLFDITVGQL 437


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 120/398 (30%), Positives = 179/398 (44%), Gaps = 44/398 (11%)

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           + ++RL  L++ AV+     P  S +   +  S  Y +   IGTPA  L    DT +D  
Sbjct: 57  RSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGSDLI 116

Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA--------CAFN 170
           W  C  C  CS   S  +    S++   + C    C ++P P C   A        C+++
Sbjct: 117 WTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYH 176

Query: 171 LTYGSSTIAANLSQ-----DTISLATDIV--PGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
             YG++    + ++     +T +   D    PG  FGC  ++ G      GL+GLGRG L
Sbjct: 177 YAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKL 236

Query: 224 SLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL- 278
           SL+ Q   L    F Y L S       +SF GSL     G       TPLL NP    L 
Sbjct: 237 SLVTQ---LNVEAFGYRLSSDLSAPSPISF-GSLADVTGGNGDSFMSTPLLTNPVVQDLP 292

Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII-DSGTVFTRLVAPAYTAVRDVFRRR 336
            YYV L  I VG ++V IP G   F+ +TGAG +I DSGT  T L  PAYT VRD    +
Sbjct: 293 FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQ 352

Query: 337 VGSNLTVTSLGGFD-TCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH---STAGSIT 388
           +G      +    D  C+   S     P++ L F  G ++ L  +N L         +  
Sbjct: 353 MGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETAR 412

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP-NSRL 425
           C ++  +    +  L +I N+ Q +  +++D+  N+R+
Sbjct: 413 CWSVVKS----SQALTIIGNIMQMDFHVVFDLSGNARM 446


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 162/383 (42%), Gaps = 53/383 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGC-------SSTVFNSAQSTTFKN 146
           Y +    GTP QTL + MDT +D  W PCT    C  C       SS +F    S++ K 
Sbjct: 90  YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149

Query: 147 LGCQAAQCKQVP--------------NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
           LGC   +C  +               +P C      + + YGS      +  +T+ L   
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209

Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
            VP +  GC   +T     P G+ G GRG  SL +Q   L    FSYCL S +    + S
Sbjct: 210 GVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQ---LGLKKFSYCLLSRRYDDTTES 263

Query: 253 LRLGPIGQPKR------IKYTPLLKNPRR------SSLYYVNLLAIRVGRRVVDIPPGAL 300
             L   G+         + YTP ++NP+       S  YY+ L  I VG + V IP   L
Sbjct: 264 SSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYL 323

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVPIV 358
                   GTIIDSGT FT +    +  V   F ++V S     V  + G   C+++  +
Sbjct: 324 IPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGL 383

Query: 359 A----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIANM 409
                P +TL F  G  + LP  N +       + CL +    AA  +       ++ N 
Sbjct: 384 NTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNF 443

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
           QQQN  + YD+ N RLG  ++ C
Sbjct: 444 QQQNFYVEYDLRNERLGFRQQSC 466


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 174/394 (44%), Gaps = 62/394 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
           Y   A +GTP Q L + +DT +   WVPCT    C  CSS       VF+   S++ + +
Sbjct: 67  YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126

Query: 148 GCQ-------------AAQCKQVPN-------PTCGGGACA-FNLTYGSSTIAANLSQDT 186
           GC+             A +C++ P        P      C  + + YGS + A  L  DT
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 186

Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK- 245
           +      VPG+  GC   +     PP GL G GRG+ S+ AQ   L    FSYCL S + 
Sbjct: 187 LRAPGRAVPGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRF 241

Query: 246 --ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPG 298
               + SGSL LG  G  + ++Y PL+K+     L     YY+ L  + VG + V +P  
Sbjct: 242 DDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPAR 301

Query: 299 ALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
           A   N     GTI+DSGT FT L      P   AV      R   +       G   C++
Sbjct: 302 AFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFA 361

Query: 355 VP-----IVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPD--------NVN 400
           +P     +  P ++  F G  V  LP +N  + +  G++  + +A   D        N  
Sbjct: 362 LPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEG 421

Query: 401 SVLNVI-ANMQQQNHRILYDVPNSRLGVARELCT 433
           S   +I  + QQQN+ + YD+   RLG  R+ CT
Sbjct: 422 SGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCT 455


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 120/398 (30%), Positives = 179/398 (44%), Gaps = 44/398 (11%)

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           + ++RL  L++ AV+     P  S +   +  S  Y +   IGTPA  L    DT +D  
Sbjct: 57  RSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGSDLI 116

Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA--------CAFN 170
           W  C  C  CS   S  +    S++   + C    C ++P P C   A        C+++
Sbjct: 117 WTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYH 176

Query: 171 LTYGSSTIAANLSQ-----DTISLATDIV--PGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
             YG++    + ++     +T +   D    PG  FGC  ++ G      GL+GLGRG L
Sbjct: 177 YAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKL 236

Query: 224 SLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL- 278
           SL+ Q   L    F Y L S       +SF GSL     G       TPLL NP    L 
Sbjct: 237 SLVTQ---LNVEAFGYRLSSDLSAPSPISF-GSLADVTGGNGDSFMSTPLLTNPVVQDLP 292

Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII-DSGTVFTRLVAPAYTAVRDVFRRR 336
            YYV L  I VG ++V IP G   F+ +TGAG +I DSGT  T L  PAYT VRD    +
Sbjct: 293 FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQ 352

Query: 337 VGSNLTVTSLGGFD-TCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH---STAGSIT 388
           +G      +    D  C+   S     P++ L F  G ++ L  +N L         +  
Sbjct: 353 MGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETAR 412

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP-NSRL 425
           C ++  +    +  L +I N+ Q +  +++D+  N+R+
Sbjct: 413 CWSVVKS----SQALTIIGNIMQMDFHVVFDLSGNARM 446


>gi|217070596|gb|ACJ83658.1| unknown [Medicago truncatula]
          Length = 65

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 60/65 (92%), Positives = 63/65 (96%)

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           MNVTLPQDN+LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR+GVA
Sbjct: 1   MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVA 60

Query: 429 RELCT 433
           RELCT
Sbjct: 61  RELCT 65


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 174/376 (46%), Gaps = 42/376 (11%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSST----VFNSAQSTTFKNL 147
           S  Y V  ++GTPA+   + +DT +D  W+ C        SS+     ++ + S++++ +
Sbjct: 56  SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREI 115

Query: 148 GCQAAQCKQVPNPTCGGGACAF------NLTYG---SSTIAANLSQDTISLATDIVPG-- 196
            C   +C+ +P P   G +C+       + TYG    S     L+ +TIS+ +    G  
Sbjct: 116 PCTDDECQFLPAPI--GSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 173

Query: 197 -------------YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCL 241
                           GC +++ G S +   G+LGLG+G +SL  QT++      FSYCL
Sbjct: 174 AGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 233

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGAL 300
             +   S + S  +      +++ +TP+++NP   S YYVN+  + V  + VD I     
Sbjct: 234 VDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDW 293

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA- 359
             +     GTI DSGT  + L  PAY+ V       +          GF+ CY+V  +  
Sbjct: 294 GIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEK 353

Query: 360 --PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
             P + + F G  V  LP +N ++   A ++ C+A+        S  N++ N+ QQ+H I
Sbjct: 354 GMPKLGVEFQGGAVMELPWNNYMVL-VAENVQCVALQKVTTTNGS--NILGNLLQQDHHI 410

Query: 417 LYDVPNSRLGVARELC 432
            YD+  +R+G     C
Sbjct: 411 EYDLAKARIGFKWSPC 426


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 168/397 (42%), Gaps = 65/397 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CV----------------------- 129
           Y++   IGTP Q + + MDT +D  WVPC      C+                       
Sbjct: 12  YLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSSY 71

Query: 130 --GCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACAFNLTYGS-STIAANLSQ 184
              C+S       S+      C  A C    +   TC     +F  TYG+   +   L++
Sbjct: 72  RDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTR 131

Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           DT+ +       T  +P + FGC+         P G+ G  RG+LS  +Q   L +  FS
Sbjct: 132 DTLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL-GLLKKGFS 187

Query: 239 YCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
           +C  +FK     + S  L +G   +     +++TP+LK+P   + YY+ L AI VG    
Sbjct: 188 HCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVSA 247

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
             +P    +F+     G +IDSGT +T L  P Y+ +  +F+  +       V    GFD
Sbjct: 248 TTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEMRAGFD 307

Query: 351 TCYSVPI----------VAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAAA 395
            CY VP           + P+IT  F + ++  LPQ N     +A S    + CL   + 
Sbjct: 308 LCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQSM 367

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            D+      V  + QQQN +I+YD+   R+G     C
Sbjct: 368 ADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 147/296 (49%), Gaps = 27/296 (9%)

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGN 208
           AA         C GG C + + YG  +      + DT++L++ D + G+ FGC ++  G 
Sbjct: 5   AAAWSDXTTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGL 64

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP---KRIK 265
                GLLGLGRG  SL  QT + Y   F++C P+    S +G L  GP   P    ++ 
Sbjct: 65  FGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARS--SGTGYLEFGPGSSPAVSAKLS 122

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            TP+L +    + YYV +  IRVG +++ IP           AGTI+DSGTV TRL   A
Sbjct: 123 TTPMLID-TGPTFYYVGMTGIRVGGKLLPIPQSVF-----AAAGTIVDSGTVITRLPPAA 176

Query: 326 YTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD-NL 378
           Y+++R  F   + +       +L   DTCY +     +  PT++L+F G  V+L  D + 
Sbjct: 177 YSSLRSAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQG-GVSLDVDASG 235

Query: 379 LIHSTAGSITCLAMAA--APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +I++ + S  CL  A   A D+V     ++ N Q +   ++YD+ +  +G     C
Sbjct: 236 IIYAASVSQACLGFAGNEAADDV----AIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 131/437 (29%), Positives = 192/437 (43%), Gaps = 52/437 (11%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASGR 89
           +LQ+ H  +      PS+      +VL + ++D AR+ +L    S + +  S   + SG 
Sbjct: 58  SLQLLHRDTVSGTKHPSR----RHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGG 113

Query: 90  QITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTF 144
            I    S  Y+VR  IG+P     +  DT +D  WV   PC+ C      +F+ A S +F
Sbjct: 114 TIVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASF 173

Query: 145 KNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGY 197
             + C +  C+            GGG C + ++YG  +     L+ +T++L     V G 
Sbjct: 174 SPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGV 233

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--LRL 255
             GC  +  G      GLLGLG G +SL+ Q        FSYCL  + +   SGS  L L
Sbjct: 234 AMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVL 293

Query: 256 G-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           G     P    + PL++NP   S YYV +  + V    + +  G        G G ++D+
Sbjct: 294 GREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDT 353

Query: 315 GTVFTRLVAPAYTAVRDVFR--------RRVGSNLTVTSLGGFDTCYSV----PIVAPTI 362
           GT  TRL A AY A+R  F         R  G +L       FDTCY +     +  PT+
Sbjct: 354 GTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSL-------FDTCYDLSGYASVRVPTV 406

Query: 363 TLMF-------SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            L F          ++TLP  NLL+    G   CLA AA    V S  +++ N+QQQ   
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAA----VASGPSILGNIQQQGIE 462

Query: 416 ILYDVPNSRLGVARELC 432
           I  D  +  +G     C
Sbjct: 463 ITVDSASGYVGFGPATC 479


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 187/440 (42%), Gaps = 70/440 (15%)

Query: 40  FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTY 97
           + PCSP + + P     S++EML  DQAR  ++   A      V  P      + Q   +
Sbjct: 75  YGPCSPSEGTPP-----SLVEMLRWDQARTDYVRRKATGEVDDVLEPDRPHVDMMQM-DF 128

Query: 98  IVRAKIGTPAQT------------------LLMAMDTSNDAAWVPCTGCV--GC---SST 134
           ++R   G  + +                    MA+DT+ D  W+ C  C+   C    + 
Sbjct: 129 MLRGTFGIGSGSGYGAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNA 188

Query: 135 VFNSAQSTTFKNLGCQAAQCKQV---------PNPTCGGGACAFNLTYGSSTIA-ANLSQ 184
            F+  +S+T   + C +  C+ +         PN T   G C + + Y    +       
Sbjct: 189 FFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNST---GDCLYRIEYSDHRLTLGTYMT 245

Query: 185 DTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
           DT++++ +     + FGC     G  S    G + LG G  SLL+QT   Y + FSYC+P
Sbjct: 246 DTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVP 305

Query: 243 SFKA---LSFSGSLRLGPIGQPKRIKYTPLLK--NPRRSSLYYVNLLAIRVGRRVVDIPP 297
              A   LS  G +     G       TPL++  N    ++Y V L  I V  R +++PP
Sbjct: 306 GPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPP 365

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-- 355
                      GT++DS  V T+L   AY A+R  FR  + +  T    G  DTC+    
Sbjct: 366 VVFS------GGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVG 419

Query: 356 --PIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
              +  PT++L+F G  V  L   ++L+ S      CLA   AP   +  L  I N+QQQ
Sbjct: 420 VSKVTVPTVSLVFDGGAVIELGLLSVLLDS------CLAF--APMAADFALGFIGNVQQQ 471

Query: 413 NHRILYDVPNSRLGVARELC 432
            H +LYDV    +G     C
Sbjct: 472 THEVLYDVAGGAVGFRHGAC 491


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/420 (27%), Positives = 183/420 (43%), Gaps = 34/420 (8%)

Query: 33  TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           ++ + H  SP SPF KPS  L+  + ++    +   +L   S   +  K  +      +I
Sbjct: 30  SIDLIHRDSPLSPFYKPS--LTPSDRIINTALRSIYQLNRASHSDLNEKKTLERV---RI 84

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
                Y++R  IGTP    L   DT++D  WV C+ C  C    + +F   +S+TF NL 
Sbjct: 85  PNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLS 144

Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATDIV--PGYTFGC-- 201
           C +  C       C   G  C +  TYG  S+    L  ++I   +  V  P   FGC  
Sbjct: 145 CDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGS 204

Query: 202 ---IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-- 256
                    N V   G++GLG G LSL++Q  +     FSYCL  F + S +  L+ G  
Sbjct: 205 NNDFMHQISNKV--TGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTS-TIKLKFGND 261

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
                  +  TPL+ +P   S Y+++L+ I +G++++ +     +    T    IID GT
Sbjct: 262 TTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV-----RTTDHTNGNIIIDLGT 316

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSL-GGFDTCY--SVPIVAPTITLMFSGMNVTL 373
           V T L    Y     + R  +G + T   +   FD C+     I  P I   F+G  V L
Sbjct: 317 VLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTGAKVFL 376

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              NL       ++ CLA+   PD      +V  N+ Q + ++ YD    ++  A   C+
Sbjct: 377 SPKNLFFRFDDLNMICLAV--LPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 175/376 (46%), Gaps = 42/376 (11%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSST----VFNSAQSTTFKNL 147
           S  Y V  ++GTPA+   + +DT +D  W+ C        SS+     ++ + S++++ +
Sbjct: 24  SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREI 83

Query: 148 GCQAAQCKQVPNPTCGGGACAF------NLTYGSSTIAAN---LSQDTISLATDIVPG-- 196
            C   +C  +P P   G +C+       + TYG S  +     L+ +TIS+ +    G  
Sbjct: 84  PCTDDECLFLPAPI--GSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 141

Query: 197 -------------YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCL 241
                           GC +++ G S +   G+LGLG+G +SL  QT++      FSYCL
Sbjct: 142 AGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 201

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGAL 300
             +   S + S  +    + +++ +TP+++NP   S YYVN+  + V  + VD I     
Sbjct: 202 VDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDW 261

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA- 359
             +     GTI DSGT  + L  PAY+ V       +          GF+ CY+V  +  
Sbjct: 262 GIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEK 321

Query: 360 --PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
             P + + F G  V  LP +N ++   A ++ C+A+        S  N++ N+ QQ+H I
Sbjct: 322 GMPKLGVEFQGGAVMELPWNNYMVL-VAENVQCVALQKVTTTNGS--NILGNLLQQDHHI 378

Query: 417 LYDVPNSRLGVARELC 432
            YD+  +R+G     C
Sbjct: 379 EYDLAKARIGFKWSPC 394


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 134/453 (29%), Positives = 202/453 (44%), Gaps = 66/453 (14%)

Query: 22  NPICD-----TQD-HSSTLQVFHVFSPCSPFKPSKPLSWEESV---------LEMLAKDQ 66
           NP C      T D + +++ + H   PC+P   S   S  E +         +   AK  
Sbjct: 44  NPACSPAPQVTSDPNRASMPLAHRHGPCAPATTSSWPSLAERLRRDRARRDHITRKAKAS 103

Query: 67  ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
            R   LS +++      P + G  +  S  Y+V   IGTPA    + +DT +D +WV C 
Sbjct: 104 GRTTTLSDVSI------PTSLGAAV-DSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCK 156

Query: 127 GCVGCS-----STVFNSAQSTTFKNLGCQAAQCKQ-VPNP-------TCGGGACAFNLTY 173
            C   S       +++   S+T+  + C +  CK  VP+        + G   C + + Y
Sbjct: 157 PCNSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEY 216

Query: 174 GS-STIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           G+  T     S +T++L+  + V  + FGC     G      GLLGLG    SL++QT  
Sbjct: 217 GNRDTTVGVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAE 276

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRV 288
            Y   FSYCLP     S +G L LG P        +  TPL   P +++ Y VNL  + V
Sbjct: 277 TYGGAFSYCLPPGN--STTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSV 334

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSL 346
           G + +DIPP  L        G IIDSGT+ T L   AY+A+R  FR  + +   L   + 
Sbjct: 335 GGKPLDIPPTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNND 388

Query: 347 GGFDTCYSVPIVA----PTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNV 399
              DTCY+   +A    PT+ L F G   +++ +P   +LI        CLA A    + 
Sbjct: 389 DVLDTCYNFTGIANVTVPTVALTFDGGATIDLDVPS-GVLIQD------CLAFAGGASDG 441

Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +  + +I N+ Q+   +LYD     +G     C
Sbjct: 442 D--VGIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 129/449 (28%), Positives = 190/449 (42%), Gaps = 48/449 (10%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +FF   LFL S S+         D+  T  +FH  S  SP + S  LS  + +     + 
Sbjct: 7   LFFHLILFLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLANAFRRS 62

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAW 122
            +R     S A+  ++    A G Q +  P    Y++   IGTP    L   DT +D  W
Sbjct: 63  LSR-----SAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTW 117

Query: 123 VPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTI 178
             C  C+ C      +FN  +ST+F ++ C    C  V +  CG  G C ++ TYG  T 
Sbjct: 118 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTY 177

Query: 179 A-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQS 235
           +  +L  + I++ +  V     GC   ++G      G++GLG G LSL++Q         
Sbjct: 178 SKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236

Query: 236 TFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG--R 290
            FSYCLP+  + + +G +  G    +  P  +  TPL+ +    + YY+ L AI +G  R
Sbjct: 237 RFSYCLPTLLSHA-NGKINFGENAVVSGPGVVS-TPLI-SKNTVTYYYITLEAISIGNER 293

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
            +     G +          IIDSGT  T L    Y  V     + V +       G  D
Sbjct: 294 HMAFAKQGNV----------IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLD 343

Query: 351 TCYSVPIVA------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            C+   I A      P IT  FS G NV L   N      A ++ CL + AA        
Sbjct: 344 LCFDDGINAAASLGIPVITAHFSGGANVNLLPINTF-RKVADNVNCLTLKAASPTTE--F 400

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            +I N+ Q N  I YD+   RL     +C
Sbjct: 401 GIIGNLAQANFLIGYDLEAKRLSFKPTVC 429


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 117/426 (27%), Positives = 183/426 (42%), Gaps = 55/426 (12%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGR---QITQSPT---------- 96
           K LS  E +   + + +AR   LS  AV  ++     SG+   Q T  PT          
Sbjct: 43  KQLSRSELIRRAMQRSKARAAALS--AVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDL 100

Query: 97  -YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAA 152
            Y+V   IGTP Q +   +DT +D  W    PC  C+     +F   +S +++ + C   
Sbjct: 101 EYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQ 160

Query: 153 QCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATD------IVPGYTFGCIQK 204
            C  + +  C     C +   YG  T+   + + +  +  +        VP   FGC   
Sbjct: 161 LCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LGFGCGSM 219

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQ 260
             G+     G++G GR  LSL++Q   L    FSYCL S+    K+    GSL  G  G 
Sbjct: 220 NVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYGSGRKSTLLFGSLSGGVYGD 276

Query: 261 PKR-IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
               ++ TPLL++ +  + YYV+L  + VG R + IP  A    P    G I+DSGT  T
Sbjct: 277 ATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALT 336

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVP-----------IVAPTITLMF 366
            L       V   FR+++   L   + G  +   C+ VP           +  P +   F
Sbjct: 337 LLPGAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHF 394

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
              ++ LP+ N ++        CL +A + D+ ++    I N+ QQ+ R+LYD+    L 
Sbjct: 395 QDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGST----IGNLVQQDMRVLYDLEAETLS 450

Query: 427 VARELC 432
            A   C
Sbjct: 451 FAPAQC 456


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 163/337 (48%), Gaps = 42/337 (12%)

Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
           P+   ++A    +   W  C  CV C   S   F+ + S T+    C       +P+ T 
Sbjct: 84  PSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSC-------IPS-TV 135

Query: 163 GGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLG 219
           G     +N+TYG  ST   N   DT++L  +D+ P + FGC +   G+      G+LGLG
Sbjct: 136 GN---TYNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLG 192

Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNP----- 273
           +G LS ++QT + ++  FSYCLP   ++   GSL  G     +  +K+T L+  P     
Sbjct: 193 QGQLSTVSQTASKFKKVFSYCLPEEDSI---GSLLFGEKATSQSSLKFTSLVNGPGTSGL 249

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
             S  Y+V LL I VG + +++P            GTIIDSGTV T L   AY+A+   F
Sbjct: 250 EESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTAAF 304

Query: 334 RRRVG----SNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHSTA 384
           ++ +     SN         DTCY++     ++ P I L F  G +V L    ++  + A
Sbjct: 305 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDA 364

Query: 385 GSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDV 420
             + CLA A  +   +NS L +I N QQ +  +LYD+
Sbjct: 365 SRL-CLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDI 400


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/354 (31%), Positives = 163/354 (46%), Gaps = 37/354 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG------CVGCSSTVFNSAQSTTFKNLGCQ 150
           Y  +  +GTPA T LM +DT +D  W P          V   S+   +   T   N  C 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--CV 179

Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
           A  C+++ +  C     +C + + YG  ++ A + + +T++ A    V     GC     
Sbjct: 180 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 239

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
           G  +   GLLGLGRG LS  +Q    +  +FSYCL    +               +    
Sbjct: 240 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSR-------------RARPS 286

Query: 267 TPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
                 PR ++ YYV+LL   V G RV  +    L+ NPTTG  G I+DSGT  TRL  P
Sbjct: 287 RRWGGTPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARP 346

Query: 325 AYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNL 378
            Y AVRD FR   VG  ++      FDTCY++     +  PT+++  + G +V LP +N 
Sbjct: 347 VYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENY 406

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LI        C AMA     V    ++I N+QQQ  R+++D    R+G   + C
Sbjct: 407 LIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 158/361 (43%), Gaps = 39/361 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP    +   DT +D  W    PC  C G  + ++++  S++F  L C +A 
Sbjct: 83  YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142

Query: 154 CKQVPNPTCG--GGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
           C  + +  C      C +   Y     +   +  ++        G  FGC     G S  
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGAYSPECAGISVG-------GIAFGCGVDNGGLSYN 195

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-------- 263
             G +GLGRGSLSL+AQ   L    FSYCL  F   S S  +  G + +           
Sbjct: 196 STGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAA 252

Query: 264 -IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA-GTIIDSGTVFTRL 321
            ++ TPL+++P   S YYV+L  I +G   + IP G    N   G+ G I+DSGT+FT L
Sbjct: 253 VVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTIL 312

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-CYSVPIVA-------PTITLMFS-GMNVT 372
           V   +  V D     +G    V +    D  C+  P          P + L F+ G ++ 
Sbjct: 313 VETGFRVVVDHVAGVLGQ--PVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMR 370

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L +DN +  +   S  CL +        SVL    N QQQN ++L+D+   +L      C
Sbjct: 371 LHRDNYMSFNEEESSFCLNIVGTESASGSVL---GNFQQQNIQMLFDITVGQLSFMPTDC 427

Query: 433 T 433
           +
Sbjct: 428 S 428


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 123/428 (28%), Positives = 190/428 (44%), Gaps = 54/428 (12%)

Query: 55  EESVLEMLAKDQARLQFLS-------------------SLAVARKSVVPIASGRQITQSP 95
           +ESVL++  KD  R++ +                      A++ + V  + SG  +  S 
Sbjct: 91  KESVLDLADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERMVATVESGVAVG-SG 149

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAA 152
            Y++   +GTP +   M MDT +D  W+ C  C+ C      VF+ A S++++N+ C   
Sbjct: 150 EYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQ 209

Query: 153 QCKQV--PNP--TC---GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYT 198
           +C  V  P P   C   G  +C +   YG  S    +L+ ++ ++      A+  V    
Sbjct: 210 RCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVV 269

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSL 253
           FGC     G      GLLGLGRG LS  +Q + +Y  TFSYCL    +     + F    
Sbjct: 270 FGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVASKVVFGEDD 329

Query: 254 RLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGAL--QFNPTTGAGT 310
            L       ++ YT     +    + YYV L  + VG  +++I               GT
Sbjct: 330 ALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGGT 389

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPIV----APTITLM 365
           IIDSGT  +  V PAY  +R  F  R+G +   +        CY+V  V     P ++L+
Sbjct: 390 IIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLL 449

Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F+ G     P +N  I      I CLA+   P    + +++I N QQQN  ++YD+ N+R
Sbjct: 450 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQQNFHVVYDLKNNR 506

Query: 425 LGVARELC 432
           LG A   C
Sbjct: 507 LGFAPRRC 514


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 172/398 (43%), Gaps = 67/398 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNS--------AQSTTF 144
           Y++   +GTP + + + MDT +D  WVPC      C+ C+    N           S++ 
Sbjct: 12  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 71

Query: 145 KNLGCQAAQCKQVPNPTCGGGACA--------------------FNLTYGS-STIAANLS 183
           ++L C +  C  V +       CA                    F  TYG+   +   L+
Sbjct: 72  RDL-CVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 130

Query: 184 QDTISLA------TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           +DT++        T  VP + FGC+         P G+ G GRG LSL +Q   L Q  F
Sbjct: 131 RDTLTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQLGFL-QKGF 186

Query: 238 SYCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-R 291
           S+C   FK     + S  L +G   I     +++T LLKNP   + YY+ L AI VG   
Sbjct: 187 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 246

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGF 349
            + +P    +F+     G IIDSGT +T L  P YT +  + +  +         +  GF
Sbjct: 247 AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGF 306

Query: 350 DTCYSVPI----------VAPTITLMFS-GMNVTLPQDNLLIHSTAGS----ITCLAMAA 394
           D CY +P           + P+I+  FS  +++ LPQ N      A S    + CL +  
Sbjct: 307 DLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQN 366

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             D+ +    V  + QQQN +++YD+   R+G     C
Sbjct: 367 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 139/432 (32%), Positives = 196/432 (45%), Gaps = 72/432 (16%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
           S+ L++ H   PC+P + S   +   SV + L  DQ R +++             S A A
Sbjct: 65  SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
             + VP + G  I  +  Y+V A +GTP     M +DT +D +WV C  C    S     
Sbjct: 123 AVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQK 181

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
             +F+ AQS+++  + C          P C G             C + ++YG  S    
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233

Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
             S DT++L A+  V G+ FGC    +G  N V   GLLGLGR   SL+ QT   Y   F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291

Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
           SYCLP+    S +G L L   GP G       T LL +P   + Y V L  I VG + + 
Sbjct: 292 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTC 352
           +P  A          T++D+GTV TRL   AY A+R  FR  + S    T  S G  DTC
Sbjct: 350 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 403

Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
           Y+      +  P + L F SG  VTL  D +L      S  CLA   AP   +  + ++ 
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 455

Query: 408 NMQQQNHRILYD 419
           N+QQ++  +  D
Sbjct: 456 NVQQRSFEVRID 467


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/415 (27%), Positives = 168/415 (40%), Gaps = 77/415 (18%)

Query: 83  VPIASGRQITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPC---------------- 125
           +P+A G        Y +   +G+ P Q + + MDT +D  W PC                
Sbjct: 67  LPLAPGSD------YTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTK 120

Query: 126 -------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGS 175
                  T  V C S   ++A ++   +  C  ++C    +    C   +C  F   YG 
Sbjct: 121 PANITKQTHSVSCQSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGD 180

Query: 176 STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--- 232
            +  ANL Q T+SL++  +  +TFGC   A      P G+ G GRG LSL AQ   L   
Sbjct: 181 GSFVANLYQQTLSLSSLHLQNFTFGCAHTALAE---PTGVAGFGRGILSLPAQLSTLSPH 237

Query: 233 YQSTFSYCLPSFKALSFSGSL--RLGPI--------------GQPKRIKYTPLLKNPRRS 276
             + FSYCL S    SF G    R  P+              G+     YT +L NP+  
Sbjct: 238 LGNRFSYCLVSH---SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHP 294

Query: 277 SLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
             Y V L  I VG+R V  P    + +     G ++DSGT FT L    Y AV + F +R
Sbjct: 295 YYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKR 354

Query: 337 VG----SNLTVTSLGGFDTCYSVPIVA--PTITLMFSGMN--VTLPQDNLLIH------- 381
           V         + +  G   CY +  ++  P + L F G N  V LP+ N           
Sbjct: 355 VNRFHKRASEIETKTGLGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDG 414

Query: 382 -STAGSITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
               G + C+ +    D           + N QQQ   ++YD+   R+G A++ C
Sbjct: 415 IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/401 (28%), Positives = 172/401 (42%), Gaps = 73/401 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNS--------AQSTTF 144
           Y++   +GTP + + + MDT +D  WVPC      C+ C+    N           S++ 
Sbjct: 29  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 88

Query: 145 KNLGCQAAQCKQVPNPTCGGGACA--------------------FNLTYGS-STIAANLS 183
           ++L C +  C  V +       CA                    F  TYG+   +   L+
Sbjct: 89  RDL-CVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 147

Query: 184 QDTISLA------TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
           +DT++        T  VP + FGC+         P G+ G GRG LSL +Q   L Q  F
Sbjct: 148 RDTLTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQLGFL-QKGF 203

Query: 238 SYCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-R 291
           S+C   FK     + S  L +G   I     +++T LLKNP   + YY+ L AI VG   
Sbjct: 204 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 263

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSL 346
            + +P    +F+     G IIDSGT +T L  P YT +  + +      R       T  
Sbjct: 264 AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEART-- 321

Query: 347 GGFDTCYSVPI----------VAPTITLMFS-GMNVTLPQDNLLIHSTAGS----ITCLA 391
            GFD CY +P           + P+I+  FS  +++ LPQ N      A S    + CL 
Sbjct: 322 -GFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLL 380

Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    D+ +    V  + QQQN +++YD+   R+G     C
Sbjct: 381 LQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 121/437 (27%), Positives = 189/437 (43%), Gaps = 51/437 (11%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFL----SSLAVARKSVVPIA 86
            TL V H  SPCSP   ++    E+ SV ++L +D  R + L    +  + A     P A
Sbjct: 63  DTLPVVHRLSPCSPLGAARIQQLEKPSVADILHRDALRFRSLFRDHNHGSAAPAPTSPGA 122

Query: 87  SG------------RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA----WVPCTGCVG 130
            G            +++  +  Y V A  GTP Q   +  DT+   A      PC     
Sbjct: 123 DGGGLSIPSRGDPIQELPGAFEYHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCAADEP 182

Query: 131 CSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISL 189
           C    F+ + S++  ++ C +  C    N  C G +C  +++  ++ +  A    D ++L
Sbjct: 183 CHH-AFDPSASSSIAHVPCGSPDCPF--NKGCSGHSCTLSVSINNTLLGNATFFTDKLTL 239

Query: 190 AT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT--QNLYQSTFSYCLPSFKA 246
              +IV  + F C++          G+L L R S SL ++    +     FSYCLPS+  
Sbjct: 240 TPWNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYP- 298

Query: 247 LSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
            S  G L LG   P    +++ YTPL  N    +LY V L+ + +G   + +P  A+   
Sbjct: 299 -SDVGFLSLGATKPELLGRKVSYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAI--- 354

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
              G GTI++  T FT L    Y A+RD FR+ +         G  DTCY+   ++    
Sbjct: 355 --AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSV 412

Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           P +TL F  G    L  D ++     G   S+ CLA  A          VI +M Q +  
Sbjct: 413 PAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGGA-----VIGSMAQMSTE 467

Query: 416 ILYDVPNSRLGVARELC 432
           ++YDV   ++G     C
Sbjct: 468 VVYDVRGGKVGFVPYRC 484


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 100/354 (28%), Positives = 155/354 (43%), Gaps = 29/354 (8%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--------FNSAQSTT 143
           T +  Y++   +GTP Q +   +D ++D  W+ C+ C  C +          F +  S+T
Sbjct: 92  TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSST 151

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG---SSTIAANLSQDTISLATDIVPGYT 198
            + + C    C+++   TC      C ++  YG   ++T A  L+ D  + AT    G  
Sbjct: 152 IREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI 211

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC     G+     G++GLGRG LSL++Q Q      FSY L    A+     +     
Sbjct: 212 FGCAVATEGDI---GGVIGLGRGELSLVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDD 265

Query: 259 GQPK--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +P+  R   TPL+ N    SLYYV L  IRV    + IP G          G ++    
Sbjct: 266 AKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITI 325

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV- 371
             T L A AY  VR     ++G      S  G D CY+   +A    P++ L+F+G  V 
Sbjct: 326 PVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 385

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            L   N     +   + CL +  +P    S+L    ++ Q    ++YD+  SRL
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDGSLLG---SLIQVGTHMIYDISGSRL 436


>gi|217073832|gb|ACJ85276.1| unknown [Medicago truncatula]
          Length = 122

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/117 (63%), Positives = 90/117 (76%)

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           A L QD++ LATD++P Y+FG I   +G S+P QGLLGLGRG LSLL+QT +LY   FSY
Sbjct: 1   ATLVQDSLRLATDVIPSYSFGSINAISGFSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSY 60

Query: 240 CLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           CLPSFK+  FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL  I VG+  V  P
Sbjct: 61  CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFP 117


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 114/372 (30%), Positives = 171/372 (45%), Gaps = 46/372 (12%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---CSSTVFNSAQSTTFKNLGCQAAQCK 155
           V   +GTP Q + M +DT ++ +W+ C         S+  F    S+TF  + C +AQC+
Sbjct: 87  VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146

Query: 156 --QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
              +P+P    GA   C+ +L+Y  GSS+  A L+ D  ++ +       FGC+  A  +
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGA-LATDVFAVGSGPPLRAAFGCMSSAFDS 205

Query: 209 S---VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
           S   V   GLLG+ RG+LS ++Q        FSYC+        +G L LG    P  + 
Sbjct: 206 SPDGVASAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---AGVLLLGHSDLPTFLP 259

Query: 266 ------YTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
                 Y P L  P    + Y V LL IRVG + + IP   L  + T    T++DSGT F
Sbjct: 260 LNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQF 319

Query: 319 TRLVAPAYTAVRDVFRRRVG------SNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
           T L+  AY+A++  F R+         + +      FDTC+ VP          P +TL+
Sbjct: 320 TFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLL 379

Query: 366 FSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           F+G  + +  D LL            + CL    A D V  +  VI +  Q N  + YD+
Sbjct: 380 FNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPIMAYVIGHHHQMNVWVEYDL 438

Query: 421 PNSRLGVARELC 432
              R+G+A   C
Sbjct: 439 ERGRVGLAPVRC 450


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 107/355 (30%), Positives = 150/355 (42%), Gaps = 42/355 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +GTP  T  +  DT +D  W    PCT C    +  F  A S+TF  L C ++ 
Sbjct: 86  YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145

Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
           C+ +PN   TC    C +N  YGS   A  L+ +T+ +     P   FGC   +T N + 
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGC---STENGLG 202

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTPL 269
            Q  LG+GR                FSYCL S  A   S  +  G +       ++ TP 
Sbjct: 203 -QLDLGVGR----------------FSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTPF 244

Query: 270 LKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPA 325
           + NP    S YYVNL  I VG    D+P     F  T    G GTI+DSGT  T L    
Sbjct: 245 VNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDG 302

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYS------VPIVAPTITLMFS-GMNVTLPQDNL 378
           Y  V+  F  +     TV    G D C+         I  P++ L F  G    +P    
Sbjct: 303 YEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFA 362

Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + + + GS+T   +   P   +  ++VI N+ Q +  +LYD+       A   C
Sbjct: 363 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 168/371 (45%), Gaps = 42/371 (11%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           T  V   +G+P Q + M +DT ++ +W+ C      +S VFN   S+++  + C +  C+
Sbjct: 39  TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS-VFNPLSSSSYSPIPCSSPVCR 97

Query: 156 ----QVPNP-TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
                +PNP TC     C   ++Y  +S++  NL+ D   + +  +PG  FGC+      
Sbjct: 98  TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 157

Query: 206 -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPK 262
            +       GL+G+ RGSLS + Q   L    FSYC+    +   SG L  G   +    
Sbjct: 158 NSEEDAKTTGLMGMNRGSLSFVTQ---LGLPKFSYCISGRDS---SGVLLFGDSHLSWLG 211

Query: 263 RIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            + YTPL++     P    + Y V L  IRVG +++ +P      + T    T++DSGT 
Sbjct: 212 NLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQ 271

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNL------TVTSLGGFDTCYSVPIVA-----PTITLMF 366
           FT L+ P YTA+R+ F  +    L           G  D CY VP        P ++LMF
Sbjct: 272 FTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMF 331

Query: 367 SGMNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
            G  + +  + LL     +      + CL    + D +     VI +  QQN  + +D+ 
Sbjct: 332 RGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNS-DLLGIEAFVIGHHHQQNVWMEFDLV 390

Query: 422 NSRLGVARELC 432
            SR+G     C
Sbjct: 391 KSRVGFVETRC 401


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 120/375 (32%), Positives = 183/375 (48%), Gaps = 30/375 (8%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--- 132
           A A    +P  SG  +  +  ++V   +GTPAQ   +  DT +D +WV C  C G S   
Sbjct: 124 APAPAVTIPDRSGTYL-DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPC-GSSGHC 181

Query: 133 ----STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQD 185
                 +F+ ++S+T+  + C   QC    +  C      C + + YG  S+    LS+D
Sbjct: 182 HPQQDPLFDPSKSSTYAAVHCGEPQCAAAGD-LCSEDNTTCLYLVRYGDGSSTTGVLSRD 240

Query: 186 TISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           T++L ++  + G+ FGC  +  G+     GLLGLGRG LSL +Q    + + FSYCLPS 
Sbjct: 241 TLALTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSS 300

Query: 245 KALSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
              S +G L +G  P       +YT +L+ P+  S Y+V L++I +G  V+ +PP     
Sbjct: 301 N--STTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVF-- 356

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIV 358
              T  GT++DSGTV T L A AY  +RD FR  +            D CY       +V
Sbjct: 357 ---TRGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVV 413

Query: 359 APTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
            P ++  F  G    L    ++I     ++ CLA AA  D     L++I N QQ++  ++
Sbjct: 414 VPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAM-DTGGLPLSIIGNTQQRSAEVI 471

Query: 418 YDVPNSRLGVARELC 432
           YDV   ++G     C
Sbjct: 472 YDVAAEKIGFVPASC 486


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 127/437 (29%), Positives = 180/437 (41%), Gaps = 68/437 (15%)

Query: 40  FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-RKSVVPIASGRQITQSPTYI 98
           F PCSP     P     S+LEML  DQ R +++   A    + V+  A  R +     + 
Sbjct: 63  FGPCSPSAGRAP---APSLLEMLRWDQVRTEYVRRKASGGAEDVLNPAKPRVLMSQTDFA 119

Query: 99  VRAKI---------------GTP--AQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVF 136
           VR+                 G P       MA+DT+ D  W+ C       C      +F
Sbjct: 120 VRSPFGVGSGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLF 179

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGG-------GACAFNLTYGSS-TIAANLSQDTIS 188
           +   S+T   + C++  C+ +  P   G         C + + Y      A     DT++
Sbjct: 180 DPTTSSTAAAVRCRSPACRSL-GPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLT 238

Query: 189 LA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           ++ T  V  + FGC     G  S    G + LG G+ SLLAQT     + FSYC+P   A
Sbjct: 239 ISGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASA 298

Query: 247 LSFSGSLRLGPIGQPKR------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
             F        IG P           TPL+++    SLY V L  I V  R + IPP A 
Sbjct: 299 SGFLS------IGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAF 352

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VP 356
                  AG ++DS  V T+L   AY A+R  FR  + +     + G  DTCY       
Sbjct: 353 S------AGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLTN 406

Query: 357 IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           +  P ++L+F  G  V L    ++I        CLA  A   ++   L  I N+QQQ H 
Sbjct: 407 VRVPAVSLVFGGGAVVVLDPPAVMIGG------CLAFTATSSDL--ALGFIGNVQQQTHE 458

Query: 416 ILYDVPNSRLGVARELC 432
           +LYDV    +G  R  C
Sbjct: 459 VLYDVAAGGVGFRRGAC 475


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 177/391 (45%), Gaps = 50/391 (12%)

Query: 62  LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +A+ +ARL    S+ +AR S               Y V   IGTP Q   +  DT++D  
Sbjct: 68  VARLEARLTGDMSVPLARIS------------DEGYTVTIGIGTPPQLHTLIADTASDLT 115

Query: 122 WVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNP---TCGGGACAFNLTYGS 175
           W  C      +  V   F+ A+S++F  + C +  C +  NP    C    C +   Y S
Sbjct: 116 WTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTE-DNPGTKRCSNKTCRYVYPYVS 174

Query: 176 STIAANLSQDTISLATD---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
              A  L+ ++ +L+ +   I   + FGC     GN +   G+LG+    LS+++Q   L
Sbjct: 175 VEAAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQ---L 231

Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL---YYVNLLAIRVG 289
               FSYCL  +     S  L  G      R K T     P + SL   YYV L+ + +G
Sbjct: 232 AIPKFSYCLTPYTDRK-SSPLFFGAWADLGRYKTT----GPIQKSLTFYYYVPLVGLSLG 286

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
            R +D+P            GT++D G    +L  PA+TA+++     +   LT  ++  +
Sbjct: 287 TRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVKDY 343

Query: 350 DTCYSVP-------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
             C+++P       +  P + L F  G ++ LP+DN     TAG + CLA+         
Sbjct: 344 KVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAG-LMCLALVPG-----G 397

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +++I N+QQQN  +L+DV +S+   A  +C
Sbjct: 398 GMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/366 (28%), Positives = 167/366 (45%), Gaps = 45/366 (12%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP-- 160
           IGTP Q + M +DT ++ +W+ C      +S +FN   S T+  + C +  CK   +   
Sbjct: 73  IGTPPQNITMVLDTGSELSWLRCKKEPNFTS-IFNPLASKTYTKIPCSSQTCKTRTSDLT 131

Query: 161 ---TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ-- 213
              TC     C F ++Y  +S++  +L+ +T    +   P   FGC+   + ++      
Sbjct: 132 LPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAK 191

Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPL 269
             GL+G+ RGSLS + Q   +    FSYC+     L  +G L LG       K + YTPL
Sbjct: 192 TTGLMGMNRGSLSFVNQ---MGFRKFSYCI---SGLDSTGFLLLGEARYSWLKPLNYTPL 245

Query: 270 LKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
           ++     P    + Y V L  I+V  +V+ +P      + T    T++DSGT FT L+ P
Sbjct: 246 VQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGP 305

Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVT 372
            Y+A+R  F  +    L V +       G  D CY +   +      P + LMF G  ++
Sbjct: 306 VYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMFRGAEMS 365

Query: 373 LPQDNLLIH-----STAGSITCLAMAAAPD-NVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           +    LL           S+ C     + +  ++S L  I + QQQN  + YD+ NSR+G
Sbjct: 366 VSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFL--IGHHQQQNVWMEYDLENSRIG 423

Query: 427 VARELC 432
            A   C
Sbjct: 424 FAELRC 429


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 111/386 (28%), Positives = 167/386 (43%), Gaps = 56/386 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST--------VFNSAQSTTFK 145
           Y V    GTP+QT+    DT +   W+PCT    C GC  +         F    S++ K
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149

Query: 146 NLGCQAAQCKQV--PNPTCGG----------GACAFNLTYGSSTIAANLSQDTISLATDI 193
            +GCQ+ +C+ +  PN  C G          G   + L YG  + A  L  + +      
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLT 209

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSG 251
           VP +  GC   +T     P G+ G GRG +SL +Q  NL +  FS+CL S  F   + + 
Sbjct: 210 VPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQ-MNLKR--FSHCLVSRRFDDTNVTT 263

Query: 252 SLRL------GPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
            L L          +   + YTP  KNP  S+      YY+NL  I VGR+ V IP   L
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYL 323

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV- 355
                   G+I+DSG+ FT +  P +  V + F  ++ SN T    +    G   C+++ 
Sbjct: 324 APGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM-SNYTREKDLEKETGLGPCFNIS 382

Query: 356 ---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA----PDNVNSVLNVIA 407
               +  P +   F  G  + LP  N           CL + +     P        ++ 
Sbjct: 383 GKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILG 442

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
           + QQQN+ + YD+ N R G A++ C+
Sbjct: 443 SFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 164/386 (42%), Gaps = 38/386 (9%)

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
           K +  + SG  +  S  Y +   +GTP +   + +DT +D  W+ C  C  C   +   +
Sbjct: 146 KLIATLESGMTLG-SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFY 204

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT------CGGGACAFNLTYGS----------STIAA 180
           +   S +FKN+ C   +C  + +P           +C +   YG            T   
Sbjct: 205 DPKTSASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTV 264

Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           NL+      +   V    FGC     G      GLLGLGRG LS  +Q Q+LY  +FSYC
Sbjct: 265 NLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 324

Query: 241 LPSFKA-LSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVD 294
           L    +  + S  L  G    +     + +T  +     S  + YY+ + +I VG   +D
Sbjct: 325 LVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALD 384

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCY 353
           IP      +P    GTIIDSGT  +    PAY  +++ F  ++  N L        D C+
Sbjct: 385 IPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCF 444

Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           +V       I  P + + F+ G     P +N  I   +  + CLA+   P    S  ++I
Sbjct: 445 NVSGIEENNIHLPELGIAFADGAVWNFPAENSFIW-LSEDLVCLAILGTP---KSTFSII 500

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQQN  ILYD   SRLG     C
Sbjct: 501 GNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 133/451 (29%), Positives = 194/451 (43%), Gaps = 74/451 (16%)

Query: 38  HVFSPCSPFK-------PSKPLS----WEESVLEMLAK---------DQARLQFLSSLAV 77
           H+ SPCSP         P K LS    W+E     + +         D A  +   S  V
Sbjct: 74  HLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQSTQV 133

Query: 78  ARKSVVPIASGRQITQS--PTYIVRAKIGTPAQTLL------MAMDTSNDAAWVPCT--- 126
                  +  G+  T S     IV A  G   Q  L      M +DT++D  WV C    
Sbjct: 134 TSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCP 193

Query: 127 --GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGG----GACAFNLTY--GSS 176
              C   S  +++  +S       C + QC+ +      C G    G C + + Y  GS 
Sbjct: 194 QPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAGNTGTCQYRVLYPDGSG 253

Query: 177 TIAANLSQDTISLATD---IVPGYTFGC----IQKATGNSVPPQGLLGLGRGSLSLLAQT 229
           T    +S D ++L  D    V  + FGC    ++  + N+    G + LGRG+ SL +QT
Sbjct: 254 TSGTYVS-DLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKT-AGFMALGRGAQSLSSQT 311

Query: 230 QNLYQ--STFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
           +  +   + FSYCLP     S  G L LG P     R   TP+LK+     +Y V L+ I
Sbjct: 312 KGTFSKGNVFSYCLPPTG--SHKGFLSLGVPQHAASRYAVTPMLKSKMAPMIYMVRLIGI 369

Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
            V  + + +PP     N        +DS T+ TRL   AY A+R  FR ++ +   V   
Sbjct: 370 DVAGQRLPVPPAVFAAN------AAMDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPK 423

Query: 347 GGFDTCYS---VPIVA-PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
           G  DTCY    VP+V  P +TL+F     V L    +++ S      CLA   AP+  + 
Sbjct: 424 GQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLDS------CLAF--APNANDF 475

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +  +I N+QQQ   +LY+V  + +G  R  C
Sbjct: 476 MPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 115/423 (27%), Positives = 197/423 (46%), Gaps = 49/423 (11%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQ 90
           S  L + + + PCS     K  S ++  L+    D++R++ +++    + S      G  
Sbjct: 61  SQGLPITYSYGPCSQLGQKKSPSRQQIFLQ----DRSRVRSINAKIFGQYSTQESKDGWS 116

Query: 91  ------ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GC-SSTVFNSAQS 141
                 + +   ++V    GTP Q   + +DT +D  W+ C  C    C +   FN + S
Sbjct: 117 PESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLS 176

Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQ-DTISLATDIVPGYTFG 200
           +++ N  C       +P+         + + Y  ++ +  +   D ++L  D+ P + FG
Sbjct: 177 SSYSNRSC-------IPSTDTN-----YTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFG 224

Query: 201 CIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
           C     G      G+LGL +G   SL++QT + ++  FSYC P  +     GSL  G   
Sbjct: 225 CGDSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEHTL--GSLLFGEKA 282

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           I     +K+T LL NP     Y+V L+ I V ++ +++   +L  +P    GTIIDSGTV
Sbjct: 283 ISASPSLKFTQLL-NPPSGLGYFVELIGISVAKKRLNVS-SSLFASP----GTIIDSGTV 336

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSVP------IVAPTITLMFSG 368
            TRL   AY A+R  F++ +    +++        DTCY++       I  P I L F G
Sbjct: 337 ITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVG 396

Query: 369 -MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
            ++V+L    +L  +   +  CLA A   +   S + +I N QQ + +++YD+   RLG 
Sbjct: 397 EVDVSLHPSGILWANGDLTQACLAFARKSN--PSHVTIIGNRQQVSLKVVYDIEGGRLGF 454

Query: 428 ARE 430
             +
Sbjct: 455 GND 457


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 164/367 (44%), Gaps = 38/367 (10%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
           T  +   IG+P Q + M +DT ++ +W+ C      +ST FN   S+++    C ++ C 
Sbjct: 58  TLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNST-FNPLLSSSYTPTPCNSSVCM 116

Query: 155 ---KQVPNP-TC--GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA-- 205
              + +  P +C      C   ++Y  +S+    L+ +T SLA    PG  FGC+  A  
Sbjct: 117 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 176

Query: 206 ---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-IGQP 261
                      GL+G+ RGSLSL+ Q   +    FSYC+    A    G L LG     P
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCISGEDAF---GVLLLGDGPSAP 230

Query: 262 KRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             ++YTPL+     S       Y V L  I+V  +++ +P      + T    T++DSGT
Sbjct: 231 SPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGT 290

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP---IVAPTITLMFS 367
            FT L+ P Y +++D F  +    LT          G  D CY  P      P +TL+FS
Sbjct: 291 QFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAAVPAVTLVFS 350

Query: 368 GMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           G  + +  + LL   + G   + C     + D +     VI +  QQN  + +D+  SR+
Sbjct: 351 GAEMRVSGERLLYRVSKGRDWVYCFTFGNS-DLLGIEAYVIGHHHQQNVWMEFDLVKSRV 409

Query: 426 GVARELC 432
           G     C
Sbjct: 410 GFTETTC 416


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 102/347 (29%), Positives = 151/347 (43%), Gaps = 31/347 (8%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
           I  P     M++DTS D  W+ C       C    + +F+  +S T   + C +A C ++
Sbjct: 155 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 214

Query: 158 PN--PTCGGGACAFNLTYGSS-TIAANLSQDTISL-ATDIVPGYTFGCIQKATGN-SVPP 212
                 C    C + + YG     +     D ++L  + +V  + FGC     GN S   
Sbjct: 215 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAST 274

Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
            G + LG G  SLL+QT   + + FSYC+P   +  F         G   R   TPL++N
Sbjct: 275 SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRN 334

Query: 273 PRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    +LY V L  I VG R +++PP           G ++DS  + T+L   AY A+R 
Sbjct: 335 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 388

Query: 332 VFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAG 385
            FR  + +   V     G DTCY       +  P ++L+F G  V  L    +++     
Sbjct: 389 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG--- 445

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              CLA    P +    L  I N+QQQ H +LYDV    +G  R  C
Sbjct: 446 ---CLAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 38/367 (10%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
           T  V   +G+P Q + M +DT ++ +W+ C      +ST FN   S+++    C ++ C 
Sbjct: 59  TLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNST-FNPLLSSSYTPTPCNSSICT 117

Query: 155 ---KQVPNP-TC--GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA-- 205
              + +  P +C      C   ++Y  +S+    L+ +T SLA    PG  FGC+  A  
Sbjct: 118 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 177

Query: 206 ---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-IGQP 261
                      GL+G+ RGSLSL+ Q   +    FSYC+    AL   G L LG     P
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQ---MSLPKFSYCISGEDAL---GVLLLGDGTDAP 231

Query: 262 KRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             ++YTPL+     S       Y V L  I+V  +++ +P      + T    T++DSGT
Sbjct: 232 SPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGT 291

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP---IVAPTITLMFS 367
            FT L+   Y++++D F  +    LT          G  D CY  P      P +TL+FS
Sbjct: 292 QFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPAVTLVFS 351

Query: 368 GMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           G  + +  + LL   + GS  + C     + D +     VI +  QQN  + +D+  SR+
Sbjct: 352 GAEMRVSGERLLYRVSKGSDWVYCFTFGNS-DLLGIEAYVIGHHHQQNVWMEFDLLKSRV 410

Query: 426 GVARELC 432
           G  +  C
Sbjct: 411 GFTQTTC 417


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 102/347 (29%), Positives = 151/347 (43%), Gaps = 31/347 (8%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
           I  P     M++DTS D  W+ C       C    + +F+  +S T   + C +A C ++
Sbjct: 139 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 198

Query: 158 P--NPTCGGGACAFNLTYGSS-TIAANLSQDTISL-ATDIVPGYTFGCIQKATGN-SVPP 212
                 C    C + + YG     +     D ++L  + +V  + FGC     GN S   
Sbjct: 199 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAST 258

Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
            G + LG G  SLL+QT   + + FSYC+P   +  F         G   R   TPL++N
Sbjct: 259 SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRN 318

Query: 273 PRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    +LY V L  I VG R +++PP           G ++DS  + T+L   AY A+R 
Sbjct: 319 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 372

Query: 332 VFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAG 385
            FR  + +   V     G DTCY       +  P ++L+F G  V  L    +++     
Sbjct: 373 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG--- 429

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              CLA    P +    L  I N+QQQ H +LYDV    +G  R  C
Sbjct: 430 ---CLAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 118/404 (29%), Positives = 182/404 (45%), Gaps = 50/404 (12%)

Query: 63  AKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
           + DQ  L  L +  + R S   ++    +T +    V   +G+P Q + M +DT ++ +W
Sbjct: 31  SSDQTLLFSLKTQKLPRSSSDKLSFRHNVTLT----VTLAVGSPPQNISMVLDTGSELSW 86

Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK----QVPNP-TCGGGA--CAFNLTYGS 175
           + C       S VFN   S+T+  + C +  C+     +P P +C      C   ++Y  
Sbjct: 87  LHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFCHVAISYAD 145

Query: 176 ST-IAANLSQDTISLATDIVPGYTFGCIQKA----TGNSVPPQGLLGLGRGSLSLLAQTQ 230
           +T I  NL+ DT  + +   PG  FGC+       +       GL+G+ RGSLS + Q  
Sbjct: 146 ATSIEGNLAHDTFVIGSVTRPGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQ-- 203

Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR--IKYTPLLKN----PRRSSLYY-VNL 283
            L  S FSYC+    +   SG L LG         I+YTPL+      P    + Y V L
Sbjct: 204 -LGFSKFSYCISGSDS---SGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQL 259

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
             IRVG +++ +P      + T    T++DSGT FT L+ P YTA+++ F  +  S L +
Sbjct: 260 EGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRI 319

Query: 344 TS------LGGFDTCYSVPIVA-------PTITLMFSGMNVTLPQDNLLIH-STAGS--- 386
                    G  D CY V           P I+LMF G  +++    LL   + AGS   
Sbjct: 320 VDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGK 379

Query: 387 --ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
             + C     + D +     VI +  QQN  + +D+  SR+G A
Sbjct: 380 EEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 422


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 159/372 (42%), Gaps = 56/372 (15%)

Query: 84  PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNS 138
           P++ G      PT  Y+V   IGTP Q + + +DT +D  W  C  C  C       F+ 
Sbjct: 74  PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDP 133

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISL--ATDIVPG 196
           + S+T     C +  C+ +P                   +A+    D  +   A   VPG
Sbjct: 134 STSSTLSLTSCDSTLCQGLP-------------------VASLPRSDKFTFVGAGASVPG 174

Query: 197 YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC-------LPSFKALS 248
             FGC     G     + G+ G GRG LSL +Q   L    FS+C       +PS   L 
Sbjct: 175 VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAIPSTVLLD 231

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
               L     G    ++ TPL++NP   + YY++L  I VG   + +P         TG 
Sbjct: 232 LPADLFSNGQGA---VQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG- 287

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTI 362
           GTIIDSGT  T L    Y  VRD F  +V   L V S    D   C S P+ A    P +
Sbjct: 288 GTIIDSGTAMTSLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKL 345

Query: 363 TLMFSGMNVTLPQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            L F G  + LP++N +  +     SI CLA+    +     +  I N QQQN  +LYD+
Sbjct: 346 VLHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDL 400

Query: 421 PNSRLGVARELC 432
            NS+L      C
Sbjct: 401 QNSKLSFVPAQC 412


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 182/373 (48%), Gaps = 40/373 (10%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------V 135
           +P  SG  +  +  ++V   +GTPAQ   +  DT +D +WV C  C G S         +
Sbjct: 136 IPDRSGTYL-DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPC-GSSGHCHPQQDPL 193

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT-------YGS-STIAANLSQDTI 187
           F+ ++S+T+  + C   QC         GG C+ + T       YG  S+    LS+DT+
Sbjct: 194 FDPSKSSTYAAVHCGEPQCAAA------GGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTL 247

Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +L ++  + G+ FGC  +  G+     GLLGLGRG LSL +Q    + + FSYCLPS   
Sbjct: 248 ALTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSN- 306

Query: 247 LSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
            S +G L +G  P       +YT +L+ P+  S Y+V L++I +G  ++ +PP       
Sbjct: 307 -STTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVF---- 361

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAP 360
            T  GT++DSGTV T L A AY  +RD FR  +            D CY       ++ P
Sbjct: 362 -TRGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVP 420

Query: 361 TITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            ++  F  G    L    ++I     ++ CLA AA  D     L++I N QQ++  ++YD
Sbjct: 421 AVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAM-DAGGLPLSIIGNTQQRSAEVIYD 478

Query: 420 VPNSRLGVARELC 432
           V   ++G     C
Sbjct: 479 VAAEKIGFVPASC 491


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 157/363 (43%), Gaps = 36/363 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
           Y +   +GTP       +DT +D  W  C    T C    + +++ A+S+TF  L C + 
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155

Query: 153 QCKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISL--------ATDIVPGYTFGCI 202
            C+ +P+    C    C ++  Y     A  L+ DT+++        A+    G  FGC 
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCS 215

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-- 260
               G+     G++GLGR +LSLL+Q   +    FSYCL S  A + +  +  G +    
Sbjct: 216 TANGGDMDGASGIVGLGRSALSLLSQ---IGVGRFSYCLRS-DADAGASPILFGALANVT 271

Query: 261 PKRIKYTPLLKNP----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             +++ T LL+NP    RR+  YYVNL  I VG   + +      F      G I+DSGT
Sbjct: 272 GDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGT 331

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVPIV---APTITLMFS-GMN 370
            FT L    YT +R  F  +    LT  S     FD C+         P +   F+ G  
Sbjct: 332 TFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADTPVPRLVFRFAGGAE 391

Query: 371 VTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
             +P+ +        G + CL +          ++VI N+ Q +  +LYD+  +    A 
Sbjct: 392 YAVPRQSYFDAVDEGGRVACLLVLP-----TRGVSVIGNVMQMDLHVLYDLDGATFSFAP 446

Query: 430 ELC 432
             C
Sbjct: 447 ADC 449


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 100/349 (28%), Positives = 161/349 (46%), Gaps = 34/349 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP    +   DT +D  W  C  C  C    + V++ + S+TF  + C +A 
Sbjct: 77  YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 136

Query: 154 CKQV---PNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI------VPGYTFGCIQ 203
           C  V    N +     C +  +Y     +A  L  +T++L + +      V    FGC  
Sbjct: 137 CLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGT 196

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--- 260
              G+S+   G +GLGRG+LSLLAQ   L    FSYCL  F   +      LG + +   
Sbjct: 197 DNGGDSLNSTGTVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSTLDSPFLLGTLAELAP 253

Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
            P  ++ TPLL++P   S Y V+L  I +G   + IP      +  +  G ++DSGT F+
Sbjct: 254 GPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFS 313

Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDTCYSVPI------VAPTITLMFS-GMNV 371
            L    +  V D   + +G   +  +SL     C+  P         P + L F+ G ++
Sbjct: 314 ILPESGFRVVVDHVAQVLGQPPVNASSLD--SPCFPAPAGERQLPFMPDLVLHFAGGADM 371

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            L +DN + ++   S  CL +        S  +++ N QQQN ++L+D+
Sbjct: 372 RLHRDNYMSYNQEDSSFCLNIVG----TTSTWSMLGNFQQQNIQMLFDM 416


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 167/372 (44%), Gaps = 43/372 (11%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
           T  V    GTP Q + M +DT ++ +W+ C      +S +FN   S T+  + C +  C 
Sbjct: 66  TLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNS-IFNPLASKTYTKIPCSSPTCE 124

Query: 155 ---KQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
              + +P P     A  C F ++Y  +S++  NL+ +T  + +   P   FGC+      
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSS 184

Query: 206 -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPK 262
            +       GL+G+ RGSLS + Q   +    FSYC+    +   SG L LG       K
Sbjct: 185 NSEEDAKTTGLMGMNRGSLSFVNQ---MGFRKFSYCISDRDS---SGVLLLGEASFSWLK 238

Query: 263 RIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            + YTPL++     P    + Y V L  IRV  +V+ +P      + T    T++DSGT 
Sbjct: 239 PLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQ 298

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV-PIVA-----PTITLM 365
           FT L+ P Y+A++  F  +    L V +       G  D CY + P  A     P + LM
Sbjct: 299 FTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLM 358

Query: 366 FSGMNVTLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           F G  +++    LL           S+ C     + D++     VI + QQQN  + YD+
Sbjct: 359 FRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS-DSLGIESFVIGHHQQQNVWMEYDL 417

Query: 421 PNSRLGVARELC 432
             SR+G A   C
Sbjct: 418 EKSRIGFAEVRC 429


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 46/385 (11%)

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQ 140
           AS  +   + +  V   +GTP Q + M +DT ++ +W+ C    G      S+  F    
Sbjct: 55  ASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRA 114

Query: 141 STTFKNLGCQAAQCK--QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDI 193
           S TF ++ C +AQC+   +P+P    GA   C  +L+Y  GSS+  A L+ +  ++    
Sbjct: 115 SLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGA-LATEVFTVGQGP 173

Query: 194 VPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
                FGC+  A   + + V   GLLG+ RG+LS ++Q        FSYC+        +
Sbjct: 174 PLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---A 227

Query: 251 GSLRLGPIGQP-KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNP 304
           G L LG    P   + YTPL +     P    + Y V LL IRVG + + IP   L  + 
Sbjct: 228 GVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDH 287

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV--- 355
           T    T++DSGT FT L+  AY+A++  F R+    L   +         FDTC+ V   
Sbjct: 288 TGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQG 347

Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIA 407
              P   P +TL+F+G  +T+  D LL            + CL    A D V     VI 
Sbjct: 348 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPITAYVIG 406

Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
           +  Q N  + YD+   R+G+A   C
Sbjct: 407 HHHQMNVWVEYDLERGRVGLAPIRC 431


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 130/447 (29%), Positives = 196/447 (43%), Gaps = 79/447 (17%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           +++ + +   PC+P   S   +   S  EML +D+AR          R  ++  ASGR+I
Sbjct: 56  ASMPLMYRHGPCAP--ASAAATNRPSPAEMLRRDRAR----------RNHILRKASGRRI 103

Query: 92  T-------------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
           T              S  Y+V    GTPA   ++ +DT +D +WV C  C   SST    
Sbjct: 104 TLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCN--SSTCYPQ 161

Query: 135 ---VFNSAQSTTFKNLGCQAAQCKQVP---------NPTCGGGACAFNLTYGS-STIAAN 181
              VF+ + S+T+  + C +  C+ +          N + G   C + + YG+  T    
Sbjct: 162 KDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGV 221

Query: 182 LSQDTISL---ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
            S +T++L   A  +V  ++FGC     G      GLLGLG    SL++QT   Y   FS
Sbjct: 222 YSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFS 281

Query: 239 YCLPSFKALSFSGSLRLGPIG----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
           YCLP+    S +G L LG            ++TPL      ++ Y V L  I VG + +D
Sbjct: 282 YCLPAGN--STAGFLALGAPATGGNNTAGFQFTPL--QVVETTFYLVKLTGISVGGKQLD 337

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTC 352
           I P           G IIDSGT+ T L   AY+A+R  FR  + +   L        DTC
Sbjct: 338 IEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTC 391

Query: 353 Y----SVPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           Y    +  +  PT+ L F G   +++ +P   LL         CLA  A   + ++   +
Sbjct: 392 YDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG-------CLAFVAGASDGDT--GI 442

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I N+ Q+   +LYD     +G     C
Sbjct: 443 IGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 154/349 (44%), Gaps = 34/349 (9%)

Query: 112 MAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTC 162
           + +DT +D  W  C             S  V++  +S+TF  L C    C+  Q     C
Sbjct: 28  LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87

Query: 163 -GGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLG 219
                C +   YGS+     L+ +T +      +     FGC   + G+ +   G+LGL 
Sbjct: 88  TSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLS 147

Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
             SLSL+ Q   L    FSYCL  F       L F     L      + I+ T ++ NP 
Sbjct: 148 PESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPV 204

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
            +  YYV L+ I +G + + +P  +L   P  G GTI+DSG+    LV  A+ AV++   
Sbjct: 205 ETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVM 264

Query: 335 RRVGSNLTVTSLGGFDTCYSVP----------IVAPTITLMFS-GMNVTLPQDNLLIHST 383
             V   +   ++  ++ C+ +P          +  P + L F  G  + LP+DN      
Sbjct: 265 DVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPR 324

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           AG + CLA+    D   S +++I N+QQQN  +L+DV + +   A   C
Sbjct: 325 AG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 172/372 (46%), Gaps = 38/372 (10%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
           L FL     A  + VP + G  I  +  Y+V A +GTP     M +DT +D +WV C  C
Sbjct: 21  LGFLPCSHAAAVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPC 79

Query: 129 VGCSST------VFNSAQSTTFKNLGCQAAQCKQV---PNPTCGGGACAFNLTYG-SSTI 178
               S       +F+ AQS+++  + C    C  +       C    C + ++YG  S  
Sbjct: 80  AAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNT 139

Query: 179 AANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
               S DT++L A+  V G+ FGC    +G      GLLGLGR   SL+ QT   Y   F
Sbjct: 140 TGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVF 199

Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
           SYCLP+    S +G L L   GP G       T LL +P   + Y V L  I VG + + 
Sbjct: 200 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 257

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTC 352
           +P  A          T++D+GTV TRL   AY A+R  FR  + S    T  S G  DTC
Sbjct: 258 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 311

Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
           Y+      +  P + L F SG  VTL  D +L      S  CLA   AP   +  + ++ 
Sbjct: 312 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 363

Query: 408 NMQQQNHRILYD 419
           N+QQ++  +  D
Sbjct: 364 NVQQRSFEVRID 375


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 179/427 (41%), Gaps = 56/427 (13%)

Query: 38  HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------VPIASGR 89
           H   PCSP +    L       EML +D+ R +++   A   + +        VP   G 
Sbjct: 67  HRNGPCSPVRGKGELP----RAEMLRRDRERTEYIIRRASRSRRLQDNNDAVSVPTQLGS 122

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTF 144
               S  Y+    +GTPA    + +DT +   WV C  C           +F+   S+++
Sbjct: 123 SY-DSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSY 181

Query: 145 KNLGCQAAQCKQVPNPTCGGG-------ACAFNLTYGS-STIAANLSQDTISLATD-IVP 195
             + C + +C+ +     G G        CA+ + YGS +T A   S D ++L    IV 
Sbjct: 182 SPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVK 241

Query: 196 GYTFGC-IQKATGNSVPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCLPSFKALSFSGSL 253
            + FGC   +  G      G+LGLGR   SL  Q +       FS+CLP       +G L
Sbjct: 242 RFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV--STGFL 299

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
            LG         +TPLL    +   Y +   AI V  +++DIPP   +       G I D
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFR------EGVITD 353

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGM 369
           SGTV + L   AYTA+R  FR  +        +G  DTC++      +  PT++L F G 
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRG- 412

Query: 370 NVTLPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
                     +H  A S      CLA  ++ D       +I ++ Q+   +LYD+P  ++
Sbjct: 413 -------GATVHLDASSGVLMDGCLAFWSSGDEYT---GLIGSVSQRTIEVLYDMPGRKV 462

Query: 426 GVARELC 432
           G     C
Sbjct: 463 GFRTGAC 469


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 130/482 (26%), Positives = 196/482 (40%), Gaps = 84/482 (17%)

Query: 10  AFLFLFSLSEGLNPICDTQDHSSTLQ------VFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           A  + F  +   NP+C     S  L       +     PCS    + P     SV E L 
Sbjct: 35  ANYYYFVAASSPNPVCQGHRVSPPLSGGGWVPLSRPHGPCSSSMDAPP----SSVAETLR 90

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQ--------------- 108
            DQ R  ++       +  VPI        S   +V+ K+GT  Q               
Sbjct: 91  WDQHRAGYIQR---KLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGVQPAGEPVGDAP 147

Query: 109 -------TLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
                     M +DT++D  WV C  C    C   +  +++ ++S++     C +  C+ 
Sbjct: 148 TGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRN 207

Query: 157 VPNP-----TCGGGACAFNLTY--GSSTIAANLSQDTISL----ATDIVPGYTFGC---I 202
           +  P     T  G  C + + Y  GS++    +S D ++L        +  + FGC   +
Sbjct: 208 L-GPYANGCTPAGDQCQYRVQYPDGSASAGTYIS-DVLTLNPAKPASAISEFRFGCSHAL 265

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQP 261
            +    S    G++ LGRG+ SL  QT+  Y   FSYCLP       SG   LG P    
Sbjct: 266 LQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPV--HSGFFILGVPRVAA 323

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
            R   TP+L++     LY V L+AI V  + + +PP          AG ++DS T+ TRL
Sbjct: 324 SRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVF------AAGAVMDSRTIVTRL 377

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---------VPIVAPTITLMFSGMN-- 370
              AY A+R  F   + +          DTCY            +  P ITL+F G N  
Sbjct: 378 PPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGA 437

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           V L    +L+        CLA   AP+  + +  +I N+QQQ   +LY+V  + +G  R 
Sbjct: 438 VELDPSGVLLDG------CLAF--APNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRG 489

Query: 431 LC 432
            C
Sbjct: 490 AC 491


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/374 (30%), Positives = 164/374 (43%), Gaps = 48/374 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP    +   DT +D  W  C  C  C    + ++++A S +F  + C +A 
Sbjct: 95  YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154

Query: 154 CKQVPNPTCGGGA-----CAFNLTYGSSTIAAN-LSQDTISLATDI---------VPGYT 198
           C  +   +    A     C +   Y     +A  L  +T++ A            V G  
Sbjct: 155 CLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVA 214

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC     G S    G +GLGRGSLSL+AQ   L    FSYCL  F   S    +  G +
Sbjct: 215 FGCGVDNGGLSYNSTGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLGSPVLFGSL 271

Query: 259 GQ---PKRI-----KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
            +   P  I     + TPL++ P   S YYV+L  I +G   + IP G          G 
Sbjct: 272 AELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGM 331

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDT-CYSVPIVA--------P 360
           I+DSGT+FT LV  A+   R V     G  N  V +    D+ C+  P  A        P
Sbjct: 332 IVDSGTIFTVLVESAF---RVVVNHVAGVLNQPVVNASSLDSPCF--PATAGEQQLPDMP 386

Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            + L F+ G ++ L +DN +  +   S  CL +A AP    S+L    N QQQN ++L+D
Sbjct: 387 DMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSIL---GNFQQQNIQMLFD 443

Query: 420 VPNSRLGVARELCT 433
           +   +L      C+
Sbjct: 444 ITVGQLSFVPTDCS 457


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 177/387 (45%), Gaps = 49/387 (12%)

Query: 86  ASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
           +S R+++   + T  V   +GTP Q++ M +DT ++ +W+ C      +S VFN   S++
Sbjct: 57  SSTRKVSFYHNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINS-VFNPHLSSS 115

Query: 144 FKNLGCQAAQCKQ------VPNPTCGGGACAFNLTYGSST-IAANLSQDTISLATDIVPG 196
           +  + C +  CK       +P        C   ++Y   T +  NL+ DT +++    PG
Sbjct: 116 YTPIPCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPG 175

Query: 197 YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
             FG +     ++        GL+G+ RGSLS + Q   +    FSYC+    A   SG 
Sbjct: 176 IIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQ---MGFPKFSYCISGKDA---SGV 229

Query: 253 LRLGP--IGQPKRIKYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPT 305
           L  G         +KYTPL+K     P    + Y V L+ IRVG + + +P      + T
Sbjct: 230 LLFGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHT 289

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYS----- 354
               T++DSGT FT L+   YTA+R+ F  +    LT+         G  D C+      
Sbjct: 290 GAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGG 349

Query: 355 -VPIVAPTITLMFSGMNVTLPQDNLL--------IHSTAGSITCLAMAAAPDNVNSVLNV 405
            VP V P +T++F G  +++  + LL        +    G + CL    + D +     V
Sbjct: 350 VVPAV-PAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNS-DLLGIEAYV 407

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I +  QQN  + +D+ NSR+G A   C
Sbjct: 408 IGHHHQQNVWMEFDLVNSRVGFADTKC 434


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/370 (29%), Positives = 168/370 (45%), Gaps = 44/370 (11%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           T  V   +G P Q + M +DT ++ +W+ C       S VFN   S+T+  + C +  C+
Sbjct: 64  TLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICR 122

Query: 156 ----QVPNP-TCGGGA--CAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKA-- 205
                +P P +C      C   ++Y  +T I  NL+ +T  + +   PG  FGC+     
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182

Query: 206 --TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-----FKALSFSGSLRLGPI 258
             +       GL+G+ RGSLS + Q   L  S FSYC+       F  L  +    LGPI
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSDSSVFLLLGDASYSWLGPI 239

Query: 259 G-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
              P  ++ TPL    R +  Y V L  IRVG +++ +P      + T    T++DSGT 
Sbjct: 240 QYTPLVLQSTPLPYFDRVA--YTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA-------PTITL 364
           FT L+ P YTA+++ F  +  S L +         G  D CY V           P ++L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357

Query: 365 MFSGMNVTLPQDNLLIH-STAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           MF G  +++    LL   + AGS     + C     + D +     VI +  QQN  + +
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEF 416

Query: 419 DVPNSRLGVA 428
           D+  SR+G A
Sbjct: 417 DLAKSRVGFA 426


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/370 (29%), Positives = 168/370 (45%), Gaps = 44/370 (11%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
           T  V   +G P Q + M +DT ++ +W+ C       S VFN   S+T+  + C +  C+
Sbjct: 64  TLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICR 122

Query: 156 ----QVPNP-TCGGGA--CAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKA-- 205
                +P P +C      C   ++Y  +T I  NL+ +T  + +   PG  FGC+     
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182

Query: 206 --TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-----FKALSFSGSLRLGPI 258
             +       GL+G+ RGSLS + Q   L  S FSYC+       F  L  +    LGPI
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSDSSGFLLLGDASYSWLGPI 239

Query: 259 G-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
              P  ++ TPL    R +  Y V L  IRVG +++ +P      + T    T++DSGT 
Sbjct: 240 QYTPLVLQSTPLPYFDRVA--YTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA-------PTITL 364
           FT L+ P YTA+++ F  +  S L +         G  D CY V           P ++L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357

Query: 365 MFSGMNVTLPQDNLLIH-STAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           MF G  +++    LL   + AGS     + C     + D +     VI +  QQN  + +
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEF 416

Query: 419 DVPNSRLGVA 428
           D+  SR+G A
Sbjct: 417 DLAKSRVGFA 426


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 121/449 (26%), Positives = 194/449 (43%), Gaps = 54/449 (12%)

Query: 22  NPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL--------- 72
           +PI     +   L V H  +PCSP       S   SV ++  +   RL+ L         
Sbjct: 56  SPIPSGASNGKKLPVLHRLNPCSPLNAGGKQSTTSSV-DVSHRAGRRLRSLFAAVQSGDD 114

Query: 73  ----SSLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
                + A A   V    +G     +P    Y V    GTPAQ L MA DT    + V C
Sbjct: 115 AAPAPAPAAASGGVTIPTTGTPEPGAPGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRC 174

Query: 126 TGC---VGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN 181
             C     C     F+ ++S+TF  + C +  C+      C  G+           ++  
Sbjct: 175 AACRPGAPCDGLASFDPSRSSTFAPVPCGSPDCRS----GCSSGSTPSCPLTSFPFLSGA 230

Query: 182 LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           ++QD ++L     V  +TFGC++ ++G  +   GLL L R S S+ ++       TFSYC
Sbjct: 231 VAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYC 290

Query: 241 LPSFKALSFSGSLRLG----PIGQPKRI-KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           LP     S  G L +G    P  +  R+    PL+ +P   + Y ++L  + +G R + I
Sbjct: 291 LP-LSTTSSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPI 349

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
           PP A     T  A  ++D+   +T +    Y  +RD FRR +       ++G  DTCY+ 
Sbjct: 350 PPHAA----TASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNF 405

Query: 356 -----PIVAPTITLMF-------SGMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVN 400
                 ++ P + L F        G  + L  D +   S  G   S+TCLA AA P + +
Sbjct: 406 TGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGD 465

Query: 401 S---VLNVIANMQQQNHRILYDVPNSRLG 426
           +   +  V+  + Q +  +++DVP  ++G
Sbjct: 466 AEAPLAMVMGTLAQSSMEVVHDVPGGKIG 494


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 168/369 (45%), Gaps = 43/369 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ-- 156
           V   +G+P Q + M +DT ++ +W+ C      +S VFN   S T+  + C +  CK   
Sbjct: 71  VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNS-VFNPLSSKTYSKVPCLSPTCKTRT 129

Query: 157 ----VPNPTCGGGACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
               +P        C   ++Y  +T I  NL+ +T  L +   P   FGC+     ++  
Sbjct: 130 RDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSE 189

Query: 212 PQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP--KRIK 265
                 GL+G+ RGSLS + Q   +    FSYC+  F +   +G L LG    P  K + 
Sbjct: 190 EDSKTTGLIGMNRGSLSFVNQ---MGYPKFSYCISGFDS---AGVLLLGNASFPWLKPLS 243

Query: 266 YTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           YTPL++     P    + Y V L  I+V  +V+ +P      + T    T++DSGT FT 
Sbjct: 244 YTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTF 303

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCY----SVPIVA--PTITLMFSG 368
           L+ P YTA+++ F  +    L V +       G  D CY    S P +   P ++LMF G
Sbjct: 304 LLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG 363

Query: 369 MNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
             +++  + LL           S+ C     + D +     VI +  QQN  + +D+  S
Sbjct: 364 AEMSVSGERLLYRVPGEVRGRDSVWCFTFGNS-DLLGVEAFVIGHHHQQNVWMEFDLEKS 422

Query: 424 RLGVARELC 432
           R+G+A   C
Sbjct: 423 RIGLADVRC 431


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 46/385 (11%)

Query: 86  ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQ 140
           AS  +   + +  V   +GTP Q + M +DT ++ +W+ C    G      S+  F    
Sbjct: 54  ASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRA 113

Query: 141 STTFKNLGCQAAQCK--QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDI 193
           S TF ++ C +AQC+   +P+P    GA   C  +L+Y  GSS+  A L+ +  ++    
Sbjct: 114 SLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGA-LATEVFTVGQGP 172

Query: 194 VPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
                FGC+  A   + + V   GLLG+ RG+LS ++Q        FSYC+        +
Sbjct: 173 PLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---A 226

Query: 251 GSLRLGPIGQP-KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNP 304
           G L LG    P   + YTPL +     P    + Y V LL IRVG + + IP   L  + 
Sbjct: 227 GVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDH 286

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV--- 355
           T    T++DSGT FT L+  AY+A++  F R+    L   +         FDTC+ V   
Sbjct: 287 TGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQG 346

Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIA 407
              P   P +TL+F+G  +T+  D LL            + CL    A D V     VI 
Sbjct: 347 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPITAYVIG 405

Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
           +  Q N  + YD+   R+G+A   C
Sbjct: 406 HHHQMNVWVEYDLERGRVGLAPIRC 430


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/288 (31%), Positives = 138/288 (47%), Gaps = 29/288 (10%)

Query: 162 CGGGA--CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
           CG  A  C + + YG  +     L  + +   T +V  + FGC +   G      GL+GL
Sbjct: 126 CGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGL 185

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPR 274
           GR  LSL++QT  ++   FSYCLPS +    SGSL LG      R    I Y  +++NP+
Sbjct: 186 GRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYAKMIENPQ 244

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVF 333
             + Y++NL  I +G        G     P+ G   I +DSGTV TRL    Y A++  F
Sbjct: 245 LYNFYFINLTGISIG--------GVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEF 296

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD----NLLIHSTAG 385
            ++        +    DTC+++     +  PTI + F G N  L  D       + S A 
Sbjct: 297 LKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEG-NAELTVDVTGVFYFVKSDAS 355

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + CLA+A+        + ++ N QQ+N R++YD   +++G A E C+
Sbjct: 356 QV-CLALASL--EYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400


>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
          Length = 484

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 132/454 (29%), Positives = 195/454 (42%), Gaps = 58/454 (12%)

Query: 22  NPICDTQDHS--STLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFL----- 72
            P C +  HS  S + V H  SPCSP   +      E  SV ++L +D  RL+ L     
Sbjct: 46  KPTCSSA-HSAHSAVPVVHRLSPCSPLAGAARNQQPERRSVADVLHRDALRLRSLLHREE 104

Query: 73  -----SSLAVARKSVVPIAS-GRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAA-W 122
                 + A      V I S G  I + P    Y V A  GTP Q L +  DT+   A  
Sbjct: 105 DNHRTPAPAAPPGGGVSIPSRGEPIEELPGAFEYHVVAGFGTPMQKLPVGFDTTTTGATL 164

Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG-----ACAFNLTYGSST 177
           + CT C   +   F+ + S++   + C +  C   P   C G      + +FN T   + 
Sbjct: 165 LQCTPCGSGADHAFDPSASSSVSQVPCGSPDC---PFHGCSGRPSCTLSVSFNNTLLGNA 221

Query: 178 IAANLSQDTISLATDIVPGYTFGCIQ-----KATGNSVPPQGLLGLGRGSLSL---LAQT 229
                +      ++  V  + F C++      A   S    G+L L R S SL   L  +
Sbjct: 222 TFFTDTLTLTPSSSATVDKFRFACLEGIAPGPAEDGSA---GILDLSRNSHSLPSRLVAS 278

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
              +   FSYCLP+  A    G L LG   P    +++ YTPL  +P   +LY V+L+ +
Sbjct: 279 SPPHAVAFSYCLPASTA--DVGFLSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGL 336

Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
            +G   + IPP A+      G  TI++  T FT L    Y  +RD FR+ +        L
Sbjct: 337 GLGGPDLPIPPAAI-----AGDDTILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPL 391

Query: 347 GGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDN 398
           G  DTCY+         P +TL F+ G +V L  D ++  +      SI CLA  A  D+
Sbjct: 392 GSLDTCYNFTGLDAFSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDD 451

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            +    VI +M Q +  ++YDV   ++G     C
Sbjct: 452 CDGG-TVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 167/387 (43%), Gaps = 59/387 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
           Y V    GTP+QTL   MDT +   W PCT    C  CS    + A+  TF        K
Sbjct: 90  YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 149

Query: 146 NLGCQAAQCKQVPN-------PTCGGG------AC-AFNLTYGSSTIAANLSQDTISLAT 191
            +GC   +C  V +       P C         AC  + + YG  T    L  +++  A 
Sbjct: 150 IVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE 209

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
              P +  GC   +  +S  P G+ G GRG  SL  Q   +    FSYCL S +      
Sbjct: 210 RTEPDFVVGC---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPK 263

Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
           S   +L +GP  +  +   + YTP  KNP  S+      YYV L  I VG + V +P   
Sbjct: 264 SSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSF 323

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV 355
           +        GTI+DSG+ FT +  P + AV   F R++ +N T    V +L G   C+++
Sbjct: 324 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNL 382

Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN-----V 405
                +  P++   F  G  + LP  N        S+ CL + +  + V S L+     +
Sbjct: 383 SGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSN-EAVGSTLSSGPSII 441

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           + N Q QN    YD+ N R G  R+ C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 135/419 (32%), Positives = 188/419 (44%), Gaps = 72/419 (17%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
           S+ L++ H   PC+P + S   +   SV + L  DQ R +++             S A A
Sbjct: 65  SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
             + VP + G  I  +  Y+V A +GTP     M +DT +D +WV   PC+    C S  
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
             +F+ AQS+++            VP   CGG  CA    Y         +    +    
Sbjct: 182 DPLFDPAQSSSYA----------AVP---CGGPVCAGLGIY--------AASACSAAQCG 220

Query: 193 IVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
            V G+ FGC    +G  N V   GLLGLGR   SL+ QT   Y   FSYCLP+    S +
Sbjct: 221 AVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP--STA 276

Query: 251 GSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           G L L   GP G       T LL +P   + Y V L  I VG + + +P  A        
Sbjct: 277 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG---- 332

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP----IVAPT 361
             T++D+GTV TRL   AY A+R  FR  + S    T  S G  DTCY+      +  P 
Sbjct: 333 --TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 390

Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           + L F SG  VTL  D +L      S  CLA   AP   +  + ++ N+QQ++  +  D
Sbjct: 391 VALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILGNVQQRSFEVRID 441


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/365 (28%), Positives = 181/365 (49%), Gaps = 38/365 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
           +V   IGTP QT  M +DT +  +W+ C   V      S+VF+ + S++F  L C    C
Sbjct: 83  LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142

Query: 155 K-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKAT 206
           K ++P+   PT C     C ++  Y   T+A  NL ++ I+ + +   P    GC ++++
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS 202

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKR 263
                 +G+LG+  G LS  +Q +    + FSYC+P+ +     + +GS  LG       
Sbjct: 203 D----AKGILGMNLGRLSFASQAK---LTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGG 255

Query: 264 IKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +Y  LL   ++ R  +L    Y V +  IR+G + ++IP  A + +P+    T+IDSG+
Sbjct: 256 FRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGS 315

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCY---SVPIVAPTITLMFS---G 368
            FT LV  AY  VR+   R VG+ L    + G   D C+   ++ I      ++F    G
Sbjct: 316 EFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKG 375

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           + + + ++ +L     G + C+ +  + + + +  N+I N  QQN  + +D+ N R+G  
Sbjct: 376 VEIVVEKERVLA-DVGGGVHCVGIGRS-EMLGAASNIIGNFHQQNIWVEFDLANRRVGFG 433

Query: 429 RELCT 433
           +  C+
Sbjct: 434 KADCS 438


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 173/362 (47%), Gaps = 36/362 (9%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
           I+   IGTP QT  M +DT +  +W+ C      +++ F+ + S+TF  L C    CK +
Sbjct: 76  IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-FDPSLSSTFSILPCTHPLCKPR 134

Query: 157 VPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFGCIQKATGNS 209
           +P+   PT C     C ++  Y   T A  NL ++  + +  +  P    GC  ++T   
Sbjct: 135 IPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATESTD-- 192

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKRIKY 266
             P+G+LG+  G LS   Q++    + FSYC+P  +     + +GS  LG     K  KY
Sbjct: 193 --PRGILGMNLGRLSFAKQSK---ITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKY 247

Query: 267 TPLLKNPRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
             ++ + R+         Y + ++ IR+  + ++I P   + +      T+IDSG+ FT 
Sbjct: 248 VGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTY 307

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAPTITLMFS---GMNV 371
           LV+ AY  VR    R VG  L    + G      FD+  +V I      ++F    G+ V
Sbjct: 308 LVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEV 367

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            +P++ +L     G + C+ + ++ D + +  N+I N  QQN  + +D+   R+G  +  
Sbjct: 368 VIPKERVLA-DVGGGVHCVGIGSS-DKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKAD 425

Query: 432 CT 433
           C+
Sbjct: 426 CS 427


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 166/365 (45%), Gaps = 38/365 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
           IV   IGTP Q   M +DT +  +W+ C          +  F+ + S+TF  L C    C
Sbjct: 98  IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVC 157

Query: 155 K-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFGCIQKAT 206
           K ++P+ T    C     C ++  Y   T A  NL ++  + +  +  P    GC  ++T
Sbjct: 158 KPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATEST 217

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF---KALSFSGSLRLGPIGQPKR 263
                P+G+LG+ RG LS  +Q++    + FSYC+P+       + +GS  LG       
Sbjct: 218 D----PRGILGMNRGRLSFASQSK---ITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNT 270

Query: 264 IKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +Y  +L   R   +       Y V L  IR+G R ++I P   + +      T++DSG+
Sbjct: 271 FRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGS 330

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS------VPIVAPTITLMFSG 368
            FT LV  AY  VR    R VG  +    + G   D C+         ++   +     G
Sbjct: 331 EFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKG 390

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           + + +P++ +L  +  G + C+ +A + D + +  N+I N  QQN  + +D+ N R+G  
Sbjct: 391 VQIVVPKERVLA-TVEGGVHCIGIANS-DKLGAASNIIGNFHQQNLWVEFDLVNRRMGFG 448

Query: 429 RELCT 433
              C+
Sbjct: 449 TADCS 453


>gi|3123349|emb|CAA06698.1| hypothetical protein [Cicer arietinum]
          Length = 99

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 57/91 (62%), Positives = 74/91 (81%), Gaps = 2/91 (2%)

Query: 344 TSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
           +SLG FDTC+  +   +AP ITL F+ +N+TLP +N LIHS++GS+ CLAMAAAP NVNS
Sbjct: 8   SSLGAFDTCFVKTYETLAPAITLRFTDLNLTLPMENSLIHSSSGSLACLAMAAAPSNVNS 67

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           VLNVIAN QQQN R+L+D  N+++G+ARELC
Sbjct: 68  VLNVIANFQQQNLRVLFDTVNNKVGIARELC 98


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 118/377 (31%), Positives = 173/377 (45%), Gaps = 51/377 (13%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPC-TGCVGCSSTV--------FNSAQSTTFKNLGC 149
           V   +GTP Q + M +DT ++ +W+ C TG  G ++          F    S TF  + C
Sbjct: 65  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124

Query: 150 QAAQC--KQVPNP-TCGGGA--CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCI 202
            + QC  + +P P +C G +  C  +L+Y  GS++  A L+ D  ++         FGC+
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGA-LATDVFAVGEAPPLRSAFGCM 183

Query: 203 QKATGNS---VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
             A  +S   V   GLLG+ RG+LS + Q        FSYC+        +G L LG   
Sbjct: 184 STAYDSSPDGVATAGLLGMNRGTLSFVTQAST---RRFSYCI---SDRDDAGVLLLGHSD 237

Query: 260 QP-KRIKYTPL----LKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
            P   + YTPL    L  P    + Y V LL IRVG + + IP   L  + T    T++D
Sbjct: 238 LPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVD 297

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF------DTCYSVP-------IVAP 360
           SGT FT L+  AY+A++  F ++    L       F      DTC+ VP          P
Sbjct: 298 SGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLP 357

Query: 361 TITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            +TL+F+G  +++  D LL      H  A  + CL    A D V     VI +  Q N  
Sbjct: 358 PVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNA-DMVPLTAYVIGHHHQMNLW 416

Query: 416 ILYDVPNSRLGVARELC 432
           + YD+   R+G+A   C
Sbjct: 417 VEYDLERGRVGLAPVKC 433


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 174/392 (44%), Gaps = 68/392 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNSAQS-------TTFK 145
           Y++   IGTP Q + + MDT +D  W PC      C+ C +   N   +       ++  
Sbjct: 80  YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139

Query: 146 NLGCQAAQCKQV---PNP-----------------TCGGGACAFNLTYGS-STIAANLSQ 184
              C +  C  V    NP                 TC      F  TYG+   +   L++
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTR 199

Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
           DT+ +       T  +P + FGC+  +      P G+ G GRG+LSL +Q   L +  FS
Sbjct: 200 DTLRVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQLGFL-RKGFS 255

Query: 239 YCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
           +C  +FK     + S  L +G I    +  +++TP+LK+P   + YYV L AI VG    
Sbjct: 256 HCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSA 315

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV----GSNLTVTSLGG 348
            ++P    +F+     G ++DSGT +T L  P Y+ V  V +  +     +++ + +  G
Sbjct: 316 TEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRT--G 373

Query: 349 FDTCYSVPI---------VAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAA 394
           FD CY VP          + P+IT  F +  ++ L + +     +A S    + CL   +
Sbjct: 374 FDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQS 433

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
             D       V+ + QQQ+  ++YD+   R+G
Sbjct: 434 MDDGDYGPAGVLGSFQQQDVEVVYDMEKERIG 465


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/396 (26%), Positives = 179/396 (45%), Gaps = 54/396 (13%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
           P A+  +   + +  V   +GTP Q + M +DT ++ +W+ C G        F+++ S++
Sbjct: 50  PPANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSR--HDAPFDASASSS 107

Query: 144 FKNLGCQAAQC----KQVP-NPTCGGGACAFNLTYGSSTIAANL-SQDTISLATDIVPGY 197
           +  + C +  C    + +P  P C   AC  +L+Y  ++ A  L + DT  L +  +P  
Sbjct: 108 YAPVPCSSPACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPAL 167

Query: 198 TFGCIQKATGNS----VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFS 250
            FGCI   + ++     PP GLLG+ RG LS + QT       F+YC+ + +    L   
Sbjct: 168 -FGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTAT---RRFAYCIAAGQGPGILLLG 223

Query: 251 GSLRLGPIGQP--KRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
           G+    P+  P  +++ YTPL++  +       + Y V L  IRVG  ++ IP   L  +
Sbjct: 224 GNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPD 283

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----------VTSLGGFDTCY 353
            T    T++DSGT FT L+  AY A++  F  ++  +L               G FD C+
Sbjct: 284 HTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACF 343

Query: 354 ----------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGS-------ITCLAMAAAP 396
                     +   + P + L+  G  V +     L++   G        + CL   ++ 
Sbjct: 344 RGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSS- 402

Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           D       VI +  QQ+  + YD+ N+RLG A   C
Sbjct: 403 DMAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARC 438


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 96/339 (28%), Positives = 150/339 (44%), Gaps = 46/339 (13%)

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           TY+V   IGTP   L   +DT +D  W     PC  C    + ++  A+S T+ N+ C++
Sbjct: 91  TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150

Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
             C+  Q P   C      CA+  +YG  T     L+ +T +L +D  V G  FGC  + 
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            G++    GL+G+GRG LSL++Q                          LG + +P+R  
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ--------------------------LG-VTRPRRSC 243

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
                     +      L  I VG  ++ I P   +  P    G IIDSGT FT L   A
Sbjct: 244 RARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERA 303

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIH 381
           + A+      RV   L   +  G   C++      +  P + L F G ++ L +++ ++ 
Sbjct: 304 FVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVE 363

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
             +  + CL M +A       ++V+ +MQQQN  ILYD+
Sbjct: 364 DRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 397


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 175/370 (47%), Gaps = 42/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +G P +  L+ +DT +D  W+   PC  C   S  VF+ +QST+FK + C AA 
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230

Query: 154 CKQVPNPTCGGGA-------CAFNLTYG-SSTIAANLSQDTISLATDIVPG------YTF 199
           C  V +  C   +       C +   YG SS  + +L+ +++S++    P          
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN--LYQSTFSYCL-PSFKALSFSGSLRLG 256
           GC     G      GLLGLG+G+LS  +Q ++  + QS FSYCL      LS S ++  G
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSVSSAISFG 349

Query: 257 PIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
                 R    +++TP ++ N    + YY+ +  I++ + ++ IP       P    GTI
Sbjct: 350 AGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTI 409

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLM 365
           IDSGT  T L   AY AV   F  R+ S            CY      +VP   PT++++
Sbjct: 410 IDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPF--PTLSIV 466

Query: 366 F-SGMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           F +G  + LPQ+N  I      +  CLA+          +++I N QQQN   LYDV ++
Sbjct: 467 FQNGAELDLPQENYFIQPDPQEAKHCLAILPT-----DGMSIIGNFQQQNIHFLYDVQHA 521

Query: 424 RLGVARELCT 433
           RLG A   C+
Sbjct: 522 RLGFANTDCS 531


>gi|413916846|gb|AFW56778.1| hypothetical protein ZEAMMB73_865423 [Zea mays]
          Length = 130

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 66/131 (50%), Positives = 92/131 (70%), Gaps = 3/131 (2%)

Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
           ++ IRVG R V +P  AL F P +G  TI+++GT+FTRL AP Y  VRDVF+ RV + + 
Sbjct: 1   MVRIRVGGRPVPVPASALAFEPASGRDTIVEAGTMFTRLSAPVYAVVRDVFQSRVRAPVA 60

Query: 343 VTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDN-VN 400
              LGGF+T Y+V I  P +T  F G ++VTLP+ N++I S++  I CLAMAA P N V+
Sbjct: 61  -GPLGGFNTFYNVTISVPIVTFSFDGRVSVTLPERNVVIRSSSDGIACLAMAAGPSNGVD 119

Query: 401 SVLNVIANMQQ 411
           +VLN++A+MQQ
Sbjct: 120 AVLNMLASMQQ 130


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 163/386 (42%), Gaps = 38/386 (9%)

Query: 80  KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
           K +  + SG  +  S  Y +   +GTP +   + +DT +D  W+ C  C  C   +   +
Sbjct: 144 KLIATLESGMTLG-SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFY 202

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGS----------STIAA 180
           +   S +FKN+ C   +C  + +P           +C +   YG            T   
Sbjct: 203 DPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 262

Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           NL+      +   V    FGC     G      GLLGLGRG LS  +Q Q+LY  +FSYC
Sbjct: 263 NLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 322

Query: 241 LPSFKA-LSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVD 294
           L    +  + S  L  G    +     + +T  +     S  + YY+ + +I VG + +D
Sbjct: 323 LVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALD 382

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV-TSLGGFDTCY 353
           IP      +     GTIIDSGT  +    PAY  +++ F  ++  N  +       D C+
Sbjct: 383 IPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCF 442

Query: 354 SVP------IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           +V       I  P + + F  G     P +N  I   +  + CLA+   P    S  ++I
Sbjct: 443 NVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIW-LSEDLVCLAILGTP---KSTFSII 498

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQQN  ILYD   SRLG     C
Sbjct: 499 GNYQQQNFHILYDTKRSRLGFTPTKC 524


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 105/394 (26%), Positives = 175/394 (44%), Gaps = 34/394 (8%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
           SV    A  +   + L+  A    +VVPI      TQ+  Y+    IGTP Q     +D 
Sbjct: 15  SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70

Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNL 171
           + +  W  C  C  C    + +F+   S T++   C    C+ +P+ +  C G  CA+  
Sbjct: 71  AGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCSGNVCAYQA 130

Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQ 230
           +  +      +  DT ++ T       FGC+  +  +++  P G++GLGR   SL+ QT 
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189

Query: 231 NLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
               + FSYCL    A     L    S +L   G+     +  +  N    S YY V L 
Sbjct: 190 ---VAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            ++ G  ++ +PP        +G+  ++D+ +  + LV  AY AV+      VG+    T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMAT 298

Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
            +  FD C+        AP +   F  G  +T+P  N L+    G++ CLAM ++A  N 
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTV-CLAMLSSARLNS 357

Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + L+++ ++QQ+N   L+D+    L      CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 166/387 (42%), Gaps = 59/387 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
           Y V    GTP+QTL   MDT +   W PCT    C  CS    + A+  TF        K
Sbjct: 90  YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 149

Query: 146 NLGCQAAQCKQVPN-------PTCGGG------AC-AFNLTYGSSTIAANLSQDTISLAT 191
            +GC   +C  V +       P C         AC  + + YG  T    L  +++  A 
Sbjct: 150 IVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE 209

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
              P +  GC   +  +S  P G+ G GRG  SL  Q   +    FSYCL S +      
Sbjct: 210 RTEPDFVVGC---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPK 263

Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
           S   +L +GP  +  +   + YTP  KNP  S+      YYV L  I VG + V  P   
Sbjct: 264 SSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSF 323

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV 355
           +        GTI+DSG+ FT +  P + AV   F R++ +N T    V +L G   C+++
Sbjct: 324 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNL 382

Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN-----V 405
                +  P++   F  G  + LP  N        S+ CL + +  + V S L+     +
Sbjct: 383 SGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSN-EAVGSTLSSGPSII 441

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           + N Q QN    YD+ N R G  R+ C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 124/427 (29%), Positives = 187/427 (43%), Gaps = 39/427 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFLSSLAVARKS-VVPIASGRQ 90
           +L++ H +S  SPF P     +E  + L  L+K +A      +LA+   S   P A   +
Sbjct: 29  SLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAH-----NLAITTSSGFSPEAFRLR 83

Query: 91  ITQSPT-YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
           I+Q  T Y+V+  IG+P   L +  DT +   W  C  C         +FNS  S T+++
Sbjct: 84  ISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRD 143

Query: 147 LGCQAAQCKQVPNP-TCGGGACAFNLTY-GSSTIAANLSQDTI-SLATDIVPGYTFGCIQ 203
           L CQ   C    N   C    C + + Y G S  A   +QD + S   D +P Y FGC +
Sbjct: 144 LPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRIPFY-FGCSR 202

Query: 204 KATGNSV-----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS---LRL 255
                S         G++GL    +SLL Q  ++ ++ FSYCL  F   S S +   LR 
Sbjct: 203 DNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRF 262

Query: 256 GPIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
           G   +  R KY  TP + +PR    Y++NL+ + V    + IPPG     P    GTIID
Sbjct: 263 GNDIRKSRRKYLSTPFV-SPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIID 321

Query: 314 SGTVFTRLVAPAY----TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLM 365
           SGT  T +   AY    TA ++ F +  G       L G+  CY          P++   
Sbjct: 322 SGTAVTYISQTAYFPVITAFKNYFDQH-GFQRVNIQLSGY-ICYKQQGHTFHNYPSMAFH 379

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F G +  +  + + +        C+A+        +   +I  + Q N + +YD  N +L
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRT---IIGALNQANTQFIYDAANRQL 436

Query: 426 GVARELC 432
               E C
Sbjct: 437 LFTPENC 443


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 173/399 (43%), Gaps = 68/399 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFK 145
           Y     +GTP Q L + +DT +  +WVPCT    C +            VF+   S++ +
Sbjct: 91  YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150

Query: 146 NLGCQAAQCKQVPNP---TCG-------GGAC-AFNLTYGSSTIAANLSQDTISLATDIV 194
            +GC+   C+ + +    TCG       G  C  + + YGS + +  L  DT+ L+    
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLLISDTLRLSPSSS 210

Query: 195 PGYTFGCIQKATGNSV-----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---A 246
                     A G S+     PP GL G GRG+ S+ +Q   L    FSYCL S +    
Sbjct: 211 SSAPAPFRNFAIGCSIVSVHQPPSGLAGFGRGAPSVPSQ---LKVPKFSYCLLSRRFDDN 267

Query: 247 LSFSGSLRLG----PIGQPK-RIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDIPP 297
            + SG L LG    P G+ K  ++Y PLL N    P  S  YY+ L  I VG + V++P 
Sbjct: 268 SAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPS 327

Query: 298 GALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
            A  F P++G G IIDSGT FT L      P   A+      R   +  V    G   C+
Sbjct: 328 RA--FVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRPCF 385

Query: 354 SVP------IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSIT-------CLAMAA----- 394
           ++P      +  P + L F G  V  LP +N  + +             CLA+ +     
Sbjct: 386 ALPPGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPAS 445

Query: 395 -APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                      ++ + QQQN+ I YD+   RLG  ++ C
Sbjct: 446 GGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 103/348 (29%), Positives = 154/348 (44%), Gaps = 27/348 (7%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           T +  Y+    IGTP Q +  A+D S+D  W  C      ++  FN  +STT  ++ C  
Sbjct: 95  TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-----ATAPFNPVRSTTVADVPCTD 149

Query: 152 AQCKQVPNPTCGGGA--CAFNLTYGSSTIAAN----LSQDTISLATDIVPGYTFGCIQKA 205
             C+Q    TCG GA  CA+   YG    AAN    L  +  +     + G  FGC  K 
Sbjct: 150 DACQQFAPQTCGAGASECAYTYMYGGG--AANTTGLLGTEAFTFGDTRIDGVVFGCGLKN 207

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--R 263
            G+     G++GLGRG+LSL++Q Q      FSY      ++     +  G    P+   
Sbjct: 208 VGDFSGVSGVIGLGRGNLSLVSQLQ---VDRFSYHFAPDDSVDTQSFILFGDDATPQTSH 264

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT-VFTRLV 322
              T LL +    SLYYV L  I+V  + + IP G        G+G +  S T + T L 
Sbjct: 265 TLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLE 324

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDN 377
             AY  +R     ++G      S  G D CY+   +A    P++ L+F+G  V  L   N
Sbjct: 325 EAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGN 384

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
                +   + CL +  +     SVL    ++ Q    ++YD+  S+L
Sbjct: 385 YFYMDSTTGLACLTILPSSAGDGSVL---GSLIQVGTHMMYDINGSKL 429


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 173/370 (46%), Gaps = 44/370 (11%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPC-TG-CVGCSSTVFNSAQSTTFKNLGCQAAQC-- 154
           V   +GTP Q + M +DT ++ +W+ C TG     ++  F    S TF  + C +A+C  
Sbjct: 63  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSS 122

Query: 155 KQVPNP-TCGGGA--CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKA---T 206
           + +P P +C   +  C  +L+Y  GS++  A L+ D  ++         FGC+  A   +
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSYADGSASDGA-LATDVFAVGDAPPLRSAFGCMSAAYDSS 181

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP-KRIK 265
            ++V   GLLG+ RG+LS + Q        FSYC+        +G L LG    P   + 
Sbjct: 182 PDAVATAGLLGMNRGALSFVTQAST---RRFSYCI---SDRDDAGVLLLGHSDLPFLPLN 235

Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           YTPL +     P    + Y V LL IRVG + + IPP  L  + T    T++DSGT FT 
Sbjct: 236 YTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTF 295

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP-------IVAPTITLMFS 367
           L+  AY+AV+  F ++    L             FDTC+ VP          P +TL+F+
Sbjct: 296 LLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFN 355

Query: 368 GMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           G  +++  D LL         A  + CL    A D V     VI +  Q N  + YD+  
Sbjct: 356 GAQMSVAGDRLLYKVPGERRGADGVWCLTFGNA-DMVPLTAYVIGHHHQMNLWVEYDLER 414

Query: 423 SRLGVARELC 432
            R+G+A   C
Sbjct: 415 GRVGLAPVKC 424


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 34/363 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y+V   +GTP Q +   +DT +D  W  C  C  C      +F+   S++++ + C    
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGEL 163

Query: 154 CKQVPNPTCGG-GACAFNLTYGSSTIAANL---------SQDTISLATDIVPGYTFGCIQ 203
           C  + + +C     C +  +YG  T    +         S  +    T +     FGC  
Sbjct: 164 CNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGT 223

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPI- 258
              G+     G++G GR  LSL++Q   L    FSYCL  +    K+    GSLR G   
Sbjct: 224 MNKGSLNNGSGIVGFGRAPLSLVSQ---LAIRRFSYCLTPYASGRKSTLLFGSLRGGVYD 280

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
                ++ T LL++ +  + YYV    + VG R + IP  A    P    G I+DSGT  
Sbjct: 281 AATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTAL 340

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSV-------PIVAPTITLMFSGM 369
           T   AP    V   FR ++          G D   C++        P V P +     G 
Sbjct: 341 TLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGA 400

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           ++ LP+ N ++        CL +A + D+  +    I N  QQ+ R+LYD+    L  A 
Sbjct: 401 DLDLPRRNYVLDDQRKGNLCLLLADSGDSGTT----IGNFVQQDMRVLYDLEADTLSFAP 456

Query: 430 ELC 432
             C
Sbjct: 457 AQC 459


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 131/435 (30%), Positives = 188/435 (43%), Gaps = 70/435 (16%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEE---SVLEMLAKDQARLQFLSS-LAVARKSVVPI 85
           +S+   + H++ PCSP   S   +  +   S+ +M+  DQ R  ++   L  A     P+
Sbjct: 61  NSTWAPLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRRADYIQKRLTGATDDKQPM 120

Query: 86  A-SGR--QITQSPTYIVRAKI--------------------GTPAQTLLMAMDTSNDAAW 122
           A S R  Q  ++  Y     +                    GT A T  + +D+ +D +W
Sbjct: 121 AFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTNSAPDGTSAVTQTVIIDSGSDVSW 180

Query: 123 VPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGGA-CAFNLTY 173
           V C  C    C      +F+ A STT+  + C +A C Q+  P    C   A C F + Y
Sbjct: 181 VQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPYRRGCSANAQCQFGINY 239

Query: 174 GS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQT 229
           G  ST     S D ++L   D++ G+ FGC     G++      G L LG GS SL+ QT
Sbjct: 240 GDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQT 299

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY------TPLLKNPRRSSLYYVNL 283
              Y   FSYCLP     S  G L LG    P+R +       TPLL +    + Y V L
Sbjct: 300 ATRYGRVFSYCLP--PTASSLGFLVLG--VPPERAQLIPSFVSTPLLSSSMAPTFYRVLL 355

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
            AI V  R + +PP          A ++IDS T+ +RL   AY A+R  FR  +      
Sbjct: 356 RAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA 409

Query: 344 TSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
             +   DTCY       I  P+I L+F  G  V L    +L+ S      CLA   AP  
Sbjct: 410 PPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAF--APTA 461

Query: 399 VNSVLNVIANMQQQN 413
            + +   I N+QQ+ 
Sbjct: 462 SDRMPGFIGNVQQKT 476



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 72/231 (31%), Positives = 98/231 (42%), Gaps = 32/231 (13%)

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-LLKN 272
           G   + R  L L   TQ  Y   FSYC+P   + S  G + LG    P+R    P  +  
Sbjct: 510 GPYDVDRQGLPLRTATQ--YGRVFSYCIP--PSPSSLGFITLG--VPPQRAALVPTFVST 563

Query: 273 PRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           P  SS       Y V L AI V  R + +PP     +      ++I S TV +RL   AY
Sbjct: 564 PLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTS------SVIASTTVISRLPPTAY 617

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIH 381
            A+R  FRR +    T   +   DTCY       I  P+I L+F  G  V L    +L+ 
Sbjct: 618 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 677

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                  CLA   AP   + +   I N+QQ+   ++YDVP   +      C
Sbjct: 678 G------CLAF--APTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 124/455 (27%), Positives = 200/455 (43%), Gaps = 42/455 (9%)

Query: 1   MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVL 59
           MKP + F LAF   +S+S   +   +      T+ + H  SP SPF  PS  L+  + ++
Sbjct: 1   MKPFVFFCLAF---YSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPS--LTPSQRII 55

Query: 60  EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
               +  +RL  +S+L + + + +P      I  +  Y++R  IGTP    L   DT +D
Sbjct: 56  NAALRSISRLNRVSNL-LDQNNKLP--QSVLILHNGEYLMRFYIGTPPVERLATADTGSD 112

Query: 120 AAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQV--PNPTCG-GGACAFNLTY 173
             WV C+ C  C   S+ +F   +S+TF    C++  C  +      CG  G C +   Y
Sbjct: 113 LIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKY 172

Query: 174 GS--STIAANLSQDTI------SLATDIVPGYTFGCIQKATGNSVPPQ---GLLGLGRGS 222
           G   S     LS +T+       + T   P   FGC         P     G++GLG G 
Sbjct: 173 GDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGP 232

Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYY 280
           LSL++Q  +     FSYCL    + S +  L+ G   I   + +  TP++  P   + Y+
Sbjct: 233 LSLVSQIGDQIGHKFSYCLLPLGSTS-TSKLKFGNESIITGEGVVSTPMIIKPWLPTYYF 291

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           +NL A+ V ++ V  P G+      T    IIDSGT+ T L    Y       +  +   
Sbjct: 292 LNLEAVTVAQKTV--PTGS------TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVE 343

Query: 341 LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
           L    L     C  Y    V P I   F+G  V+L   NL + +   +  CL +  AP +
Sbjct: 344 LVQDVLSPLPFCFPYRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMI--APSS 401

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           V+ + ++  +  Q + ++ YD+   ++      C+
Sbjct: 402 VSGI-SIFGSFSQIDFQVEYDLEGKKVSFQPTDCS 435


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 127/460 (27%), Positives = 191/460 (41%), Gaps = 56/460 (12%)

Query: 1   MKPQLVFFLAFLFLFSLS-----EGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSW 54
           M P +   LA   L +LS     EGL           ++ + H  SP SPF  PS  L+ 
Sbjct: 1   MHPWVFMILALFSLSTLSSREAREGL--------RGFSVDLIHRDSPSSPFYNPS--LTP 50

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
            E ++    +  +RLQ +S      K    +     I     Y++R  IG+P    L  +
Sbjct: 51  SERIINAALRSMSRLQRVSHFLDENK----LPESLLIPDKGEYLMRFYIGSPPVERLAMV 106

Query: 115 DTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTCGG-GACA 168
           DT +   W+ C+ C  C    + +F   +S+T+K   C +  C   Q     CG  G C 
Sbjct: 107 DTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCI 166

Query: 169 FNLTYGSSTIAAN-LSQDTISLA------TDIVPGYTFGC-----IQKATGNSVPPQGLL 216
           + + YG  + +   L  +T+S        T   P   FGC         T N V   G+ 
Sbjct: 167 YGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKV--MGIA 224

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG--PIGQPKRIKYTPLLKNPR 274
           GLG G LSL++Q        FSYCL  + + S S  L+ G   I     +  TPL+  P 
Sbjct: 225 GLGAGPLSLVSQLGAQIGHKFSYCLLPYDSTSTS-KLKFGSEAIITTNGVVSTPLIIKPS 283

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
             + Y++NL A+ +G++VV            T    +IDSGT  T L    Y       +
Sbjct: 284 LPTYYFLNLEAVTIGQKVVS--------TGQTDGNIVIDSGTPLTYLENTFYNNFVASLQ 335

Query: 335 RRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
             +G  L         TC+     +  P I   F+G +V L   N+LI  T  +I CLA+
Sbjct: 336 ETLGVKLLQDLPSPLKTCFPNRANLAIPDIAFQFTGASVALRPKNVLIPLTDSNILCLAV 395

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +     S+   IA   Q + ++ YD+   ++  A   C
Sbjct: 396 VPSSGIGISLFGSIA---QYDFQVEYDLEGKKVSFAPTDC 432


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 110/428 (25%), Positives = 177/428 (41%), Gaps = 56/428 (13%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASGRQITQSP------------ 95
           K +S  E +   + + +AR    ++L+VAR     VP  S +Q  Q              
Sbjct: 45  KQMSRRELIRRAMQRSKARA---AALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDL 101

Query: 96  TYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAA 152
            Y++   IGTP Q +   +DT +D  W    PC  C+     +F  A S+++  + C   
Sbjct: 102 EYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQ 161

Query: 153 QCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATDI-----VPGYTFGCIQKA 205
            C  + + +C     C +   YG  T    + + +  + A+       VP   FGC    
Sbjct: 162 LCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVP-LGFGCGTMN 220

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL--------SFSGSLRLGP 257
            G+     G++G GR  LSL++Q   L    FSYCL  + +         S S  +  G 
Sbjct: 221 VGSLNNGSGIVGFGRDPLSLVSQ---LSIRRFSYCLTPYTSTRKSTLMFGSLSDGVFEGD 277

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
                +++ T LL++ +  + YYV    + VG R + IP  A    P    G I+DSGT 
Sbjct: 278 DAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTA 337

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------------IVAPTITL 364
            T   A   T V   FR ++    T +S      C++ P             +  P +  
Sbjct: 338 LTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAF 397

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            F G ++ LP+ N ++        C+ +A + D+  +    I N  QQ+ R+LYD+    
Sbjct: 398 HFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGAT----IGNFVQQDMRVLYDLEAET 453

Query: 425 LGVARELC 432
           L  A   C
Sbjct: 454 LSFAPAQC 461


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 163/378 (43%), Gaps = 46/378 (12%)

Query: 91  ITQSPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
           +  S  Y++   IGTP  Q + + MDT +D  W  CT C  C      +F+ + S+TF+ 
Sbjct: 81  VPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRA 140

Query: 147 LGCQAAQCKQVPNPTCGGGACAFNL-------TYGSSTIAAN-LSQDTISLATD------ 192
           + C    C+  P+      ACA          +YG  +I A  + +DT +  +       
Sbjct: 141 VACPDPICR--PSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAP 198

Query: 193 --IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
              V G  FGC    TG     + G+ G GRG LSL +Q   L    FSYCL S      
Sbjct: 199 PVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQ---LRVGRFSYCLTSHDETE- 254

Query: 250 SGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
           S       +G P            + TP++ +P   + YY++L  I VG+  + +     
Sbjct: 255 SNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVF 314

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP--- 356
                   GT+IDSGT  T   A  +  +++ F  ++       TS  G   C+  P   
Sbjct: 315 ALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLLCFQRPKGG 374

Query: 357 --IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
             +  P +    +  ++ LP++N +   T   + CL +  A   V+ VL  I N QQQN 
Sbjct: 375 KQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGA--EVDMVL--IGNFQQQNM 430

Query: 415 RILYDVPNSRLGVARELC 432
            I+YDV NS+L  A   C
Sbjct: 431 HIVYDVENSKLLFASAQC 448


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 105/394 (26%), Positives = 174/394 (44%), Gaps = 34/394 (8%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
           SV    A  +   + L+  A    +VVPI      TQ+  Y+    IGTP Q     +D 
Sbjct: 15  SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70

Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP--TCGGGACAFNL 171
           + +  W  C  C  C    + +F+   S T++   C    C+ +P+    C G  CA+  
Sbjct: 71  AGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPLCESIPSDVRNCSGNVCAYEA 130

Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQ 230
           +  +      +  DT ++ T       FGC+  +  +++  P G++GLGR   SL+ QT 
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189

Query: 231 NLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
               + FSYCL    A     L    S +L   G+     +  +  N    S YY V L 
Sbjct: 190 ---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            ++ G  ++ +PP        +G+  ++D+ +  + LV  AY AV+      VG+    T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMAT 298

Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
            +  FD C+        AP +   F  G  +T+P  N L+    G++ CLAM ++A  N 
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTV-CLAMLSSARLNS 357

Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + L+++ ++QQ+N   L+D+    L      CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 117/429 (27%), Positives = 177/429 (41%), Gaps = 97/429 (22%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S      S        +
Sbjct: 63  SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 118

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
                +   ++V    GTP Q  ++ +DT +   W  C  CV C   S   FN + S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTY 178

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
            +  C     +             +N+TYG  ST   N   DT++L  +D+   + FGC 
Sbjct: 179 SSGSCIPGTVEN-----------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCG 227

Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
           +   G+      G+LGLG+G LS ++QT + +   FSYCLP   ++   GSL  G     
Sbjct: 228 RNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATS 284

Query: 260 QPKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
           Q   +K+T L+  P   + S  Y+VNL  I VG   ++IP            GTIIDS T
Sbjct: 285 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRT 339

Query: 317 VFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSVPIVAPTITL 364
           V TRL   AY+A++  F            RR+ G  L        DTCY+          
Sbjct: 340 VITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNX--------- 382

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
                                          P+     L +I N QQ +  +LYD+   R
Sbjct: 383 --------------------------XXXXXPE-----LTIIGNRQQLSLTVLYDIQGGR 411

Query: 425 LGVARELCT 433
           +G     C+
Sbjct: 412 IGFRSNGCS 420


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 175/365 (47%), Gaps = 38/365 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
           +V   IGTP Q+  M +DT +  +W+ C   V      STVF+ + S++F  L C    C
Sbjct: 78  LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137

Query: 155 K-QVPN---PT-CG-GGACAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKAT 206
           K ++P+   PT C     C ++  Y   T+A  NL ++ I+ +T    P    GC + A+
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS 197

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKR 263
            +    +G+LG+  G LS  +Q +    + FSYC+P+ +     + +GS  LG       
Sbjct: 198 DD----KGILGMNLGRLSFASQAK---ITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAG 250

Query: 264 IKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +Y  LL   ++ R  +L    + V L  IR+G + ++IP  A + +P+    ++IDSG+
Sbjct: 251 FQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGS 310

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS------VPIVAPTITLMFSG 368
            FT LV  AY  VR+   R  G  L    +  G  D C+         ++   +     G
Sbjct: 311 EFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNMVFEFDKG 370

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           + + + +  +L     G + C+ +  + + + +  N+I N  QQN  + +D+ N R+G  
Sbjct: 371 VEIVIEKGRVLA-DVGGGVHCVGIGRS-EMLGAASNIIGNFHQQNLWVEFDIANRRVGFG 428

Query: 429 RELCT 433
           +  C+
Sbjct: 429 KADCS 433


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/391 (27%), Positives = 157/391 (40%), Gaps = 68/391 (17%)

Query: 107 AQTLLMAMDTSNDAAWVPCT------------------------GCVGCSSTVFNSAQST 142
           +QTL + MDT +D  W PC+                          + C S   ++A ++
Sbjct: 102 SQTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNS 161

Query: 143 TFKNLGCQAAQC--KQVPNPTCGGGAC-AFNLTYGSSTIAANLSQDTISL-ATDIVP--- 195
              +  C  A+C   ++    C    C +F   YG  ++ A L +  + + +T   P   
Sbjct: 162 PSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTSNKPFSL 221

Query: 196 -GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCLPSFK----AL 247
             +TFGC   A G    P G+ G G GSLSL AQ  NL     + FSYCL S       L
Sbjct: 222 KDFTFGCAHSALGE---PIGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKL 278

Query: 248 SFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
                L LG + +       +  YTP+L NP+    Y V++ AI VG   V  P   ++ 
Sbjct: 279 HHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALIRI 338

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL----TVTSLGGFDTCYSVP-- 356
           +     G ++DSGT +T L    Y +V     RRVG          S  G   CY +   
Sbjct: 339 DRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGN 398

Query: 357 ------IVAPTITLMFSG-MNVTLPQDNLLIHSTAGS-------ITCLAMAAAPDNVNSV 402
                 +V P +   F G  +V LP+ N       G        + CL +    D     
Sbjct: 399 GVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGG 458

Query: 403 LNV-IANMQQQNHRILYDVPNSRLGVARELC 432
               + N QQQ  +++YD+   R+G A   C
Sbjct: 459 PGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 121/450 (26%), Positives = 188/450 (41%), Gaps = 40/450 (8%)

Query: 7   FFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ 66
           F+ + L LF        +  TQ++  ++++ H  S  SPF  +    ++     M     
Sbjct: 3   FYSSLLLLFCFCRV--SVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNM-KHST 59

Query: 67  ARLQFLSSLAVARKSVVPIASGRQITQSP----TYIVRAKIGTPAQTLLMAMDTSNDAAW 122
            R+ +L+ +     + VP      I  SP     YI+   IGTP   L   MDT+ND  W
Sbjct: 60  NRVHYLNHVFSFPPNKVP-----NIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIW 114

Query: 123 V---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSS 176
               PC  C   +S +F+ ++S+T+K + C + +CK V N  C       C ++ TYG  
Sbjct: 115 FQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGE 174

Query: 177 TIA-ANLSQDTISLATDIVPGYTFGCIQKATG--NSVPPQGL----LGLGRGSLSLLAQT 229
             +  +LS DT++L ++     +F  I    G  N  P +G     +GLGRG LS ++Q 
Sbjct: 175 AYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQL 234

Query: 230 QNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAI 286
            +     FSYCL P F     SG L     G    +     +  P  +    Y   L A+
Sbjct: 235 NSSIGGKFSYCLVPLFSNEGISGKLHF---GDKSVVSGVGTVSTPITAGEIGYSTTLNAL 291

Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
            VG  ++       + N   G  TIIDSGT  T L    Y+ +  +    V      +  
Sbjct: 292 SVGDHIIKFENSTSK-NDNLG-NTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPN 349

Query: 347 GGFDTCYSVPIV---APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
             F  CY   +     P IT  F+G +V L   N   +     + C A  +     N   
Sbjct: 350 QQFKLCYKATLKNLDVPIITAHFNGADVHLNSLNTF-YPIDHEVVCFAFVSVG---NFPG 405

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +I N+ QQN  + +D+  + +      CT
Sbjct: 406 TIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 123/476 (25%), Positives = 198/476 (41%), Gaps = 85/476 (17%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAK 64
           L+  L+ +FL S +     I      S T ++ H+ SP SPF      +  E+    LAK
Sbjct: 16  LIIILSTVFLSSFA-----IIQADKFSFTAELIHIDSPNSPF-----FNASETTTHRLAK 65

Query: 65  DQARLQFLSSLAVARKSVVPIASGRQ------ITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
              R    S+  VAR  + P+++  +       +    Y+++  IGTP   +  A+DT +
Sbjct: 66  ALQR----SANRVAR--LNPLSNSDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGS 119

Query: 119 DAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS 175
           +  W+PC  C  C   SS++FN   S+T+++  C + QC+   +       C ++     
Sbjct: 120 NVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKH 179

Query: 176 STIAAN--LSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLS 224
                N  ++ DT++L +       +P   F C     GNS+       G++GLGRG+LS
Sbjct: 180 QLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVC-----GNSIYKTFAGVGVIGLGRGALS 234

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--------------TPLL 270
           L ++  +L    FSYCL  + +             QP +I +              +  L
Sbjct: 235 LTSKLYHLSDGKFSYCLADYYS------------KQPSKINFGLQSFISDDDLEVVSTTL 282

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
            + R S  YYV L  I VG +  D+      F P  G   +IDSGT+FT L    Y  + 
Sbjct: 283 GHHRHSGNYYVTLEGISVGEKRQDLYYVDDPFAPPVG-NMLIDSGTMFTLLPKDFYDYLW 341

Query: 331 DVFRRRVGSN-----------LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDN 377
                 +  N            ++ +      C  Y   +  P IT+ F+  +V L  DN
Sbjct: 342 STVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDN 401

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             I   A  + C A AA     ++V     + QQ N  + YD+    +   R  C+
Sbjct: 402 SFIR-VAEDVVCFAFAATQPGQSTVY---GSWQQMNFILGYDLKRGTVSFKRTDCS 453


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 171/372 (45%), Gaps = 48/372 (12%)

Query: 96   TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
            T  V   +G+P Q + M +DT ++ +W+ C      +S VFN   S+++  + C +  C+
Sbjct: 999  TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS-VFNPLSSSSYSPIPCSSPICR 1057

Query: 156  ----QVPNP-TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
                 +PNP TC     C   ++Y  +S++  NL+ D   + +  +PG  FGC+      
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 1117

Query: 206  -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLRLGPIGQP 261
             +       GL+G+ RGSLS + Q   L    FSYC+    S   L F G L L  +G  
Sbjct: 1118 NSEEDAKTTGLMGMNRGSLSFVTQ---LGLPKFSYCISGRDSSGVLLF-GDLHLSWLGN- 1172

Query: 262  KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
              + YTPL++     P    + Y V L  IRVG +++ +P      + T    T++DSGT
Sbjct: 1173 --LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGT 1230

Query: 317  VFTRLVAPAYTAVRDVFRRRVGSNL------TVTSLGGFDTCYSVPI-----VAPTITLM 365
             FT L+ P YTA+R+ F  +    L           G  D CYSV         P+++LM
Sbjct: 1231 QFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLM 1290

Query: 366  FSGMNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            F G  + +  + LL     +      + CL    + D +     VI +  QQN  + +D+
Sbjct: 1291 FRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNS-DLLGIEAFVIGHHHQQNVWMEFDL 1349

Query: 421  PNSRLGVARELC 432
                +  A +LC
Sbjct: 1350 ----VAFAADLC 1357


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/386 (27%), Positives = 151/386 (39%), Gaps = 55/386 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-----------STTFK 145
           Y V    GTP Q L    DT +   W PCT    CS   F               S++ K
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191

Query: 146 NLGCQAAQCKQVPNPT--------------CGGGACAFNLTYGSSTIAANLSQDTISLAT 191
            +GC+  +C  +  P               C      + L YGS   A  L  +T+ L  
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLDLEN 251

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSF 249
             VP +  GC   +      P G+ G GRG  SL +Q +      FS+CL S  F     
Sbjct: 252 KRVPDFLVGCSVMSVHQ---PAGIAGFGRGPESLPSQMR---LKRFSHCLVSRGFDDSPV 305

Query: 250 SGSLRLGPIGQPKRIK-----YTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
           S  L L    +    K     Y P  +NP  S+      YY++L  I +G + V  P   
Sbjct: 306 SSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKY 365

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR---RVGSNLTVTSLGGFDTCYSVP 356
           L  + T   G IIDSG+ FT L  P + A+ D   +   +      V +  G   C+++P
Sbjct: 366 LVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIP 425

Query: 357 IVA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM---AAAPDNVNSVLNVIA 407
                   P + L F  G  ++L  +N L   T   + CL M    A          ++ 
Sbjct: 426 KEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILG 485

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
             QQQN  + YD+   R+G  ++ CT
Sbjct: 486 AFQQQNVLVEYDLAKQRIGFRKQKCT 511


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 153/356 (42%), Gaps = 34/356 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y++R  +GTP+   L   DT +D +W+ CT C  C    + +F+  QS+T+ ++ C++  
Sbjct: 88  YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147

Query: 154 CKQVPNP--TCGGGA-CAFNLTYGSSTIA-ANLSQDTISLATD-------IVPGYTFGCI 202
           C   P     CG    C +   YG+ +     L  DTIS ++          P   FGC 
Sbjct: 148 CTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCA 207

Query: 203 QKATGN---SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
             +      S    G +GLG G LSL +Q  +     FSYC+  F + S +G L+ G + 
Sbjct: 208 FYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTS-TGKLKFGSMA 266

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
               +  TP + NP   S Y +NL  I VG++ V             G   IIDS  + T
Sbjct: 267 PTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPILT 318

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQDN 377
            L    YT      +  +   +   +   F+ C   P  +  P     F+G +V L   N
Sbjct: 319 HLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNFPEFVFHFTGADVVLGPKN 378

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           + I +   ++ C+ +  +       +++  N  Q N ++ YD+   ++  A   C+
Sbjct: 379 MFI-ALDNNLVCMTVVPSKG-----ISIFGNWAQVNFQVEYDLGEKKVSFAPTNCS 428


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 123/439 (28%), Positives = 179/439 (40%), Gaps = 69/439 (15%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKS----------------VVPIASG--RQITQ----- 93
           SVLE+  +D  R+Q L    +A+K+                  P+AS    Q  Q     
Sbjct: 85  SVLELQIRDLTRIQTLHKRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATL 144

Query: 94  -------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTT 143
                  S  Y +   +G+P +   + +DT +D  W+   PC  C   +   ++   S +
Sbjct: 145 ESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASAS 204

Query: 144 FKNLGCQAAQCKQV--PNP----TCGGGACAFNLTYGSS----------TIAANLSQDTI 187
           +KN+ C   +C  V  P+P         +C +   YG S          T   NL+    
Sbjct: 205 YKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGG 264

Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
           S     V    FGC     G      GLLGLGRG LS  +Q Q+LY  +FSYCL    + 
Sbjct: 265 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSD 324

Query: 248 SFSGSLRLGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
           +   S  +   G+ K +   P L        K     + YYV + +I V   V++IP   
Sbjct: 325 TNVSSKLI--FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEET 382

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCYSV--- 355
              +     GTIIDSGT  +    PAY  +++ +  +  G           D C++V   
Sbjct: 383 WNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGI 442

Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
             I  P + + F+ G     P +N  I      + CLA+   P    S  ++I N QQQN
Sbjct: 443 DSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTP---KSAFSIIGNYQQQN 498

Query: 414 HRILYDVPNSRLGVARELC 432
             ILYD   SRLG A   C
Sbjct: 499 FHILYDTKRSRLGYAPTKC 517


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 166/386 (43%), Gaps = 56/386 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST--------VFNSAQSTTFK 145
           Y V    GTP+QT+    DT +    +PCT    C GC  +         F    S++ K
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149

Query: 146 NLGCQAAQCKQV--PNPTCGG----------GACAFNLTYGSSTIAANLSQDTISLATDI 193
            +GCQ+ +C+ +  PN  C G          G   + L YG  + A  L  + +      
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLT 209

Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSG 251
           VP +  GC   +T     P G+ G GRG +SL +Q  NL +  FS+CL S  F   + + 
Sbjct: 210 VPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQ-MNLKR--FSHCLVSRRFDDTNVTT 263

Query: 252 SLRL------GPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
            L L          +   + YTP  KNP  S+      YY+NL  I VGR+ V IP   L
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYL 323

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV- 355
                   G+I+DSG+ FT +  P +  V + F  ++ SN T    +    G   C+++ 
Sbjct: 324 APGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM-SNYTREKDLEKETGLGPCFNIS 382

Query: 356 ---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA----PDNVNSVLNVIA 407
               +  P +   F  G  + LP  N           CL + +     P        ++ 
Sbjct: 383 GKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILG 442

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
           + QQQN+ + YD+ N R G A++ C+
Sbjct: 443 SFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 134/448 (29%), Positives = 200/448 (44%), Gaps = 60/448 (13%)

Query: 10  AFLFLFSLSEGLNPI--CDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
           A L LFS + G  PI   D+ D+ +   V  + +       +   +  + +   LA+D A
Sbjct: 50  AALPLFSAASGEAPILELDSDDNGNASTVRFLLAH-REAFAAPNATAAQLLAHRLARDAA 108

Query: 68  RLQFLSSLA--VARKS---VVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           R + +S  A  V R       P+ SG  + Q S  Y     +GTP    L+ +DT +D  
Sbjct: 109 RAEAISVSARNVTRAGGGFSAPVVSG--LAQGSGEYFASVGVGTPPTPALLVLDTGSDVV 166

Query: 122 WV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY 173
           W+   PC  C   S  VF+  +S ++  + C A  C+ +     GG       C + + Y
Sbjct: 167 WLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAY 226

Query: 174 GSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
           G  ++ A +L+ +T+  A    VP    GC     G  V   GLLGLGRG LSL  QT  
Sbjct: 227 GDGSVTAGDLATETLWFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTAR 286

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
            Y   FSYC        F GS           + +  ++    R+   +V       G R
Sbjct: 287 RYGRRFSYC--------FQGS----------DLDHRTII----RTVHQHVG------GAR 318

Query: 292 VVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGF 349
           V  +   +L+ +P+TG  G I+DSGT  TRL  P Y AVR+ FR   G   L       F
Sbjct: 319 VRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLF 378

Query: 350 DTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
           DTCY +     +  PT+++  + G  V LP +N LI        CLA+A     V    +
Sbjct: 379 DTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGV----S 434

Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
           ++ N+QQQ  R+++D    R+ +  + C
Sbjct: 435 IVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 170/364 (46%), Gaps = 39/364 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
           IV   IGTP QT  M +DT +  +W+ C        T F+   S++F  L C  + CK +
Sbjct: 79  IVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPR 138

Query: 157 VPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGYTFGCIQKATGNS 209
           VP+ T    C     C ++  Y   T A  NL ++  +  ++   P    GC   ++   
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSDT- 197

Query: 210 VPPQGLLG--LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS---GSLRLGPIGQPKRI 264
              QG+LG  LGR S S LA+      S FSYC+P  ++ S S   GS  LGP       
Sbjct: 198 ---QGILGMNLGRLSFSSLAKI-----SKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGF 249

Query: 265 KYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           KY  L+   ++ R  +L    Y + +L IR+  + ++I   A + +P+    T+IDSGT 
Sbjct: 250 KYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTW 309

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCY--SVPIVAPTITLMF----SGM 369
           FT LV  AY+ V++   +  G  L    +  G  D C+     ++   I  M     +G+
Sbjct: 310 FTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGV 369

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            + + ++ +L     G + CL +  + D +    N+I N  QQ+  + +D+   R+G  R
Sbjct: 370 EIVVEREKMLA-DVGGGVQCLGIGRS-DLLGVASNIIGNFHQQDLWVEFDLVGRRVGFGR 427

Query: 430 ELCT 433
             C+
Sbjct: 428 TDCS 431


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 112/406 (27%), Positives = 161/406 (39%), Gaps = 73/406 (17%)

Query: 97  YIVRAKIGT-PAQTLLMAMDTSNDAAWVPCT--GCVGCSSTV------------FNSAQS 141
           Y +   +G+ P Q + + MDT +D  W PC    C+ C                  S+ S
Sbjct: 73  YTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSAS 132

Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNL----------------TYGSSTIAANLSQD 185
            + K+  C AA      +  C    C   L                 YG  ++ A L +D
Sbjct: 133 VSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRD 192

Query: 186 TISLATD---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSY 239
           ++S+      ++  +TFGC   A G    P G+ G GRG LSL AQ  +      + FSY
Sbjct: 193 SLSMPASSPLVLHNFTFGCAHTALGE---PVGVAGFGRGVLSLPAQLASFSPHLGNQFSY 249

Query: 240 CL--PSFKA--LSFSGSLRLGPIG----QPKRIK-------YTPLLKNPRRSSLYYVNLL 284
           CL   SF A  +     L LG       + KR+        YT +L NP+    Y V L 
Sbjct: 250 CLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLE 309

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-- 342
            I VG R + +P    + +     G ++DSGT FT L A  Y ++   F  R+G      
Sbjct: 310 GITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRA 369

Query: 343 --VTSLGGFDTCYSVPIVA---PTITLMFSGMN-VTLPQDNLLIHSTAG--------SIT 388
             +    G   CY     A   P + L F G + V LP++N       G         + 
Sbjct: 370 TQIEERTGLGPCYYSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVG 429

Query: 389 CLAMAAAPDNVNS--VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CL +    D   S      + N QQQ   ++YD+   R+G AR  C
Sbjct: 430 CLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/354 (27%), Positives = 153/354 (43%), Gaps = 29/354 (8%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--------FNSAQSTT 143
           T +  Y++   +GTP Q +   +D ++D  W+ C+ C  C +          F +  S+T
Sbjct: 92  TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSST 151

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG---SSTIAANLSQDTISLATDIVPGYT 198
            + + C    C+++   TC      C ++  YG   ++T A  L+ D  + AT    G  
Sbjct: 152 IREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI 211

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC     G+     G++GLGRG LS ++Q Q      FSY L    A+     +     
Sbjct: 212 FGCAVATEGDI---GGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDD 265

Query: 259 GQPK--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +P+  R   TPL+ +    SLYYV L  IRV    + IP G          G ++    
Sbjct: 266 AKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITI 325

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV- 371
             T L A AY  VR     ++       S  G D CY+   +A    P++ L+F+G  V 
Sbjct: 326 PVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 385

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            L   N     +   + CL +  +P    S+L    ++ Q    ++YD+  SRL
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDGSLLG---SLIQVGTHMIYDISGSRL 436


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 105/422 (24%), Positives = 169/422 (40%), Gaps = 48/422 (11%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAK------- 102
           K L   E +   + + +AR   LS +         IA  R+  + P   VRA        
Sbjct: 41  KELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVL 100

Query: 103 ---IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
              +GTP Q +   +DT +D  W  C  C  C      +F+   S++++ + C    C  
Sbjct: 101 DLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGD 160

Query: 157 VPNPTC-GGGACAFNLTYGSSTIA------ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           + + +C     C +  +YG  T           +  + S  T  VP   FGC     G+ 
Sbjct: 161 ILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSL 219

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------PK 262
               G++G GR  LSL++Q   L    FSYCL  + A S   +L+ G +           
Sbjct: 220 NNASGIVGFGRDPLSLVSQ---LSIRRFSYCLTPY-ASSRKSTLQFGSLADVGLYDDATG 275

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            ++ TP+L++ +  + YYV    + VG R + IP  A    P    G IIDSGT  T   
Sbjct: 276 PVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFP 335

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------------SVPIVAPTITLMFSGMN 370
           A     V   FR ++       S      C+            +  +  P +   F G +
Sbjct: 336 AAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD 395

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + LP++N ++        C+ +  + D+  +    I N  QQ+ R++YD+    L  A  
Sbjct: 396 LDLPRENYVLEDHRRGHLCVLLGDSGDDGAT----IGNFVQQDMRVVYDLERETLSFAPV 451

Query: 431 LC 432
            C
Sbjct: 452 EC 453


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 115/442 (26%), Positives = 177/442 (40%), Gaps = 72/442 (16%)

Query: 57  SVLEMLAKDQARLQFLSSLAVAR----------------KSVVPIASGRQITQSPTYIVR 100
           S+ ++   D+ R+ F++S    R                   +P+ SG   T    Y VR
Sbjct: 42  SLADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSG-AYTGIGQYFVR 100

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV------------FNSAQSTTFKNLG 148
            ++GTPAQ  L+  DT +D  WV C      +S++            F    S T+  + 
Sbjct: 101 FRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPIS 160

Query: 149 CQAAQC-KQVPN--PTC--GGGACAFNLTYGSSTIA---ANLSQDTISLA-----TDIVP 195
           C +  C K +P    TC   G  CA++  Y   + A         TI+L+        + 
Sbjct: 161 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKAKLK 220

Query: 196 GYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
           G   GC    TG S     G+L LG   +S  +   + +   FSYCL       + +  L
Sbjct: 221 GLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNATSYL 280

Query: 254 RLGP---IGQPK-----------RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
             GP   +  P+           R + TPLL + R    Y V+L AI V    + IP   
Sbjct: 281 TFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKIPRAV 340

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----- 354
             ++   G G I+DSGT  T L  PAY AV     + + + L   ++  F+ CY+     
Sbjct: 341 --WDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGL-AGLPRVTMDPFEYCYNWTSPS 397

Query: 355 ---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
                +  P + + F+G     P     +   A  + C+ +   P      ++VI N+ Q
Sbjct: 398 GKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP---WPGISVIGNILQ 454

Query: 412 QNHRILYDVPNSRLGVARELCT 433
           Q H   +D+ N RL   R  CT
Sbjct: 455 QEHLWEFDIKNRRLKFQRSRCT 476


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 162/384 (42%), Gaps = 40/384 (10%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           V  + SG  +  S  Y +   +G+P +   + +DT +D  W+ C  C  C   +   ++ 
Sbjct: 156 VATLESGMTLG-SGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214

Query: 139 AQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----------TIAANL 182
             S ++KN+ C   +C  V +P           +C +   YG S          T   NL
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNL 274

Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
           + +  S     V    FGC     G      GLLGLGRG LS  +Q Q+LY  +FSYCL 
Sbjct: 275 TTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 334

Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVD 294
              + +   S  +   G+ K +   P L        K     + YYV + +I V   V++
Sbjct: 335 DRNSDTNVSSKLI--FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLN 392

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCY 353
           IP      +     GTIIDSGT  +    PAY  +++ +  +  G           D C+
Sbjct: 393 IPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCF 452

Query: 354 SVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +V     +  P + + F+ G     P +N  I      + CLAM   P    S  ++I N
Sbjct: 453 NVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTP---KSAFSIIGN 508

Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
            QQQN  ILYD   SRLG A   C
Sbjct: 509 YQQQNFHILYDTKRSRLGYAPTKC 532


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 170/366 (46%), Gaps = 39/366 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           IV   IGTP QT  M +DT +  +W+ C           +T F+ + S++F  L C    
Sbjct: 81  IVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPL 140

Query: 154 CK-QVPN----PTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKA 205
           CK ++P+     TC     C ++  Y   T A  +L ++ I+ ++    P    GC + +
Sbjct: 141 CKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEAS 200

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRLGPIGQPK 262
           T      +G+LG+  G  S  +Q +    S FSYC+P+ +A   LS +GS  LG      
Sbjct: 201 TDE----KGILGMNLGRRSFASQAK---ISKFSYCVPTRQARAGLSSTGSFYLGNNPNSG 253

Query: 263 RIKY------TPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           R +Y      TP  ++P    L Y + +  IR+G   ++I     + +P+    TIIDSG
Sbjct: 254 RFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSG 313

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS------VPIVAPTITLMFS 367
           + FT LV  AY  VR+   R VG  L    + G   D C+         ++   +     
Sbjct: 314 SEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEK 373

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G+ + + +  +L     G + C+ +  + + + +  N+I N  QQN  + YD+ N R+G+
Sbjct: 374 GVEIVIDKWRVLA-DVGGGVHCIGIGRS-EMLGAASNIIGNFHQQNLWVEYDLANRRIGL 431

Query: 428 ARELCT 433
            +  C+
Sbjct: 432 GKADCS 437


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 173/370 (46%), Gaps = 42/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   +G P +  L+ +DT +D  W+   PC  C   S  VF+ +QST+FK + C AA 
Sbjct: 87  YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146

Query: 154 CKQVPNPTCGGGA-------CAFNLTYG-SSTIAANLSQDTISLATDIVPG------YTF 199
           C  V +  C   +       C +   YG SS  + +L+ +++S++    P          
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN--LYQSTFSYCL-PSFKALSFSGSLRLG 256
           GC     G      GLLGLG+G+LS  +Q ++  + QS FSYCL      LS S ++  G
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSVSSAISFG 265

Query: 257 PIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
                 R    +K+TP ++ N    + YY+ +  I++ + ++ IP            GTI
Sbjct: 266 AGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTI 325

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLM 365
           IDSGT  T L   AY AV   F  R+ S            CY      +VP   P ++++
Sbjct: 326 IDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPF--PALSIV 382

Query: 366 F-SGMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           F +G  + LPQ+N  I      +  CLA+          +++I N QQQN   LYDV ++
Sbjct: 383 FQNGAELDLPQENYFIQPDPQEAKHCLAILPT-----DGMSIIGNFQQQNIHFLYDVQHA 437

Query: 424 RLGVARELCT 433
           RLG A   C+
Sbjct: 438 RLGFANTDCS 447


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 155/385 (40%), Gaps = 55/385 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
           Y +   +GTP QT    +DT +   W PCT    CS   F +             S+T K
Sbjct: 92  YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAK 151

Query: 146 NLGCQAAQCKQV--------------PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
            LGC+  +C  +               +  C     A+ + YG  + A  L  D ++   
Sbjct: 152 LLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPG 211

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
             VP +  GC   +      P G+ G GRG  SL +Q  NL +  FSYCL S +      
Sbjct: 212 KTVPQFLVGCSILSIRQ---PSGIAGFGRGQESLPSQ-MNLKR--FSYCLVSHRFDDTPQ 265

Query: 252 S----LRLGPIGQPKR--IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
           S    L++   G  K   + YTP   NP  ++      YY+ L  + VG + V IP   L
Sbjct: 266 SSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFL 325

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
           +       GTI+DSG+ FT +  P Y  V   F +++  N +      +  G   C+++ 
Sbjct: 326 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNIS 385

Query: 357 ----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIA 407
               +  P +T  F  G  +T P  N         + CL +     A P        ++ 
Sbjct: 386 GVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILG 445

Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
           N QQQN  I YD+ N R G     C
Sbjct: 446 NYQQQNFYIEYDLENERFGFGPRSC 470


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 109/438 (24%), Positives = 194/438 (44%), Gaps = 45/438 (10%)

Query: 9   LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
            + LFL S +   + +   +D+  T+++ H  SP SP   S    ++  ++  L +   R
Sbjct: 5   FSLLFLISTASVFSAVT-ARDYGFTVELIHRDSPKSPMYNSSETHFDR-IVNALRRSSHR 62

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PC 125
              +     A     PI           Y+V   +GTP  +++   DT +D  W    PC
Sbjct: 63  NTVVLESDTAE---API-----FNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114

Query: 126 TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN-PTCGGGA-CAFNLTYGSSTIA-ANL 182
           + C   ++ +F+ ++STT+KN+ C +  C    +  +C   + C +++ YG  + +  NL
Sbjct: 115 SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNL 174

Query: 183 SQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQST 236
           + DT+++ +        P    GC     G  +    G++GLGRG  SL+ Q        
Sbjct: 175 AVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGK 234

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIK-----YTPLLKNPRRSSLYYVNLLAIRVGRR 291
           FSYCL      S + S +L   G    +       TP+  + +  + Y + L A+ VG  
Sbjct: 235 FSYCLIPIGTGSTNDSTKLN-FGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDT 293

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--- 348
             + P GA +    +    IIDSGT  T L     +A+ + F   +  ++++        
Sbjct: 294 KFNFPEGASKLGGES--NIIIDSGTTLTYLP----SALLNSFGSAISQSMSLPHAQDPSE 347

Query: 349 -FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
             D C++        P +T+ F G +V L ++NL +  +  +I CLA  + PD+    + 
Sbjct: 348 FLDYCFATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDDTI-CLAFGSFPDD---NIF 403

Query: 405 VIANMQQQNHRILYDVPN 422
           +  N+ Q N  + YD+ N
Sbjct: 404 IYGNIAQSNFLVGYDIKN 421


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 104/394 (26%), Positives = 174/394 (44%), Gaps = 34/394 (8%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
           SV    A  +   + L+  A    +VVPI      TQ+  Y+    IGTP Q     +D 
Sbjct: 15  SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70

Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNL 171
           + +  W  C  C  C    + +F+   S T++   C    C+ +P+ +  C G  CA+  
Sbjct: 71  AGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCSGNVCAYQA 130

Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQ 230
           +  +      +  DT ++ T       FGC+  +  +++  P G++GLGR   SL+ QT 
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189

Query: 231 NLYQSTFSYCLPSFK-----ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
               + FSYCL         AL    S +L   G+     +  +  N    S YY V L 
Sbjct: 190 ---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            ++ G  ++ +PP        +G+  ++D+ +  + LV  AY AV+      VG+    T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMAT 298

Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
            +  FD C+        AP +   F  G  +T+   N L+    G++ CLAM ++A  N 
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYLLDYKNGTV-CLAMLSSARLNS 357

Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + L+++ ++QQ+N   L+D+    L      CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 155/386 (40%), Gaps = 57/386 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
           Y +   +GTP QT    +DT +   W PCT    CS   F +             S+T K
Sbjct: 88  YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147

Query: 146 NLGCQ---------------AAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLA 190
            LGC+                 QCK+  +  C     ++ + YG    A  L  D ++  
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFP 207

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
              VP +  GC   +      P G+ G GRG  SL +Q  NL +  FSYCL S +     
Sbjct: 208 GKTVPQFLVGCSILSIRQ---PSGIAGFGRGQESLPSQ-MNLKR--FSYCLVSHRFDDTP 261

Query: 251 GS----LRLGPIGQPKR--IKYTPLLKNPRRSSL----YYVNLLAIRVGRRVVDIPPGAL 300
            S    L++   G  K   + YTP   NP  +S+    YYV L  + VG   V IP   L
Sbjct: 262 QSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFL 321

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
           +       GTI+DSG+ FT +  P Y  V   F R++G   +    V +  G   C+++ 
Sbjct: 322 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNIS 381

Query: 357 ----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM-----AAAPDNVNSVLNVI 406
               I  P  T  F  G  ++ P  N         + C  +     A  P      + ++
Sbjct: 382 GVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAI-IL 440

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N QQQN  + YD+ N R G     C
Sbjct: 441 GNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 154/352 (43%), Gaps = 31/352 (8%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           T +  Y+    IGTP Q +  A+D S+D  W  C      ++  FN  +STT  ++ C  
Sbjct: 95  TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-----ATAPFNPVRSTTVADVPCTD 149

Query: 152 AQCKQVPNPTCGGGA------CAFNLTYGSSTIAAN----LSQDTISLATDIVPGYTFGC 201
             C+Q    TCG GA      CA+   YG    AAN    L  +  +     + G  FGC
Sbjct: 150 DACQQFAPQTCGAGAGAGSSECAYTYMYGGG--AANTTGLLGTEAFTFGDTRIDGVVFGC 207

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
             +  G+     G++GLGRG+LSL++Q Q      FSY      ++     +  G    P
Sbjct: 208 GLQNVGDFSGVSGVIGLGRGNLSLVSQLQ---VDRFSYHFAPDDSVDTQSFILFGDDATP 264

Query: 262 K--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT-VF 318
           +      T LL +    SLYYV L  I+V  + + IP G        G+G +  S T + 
Sbjct: 265 QTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLV 324

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TL 373
           T L   AY  +R     ++G      S  G D CY+   +A    P++ L+F+G  V  L
Sbjct: 325 TVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMEL 384

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
              N     +   + CL +  +     SVL    ++ Q    ++YD+  S+L
Sbjct: 385 ELGNYFYMDSTTGLACLTILPSSAGDGSVL---GSLIQVGTHMMYDINGSKL 433


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 134/419 (31%), Positives = 186/419 (44%), Gaps = 72/419 (17%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
           S+ L++ H   PC+P + S   +   SV + L  DQ R +++             S A A
Sbjct: 65  SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
             + VP + G  I  +  Y+V A +GTP     M +DT +D +WV C  C    S     
Sbjct: 123 AVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQK 181

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
             +F+ AQS+++            VP   CGG  CA    Y         +    +    
Sbjct: 182 DPLFDPAQSSSYA----------AVP---CGGPVCAGLGIY--------AASACSAAQCG 220

Query: 193 IVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
            V G+ FGC    +G  N V   GLLGLGR   SL+ QT   Y   FSYCLP+    S +
Sbjct: 221 AVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP--STA 276

Query: 251 GSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           G L L   GP G       T LL +P   + Y V L  I VG + + +P  A        
Sbjct: 277 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG---- 332

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP----IVAPT 361
             T++D+GTV TRL   AY A+R  FR  + S    T  S G  DTCY+      +  P 
Sbjct: 333 --TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 390

Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           + L F SG  VTL  D +L      S  CLA   AP   +  + ++ N+QQ++  +  D
Sbjct: 391 VALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILGNVQQRSFEVRID 441


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 111/349 (31%), Positives = 167/349 (47%), Gaps = 41/349 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
           Y +   +GTP QTL    DT +D  W  C  C  C+   S  +   +S++F  L C +A 
Sbjct: 81  YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140

Query: 154 CKQVPN---PTCGG-----GACAFNLTYGSSTIAANLSQ-----DTISLATDIVPGYTFG 200
           C+ + +    TCGG       C++  +YG S+   + +Q     +T +L +D V G  FG
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFG 200

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C   + G      GL+GLGRG LSL+ Q   L    FSYCL S  + S       G +  
Sbjct: 201 CTTMSEGGYGSGSGLVGLGRGKLSLVRQ---LKVGAFSYCLTSDPSTSSPLLFGAGALTG 257

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFT 319
           P  ++ TPL+ N + S+ Y VNL +I +G              P TG  G I DSGT  T
Sbjct: 258 PG-VQSTPLV-NLKTSTFYTVNLDSISIGAAKT----------PGTGRHGIIFDSGTTLT 305

Query: 320 RLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQD 376
            L  PAYT        +  +NLT V    G++ C+  S   V P++ L F G ++ L  +
Sbjct: 306 FLAEPAYTLAEAGLLSQT-TNLTRVPGTDGYEVCFQTSGGAVFPSMVLHFDGGDMALKTE 364

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           N    +   S++C  +  +P    S ++++ N+ Q ++ I YD+  S L
Sbjct: 365 NYF-GAVNDSVSCWLVQKSP----SEMSIVGNIMQMDYHIRYDLDKSVL 408


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 76/248 (30%), Positives = 116/248 (46%), Gaps = 19/248 (7%)

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGS 252
           TFGC +   G      G++G+  G LS+L Q   L  + FSYCL  F       + F   
Sbjct: 25  TFGCGKLTNGTIAGASGIMGVSPGPLSVLKQ---LSITKFSYCLTPFTDHKTSPVMFGAM 81

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
             LG      +++  PLLKNP     YYV ++ I +G + +D+P   L   P    GT++
Sbjct: 82  ADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDVPEAILALRPDGTGGTVL 141

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
           DS T    LV PA+  ++      +       S+  +  C+ +P       +  P + L 
Sbjct: 142 DSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYPVCFELPRGMSMEGVQVPPLVLH 201

Query: 366 FSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F+G   ++LP+D+     + G + CLA+  AP       NVI N+QQQN  +LYD+ N +
Sbjct: 202 FAGDAEMSLPRDSYFQEPSPG-MMCLAVMQAP--FEGAPNVIGNVQQQNMHVLYDLGNRK 258

Query: 425 LGVARELC 432
              A   C
Sbjct: 259 FSYAPTKC 266


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 159/398 (39%), Gaps = 77/398 (19%)

Query: 106 PAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVFNSAQST-------TFKNLGCQAAQC-- 154
           P Q + + +DT +D  W PC    C+ C     N+  ST       T +++ C+++ C  
Sbjct: 92  PPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSA 151

Query: 155 ------------------KQVPNPTCGGGAC-AFNLTYGSSTIAANLSQDTISLATDI-- 193
                             + +    C   +C +F   YG  ++ A L  D+I L      
Sbjct: 152 AHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPS 211

Query: 194 --VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCLPSFKALS 248
             +  +TFGC   A      P G+ G GRG LSL AQ  +      + FSYCL S    S
Sbjct: 212 LSLHNFTFGCAHTALAE---PVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSH---S 265

Query: 249 F-SGSLRL-GPI------GQPKRIK-------YTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           F S  LRL  P+       + KR+        YT +L NP+    Y V L  I +G++ +
Sbjct: 266 FNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKI 325

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL----TVTSLGGF 349
             P    + +     G ++DSGT FT L A  Y +V   F  RVG        V    G 
Sbjct: 326 PAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGL 385

Query: 350 DTCYSVPIVA--PTITLMFSG--MNVTLPQDNLLIHSTAGS--------ITCLAMAAAPD 397
             CY    V   P++ L F G   +V LP+ N       G         + CL +    +
Sbjct: 386 GPCYYYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGE 445

Query: 398 NVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELC 432
                      + N QQ    ++YD+   R+G AR  C
Sbjct: 446 EAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 116/425 (27%), Positives = 201/425 (47%), Gaps = 51/425 (12%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG-- 88
           S  L + + + PCS     K  S ++  L+    D++R++ +++  + + S      G  
Sbjct: 61  SQGLPITYSYGPCSQLGQKKSPSRQQIFLQ----DRSRVRSINARILGQYSTEESKDGGS 116

Query: 89  ----RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSST---VFNSA 139
                 + +   ++V    G P Q L + +DT +D  W+ C  C    C +     FN +
Sbjct: 117 PESMHSLNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPS 176

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQ-DTISLATDIVPGYT 198
            S+++ N  C       +P+         + + Y  ++ +  +   D ++L  D+ P + 
Sbjct: 177 LSSSYSNRSC-------IPSTKTN-----YTMNYEDNSYSKGVFVCDEVTLKPDVFPKFQ 224

Query: 199 FGCIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
           FGC     G+     G+LGL +G   SL++QT + ++  FSYC P  +  +  GSL  G 
Sbjct: 225 FGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCFPHNE--NTRGSLLFGE 282

Query: 258 --IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
             I     +K+T LL NP   S+Y+V L+ I V ++ +++   +L  +P    GTIIDSG
Sbjct: 283 KAISASPSLKFTRLL-NPSSGSVYFVELIGISVAKKRLNVS-SSLFASP----GTIIDSG 336

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSVP------IVAPTITLMF 366
           TV T L   AY A+R  F++ +    +V+        DTCY++       I  P I L F
Sbjct: 337 TVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHF 396

Query: 367 SG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            G ++V+L    +L     G +T   +A A  +  S + +I N QQ + +++YD+   RL
Sbjct: 397 VGEVDVSLHPSGILW--ANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRL 454

Query: 426 GVARE 430
           G   +
Sbjct: 455 GFGND 459


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 166/399 (41%), Gaps = 65/399 (16%)

Query: 97  YIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG------------------- 130
           Y +   +G P  A ++ + +DT +D  W PC       C G                   
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDSR 147

Query: 131 ---CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQ 184
              C+S + ++A S+   +  C AA+C    +   +C   AC      YG  ++ ANL +
Sbjct: 148 RISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANLRR 207

Query: 185 DTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-- 241
             + LA  + V  +TF C   A      P G+ G GRG LSL AQ        FSYCL  
Sbjct: 208 GRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVA 264

Query: 242 PSFKA--LSFSGSLRLG------PIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
            SF+A  L  S  L LG       IG  +    YTPLL NP+    Y V L A+ VG + 
Sbjct: 265 HSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKR 324

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG----- 347
           +   P     +     G ++DSGT FT L +  +  V D F R + +     + G     
Sbjct: 325 IQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT 384

Query: 348 GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAPDNVN 400
           G   CY         P + L F G   V LP+ N  +   S  G S+ CL +     N +
Sbjct: 385 GLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 444

Query: 401 SVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
              +       + N QQQ   ++YDV   R+G AR  CT
Sbjct: 445 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/422 (24%), Positives = 168/422 (39%), Gaps = 48/422 (11%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAK------- 102
           K L   E +   + + +AR   LS +         IA  R+  + P   VRA        
Sbjct: 41  KELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVL 100

Query: 103 ---IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
              +GTP Q +   +DT +D  W  C  C  C      +F+   S++++ + C    C  
Sbjct: 101 DLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGD 160

Query: 157 VPNPTC-GGGACAFNLTYGSSTIA------ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           + + +C     C +  +YG  T           +  + S  T  VP   FGC     G+ 
Sbjct: 161 ILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSL 219

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------PK 262
               G++G GR  LSL++Q   L    FSYCL  + A S   +L+ G +           
Sbjct: 220 NNASGIVGFGRDPLSLVSQ---LSIRRFSYCLTPY-ASSRKSTLQFGSLADVGLYDDATG 275

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            ++ TP+L++ +  + YYV    + VG R + IP  A    P    G IIDSGT  T   
Sbjct: 276 PVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFP 335

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------------SVPIVAPTITLMFSGMN 370
                 V   FR ++       S      C+            +  +  P +   F G +
Sbjct: 336 VAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD 395

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + LP++N ++        C+ +  + D+  +    I N  QQ+ R++YD+    L  A  
Sbjct: 396 LDLPRENYVLEDHRRGHLCVLLGDSGDDGAT----IGNFVQQDMRVVYDLERETLSFAPV 451

Query: 431 LC 432
            C
Sbjct: 452 EC 453


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 128/249 (51%), Gaps = 20/249 (8%)

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           FGC   + G+ V   GL+GL  G++SL++Q   L    FSYCL  F     S  L  G +
Sbjct: 96  FGCGALSAGSLVGASGLMGLSPGTMSLISQ---LSVPRFSYCLTPFAERKTSPML-FGAM 151

Query: 259 GQPKR------IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
              ++      I+ T +L+NP   + YY V L+ + +G + + +P  +L  NP    GTI
Sbjct: 152 ADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTI 211

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITL 364
           +DSG+    L   A+ AV+      V   +   ++  ++ C++VP       +  P + L
Sbjct: 212 VDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPPLVL 271

Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            F  G  + LP+DN      AG + CLA+A +P+++ + +++I N+QQQN  +L+DV N 
Sbjct: 272 HFDGGAAMALPRDNYFQEPRAG-LMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDVHNQ 330

Query: 424 RLGVARELC 432
           +   A   C
Sbjct: 331 KFSFAPTKC 339


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 162/381 (42%), Gaps = 62/381 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y+V+   GTP      A+DT++D  W+ C  CV C      VFN   S+++  + C +  
Sbjct: 92  YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151

Query: 154 CKQVPNPTC---GGGACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
           C Q+    C     GAC +   Y G       L+ D +++  D+     FGC   + G  
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGP 211

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP-----KR 263
           +    GL+GLGRG LSL++Q   L    F YCLP   + + SG L LG           R
Sbjct: 212 AAQASGLVGLGRGPLSLVSQ---LSVHRFMYCLPPPMSRT-SGKLVLGAGADAVRNMSDR 267

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT----------------- 306
           +  T +  + R  S YY+NL  + VG    D  PG  + N T+                 
Sbjct: 268 VTVT-MSSSTRYPSYYYLNLDGLAVG----DQTPGTTR-NATSPPSGGAGGGGGGGGGGI 321

Query: 307 -------GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVP-- 356
                    G I+D  +  + L    Y  + D     +       SL  G D C+ +P  
Sbjct: 322 VGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEG 381

Query: 357 -----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
                +  PT++L F G  + L +D L +  T G + CL +        S ++++ N Q 
Sbjct: 382 VGMDRVYVPTVSLSFDGRWLELDRDRLFV--TDGRMMCLMIGR-----TSGVSILGNFQL 434

Query: 412 QNHRILYDVPNSRLGVARELC 432
           QN R+L+++   ++  A+  C
Sbjct: 435 QNMRVLFNLRRGKITFAKASC 455


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 62/400 (15%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST- 134
           A  RK+VV  A    + +   Y+V+  IGTP      A+DT++D  W+ C  CV C    
Sbjct: 69  ARNRKAVVGEAP--LVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQL 126

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
             +FN   S+++  + C +  C Q+    C      AC +N  Y G++     L+ D ++
Sbjct: 127 DPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLA 186

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
           +  ++      GC   + G   PPQ  GL+GL RG LSLL+Q   L    F YCLP   +
Sbjct: 187 VGGNVFHAVVLGCSDSSVGGP-PPQASGLVGLARGPLSLLSQ---LSVRRFMYCLPPPMS 242

Query: 247 LSFSGSLRLGPIGQPKRIKYTP------LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
            +  G L LG       ++         +  + R  S YY+N   + VG    D  PG +
Sbjct: 243 RT-PGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVG----DQTPGTI 297

Query: 301 QFNPTT--------------------GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-S 339
           +  PT+                      G I+D  +  + L A  Y  + D     +   
Sbjct: 298 R-RPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLP 356

Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
             T ++  G D C+ +P       +  PT+++ F G  + L +D L +    G + CL +
Sbjct: 357 RATPSTRLGLDLCFILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLED--GRMMCLMI 414

Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                   S ++++ N QQQN  +LY++   ++  A+  C
Sbjct: 415 GR-----TSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 119/432 (27%), Positives = 182/432 (42%), Gaps = 63/432 (14%)

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT------QSPTYIVRAKIG 104
           PLS   S L+    +   L  LSSL+ AR    P     ++T          Y V   +G
Sbjct: 24  PLSISPSALDKW--ESINLAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLG 81

Query: 105 TPAQTLLMAMDTSNDAAWVPCT------GCVGCSST--------VFNSAQSTTFKNLGCQ 150
           TP Q + + +DT +   W PCT       C  C+ +        ++   +S+T ++L C+
Sbjct: 82  TPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCR 141

Query: 151 AAQCKQV----PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT-DIVPGYTFGCIQKA 205
           + +C  V     N +       + L YG  +    L  D + L+  + +P + FGC   +
Sbjct: 142 SPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKLNRIPDFLFGC---S 198

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSL-----RLGPI 258
             ++  P+G+ G GRG  S+ AQ   L  + FSYCL S  F     SG L     R    
Sbjct: 199 LVSNRQPEGIAGFGRGLASIPAQ---LGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHAD 255

Query: 259 GQPKRIKYTPLLKNPR---RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
                + Y P  K+P     S  YY++L  I VG + V IPP  L  +     G I+DSG
Sbjct: 256 AAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSG 315

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLT-------VTSLGGFDTCYSV----PIVAPTITL 364
           + FT +       + D   R +  ++T       +    G   CY++     +  P +T 
Sbjct: 316 STFTFMER----IIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLTF 371

Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDV 420
            F  G N+ LP  +     T G + C+ +   PD   S      ++ N QQQN  I YD+
Sbjct: 372 SFKGGANMDLPLTDYFSLVTDG-VVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDL 430

Query: 421 PNSRLGVARELC 432
              R G   + C
Sbjct: 431 KKQRFGFKPQQC 442


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 149/354 (42%), Gaps = 42/354 (11%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---KQVPN 159
           +GTP   + + ++  N+  W        C    F   +  TF   G   A C   K  PN
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSR-GLPFASCGSPKFWPN 59

Query: 160 PTCGGGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQKATGNSVPPQ-GL 215
            TC      +  +YG  ++    L  D  +   A   VPG  FGC     G     + G+
Sbjct: 60  QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGI 114

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYC-------LPSFKALSFSGSLRLGPIGQPKRIKYTP 268
            G GRG LSL +Q   L    FS+C       +PS   L     L     G    ++ TP
Sbjct: 115 AGFGRGPLSLPSQ---LKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQG---AVQTTP 168

Query: 269 LL---KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           L+   KN    +LYY++L  I VG   + +P  A      TG GTIIDSGT  T L    
Sbjct: 169 LIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSGTSITSLPPQV 227

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH 381
           Y  VRD F  ++   +   +  G  TC+S P  A    P + L F G  + LP++N +  
Sbjct: 228 YQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFE 287

Query: 382 ---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                  SI CLA+     N      +I N QQQN  +LYD+ N+ L      C
Sbjct: 288 VPDDAGNSIICLAI-----NKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 157/370 (42%), Gaps = 47/370 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTVFNSAQSTTFKNLGCQA 151
           YI    +G P Q     +DT +   W  CT      CV      FN++ S +F  + CQ 
Sbjct: 86  YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145

Query: 152 AQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV 210
             C       C   G C F +TYG+  I   L  D  +  +       FGC+   T  + 
Sbjct: 146 KACAGNYLHFCALDGTCTFRVTYGAGGIIGFLGTDAFTFQSGGAT-LAFGCV-SFTRFAA 203

Query: 211 P-----PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPI----GQ 260
           P       GL+GLGRG LSL +QT       FSYCL P F     S  L +G      G 
Sbjct: 204 PDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYFHNNGASSHLFVGAAASLSGG 260

Query: 261 PKRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTT----GAGTIID 313
              +     +++P+    S+ YY+ L+ I VG   + IP  A             G IID
Sbjct: 261 GGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIID 320

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLT---VTSLGGFDTCYS---VPIVAPTITLMFS 367
           SG+ FT LV  AY  +     R++  +L        GG   C +   +  V PT+ L FS
Sbjct: 321 SGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLVLHFS 380

Query: 368 -GMNVTLPQDNL---LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            G ++ LP +N    L  STA    C+A+         + ++I N QQQN  IL+DV   
Sbjct: 381 GGADMALPPENYWAPLEKSTA----CMAIVRG-----YLQSIIGNFQQQNMHILFDVGGG 431

Query: 424 RLGVARELCT 433
           RL      C+
Sbjct: 432 RLSFQNADCS 441


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 179/382 (46%), Gaps = 53/382 (13%)

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTF 144
           GR+  +   Y    K+G+P Q  ++ +DT ++  W+ C  C  C+    T++++A+S ++
Sbjct: 94  GRKFGE---YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSY 150

Query: 145 KNLGCQAAQ-CKQVPNPT---CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI----- 193
           K + C  +Q C      T   C  G+ C F   YG  + +  +LS DT+ + T +     
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 194 -VPGYTFGCIQKA-----TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
            V  + FGC Q       TG S    G+LGL  G ++L  Q    +   FS+C P   + 
Sbjct: 211 TVQDFAFGCAQGDLELVPTGAS----GILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSH 266

Query: 247 LSFSGSLRLGPIGQP-KRIKYT--PLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQF 302
           L+ +G +  G    P ++++YT   L  +  +   Y+V L  + +    +V +P G++  
Sbjct: 267 LNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSV-- 324

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF-RRRVGS--NLTVTSLGGFDTCYSVP--- 356
                   I+DSG+ F+  V P ++ +R+ F + R  S  +L   S G   TC+ V    
Sbjct: 325 -------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDD 377

Query: 357 -----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
                   P+++L+F  G+ + +P   +L+             A  D   + +NVI N Q
Sbjct: 378 IDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQ 437

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  + YD+  SR+G AR  C
Sbjct: 438 QQNLWVEYDIQRSRVGFARASC 459


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 166/399 (41%), Gaps = 65/399 (16%)

Query: 97  YIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG------------------- 130
           Y +   +G P  A ++ + +DT +D  W PC       C G                   
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDSR 147

Query: 131 ---CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQ 184
              C+S + ++A S+   +  C AA+C    +   +C   AC      YG  ++ ANL +
Sbjct: 148 RISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANLRR 207

Query: 185 DTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-- 241
             + LA  + V  +TF C   A      P G+ G GRG LSL AQ        FSYCL  
Sbjct: 208 GRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVA 264

Query: 242 PSFKA--LSFSGSLRLG------PIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
            SF+A  L  S  L LG       IG  +    YTPLL NP+    Y V L A+ VG + 
Sbjct: 265 HSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKR 324

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG----- 347
           +   P     +     G ++DSGT FT L +  +  V D F R + +     + G     
Sbjct: 325 IQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT 384

Query: 348 GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAPDNVN 400
           G   CY         P + L F G   V LP+ N  +   S  G S+ CL +     N +
Sbjct: 385 GLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 444

Query: 401 SVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
              +       + N QQQ   ++YDV   R+G AR  CT
Sbjct: 445 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 167/374 (44%), Gaps = 50/374 (13%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV----FNSAQSTTFKNLGCQAAQC 154
           +   +GTP Q + M +DT ++ +W+ C      ++T+    FN   S+++  + C +  C
Sbjct: 68  ISITVGTPPQNMSMVIDTGSELSWLHCN--TNTTATIPYPFFNPNISSSYTPISCSSPTC 125

Query: 155 ----KQVPNP-TC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATG 207
               +  P P +C     C   L+Y  +S+   NL+ DT    +   PG  FGC+  +  
Sbjct: 126 TTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYS 185

Query: 208 NSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQP 261
            +        GL+G+  GSLSL++Q   L    FSYC+       FSG L LG       
Sbjct: 186 TNSESDSNTTGLMGMNLGSLSLVSQ---LKIPKFSYCI---SGSDFSGILLLGESNFSWG 239

Query: 262 KRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSG 315
             + YTPL++          S Y V L  I++  ++++I  G L     TGAG T+ D G
Sbjct: 240 GSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNIS-GNLFVPDHTGAGQTMFDLG 298

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF------DTCYSVPI------VAPTIT 363
           T F+ L+ P Y A+RD F  +    L       F      D CY VP+        P+++
Sbjct: 299 TQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVS 358

Query: 364 LMFSGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           L+F G  + +  D LL           S+ C     + D +     +I +  QQ+  + +
Sbjct: 359 LVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNS-DLLGVEAFIIGHHHQQSMWMEF 417

Query: 419 DVPNSRLGVARELC 432
           D+   R+G+A   C
Sbjct: 418 DLVEHRVGLAHARC 431


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 111/336 (33%), Positives = 152/336 (45%), Gaps = 41/336 (12%)

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCS---STVFNSAQSTTFKNLGCQAAQCK 155
           A  GT A T  + +D+ +D +WV C  C    C      +F+ A STT+  + C +A C 
Sbjct: 68  APDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACA 127

Query: 156 QVPNPTCGGGA---CAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
           Q+     G  A   C F + YG  ST     S D ++L   D++ G+ FGC     G++ 
Sbjct: 128 QLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAF 187

Query: 211 PPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY-- 266
                G L LG GS SL+ QT   Y   FSYCLP     S  G L LG    P+R +   
Sbjct: 188 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLP--PTASSLGFLVLGV--PPERAQLIP 243

Query: 267 ----TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
               TPLL +    + Y V L AI V  R + +PP          A ++IDS T+ +RL 
Sbjct: 244 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLP 297

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDN 377
             AY A+R  FR  +        +   DTCY       I  P+I L+F  G  V L    
Sbjct: 298 PTAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAG 357

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           +L+ S      CLA   AP   + +   I N+QQ+ 
Sbjct: 358 ILLGS------CLAF--APTASDRMPGFIGNVQQKT 385



 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 72/231 (31%), Positives = 97/231 (41%), Gaps = 32/231 (13%)

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-LLKN 272
           G   + R  L L   TQ  Y   FSYC+P   + S  G + LG    P+R    P  +  
Sbjct: 419 GPYDVDRQGLPLRTATQ--YGRVFSYCIP--PSPSSLGFITLGV--PPQRAALVPTFVST 472

Query: 273 PRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           P  SS       Y V L AI V  R + +PP            ++I S TV +RL   AY
Sbjct: 473 PLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFS------TSSVIASTTVISRLPPTAY 526

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIH 381
            A+R  FRR +    T   +   DTCY       I  P+I L+F  G  V L    +L+ 
Sbjct: 527 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 586

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                  CLA   AP   + +   I N+QQ+   ++YDVP   +      C
Sbjct: 587 G------CLAF--APTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 151/349 (43%), Gaps = 42/349 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPC---TGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   IGTP Q L    DT +D  W  C    G     S+ ++   S+TF  L C    
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRL 159

Query: 154 CKQVPNPT-----CGGGACAFNLTYGSST----IAANLSQDTISLATDIVPGYTFGCIQK 204
           C  + + +      GG  C +   YG           L  +T +L  D VPG  FGC   
Sbjct: 160 CAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTA 219

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----GQ 260
             G+     GL+GLGRG LSL++Q   L   TF YCL +    S +  L  G +    G 
Sbjct: 220 LEGDYGEGAGLVGLGRGPLSLVSQ---LDAGTFMYCLTA--DASKASPLLFGALATMTGA 274

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
              ++ T LL +   ++ Y VNL +I +G         A         G + DSGT  T 
Sbjct: 275 GAGVQSTGLLAS---TTFYAVNLRSITIGS--------ATTAGVGGPGGVVFDSGTTLTY 323

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---PTITLMFS-GMNVTLPQD 376
           L  PAYT  +  F  +  S   V    GF+ CY  P  A   P + L F  G ++ LP  
Sbjct: 324 LAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARLIPAMVLHFDGGADMALPVA 383

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           N ++    G + C  +  +P      L++I N+ Q N+ +L+DV  S L
Sbjct: 384 NYVVEVDDG-VVCWVVQRSPS-----LSIIGNIMQMNYLVLHDVRKSVL 426


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 115/415 (27%), Positives = 171/415 (41%), Gaps = 62/415 (14%)

Query: 70  QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
           + LSS A A  +VV   S         Y V    GTP+QT+    DT +   W PCT   
Sbjct: 65  EALSSTATASATVV--KSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRY 122

Query: 130 GCSSTVFNS-----------AQSTTFKNLGCQAAQCKQV--PNPTCGGGACAFN------ 170
            CS   F+              S++ + +GCQ  +C+ +   N  C G  C  N      
Sbjct: 123 LCSDCNFSGLDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRG--CDPNTRNCTV 180

Query: 171 ------LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
                 L YG  + A  L  + +      VP +  GC   +T     P G+ G GRG  S
Sbjct: 181 PCPPYILQYGLGSTAGILISEKLDFPDLTVPDFVVGCSVISTRT---PAGIAGFGRGPES 237

Query: 225 LLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKR------IKYTPLLKNPRRS 276
           L +Q +     +FS+CL S  F   + +  L L      K       + YTP  KNP  S
Sbjct: 238 LPSQMK---LKSFSHCLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVS 294

Query: 277 S-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           +      YY+NL  I VG + V IP   L        G+I+DSG+ FT +  P +  V +
Sbjct: 295 NTAFLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAE 354

Query: 332 VFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
            F  ++ SN T    +  + G   C+++     +  P +   F  G  + LP  N     
Sbjct: 355 EFATQM-SNYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFV 413

Query: 383 TAGSITCLAMAAA----PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
                 CL + +     P        ++ + QQQN+ + YD+ N R G A++ C+
Sbjct: 414 GNADTVCLTVVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/382 (27%), Positives = 179/382 (46%), Gaps = 53/382 (13%)

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTF 144
           GR+  +   Y    K+G+P Q  ++ +DT ++  W+ C  C  C+    T++++A+S ++
Sbjct: 94  GRKFGE---YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASY 150

Query: 145 KNLGCQAAQ-CKQVPNPT---CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI----- 193
           + + C  +Q C      T   C  G+ C F   YG  + +  +LS DT+ + T +     
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 194 -VPGYTFGCIQKA-----TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
            V  + FGC Q       TG S    G+LGL  G ++L  Q    +   FS+C P   + 
Sbjct: 211 TVQDFAFGCAQGDLELVPTGAS----GILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSH 266

Query: 247 LSFSGSLRLGPIGQP-KRIKYT--PLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQF 302
           L+ +G +  G    P ++++YT   L  +  +   Y+V L  + +    +V +P G++  
Sbjct: 267 LNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSV-- 324

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF-RRRVGS--NLTVTSLGGFDTCYSVP--- 356
                   I+DSG+ F+  V P ++ +R+ F + R  S  +L   S G   TC+ V    
Sbjct: 325 -------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDD 377

Query: 357 -----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
                   P+++L+F  G+ + +P   +L+             A  D   + +NVI N Q
Sbjct: 378 IDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQ 437

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  + YD+  SR+G AR  C
Sbjct: 438 QQNLWVEYDIQRSRVGFARASC 459


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 152/328 (46%), Gaps = 43/328 (13%)

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDI 193
           F  A S+TF  L C ++ C+ + +P  TC    C +   YG    A  L+ +T+ +    
Sbjct: 96  FQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGAS 155

Query: 194 VPGYTFGC-IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
            PG  FGC  +   GNS    G++GLGR  LSL++Q        FSYCL S  A +    
Sbjct: 156 FPGVAFGCSTENGVGNS--SSGIVGLGRSPLSLVSQVG---VGRFSYCLRS-DADAGDSP 209

Query: 253 LRLGPIGQPKRIKYTP-LLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA- 308
           +  G + +    K +P +L+NP    SS YYVNL  I VG    D+P  +  F  T GA 
Sbjct: 210 ILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVG--ATDLPVTSTTFGFTRGAG 267

Query: 309 -----GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLG---GFDTCYS----- 354
                GTI+DSGT  T LV   Y  V+  F  ++  +NLT T  G   GFD C+      
Sbjct: 268 AGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDANAAG 327

Query: 355 ----VPIVAPTITLMFSGMNVTLPQDNLLIHSTA------GSITCLAMAAAPDNVNSVLN 404
               VP+  PT+ L F+G      +    +           ++ CL +  A + ++  ++
Sbjct: 328 GGSGVPV--PTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLS--IS 383

Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
           +I N+ Q +  +LYD+       A   C
Sbjct: 384 IIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 107/369 (28%), Positives = 164/369 (44%), Gaps = 53/369 (14%)

Query: 114 MDTSNDAAWVPCT---GCVGC-----SSTVFNSAQSTTFKNLGCQAAQCK-------QVP 158
           MDT +D  WVPCT    C+ C     S+ VF    S++   + C  + CK       ++ 
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 159 NPTCGGG--ACA-----FNLTYGSSTIAANLSQDTISLATDIVPG------YTFGCIQKA 205
             +C G    C+     + + YG  + A  L  +T++L  +   G      +  GC   +
Sbjct: 61  CQSCAGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGC---S 117

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPK 262
             +S  P G+ G GRG+LS+ +Q  +++ +  F+YCL S  F   +    + LG    P 
Sbjct: 118 IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALPN 177

Query: 263 RI--KYTPLLKNPR------RSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIID 313
            I   YTP L N R          YY+ L  + +G +R+  +P   L+F+     GTIID
Sbjct: 178 NIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTIID 237

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTCYSVP----IVAPTITLMFS 367
           SGT FT      +  +   F  ++G      V    G   CY V     IV P     F 
Sbjct: 238 SGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVTGLENIVLPEFAFHFK 297

Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPD--NVNSVLNVI-ANMQQQNHRILYDVPNS 423
            G ++ LP  N   + ++    CL M ++     V+S   VI  N QQQ+  +LYD   +
Sbjct: 298 GGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYDREKN 357

Query: 424 RLGVARELC 432
           RLG  ++ C
Sbjct: 358 RLGFTQQTC 366


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 162/380 (42%), Gaps = 53/380 (13%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---- 154
           V   +GTP Q + M +DT ++ + + C G        FN++ S T+  + C +  C    
Sbjct: 67  VSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVWRG 126

Query: 155 KQVP-NPTCGG---GACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCI------- 202
           + +P  P C      +C  +++Y  ++ A  +L  DT  L T  VP   FGCI       
Sbjct: 127 RDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTFILGTQAVPAL-FGCITSYSSST 185

Query: 203 ---QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
                AT  S    GLLG+ RGSLS + QT  L    F+YC+   +          G   
Sbjct: 186 AINSSATDPSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGQGPGILLLGGDGGAA 242

Query: 260 QPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
            P  + YTPL++  +         Y V L  IRVG  ++ IP   L  + T    T++DS
Sbjct: 243 PP--LNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTMVDS 300

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCYSVPI--------VAP 360
           GT FT L+A AY A++  F  +  S L           G FD C+  P         + P
Sbjct: 301 GTQFTFLLADAYAALKAEFLNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLP 360

Query: 361 TITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
            + L+  G  V +  + LL            A ++ CL    + D       VI +  QQ
Sbjct: 361 EVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNS-DMAGMSAYVIGHHHQQ 419

Query: 413 NHRILYDVPNSRLGVARELC 432
           +  + YD+ N R+G A   C
Sbjct: 420 DVWVEYDLQNGRVGFAPARC 439


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 110/423 (26%), Positives = 184/423 (43%), Gaps = 42/423 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T+ + H  SP SPF  S   S +     +    ++ LQF +  A        I S R   
Sbjct: 27  TIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRG-- 84

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGC 149
               Y++   IGTP   +L   DT +D  W  C  C  C   +S +F+  +S+T++ + C
Sbjct: 85  ---EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSC 141

Query: 150 QAAQCKQVPNPTCG--GGACAFNLTYG-SSTIAANLSQDTISLATD-----IVPGYTFGC 201
            ++QC+ + + +C      C++ +TYG +S    +++ DT+++ +       +     GC
Sbjct: 142 SSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGC 201

Query: 202 IQKATGNSVPPQGLLGLGRGSL-SLLAQTQNLYQSTFSYCLPSFKALS-FSGSLRLGPIG 259
             + TG   P    +    G   SL++Q +      FSYCL  F + +  +  +  G  G
Sbjct: 202 GHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNG 261

Query: 260 ---QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT---TGAGTI-I 312
                  +  + + K+P  ++ Y++NL AI VG +        +QF  T   TG G I I
Sbjct: 262 IVSGDGVVSTSMVKKDP--ATYYFLNLEAISVGSK-------KIQFTSTIFGTGEGNIVI 312

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGMN 370
           DSGT  T L +  Y  +  V    + +       G    CY  S     P IT+ F G +
Sbjct: 313 DSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGD 372

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           V L   N  + + +  ++C A AA     N  L +  N+ Q N  + YD  +  +   + 
Sbjct: 373 VKLGNLNTFV-AVSEDVSCFAFAA-----NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKT 426

Query: 431 LCT 433
            C+
Sbjct: 427 DCS 429


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 126/457 (27%), Positives = 187/457 (40%), Gaps = 54/457 (11%)

Query: 7   FFLAFLFLFSLSEGLNPICDTQDHSSTLQVF-----HVFSPCSPF-----KPSKPLSWEE 56
           F   FL L S S     I    + S TL  F     H  SP SPF      PS+ +  + 
Sbjct: 4   FVFCFLLLCSHS-----IASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERI--KN 56

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
           +VL   A+ + RL+ LS         + I     IT+   Y++R  IGTP        DT
Sbjct: 57  TVLRSFARSKRRLR-LSQNDDRSPGTITIPD-EPITE---YLMRFYIGTPPVERFAIADT 111

Query: 117 SNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP--NPTCGG--GACAF 169
            +D  WV   PC  CV  ++ +F+  +S+TFK + C +  C  +P     C G  G C +
Sbjct: 112 GSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYY 171

Query: 170 NLTYGSSTIAAN-LSQDTISLATD----IVPGYTFGCI---QKATGNSVPPQGLLGLGRG 221
              YG  T+ +  L  ++I+  +       P  TFGC          S    GL+GLG G
Sbjct: 172 QYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVG 231

Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK---YTPLLKNPRRSSL 278
            LSL++Q        FSYC P   + S S  +R G     K+IK    TPL+      S 
Sbjct: 232 PLSLISQLGYQIGRKFSYCFPPLSSNSTS-KMRFGNDAIVKQIKGVVSTPLIIKSIGPSY 290

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
           YY+NL  + +G + V            T    +IDSGT FT L    Y     + +   G
Sbjct: 291 YYLNLEGVSIGNKKVKTSESQ------TDGNILIDSGTSFTILKQSFYNKFVALVKEVYG 344

Query: 339 SNLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
                     ++ C+         P +  +F+G  V +   NL   +   ++ C+     
Sbjct: 345 VEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDASNLF-EAEDNNLLCMVALPT 403

Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            D  +S+     N  Q  +++ YD+    +  A   C
Sbjct: 404 SDEDDSIF---GNHAQIGYQVEYDLQGGMVSFAPADC 437


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 120/465 (25%), Positives = 189/465 (40%), Gaps = 77/465 (16%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSK------PLSWEESVLEMLAKDQARLQFLSSL---- 75
           D+++++++   F +F   SP   S+      P S  +   ++L  D AR Q +SSL    
Sbjct: 34  DSKNNNNSGVWFEMFHMHSPKLKSQSKFLGPPKSRLDGTRQLLQSDNARRQMISSLRHGT 93

Query: 76  -----AVARKSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMAMDTSNDAAWVPCT-GC 128
                 V+  + +PI SG    QS  Y V  +IGTP  Q  ++  DT +D  W+ C   C
Sbjct: 94  RRKAFEVSHTAQIPIHSGADSGQS-QYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWC 152

Query: 129 VGCSS------TVFNSAQSTTFKNLGCQAAQCK----------QVPNPTCGGGACAFNLT 172
             C         VF +  S++F+ + C +  CK          + PNP      C F+  
Sbjct: 153 KSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPN---APCLFDYR 209

Query: 173 Y----------GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
           Y           + T+   L+        D++     GC +     +  P G++GLG   
Sbjct: 210 YLNGPRAIGVFANETVTVGLNDHKKIRLFDVL----IGCTESFNETNGFPDGVMGLGYRK 265

Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPK--RIKYTPLLKNPRRSSLY 279
            SL  +   ++ + FSYCL     + +    L  G I + K  ++++T LL     ++ Y
Sbjct: 266 HSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG-YINAFY 324

Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
            VN+  I VG  ++ I      +N T   G I+DSGT  T L   AY  V D  +     
Sbjct: 325 PVNVSGISVGGSMLSISSDI--WNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDK 382

Query: 340 NLTVTSL------------GGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
           +  V  +             GFD         P + + F+   +  P     I   A  I
Sbjct: 383 HKKVVPIELPELNNFCFEDKGFDRA-----AVPRLLIHFADGAIFKPPVKSYIIDVAEGI 437

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            CL +  A    +S+L    N+ QQNH   YD+   +LG     C
Sbjct: 438 KCLGIIKADFPGSSIL---GNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 104/420 (24%), Positives = 175/420 (41%), Gaps = 32/420 (7%)

Query: 33  TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           ++ + H  SP SPF  PSK  +  E + +   +  +R+      A+    +      R +
Sbjct: 33  SVDLIHRDSPHSPFFDPSK--TRTERLTDAFHRSASRVGRFRQSAMTSDGI----QSRLV 86

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLG 148
             +  YI+   IGTP   ++  +DT +D  W  C  C  C   V   F+   S+T+++  
Sbjct: 87  PSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSS 146

Query: 149 CQAAQCKQVPNP-TC-GGGACAFNLTYGSSTI-AANLSQDTISLATDI-----VPGYTFG 200
           C  + C  + N  +C  G  C F  +Y   +    NL+ +T+++A+        PG+ FG
Sbjct: 147 CGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFG 206

Query: 201 CIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC-LPSFKALSFSGSLRLGPI 258
           C+ ++ G       G++GLG   LS+++Q ++     FSYC LP F   S S  +  G  
Sbjct: 207 CVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRS 266

Query: 259 G--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
           G         TPL+     +  Y + L    VG++ +     + +     G   I+DSGT
Sbjct: 267 GIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEG-NIIVDSGT 325

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTL 373
            +T L    Y  + +     +         G    CY+     I AP IT  F   NV L
Sbjct: 326 TYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAHFKDANVEL 385

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              N  +      + C  +    D     + ++ N+ Q N  + +D+   R+      CT
Sbjct: 386 QPWNTFLRMQE-DLVCFTVLPTSD-----IGILGNLAQVNFLVGFDLRKKRVSFKAADCT 439


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 113/440 (25%), Positives = 185/440 (42%), Gaps = 38/440 (8%)

Query: 13  FLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQF 71
           FLF L E    +   +    ++ + H  SP SPF  PSK  +  E + +   +  +R+  
Sbjct: 17  FLFQLLE----VALARGGGFSVDLIHRDSPHSPFFDPSK--TQAERLTDAFRRSVSRVGR 70

Query: 72  LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
               A+    +      R +  +  Y++   IGTP   ++  +DT +D  W  C  C  C
Sbjct: 71  FRPTAMTSDGI----QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHC 126

Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-PNPTCGG-GACAFNLTYGSSTI-AANLSQD 185
              V   F+   S+T+++  C  + C  +  + +C     C F  +Y   +    NL+ +
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186

Query: 186 TISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           T+++ +        PG+ FGC   + G       G++GLG G LSL++Q ++     FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246

Query: 240 C-LPSFKALSFSGSLRLGPIGQPKRIK--YTPLL-KNPRRSSLYYVNLLAIRVGRRVVDI 295
           C LP     S S  +  G  G+        TPL+ K+P   + YY+ L  I VG++ +  
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSP--DTFYYLTLEGISVGKKRLPY 304

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-- 353
             G  +         I+DSGT +T L    Y+ +       +         G F  CY  
Sbjct: 305 -KGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNT 363

Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           +  I AP IT  F   NV L   N  +      + C  +A   D     + V+ N+ Q N
Sbjct: 364 TAEINAPIITAHFKDANVELQPLNTFMRMQE-DLVCFTVAPTSD-----IGVLGNLAQVN 417

Query: 414 HRILYDVPNSRLGVARELCT 433
             + +D+   R+      CT
Sbjct: 418 FLVGFDLRKKRVSFKAADCT 437


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 160/357 (44%), Gaps = 44/357 (12%)

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCK 155
           A  GT A +  + +D+ +D  WV C  C  + C      +F+ A STT+  + C +A C 
Sbjct: 72  APDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACA 131

Query: 156 QVPNPTCGG----GACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGN- 208
           ++  P   G      C F +TY + +T     S D ++L   D+V G+ FGC     G+ 
Sbjct: 132 RL-GPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGST 190

Query: 209 -SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY- 266
            S    G L LG GS S + QT + Y   FSYC+P   + S  G +  G    P+R    
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVP--PSTSSFGFIMFGV--PPQRAALV 246

Query: 267 -----TPLLKNPRRSSLYYVNLL-AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
                TPLL +   S  +Y  LL +I V  R + +PP          A ++IDS TV +R
Sbjct: 247 PTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFS------ASSVIDSATVISR 300

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQ 375
           +   AY A+R  FR  +        +   DTCY       I  P+I L+F  G  V L  
Sbjct: 301 IPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDA 360

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             +L+        CLA   AP   + +   I N+QQ+   ++YDVP   +      C
Sbjct: 361 AGILLQG------CLAF--APTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 128/451 (28%), Positives = 193/451 (42%), Gaps = 50/451 (11%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +FF   L L S S+         D+  T  +FH  S  SP + S  LS  + +     + 
Sbjct: 7   IFFHLILLLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLTNAFRRS 62

Query: 66  QARLQFLSSLAVARKSV---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
            +R   L + A    ++    P+  G     S  Y++   IGTP    +   DT +D  W
Sbjct: 63  LSRSATLLNRAATNGALDLQAPLTPG-----SGEYLMSVSIGTPPVDYIGMADTGSDLMW 117

Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTI 178
             C  C+ C   S  +F+  +ST+F ++ C +  CK + +  CG  G C ++ TYG  T 
Sbjct: 118 AQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTY 177

Query: 179 A-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQS 235
              +L  + I++ +  V     GC  ++ G      G++GLG G LSL++Q         
Sbjct: 178 TKGDLGFEKITIGSSSVKS-VIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236

Query: 236 TFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVG-- 289
            FSYCLP+  + + +G +  G    +  P  +  TPL+ KNP   + YYV L AI +G  
Sbjct: 237 RFSYCLPTLLSHA-NGKINFGQNAVVSGPGVVS-TPLISKNP--VTYYYVTLEAISIGNE 292

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
           R +     G +          IIDSGT  + L    Y  V     + V +         +
Sbjct: 293 RHMASAKQGNV----------IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFW 342

Query: 350 DTCYSVPI-VA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
           D C+   I VA     P IT  FS G NV L   N      A ++ CL +   P +    
Sbjct: 343 DLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTF-QKVANNVNCLTL--TPASPTDE 399

Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             +I N+   N  I YD+   RL     +CT
Sbjct: 400 FGIIGNLALANFLIGYDLEAKRLSFKPTVCT 430


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 161/384 (41%), Gaps = 55/384 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS----TVFNSAQSTTFKNLGC 149
           Y +  + GTP+QT    +DT +   W+PC+    C  C+S      F    S++ K +GC
Sbjct: 86  YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGC 145

Query: 150 QAAQCKQVPNPTCGGGAC---------------AFNLTYGSSTIAANLSQDTISLATDIV 194
              +C  V  P      C               A+ + YG  + A  L  + ++  T   
Sbjct: 146 TNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPTKKY 205

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSG 251
             +  GC   +  +   P G+ G GRG  SL +Q  NL  + FSYCL S +   + + + 
Sbjct: 206 SDFLLGC---SVVSVYQPAGIAGFGRGEESLPSQ-MNL--TRFSYCLLSHQFDDSATITS 259

Query: 252 SLRLGPI----GQPKRIKYTPLLKNPRRS------SLYYVNLLAIRVGRRVVDIPPGALQ 301
           +L L       G+   + YTP LKNP         + YY+ L  I VG + V +P   L+
Sbjct: 260 NLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLE 319

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVPIVA 359
            N     G I+DSG+ FT +  P +  V   F ++V       +    G   C+ +   A
Sbjct: 320 PNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGA 379

Query: 360 PTIT---LMFS---GMNVTLPQDNLLIHSTAGSITCLAM-----AAAPDNVNSVLNVIAN 408
            T +   L F    G  + LP  N       G + CL +     A +   V   + ++ N
Sbjct: 380 ETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAV-ILGN 438

Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
            QQQN  + YD+ N R G   + C
Sbjct: 439 YQQQNFYVEYDLENERFGFRSQSC 462


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 38/351 (10%)

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSSTV---FNSAQSTTFKNLGCQAAQC 154
           R+K+    QT+++  D+++D  WV C  C    C   V   ++ ++S T     C +  C
Sbjct: 21  RSKLPGVIQTVVL--DSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTC 78

Query: 155 KQVPNP---TCGGGACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
             +  P    C    C + + Y  GSST  A ++      A + V G+ FGC     G+ 
Sbjct: 79  TAL-GPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSF 137

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
                G++ LG G  SLL+QT + Y + FSYC+P+    S SG   LG P     R   T
Sbjct: 138 DARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TASDSGFFTLGVPRRASSRYVVT 195

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           P+++  + ++ Y V L  I VG + + + P          AG+++DS T  TRL   AY 
Sbjct: 196 PMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQ 249

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQD--NLLIH 381
           A+R  FR  +    +    G  DTCY     V I  P I+L+F   N  LP D   +L +
Sbjct: 250 ALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD-RNAVLPLDPSGILFN 308

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                  CLA  +  D  + +  V+ ++QQQ   +LYDV    +G  +  C
Sbjct: 309 D------CLAFTSNAD--DRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 158/367 (43%), Gaps = 68/367 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
           Y     +G+P +   + MDT +D  WV C  C    S+ F+   S T+K L C       
Sbjct: 3   YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTC------- 55

Query: 157 VPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLAT------DIVPGYTFGCIQKATGNS 209
                    A  ++  YG  +    +LS DT+ +A       +  PG+ FGC     G  
Sbjct: 56  ---------ADDYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLI 106

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----------- 258
               G+L L  GSLS  +Q    Y + FSYCL    A     SL+  P+           
Sbjct: 107 SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTA---QNSLKKSPMVFGEAAVELKE 163

Query: 259 ---GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGAL---QFNPTTGAGTI 311
              G+ + ++YTP+      SS+YY V L  I VG + +D+ P A    Q  P     TI
Sbjct: 164 PGSGKLQELQYTPI----GESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKP-----TI 214

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS 367
            DSGT  T L      +++      V S     ++ G D C+ VP  +    P IT  F+
Sbjct: 215 FDSGTTLTMLPPGVCDSIKQSLASMV-SGAEFVAIKGLDACFRVPPSSGQGLPDITFHFN 273

Query: 368 GMN--VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           G    VT P + ++     GS+ CL     P N    +++  N+QQQ+  +L+D+ N R+
Sbjct: 274 GGADFVTRPSNYVI---DLGSLQCLIF--VPTN---EVSIFGNLQQQDFFVLHDMDNRRI 325

Query: 426 GVARELC 432
           G     C
Sbjct: 326 GFKETDC 332


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 118/415 (28%), Positives = 184/415 (44%), Gaps = 43/415 (10%)

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
           P + E  + E+ A D AR   L    V      P+           Y  + K+GTP +  
Sbjct: 38  PPNHELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREF 97

Query: 111 LMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPN 159
            + +DT +D  WV CT C GC  T         F+   S++   + C   +C    Q  +
Sbjct: 98  NVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157

Query: 160 PTCGGGACAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGNSV 210
                  C+++  YG  +      I+  +S DT+   +LA +    + FGC    TG+  
Sbjct: 158 GCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQ 217

Query: 211 PPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
            P+    G+ GLG+GSLS+++Q   Q L    FS+CL   K  S  G + LG I +P  +
Sbjct: 218 RPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK--SGGGIMVLGQIKRPDTV 275

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
            YTPL+ +      Y VNL +I V  +++ I P    F   TG GTIID+GT    L   
Sbjct: 276 -YTPLVPSQPH---YNVNLQSIAVNGQILPIDPSV--FTIATGDGTIIDTGTTLAYLPDE 329

Query: 325 AYTAVRDVFRRRV---GSNLTVTSLGGFDTCYSVPIVAPTITLMFSG--MNVTLPQDNLL 379
           AY+         V   G  +T  S   F+       V P ++L F+G    V  P   L 
Sbjct: 330 AYSPFIQAIANAVSQYGRPITYESYQCFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQ 389

Query: 380 IHSTAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           I S++G SI C+         +  + ++ ++  ++  ++YD+   R+G A   C+
Sbjct: 390 IFSSSGSSIWCIGFQRMS---HRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 125/261 (47%), Gaps = 23/261 (8%)

Query: 52  LSWEESVLEMLAKDQARLQFLS----SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
           L+  E +   + + + RL  +       A ARK+VV  A    +     Y+V+  IGTP 
Sbjct: 42  LTEHELLRRAIQRSRYRLAGIGMARGEAASARKAVV--AETPIMPAGGEYLVKLGIGTPP 99

Query: 108 QTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
                A+DT++D  W  C  C GC   V   FN   S+T+  L C +  C ++    CG 
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159

Query: 165 G---ACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLLGL 218
               +C +  TY G++T    L+ D + +  D   G  FGC   +TG + PPQ  G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYT----PLLKNPR 274
           GRG LSL++Q   L    F+YCLP   A    G L LG      R        P+ ++PR
Sbjct: 220 GRGPLSLVSQ---LSVRRFAYCLPP-PASRIPGKLVLGADADAARNATNRIAVPMRRDPR 275

Query: 275 RSSLYYVNLLAIRVGRRVVDI 295
             S YY+NL  + +G R + +
Sbjct: 276 YPSYYYLNLDGLLIGDRTMSL 296


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 150/346 (43%), Gaps = 41/346 (11%)

Query: 109 TLLMAMDTSNDAAWVPCTGC-----VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
           T  M +DT++D  WV C+ C           +++  +S++     C +  C Q+  P   
Sbjct: 143 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYAN 201

Query: 164 G----GACAFNLTY--GSSTIAANLSQD-TISLATDIVPGYTFGCIQKATGN---SVPPQ 213
           G      C + + Y  G+ST    +S   TI+ AT  V  + FGC     G+        
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPAT-AVRSFQFGCSHGVQGSFSFGSSAA 260

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
           G++ LG G  SL++QT   Y   FS+C P      F     LG P     R   TP+LKN
Sbjct: 261 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF---FTLGVPRVAAWRYVLTPMLKN 317

Query: 273 PR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    + Y V L AI V  + + +PP          AG  +DS T  TRL   AY A+R 
Sbjct: 318 PAIPPTFYMVRLEAIAVAGQRIAVPPTVF------AAGAALDSRTAITRLPPTAYQALRQ 371

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGS 386
            FR R+         G  DTCY +  V     P ITL+F     V L    +L       
Sbjct: 372 AFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG---- 427

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             CLA  A P+  + V  +I N+Q Q   +LY++P + +G     C
Sbjct: 428 --CLAFTAGPN--DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 150/346 (43%), Gaps = 41/346 (11%)

Query: 109 TLLMAMDTSNDAAWVPCTGC-----VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
           T  M +DT++D  WV C+ C           +++  +S++     C +  C Q+  P   
Sbjct: 168 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYAN 226

Query: 164 G----GACAFNLTY--GSSTIAANLSQD-TISLATDIVPGYTFGCIQKATGN---SVPPQ 213
           G      C + + Y  G+ST    +S   TI+ AT  V  + FGC     G+        
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPAT-AVRSFQFGCSHGVQGSFSFGSSAA 285

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
           G++ LG G  SL++QT   Y   FS+C P      F     LG P     R   TP+LKN
Sbjct: 286 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF---FTLGVPRVAAWRYVLTPMLKN 342

Query: 273 PR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    + Y V L AI V  + + +PP          AG  +DS T  TRL   AY A+R 
Sbjct: 343 PAIPPTFYMVRLEAIAVAGQRIAVPPTVF------AAGAALDSRTAITRLPPTAYQALRQ 396

Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGS 386
            FR R+         G  DTCY +  V     P ITL+F     V L    +L       
Sbjct: 397 AFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG---- 452

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
             CLA  A P+  + V  +I N+Q Q   +LY++P + +G     C
Sbjct: 453 --CLAFTAGPN--DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 111/427 (25%), Positives = 170/427 (39%), Gaps = 94/427 (22%)

Query: 21  LNPICDTQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAKDQARLQFLSSLA 76
           L  I  + D +S++ + H + PCSP  P+    +P   E    + L  D  R +F  S  
Sbjct: 20  LATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNG 79

Query: 77  VA-------RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC---- 125
            A        K  VP   G  +  +  Y++   +G+PA T  + +DT +D +WV C    
Sbjct: 80  TAAGEDGQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCP 138

Query: 126 --TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYGSSTI 178
             + C   +  +F+ A S+T+    C AA C Q+ +     G      C + + YG  + 
Sbjct: 139 APSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSN 198

Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQST 236
                            G+ FGC     G  +  +  GL+GLG  + SL++QT       
Sbjct: 199 TTGT-------------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTA------ 239

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
                                     R K  P        + Y+  L  I VG + + + 
Sbjct: 240 -------------------------ARSKKVP--------TYYFAALEDIAVGGKKLGLS 266

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
           P          AG+++DSGTV TRL   AY A+   FR  +        LG  DTC++  
Sbjct: 267 PSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFT 320

Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
               +  PT+ L+F+G  V     +L  H    S  CLA   AP   +     I N+QQ+
Sbjct: 321 GLDKVSIPTVALVFAGGAVV----DLDAHGIV-SGGCLAF--APTRDDKAFGTIGNVQQR 373

Query: 413 NHRILYD 419
              +LYD
Sbjct: 374 TFEVLYD 380


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 111/392 (28%), Positives = 163/392 (41%), Gaps = 64/392 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-----------STTFK 145
           Y +   +GTP+QT+ + MDT +   W PCT    C+S  F +             S++ K
Sbjct: 84  YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143

Query: 146 NLGCQAAQCKQV-----------PNP---TCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
            +GC+  +C  V            NP    C      + + YG  + A  L  +TI+   
Sbjct: 144 LIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPN 203

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
             +  +  GC   +T     P+G+ G GR   SL  Q   L    FSYCL S +     +
Sbjct: 204 KTISDFLAGCSLLSTRQ---PEGIAGFGRSQESLPLQ---LGLKKFSYCLVSRRFDDSPV 257

Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLK------NPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           S    L +GP     +   + YTP  K      NP     YYV L  I VG+  V +P  
Sbjct: 258 SSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYS 317

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS----LGGFDTCYS 354
            L        GTI+DSG+ FT +    +  +   F +++ +N TV +    L G   C+ 
Sbjct: 318 FLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQM-ANYTVATNVQKLTGLRPCFD 376

Query: 355 V----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLN- 404
           +     +V P +T  F  G  + LP  N       G + CL +    AAA      V + 
Sbjct: 377 ISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMG-VVCLTIVSDNAAALGGDGGVRSS 435

Query: 405 ----VIANMQQQNHRILYDVPNSRLGVARELC 432
               ++ N QQQN  I YD+ N R G   + C
Sbjct: 436 GPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 136/283 (48%), Gaps = 29/283 (10%)

Query: 162 CGGGA--CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
           CG  A  C + + YG  +     L  + +   T +V  + FGC +   G      GL+GL
Sbjct: 69  CGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGL 128

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPR 274
           GR  LSL++QT  ++   FSYCLPS +    SGSL LG      R    I Y  +++NP+
Sbjct: 129 GRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYAKMIENPQ 187

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVF 333
             + Y++NL  I +G         ALQ  P+ G   I +DSGTV TRL    Y A++  F
Sbjct: 188 LYNFYFINLTGISIGGV-------ALQ-APSVGPSRILVDSGTVITRLPPTIYKALKAEF 239

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD----NLLIHSTAG 385
            ++        +    DTC+++     +  PTI + F G N  L  D       + S A 
Sbjct: 240 LKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEG-NAELTVDVTGVFYFVKSDAS 298

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            + CLA+A+        + ++ N QQ+N R++YD   ++  V+
Sbjct: 299 QV-CLALASL--EYQDEVAILGNYQQKNLRVIYDTKETKADVS 338


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 106/351 (30%), Positives = 163/351 (46%), Gaps = 38/351 (10%)

Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSSTV---FNSAQSTTFKNLGCQAAQC 154
           R+K+    QT+++  D+++D  WV C  C    C   V   ++ ++S +     C +  C
Sbjct: 151 RSKLPGVIQTVVL--DSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTC 208

Query: 155 KQVPNP---TCGGGACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
             +  P    C    C + + Y  GSST  A ++      A + V G+ FGC     G+ 
Sbjct: 209 TAL-GPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSF 267

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
                G++ LG G  SLL+QT + Y + FSYC+P+    S SG   LG P     R   T
Sbjct: 268 DARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TASDSGFFTLGVPRRASSRYVVT 325

Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
           P+++  + ++ Y V L  I VG + + + P          AG+++DS T  TRL   AY 
Sbjct: 326 PMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQ 379

Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQD--NLLIH 381
           A+R  FR  +    +    G  DTCY     V I  P I+L+F   N  LP D   +L +
Sbjct: 380 ALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD-RNAVLPLDPSGILFN 438

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                  CLA  +  D  + +  V+ ++QQQ   +LYDV    +G  +  C
Sbjct: 439 D------CLAFTSNAD--DRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 116/448 (25%), Positives = 185/448 (41%), Gaps = 46/448 (10%)

Query: 6   VFFLAFLFLFS--LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           +F L F+  FS  LS G            ++++ H  SP SP+       ++  V     
Sbjct: 11  LFSLCFIASFSHALSNGF-----------SVELIHRDSPKSPYYKPTENKYQHFV----- 54

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
            D AR     +    + S         I     Y++   +GTP   +    DT +D  W+
Sbjct: 55  -DAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWL 113

Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA 179
              PC  C   ++ +FN ++S+++KN+ C +  C  V + +C    +C + ++YG S+ +
Sbjct: 114 QCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHS 173

Query: 180 -ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL 232
             +LS DT+SL +        P    GC     G       G++GLG G +SL+ Q  + 
Sbjct: 174 QGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSS 233

Query: 233 YQSTFSYCLPSF--KALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
               FSYCL     K  + S  L  G   +     +  TPL+K  +    Y++ L A  V
Sbjct: 234 IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSV 291

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G + V+   G            IIDSGT  T + +  YT +       V  +        
Sbjct: 292 GNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQ 349

Query: 349 FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           F  CYS+       P IT+ F G +V L   +  +  T G I C A   +P     + ++
Sbjct: 350 FSLCYSLKSNEYDFPIITVHFKGADVELHSISTFVPITDG-IVCFAFQPSP----QLGSI 404

Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
             N+ QQN  + YD+    +      CT
Sbjct: 405 FGNLAQQNLLVGYDLQQKTVSFKPTDCT 432


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/306 (32%), Positives = 136/306 (44%), Gaps = 29/306 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y+V   IGTP Q + + +DT +D  W  C  C  C       F+ + S+T     C +  
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141

Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
           C+ +P  +CG         C +  +YG  ++    L  D  +   A   VPG  FGC   
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
             G     + G+ G GRG LSL +Q   L    FS+C  +   L  S  L   P    K 
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258

Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
               ++ TPL++NP   + YY++L  I VG   + +P         TG GTIIDSGT  T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG-GTIIDSGTAMT 317

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
            L    Y  VRD F  +V   L V S    D   C S P+ A    P + L F G  + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375

Query: 374 PQDNLL 379
           P++N +
Sbjct: 376 PRENYV 381


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/411 (24%), Positives = 170/411 (41%), Gaps = 51/411 (12%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
           LQ L++ +++R   +       + Q+         + +    GTP Q L   MDT +   
Sbjct: 52  LQHLATASMSRSHHLKHGKASPLIQTSLFPHSYGAHTIPLSFGTPPQKLSFLMDTGSHVV 111

Query: 122 WVPCT---GCVGCSST------VFNSAQSTTFKNLGCQAAQCKQV--PN-----PTCGGG 165
           W PCT    C  CS +      +FN   S++ K LGC+  +C     PB     P C G 
Sbjct: 112 WAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNGN 171

Query: 166 ------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
                 AC  + L YG+   +     + +      +  +  GC   A         L G 
Sbjct: 172 SKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSAD-REPSSDALAGF 230

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
           GR   SL  Q   +    F+YCL S  +     SG L L    G+ + + Y P  KNP  
Sbjct: 231 GRTMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPD 287

Query: 276 SSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
             + YY+ +  +++G +V+ IP   L     +  G +IDSG  ++ +  P +  V +  +
Sbjct: 288 YPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELK 347

Query: 335 RRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
           +++     +L + +  G   CY+      I  P +   F+ G N+ +P  N  +  +  S
Sbjct: 348 KQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEAS 407

Query: 387 ITCLAMAAAPDNVNSVLN-----VIANMQQQNHRILYDVPNSRLGVARELC 432
           + C  +       N         ++ N QQ +H + +D+ N RLG  ++ C
Sbjct: 408 LGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 112/437 (25%), Positives = 186/437 (42%), Gaps = 49/437 (11%)

Query: 26  DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFLSSLA---VARK 80
           ++Q+   ++++ H  S  SPF   +    +   +V+    K    L  + SL+   + + 
Sbjct: 21  ESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKP 80

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFN 137
           +++P A          Y++   IGTP   L   +DT +D  W    PC  C+  +S +FN
Sbjct: 81  TIIPYAGSY-------YVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFN 133

Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTISLATDI 193
            ++S+T+KN+ C +  CK+     C       C + +TY   S    ++S+DT++L ++ 
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193

Query: 194 -----VPGYTFGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCLPS- 243
                 P    GC  K   NS+  +GL    +G GRG+ S+++Q  +     FSYCL S 
Sbjct: 194 GSPISFPKIVIGCGHK---NSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASL 250

Query: 244 FKALSFSGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
           F   + S  L  G +       +  TPL+++    + Y+ NL A  VG  ++ +   +L 
Sbjct: 251 FSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFSVGDHIIKLKDSSLI 309

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI---V 358
             P      +IDSG+  T+L    Y+ +       V              CY   +    
Sbjct: 310 --PDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYE 367

Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIANMQQQNHRI 416
            P IT  F G +V L   N  I      + C A   +A P        V  N+ QQN  +
Sbjct: 368 VPIITAHFRGADVKLNAFNTFIQMNH-EVMCFAFNSSAFP------WVVYGNIAQQNFLV 420

Query: 417 LYDVPNSRLGVARELCT 433
            YD   + +      CT
Sbjct: 421 GYDTLKNIISFKPTNCT 437


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 114/443 (25%), Positives = 189/443 (42%), Gaps = 55/443 (12%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T+++ H  SP SP  P   L   E +L+  A   A L   +S+    K+V+        +
Sbjct: 15  TMELIHKDSPQSPLYPGN-LPPGEQILQPAACPFAGLHHQTSMMSTNKAVMNRMMSPLTS 73

Query: 93  QSPTYIVRAKIG----------TPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVF------ 136
               ++  A++G          T  +T    +DT N+ +W+ C GC    +  F      
Sbjct: 74  YGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPP 133

Query: 137 -NSAQSTTFKNLGC-QAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD- 192
             S+QS ++K + C Q + C+  PN  C  G CA+N+TYG  S  + NL+ +T +  ++ 
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCE--PN-QCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNH 190

Query: 193 ----IVPGYTFGCIQKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
                +   +FGC   +         +  P  G+LG+G G  S LAQ  ++    FSYC+
Sbjct: 191 GKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCI 250

Query: 242 PSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
            +    + +  LR G  + + K ++ T +++  + S+ Y+VNLL I V    ++I    L
Sbjct: 251 TANN--THNTYLRFGKHVVKSKNLQTTKIMQ-VKPSAAYHVNLLGISVNGVKLNITKTDL 307

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
                   G IID+GT+ T LV P +  +       + SN      V      D CY   
Sbjct: 308 AVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQL 367

Query: 357 IVA-----PTITLMFSGMNVTL-PQDNLLIHSTAG-SITCLAMAAAPDNVNSVLNVIANM 409
             A     P +T      ++ + P+   L     G ++ CL+M +          +I   
Sbjct: 368 SDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSKT-----IIGAY 422

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
           QQ   + +YD     L    E C
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDC 445


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 161/369 (43%), Gaps = 62/369 (16%)

Query: 122 WVPCTG---CVGCSST------VFNSAQSTTFKNLGCQ-------------AAQCKQVPN 159
           WVPCT    C  CSS       VF+   S++ + +GC+             A +C++ P 
Sbjct: 85  WVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPC 144

Query: 160 -------PTCGGGACA-FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
                  P      C  + + YGS + A  L  DT+      VPG+  GC   +     P
Sbjct: 145 SPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQ--P 202

Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKRIKYTP 268
           P GL G GRG+ S+ AQ   L    FSYCL S +     + SGSL LG  G  + ++Y P
Sbjct: 203 PSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVP 259

Query: 269 LLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL-- 321
           L+K+     L     YY+ L  + VG + V +P  A   N     GTI+DSGT FT L  
Sbjct: 260 LVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDP 319

Query: 322 --VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-TL 373
               P   AV      R   +       G   C+++P     +  P ++  F G  V  L
Sbjct: 320 TVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQL 379

Query: 374 PQDNLLIHSTAGSITCLAMAAAPD--------NVNSVLNVI-ANMQQQNHRILYDVPNSR 424
           P +N  + +  G++  + +A   D        N  S   +I  + QQQN+ + YD+   R
Sbjct: 380 PVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKER 439

Query: 425 LGVARELCT 433
           LG  R+ CT
Sbjct: 440 LGFRRQSCT 448


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 157/385 (40%), Gaps = 58/385 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST---TF--------K 145
           Y +    GTP QT    MDT +   W PCT    CS   F + + T   TF        K
Sbjct: 83  YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSK 142

Query: 146 NLGCQAAQCKQVPNP--------------TCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
            +GC+  +C  +  P               C      + + YGS + A  L  +T+    
Sbjct: 143 LIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPN 202

Query: 192 D-IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALS 248
              +P +  GC   +  +   P+G+ G GR   SL +Q   L    FSYCL S  F    
Sbjct: 203 KKTIPDFLVGC---SIFSIKQPEGIAGFGRSPESLPSQ---LGLKKFSYCLVSHAFDDTP 256

Query: 249 FSGSLRLGP-----IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVDIPPGALQ 301
            S  L L       + +   + +TP LKNP  +    YYV L  I +G   V +P   L 
Sbjct: 257 TSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLV 316

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSV--- 355
                  GTI+DSGT FT +  P Y  V   F +++      T   +L G   CY++   
Sbjct: 317 PGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGE 376

Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN------VIA 407
             +  P +   F  G  + LP  N      +G I CL + +  DNV           ++ 
Sbjct: 377 KSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVI-CLTIVS--DNVAGPGLGGGPAIILG 433

Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
           N QQ+N  + +D+ N + G  ++ C
Sbjct: 434 NYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 121/426 (28%), Positives = 180/426 (42%), Gaps = 64/426 (15%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMA 113
            E +  M+A+ +ARL  L S A       P+  G     S  Y++   IGTP  Q +++ 
Sbjct: 52  HELLRRMVARSKARLASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLH 111

Query: 114 MDTSNDAAWV--PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ-VPNPTCGGGA---- 166
           +DT +D  W    CT C      VF ++ S TF  + C    C   V  P  G  A    
Sbjct: 112 LDTGSDLVWTQCACTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRS 171

Query: 167 CAFNLTYGSSTI-AANLSQDTISL-ATD------IVPGYTFGCIQKATGNSVPPQ-GLLG 217
           C +   Y   +I    +++DT +  A D       VP   FGC     G   P Q G+ G
Sbjct: 172 CFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAG 231

Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI---GQPKRIKY-------- 266
            G G LSL +Q   L    FSYC   F A+  S   R+ P+   G+P+ I+         
Sbjct: 232 FGTGPLSLPSQ---LKVRRFSYC---FTAMEES---RVSPVILGGEPENIEAHATGPIQS 282

Query: 267 TPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
           TP    P  + +     Y+++L  + VG   +               GT IDSGT  T  
Sbjct: 283 TPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFF 342

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-----CYSVPI-----VAPTITLMFSGMNV 371
               + ++R+ F  +V   L V    G+       C+SVP        P + L   G + 
Sbjct: 343 PQAVFRSLREAFVAQV--PLPVAK--GYTDPDNLLCFSVPAKKKAPAVPKLILHLEGADW 398

Query: 372 TLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            LP++N ++      S AG   C+ + +A    NS   +I N QQQN  I+YD+ ++++ 
Sbjct: 399 ELPRENYVLDNDDDGSGAGRKLCVVILSAG---NSNGTIIGNFQQQNMHIVYDLESNKMV 455

Query: 427 VARELC 432
            A   C
Sbjct: 456 FAPARC 461


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 96/280 (34%), Positives = 138/280 (49%), Gaps = 35/280 (12%)

Query: 164 GGACAFNLTY--GSSTIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLLGLGR 220
           G  C F ++Y  G+ST+ A  SQD ++LA   IV  + FGC            G+LGLGR
Sbjct: 34  GKQCGFAISYADGTSTVGA-YSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGR 92

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY 280
              SL A+    Y   FSYCLPS    S  G L LG    P    +TP+   P + +   
Sbjct: 93  LRESLGAR----YGGVFSYCLPSVS--SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFST 146

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           V L  I VG + +D+ P A         G I+DSGTV T L + AY A+R  FR+ + + 
Sbjct: 147 VTLAGINVGGKKLDLRPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAY 200

Query: 341 LTVTSLGGFDTCYSVP----IVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMA 393
             + + G  DTCY++     +V P I L F+G   +N+ +P + +L++       CLA A
Sbjct: 201 RLLPN-GDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVP-NGILVNG------CLAFA 252

Query: 394 -AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            + PD    VL    N+ Q+   +L+D   S+ G   + C
Sbjct: 253 ESGPDGSAGVL---GNVNQRAFEVLFDTSTSKFGFRAKAC 289


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 163/393 (41%), Gaps = 33/393 (8%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           EE V   +A  + RL FL +          + +  +   +  Y+    IG P Q     +
Sbjct: 49  EELVRRAVAAGKQRLAFLDAAMAGGGDGGGVGAPVRWA-TLQYVAEYLIGDPPQRAEALI 107

Query: 115 DTSNDAAWVPCTGCVG--CSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPT--CG-GGA 166
           DT +D  W  C+ C+   C+      +NS+ S+TF  + C A  C    +    C     
Sbjct: 108 DTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHFCDLAAG 167

Query: 167 CAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGSL 223
           C+    YG+  +A  L  +  +  +       FGC+   +   G      GL+GLGRG L
Sbjct: 168 CSVIAGYGAGVVAGTLGTEAFAFQSGTAE-LAFGCVTFTRIVQGALHGASGLIGLGRGRL 226

Query: 224 SLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLY 279
           SL++QT     + FSYCL P F     +G L +G    +G    +  T  +K P+ S  Y
Sbjct: 227 SLVSQTG---ATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFY 283

Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
           Y+ L+ + VG   + IP                G IIDSG+ FT LV  AY A+      
Sbjct: 284 YLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAA 343

Query: 336 RVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITC 389
           R+  +L        D         V  V P +   F  G ++ +P ++        +   
Sbjct: 344 RLNGSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACM 403

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
              +A P    S   VI N QQQN R+LYD+ N
Sbjct: 404 AIASAGPYRRQS---VIGNYQQQNMRVLYDLAN 433


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 122/457 (26%), Positives = 196/457 (42%), Gaps = 60/457 (13%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAK 64
           + F +  F+ S S         Q +  ++++ H  S  SP +KP++   ++  V      
Sbjct: 9   LLFFSICFIVSFSHA-------QKNGFSVELIHRDSLKSPLYKPTQN-KYQYFV------ 54

Query: 65  DQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV- 123
           D AR     +    + S+  I     I     Y++   +GTP   L   +DT +D  W+ 
Sbjct: 55  DAARRSINRANHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQ 114

Query: 124 --PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYG-SSTIA 179
             PC  C   ++ +FN ++S+++KN+ C +  C+ + + +C     C ++  YG +S   
Sbjct: 115 CEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSG 174

Query: 180 ANLSQDTISLA-----TDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQ 230
            +LS DT++L      T   P    GC    T N +  +    G++G G G  S + Q  
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGC---GTNNILSYEGASSGIVGFGSGPASFITQLG 231

Query: 231 NLYQSTFSYCL-PSFKALSF----SGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNL 283
           +     FSYCL P F   +     +  L  G         +  TP+LK     + YY+ L
Sbjct: 232 SSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPE-TFYYLTL 290

Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY----TAVRDVFRRRVGS 339
            A  VG R V+I  G +  N       IIDSGT  T L    Y    +AV D+ +     
Sbjct: 291 EAFSVGNRRVEI--GGVP-NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVD 347

Query: 340 NLTVTSLGGFDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP 396
           + T T     + CYSV       P IT+ F G +V L   +  + S A  + CLA  ++ 
Sbjct: 348 DPTQT----LNLCYSVKAEGYDFPIITMHFKGADVDLHPISTFV-SVADGVFCLAFESSQ 402

Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           D+      +  N+ QQN  + YD+    +      CT
Sbjct: 403 DHA-----IFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 119/398 (29%), Positives = 169/398 (42%), Gaps = 69/398 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
           Y   A +GTP Q L + +DT +   WVPCT    C  CSS       VF+   S++ + +
Sbjct: 103 YAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLV 162

Query: 148 GCQAAQCKQVPN---------PTCGGGACA--------FNLTYGSSTIAANLSQDTISLA 190
           GC+   C  V +         P   G  C         + + YGS + A  L  DT+   
Sbjct: 163 GCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAP 222

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---AL 247
              V G+  GC   +     PP GL G GRG+ S+ AQ   L  S FSYCL S +     
Sbjct: 223 GRAVSGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLSKFSYCLLSRRFDDNA 277

Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGALQF 302
           + SGSL LG  G    ++Y PL+K+           YY+ L  + VG + V +P  A   
Sbjct: 278 AVSGSLVLG--GDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAA 335

Query: 303 NPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
           N     G I+DSGT FT L      P   AV      R   +  V    G   C+++P  
Sbjct: 336 NAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQG 395

Query: 357 ---IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSI------------TCLAM------AA 394
              +  P ++L F G  V  LP +N  + +    +             CLA+      + 
Sbjct: 396 AKSMALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSG 455

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           A D       ++ + QQQN+ + YD+   RLG  R+ C
Sbjct: 456 AGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 168/363 (46%), Gaps = 37/363 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
           +V   IGTP Q   M +DT +  +W+ C      +++ F+ + S++F  L C    CK +
Sbjct: 89  VVTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTAS-FDPSLSSSFYVLPCTHPLCKPR 147

Query: 157 VPN----PTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNS 209
           VP+     TC     C ++  Y   T A  NL ++ ++ + +   P    GC    +  S
Sbjct: 148 VPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGC----SSES 203

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF----SGSLRLGPIGQPKRIK 265
              +G+LG+  G LS   Q +    + FSYC+P+ +  +     +GS  LG      R +
Sbjct: 204 RDARGILGMNLGRLSFPFQAK---VTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFR 260

Query: 266 YTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
           Y  +L  P+   +       Y V +  IR+G R ++IPP   + N      T++DSG+ F
Sbjct: 261 YVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDSGSEF 320

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCY-----SVPIVAPTITLMFS-GMN 370
           T LV  AY  VR+   R +G  +    + G   D C+      +  +   +   F  G+ 
Sbjct: 321 TFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVE 380

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           + +P++ +L     G + C+ +  + + + +  N+I N  QQN  + +D+ N R+G    
Sbjct: 381 IVVPKERVLA-DVGGGVHCVGIGRS-ERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVA 438

Query: 431 LCT 433
            C+
Sbjct: 439 DCS 441


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 107/400 (26%), Positives = 162/400 (40%), Gaps = 68/400 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGC---------------SSTVFNSA 139
           Y +   +G+ +  + + MDT +D  W PC+   C+ C               + +V  SA
Sbjct: 76  YTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSA 135

Query: 140 QSTTFKNLG-------CQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQDTISL 189
            + +  + G       C  ++C  + +    C   +C  F   YG  ++ A L +D++SL
Sbjct: 136 AACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL 195

Query: 190 ATDI------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYC 240
            T        V  +TFGC     G    P G+ G GRG LS+ +Q         + FSYC
Sbjct: 196 PTPAPSPPINVRNFTFGCAHTTLGE---PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYC 252

Query: 241 LPSFKALSFSGSLRLGPI-------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           L S  + +     R  P+       G+ + I YT LL+NP+    Y V L  I VG   +
Sbjct: 253 LVS-HSFAADRVRRPSPLILGRYYTGETEFI-YTSLLENPKHPYFYSVGLAGISVGNIRI 310

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS----NLTVTSLGGF 349
             P    + +     G ++DSGT FT L A  Y +V   F  R G        +    G 
Sbjct: 311 PAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL 370

Query: 350 DTCYSVP--IVAPTITLMFSGM--NVTLPQDNLLIHSTAG---------SITCLAMAAAP 396
             CY     +  P + L F G   NV LP+ N       G          + CL +    
Sbjct: 371 SPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGG 430

Query: 397 DNVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELCT 433
           D           + N QQQ   ++YD+  +R+G AR  C+
Sbjct: 431 DEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 470


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 158/373 (42%), Gaps = 60/373 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQA 151
           YI    IG P Q     +DT ++  W  C+     GC G   T ++ ++S T K + C  
Sbjct: 84  YIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACND 143

Query: 152 AQCKQVPNPTCG--GGACAFNLTYGSSTIAANL------------SQDTISLATDIVPGY 197
             C       C   G ACA    YG+  I   L            S++ +SLA       
Sbjct: 144 TACLLGSETRCARDGKACAVLTAYGAGAIGGFLGTEVFTFGHGQSSENNVSLA------- 196

Query: 198 TFGCIQKAT---GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
            FGCI  +    G+     G++GLGRG LSL +Q   L  + FSYCL P F   + + +L
Sbjct: 197 -FGCITASRLTPGSLDGASGIIGLGRGKLSLPSQ---LGDNKFSYCLTPYFSDAANTSTL 252

Query: 254 RLGPIGQPKRIKY----TPLLKNPRRS---SLYYVNLLAIRVGRRVVDIPPGALQFN--- 303
            +G               P LKNP      S YY+ L  I VG   +D+P  A       
Sbjct: 253 FVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVA 312

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCY------SV 355
           P    GT+IDSG+ FT L+  AY A+RD   R++G+++     G  G D C         
Sbjct: 313 PAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDA 372

Query: 356 PIVAPTITLMF-----SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---VIA 407
             + P + L F      G +V +P +N        +   +  ++   N    LN   +I 
Sbjct: 373 GKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIG 432

Query: 408 NMQQQNHRILYDV 420
           N  QQ+  +LYD+
Sbjct: 433 NYMQQDMHLLYDL 445


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 117/415 (28%), Positives = 185/415 (44%), Gaps = 43/415 (10%)

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
           P + E  + E+ A D AR   L    V      P+           Y  + K+GTP +  
Sbjct: 38  PPNHELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREF 97

Query: 111 LMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPN 159
            + +DT +D  WV CT C GC  T         F+   S++   + C   +C    Q  +
Sbjct: 98  NVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157

Query: 160 PTCGGGACAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGNSV 210
                  C+++  YG  +      I+  +S DT+   +LA +    + FGC    +G+  
Sbjct: 158 GCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQ 217

Query: 211 PPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
            P+    G+ GLG+GSLS+++Q   Q L    FS+CL   K  S  G + LG I +P  +
Sbjct: 218 RPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK--SGGGIMVLGQIKRPDTV 275

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
            YTPL+ +      Y VNL +I V  +++ I P    F   TG GTIID+GT    L   
Sbjct: 276 -YTPLVPSQPH---YNVNLQSIAVNGQILPIDPSV--FTIATGDGTIIDTGTTLAYLPDE 329

Query: 325 AYTAVRDVFRRRV---GSNLTVTSLGGFDTCYSVPIVAPTITLMFSG--MNVTLPQDNLL 379
           AY+         V   G  +T  S   F+       V P ++L F+G    V  P+  L 
Sbjct: 330 AYSPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQ 389

Query: 380 IHSTAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           I S++G SI C+         +  + ++ ++  ++  ++YD+   R+G A   C+
Sbjct: 390 IFSSSGSSIWCIGFQRMS---HRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 176/409 (43%), Gaps = 52/409 (12%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
           E    +   DQ RL+ +    VA     PI+       +  Y  R  +GTP Q   + +D
Sbjct: 11  EYYRTLREHDQRRLRRILPEVVA----FPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVD 66

Query: 116 TSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGG 165
           T +D AWV C  C  C          ++F+  +ST+  ++ C   +C    N  C     
Sbjct: 67  TGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSM 126

Query: 166 ACAFNLTYGS-STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGL 215
           +C ++  YG  S+ A  L  D +S          AT      TFGC    TG  +   GL
Sbjct: 127 SCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TDGL 185

Query: 216 LGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
           +G G+  +SL +Q   QN+  + F++CL        SG+L +G I +P  + YTP++  P
Sbjct: 186 VGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDN--KGSGTLVIGHIREPGLV-YTPIV--P 240

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY----TAV 329
           ++S  Y V LL I V    V  P     F+ +   G I+DSGT  T LV PAY      V
Sbjct: 241 KQSH-YNVELLNIGVSGTNVTTPTA---FDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKV 296

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTL--PQDNLL--IHSTAG 385
           RD  R  V   L V     F    ++    P +TL F+G    L  P   L   + +T  
Sbjct: 297 RDCMRSGV---LPV----AFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGL 349

Query: 386 SITCLAMAAAPDNVNSV-LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           S  C +   +      +   +  +   ++  ++YD  N+R+G     CT
Sbjct: 350 SAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 165/389 (42%), Gaps = 66/389 (16%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------TVFNSAQSTTFKNLGCQAA 152
           V   +G P Q + M +DT ++ +W+ C G    S+        FN + S+T+    C + 
Sbjct: 62  VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 121

Query: 153 QCKQ------VPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
           +C+       VP P C G    +C  +L+Y  ++ A   L+ DT  L         FGC+
Sbjct: 122 ECQWRGRDLPVP-PFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXALFGCV 180

Query: 203 QKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFS 250
              +        +S    GLLG+ RGSLS + QT  L    F+YC+     P    L   
Sbjct: 181 TSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGDGPGLLVLGGD 237

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           G+  L P     ++ YTPL++  R         Y V L  IRVG  ++ IP   L  + T
Sbjct: 238 GAA-LAP-----QLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHT 291

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCY------ 353
               T++DSGT FT L+A AY  ++  F  +  + L           G FD C+      
Sbjct: 292 GAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEAR 351

Query: 354 --SVPIVAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVL 403
             +   + P + L+  G  V +  + LL            A ++ CL    + D      
Sbjct: 352 VAAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNS-DMAGMSA 410

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            VI +  QQN  + YD+ N R+G A   C
Sbjct: 411 YVIGHHHQQNVWVEYDLQNGRVGFAPARC 439


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 165/389 (42%), Gaps = 66/389 (16%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------TVFNSAQSTTFKNLGCQAA 152
           V   +G P Q + M +DT ++ +W+ C G    S+        FN + S+T+    C + 
Sbjct: 64  VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 123

Query: 153 QCKQ------VPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
           +C+       VP P C G    +C  +L+Y  ++ A   L+ DT  L         FGC+
Sbjct: 124 ECQWRGRDLPVP-PFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRALFGCV 182

Query: 203 QKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFS 250
              +        +S    GLLG+ RGSLS + QT  L    F+YC+     P    L   
Sbjct: 183 TSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGDGPGLLVLGGD 239

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           G+  L P     ++ YTPL++  R         Y V L  IRVG  ++ IP   L  + T
Sbjct: 240 GAA-LAP-----QLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHT 293

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCY------ 353
               T++DSGT FT L+A AY  ++  F  +  + L           G FD C+      
Sbjct: 294 GAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEAR 353

Query: 354 --SVPIVAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVL 403
             +   + P + L+  G  V +  + LL            A ++ CL    + D      
Sbjct: 354 VAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNS-DMAGMSA 412

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
            VI +  QQN  + YD+ N R+G A   C
Sbjct: 413 YVIGHHHQQNVWVEYDLQNGRVGFAPARC 441


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 115/448 (25%), Positives = 184/448 (41%), Gaps = 46/448 (10%)

Query: 6   VFFLAFLFLFS--LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
           +F L F+  FS  LS G            ++++ H  SP SP+       ++  V     
Sbjct: 11  LFSLCFIASFSHALSNGF-----------SVELIHRDSPKSPYYKPTENKYQHFV----- 54

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
            D AR     +    + S         I     Y++   +GTP   +    DT +D  W+
Sbjct: 55  -DAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWL 113

Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA 179
              PC  C   ++ +FN ++S+++KN+ C +  C  V + +C    +C + ++YG S+ +
Sbjct: 114 QCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHS 173

Query: 180 -ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL 232
             +LS DT+SL +        P    GC     G       G++GLG G +SL+ Q  + 
Sbjct: 174 QGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSS 233

Query: 233 YQSTFSYCLPSF--KALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
               FSYCL     K  + S  L  G   +     +  TPL+K  +    Y++ L A  V
Sbjct: 234 IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSV 291

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G + V+   G            IIDSGT  T + +  YT +       V  +        
Sbjct: 292 GNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQ 349

Query: 349 FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           F  CYS+       P IT  F G ++ L   +  +  T G I C A   +P     + ++
Sbjct: 350 FSLCYSLKSNEYDFPIITAHFKGADIELHSISTFVPITDG-IVCFAFQPSP----QLGSI 404

Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
             N+ QQN  + YD+    +      CT
Sbjct: 405 FGNLAQQNLLVGYDLQQKTVSFKPTDCT 432


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 174/404 (43%), Gaps = 64/404 (15%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQST 142
           P A+  +   + +  V   +GTP Q + M +DT ++ +W+ C G      T  FN++ S+
Sbjct: 42  PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSS 101

Query: 143 TFKNLGCQAAQC----KQVPNP----TCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI 193
           ++  + C +  C    + +P P    T    AC  +L+Y  ++ A   L+ DT  L    
Sbjct: 102 SYGAVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGA 161

Query: 194 VP---GYTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFS 238
            P   G  FGCI             TG  V     GLLG+ RG+LS + QT       F+
Sbjct: 162 PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFA 218

Query: 239 YCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRV 292
           YC+   +     G L LG  G     + YTPL++  +         Y V L  IRVG  +
Sbjct: 219 YCIAPGEG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCAL 275

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL------ 346
           + IP   L  + T    T++DSGT FT L+A AY A++  F  +  + L +  L      
Sbjct: 276 LPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFV 333

Query: 347 --GGFDTCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSIT 388
             G FD C+  P         + P + L+  G  V +  + LL            A ++ 
Sbjct: 334 FQGAFDACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVW 393

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CL    + D       VI +  QQN  + YD+ N R+G A   C
Sbjct: 394 CLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 116/370 (31%), Positives = 162/370 (43%), Gaps = 52/370 (14%)

Query: 94  SPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCS------STVFNSAQSTTFKN 146
           +P  ++   +GTP AQT+   +D ++   W  C  C   +      +T F    S TF  
Sbjct: 85  APPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSP 144

Query: 147 LGCQAAQCKQVPNPTCGGGAC-----------AFNLTYGSSTIAAN----LSQDTISLAT 191
           L C +  C  V   TCG               +++LTYG S  AAN    L+ DT +   
Sbjct: 145 LPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGS--AANTSGYLATDTFTFGA 202

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---S 248
             VPG  FGC   + G+     G++G+GRG+LSL++Q Q      FSY L + +A    S
Sbjct: 203 TAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGS 259

Query: 249 FSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPT 305
               +R G    P  KR + TPLL +      YYVNL  +RV G R+  IP G       
Sbjct: 260 ADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRAN 319

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPIVA- 359
              G I+ S T  T L   AY    DV R  V S + + ++ G      D CY+   +A 
Sbjct: 320 GTGGVILSSTTPVTYLEQAAY----DVVRAAVASRIGLPAVNGSAALELDLCYNASSMAK 375

Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              P +TL+F  G ++ L   N         + CL M   P    SVL     + Q    
Sbjct: 376 VKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTM--LPSQGGSVL---GTLLQTGTN 430

Query: 416 ILYDVPNSRL 425
           ++YDV   RL
Sbjct: 431 MIYDVDAGRL 440


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 174/404 (43%), Gaps = 64/404 (15%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQST 142
           P A+  +   + +  V   +GTP Q + M +DT ++ +W+ C G      T  FN++ S+
Sbjct: 42  PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSS 101

Query: 143 TFKNLGCQAAQC----KQVPNP----TCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI 193
           ++  + C +  C    + +P P    T    AC  +L+Y  ++ A   L+ DT  L    
Sbjct: 102 SYGAVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGA 161

Query: 194 VP---GYTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFS 238
            P   G  FGCI             TG  V     GLLG+ RG+LS + QT       F+
Sbjct: 162 PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFA 218

Query: 239 YCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRV 292
           YC+   +     G L LG  G     + YTPL++  +         Y V L  IRVG  +
Sbjct: 219 YCIAPGEG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCAL 275

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL------ 346
           + IP   L  + T    T++DSGT FT L+A AY A++  F  +  + L +  L      
Sbjct: 276 LPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFV 333

Query: 347 --GGFDTCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSIT 388
             G FD C+  P         + P + L+  G  V +  + LL            A ++ 
Sbjct: 334 FQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVW 393

Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           CL    + D       VI +  QQN  + YD+ N R+G A   C
Sbjct: 394 CLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 111/229 (48%), Gaps = 12/229 (5%)

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           V   GLLGLG G +S + Q       TFSYCL S +    SGSL  G    P    +  L
Sbjct: 3   VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVS-RGTESSGSLEFGRESVPVGASWVSL 61

Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           + NPR  S YY+ L  + VG   V I     + N     G ++D+GT  TRL A AY A 
Sbjct: 62  IHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNAF 121

Query: 330 RDVFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHST 383
           RD F  +  +NL  TS +  FDTCY     V +  PTI+  F G  + TLP  N LI   
Sbjct: 122 RDAFVAQT-TNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    C A A +    +S L++I N+QQ+   I  D  N  +G    +C
Sbjct: 181 SVGTFCFAFAPS----SSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 161/382 (42%), Gaps = 39/382 (10%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--------CSST 134
           +P+ SG   T +  Y V+ ++GTPAQ  ++  DT +D  WV C G            S  
Sbjct: 97  MPLTSG-AYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPR 155

Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VPN--PTCGGGA-----CAFNLTYGSSTIAANL---S 183
           VF  A S ++  + C +  CK  VP     C  G      C ++  Y   + A  +    
Sbjct: 156 VFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTD 215

Query: 184 QDTISLATD------IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQST 236
             TI+L+         +     GC     G S     G+L LG  ++S  ++    +   
Sbjct: 216 AATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 275

Query: 237 FSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           FSYCL    A   + S L  GP+G       TPLL + + +  Y V + A+ V  + ++I
Sbjct: 276 FSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNI 335

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
           P  A  ++     G I+DSGT  T L  PAY AV     +++     VT +  F+ CY+ 
Sbjct: 336 P--AEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVT-MDPFEYCYNW 392

Query: 356 -----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
                P   P + + F+G     P     +   A  + C+ +    + V   ++VI N+ 
Sbjct: 393 TATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQ---EGVWPGVSVIGNIL 449

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQ H   +D+ N  L      C
Sbjct: 450 QQEHLWEFDLANRWLRFQESRC 471


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 114/448 (25%), Positives = 177/448 (39%), Gaps = 79/448 (17%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKS------VVPIASGRQITQSPTYIVRAKIGTPAQTL 110
           S+ ++   D+ R+ F+SS    R +       +P++SG   T +  Y VR ++GTPAQ  
Sbjct: 42  SLADLARMDRERMAFISSRGRRRAAETASAFAMPLSSG-AYTGTGQYFVRFRVGTPAQPF 100

Query: 111 LMAMDTSNDAAWVPCTGCV------------------GCSSTVFNSAQSTTFKNLGCQAA 152
           L+  DT +D  WV C                            F   +S T+  + C +A
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSA 160

Query: 153 QCKQ---VPNPTCGGGA--CAFNLTYGSSTIA---ANLSQDTISLATDI-----VPGYTF 199
            C++        C   A  CA++  Y   + A     +   TI+L+        + G   
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVL 220

Query: 200 GCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGP 257
           GC     G S +   G+L LG  ++S  ++  + +   FSYCL    A   + S L  GP
Sbjct: 221 GCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGP 280

Query: 258 ------------IGQPKRI-------------KYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
                       I   K               + TPL+ + R    Y V +  + V   +
Sbjct: 281 NPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGEL 340

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           + IP     ++   G G I+DSGT  T L  PAY AV     +R+ + L   ++  FD C
Sbjct: 341 LKIPRAV--WDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRL-AGLPRVTMDPFDYC 397

Query: 353 YS--------VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
           Y+        V    P + + F+G     P     +   A  + C+ +   P      L+
Sbjct: 398 YNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGP---WPGLS 454

Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
           VI N+ QQ H   YD+ N RL   R  C
Sbjct: 455 VIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
          Length = 340

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 125/262 (47%), Gaps = 26/262 (9%)

Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-AN 181
            PC G   C    F+ ++S++F  + C + +C       C G +C F + +G+ T+A   
Sbjct: 21  APCVGGAPCD-VAFDPSRSSSFAAIPCGSPECAV----ECTGASCPFTIQFGNVTVANGT 75

Query: 182 LSQDTISLA-TDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQT-----QNLY 233
           L +DT++L+ +    G+TFGCI+            GL+ L R S SL ++          
Sbjct: 76  LVRDTLTLSPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTTT 135

Query: 234 QSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
            + FSYCLPS  +    G L +G   P      IKY P+  NP   + Y+V+L+ I VG 
Sbjct: 136 TAAFSYCLPSLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGG 195

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
             + +PP  L  +     GT++++ T FT L   AY A+RD FR  +            D
Sbjct: 196 EDLPVPPAVLAAH-----GTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLD 250

Query: 351 TCYSV----PIVAPTITLMFSG 368
           TCY++     +  P + L F+G
Sbjct: 251 TCYNLTGLASLAVPAVALRFAG 272


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 164/374 (43%), Gaps = 59/374 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS---TVFNSAQSTTFKNLGCQA 151
           YI    IG P Q     +DT ++  W  C+ C   GC S   + ++ ++S T + + C  
Sbjct: 71  YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACND 130

Query: 152 AQCKQVPNPTCG--GGACAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKAT- 206
             C       C     ACA    YG+  I   L  +  +    ++ V    FGCI     
Sbjct: 131 TACALGSETRCARDNKACAVLTAYGAGVIGGVLGTEAFTFQPQSENV-SLAFGCIAATRL 189

Query: 207 --GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKA------LSFSGSLRLGP 257
             G+     G++GLGRG+LSL++Q   L  + FSYCL P F        L    S  L  
Sbjct: 190 TPGSLDGASGIIGLGRGNLSLVSQ---LGDNKFSYCLTPYFSQSTNTSRLFVGASAGLSS 246

Query: 258 IGQPKRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNP-TTG--AGTI 311
            G P      P LKNP     S+ YY+ L  I VG   + +P  A       TG  AGT+
Sbjct: 247 GGAPA--TSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTL 304

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVP-----IVAPTITL 364
           IDSG+ FT LV  AY A+RD   +++G+++     G  G D C +V       + P + L
Sbjct: 305 IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVL 364

Query: 365 MF--SGMNVTLPQDN-----------LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
            F   G +V +P +N           +++ S+ G  + L M        +   +I N  Q
Sbjct: 365 HFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPM--------NETTIIGNYMQ 416

Query: 412 QNHRILYDVPNSRL 425
           Q+  +LYD+    L
Sbjct: 417 QDMHLLYDLEKGML 430


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 116/370 (31%), Positives = 162/370 (43%), Gaps = 52/370 (14%)

Query: 94  SPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCS------STVFNSAQSTTFKN 146
           +P  ++   +GTP AQT+   +D ++   W  C  C   +      +T F    S TF  
Sbjct: 85  APPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSP 144

Query: 147 LGCQAAQCKQVPNPTCGGGAC-----------AFNLTYGSSTIAAN----LSQDTISLAT 191
           L C +  C  V   TCG               +++LTYG S  AAN    L+ DT +   
Sbjct: 145 LPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGS--AANTSGYLATDTFTFGA 202

Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---S 248
             VPG  FGC   + G+     G++G+GRG+LSL++Q Q      FSY L + +A    S
Sbjct: 203 TAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGS 259

Query: 249 FSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPT 305
               +R G    P  KR + TPLL +      YYVNL  +RV G R+  IP G       
Sbjct: 260 ADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRAN 319

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPIVA- 359
              G I+ S T  T L   AY    DV R  V S + + ++ G      D CY+   +A 
Sbjct: 320 GTGGVILSSTTPVTYLEQAAY----DVVRAAVASRIGLPAVNGSAALELDLCYNASSMAK 375

Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              P +TL+F  G ++ L   N         + CL M   P    SVL     + Q    
Sbjct: 376 VKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTM--LPSQGGSVL---GTLLQTGTN 430

Query: 416 ILYDVPNSRL 425
           ++YDV   RL
Sbjct: 431 MIYDVDAGRL 440


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 85/280 (30%), Positives = 134/280 (47%), Gaps = 46/280 (16%)

Query: 5   LVFFLAFLFLFSLSE----------GLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSW 54
           LV+FL + +L + +            L P  +       + + HV  P S   P  P+S+
Sbjct: 3   LVWFLGWFYLLATASSFVEKENEAVALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSF 62

Query: 55  EESVLEMLAKDQARLQFLSSLAVAR-----------------KSV-VPIASGRQITQSPT 96
            +    +LA D AR++ L+S    +                 KSV VP+  G  I  S  
Sbjct: 63  SD----VLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIG-SGN 117

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
           Y V+   G+PA+   M +DT +  +W+ C  CV  C   +  +F+ + S T+K+L C ++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177

Query: 153 QCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
           QC  +      NP C      C +  +YG S+ +   LSQD ++LA +  +PG+ +GC Q
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQ 237

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
            + G      G+LGLGR  LS+L Q  + +   FSYCLP+
Sbjct: 238 DSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPT 277


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/411 (23%), Positives = 167/411 (40%), Gaps = 51/411 (12%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSPTY-------IVRAKIGTPAQTLLMAMDTSNDAA 121
           LQ L++ +++R   +       + Q+  +        +    GTP Q L   +DT +   
Sbjct: 52  LQHLATASMSRSHHLKHGKASPLIQTSLFPHSHGGHTIPLSFGTPPQKLSFLVDTGSHVV 111

Query: 122 WVPCT---GCVGCSST------VFNSAQSTTFKNLGCQAAQCKQVPNPT-------CGGG 165
           W PCT    C  CS +      +FN   S++ K LGC+  +C    +P        C G 
Sbjct: 112 WAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNGN 171

Query: 166 ------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
                 AC  + L YG+   +     + +      +  +  GC   A         L G 
Sbjct: 172 SKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSAD-REPSSDALAGF 230

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
           GR   SL  Q   +    F+YCL S  +     SG L L    G+ + + Y P LKNP  
Sbjct: 231 GRTMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPD 287

Query: 276 SSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
               YY+ +  +++G +++ IP   L     +  G +IDSG  +  +  P +  V +  +
Sbjct: 288 YPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELK 347

Query: 335 RRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
           +++     +L   +  G   CY+      I  P +   F+ G N+ +P  N  +  +  S
Sbjct: 348 KQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEAS 407

Query: 387 ITCLAMAAAPDNVNSVLN-----VIANMQQQNHRILYDVPNSRLGVARELC 432
           + C  +       N         ++ N QQ +H + +D+ N RLG  ++ C
Sbjct: 408 LGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/305 (28%), Positives = 138/305 (45%), Gaps = 34/305 (11%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST- 134
           A  R  +V  A G    +   Y+V   +GTP + + + +DT +D  W  C  C  C    
Sbjct: 68  ARVRAGLVAAAGGIATNE---YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQG 124

Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
             + + A S+T+  L C A +C+ +P  +CGG +C +   YG  ++    ++ D  +   
Sbjct: 125 IPLLDPAASSTYAALPCGAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGD 184

Query: 192 D-------IVPG---YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC 240
           +        +P     TFGC     G     + G+ G GRG  SL +Q   L  ++FSYC
Sbjct: 185 NGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQ---LNATSFSYC 241

Query: 241 LPS-FKALSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
             S F + S   +L   P           ++ TPL KNP + SLY+++L  I VG+  + 
Sbjct: 242 FTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLP 301

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
           +P    +        TIIDSG   T L    Y AV+  F  +VG   +       D C++
Sbjct: 302 VPETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFA 354

Query: 355 VPIVA 359
           +P+ A
Sbjct: 355 LPVSA 359


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 125/447 (27%), Positives = 187/447 (41%), Gaps = 54/447 (12%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +FF   LFL S S+         D+  T  +FH  S  SP + S  LS  + +     + 
Sbjct: 7   LFFHLILFLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLANAFRRS 62

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
            +R     S A+  ++    A G Q          + IGTP    L   DT +D  W  C
Sbjct: 63  LSR-----SAALLNRAATSGAVGLQ---------SSIIGTPPVDYLGIADTGSDLTWAQC 108

Query: 126 TGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIA-A 180
             C+ C      +FN  +ST+F ++ C    C  V +  CG  G C ++ TYG  T +  
Sbjct: 109 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKG 168

Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQSTFS 238
           +L  + I++ +  V     GC   ++G      G++GLG G LSL++Q          FS
Sbjct: 169 DLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFS 227

Query: 239 YCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG--RRVV 293
           YCLP+  + + +G +  G    +  P  +  TPL+ +    + YY+ L AI +G  R + 
Sbjct: 228 YCLPTLLSHA-NGKINFGQNAVVSGPGVVS-TPLI-SKNTVTYYYITLEAISIGNERHMA 284

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
               G +          IIDSGT  + L    Y  V     + V +         +D C+
Sbjct: 285 FAKQGNV----------IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 334

Query: 354 SVPI-VA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
              I VA     P IT  FS G NV L   N      A ++ CL +   P +      +I
Sbjct: 335 DDGINVATSSGIPIITAQFSGGANVNLLPVNTF-QKVANNVNCLTL--TPASPTDEFGII 391

Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
            N+   N  I YD+   RL     +CT
Sbjct: 392 GNLALANFLIGYDLEAKRLSFKPTVCT 418


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)

Query: 51  PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           P + E  + ++ A+D+AR  + L SL        P+           Y  + ++GTP + 
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
             + +DT +D  WV C  C GC  T         F+   S T   + C   +C    Q  
Sbjct: 94  FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153

Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
           +  C      CA+   YG     S    +++ Q  + + + +VP  T    FGC    TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213

Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           + V       G+ G G+  +S+++Q  +Q +    FS+CL         G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + +TPL+ +      Y VNLL+I V  + + I P    F+ + G GTIID+GT    L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325

Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
              AY    +     V  ++  V S G  + CY    SV  + P ++L F+G       P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383

Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           QD L+  +  G  ++ C+         N  + ++ ++  ++   +YD+   R+G A   C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440

Query: 433 T 433
           +
Sbjct: 441 S 441


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 113/403 (28%), Positives = 162/403 (40%), Gaps = 77/403 (19%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG----- 130
           +P+A G        Y +   +G P  A ++ + +DT +D  W PC       C G     
Sbjct: 80  LPLAPGSD------YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPG 133

Query: 131 -----------------CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FN 170
                            C+S + ++A S+   +  C AA+C    +   +C   AC    
Sbjct: 134 GNHSSPLPPPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY 193

Query: 171 LTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
             YG  ++ ANL +  + LA  + V  +TF C   A      P G+ G GRG LSL AQ 
Sbjct: 194 YAYGDGSLVANLRRGRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQL 250

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRV 288
                           A S SGS     IG  +    YTPLL NP+    Y V L A+ V
Sbjct: 251 ----------------APSLSGSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 294

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG- 347
           G + +   P     +     G ++DSGT FT L +  +  V D F R + +     + G 
Sbjct: 295 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 354

Query: 348 ----GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAP 396
               G   CY         P + L F G   V LP+ N  +   S  G S+ CL +    
Sbjct: 355 EAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVG 414

Query: 397 DNVNSVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
            N +   +       + N QQQ   ++YDV   R+G AR  CT
Sbjct: 415 GNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 164/361 (45%), Gaps = 40/361 (11%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQ 150
           Q+  Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C 
Sbjct: 78  QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 151 AAQC-KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
            + C     +P C        C F ++Y   + +   L QDT++ +    +PG++FGC  
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNM 196

Query: 204 KATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLG 256
            + G +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLG 255

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +     ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+
Sbjct: 256 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGS 310

Query: 317 VFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-S 367
             +    R ++     +R++  +R  +            CY +  V     P I+L F  
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDD 365

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G    L    + +  +        +A AP      +++I ++ Q +  ++YD+    +G+
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQLIGI 422

Query: 428 A 428
            
Sbjct: 423 G 423


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/427 (27%), Positives = 198/427 (46%), Gaps = 43/427 (10%)

Query: 30  HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR----LQFLSSLAVA--RKSVV 83
           H  T  +F   SP SP   +  LS  +S+++   +  +R    L  L+S++ A  R  ++
Sbjct: 26  HGFTTSLFRRDSPLSPLH-NPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPII 84

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQ 140
           P         S  +++   IGTP   ++   DT +D  W   +PC  C   S  +FN  +
Sbjct: 85  P--------DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRR 136

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG--ACAFNLTYGSSTIA-ANLSQDTISLATDIVPGY 197
           S++++ + C +  C+ + +  CG    +C++  +YG  +    +L+ D I++ +  +P  
Sbjct: 137 SSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKT 196

Query: 198 TFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL--YQSTFSYCLPS-FKALSFSGSL 253
             GC  +  G       G++GLG GSLSL++Q + +   +  FSYCLP+ F   + +G++
Sbjct: 197 VIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTI 256

Query: 254 RLG--PIGQPKRIKYTPLLKNPRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
             G   +   +++  TPL+  PR   + Y++ L AI VG++      G      T     
Sbjct: 257 SFGRKAVVSGRQVVSTPLV--PRSPDTFYFLTLEAISVGKKRFKAANGISAM--TNHGNI 312

Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF 366
           IIDSGT  T L    Y  V     R + +       G  + CYS   V     P IT  F
Sbjct: 313 IIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHF 372

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           + G +V L   N      A ++TCL  A A     + + +  N+ Q N  + YD+ N RL
Sbjct: 373 AGGADVKLLPVNTFA-PVADNVTCLTFAPA-----TQVAIFGNLAQINFEVGYDLGNKRL 426

Query: 426 GVARELC 432
               +LC
Sbjct: 427 SFEPKLC 433


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 115/411 (27%), Positives = 172/411 (41%), Gaps = 59/411 (14%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           EE V     +   RL  +  +        PI  G Q      YI    IG P Q     +
Sbjct: 39  EERVRRATERTHRRLASMGGV------TAPIHWGGQ----SQYIAEYLIGDPPQRAEAII 88

Query: 115 DTSNDAAWVPCTGCVGCSSTVF-------NSAQSTTFKNLGCQAAQCKQVPNPTC--GGG 165
           DT ++  W   T C  C  T F       + ++S   + +GC  A C       C     
Sbjct: 89  DTGSNLIW---TQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGSETQCLSDNK 145

Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
            CA    YG+  IA  L+ + ++  ++ V    FGCI   + + G+     G++GLGRG 
Sbjct: 146 TCAVVTGYGAGNIAGTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGK 204

Query: 223 LSLLAQTQNLYQSTFSYCL---------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
           LSL +Q   L  + FSYCL         PS   +  S  L  G       +   P +++P
Sbjct: 205 LSLPSQ---LGDTRFSYCLTPYFEDTIEPSHMVVGASAGLINGS-ASSTPVTTVPFVRSP 260

Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGAL---QFNPTTGAGTIIDSGTVFTRLVAPAYT 327
                S+ YY+ L  I  G+  + +P  A    Q  P    GT IDSG   T LV  AY 
Sbjct: 261 SDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQ 320

Query: 328 AVRDVFRRRVGSNLTVTSLG--GFDTCYSV---PIVAPTITLMF-----SGMNVTLPQDN 377
           A+R    R++G+ L     G  GFD C ++     + P + L F     +G ++ +P  N
Sbjct: 321 ALRAELARQLGAALVQPLAGTTGFDLCVALKDAERLVPPLVLHFGGGSGTGTDLVVPPAN 380

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRL 425
                 + +  C+ + ++ D  +  +N   VI N  QQN  +LYD+    L
Sbjct: 381 YWAPVDSAT-ACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVL 430


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 165/362 (45%), Gaps = 43/362 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP QT  + +DT +   +VPC+ C  C       F    S+T++ L C + +
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150

Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
           C      TC      C ++  Y   S+ +  L +D +S    +++ P  T FGC    TG
Sbjct: 151 C------TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVETG 204

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +  ++FS C          G++ LG I  P  
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAG 262

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + +T    +P RS+ Y ++L  I +  + + I P           GTI+DSGT +  L  
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 324 PAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMFS-GMNVT 372
           PA+ A +D   + + S   +        D C+S        +    P + L+FS G  ++
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           L P++ L  HS A    CL +     N N    ++  +  +N  ++YD  + ++G  +  
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433

Query: 432 CT 433
           C+
Sbjct: 434 CS 435


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/418 (25%), Positives = 171/418 (40%), Gaps = 60/418 (14%)

Query: 69  LQFLSSLAVARKS-VVPIASGR-----QITQSPT----YIVRAKIGTPAQTLLMAMDTSN 118
           L+FL  LA A  S    +  G+     QI+ SP     + +    GTP Q L   +DT +
Sbjct: 49  LRFLQHLATASLSRAHHLKHGKTSPLTQISLSPHSYGGHSIPLSFGTPPQKLSFLVDTGS 108

Query: 119 DAAWVPCT---GCVGCSST--------VFNSAQSTTFKNLGCQAAQCKQVPNPT------ 161
              W PCT    C  CS +        +FN   S++ K LGC+  +C    +P       
Sbjct: 109 HVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCP 168

Query: 162 -CGGG------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
            C G       AC  ++L YG+   + +   + ++     +  +  GC   A G  V   
Sbjct: 169 PCNGNSKNCSHACPPYSLQYGTGASSGDFLLENLNFPGKTIHEFLVGCTTSAVGE-VTSA 227

Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLL 270
            L G GR   SL  Q   +    F+YCL S        S +L      G+ K + Y P L
Sbjct: 228 ALAGFGRSMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFL 284

Query: 271 KNPRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
           KNP    + YY+ +  I++G +++ IP   L        G +IDSG  +  +  P +  V
Sbjct: 285 KNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKV 344

Query: 330 RDVFRRRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIH 381
            +  ++R+     +L   +  G   CY+      I  P +   F  G  + +P  N  + 
Sbjct: 345 TNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVL 404

Query: 382 STAGSITCLAMAAAPDNVNSVLN-------VIANMQQQNHRILYDVPNSRLGVARELC 432
               S+ C  +    D   + L        ++ N Q  ++ + +D+ N RLG  ++ C
Sbjct: 405 IPEISLACFPLTT--DAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 164/369 (44%), Gaps = 46/369 (12%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKN----------L 147
           +V   IGTP Q   M +DT +  +W+ C        T       TT             L
Sbjct: 83  VVTLPIGTPPQLQQMVLDTGSQLSWIQCHN----KKTPQKKQPPTTSSFDPSLSSSFFVL 138

Query: 148 GCQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTF 199
            C    CK +VP+   PT C   + C ++  Y   T A  NL ++ I+ + +   P    
Sbjct: 139 PCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIIL 198

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC  +    S   +G+LG+  G L   +Q +    + FSYC+P+ +A   SGS  LG   
Sbjct: 199 GCATQ----SDDARGILGMNLGRLGFPSQAK---ITKFSYCVPTKQAQPASGSFYLGNNP 251

Query: 260 QPKRIKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
                +Y  LL   ++ R  +L    Y + L  I +G + ++IPP   + N      T+I
Sbjct: 252 ASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMI 311

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP------IVAPTITL 364
           DSG+ FT LV  AY  +R+   ++VG  +    + G   D C+         +V   +  
Sbjct: 312 DSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDMVFE 371

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
              G+ + +P++ +L  +  G + CL M  + + + +  N+I N  QQN  + +D+ N R
Sbjct: 372 FEKGVQIVIPKERVLA-TVDGGVHCLGMGRS-ERLGAGGNIIGNFHQQNLWVEFDLANRR 429

Query: 425 LGVARELCT 433
           +G     C+
Sbjct: 430 VGFGEADCS 438


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 109/414 (26%), Positives = 174/414 (42%), Gaps = 81/414 (19%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNS-------- 138
           T +  Y++   +G P Q   + +DT +D  WVPC       C+ C +    S        
Sbjct: 20  TYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSP 79

Query: 139 -AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACA---------------FNLTYGSST 177
              S+  K L C +  C  +      +  C    CA               F+ TYG   
Sbjct: 80  SQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTYGGGA 138

Query: 178 IA-ANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLA 227
           +   +L++D ++L   I        VPG+ FGC+    G+S+  P G+ G G+G LSL +
Sbjct: 139 LVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCV----GSSIREPIGIAGFGKGILSLPS 194

Query: 228 QTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVN 282
           Q   L    FS+C   F+     +F+ SL +G +    +    +TP+LK+    + YY+ 
Sbjct: 195 QLGFL-DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIG 253

Query: 283 LLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV----RDVFRRRV 337
           L  + +G    +  PP     +     G I+D+GT +T L  P YTA+      V     
Sbjct: 254 LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYER 313

Query: 338 GSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIHSTAGS-- 386
             +L + +  GFD C+ +P           P I   F G + +TLP+D+     TA    
Sbjct: 314 SYDLEMRT--GFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNS 371

Query: 387 --ITCLAM--AAAPDNVNSVLN----VIANMQQQNHRILYDVPNSRLGVARELC 432
             + CL        D+V    N    V+ + Q QN  ++YD+   R+G   + C
Sbjct: 372 VVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 425


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 165/362 (45%), Gaps = 43/362 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP QT  + +DT +   +VPC+ C  C       F    S+T++ L C + +
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150

Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
           C      TC      C ++  Y   S+ +  L +D +S    +++ P  T FGC    TG
Sbjct: 151 C------TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVETG 204

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +  ++FS C          G++ LG I  P  
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAG 262

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + +T    +P RS+ Y ++L  I +  + + I P           GTI+DSGT +  L  
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 324 PAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMFS-GMNVT 372
           PA+ A +D   + + S   +        D C+S        +    P + L+FS G  ++
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           L P++ L  HS A    CL +     N N    ++  +  +N  ++YD  + ++G  +  
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433

Query: 432 CT 433
           C+
Sbjct: 434 CS 435


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 58/370 (15%)

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
            T    Y     +G+P +   + MDT +D  WV C  C    S+ F+   S T+K L C 
Sbjct: 118 FTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTC- 176

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIAANLS-QDTISLA------TDIVPGYTFGCIQ 203
            A   ++P            L        +  S +DT+ +A       +  PG+ FGC  
Sbjct: 177 -ADDLRLP----------VLLRLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGCGS 225

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----- 258
              G      G+L L  GSLS  +Q    Y + FSYCL    A     SL+  P+     
Sbjct: 226 LLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTA---QNSLKKSPMVFGEA 282

Query: 259 ---------GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGA 308
                    G+P+ ++YTP+      SS+YY V L  I VG + +D+ P    F      
Sbjct: 283 AVELKEPGSGKPQELQYTPI----GESSIYYTVRLDGISVGNQRLDLSPST--FLNGQDK 336

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITL 364
            TI DSGT  T L +    +++      V S     ++ G D C+ VP  +    P IT 
Sbjct: 337 PTIFDSGTTLTMLPSGVCDSIKQSLASMV-SGAEFVAIKGLDACFRVPPSSGQGLPDITF 395

Query: 365 MFSGMN--VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
            F+G    VT P + ++     GS+ CL     P N    +++  N+QQQ+  +L+D+ N
Sbjct: 396 HFNGGADFVTRPSNYVI---DLGSLQCLIF--VPTN---EVSIFGNLQQQDFFVLHDMDN 447

Query: 423 SRLGVARELC 432
            R+G     C
Sbjct: 448 RRIGFKETDC 457


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)

Query: 51  PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           P + E  + ++ A+D+AR  + L SL        P+           Y  + ++GTP + 
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
             + +DT +D  WV C  C GC  T         F+   S T   + C   +C    Q  
Sbjct: 94  FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153

Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
           +  C      CA+   YG     S    +++ Q  + + + +VP  T    FGC    TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213

Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           + V       G+ G G+  +S+++Q  +Q +    FS+CL         G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + +TPL+ +      Y VNLL+I V  + + I P    F+ + G GTIID+GT    L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325

Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
              AY    +     V  ++  V S G  + CY    SV  + P ++L F+G       P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383

Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           QD L+  +  G  ++ C+         N  + ++ ++  ++   +YD+   R+G A   C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440

Query: 433 T 433
           +
Sbjct: 441 S 441


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/429 (25%), Positives = 163/429 (37%), Gaps = 87/429 (20%)

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
           R+  +P++ G   T S T          +Q + + +DT +D  W PC    C+ C     
Sbjct: 70  RQVSLPLSPGSDYTLSFT--------LDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAE 121

Query: 137 NSAQSTT-------------FKNLGCQAAQC---------------KQVPNPTCGGGAC- 167
           N++ ++T              K+  C AA                 + +    C   +C 
Sbjct: 122 NTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCP 181

Query: 168 AFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
            F   YG  ++ A L +D+ISL        IV  +TFGC   A      P G+ G GRG 
Sbjct: 182 QFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAE---PIGVAGFGRGV 238

Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSF----------------SGSLRLGPIGQPKR 263
           LSL AQ   L     + FSYCL S    S                     R+  + +P R
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKP-R 297

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
             YT +L N      Y V L  I +GR+ +  P    + +     G ++DSGT FT L A
Sbjct: 298 FVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPA 357

Query: 324 PAYTAVRDVFRRRVG----SNLTVTSLGGFDTCY-----SVPIVAPTITLMFSGMNVTLP 374
             Y +V   F  RVG        +    G   CY      V + +  +  + +G +V LP
Sbjct: 358 SLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLP 417

Query: 375 QDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNS 423
           + N                 + CL +    D           + N QQQ   ++YD+ N 
Sbjct: 418 RRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENK 477

Query: 424 RLGVARELC 432
           R+G AR  C
Sbjct: 478 RVGFARRQC 486


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 114/415 (27%), Positives = 171/415 (41%), Gaps = 46/415 (11%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS--GRQITQ-SPTYIVRAKIGTP 106
           KPLS  E V+     DQ R   +S     R S V +    G  I   +  Y    ++GTP
Sbjct: 62  KPLSRIEDVI---GADQKRHSLISR---KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 115

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCK--------- 155
           A+   + +DT ++  WV C           VF + +S +FK +GC    CK         
Sbjct: 116 AKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSL 175

Query: 156 -QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATGN 208
              P P+     C+++  Y   + A  + +++TI++         +PG+  GC    TG 
Sbjct: 176 TTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQ 232

Query: 209 SVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKR-IK 265
           S     G+LGL     S  +   +LY + FSYCL       + S  L  G     K   +
Sbjct: 233 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFR 292

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            T  L   R    Y +N++ I +G  ++DIP     ++ T+G GTI+DSGT  T L   A
Sbjct: 293 RTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV--WDATSGGGTILDSGTSLTLLADAA 350

Query: 326 YTAVRDVFRRRVGSNLTVTSLG-------GFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
           Y  V     R +     V   G        F + ++V  + P +T    G     P    
Sbjct: 351 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKL-PQLTFHLKGGARFEPHRKS 409

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +   A  + CL   +A        NVI N+ QQN+   +D+  S L  A   CT
Sbjct: 410 YLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 161/384 (41%), Gaps = 63/384 (16%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P+ SGR       Y    K+G+P Q   + +DT ++  W+ C               S 
Sbjct: 100 MPMHSGRDDALGE-YFAEVKVGSPGQRFWLVVDTGSEFTWLNC---------------SK 143

Query: 143 TFKNLGCQAAQCKQ----------VPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA- 190
           +F+ + C + +CK            P P+     C ++++Y   + A      D+I++  
Sbjct: 144 SFEAVTCASRKCKVDLSELFSLSVCPKPS---DPCLYDISYADGSSAKGFFGTDSITVGL 200

Query: 191 TDIVPG----YTFGCIQKATGNSV----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL- 241
           T+   G     T GC  K+  N V       G+LGLG    S + +  N Y + FSYCL 
Sbjct: 201 TNGKQGKLNNLTIGCT-KSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLV 259

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL------YYVNLLAIRVGRRVVDI 295
                 S S +L +G     K      LL   RR+ L      Y VN++ I +G +++ I
Sbjct: 260 DHLSHRSVSSNLTIGGHHNAK------LLGEIRRTELILFPPFYGVNVVGISIGGQMLKI 313

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCY 353
           PP    FN     GT+IDSGT  T L+ PAY AV +   + +     VT       + C+
Sbjct: 314 PPQVWDFNAE--GGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCF 371

Query: 354 SVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
                   V P +   F+G     P     I   A  + C+ +   P +     +VI N+
Sbjct: 372 DAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGI--VPIDGIGGASVIGNI 429

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
            QQNH   +D+  + +G A   CT
Sbjct: 430 MQQNHLWEFDLSTNTVGFAPSTCT 453


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 114/415 (27%), Positives = 171/415 (41%), Gaps = 46/415 (11%)

Query: 50  KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS--GRQITQ-SPTYIVRAKIGTP 106
           KPLS  E V+     DQ R   +S     R S V +    G  I   +  Y    ++GTP
Sbjct: 40  KPLSRIEDVI---GADQKRHSLISR---KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 93

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCK--------- 155
           A+   + +DT ++  WV C           VF + +S +FK +GC    CK         
Sbjct: 94  AKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSL 153

Query: 156 -QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATGN 208
              P P+     C+++  Y   + A  + +++TI++         +PG+  GC    TG 
Sbjct: 154 TTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQ 210

Query: 209 SVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKR-IK 265
           S     G+LGL     S  +   +LY + FSYCL       + S  L  G     K   +
Sbjct: 211 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFR 270

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            T  L   R    Y +N++ I +G  ++DIP     ++ T+G GTI+DSGT  T L   A
Sbjct: 271 RTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV--WDATSGGGTILDSGTSLTLLADAA 328

Query: 326 YTAVRDVFRRRVGSNLTVTSLG-------GFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
           Y  V     R +     V   G        F + ++V  + P +T    G     P    
Sbjct: 329 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKL-PQLTFHLKGGARFEPHRKS 387

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            +   A  + CL   +A        NVI N+ QQN+   +D+  S L  A   CT
Sbjct: 388 YLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 48/368 (13%)

Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAAQCKQ------VP 158
           P Q + M +DT ++ +W+ C      +    F+  +S+++  + C +  C+       +P
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141

Query: 159 NPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQ--- 213
                   C   L+Y  +S+   NL+ +              FGC+   +G S P +   
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSG-SDPEEDTK 200

Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIK 265
             GLLG+ RGSLS ++Q   +    FSYC+      P F  L  S    L P+     I+
Sbjct: 201 TTGLLGMNRGSLSFISQ---MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIR 257

Query: 266 Y-TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TPL    R +  Y V L  I+V  +++ IP   L  + T    T++DSGT FT L+ P
Sbjct: 258 ISTPLPYFDRVA--YTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGP 315

Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA---------PTITLMFSGM 369
            YTA+R  F  R    LTV         G  D CY +  V          PT++L+F G 
Sbjct: 316 VYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGA 375

Query: 370 NVTLPQDNLLI---HSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            + +    LL    H T G  S+ C     + D +     VI +  QQN  I +D+  SR
Sbjct: 376 EIAVSGQPLLYRVPHLTVGNDSVYCFTFGNS-DLMGMEAYVIGHHHQQNMWIEFDLQRSR 434

Query: 425 LGVARELC 432
           +G+A   C
Sbjct: 435 IGLAPVEC 442


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 83/264 (31%), Positives = 131/264 (49%), Gaps = 28/264 (10%)

Query: 50  KPLSWEESVLEMLAKD-------QARL-QFLSSLAVARKSV-VPIASGRQITQSPTYIVR 100
           K ++W   +   L  D       Q RL + +SS +V    + +P+ASG    Q+  YIV 
Sbjct: 90  KKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNF-QTLNYIVT 148

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQV 157
            ++G   Q + + +DT +D  WV C  C+ C +    VF  + S++++++ C ++ C+ +
Sbjct: 149 MELG--GQDMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSL 206

Query: 158 PNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
              T   GAC  N       + YG  S     L  + +S     V  + FGC +   G  
Sbjct: 207 QLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLF 266

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IK 265
               GL+GLGR +LSL++QT + +   FSYCLP   A + SGSL +G      +    I 
Sbjct: 267 GGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGA-SGSLAMGNESSVFKNLTPIA 325

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVG 289
           YT ++ NP+ S+ Y +NL  I VG
Sbjct: 326 YTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 102/399 (25%), Positives = 178/399 (44%), Gaps = 46/399 (11%)

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           +L +D  RL+ L +L     S   +     +  +  Y  R  IG+P Q   + +DT +  
Sbjct: 54  VLDRDH-RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112

Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-S 176
            +VPC+ CV C +     F    S+T++ + C  A C    N    G  C +   Y   S
Sbjct: 113 TYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADCNCDEN----GVQCTYERRYAEMS 167

Query: 177 TIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQT-- 229
           T +  L++D +S    +++VP    FGC    +G+  +    G++GLGRG+LS++ Q   
Sbjct: 168 TSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVG 227

Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
           + +  ++FS C          G++ LG I  P  + ++    +P RS  Y + L  I V 
Sbjct: 228 KGVVSNSFSLCYGGMDV--GGGAMVLGGISSPPGMVFSH--SDPSRSPYYNIELKEIHVA 283

Query: 290 RRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVT 344
            +        L+ NP T     G I+DSGT +      AY A +D   +++     ++  
Sbjct: 284 GK-------PLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGP 336

Query: 345 SLGGFDTCYS--------VPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAA 394
                D C+S        +P V P + ++F+ G  ++L P++ L  H+      CL +  
Sbjct: 337 DPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFK 396

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              N N    ++  +  +N  + Y+  NS +G  +  C+
Sbjct: 397 ---NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 113/444 (25%), Positives = 183/444 (41%), Gaps = 46/444 (10%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +F L  + +F +S  +       D+  T+++ H  SP SP     PL   E+    +A  
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMY--NPL---ENHYHRVADT 58

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW--- 122
             R    ++  V      PI + R       Y+++  +GTP   ++   DT +D  W   
Sbjct: 59  LRRSISHNTGLVTNTVEAPIYNNRG-----EYLMKLSVGTPPFPIIAVADTGSDIIWTQC 113

Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV--PNPTCGGGACAFNLTYGSSTIA- 179
           VPCT C      +FN ++STT++ + C +  C      N       C ++++YG ++ + 
Sbjct: 114 VPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173

Query: 180 ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
            + + DT+++ +        P    GC     G+      G++GLG G  SL+ Q  +  
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233

Query: 234 QSTFSYCLPSF-------KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
              FSYCL            L+F  +  +   G       TP+  + +  S Y + L A+
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVS----TPIYISDKFKSFYSLKLKAV 289

Query: 287 RVGRRVVDIPPGALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            VGR            N   G  A  IIDSGT  T L    Y          +    T  
Sbjct: 290 SVGRNNTFYSTA----NSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDD 345

Query: 345 SLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
                + C+         P I + F G N+ L ++N+LI   + ++ CLA A A DN   
Sbjct: 346 PNQFLEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIR-VSDNVICLAFAGAQDN--- 401

Query: 402 VLNVIANMQQQNHRILYDVPNSRL 425
            +++  N+ Q N  + YDV N  L
Sbjct: 402 DISIYGNIAQINFLVGYDVTNMSL 425


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 42/363 (11%)

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQA 151
           P Y+    IGTP Q     +  + +  W  C+ C  C      +FN + S+T++   C  
Sbjct: 26  PLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGT 85

Query: 152 AQCKQVPNPTCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-----IQKA 205
           A C+ VP  TC G G C++ +       +     DT ++ T       FGC     I++ 
Sbjct: 86  ALCESVPASTCSGDGVCSYEVETMFGDTSGIGGTDTFAIGTATA-SLAFGCAMDSNIKQL 144

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPK 262
            G S    G++GLGR   SL+ Q   +  + FSYCL    A     +L LG    +   K
Sbjct: 145 LGAS----GVVGLGRTPWSLVGQ---MNATAFSYCLAPHGAAGKKSALLLGASAKLAGGK 197

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
               TPL+     SS Y ++L  I+ G  ++  PP         G+  ++D+    + LV
Sbjct: 198 SAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPP--------NGSVVLVDTIFGVSFLV 249

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----------SVPIVAPTITLMFSG-MN 370
             A+ A++      VG+    T    FD C+           S+P+  P + L F G   
Sbjct: 250 DAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPL--PDVVLTFQGAAA 307

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           +T+P    +  +  G++    M++A  N+ + L+++  + Q+N   L+D+    L     
Sbjct: 308 LTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPA 367

Query: 431 LCT 433
            C+
Sbjct: 368 DCS 370


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 101/400 (25%), Positives = 178/400 (44%), Gaps = 48/400 (12%)

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           +L +D  RL+ L +L     S   +     +  +  Y  R  IG+P Q   + +DT +  
Sbjct: 54  VLDRDH-RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112

Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQA-AQCKQVPNPTCGGGACAFNLTYGS- 175
            +VPC+ CV C +     F    S+T++ + C A   C +       G  C +   Y   
Sbjct: 113 TYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCDE------NGVQCTYERRYAEM 166

Query: 176 STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQT- 229
           ST +  L++D +S    +++VP    FGC    +G+  +    G++GLGRG+LS++ Q  
Sbjct: 167 STSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLV 226

Query: 230 -QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
            + +  ++FS C          G++ LG I  P  + ++    +P RS  Y + L  I V
Sbjct: 227 GKGVVSNSFSLCYGGMDV--GGGAMVLGGISSPPGMVFSH--SDPSRSPYYNIELKEIHV 282

Query: 289 GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTV 343
             +        L+ NP T     G I+DSGT +      AY A +D   +++     ++ 
Sbjct: 283 AGK-------PLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISG 335

Query: 344 TSLGGFDTCYS--------VPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMA 393
                 D C+S        +P V P + ++F+ G  ++L P++ L  H+      CL + 
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIF 395

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               N N    ++  +  +N  + Y+  NS +G  +  C+
Sbjct: 396 K---NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 83/254 (32%), Positives = 122/254 (48%), Gaps = 30/254 (11%)

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSL 253
           FGC   + G+ +   G+LGL   SLSL+ Q   L    FSYCL  F       L F    
Sbjct: 129 FGCGALSAGSLIGATGILGLSPESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMA 185

Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
            L      + I+ T ++ NP  +  YYV L+ I +G + + +P  +L   P  G GTI+D
Sbjct: 186 DLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 245

Query: 314 SGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFDTCYSVP----------IVA 359
           SG+    LV  A+ AV+    DV R  V +N TV     ++ C+ +P          +  
Sbjct: 246 SGSTVAYLVEAAFEAVKEAVMDVVRLPV-ANRTVED---YELCFVLPRRTAAAAMEAVQV 301

Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
           P + L F  G  + LP+DN      AG + CLA+    D   S +++I N+QQQN  +L+
Sbjct: 302 PPLVLHFDGGAAMVLPRDNYFQEPRAG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLF 358

Query: 419 DVPNSRLGVARELC 432
           DV + +   A   C
Sbjct: 359 DVQHHKFSFAPTQC 372


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 151/385 (39%), Gaps = 58/385 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
           Y +    GTP QT    MDT +   W PCT    CS   F +            QS++  
Sbjct: 92  YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSN 151

Query: 146 NLGCQAAQCKQVPNP--------------TCGGGACAFNLTYGSSTIAANLSQDTISL-A 190
            +GC+  +C  +  P               C      + + YG  + A  L  +T+    
Sbjct: 152 LIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPH 211

Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALS 248
              +PG+  GC   +      P+G+ G GR   SL +Q   L    FSYCL S  F    
Sbjct: 212 KKTIPGFLVGCSLFSIRQ---PEGIAGFGRSPESLPSQ---LGLKKFSYCLVSHAFDDTP 265

Query: 249 FSGSLRLGPIGQPKRIK-----YTPLLKNPRRS--SLYYVNLLAIRVGRRVVDIPPGALQ 301
            S  L L         K     YTP  KNP  +    YYV L  I +G   V +P   L 
Sbjct: 266 ASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLV 325

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG---GFDTCYSV--- 355
                  GTI+DSGT FT +  P Y  V   F ++V      T +    G   C+++   
Sbjct: 326 PGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGE 385

Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV------LNVIA 407
             +  P     F  G  + LP  N      +G I CL + +  DN++          ++ 
Sbjct: 386 KSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVI-CLTIVS--DNMSGSGIGGGPAIILG 442

Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
           N QQ+N  + +D+ N R G  ++ C
Sbjct: 443 NYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)

Query: 51  PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           P + E  + ++ A+D+AR  + L SL        P+           Y  + ++G+P + 
Sbjct: 36  PANHEMELSQLKARDKARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKIRLGSPPRD 93

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
             + +DT +D  WV C  C GC  T         F+   S T   + C   +C    Q  
Sbjct: 94  FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSS 153

Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
           +  C      CA+   YG     S    +++ Q  + + + +VP  T    FGC    TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213

Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           + V       G+ G G+  +S+++Q  +Q L    FS+CL         G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENG--GGGILVLGEIVEP 271

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + +TPL+ +      Y VNLL+I V  + + I P    F+ + G GTIID+GT    L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325

Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
              AY    +     V  ++  V S G  + CY    SV  + P ++L F+G       P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVIATSVADIFPPVSLNFAGGASMFLNP 383

Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           QD L+  +  G  ++ C+         N  + ++ ++  ++   +YD+   R+G A   C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440

Query: 433 T 433
           +
Sbjct: 441 S 441


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 121/439 (27%), Positives = 183/439 (41%), Gaps = 89/439 (20%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL----AVARKSVVPIASGR 89
           LQ+ HV          + L+  E +  M  + +AR   L S        R +  P+  G 
Sbjct: 26  LQLSHV-------DAGRGLTHWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGA 78

Query: 90  QITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQST 142
                P   Y+V    GTP Q + + +DT +D  W  C     + C   +  +F+ + S+
Sbjct: 79  YDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASS 138

Query: 143 TFKNLGCQAAQCKQVPNPTCGGG------ACAFNLTYGSSTIA-ANLSQDTISLATDI-- 193
           +F +L C +  C+    P CGGG       C ++++YG  +++   + ++  + A+    
Sbjct: 139 SFASLPCSSPACETT--PPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGE 196

Query: 194 -----VPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
                VPG  FGC     G     + G+ G GRGSLSL +Q   L    FS+C  +    
Sbjct: 197 GSSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQ---LKVGNFSHCFTTITGS 253

Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
             S  L LG           P +  P  S L          GRR      G+ +   T  
Sbjct: 254 KTSAVL-LG----------LPGVAPPSASPL----------GRRR-----GSYRCRSTPR 287

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVPIVA-----P 360
           +    +SGT  T L    Y AVR+ F  +V   L V      D  TC+S P+       P
Sbjct: 288 SS---NSGTSITSLPPRTYRAVREEFAAQV--KLPVVPGNATDPFTCFSAPLRGPKPDVP 342

Query: 361 TITLMFSGMNVTLPQDNLLIH----STAGS---ITCLAMAAAPDNVNSVLNVIANMQQQN 413
           T+ L F G  + LPQ+N +        AG+   I CLA+      +     ++ N+QQQN
Sbjct: 343 TMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV------IEGGEIILGNIQQQN 396

Query: 414 HRILYDVPNSRLGVARELC 432
             +LYD+ NS+L      C
Sbjct: 397 MHVLYDLQNSKLSFVPAQC 415


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 163/361 (45%), Gaps = 40/361 (11%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQ 150
           Q+  Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C 
Sbjct: 78  QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 151 AAQC-KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
            + C     +P C        C F ++Y   + +   L QDT++ +    +P +TFGC  
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNL 196

Query: 204 KATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLG 256
            + G +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLG 255

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            +     ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+
Sbjct: 256 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGS 310

Query: 317 VFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-S 367
             +    R ++     +R++  RR  +            CY +  V     P I+L F  
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDD 365

Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           G    L    + +  +        +A AP      +++I ++ Q +  ++YD+    +G+
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQLIGI 422

Query: 428 A 428
            
Sbjct: 423 G 423


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 130/298 (43%), Gaps = 29/298 (9%)

Query: 149 CQAAQCKQVPNPTCGG------GACAFNLTYGSSTIAANLSQ-DTISL-ATDIVPGYTFG 200
           C +  C+ +   +CG         C +   Y   ++   L + D  +  A   VPG  FG
Sbjct: 38  CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDKFTFGAGASVPGVAFG 97

Query: 201 CIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           C     G     + G+ G GRG LSL +Q   L    FS+C  +   L  S  L   P  
Sbjct: 98  CGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKQSTVLLDLPAD 154

Query: 260 QPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
             K     ++ TPL++N    + YY++L  I VG   + +P  A      TG GTIIDSG
Sbjct: 155 LYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSG 213

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV 371
           T  T L    Y  VRD F  ++   +   +  G  TC+S P  A    P + L F G  +
Sbjct: 214 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATM 273

Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            LP++N +         SI CLA+     N      +I N QQQN  +LYD+ N   G
Sbjct: 274 DLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQQNMHVLYDLQNMHRG 326


>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
 gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
          Length = 503

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 116/441 (26%), Positives = 196/441 (44%), Gaps = 58/441 (13%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF--------LSSLAVARKSVVPI 85
           L + H  SPCSP      L+  + +    +  + R            SSLAV   +++P 
Sbjct: 79  LPIVHQQSPCSPLHGLPSLTAADGLHHDASLIRRRFSSKSSPVAPPASSLAV---TIIPT 135

Query: 86  ASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSN-DAAWVPCTGCVGCSST---VFNS 138
                 T+ P    Y V    GTP Q   + +DTS+   + + C  C   S      F++
Sbjct: 136 NGSSDPTRKPVTLQYSVLVSYGTPEQQFPVLLDTSSIGMSLLRCKPCASGSDDCHLAFDT 195

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGSSTIAANLSQDTISLA--TD 192
           ++S+TF ++ C +  C   P    G G     C  + TY  S I    ++D ++LA  + 
Sbjct: 196 SRSSTFAHVLCGSPDC---PTNCSGDGDGDSFCPLDSTY--SIIDGAFAEDVLTLAPSSK 250

Query: 193 IVPGYTFGCIQ-KATGNSVPPQGLLGLGRG-SLSLLAQTQNLYQST--FSYCLPSFKALS 248
            +  + F C+      + +P  G L L R  +      + +  Q+T  FSYCLP  K+ S
Sbjct: 251 AIENFRFVCLDVDEPDDDLPVAGTLDLSRDRNSLPSQLSSSPGQATAAFSYCLP--KSPS 308

Query: 249 FSGSLRLG---PIGQPKRIKYTPLLKN---PRRSSLYYVNLLAIRVGRRVVDIPP-GALQ 301
             G L L     +   K   + PL+ N   P  +S+Y+++L+ + +G   + IPP G+  
Sbjct: 309 SQGYLSLAVDATVRHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGVDDIPIPPAGSFG 368

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSV----P 356
            N     G  +D GT FT+L    Y  +RD FR+++  +N ++    GFDTC+++     
Sbjct: 369 NN-----GVNLDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDGFDTCFNLTGVRD 423

Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAG----SITCLAMAAAPDNVNSVLNVIANMQQ 411
           +  P +   FS G  + +  D +L +        ++ CLA ++  D  +S   VI     
Sbjct: 424 LAMPLLWFKFSNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSL-DAGDSFSAVIGTHTL 482

Query: 412 QNHRILYDVPNSRLGVARELC 432
            +  ++YDV   ++G     C
Sbjct: 483 ASTEVIYDVAGGKVGFIPRSC 503


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 110/329 (33%), Positives = 153/329 (46%), Gaps = 37/329 (11%)

Query: 112 MAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQCKQV---PNPTC 162
           M +DT +D +WV C  C    S       +F+ AQS+++  + C    C  +       C
Sbjct: 1   MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60

Query: 163 GGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
               C + ++YG  S      S DT++L A+  V G+ FGC    +G      GLLGLGR
Sbjct: 61  SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSS 277
              SL+ QT   Y   FSYCLP+    S +G L L   GP G       T LL +P   +
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y V L  I VG + + +P  A          T++D+GTV TRL   AY A+R  FR  +
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGM 232

Query: 338 GSNL--TVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCL 390
            S    T  S G  DTCY+      +  P + L F SG  VTL  D +L      S  CL
Sbjct: 233 ASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCL 286

Query: 391 AMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           A   AP   +  + ++ N+QQ++  +  D
Sbjct: 287 AF--APSGSDGGMAILGNVQQRSFEVRID 313


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 165/372 (44%), Gaps = 61/372 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
           Y     +GTPA + L+A+DT +D  WVPC  C+ C+              ++  A+STT 
Sbjct: 66  YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 124

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
           ++L C    C+ VP  T     C +N+ Y S    ++  L +DT+ L    D VP     
Sbjct: 125 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 184

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC QK +G   + + P GLLGLG   +S+   LA+   L Q++FS C   FK  S SG
Sbjct: 185 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 239

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G P + + TP +    +   Y VN+    +G + ++           T    +
Sbjct: 240 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 288

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
           +DSGT FT L    Y A    F +++ +         +  CYS     +P V PTITL F
Sbjct: 289 VDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 347

Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +       +N  LP      +   G++    +A  P      + +IA      + +++D 
Sbjct: 348 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 400

Query: 421 PNSRLGVARELC 432
            + +LG  R  C
Sbjct: 401 ESMKLGWYRSEC 412


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 165/372 (44%), Gaps = 61/372 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
           Y     +GTPA + L+A+DT +D  WVPC  C+ C+              ++  A+STT 
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
           ++L C    C+ VP  T     C +N+ Y S    ++  L +DT+ L    D VP     
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC QK +G   + + P GLLGLG   +S+   LA+   L Q++FS C   FK  S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G P + + TP +    +   Y VN+    +G + ++           T    +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
           +DSGT FT L    Y A    F +++ +         +  CYS     +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377

Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +       +N  LP      +   G++    +A  P      + +IA      + +++D 
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430

Query: 421 PNSRLGVARELC 432
            + +LG  R  C
Sbjct: 431 ESMKLGWYRSEC 442


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 158/375 (42%), Gaps = 36/375 (9%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P++SG   + +  Y V+ ++GTP Q   +  DT +D  WV C G       VF    S 
Sbjct: 103 LPMSSG-AYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG-ASPPGRVFRPKTSR 160

Query: 143 TFKNLGCQAAQCK-QVP----NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPG- 196
           ++  + C +  CK  VP    N +     C ++  Y   +  A     T S AT  +PG 
Sbjct: 161 SWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTES-ATIALPGG 219

Query: 197 -------YTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-L 247
                     GC     G S     G+L LG   +S   Q    +  +FSYCL    A  
Sbjct: 220 KVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPR 279

Query: 248 SFSGSLRLGPIGQPKRI--KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           + +G L  GP GQ  R     T L  +P     Y V + AI V  + +DIP  A  ++  
Sbjct: 280 NATGYLAFGP-GQVPRTPATQTKLFLDPEM-PFYGVKVDAIHVAGKALDIP--AEVWDAK 335

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------VPIV 358
           +G G I+DSG   T L APAY AV     + +   +   S   F+ CY+        P +
Sbjct: 336 SG-GVILDSGNTLTVLAAPAYKAVVAALSKHL-DGVPKVSFPPFEHCYNWTARRPGAPEI 393

Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
            P + + F+G     P     +      + C+ +    +     L+VI N+ QQ H   +
Sbjct: 394 IPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQ---EGEWPGLSVIGNIMQQEHLWEF 450

Query: 419 DVPNSRLGVARELCT 433
           D+ N ++   +  CT
Sbjct: 451 DLKNMQVRFKQSNCT 465


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 151/371 (40%), Gaps = 45/371 (12%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSST--------VFNSAQSTTFKNLGCQA 151
            GTP Q L   +DT +D  W PCT    C  CS +        +F+   S++ K L C+ 
Sbjct: 84  FGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRN 143

Query: 152 AQC-------KQVPNPTCGGG------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
            +C         +  P C G       AC ++  YG+   +     + +      +  + 
Sbjct: 144 PKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENLKFPRKTIRNFL 203

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG 256
            GC   A    +    L G GR   SL  Q   +    F+YCL S  +     SG L L 
Sbjct: 204 LGCTTSA-ARELSSDALAGFGRSMFSLPIQ---MGVKKFAYCLNSHDYDDTRNSGKLILD 259

Query: 257 -PIGQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
              G+ K + YTP LK+P  S+ YY + +  I++G +++ IP   L       +G IIDS
Sbjct: 260 YRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDS 319

Query: 315 GTVFT-RLVAPAYTAVRDVFRRRVGS---NLTVTSLGGFDTCYSVP-----IVAPTITLM 365
           G      +  P +  V +  ++++     +L   +  G   CY+        + P I   
Sbjct: 320 GYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQF 379

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN----VIANMQQQNHRILYDVP 421
             G N+ +P  N    S   S+ C  M     N   +      ++ N Q  ++ + YD+ 
Sbjct: 380 RGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLK 439

Query: 422 NSRLGVARELC 432
           N R G  R+ C
Sbjct: 440 NDRFGFRRQTC 450


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/429 (25%), Positives = 163/429 (37%), Gaps = 87/429 (20%)

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
           R+  +P++ G   T S T          +Q + + +DT +D  W PC    C+ C     
Sbjct: 70  RQVSLPLSPGSDYTLSFT--------LDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAE 121

Query: 137 NSAQSTT-------------FKNLGCQAAQC---------------KQVPNPTCGGGAC- 167
           N++ ++T              K+  C AA                 + +    C   +C 
Sbjct: 122 NTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCP 181

Query: 168 AFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
            F   YG  ++ A L +D+ISL        IV  +TFGC   A      P G+ G GRG 
Sbjct: 182 QFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAE---PIGVAGFGRGV 238

Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSF----------------SGSLRLGPIGQPKR 263
           LSL AQ   L     + FSYCL S    S                     R+  + +P R
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKP-R 297

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
             YT +L N      Y V L  I +GR+ +  P    + +     G ++DSGT FT L A
Sbjct: 298 FVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPA 357

Query: 324 PAYTAVRDVFRRRVG----SNLTVTSLGGFDTCY-----SVPIVAPTITLMFSGMNVTLP 374
             Y +V   F  RVG        +    G   CY      V + +  +  + +G +V LP
Sbjct: 358 SLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLP 417

Query: 375 QDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNS 423
           + N                 + CL +    +           + N QQQ   ++YD+ N 
Sbjct: 418 RRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENK 477

Query: 424 RLGVARELC 432
           R+G AR  C
Sbjct: 478 RVGFARRQC 486


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 166/372 (44%), Gaps = 61/372 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
           Y     +GTPA + L+A+DT +D  WVPC  C+ C+              ++  A+STT 
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS--STIAANLSQDTISL--ATDIVP---GY 197
           ++L C    C+ VP  T     C +N+ Y S  +T +  L +DT+ L    D VP     
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC QK +G   + + P GLLGLG   +S+   LA+   L Q++FS C   FK  S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G P + + TP +    +   Y VN+    +G + ++           T    +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
           +DSGT FT L    Y A    F +++ +         +  CYS     +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377

Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +       +N  LP      +   G++    +A  P      + +IA      + +++D 
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430

Query: 421 PNSRLGVARELC 432
            + +LG  R  C
Sbjct: 431 ESMKLGWYRSEC 442


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 135/304 (44%), Gaps = 28/304 (9%)

Query: 149 CQAAQCKQVPNPTCGG------GACAFNLTYGSSTIAANLSQ-DTISL-ATDIVPGYTFG 200
           C +  C+ +   +CG         C +   Y   ++   L + D  +  A   VPG  FG
Sbjct: 190 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFGAGASVPGVAFG 249

Query: 201 CIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR--LGP 257
           C     G     + G+ G GRG LSL +Q   L    FS+C  +   L  S  L   L  
Sbjct: 250 CGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKQSTVLLDLLAD 306

Query: 258 IGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           + +  R  ++ TPL++N    +LYY++L  I VG   + +P  A      TG GTIIDSG
Sbjct: 307 LYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSG 365

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV 371
           T  T L    Y  VRD F  ++   +   +  G  TC+S P  A    P + L F G  +
Sbjct: 366 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATM 425

Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            LP++N +         S+ CLA+    D   +    I N QQQN  +LYD+ N+ L   
Sbjct: 426 DLPRENYVFEVPDDAGNSMICLAINELGDERAT----IGNFQQQNMHVLYDLQNNMLSFV 481

Query: 429 RELC 432
              C
Sbjct: 482 AAQC 485



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 62/140 (44%), Gaps = 13/140 (9%)

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            I VG   + +P  A      TG GTIIDSGT  T L    Y  VRD F  ++   +   
Sbjct: 41  GITVGSTRLPVPESAFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPG 99

Query: 345 SLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPD 397
           +  G  TC+S P  A    P + L F G  + LP++N +         SI CLA+     
Sbjct: 100 NATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI----- 154

Query: 398 NVNSVLNVIANMQQQNHRIL 417
           N      +I N QQQN   L
Sbjct: 155 NKGDETTIIGNFQQQNMHAL 174


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 162/404 (40%), Gaps = 60/404 (14%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------- 134
           +P++SG   T +  Y VR ++GTPAQ  ++  DT +D  WV C G    S          
Sbjct: 97  MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAA 155

Query: 135 ----------VFNSAQSTTFKNLGCQAAQCK-----QVPNPTCGGGACAFNLTYGSSTIA 179
                     VF    S T+  + C +  CK      + N +    AC+++  Y  ++ A
Sbjct: 156 APSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAA 215

Query: 180 AN-LSQDTISLATDI-------------VPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
              +  D+ ++A                + G   GC     G       G+L LG  ++S
Sbjct: 216 RGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNIS 275

Query: 225 LLAQTQNLYQSTFSYCLPSFKA-------LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSS 277
             ++  + +   FSYCL    A       L+F           P     TPLL + R   
Sbjct: 276 FASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRP 335

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y V + ++ V    +DIP  A  ++  +  GTIIDSGT  T L  PAY AV      ++
Sbjct: 336 FYAVAVDSVSVDGVALDIP--AEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQL 393

Query: 338 GSNLTVTSLGGFDTCYSV--------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITC 389
            + L   ++  FD CY+          +  P + + F+G     P     +   A  + C
Sbjct: 394 -AGLPRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKC 452

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           + +    +     ++VI N+ QQ H   +D+ N  L   +  CT
Sbjct: 453 IGVQ---EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 162/367 (44%), Gaps = 50/367 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           ++V   IG+P  T L+ MDT++D  W+ C  C+ C   S  +F+ ++S T +N  C+ +Q
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144

Query: 154 CKQVPNPTCGGG--ACAFNLTY----GSSTIAA------NLSQDTISLAT--DIVPGYTF 199
              +P+        +C +++ Y    GS  I A      N   D  S A   D+V    F
Sbjct: 145 -YSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVV----F 199

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRLGPI 258
           GC     G  +   G+LGLG G  SL+ +    + + FSYC  S    S+    L LG  
Sbjct: 200 GCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGTKFSYCFGSLDDPSYPHNVLVLGDD 255

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
           G       TPL      +  YYV + AI V   ++ I P     N  TG  GTIID+G  
Sbjct: 256 GANILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD----TCYS-------VPIVAPTITLMF 366
            T LV  AY  +++          T   +   D     CY+       V    P +T  F
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHF 372

Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           S G  ++L   ++ +   + ++ CLA+   P N+NS    I    QQ++ I YD+   ++
Sbjct: 373 SDGAELSLDVKSVFM-KLSPNVFCLAV--TPGNMNS----IGATAQQSYNIGYDLEAKKI 425

Query: 426 GVARELC 432
              R  C
Sbjct: 426 SFERIDC 432


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/357 (26%), Positives = 168/357 (47%), Gaps = 30/357 (8%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTG--CVGCSS-TVFNSAQSTTFKNLGCQAAQCKQVP- 158
           +GTP Q L   +   +  +WV C+    + C++ ++F    ST+   L C +  C     
Sbjct: 5   LGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAFSA 64

Query: 159 -NPTCG-GGACAFNLTYGSS-TIAANLSQDTISLAT----DIVPGYTFGCIQKATG--NS 209
            + +CG   +C++N +YG++ + A +L  D  ++ +     +    + GC + + G    
Sbjct: 65  VSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLEL 124

Query: 210 VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIKY 266
           +   G +G  +G++S + Q   L Y+S F YCLPS  F+     G+ +L        + Y
Sbjct: 125 LDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGKLVIGNYKLRNASISSSMAY 184

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           TP++ NP+ + LY++NL  I + +    +P      N T   GT+ID+ T  + L +  Y
Sbjct: 185 TPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGT--GGTVIDTTTFLSYLTSDFY 242

Query: 327 T----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP-----TITLMF-SGMNVTLPQD 376
           T    A+++     V  + +V    G + CY++   +      T+T  F  G  V +   
Sbjct: 243 TQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFLGGAGVEVSTW 302

Query: 377 NLLIHS-TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            LL  S +  +  C+A+  + ++V   LNVI   QQ +  + YD+   R G   + C
Sbjct: 303 FLLDDSDSVNNTICMAIGRS-ESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 163/383 (42%), Gaps = 40/383 (10%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------- 134
           +P+ SG   T +  Y VR ++GTPAQ  ++  DT +D  WV C+     SS+        
Sbjct: 91  MPLTSG-AYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQR 149

Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIA---ANLSQDT 186
           VF  A S ++  L C +  CK  VP    N +     C+++  Y  ++ A     L   T
Sbjct: 150 VFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSAT 209

Query: 187 ISLATD------IVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           +SL+ +       +     GC     G S     G+L LG  ++S  ++  + +   FSY
Sbjct: 210 VSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSY 269

Query: 240 CLPSFKA-------LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           CL    A       L+F             R     LL++ R    Y+V++ A+ V    
Sbjct: 270 CLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGER 329

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           ++I P    F    GA  I+DSGT  T L  PAY AV     ++  + +   ++  F+ C
Sbjct: 330 LEILPDVWDFRKNGGA--ILDSGTSLTILATPAYDAVVKAISKQF-AGVPRVNMDPFEYC 386

Query: 353 YS---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
           Y+   V    P + L F+G     P     +  TA  + C+ +    +     ++VI N+
Sbjct: 387 YNWTGVSAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVV---EGAWPGVSVIGNI 443

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
            QQ H   +D+ N  L   +  C
Sbjct: 444 LQQEHLWEFDLANRWLRFKQSRC 466


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 112/444 (25%), Positives = 182/444 (40%), Gaps = 46/444 (10%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
           +F L  + +F +S  +       D+  T+++ H  SP SP     PL   E+    +A  
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMY--NPL---ENHYHRVADT 58

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV-- 123
             R    ++  V      PI + R       Y+++  +GTP   ++   DT +D  W   
Sbjct: 59  LRRSISHNTGLVTNTVEAPIYNNRG-----EYLMKLSVGTPPFPIIAVADTGSDIIWTQC 113

Query: 124 -PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTCGGGACAFNLTYGSSTIA- 179
            PCT C      +FN ++STT++ + C +  C      N       C ++++YG ++ + 
Sbjct: 114 EPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173

Query: 180 ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
            + + DT+++ +        P    GC     G+      G++GLG G  SL+ Q  +  
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233

Query: 234 QSTFSYCLPSF-------KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
              FSYCL            L+F  +  +   G       TP+  + +  S Y + L A+
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVS----TPIYISDKFKSFYSLKLKAV 289

Query: 287 RVGRRVVDIPPGALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            VGR            N   G  A  IIDSGT  T L    Y          +    T  
Sbjct: 290 SVGRNNTFYSTA----NSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDD 345

Query: 345 SLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
                + C+         P I + F G N+ L ++N+LI   + ++ CLA A A DN   
Sbjct: 346 PNQFLEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIR-VSDNVICLAFAGAQDN--- 401

Query: 402 VLNVIANMQQQNHRILYDVPNSRL 425
            +++  N+ Q N  + YDV N  L
Sbjct: 402 DISIYGNIAQINFLVGYDVTNMSL 425


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 38/312 (12%)

Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATDI--------VPGYT 198
           C    C  + + +C     C +   YG  T+   + + +  + A+          VP   
Sbjct: 3   CAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LG 61

Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS----GSLR 254
           FGC     G+     G++G GR  LSL++Q   L    FSYCL S+ +   S    GSL 
Sbjct: 62  FGCGSVNVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYASRRQSTLLFGSLS 118

Query: 255 LGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
            G  G    R++ TPLL++P+  + YYV+   + VG R + IP  A    P    G I+D
Sbjct: 119 DGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVD 178

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVPIV-----------AP 360
           SGT  T L A     V   FR+++   L   + G  +   C+ VP              P
Sbjct: 179 SGTALTLLPAAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVP 236

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            + L F G ++ LP+ N ++        CL +A + D+ ++    I N+ QQ+ R+LYD+
Sbjct: 237 RMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST----IGNLVQQDMRVLYDL 292

Query: 421 PNSRLGVARELC 432
               L +A   C
Sbjct: 293 EAETLSIAPARC 304


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 166/356 (46%), Gaps = 34/356 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQC 154
           ++    IG P    L+ +DT +D  W+ C  C     T+  F+ ++S+T++N  C++A  
Sbjct: 88  FLANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPFFHPSRSSTYRNASCESAP- 146

Query: 155 KQVPN--PTCGGGACAFNLTYGS-STIAANLSQDTISLATDIV-----PGYTFGCIQKAT 206
             +P        G C ++L Y   S     L+++ ++  T        P   FGC Q  +
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
           G +    G+LGLG G+ S++ +    + S FSYC  S    ++  +  +  +G   RI+ 
Sbjct: 207 GFT-QYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLIDPTYPHNFLI--LGNGARIEG 260

Query: 267 --TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TPL     R   YY++L AI +G +++DI PG  Q   + G GT+ID+G   T L   
Sbjct: 261 DPTPLQIFQDR---YYLDLQAISLGEKLLDIEPGIFQRYRSKG-GTVIDTGCSPTILARE 316

Query: 325 AYTAVRDVFRRRVGSNL-TVTSLGGF-DTCYSVPIVA-----PTITLMFS-GMNVTLPQD 376
           AY  + +     +G  L  V     + + CY   +       P +T  F+ G  + L  +
Sbjct: 317 AYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVE 376

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +L + S +G   CLAM     N    ++VI  M QQN+ + Y++   ++   R  C
Sbjct: 377 SLFVSSESGDSFCLAMTM---NTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/334 (29%), Positives = 144/334 (43%), Gaps = 43/334 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
           Y V    GTP+QTL   MDT +   W PCT    C  CS    + A+  TF        K
Sbjct: 106 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 165

Query: 146 NLGCQAAQCKQVPN----PTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC 201
            +GC   +C  V +      C      + + YG  T    L  +++  A    P +  GC
Sbjct: 166 IVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGC 225

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----ALSFSGSLRLGP 257
              +  +S  P G+ G GRG  SL  Q   +    FSYCL S +      S   +L +GP
Sbjct: 226 ---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 279

Query: 258 IGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
             +  +   + YTP  KNP  S+      YYV L  I VG + V +P   +        G
Sbjct: 280 DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 339

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPT 361
           TI+DSG+ FT +  P + AV   F R++ +N T    V +L G   C+++     +  P+
Sbjct: 340 TIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNLSGVGSVALPS 398

Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAA 394
           +   F  G  + LP  N        S+ CL + +
Sbjct: 399 LVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVS 432


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 167/438 (38%), Gaps = 67/438 (15%)

Query: 57  SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP----------TYIVRAKIGTP 106
           S+ ++   D+ R+ F++S    R       S     + P           Y VR ++GTP
Sbjct: 44  SLADLARSDRQRMAFIASHGRRRARETAAGSSAAAFEMPLTSGAYTGIGQYFVRFRVGTP 103

Query: 107 AQTLLMAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC-KQVP 158
           AQ  L+  DT +D  WV C       +     S   F    S T+  + C +  C K +P
Sbjct: 104 AQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLP 163

Query: 159 N--PTC--GGGACAFNLTYGSSTIA---ANLSQDTISLA-------TDIVPGYTFGCIQK 204
               TC   G  CA++  Y   + A         TI+L+          + G   GC   
Sbjct: 164 FSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSS 223

Query: 205 ATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP----- 257
            TG S     G+L LG   +S  +   + +   FSYCL       + +  L  GP     
Sbjct: 224 YTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVA 283

Query: 258 -----------------IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
                                 R + TPLL + R    Y V + A+ V  + + IP    
Sbjct: 284 SSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLKIPRAVW 343

Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SV 355
             +   G G I+DSGT  T L  PAY AV       + + L   ++  F+ CY     S 
Sbjct: 344 DVD--AGGGVILDSGTSLTVLAKPAYRAVVAALSEGL-AGLPRVTMDPFEYCYNWTSPSG 400

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            +  P + + F+G     P     +   A  + C+ +   P      ++VI N+ QQ H 
Sbjct: 401 DVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP---WPGISVIGNILQQEHL 457

Query: 416 ILYDVPNSRLGVARELCT 433
             +D+ N RL   R  CT
Sbjct: 458 WEFDIKNRRLKFQRSRCT 475


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/353 (29%), Positives = 154/353 (43%), Gaps = 47/353 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTVFNSAQSTTFKNLGCQA 151
           Y +   +GTP Q L    DT +D  W  C G     C    S  +    S+TF  L C  
Sbjct: 91  YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150

Query: 152 AQCKQVPNPT-----CGGGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYTFGC 201
             C  + + +       G  C +  +YG            L+++T +L  D VP   FGC
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPSVRFGC 210

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ- 260
              + G      GL+GLGRG LSL++Q   L  STF YCL S    S +  L  G +   
Sbjct: 211 TTASEGGYGSGSGLVGLGRGPLSLVSQ---LNASTFMYCLTS--DASKASPLLFGSLASL 265

Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
              +++ T LL +   ++ Y VNL +I +G       PG  +       G + DSGT  T
Sbjct: 266 TGAQVQSTGLLAS---TTFYAVNLRSISIGSATT---PGVGE-----PEGVVFDSGTTLT 314

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVT 372
            L  PAY+  +  F  +   +  V    GF+ C+  P          PT+ L F G ++ 
Sbjct: 315 YLAEPAYSEAKAAFLSQTSLD-QVEDTDGFEACFQKPANGRLSNAAVPTMVLHFDGADMA 373

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           LP  N ++    G + C  +  +P      L++I N+ Q N+ +L+DV  S L
Sbjct: 374 LPVANYVVEVEDG-VVCWIVQRSPS-----LSIIGNIMQVNYLVLHDVHRSVL 420


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 119/438 (27%), Positives = 184/438 (42%), Gaps = 69/438 (15%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLA 63
           L+FF  F F+ SLS  LN       +  TL++ H  S  SPF +P++  +  E +   + 
Sbjct: 9   LLFFTIFCFIISLSHALN-------NGFTLELIHRDSSKSPFYQPTQ--NKYERIANAVR 59

Query: 64  KDQARLQ--FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
           +   R+   +  SL    +S V    G        Y++   IGTP   +   +DT +D  
Sbjct: 60  RSINRVNHFYKYSLTSTPQSTVNSDKGE-------YLMSYSIGTPPFKVFGFVDTGSDLV 112

Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
           W+ C  C  C    + +F+ + S++++N+ C +  C  +   +C               +
Sbjct: 113 WLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSC--------------DV 158

Query: 179 AANLSQDTISLATDIVPGYT-------FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQ 230
              LS +T++L  D   GY+        GC  + TG    P  G++GLG G +SL +Q  
Sbjct: 159 RGYLSVETLTL--DSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLG 216

Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
                 FSYCL  +   S S  L  G   I        TP++K   +S  YY+ L A  V
Sbjct: 217 TSIGGKFSYCLGPWLPNSTS-KLNFGDAAIVYGDGAMTTPIVKKDAQSG-YYLTLEAFSV 274

Query: 289 GRRVVDIPPGALQFNPTTGAGT---IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
           G ++++         PT G      +IDSGT FT L    Y          +        
Sbjct: 275 GNKLIEFG------GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDP 328

Query: 346 LGGFDTCYSVP---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
            G F  CY+V      AP IT  F G ++ L   +  I  + G I CLA       + S 
Sbjct: 329 NGTFKLCYNVAYHGFEAPLITAHFKGADIKLYYISTFIKVSDG-IACLAF------IPSQ 381

Query: 403 LNVIANMQQQNHRILYDV 420
             +  N+ QQN  + Y++
Sbjct: 382 TAIFGNVAQQNLLVGYNL 399


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/368 (28%), Positives = 161/368 (43%), Gaps = 48/368 (13%)

Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAAQCKQ------VP 158
           P Q + M +DT ++ +W+ C      +    F+  +S+++  + C +  C+       +P
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141

Query: 159 NPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVP-GYTFGCIQKATGNSVPPQ--- 213
                   C   L+Y  +S+   NL+ +              FGC+   +G S P +   
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSG-SDPEEDTK 200

Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIK 265
             GLLG+ RGSLS ++Q   +    FSYC+      P F  L  S    L P+     I+
Sbjct: 201 TTGLLGMNRGSLSFISQ---MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIR 257

Query: 266 Y-TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TPL    R +  Y V L  I+V  +++ IP   L  + T    T++DSGT FT L+ P
Sbjct: 258 ISTPLPYFDRVA--YTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGP 315

Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA---------PTITLMFSGM 369
            YTA+R  F  +    LTV         G  D CY +             PT++L+F G 
Sbjct: 316 VYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSLVFEGA 375

Query: 370 NVTLPQDNLLI---HSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            + +    LL    H TAG  S+ C     + D +     VI +  QQN  I +D+  SR
Sbjct: 376 EIAVSGQPLLYRVPHLTAGNDSVYCFTFGNS-DLMGMEAYVIGHHHQQNMWIEFDLQRSR 434

Query: 425 LGVARELC 432
           +G+A   C
Sbjct: 435 IGLAPVQC 442


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/289 (33%), Positives = 129/289 (44%), Gaps = 28/289 (9%)

Query: 167 CAFNLTYGSS----------TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
           C +   YG S          T   NL+  +       V    FGC     G      GLL
Sbjct: 74  CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL- 270
           GLGRG LS  +Q Q+LY  +FSYCL    +  + S  L  G     +  P+ + +T L+ 
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVA 192

Query: 271 --KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
             +NP   + YYV + +I VG  VV+IP    Q       GTIIDSGT  +    PAY  
Sbjct: 193 GKENPV-DTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQV 251

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHST 383
           +++ F  +V     V      + CY+V  V     P   ++FS G     P +N  I   
Sbjct: 252 IKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIE 311

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              + CLA+   P    S L++I N QQQN  ILYD   SRLG A   C
Sbjct: 312 PREVVCLAILGTPP---SALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 113/425 (26%), Positives = 174/425 (40%), Gaps = 72/425 (16%)

Query: 75  LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS- 133
           L VA  +  P A+  +     +  V   +G P Q + M +DT ++ +W+ C G    S+ 
Sbjct: 37  LVVAPPTRSPAANRLRFRHDVSLTVPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTP 96

Query: 134 ------TVFNSAQSTTFKNLGCQAA-QCKQ------VPNPTCGG---GACAFNLTYGSST 177
                   FN + S+T+    C ++ +C+       VP P C G    +C  +L+Y  ++
Sbjct: 97  PQPQAPAAFNGSASSTYAAAHCSSSPECQWRGRDLPVP-PFCAGPPSNSCRVSLSYADAS 155

Query: 178 IAAN-LSQDTISLATDIVPGYTFGCI-----------------QKATGNSVPPQGLLGLG 219
            A   L+ DT  L         FGCI                   AT +S    GLLG+ 
Sbjct: 156 SADGVLAADTFLLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMN 215

Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRLGPIGQPKRIKYTPLLKNPR-- 274
           RGSLS + QT  L    F+YC+        L   G      +    ++ YTPL++  +  
Sbjct: 216 RGSLSFVTQTGTL---RFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPL 272

Query: 275 ---RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
                  Y V L  IRVG  ++ IP   L  + T    T++DSGT FT L+A AY  ++ 
Sbjct: 273 PYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKG 332

Query: 332 VFRRRVGSNLT------VTSLGGFDTCY----------SVPIVAPTITLMFSGMNVTLPQ 375
            F  +  + L           G FD C+          +   + P + L+  G  V +  
Sbjct: 333 EFLNQTSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGG 392

Query: 376 DNLLI--------HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
           + LL            + ++ CL    + D       VI +  QQN  + YD+ NSR+G 
Sbjct: 393 EKLLYMVPGERRGEGGSEAVWCLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNSRVGF 451

Query: 428 ARELC 432
           A   C
Sbjct: 452 APARC 456


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 164/368 (44%), Gaps = 54/368 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
           Y     +GTP  + ++A+DT +D  W+PC  C+ C+              ++  A+STT 
Sbjct: 208 YYTWVDVGTPNTSFMVALDTGSDLFWIPCD-CIECAPLSGYHGSLDRDLGIYKPAESTTS 266

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGY 197
           ++L C    C    + T     C +N  Y   ++T +  L +D + L +      +    
Sbjct: 267 RHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASV 326

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC +K +G   + + P GLLGLG   +S+   LA+   L +++FS C         SG
Sbjct: 327 IIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMCF-----TKDSG 380

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G   + + TP +    +   Y VN+    VG +  +          +T    I
Sbjct: 381 RIFFGDQGVSTQ-QSTPFVPLYGKLQTYTVNVDKSCVGHKCFE----------STSFQAI 429

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-PIV---APTITLMFS 367
           +DSGT FT L    Y AV   F ++V ++        FD CYS  P+V    PT+TL F+
Sbjct: 430 VDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTLTFA 489

Query: 368 GMNVTLPQD-NLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           G     P +   L+H   G++   CLA+  +P+ +     +IA      + +++D  N +
Sbjct: 490 GNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPI----GIIAQNFLLGYHVVFDRENMK 545

Query: 425 LGVARELC 432
           LG  R  C
Sbjct: 546 LGWYRSEC 553


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 153/372 (41%), Gaps = 59/372 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC------------SSTVFNSAQSTTF 144
           +     +GTPA + L+A+DT +D  W+PC  C  C            +  ++++ +S+T 
Sbjct: 113 HFANVSVGTPASSYLVALDTGSDLFWLPCN-CTKCVHGIQLSTGQKIAFNIYDNKESSTS 171

Query: 145 KNLGCQAAQCKQVPN-PTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD-------IV 194
           KN+ C ++ C+Q     +  GG C + + Y S   +    L +D + L TD         
Sbjct: 172 KNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQHAN 231

Query: 195 PGYTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSF 249
           P  TFGC Q  TG   +   P GL GLG   +S+  +   Q L  ++FS C     A   
Sbjct: 232 PLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCF----AADG 287

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
            G +  G          TP    P  S+ Y + +  I VG    D     L+FN      
Sbjct: 288 LGRITFGDNNSSLDQGKTPFNIRPSHST-YNITVTQIIVGGNSAD-----LEFN------ 335

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSV----PIVAP 360
            I D+GT FT L  PAY  +   F  ++   L   S        F+ CY +     I  P
Sbjct: 336 AIFDTGTSFTYLNNPAYKQITQSFDSKI--KLQRHSFSNSDDLPFEYCYDLRTNQTIEVP 393

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            I L   G +     D  +I S  G+   L +A    N    +N+I       +RI++D 
Sbjct: 394 NINLTMKGGDNYFVMD-PIITSGGGNNGVLCLAVLKSN---NVNIIGQNFMTGYRIVFDR 449

Query: 421 PNSRLGVARELC 432
            N  LG     C
Sbjct: 450 ENMTLGWKESNC 461


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 55/368 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F+   S+T+K + C    
Sbjct: 83  YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC---- 138

Query: 154 CKQVPNPTC----GGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
                N  C     G  C +   Y   ST +  L +D IS    ++++P    FGC    
Sbjct: 139 -----NIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQ 260
           TG+  S    G++GLG G LSL+ Q   +     +FS C   +  +    G++ LG I  
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLC---YGGMDIGGGAMVLGGISP 250

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P  + +T    +P RS  Y V+L  I V  + + +  G   F+   GA  ++DSGT +  
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGI--FDGRYGA--VLDSGTTYAY 304

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF- 366
           L A A++A +D     + S   +  + G      D C+S        +    PT+ ++F 
Sbjct: 305 LPAEAFSAFKDAIMDEIHS---LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFE 361

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++    HS      CL +    +N N    ++  +  +N  ++YD  NS++
Sbjct: 362 NGQKLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKI 418

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 419 GFWKTNCS 426


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 55/368 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F+   S+T+K + C    
Sbjct: 83  YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC---- 138

Query: 154 CKQVPNPTC----GGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
                N  C     G  C +   Y   ST +  L +D IS    ++++P    FGC    
Sbjct: 139 -----NIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQ 260
           TG+  S    G++GLG G LSL+ Q   +     +FS C   +  +    G++ LG I  
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLC---YGGMDIGGGAMVLGGISP 250

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P  + +T    +P RS  Y V+L  I V  + + +  G   F+   GA  ++DSGT +  
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGI--FDGRYGA--VLDSGTTYAY 304

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF- 366
           L A A++A +D     + S   +  + G      D C+S        +    PT+ ++F 
Sbjct: 305 LPAEAFSAFKDAIMDEIHS---LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFE 361

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++    HS      CL +    +N N    ++  +  +N  ++YD  NS++
Sbjct: 362 NGQKLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKI 418

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 419 GFWKTNCS 426


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 111/382 (29%), Positives = 154/382 (40%), Gaps = 64/382 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCS-------STVFNSAQSTTFKN 146
           Y +    GTP QTL + MDT +D  W PCT    C  CS       S +F    S++ K 
Sbjct: 90  YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149

Query: 147 LGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGY-------- 197
           LGC         NP CG         +GS   +     + T    T I P Y        
Sbjct: 150 LGCV--------NPKCG-------WIHGSKVQSRCRDCEPTSPNCTQICPPYLNFLRFWD 194

Query: 198 ----TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
                F        +    + + G GRG  SL +Q   L    FSYCL S +    + S 
Sbjct: 195 HRRSQFHRRMLCPLHQSTRREISGFGRGPPSLPSQ---LGLKKFSYCLLSRRYDDTTESS 251

Query: 254 RLGPIGQPKR------IKYTPLLKNPRR------SSLYYVNLLAIRVGRRVVDIPPGALQ 301
            L   G+         + YTP ++NP+       S  YY+ L  I VG + V IP   L 
Sbjct: 252 SLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLI 311

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVPIVA 359
                  GTIIDSGT FT +    +  V   F ++V S     V  + G   C+++  + 
Sbjct: 312 PGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLN 371

Query: 360 ----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIANMQ 410
               P +TL F  G  + LP  N +       + CL +    AA  +       ++ N Q
Sbjct: 372 TPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQ 431

Query: 411 QQNHRILYDVPNSRLGVARELC 432
           QQN  + YD+ N RLG  ++ C
Sbjct: 432 QQNFYVEYDLRNERLGFRQQSC 453


>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
 gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
          Length = 414

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 169/424 (39%), Gaps = 73/424 (17%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
           LQ+ H  SP SPF P K L+  E +  ++   + R     S   +     P+        
Sbjct: 34  LQLIHRDSPESPFYPGK-LTNSERISRLVEFSKIRAHNFDSGFSSEAFRPPV-----FQD 87

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
              Y+V+ +IG P   L +  DT +   W           TV N      F+        
Sbjct: 88  FTCYLVKVRIGNPGIPLYLVPDTGSALIW-----------TVNNQ---NIFQ-------- 125

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTI--SLATDIVPGYTFGCIQKATGNSVP 211
                   C    C++   Y   +I   ++   I  S  ++ +P Y FGC +     SV 
Sbjct: 126 --------CRNNKCSYTRRYDDGSITTGVAAQDILQSEGSERIPFY-FGCSRDNQNFSVF 176

Query: 212 PQ-----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---------LSFSGSLRLGP 257
                  G++GL    +SLL Q  ++ Q  FSYCL  ++          L F   +R G 
Sbjct: 177 EHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPSSLLRFGNDIRKGR 236

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
               +R + TPL+ +P R + Y++NLL + V  + + +PPG          GTIIDSGT 
Sbjct: 237 ----RRFQSTPLMSSPDRPN-YFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTG 291

Query: 318 FTRLVAPAY----TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSG 368
            T +   AY    +A ++ F  R    + +     FD CYS           ++T  F  
Sbjct: 292 LTFITQTAYPRLISAFQNYFDHRGFQRVHIPE---FDLCYSFRGNHTFHDHASMTFHFER 348

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            + T+  D + +     +  C+A+   P    +V+  I    Q N R +YD    +L   
Sbjct: 349 ADFTVQADYVYLPMEDDNAFCVALQPTPPQQRTVIGAI---NQGNTRFIYDAAAHQLLFI 405

Query: 429 RELC 432
            E C
Sbjct: 406 AENC 409


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 164/372 (44%), Gaps = 61/372 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
           Y     +GTPA + L+A+DT +D  WVPC  C+ C+              ++  A+STT 
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
           ++L C    C+ VP  T     C +N+ Y S    ++  L +DT+ L    D VP     
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC QK +G   + + P GLL LG   +S+   LA+   L Q++FS C   FK  S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G P + + TP +    +   Y VN+    +G + ++           T    +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
           +DSGT FT L    Y A    F +++ +         +  CYS     +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377

Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +       +N  LP      +   G++    +A  P      + +IA      + +++D 
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430

Query: 421 PNSRLGVARELC 432
            + +LG  R  C
Sbjct: 431 ESMKLGWYRSEC 442


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 141/308 (45%), Gaps = 39/308 (12%)

Query: 149 CQAAQCKQVPNPTCGGGA--------CAFNLTYGSSTIAANLSQ-----DTISLATDIV- 194
           C    C ++P P C   A        C+++  YG++    + ++     +T +   D   
Sbjct: 28  CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 87

Query: 195 -PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LSF 249
            PG  FGC  ++ G      GL+GLGRG LSL+ Q   L    F Y L S  +    +SF
Sbjct: 88  FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQ---LNVEAFGYRLSSDLSAPSPISF 144

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
            GSL     G       TPLL NP    L  YYV L  I VG ++V IP G   F+ +TG
Sbjct: 145 -GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTG 203

Query: 308 AGTII-DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD-TCY---SVPIVAPTI 362
           AG +I DSGT  T L  PAYT VRD    ++G      +    D  C+   S     P++
Sbjct: 204 AGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSM 263

Query: 363 TLMFS-GMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
            L F  G ++ L  +N L         +  C ++  +    +  L +I N+ Q +  +++
Sbjct: 264 VLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKS----SQALTIIGNIMQMDFHVVF 319

Query: 419 DVP-NSRL 425
           D+  N+R+
Sbjct: 320 DLSGNARM 327


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 158/364 (43%), Gaps = 47/364 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T++ + C    
Sbjct: 77  YTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC---- 132

Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
                NP+C     G  C +   Y   S+ +  +++D +S    +++ P    FGC    
Sbjct: 133 -----NPSCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVE 187

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG I  P
Sbjct: 188 TGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDV--GGGAMVLGQISPP 245

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + ++    NP RS  Y + L  + V  + + + P           GT++DSGT +   
Sbjct: 246 PNMVFS--HSNPYRSPYYNIELKELHVAGKPLKLKPKVFD----EKHGTVLDSGTTYAYF 299

Query: 322 VAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMN 370
              A+ A++D   + +     +        D C+S        +  V P + ++F SG  
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQK 359

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           ++L P++ L  H+      CL +     N N +  ++  +  +N  + YD  N ++G  +
Sbjct: 360 LSLSPENYLFRHTKVSGAYCLGIF---QNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWK 416

Query: 430 ELCT 433
             C+
Sbjct: 417 TNCS 420


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/420 (25%), Positives = 175/420 (41%), Gaps = 37/420 (8%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           ++++ H  S  SPF  +    ++  V   + +   R    + ++V   +V    S   + 
Sbjct: 28  SVEIIHRDSSRSPFYRATETQFQR-VTNAVRRSMNRANHFNQISVYSNAV---ESPVTLL 83

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGC 149
               Y++   +GTP   +   +DT++D  WV C  C  C   +S +F+ + S T+KNL C
Sbjct: 84  DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPC 143

Query: 150 QAAQCKQVPNPTCGGGA---CAFNLTYGS-STIAANLSQDTISLATDIVPGYTF-----G 200
            +  CK V   +C       C   + Y   S    +L  +T++L +   P   F     G
Sbjct: 144 SSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIG 203

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
           CI+  T  S    G++GLG G +SL+ Q  +     FSYCL        S  L+ G    
Sbjct: 204 CIRN-TNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDR--SSKLKFGDAAM 260

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IIDSGT 316
           +     +    + K+ ++   YY+ L A  VG   ++    + +   ++G G  IIDSGT
Sbjct: 261 VSGDGTVSTRIVFKDWKK--FYYLTLEAFSVGNNRIEFRSSSSR---SSGKGNIIIDSGT 315

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTL 373
            FT L    Y+ +       V        L  F  CY      +  P IT  FSG +V L
Sbjct: 316 TFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPVITAHFSGADVKL 375

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              N  I   +  + CLA  ++         +  N+ QQN  + YD+    +      CT
Sbjct: 376 NALNTFI-VASHRVVCLAFLSSQSGA-----IFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 46/363 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F   +S+T+  + C    
Sbjct: 88  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN-MD 146

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
           C    N    G  C +   Y   S+ +  L +D IS    +++VP    FGC    TG+ 
Sbjct: 147 C----NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVETGDL 202

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            S    G++GLGRG LS++ Q   +N+   +FS C          G++ LG I  P  + 
Sbjct: 203 YSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHV--GGGAMVLGGIPPPPDMV 260

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           ++    +P RS  Y + L  I V  + + + P           GT++DSGT +  L   A
Sbjct: 261 FS--RSDPYRSPYYNIELKEIHVAGKPLKLSPSTFD----RKHGTVLDSGTTYAYLPEEA 314

Query: 326 YTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMFS-GMNV 371
           + A RD   ++   +  +  + G      D C+S        +    P + ++FS G  +
Sbjct: 315 FVAFRDAIIKK---SHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQKL 371

Query: 372 TL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           +L P++ L  H+      CL +    D+   +  +I     +N  + YD  N ++G  + 
Sbjct: 372 SLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIV----RNTLVTYDRENEKIGFWKT 427

Query: 431 LCT 433
            C+
Sbjct: 428 NCS 430


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 164/387 (42%), Gaps = 44/387 (11%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNS 138
           +P++SG   T +  Y VR ++GTPAQ  ++  DT +D  WV C+G    +      VF +
Sbjct: 99  MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRA 157

Query: 139 AQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDTISLA 190
           A S ++  + C +  C   VP    N +     CA++  Y   + A  +      TI+L+
Sbjct: 158 AASRSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALS 217

Query: 191 TD----------IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
                        + G   GC     G S     G+L LG  ++S  ++    +   FSY
Sbjct: 218 GSESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSY 277

Query: 240 CLPSFKALSFSGS-LRLGPIG----------QPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
           CL    A   + S L  GP G                 TPLL + R S  Y V + A+ V
Sbjct: 278 CLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHV 337

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
               +DIP  A  ++   G G I+DSGT  T L  PAY AV      R+ + L   S+  
Sbjct: 338 AGEALDIP--ADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERL-AGLPRVSMDP 394

Query: 349 FDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           F+ CY+    A   P + + F+G     P     +   A  + C+ +    +     ++V
Sbjct: 395 FEYCYNWTAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQ---EGAWPGVSV 451

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I N+ QQ+H   +D+ +  L      C
Sbjct: 452 IGNILQQDHLWEFDLRDRWLRFKHTRC 478


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 119/418 (28%), Positives = 184/418 (44%), Gaps = 63/418 (15%)

Query: 34  LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
           +  +H++S     KP      +E+ +E L   +A+      +       VPI     I Q
Sbjct: 35  VHSYHIYSR----KPPHVYHIKEASVERLEYLKAKTT--GDIIAHLSPNVPI-----IPQ 83

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
           +  ++V   IG+P  T L+ MDT++D  W+ C  C+ C   S  +F+ ++S T +N  C+
Sbjct: 84  A--FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCR 141

Query: 151 AAQCKQVPNPTCGGG--ACAFNLTY----GSSTIAA------NLSQDTISLAT--DIVPG 196
            +Q   +P+        +C +++ Y    GS  I A      N   D  S A   D+V  
Sbjct: 142 TSQ-YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVV-- 198

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRL 255
             FGC     G  +   G+LGLG G  SL+ +    +   FSYC  S    S+    L L
Sbjct: 199 --FGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGKKFSYCFGSLDDPSYPHNVLVL 252

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDS 314
           G  G       TPL      +  YYV + AI V   ++ I P     N  TG  GTIID+
Sbjct: 253 GDDGANILGDTTPL---EIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDT 309

Query: 315 GTVFTRLVAPAY----TAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVA---PTIT 363
           G   T LV  AY      + D+F  R      S   +  +  ++  +   +V    P +T
Sbjct: 310 GNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVT 369

Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
             FS G  ++L   +L +   + ++ CLA+   P N+NS    I    QQ++ I YD+
Sbjct: 370 FHFSEGAELSLDVKSLFM-KLSPNVFCLAV--TPGNLNS----IGATAQQSYNIGYDL 420


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 104/373 (27%), Positives = 158/373 (42%), Gaps = 48/373 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R ++G P +   + +DT +D  WV C  C GC +T         F+   STT   + 
Sbjct: 83  YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142

Query: 149 CQAAQCK---QVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLATDI--------V 194
           C    C    Q  +  C G +  CA+   YG  S  +     D I L   I         
Sbjct: 143 CSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSS 202

Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
               FGC    TG+         G+ G G+  LS+++Q  ++ +    FS+CL      S
Sbjct: 203 ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDD--S 260

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G L LG I +P  + YTPL+ +      Y +NL +I V  +V+ I P    F  ++  
Sbjct: 261 GGGILVLGEIVEPN-VVYTPLVPSQPH---YNLNLQSISVNGQVLPISPAV--FATSSSQ 314

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
           GTIIDSGT    L   AY A        V  +     L G + CY    SV  + P ++L
Sbjct: 315 GTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKG-NRCYVTSSSVSDIFPQVSL 373

Query: 365 MFSGMN--VTLPQDNLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            F+G    V   QD L+  ++ G  T  C+     P      + ++ ++  ++   +YD+
Sbjct: 374 NFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIP---GQGITILGDLVLKDKIFIYDL 430

Query: 421 PNSRLGVARELCT 433
            N R+G     C+
Sbjct: 431 ANQRIGWTNYDCS 443


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 164/407 (40%), Gaps = 39/407 (9%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           EE V   +A  + RL +       R S    A     T+   YI    IG P Q     +
Sbjct: 44  EERVRRAVAVSRERLAYTQQQQQLRASGDVSAPVHLATRQ--YIAEYLIGDPPQRAAALI 101

Query: 115 DTSNDAAWVPCT---GCVGCSST---VFNSAQSTTFKNLGC--QAAQCKQVPNPTCG-GG 165
           DT ++  W  C    G   C+      +N ++S+TF  + C   A  C       CG  G
Sbjct: 102 DTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDG 161

Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
           +C F  +YG+ ++  +L  +  +  +       FGC+   +   G      GL+GLGRG 
Sbjct: 162 SCTFAASYGAGSVFGSLGTEAFTFQSGAAK-LGFGCVSLTRITKGALNGASGLIGLGRGR 220

Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRR-- 275
           LSL++QT     + FSYCL P  +    S  L +G      G    +   P +K+P    
Sbjct: 221 LSLVSQTG---ATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYP 277

Query: 276 -SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG----AGTIIDSGTVFTRLVAPAYTAVR 330
            S+ YY+ L+ I VG   + IP  A +           G IID+G+  T L   AY+A+ 
Sbjct: 278 YSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALS 337

Query: 331 DVFRRRVGSNLTV-TSLGGFDTCYS---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
           D   R++  +L    +  G D C +   V  V P +   F G                 S
Sbjct: 338 DEVARQLNRSLVQPPADTGLDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKS 397

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             C+ +            VI N QQQ+  +LYD+    L      C+
Sbjct: 398 TACMLI-----EEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 160/368 (43%), Gaps = 45/368 (12%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLG 148
           T  P ++V   +G PA   L  MDT ++  WV C  C  C+     + + ++S+T+ +L 
Sbjct: 94  TYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLP 153

Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAANL--SQDTISLATD----IVPGYTFGC 201
           C    C   P+  C     C +NL+Y +   +A +  ++  I  ++D     VP   FGC
Sbjct: 154 CTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGC 213

Query: 202 IQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-------FKALSFSGS 252
             +  G+    +  G+ GLG+G  S + +      S FSYCL +       +  L F   
Sbjct: 214 SHE-NGDYKDRRFTGVFGLGKGITSFVTRM----GSKFSYCLGNIADPHYGYNQLVFGEK 268

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
                   P ++           +  YYV L  I VG + +DI   A        +  +I
Sbjct: 269 ANFEGYSTPLKVV----------NGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSA-LI 317

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----PTITLMFS 367
           DSGT  T L   A+ A+ +  R+ +   L     G F  CY   +       P +T  FS
Sbjct: 318 DSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSF-ACYKGTVSQDLIGFPVVTFHFS 376

Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            G ++ L  +++   +T   I C+A+  A+A  N     +VI  M QQ + + YD+ +++
Sbjct: 377 GGADLDLDTESMFYQATP-DILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNK 435

Query: 425 LGVARELC 432
           L   R  C
Sbjct: 436 LFFQRIDC 443


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 115/405 (28%), Positives = 171/405 (42%), Gaps = 75/405 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS-------TVFNSAQSTTFKN 146
           Y     +GTP Q L + +DT +  +WVPCT    C  CSS        VF+   S++ + 
Sbjct: 89  YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148

Query: 147 LGCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQ 184
           +GC+   C  + +P          +C G  C             + + YGS + A  L  
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           DT+      V  +  GC   +     PP GL G GRG+ S+ +Q   L  + FSYCL S 
Sbjct: 209 DTLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSR 263

Query: 245 K---ALSFSGSLRL---GPIGQPKRIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVD 294
           +     + SG L L   G       ++Y PL ++    P  S  YY+ L AI VG + V 
Sbjct: 264 RFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQ 323

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTR----LVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
           +P  A       G G I+DSGT F+     +  P   AV      R   +  V    G  
Sbjct: 324 LPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382

Query: 351 TCYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNS 401
            C+++P     +  P ++L F G +V  LP +N  +    + +G    +A A     V+ 
Sbjct: 383 PCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSD 442

Query: 402 VLN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
           V                ++ + QQQN+ I YD+   RLG  R+ C
Sbjct: 443 VPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 162/367 (44%), Gaps = 54/367 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    ST+++ L C    
Sbjct: 76  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC---- 131

Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
                NP C     G  C +   Y   S+ +  LS+D IS   +  + P    FGC  + 
Sbjct: 132 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+  S    G++GLGRG LS++ Q   + + +  FS C    +     G++ LG I  P
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 244

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + ++    +P RS  Y ++L  + V  + + + P    FN     GT++DSGT +   
Sbjct: 245 PGMVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FN--GKHGTVLDSGTTYAYF 298

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF-S 367
              A+ A++D   + + S   +  + G      D C+S        +    P I + F +
Sbjct: 299 PKEAFIAIKDAVIKEIPS---LKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGN 355

Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           G  + L P++ L  H+      CL +   PD  ++ L  +  +  +N  + YD  N +LG
Sbjct: 356 GQKLILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLG 411

Query: 427 VARELCT 433
             +  C+
Sbjct: 412 FLKTNCS 418


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 109/417 (26%), Positives = 174/417 (41%), Gaps = 84/417 (20%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNS-------- 138
           T +  Y++   +G P Q   + +DT +D  WVPC       C+ C +    S        
Sbjct: 20  TYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSP 79

Query: 139 -AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACA---------------FNLTYGSST 177
              S+  K L C +  C  +      +  C    CA               F+ TYG   
Sbjct: 80  SQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTYGGGA 138

Query: 178 IA-ANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLA 227
           +   +L++D ++L   I        VPG+ FGC+    G+S+  P G+ G G+G LSL +
Sbjct: 139 LVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCV----GSSIREPIGIAGFGKGILSLPS 194

Query: 228 QTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVN 282
           Q   L    FS+C   F+     +F+ SL +G +    +    +TP+LK+    + YY+ 
Sbjct: 195 QLGFL-DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIG 253

Query: 283 LLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV----RDVFRRRV 337
           L  + +G    +  PP     +     G I+D+GT +T L  P YTA+      V     
Sbjct: 254 LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYER 313

Query: 338 GSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIHSTAGS-- 386
             +L + +  GFD C+ +P           P I   F G + +TLP+D+     TA    
Sbjct: 314 SYDLEMRT--GFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNS 371

Query: 387 --ITCLAM-----AAAPDNVNSVLN----VIANMQQQNHRILYDVPNSRLGVARELC 432
             + CL           D+V    N    V+ + Q QN  ++YD+   R+G   + C
Sbjct: 372 VVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 428


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 124/431 (28%), Positives = 185/431 (42%), Gaps = 81/431 (18%)

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTY----IVRAKIGTPAQTLLMAMDT 116
           M+ +D  R+     LA  R + +  A+G +  Q   +         +GTP    L+A+DT
Sbjct: 75  MVHRD--RVFHGRRLADDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDT 132

Query: 117 SNDAAWVP--CTGCVGCSST---------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
            +D  W+P  CT CV    T         ++   +S+T KN+ C +  CKQ    +  G 
Sbjct: 133 GSDLFWLPCNCTSCVRGLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCKQTQCHS-SGS 191

Query: 166 ACAFNLTYGSSTIAAN--LSQDTISLAT------DIVPGYTFGCIQKATG---NSVPPQG 214
           +C + + Y S+  +++  L +D + L T      DI    T GC Q  TG   N   P G
Sbjct: 192 SCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNG 251

Query: 215 LLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
           L GLG  ++   S+LAQ + L   +FS C  S      SG +  G  G   + K TP   
Sbjct: 252 LFGLGMENVSVPSILAQ-KGLISDSFSMCFGS----DGSGRITFGDTGSSDQGK-TPF-- 303

Query: 272 NPRRSS-LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
           N R S   Y V +  I VG    D      +F+       I DSGT FT L  PAYT + 
Sbjct: 304 NLRESHPTYNVTITQIIVGGYAAD-----HEFH------AIFDSGTSFTYLNDPAYTLIS 352

Query: 331 DVFRRRVGSN----LTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHS 382
           + F   V +N    L+  S   F+ CY +     I  P + L   G +     D ++  S
Sbjct: 353 EKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTIEVPFLNLTMKGGDDYYVTDPIVPVS 412

Query: 383 T--AGSITCLAMAAAPDNVNSVLN--------------VIANMQQQN----HRILYDVPN 422
           +   G++ CL +  + DN+N +                +I    Q+N    +RI++D  N
Sbjct: 413 SEVEGNLLCLGIQKS-DNLNIIGREYTTEEEFLHLKHMIIKFFIQKNFMTGYRIVFDREN 471

Query: 423 SRLGVARELCT 433
             LG     CT
Sbjct: 472 MNLGWKESNCT 482


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 96/348 (27%), Positives = 151/348 (43%), Gaps = 62/348 (17%)

Query: 112 MAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN---PTCG 163
           + +DT++D  WV C     +     SS+ ++ A+S+T+  L C +A C ++       C 
Sbjct: 126 VVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGACV 185

Query: 164 GGACAFNL-------------TYGSSTIAANLSQDTISLATDIVPG----YTFGCIQ--- 203
              C + +             TYGS         D + L  D   G    + FGC     
Sbjct: 186 NNQCQYRVPIPSSPASSSSSGTYGS---------DLLKLTADPADGASMSFKFGCSHGEA 236

Query: 204 KATGNSV---PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGP 257
           K  G         G++ LG G  SL++Q   +Y S FSYC+P   S +   F     +G 
Sbjct: 237 KQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGD 296

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           +        TP+L+  R  +LY V LLAI V  + +++ P          +G+++DS T 
Sbjct: 297 LSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVF------ASGSVLDSRTA 350

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVT- 372
            TRL   AY A+R+ FR R+         G  DTCY       ++ P + L+  G  V  
Sbjct: 351 ITRLPPTAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVA 410

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           L +  +L H       CL   +  D  + +  ++ N+QQQ   +LY+V
Sbjct: 411 LDRQGILFHD------CLVFTSNTD--DRMPGILGNVQQQTMEVLYNV 450


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 180/424 (42%), Gaps = 40/424 (9%)

Query: 33  TLQVFHVFSPCSPFKPSKPLS---WEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGR 89
           T ++ H  SP SP   S+      W +++   +++     +  ++++        IA+G 
Sbjct: 32  TTELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANGG 91

Query: 90  QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKN 146
           +      Y++   +GTP   +L   DT +D  W  CT C  C   +   F+   S T+++
Sbjct: 92  E------YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRD 145

Query: 147 LGCQAAQCKQV-PNPTCGGGA-CAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGY----T 198
           L C   QC+ +  + +C     C ++  YG  +    NL+ DT++L +T+  P Y     
Sbjct: 146 LSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTV 205

Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL-- 255
            GC ++  G       G++GLG G +SL++Q  +     FSYCL  F + S   S +L  
Sbjct: 206 IGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHF 265

Query: 256 --GPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
               +     ++ TPL+ KNP   + YY+ L A+ VG + ++           +    II
Sbjct: 266 GRNAVVSGSGVQSTPLISKNP--DTFYYLTLEAMSVGDKKIEF---GGSSFGGSEGNIII 320

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGM 369
           DSGT  T      +T         V     T  + G    CY  +  +  P IT  F+G 
Sbjct: 321 DSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPVITAHFNGA 380

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           +V L   N  I   +  + CLA  +          +  N+ Q N  I YD+    +    
Sbjct: 381 DVVLQTLNTFIL-ISDDVLCLAFNSTQSGA-----IFGNVAQMNFLIGYDIQGKSVSFKP 434

Query: 430 ELCT 433
             CT
Sbjct: 435 TDCT 438


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 171/420 (40%), Gaps = 64/420 (15%)

Query: 59  LEMLAKDQARLQFLSSLAVARKSVV-----PIASGRQITQS---------------PTYI 98
           +EM+ +D +R  F S      + V       I     + QS                 Y+
Sbjct: 31  VEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTVISALGEYL 90

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCK 155
           +   +GTP+  +   +DT +D  W+ C  C  C   ++ +F+S++S T+K L C +  C+
Sbjct: 91  ISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNTCQ 150

Query: 156 QVPNPTCGG-GACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGYTFGCIQ-KAT 206
            V    C     C +++ Y  GS ++  +LS +T++L +        PG   GC +  A 
Sbjct: 151 SVQGTFCSSRKHCLYSIHYVDGSQSL-GDLSVETLTLGSTNGSPVQFPGTVIGCGRYNAI 209

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIK 265
           G      G++GLGRG +SL+ Q        FSYCL P     S   +     +   +   
Sbjct: 210 GIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTV 269

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDI-PPGALQFNPTTGAGT-IIDSGTVFTRLVA 323
            TPL         Y++ L A  VGR  ++   PG+       G G  IIDSGT  T L  
Sbjct: 270 STPLFSK-NGLVFYFLTLEAFSVGRNRIEFGSPGS------GGKGNIIIDSGTTLTALPN 322

Query: 324 PAYTAV-----RDVFRRRVGSNLTVTSLGGFDTCYSV-----PIVAPTITLMFSGMNVTL 373
             Y+ +     + V  +RV     V  L     CY V         P IT  FSG +VTL
Sbjct: 323 GVYSKLEAAVAKTVILQRVRDPNQVLGL-----CYKVTPDKLDASVPVITAHFSGADVTL 377

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              N  +   A  + C A             V  N+ QQN  + YD+  + +      CT
Sbjct: 378 NAINTFVQ-VADDVVCFAFQPTETGA-----VFGNLAQQNLLVGYDLQMNTVSFKHTDCT 431


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 113/452 (25%), Positives = 185/452 (40%), Gaps = 59/452 (13%)

Query: 1   MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
           M P L FFLA LF + ++            S+TL+  H+    S     +  +  E +  
Sbjct: 13  MLPYL-FFLAILFAWPVT------------SATLRA-HL----SHVDDGRGFTKRELLRR 54

Query: 61  MLAKDQARLQFLS--SLAVARKSVVPIASGRQITQSPTYIVRAKIGTP-AQTLLMAMDTS 117
           M+ + +AR   L   S A AR +  P+        S  Y++   IG P +Q +++ +DT 
Sbjct: 55  MVVRSRARAANLCPYSGATARPATAPVGRANTDVNS-EYLIHLSIGAPRSQPVVLTLDTG 113

Query: 118 NDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
           +D  W  C  C  C +     F++A S T +++ C    C       C    C +   YG
Sbjct: 114 SDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYG 173

Query: 175 SSTIA-ANLSQDTISLATD------IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLL 226
             +++  +  +D+ +           VP   FGC     G  +  + G+ G GRG LSL 
Sbjct: 174 DGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLP 233

Query: 227 AQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL------- 278
           +Q   L    FSYC  + F+A   S  + LG  G  K     P+L  P   SL       
Sbjct: 234 SQ---LKVRQFSYCFTTRFEAK--SSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNS 288

Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y ++   + VG+  + +P    +        T IDSGT  T      +  ++  F  + 
Sbjct: 289 HYVLSFKGVTVGKTRLPVP----EIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQA 344

Query: 338 GSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
              +  T+    D C+S         P +     G +  LP++N +         C+A++
Sbjct: 345 ALPVNKTADED-DICFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVS 403

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            +     +   +I N QQQN  I+YD+   +L
Sbjct: 404 TSGQMDRT---LIGNFQQQNTHIVYDLAAGKL 432


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 112/431 (25%), Positives = 161/431 (37%), Gaps = 91/431 (21%)

Query: 79  RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
           R+  +P++ G   T S T          +Q + + +DT +D  W PC    C+ C     
Sbjct: 70  RQVSLPLSPGSDYTLSFT--------INSQPISLYLDTGSDLVWFPCQPFECILCEGKAE 121

Query: 137 NSAQ--------STTFKNLGCQAAQCKQVPNPTCGGGACA-------------------- 168
           N++         S T   + C+++ C  V +       CA                    
Sbjct: 122 NASLASTPPPKLSKTATPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCP 181

Query: 169 -FNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
            F   YG  ++ A L +D+I L        I   +TFGC          P G+ G GRG 
Sbjct: 182 QFYYAYGDGSLIARLYRDSIRLPLSNQTNLIFNNFTFGCAHTTLAE---PIGVAGFGRGV 238

Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSFSGS-------LRLGPIGQPKRIK------- 265
           LSL AQ   L     + FSYCL S    SF          L LG     ++ +       
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSH---SFDSDRVRRPSPLILGRYDHDEKERRVNGVKK 295

Query: 266 ----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
               YT +L NPR    Y V L  I +GR+ +  P    + +     G ++DSGT FT L
Sbjct: 296 PSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTML 355

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSL----GGFDTCY-----SVPIVAPTITLMFSGMNVT 372
            A  Y  V   F  RVG      S+     G   CY      V +    +  + +G +V 
Sbjct: 356 PASLYDFVVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVV 415

Query: 373 LPQDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVP 421
           LP+ N                 + CL +    D           + N QQQ   ++YD+ 
Sbjct: 416 LPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLE 475

Query: 422 NSRLGVARELC 432
           N R+G AR  C
Sbjct: 476 NRRVGFARRQC 486


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 175/387 (45%), Gaps = 54/387 (13%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
           V P+ SG     S  Y  +  +GTP    LM +DT +D  W+ C  C  C   S  +F+ 
Sbjct: 133 VAPVVSG-LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-V 194
             S ++  + C A  C+++ +  C     AC + + YG  ++ A + + +T++ A+   V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS---- 250
           P    GC     G  V   GLLGLGRGSLS  +Q    +  +FSYCL    + S S    
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311

Query: 251 ------GSLRLGPIGQPKRIKYTPLLKNPR------RSSLYYVNLLAIRVGRRVVDIPPG 298
                 GS   G +G  +R+ + P  + P+      R++  +      R GR  V  PP 
Sbjct: 312 SSTVTFGSGARGALG--RRVLH-PDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPP- 367

Query: 299 ALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRD----VFRRRVGSNLTVTSLGG---FD 350
               +P+TG  G I+DSG       +PA+           R R  +     S GG   FD
Sbjct: 368 ----DPSTGRGGVIVDSGR-----PSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFD 418

Query: 351 TCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
           TCY +     +  PT+++ F+ G    LP +N LI   +    C A A      +  +++
Sbjct: 419 TCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFA----GTDGGVSI 474

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
           I N+QQQ  R+++D    RLG   + C
Sbjct: 475 IGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 160/364 (43%), Gaps = 48/364 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    ST+++ L C    
Sbjct: 76  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC---- 131

Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
                NP C     G  C +   Y   S+ +  LS+D IS   +  + P    FGC  + 
Sbjct: 132 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+  S    G++GLGRG LS++ Q   + + +  FS C    +     G++ LG I  P
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 244

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + ++    +P RS  Y ++L  + V  + + + P    FN     GT++DSGT +   
Sbjct: 245 PGMVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FN--GKHGTVLDSGTTYAYF 298

Query: 322 VAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMN 370
              A+ A++D   + + S   +        D C+S        +    P I + F +G  
Sbjct: 299 PKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQK 358

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           + L P++ L  H+      CL +   PD  ++ L  +  +  +N  + YD  N +LG  +
Sbjct: 359 LILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLGFLK 414

Query: 430 ELCT 433
             C+
Sbjct: 415 TNCS 418


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 115/405 (28%), Positives = 171/405 (42%), Gaps = 75/405 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS-------TVFNSAQSTTFKN 146
           Y     +GTP Q L + +DT +  +WVPCT    C  CSS        VF+   S++ + 
Sbjct: 89  YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148

Query: 147 LGCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQ 184
           +GC+   C  + +P          +C G  C             + + YGS + A  L  
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
           DT+      V  +  GC   +     PP GL G GRG+ S+ +Q   L  + FSYCL S 
Sbjct: 209 DTLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSR 263

Query: 245 K---ALSFSGSLRL---GPIGQPKRIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVD 294
           +     + SG L L   G       ++Y PL ++    P  S  YY+ L AI VG + V 
Sbjct: 264 RFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQ 323

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
           +P  A       G G I+DSGT F+     +  P   AV      R   +  V    G  
Sbjct: 324 LPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382

Query: 351 TCYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNS 401
            C+++P     +  P ++L F G +V  LP +N  +    + +G    +A A     V+ 
Sbjct: 383 PCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSD 442

Query: 402 VLN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
           V                ++ + QQQN+ I YD+   RLG  R+ C
Sbjct: 443 VPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 54/367 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+++K L C    
Sbjct: 80  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKC---- 135

Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
                NP C     G  C +   Y   S+ +  LS+D IS   +  + P    FGC    
Sbjct: 136 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVE 190

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+  S    G++GLGRG LS++ Q   + + +  FS C    +     G++ LG I  P
Sbjct: 191 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 248

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + ++    +P RS  Y ++L  + V  + + + P    FN     GT++DSGT +   
Sbjct: 249 AGMVFSH--SDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FNGK--HGTVLDSGTTYAYF 302

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF-S 367
              A+ A++D   + + S   +  + G      D C+S        +    P I + F +
Sbjct: 303 PKEAFIAIKDAIIKEIPS---LKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGN 359

Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           G  + L P++ L  H+      CL +   PD  ++ L  +  +  +N  + YD  N +LG
Sbjct: 360 GQKLILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLG 415

Query: 427 VARELCT 433
             +  C+
Sbjct: 416 FLKTNCS 422


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 116/438 (26%), Positives = 165/438 (37%), Gaps = 82/438 (18%)

Query: 66  QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIG--TPAQTLLMAMDTSNDAAWV 123
           + R   L S    R+  +P+A G        Y +   +G  + A  + + +DT +D  W 
Sbjct: 58  RHRTHHLPSSRRHRQLSLPLAPGSD------YTLSLSVGPLSTANPVSLFLDTGSDLVWF 111

Query: 124 PCTG-----CVG------------------------CSSTVFNSAQSTTFKNLGCQAAQC 154
           PC       C G                        C+S   ++A S+      C AA+C
Sbjct: 112 PCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAAARC 171

Query: 155 --KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNS 209
               +   +C    AC      YG  ++ A L +  + +A  + V  +TF C   A G  
Sbjct: 172 PLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGIAASVAVENFTFACAHTALGE- 230

Query: 210 VPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCL--PSFKALSFSGSLRLGPI-------- 258
             P G+ G GRG LSL AQ         FSYCL   SF+A      +R  P+        
Sbjct: 231 --PVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRA---DRPIRPSPLILGRSPGE 285

Query: 259 --GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
                  I YTPLL NP+    Y V L A+ VG   +   P   +       G ++DSGT
Sbjct: 286 DPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGRAGDGGMVVDSGT 345

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-----GFDTCYSVPIVA-----------P 360
            FT L    Y  V + F R + +     +       G   CY     A           P
Sbjct: 346 TFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVP 405

Query: 361 TITLMFSG-MNVTLPQDNLLI---HSTAGSITCLA-MAAAPDNVNSVLNVIANMQQQNHR 415
            + + F G   V LP+ N  +         + CL  M    D+       + N QQQ   
Sbjct: 406 PLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFE 465

Query: 416 ILYDVPNSRLGVARELCT 433
           ++YDV   R+G AR  CT
Sbjct: 466 VVYDVDAGRVGFARRRCT 483


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 5/199 (2%)

Query: 94  SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
           S  Y  R  +GTP +   M +DT +D AW+ C  C  C S    +FN + S +F  +GC 
Sbjct: 154 SGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213

Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
           +A C Q+    C  G C +  +YG  + +  + + +T++  T  V     GC  K  G  
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTSVANVAIGCGHKNVGLF 273

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
           +   GLLGLG G+LS   Q       TFSYCL   ++ S SG L+ GP   P    +TPL
Sbjct: 274 IGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDS-SGPLQFGPKSVPVGSIFTPL 332

Query: 270 LKNPRRSSLYYVNLLAIRV 288
            KNP   + YY+++ AI +
Sbjct: 333 EKNPHLPTFYYLSVTAISI 351


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 119/427 (27%), Positives = 185/427 (43%), Gaps = 50/427 (11%)

Query: 33  TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           ++ + H  SP SPF  PS  L+  E +     +  +RL  +S           +     I
Sbjct: 33  SIDLIHRDSPLSPFYDPS--LTPSERITNAAFRSSSRLNRVSHFLDENN----LPESLLI 86

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
            ++  Y++   IGTP    L   DT +D  WV C+ C  C    + +F   +S+TFK   
Sbjct: 87  PENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAAT 146

Query: 149 CQAAQCKQVP--NPTCGG-GACAFNLTYGSSTIAAN-LSQDTISLA------TDIVPGYT 198
           C +  C  VP     CG  G C ++ +YG  +     +  +T+S        T   P   
Sbjct: 147 CDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSI 206

Query: 199 FGC-----IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
           FGC         T + V     LG G  SL      Q  Y+  FSYCL  F + S +  L
Sbjct: 207 FGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYK--FSYCLLPFSSNS-TSKL 263

Query: 254 RLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           + G   I     +  TPL+  P   S Y++NL A+ +G++VV  P G       T    I
Sbjct: 264 KFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVV--PTGR------TDGNII 315

Query: 312 IDSGTVFTRLVAPAYT----AVRDVFRRRVGSNLTVTSLGGFDTCYSV-PIVAPTITLMF 366
           IDSGTV T L    Y     ++++V       +L       F  C+    +  P I   F
Sbjct: 316 IDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFP----FKFCFPYRDMTIPVIAFQF 371

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           +G +V L   NLLI     ++ CLA+   P +++ + ++  N+ Q + +++YD+   ++ 
Sbjct: 372 TGASVALQPKNLLIKLQDRNMLCLAV--VPSSLSGI-SIFGNVAQFDFQVVYDLEGKKVS 428

Query: 427 VARELCT 433
            A   CT
Sbjct: 429 FAPTDCT 435


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 168/369 (45%), Gaps = 41/369 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---------CVGCSSTVFNSAQSTTFKNLG 148
           +V   IGTP Q   + +DT +  +W+ C                +T F+ + S++F  L 
Sbjct: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126

Query: 149 CQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFG 200
           C    CK ++P+   PT C     C ++  Y   T+A  NL ++  + +  +  P    G
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C Q +T N    +G+LG+ RG LS ++Q +    S FSYC+PS    + +G   LG    
Sbjct: 187 CAQASTEN----RGILGMNRGRLSFISQAK---ISKFSYCVPSRTGSNPTGLFYLGDNPN 239

Query: 261 PKRIKYTPLLKNPRRSS-------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
             + KY  +L  P   S        Y + + AI++  + +++PP A + +      T+ID
Sbjct: 240 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQTMID 299

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVPIVAPT------ITLM 365
           SG+  T LV  AY  V++   R VG+ +    +     D C+   + A        I+  
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359

Query: 366 F-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F +G+ + + +   ++      + C+ +  + + +    N+I  + QQN  + YD+ N R
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS-ERLGIGSNIIGTVHQQNMWVEYDLANKR 418

Query: 425 LGVARELCT 433
           +G     C+
Sbjct: 419 VGFGGAECS 427


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/417 (26%), Positives = 166/417 (39%), Gaps = 87/417 (20%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSS----------TVFNSAQST 142
           Y++   IGTP Q + + MDT +D  WVPC      C  C              F    S+
Sbjct: 21  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 80

Query: 143 TFKNLGCQAAQCKQV---PNP-----------------TCGGGACAFNLTYGSS-TIAAN 181
           T     C ++ C  +    NP                 TC     +F  TYG+S  +  +
Sbjct: 81  TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140

Query: 182 LSQDTI---------SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
           L++D +         +     +P + FGC+         P G+ G GRG LSL  Q    
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPFQL-GF 196

Query: 233 YQSTFSYCLPSFK---ALSFSGSLRLGPIG---QPKRIKYTPLLKNPRRSSLYYVNLLAI 286
               FS+C   FK     +FS  L LG +    + + +++TPLLK+P   + YY+ L +I
Sbjct: 197 SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESI 256

Query: 287 RVGRRVVDIPPGA----LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SN 340
            +G    +   G      + +     G +IDSGT +T L  P Y+ +       +G    
Sbjct: 257 TIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRA 316

Query: 341 LTVTSLGGFDTCYSVPIVA-----------PTITLMF-SGMNVTLPQDNLL------IHS 382
             V    GFD CY VP              P+IT  F + ++V LPQ N        I+S
Sbjct: 317 KQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINS 376

Query: 383 TAGSITCLAMAAAPDNVNSVL-------NVIANMQQQNHRILYDVPNSRLGVARELC 432
           T   + CL   +     +           +  + QQQN  ++YD+   RLG     C
Sbjct: 377 TV--VKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 431


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/400 (25%), Positives = 171/400 (42%), Gaps = 40/400 (10%)

Query: 60  EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
           ++ ++   R +  + +A +    +P++SG     +  Y V+  +GTPAQ   +  DT ++
Sbjct: 55  QLPSRRGGRQRVAAEVASSSAVSLPMSSG-AYAGTGQYFVKVLVGTPAQEFTLVADTGSE 113

Query: 120 AAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-QVP----NPTCGGGACAFNLTY- 173
             WV C G       VF    S ++  + C +  CK  VP    N +     C+++  Y 
Sbjct: 114 LTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYK 173

Query: 174 -GSSTIAANLSQDTISLATDIVPG--------YTFGCIQKATGNSVPP-QGLLGLGRGSL 223
            GS+     +  D+ ++A   +PG           GC     G S     G+L LG   +
Sbjct: 174 EGSAGALGVVGTDSATIA---LPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKI 230

Query: 224 SLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGPIGQPKRI--KYTPLLKNPRRSSLYY 280
           S  ++    +  +FSYCL    A  + +G L  GP GQ  R     T L  +P     Y 
Sbjct: 231 SFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGP-GQVPRTPATQTKLFLDPAM-PFYG 288

Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           V + A+ V  + +DIP  A  ++P +G G I+DSGT  T L  PAY AV     + + + 
Sbjct: 289 VKVDAVHVAGQALDIP--AEVWDPKSG-GVILDSGTTLTVLATPAYKAVVAALTKLL-AG 344

Query: 341 LTVTSLGGFDTCYS-------VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
           +       F+ CY+        P + P + + F+G     P     +      + C+ + 
Sbjct: 345 VPKVDFPPFEHCYNWTAPRPGAPEI-PKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQ 403

Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              +     ++VI N+ QQ H   +D+ N  +      CT
Sbjct: 404 ---EGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTCT 440


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 122/424 (28%), Positives = 185/424 (43%), Gaps = 55/424 (12%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---------------SL 75
           S+ L++ H   PC+   PS+  S   S  E+L  D+ R +++                + 
Sbjct: 422 SAVLRLTHRHGPCA--GPSRSAS-APSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTA 478

Query: 76  AVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--- 131
           A + KSV +P   G  I  +  Y+V   +GTP     + +DT +D +WV C  C      
Sbjct: 479 ASSSKSVTIPANIGHSIG-TLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACY 537

Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQD 185
                +F+ A+S+++  + C A  C ++     G   G  C + ++YG  S        D
Sbjct: 538 AQKDQLFDPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSD 597

Query: 186 TISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCLPS 243
           T++L   D V G+ FGC     G      GLL LGR  +SL +QT   Y    FSYCLP 
Sbjct: 598 TLTLTDADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLP- 656

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQF 302
             + S +G L LG          T LL      + Y V L  I V G+++  +P  A   
Sbjct: 657 -PSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA- 714

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL---TVTSLGGFDTCYSV---- 355
                 GT++D+GTV TRL  P   A      R   +        + G  DTCY+     
Sbjct: 715 -----GGTVVDTGTVITRL-PPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYG 768

Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
            +  PT++L FSG   TL  D     S+     CLA A    + +    ++ N+QQ++  
Sbjct: 769 TVTLPTVSLTFSG-GATLKLDAPGFLSSG----CLAFATNSGDGDPA--ILGNVQQRSFA 821

Query: 416 ILYD 419
           + +D
Sbjct: 822 VRFD 825


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 122/452 (26%), Positives = 191/452 (42%), Gaps = 61/452 (13%)

Query: 9   LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
           L F+F FSL+        T   +  +++ H  S  SP+  SK   W+    ++L +  + 
Sbjct: 23  LPFIFHFSLTTATIT---TSTINLVIKLIHHESSLSPYN-SKDTIWDHYSHKILKQTFSN 78

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
             ++S+L  + + VV             +++   IG P    L  MDT +   WV C  C
Sbjct: 79  -DYISNLVPSPRYVV-------------FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC 124

Query: 129 VGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY---GSST---IA 179
             CS     +F+ ++S+T+ NL C       V N     G C +++ Y   GSS      
Sbjct: 125 SSCSQQSVPIFDPSKSSTYSNLSCSECNKCDVVN-----GECPYSVEYVGSGSSQGIYAR 179

Query: 180 ANLSQDTISLATDIVPGYTFGCIQK--ATGNSVPPQGL---LGLGRGSLSLLAQTQNLYQ 234
             L+ +TI  +   VP   FGC +K   + N  P QG+    GLG G  SLL      + 
Sbjct: 180 EQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPS----FG 235

Query: 235 STFSYCLPSFKALSFS-GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
             FSYC+ + +  ++    L LG     +    T  + N     LYYVNL AI +G R +
Sbjct: 236 KKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN----GLYYVNLEAISIGGRKL 291

Query: 294 DIPPGALQFNPT-TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---F 349
           DI P   + + T   +G IIDSG   T L    +  +       +   L +        +
Sbjct: 292 DIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPY 351

Query: 350 DTCYSVPIVA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAP---DNVN 400
             CYS  +       P +T  F+ G  + L   ++ I +T     C+AM       D+  
Sbjct: 352 TLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQTTENEF-CMAMLPGNYFGDDYE 410

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           S  + I  + QQN+ + YD+   R+   R  C
Sbjct: 411 S-FSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 155/392 (39%), Gaps = 64/392 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPC--------------TGCVGCSSTVFNSAQST 142
           Y VR ++GTPAQ  L+  DT +D  WV C              +         F   +S 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 143 TFKNLGCQAAQCKQ--------VPNPTCGGGACAFNLTYGSSTIA---ANLSQDTISLAT 191
           T+  + C +  C +         P P   G  CA++  Y   + A         TI+L++
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTP---GSPCAYDYRYKDGSAARGTVGTESATIALSS 211

Query: 192 DI-----------VPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSY 239
                        + G   GC    TG S     G+L LG  ++S  +   + +   FSY
Sbjct: 212 SSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSY 271

Query: 240 CLPSFKA-------LSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
           CL    +       L+F  +  L    P       + TPL+ + R    Y V++ AI V 
Sbjct: 272 CLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVD 331

Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
             ++ IP    + +   G G I+DSGT  T L  PAY AV     +++ +     ++  F
Sbjct: 332 GELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPF 388

Query: 350 DTCYSVPIVA--------PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
           + CY+    +        P + + F+G     P     +   A  + C+ +   P     
Sbjct: 389 EYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGP---WP 445

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            ++VI N+ QQ H   +D+ N RL   R  CT
Sbjct: 446 GISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477


>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
          Length = 398

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 105/416 (25%), Positives = 165/416 (39%), Gaps = 94/416 (22%)

Query: 31  SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
           S  L +   + PCS    S+P S +E    +  +D++R+ F++S      S        +
Sbjct: 63  SQGLPITQKYGPCSGSGHSQPPSPQE----IXGRDESRVSFINSKCNQYTSGNLKNHAHN 118

Query: 88  GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
                +   ++V    GTP Q   + +DT +   W  C  CV C   S   FB + S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPPQXFXLILDTGSSITWTQCKACVNCLQDSXRYFBXSASSTY 178

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
               C     +             +N+TYG  ST   N    T++L  +D+   + FG  
Sbjct: 179 SXGSCIPXTVEN-----------NYNMTYGDDSTSVGNYGCXTMTLEPSDVFQKFQFGXG 227

Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
           +   G+      G+LGLG+G LS ++QT + +   FSYCLP   ++   GSL  G     
Sbjct: 228 RNNKGDFGSGADGMLGLGQGQLSTVSQTASKFXKVFSYCLPEEDSI---GSLLFGEKATS 284

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
           Q   +K+T L+  P  S L                                  +SG  F 
Sbjct: 285 QSSSLKFTSLVNGPGTSGL---------------------------------XESGYYFV 311

Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNL 378
           +L+        D+                     SV ++ P I L F G  +V L   N+
Sbjct: 312 KLL--------DI---------------------SVDVLLPEIVLHFGGGADVRLNGTNI 342

Query: 379 LIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +  S A S  CLA A  +   +N  L +I N QQ +  +LYD+   R+G     C+
Sbjct: 343 VWGSDA-SRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 154/361 (42%), Gaps = 34/361 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP       +DT +D  W   +PCT C    + +F+   S+T+ N+   +  
Sbjct: 59  YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSES 118

Query: 154 CKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-----VPGYTFGCIQKA 205
           C ++ + +C      C +  +Y   +I    L+Q+T++L +       + G  FGC    
Sbjct: 119 CSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN 178

Query: 206 TG-NSVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCLPSFKA-------LSF-SGSLRL 255
            G  +    G++GLGRG LSL++Q  + +    FS CL  F         +SF  GS  L
Sbjct: 179 NGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVL 238

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           G       +  TPL+      + Y+V LL I V    +    G+    P T    +IDSG
Sbjct: 239 G-----NGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMVIDSG 292

Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVT 372
           T  T L    Y  + +  R +V  + + +    G+  CY  P  +   T+T  F G +V 
Sbjct: 293 TPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKGTTLTAHFEGADVL 352

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           L    + I    G I C A  +   N      +  N  Q N+ I +D+    +      C
Sbjct: 353 LTPTQIFIPVQDG-IFCFAFTSTFSN---EYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408

Query: 433 T 433
           T
Sbjct: 409 T 409


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 115/421 (27%), Positives = 164/421 (38%), Gaps = 68/421 (16%)

Query: 57  SVLEMLAKDQARLQFLSSLAVAR--------------KSVV---PIASG--RQITQ---- 93
           SVLE+  +D  R+Q L    + +              K VV   P+AS    Q  Q    
Sbjct: 99  SVLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVAT 158

Query: 94  --------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
                   S  Y +   +G+P +   + +DT +D  W+ C  C  C              
Sbjct: 159 LESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDC-------------- 204

Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKA 205
               Q    +  P     G +      +   T   NL+ +  S     V    FGC    
Sbjct: 205 ---FQQNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWN 261

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            G      GLLGLGRG LS  +Q Q+LY  +FSYCL    + +   S  +   G+ K + 
Sbjct: 262 RGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI--FGEDKDLL 319

Query: 266 YTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
             P L        K     + YYV + +I V   V++IP      +     GTIIDSGT 
Sbjct: 320 SHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTT 379

Query: 318 FTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
            +    PAY  +++ +  +  G           D C++V     +  P + + F+ G   
Sbjct: 380 LSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVW 439

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
             P +N  I      + CLAM   P    S  ++I N QQQN  ILYD   SRLG A   
Sbjct: 440 NFPTENSFIWLNE-DLVCLAMLGTP---KSAFSIIGNYQQQNFHILYDTKRSRLGYAPTK 495

Query: 432 C 432
           C
Sbjct: 496 C 496


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 152/363 (41%), Gaps = 34/363 (9%)

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFK 145
           I  +  Y++R  IGTP+   L   DT +D  WV C     T C   ++ +++   S+TF 
Sbjct: 90  IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149

Query: 146 NLGCQAAQCKQVP--NPTCGG-GACAFNLTYGSSTIA-ANLSQDTISLA---TDIVPGYT 198
            L C +  C Q+P     C   G C +  TYG ++ +   LS D+I L            
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKIC 209

Query: 199 FGC--IQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
           FGC    K T + S    G++GLG G LSL++Q  +     FSYCL  F + S S  L+ 
Sbjct: 210 FGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNS-KLKF 268

Query: 256 GP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
           G   I Q   +  TPL+  P     YY+NL  I VG + V            T    IID
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVK--------TGQTDGNIIID 319

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI---VAPTITLMFSGMN 370
           SG+  T L    Y     + +  V           FD C++        P +   F+G +
Sbjct: 320 SGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGD 379

Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           V L   N L+      I    + +  D +     +  N+ Q +  + YD+   ++  A  
Sbjct: 380 VVLKPMNTLVLIEDNLICSTVVPSHFDGI----AIFGNLGQIDFHVGYDIQGGKVSFAPT 435

Query: 431 LCT 433
            C+
Sbjct: 436 DCS 438


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/380 (26%), Positives = 158/380 (41%), Gaps = 42/380 (11%)

Query: 75  LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
           LA    +VVP     ++      +    IGTP Q     +D + +  W  C+ C+ C   
Sbjct: 6   LADGGGAVVPFHWSPELYN----VANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQ 61

Query: 135 ---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLA 190
              VF    S+TFK   C    CK +P P C    CAF+   G        ++ DT ++ 
Sbjct: 62  DLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIG 121

Query: 191 TDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA--- 246
           T       FGC+  +  +++  P G +GLGR   SL+AQ   +  + FSYCL        
Sbjct: 122 TAAPASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQ---MKLTRFSYCLAPHDTGKN 178

Query: 247 --LSFSGSLRLGPIGQPKRIKYTPLLK---NPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
             L    S +L   G      +TP +K   N   S  Y + L  I+ G   + +P     
Sbjct: 179 SRLFLGASAKLAGGG-----AWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMP----- 228

Query: 302 FNPTTGAGTIIDSGTV--FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-FDTCYSVPIV 358
                G  T++    V   + LV   Y   +      VG+  T T +G  F+ C+    V
Sbjct: 229 ----RGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFPKAGV 284

Query: 359 --APTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV--LNVIANMQQQN 413
             AP +   F +G  +T+P  N L      ++    M+ A  N+ ++  LN++ + QQ+N
Sbjct: 285 SGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQEN 344

Query: 414 HRILYDVPNSRLGVARELCT 433
             +L+D+    L      C+
Sbjct: 345 VHLLFDLDKDMLSFEPADCS 364


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 159/358 (44%), Gaps = 38/358 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQC 154
           ++    IG P    L+ +DT +D  W+ C  C     T+  F+ ++S+T++N  C +A  
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFHPSRSSTYRNASCVSAP- 136

Query: 155 KQVPN--PTCGGGACAFNLTY----------GSSTIAANLSQDTISLATDIVPGYTFGCI 202
             +P        G C ++L Y              +    S D +    +IV    FGC 
Sbjct: 137 HAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIV----FGCG 192

Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
           Q  +G +    G+LGLG G+ S++ +    + S FSYC  S    ++  ++ +   G   
Sbjct: 193 QDNSGFT-KYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLTNPTYPHNILILGNGAKI 248

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
               TPL     R   YY++L AI  G +++DI PG  Q   + G GT+ID+G   T L 
Sbjct: 249 EGDPTPLQIFQDR---YYLDLQAISFGEKLLDIEPGTFQRYRSQG-GTVIDTGCSPTILA 304

Query: 323 APAYTAVRDVFRRRVGSNL-TVTSLGGFDT-CYSVPIVA-----PTITLMFS-GMNVTLP 374
             AY  + +     +G  L  V     + T CY   +       P +T  F+ G  + L 
Sbjct: 305 REAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALD 364

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            ++L + S +G   CLAM     N    ++VI  M QQN+ + Y++   ++   R  C
Sbjct: 365 VESLFVSSESGDSFCLAMTM---NTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 152/363 (41%), Gaps = 36/363 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           + +   IGTP Q   + +DT +D  W  C            +++ A+S++F    C    
Sbjct: 89  HTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRL 148

Query: 154 CK--QVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNS 209
           C+        C    C +   YGS+T    L+ +T +      +     FGC +  +G+ 
Sbjct: 149 CETGSFNTKNCSRNKCIYTYNYGSATTKGELASETFTFGEHRRVSVSLDFGCGKLTSGSL 208

Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR------ 263
               G+LG+    LSL++Q Q      FSYCL  F   + +  +  G +    +      
Sbjct: 209 PGASGILGISPDRLSLVSQLQ---IPRFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGP 265

Query: 264 IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
           I+ T L+ NP  S+ YY V L+ I VG + +++P  +         GT +DSG     L 
Sbjct: 266 IQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLP 325

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPI-----------VAPTITLMFSGM 369
           +    A+++     V   +   +  G++   C+ +P            V P +     G 
Sbjct: 326 SVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGA 385

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
            + L +D+ ++  +AG + CL +++          +I N QQQN  +L+DV N     A 
Sbjct: 386 AMLLRRDSYMVEVSAGRM-CLVISSGARGA-----IIGNYQQQNMHVLFDVENHEFSFAP 439

Query: 430 ELC 432
             C
Sbjct: 440 TQC 442


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 102/398 (25%), Positives = 166/398 (41%), Gaps = 45/398 (11%)

Query: 59  LEMLAKDQAR--------LQFLSSLAVARKSV--------VPIASGRQITQSP-TYIVRA 101
           +EM+ +D +R         QF        +SV           A+   ITQ+   Y++  
Sbjct: 31  VEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFHKAHKAAKATITQNDGEYLISY 90

Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
            +G P   L   +DT +D  W+   PC  C   ++ +F+ ++S T+K L   +  C+ V 
Sbjct: 91  SVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSVE 150

Query: 159 NPTCGGG---ACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTF-----GCIQKATGN- 208
           + +C       C + + YG  + +  +LS +T++L +       F     GC +  T + 
Sbjct: 151 DTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSF 210

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
                G++GLG G +SL+ Q +    S    FSYCL S   +S   +     +       
Sbjct: 211 EGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAVVSGDGTV 270

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            TP++ +  +   YY+ L A  VG   ++    + +F        IIDSGT  T L    
Sbjct: 271 STPIVTHDPK-VFYYLTLEAFSVGNNRIEFTSSSFRFGEK--GNIIIDSGTTLTLLPNDI 327

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHS 382
           Y+ +       V  +     L     CY      + AP I   FSG +V L   N  I  
Sbjct: 328 YSKLESAVADLVELDRVKDPLKQLSLCYRSTFDELNAPVIMAHFSGADVKLNAVNTFIEV 387

Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
             G +TCLA  ++      +  +  NM QQN  + YD+
Sbjct: 388 EQG-VTCLAFISSK-----IGPIFGNMAQQNFLVGYDL 419


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 163/378 (43%), Gaps = 53/378 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
           Y  R ++G+P +   + +DT +D  WV C+ C GC          T F+   STT   + 
Sbjct: 84  YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143

Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTISLA----TDI 193
           C   +C    Q  +  C      C +   YG  +      +A  +  DT+ L+    + I
Sbjct: 144 CSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQI 203

Query: 194 VPGY----TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
              Y    +F C    TG+         G+ G G+  +S+++Q  +Q +    FS+CL  
Sbjct: 204 CQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKG 263

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
               S  G L LG I +P  I YTPL+ +    +LY   L +I V  + + I P    F 
Sbjct: 264 DD--SGGGVLVLGEIVEPN-IVYTPLVPSQPHYNLY---LQSISVAGQTLAIDPSV--FG 315

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVA 359
            ++  GTI+DSGT    L   AY          V  N   T L   + CY    SV  V 
Sbjct: 316 ASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNAR-TYLSKGNQCYLVTSSVNDVF 374

Query: 360 PTITLMFSGMNVTL--PQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           P ++L F+G    +  PQD LL  ++ G  ++ C+     P      + ++ ++  ++  
Sbjct: 375 PQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTP---GQQITILGDLVLKDKI 431

Query: 416 ILYDVPNSRLGVARELCT 433
            +YD+ N R+G     C+
Sbjct: 432 FVYDIANQRVGWTNYDCS 449


>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
          Length = 337

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 155/347 (44%), Gaps = 48/347 (13%)

Query: 112 MAMDT------SNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
           MA DT      +  AA  P   C G +S  F+ ++S+TF  + C +  C+      C  G
Sbjct: 1   MAFDTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDCRS----GCSSG 54

Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
           +           ++  ++QD ++L     V  +TFGC++ ++G  +   GLL L R S S
Sbjct: 55  STPSCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRS 114

Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG----PIGQPKRI-KYTPLLKNPRRSSLY 279
           L ++       TFSYCLP     S  G L +G    P  +  R+    PL+ +P   + Y
Sbjct: 115 LASRLAAGAGGTFSYCLP-LSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHY 173

Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
            ++L  + +G R + IPP          A  ++D+   +T +    Y  +RD FRR +  
Sbjct: 174 VIDLAGVSLGGRDIPIPP---------HAAMVLDTALPYTYMKPSMYAPLRDAFRRAMAR 224

Query: 340 NLTVTSLGGFDTCYSV-----PIVAPTITLMF---------SGMNVTLPQDNLLIHSTAG 385
                ++G  DTCY+       ++ P + L F          G  + L  D +L  S  G
Sbjct: 225 YPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPG 284

Query: 386 ---SITCLAMAAAP---DNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
              S+TCLA AA P   D    +  V+  + Q +  +++DV   ++G
Sbjct: 285 NFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIG 331


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 167/405 (41%), Gaps = 33/405 (8%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T  + H  SP SPF    P+      L           F        K   P       +
Sbjct: 32  TADLIHRDSPKSPFY--NPMETSSQRLRNAIHRSVNRVF----HFTEKDNTPQPQIDLTS 85

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGC 149
            S  Y++   IGTP   ++   DT +D  W  C  C  C + V   F+   S+T+K++ C
Sbjct: 86  NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 145

Query: 150 QAAQCKQVPNP---TCGGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVP----GYTFG 200
            ++QC  + N    +     C+++L+YG +S    N++ DT++L ++D  P        G
Sbjct: 146 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 205

Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG-- 256
           C     G  +    G++GLG G +SL+ Q  +     FSYCL P       +  +  G  
Sbjct: 206 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            I     +  TPL+    + + YY+ L +I VG + +     +   + ++    IIDSGT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
             T L    Y+ + D     + +        G   CYS    +  P IT+ F G +V L 
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLD 382

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
             N  +   +  + C A   +P       ++  N+ Q N  + YD
Sbjct: 383 SSNAFVQ-VSEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 421


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 167/405 (41%), Gaps = 33/405 (8%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T  + H  SP SPF    P+      L           F        K   P       +
Sbjct: 32  TADLIHRDSPKSPFY--NPMETSSQRLRNAIHRSVNRVF----HFTEKDNTPQPQIDLTS 85

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGC 149
            S  Y++   IGTP   ++   DT +D  W  C  C  C + V   F+   S+T+K++ C
Sbjct: 86  NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 145

Query: 150 QAAQCKQVPNP---TCGGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVP----GYTFG 200
            ++QC  + N    +     C+++L+YG +S    N++ DT++L ++D  P        G
Sbjct: 146 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 205

Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG-- 256
           C     G  +    G++GLG G +SL+ Q  +     FSYCL P       +  +  G  
Sbjct: 206 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
            I     +  TPL+    + + YY+ L +I VG + +     +   + ++    IIDSGT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
             T L    Y+ + D     + +        G   CYS    +  P IT+ F G +V L 
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLD 382

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
             N  +   +  + C A   +P       ++  N+ Q N  + YD
Sbjct: 383 SSNAFVQ-VSEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 421


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 103/384 (26%), Positives = 164/384 (42%), Gaps = 62/384 (16%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSSTVFNSAQSTT 143
           T +  Y    K+GTP +   + +DT +D  WV C  C        +G   T+++   S+T
Sbjct: 81  TDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASST 140

Query: 144 FKNLGCQAAQCKQVPN---PTCGGGA-CAFNLTY--GSSTIAANLSQ----DTISLATDI 193
              + C  A C        P CG    C +++TY  GSSTI + ++     D ++     
Sbjct: 141 GSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQT 200

Query: 194 VPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
            P      FGC  +  G+    +    G+LG G  + S+L+Q  T    +  F++CL + 
Sbjct: 201 QPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTI 260

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
           K     G   +G + QPK +K TPL+ +      Y VNL  I VG   + +P  A  F P
Sbjct: 261 KG---GGIFSIGDVVQPK-VKTTPLVADKPH---YNVNLKTIDVGGTTLQLP--AHIFEP 311

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVR-DVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
               GTIIDSGT  T L    +  V   VF +    ++T   + GF  C+  P       
Sbjct: 312 GEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKH--QDITFHDVQGF-LCFQYPGSVDDGF 368

Query: 360 PTITLMFSGMNVTLPQDNLLIH--------STAGSITCLAM--AAAPDNVNSVLNVIANM 409
           PTIT  F        +D+L +H        +    + C+     A+       + ++ ++
Sbjct: 369 PTITFHF--------EDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDL 420

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
              N  ++YD+ N  +G     C+
Sbjct: 421 VLSNKLVIYDLENRVIGWTDYNCS 444


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 162/376 (43%), Gaps = 48/376 (12%)

Query: 51  PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           P + E  + ++ A+D+AR  + L SL        P+           Y  + ++GTP + 
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
             + +DT +D  WV C  C GC  T         F+   S T   + C   +C    Q  
Sbjct: 94  FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153

Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
           +  C      CA+   YG     S    +++ Q  + + + +VP  T    FGC    TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213

Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           + V       G+ G G+  +S+++Q  +Q +    FS+CL         G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             + +TPL+ +      Y VNLL+I V  + + I P    F+ + G GTIID+GT    L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325

Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
              AY    +     V  ++  V S G  + CY    SV  + P ++L F+G       P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383

Query: 375 QDNLLIHSTAGSITCL 390
           QD L+  +   S  C 
Sbjct: 384 QDYLIQQNNVASALCF 399


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 149/349 (42%), Gaps = 43/349 (12%)

Query: 108 QTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
           Q   +A+D     +W+ C  C  C    S VF+  +S TF N+        + P      
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLAN 168

Query: 165 GACAFNLTYGSSTIAAN-LSQDTISLAT---DIVP--GYTFGCIQKATG--NSVPPQGLL 216
           GAC F++ Y  +T A+  L++DT S      D VP     FGC  +     N     G+L
Sbjct: 169 GACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGIL 228

Query: 217 GLGRGSL-----SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRI--KY 266
           GLG G       +   Q    +   FSYC P    +S    LR G   P   P  +  + 
Sbjct: 229 GLGMGPAGKPPTAFTKQVLPAHGGRFSYC-PFVPGMSMYSYLRFGSDIPSHPPPNVHRQS 287

Query: 267 TPLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           TP+L     S  Y+V L  + VG  R+  + P   + N     G ++D GT  T  +  A
Sbjct: 288 TPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSA 347

Query: 326 YT----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQD 376
           Y     AVR   +RR G+++ V      +TC   P     V P++TL F +G  + +  +
Sbjct: 348 YVHIDHAVRQHLQRR-GAHIVVVRG---NTCVQQPAPHHDVLPSMTLHFENGAWLRVMPE 403

Query: 377 NLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           ++ +    G     C    ++ D     L VI   QQ NHR ++D+ ++
Sbjct: 404 HVFMPFVVGGHHYQCFGFVSSTD-----LTVIGARQQVNHRFIFDLHDT 447


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 149/352 (42%), Gaps = 38/352 (10%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPN 159
           IGTP Q     +D + +  W  C+ C+ C      VF    S+TFK   C    CK +P 
Sbjct: 60  IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119

Query: 160 PTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLG 217
           P C    CA++   G        ++ DT ++ T       FGC+  +  +++  P G +G
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 179

Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLK- 271
           LGR   SL+AQ +    + FSYCL          L    S +L   G      +TP +K 
Sbjct: 180 LGRTPWSLVAQMK---LTRFSYCLAPHDTGKNSRLFLGASAKLAGGG-----AWTPFVKT 231

Query: 272 --NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV--FTRLVAPAYT 327
             N   S  Y + L  I+ G   + +P          G  T++    V   + LV   Y 
Sbjct: 232 SPNDGMSQYYPIELEEIKAGDATITMP---------RGRNTVLVQTAVVRVSLLVDSVYQ 282

Query: 328 AVRDVFRRRVGSNLTVTSLGG-FDTCYSVPIV--APTITLMF-SGMNVTLPQDNLLIHST 383
             +      VG+  T T +G  F+ C+    V  AP +   F +G  +T+P  N L    
Sbjct: 283 EFKKAVMASVGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDVG 342

Query: 384 AGSITCLAMAAAPDNVNSV--LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
             ++    M+ A  N+ ++  LN++ + QQ+N  +L+D+    L      C+
Sbjct: 343 NDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 108/430 (25%), Positives = 179/430 (41%), Gaps = 55/430 (12%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
           P + + PL     + E+ A+D+ R   +  L   R+S V      P+           Y 
Sbjct: 43  PLQRAFPLDEPVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
            + K+G+P     + +DT +D  WV C+ C  C  +         F++  S T  ++ C 
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCS 161

Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
              C  V   T         C ++  YG  S  +     DT         SL  +     
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221

Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
            FGC    +G+         G+ G G+G LS+++Q  ++ +    FS+CL      S  G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
              LG I  P  + Y+PLL +      Y +NLL+I V  ++  +P  A  F  +   GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLLPSQPH---YNLNLLSIGVNGQI--LPIDAAVFEASNTRGTI 333

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
           +D+GT  T LV  AY    +     V   +T+    G + CY    S+  + P ++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNG-EQCYLVSTSISDMFPPVSLNFA 392

Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G    +  PQD L  +      S+ C+    AP+       ++ ++  ++   +YD+   
Sbjct: 393 GGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448

Query: 424 RLGVARELCT 433
           R+G A   C+
Sbjct: 449 RIGWANYDCS 458


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 156/389 (40%), Gaps = 61/389 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNS------AQSTTFKNL 147
           Y +  K GTP QT    +DT +   W+PC     C  C+S   N+        S + K +
Sbjct: 216 YSIDLKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFV 275

Query: 148 GCQAAQCKQV------------------PNPTCGGGACAFNLTYGSSTIAANLSQDTISL 189
           GC+  +C  V                   N  C     A+ + YG  + A  L  + ++ 
Sbjct: 276 GCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFLLSENLNF 335

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKAL 247
               V  +  GC   +  +   P G+ G GRG  SL AQ  NL  + FSYCL S  F   
Sbjct: 336 PAKNVSDFLVGC---SVVSVYQPGGIAGFGRGEESLPAQ-MNL--TRFSYCLLSHQFDES 389

Query: 248 SFSGSLRLGPI--GQPKR---IKYTPLLKNPRRS-----SLYYVNLLAIRVGRRVVDIPP 297
             +  L +     G+ K+   + YT  LKNP        + YY+ L  I VG + V +P 
Sbjct: 390 PENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPR 449

Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV 355
             L+ +     G I+DSG+  T +  P +  V + F ++V       +    G   C+ +
Sbjct: 450 RMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVL 509

Query: 356 PIVA-----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN----- 404
              A     P +   F  G  + LP  N       G + CL + +  D+V          
Sbjct: 510 AGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVS--DDVAGQGGAVGPA 567

Query: 405 -VIANMQQQNHRILYDVPNSRLGVARELC 432
            ++ N QQQN  +  D+ N R G   + C
Sbjct: 568 VILGNYQQQNFYVECDLENERFGFRSQSC 596


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 79/277 (28%), Positives = 127/277 (45%), Gaps = 36/277 (12%)

Query: 48  PSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYI 98
           P  P++ +  +  +LA D++R             S+   +  + VP+ SG ++ Q+  Y+
Sbjct: 35  PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL-QTLNYV 93

Query: 99  VRAKIG----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
               +G    +PA  L + +DT +D  WV   PC+ C      +F+ A S T+  + C A
Sbjct: 94  TTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 153

Query: 152 AQCKQ-------VPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTF 199
           + C          P      GA    C + L YG  + +   L+ DT++L    + G+ F
Sbjct: 154 SACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVF 213

Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           GC     G      GL+GLGR  LSL++QT + Y   FSYCLP+  +   SGSL LG   
Sbjct: 214 GCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 273

Query: 260 QPKR-------IKYTPLLKNPRRSSLYYVNLLAIRVG 289
                      + YT ++ +P +   Y++N+    VG
Sbjct: 274 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVG 310


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 101/386 (26%), Positives = 162/386 (41%), Gaps = 43/386 (11%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-------ST 134
            +P++SG   T +  Y VR ++GTPAQ  ++  DT +D  WV C G    +       + 
Sbjct: 87  AMPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPAR 145

Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDT 186
           VF +A S ++  + C +  C   VP    N +     CA++  Y   + A  +      T
Sbjct: 146 VFRTAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSAT 205

Query: 187 ISLATDI--------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQN 231
           I+L++                + G   GC     G S     G+L LG  ++S  ++   
Sbjct: 206 IALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAA 265

Query: 232 LYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
            +   FSYCL    A   + S L  GP G       TPLL + R +  Y V + A+ V  
Sbjct: 266 RFGGRFSYCLVDHLAPRNATSYLTFGP-GATAPAAQTPLLLDRRMTPFYAVTVDAVYVAG 324

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
             +DIP      +   GA  I+DSGT  T L  PAY AV     + + + L   ++  F+
Sbjct: 325 EALDIPADVWDVDRNGGA--ILDSGTSLTILATPAYRAVVTALSKHL-AGLPRVTMDPFE 381

Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
            CY+      +  P + + F+G     P     +   A  + C+ +    +     ++VI
Sbjct: 382 YCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQ---EGSWPGVSVI 438

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N+ QQ H   +D+ +  L      C
Sbjct: 439 GNILQQEHLWEFDLRDRWLRFKHTRC 464


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 93/275 (33%), Positives = 125/275 (45%), Gaps = 34/275 (12%)

Query: 171 LTYGSSTIAAN----LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
           LTYG S  AAN    L+ DT +     VPG  FGC   + G+     G++G+GRG+LSL+
Sbjct: 120 LTYGGS--AANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLI 177

Query: 227 AQTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYV 281
           +Q Q      FSY L + +A    S    +R G    P  KR + TPLL +      YYV
Sbjct: 178 SQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYV 234

Query: 282 NLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
           NL  +RV G R+  IP G          G I+ S T  T L   AY    DV R  V S 
Sbjct: 235 NLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAY----DVVRAAVASR 290

Query: 341 LTVTSLGG-----FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCL 390
           + + ++ G      D CY+   +A    P +TL+F  G ++ L   N         + CL
Sbjct: 291 IGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECL 350

Query: 391 AMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            M   P    SVL     + Q    ++YDV   RL
Sbjct: 351 TM--LPSQGGSVL---GTLLQTGTNMIYDVDAGRL 380


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 160/368 (43%), Gaps = 55/368 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S T++ + C    
Sbjct: 89  YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC---- 144

Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
               P+  C G    C ++  Y   S+ +  L +D +S    +++ P    FGC    TG
Sbjct: 145 ---TPDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDETG 201

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG I  P+ 
Sbjct: 202 DLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMILGGISPPED 259

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTR 320
           + +T    +P RS  Y +NL  + V  +        LQ NP       GT++DSGT +  
Sbjct: 260 MVFTH--SDPDRSPYYNINLKEMHVAGK-------KLQLNPKVFDGKHGTVLDSGTTYAY 310

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----------SVPIVA---PTITLMF- 366
           L   A+ A +    +   S   +  + G D  Y           V  +A   P + ++F 
Sbjct: 311 LPETAFLAFKRAIMKERNS---LKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFE 367

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++ L  HS      CL + +   N      ++  +  +N  ++YD  NS++
Sbjct: 368 NGHKLSLSPENYLFRHSKVRGAYCLGVFS---NGRDPTTLLGGIFVRNTLVMYDRENSKI 424

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 425 GFWKTNCS 432


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 107/408 (26%), Positives = 180/408 (44%), Gaps = 36/408 (8%)

Query: 33  TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           T  + H  SP SPF  P++  S  + +   + +  +R+   +   +++K     A    +
Sbjct: 32  TADLIHRDSPKSPFYNPTETSS--QRLRNAIHRSVSRVFHFTD--ISQKDASDNAPQIDL 87

Query: 92  T-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
           T  S  Y++   +GTP   ++   DT +D  W  C  C  C + V   F+   S+T+K++
Sbjct: 88  TSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDV 147

Query: 148 GCQAAQCKQVPN-PTCG--GGACAFNLTYGS-STIAANLSQDTISL-ATDIVP----GYT 198
            C ++QC  + N  +C      C+++ +YG  S    N++ DT++L +TD  P       
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207

Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG 256
            GC     G  +    G++GLG G++SL+ Q  +     FSYCL P       +  +  G
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267

Query: 257 --PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IID 313
              +     +  TPL+   +  + YY+ L +I VG + V  P      +  +G G  IID
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQE-TFYYLTLKSISVGSKEVQYPGS----DSGSGEGNIIID 322

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNV 371
           SGT  T L    Y+ + D     + +        G   CYS    +  P IT+ F G +V
Sbjct: 323 SGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADV 382

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
            L   N  +   +  + C A   +P       ++  N+ Q N  + YD
Sbjct: 383 NLKPSNCFVQ-ISEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 424


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 69/376 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
           Y     +GTP  + L+A+DT +D  WVPC  C+ C+              ++  ++STT 
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCD-CIQCAPLSSYHGSLDRDLGIYKPSESTTS 160

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDIVPGYT---- 198
           ++L C    C      T     C +N+ Y S    ++  L +D + L  D   G+     
Sbjct: 161 RHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHL--DSREGHAPVNA 218

Query: 199 ---FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSF 249
               GC +K +G     + P GLLGLG   +S+   LA+   L +++FS C   FK    
Sbjct: 219 SVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FKKDD- 273

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
           SG +  G  G P + + TP +    +   Y VN+    +G +             T GAG
Sbjct: 274 SGRIFFGDQGVPTQ-QSTPFVPMNGKLQTYAVNVDKYCIGHKC------------TEGAG 320

Query: 310 --TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTI 362
              ++D+GT FT L   AY ++   F +++ ++   +    F+ CYS     +P V PTI
Sbjct: 321 FQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDV-PTI 379

Query: 363 TLMFS------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           TL F+       +N  LP ++        ++ CLA+  +P+ V     +I       + +
Sbjct: 380 TLTFAENKSFQAVNPILPFND---RQGEFAVFCLAVLPSPEPV----GIIGQNFMVGYHV 432

Query: 417 LYDVPNSRLGVARELC 432
           ++D  N +LG  R  C
Sbjct: 433 VFDRENMKLGWYRSEC 448


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 69/376 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
           Y     +GTP  + L+A+DT +D  WVPC  C+ C+              ++  ++STT 
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCD-CIQCAPLSSYHGSLDRDLGIYKPSESTTS 160

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDIVPGYT---- 198
           ++L C    C      T     C +N+ Y S    ++  L +D + L  D   G+     
Sbjct: 161 RHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHL--DSREGHAPVNA 218

Query: 199 ---FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSF 249
               GC +K +G     + P GLLGLG   +S+   LA+   L +++FS C   FK    
Sbjct: 219 SVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FKKDD- 273

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
           SG +  G  G P + + TP +    +   Y VN+    +G +             T GAG
Sbjct: 274 SGRIFFGDQGVPTQ-QSTPFVPMNGKLQTYAVNVDKYCIGHKC------------TEGAG 320

Query: 310 --TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTI 362
              ++D+GT FT L   AY ++   F +++ ++   +    F+ CYS     +P V PTI
Sbjct: 321 FQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDV-PTI 379

Query: 363 TLMFS------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           TL F+       +N  LP ++        ++ CLA+  +P+ V     +I       + +
Sbjct: 380 TLTFAENKSFQAVNPILPFND---RQGEFAVFCLAVLPSPEPV----GIIGQNFMVGYHV 432

Query: 417 LYDVPNSRLGVARELC 432
           ++D  N +LG  R  C
Sbjct: 433 VFDRENMKLGWYRSEC 448


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 107/433 (24%), Positives = 180/433 (41%), Gaps = 34/433 (7%)

Query: 21  LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK 80
           L PI D      T+++ +  SP SPF   +    +  ++  + +  +R+   S    +  
Sbjct: 19  LVPI-DAAKDGFTVELINRDSPKSPFYNPRETPTQR-IVSAVRRSMSRVHHFSPTKNS-D 75

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFN 137
                A    I+    Y+++  +GTPA  +L   DT +D  W  C  C  C    + +F+
Sbjct: 76  IFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFD 135

Query: 138 SAQSTTFKNLGCQAAQCKQVPN-PTCGGGA---CAFNLTYGS-STIAANLSQDTISLATD 192
              S+T++++ C   QC  +    +C G     C ++ +YG  S  + N++ DTI+L + 
Sbjct: 136 PKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGST 195

Query: 193 -----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCL-PSFK 245
                ++P    GC     G+       +    G  +SL++Q  +     FSYCL P   
Sbjct: 196 SGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSS 255

Query: 246 ALSFSGSLRLGPIG--QPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
             + S  L  G  G      ++ TPL+ K+P   + Y++ L A+ VG   +  P  +   
Sbjct: 256 NATNSSKLNFGSNGIVSGGGVQSTPLISKDP--DTFYFLTLEAVSVGSERIKFPGSSFG- 312

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAP 360
             T+    IIDSGT  T      ++ +    +  V         G    CYS+   +  P
Sbjct: 313 --TSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDADLKFP 370

Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
           +IT  F G +V L   N  +     S T L  A  P N  ++     N+ Q N  + YD+
Sbjct: 371 SITAHFDGADVKLNPLNTFVQV---SDTVLCFAFNPINSGAIF---GNLAQMNFLVGYDL 424

Query: 421 PNSRLGVARELCT 433
               +      CT
Sbjct: 425 EGKTVSFKPTDCT 437


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R ++GTP +   + +DT +D  WV C  C  C  T         F+   S+T   L 
Sbjct: 41  YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100

Query: 149 CQAAQC---KQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQD-------TISLATDIVP 195
           C  ++C    Q+    C     C ++  Y  GS T+   +S +          +  +   
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160

Query: 196 GYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF 249
             TFGC    +G+   P     G+ G G+  LS+++Q  +Q L    FS+CL    A   
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEG--ADPG 218

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
            G L LG I +P  + YTP++ +      Y +NL  I V  + + I P    F  T   G
Sbjct: 219 GGILVLGEITEPGMV-YTPIVPSQPH---YNLNLQGIAVNGQQLSIDPQV--FATTNTRG 272

Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVAPTITLMF 366
           TIID GT    L   AY    +     V  +     L G   F T +S+  + P++TL F
Sbjct: 273 TIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEIFPSVTLYF 332

Query: 367 SGMNVTL-PQDNLLIHSTAGS--ITCLAMAAAPDNV--NSVLNVIANMQQQNHRILYDVP 421
            G  + L P+D L+   +  S  + C+    +      +S + ++ ++  ++   +YD+ 
Sbjct: 333 EGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLE 392

Query: 422 NSRLGVARELCT 433
           N R+G     C+
Sbjct: 393 NQRIGWTSFDCS 404


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 106/436 (24%), Positives = 169/436 (38%), Gaps = 94/436 (21%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC---------------- 125
            +P++SG   T +  Y VR ++GTPA+  L+  DT +D  WV C                
Sbjct: 93  AMPLSSG-AYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAA 151

Query: 126 --------------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--------KQVPNPTCG 163
                                + VF   +S T+  + C +  C           P P   
Sbjct: 152 PASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP--- 208

Query: 164 GGACAFNLTYGSSTIA-ANLSQDTISLATD-----------IVPGYTFGCIQKATGNS-V 210
           G  CA++  Y   + A   +  D+ ++A              + G   GC    TG+S +
Sbjct: 209 GSPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFL 268

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGP-----IGQPKRI 264
              G+L LG  ++S  ++    +   FSYCL    A   + S L  GP        P + 
Sbjct: 269 ASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKT 328

Query: 265 ------------------KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
                             + TPLL + R    Y V +  I V   ++ IP   L ++   
Sbjct: 329 ACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIP--RLVWDVAK 386

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---------VPI 357
           G G I+DSGT  T LV+PAY AV     +++ + L   ++  FD CY+         + +
Sbjct: 387 GGGAILDSGTSLTVLVSPAYRAVVAALNKKL-AGLPRVTMDPFDYCYNWTSPSTGEDLTV 445

Query: 358 VAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
             P + + F+G     P     +   A  + C+ +    +     ++VI N+ QQ H   
Sbjct: 446 AMPELAVHFAGSARLQPPAKSYVIDAAPGVKCIGLQ---EGEWPGVSVIGNILQQEHLWE 502

Query: 418 YDVPNSRLGVARELCT 433
           +D+ N RL   R  CT
Sbjct: 503 FDLKNRRLRFKRSRCT 518


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 166/369 (44%), Gaps = 41/369 (11%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---------CVGCSSTVFNSAQSTTFKNLG 148
           +V   IGTP Q   + +DT +  +W+ C                +  F+ + S++F  L 
Sbjct: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126

Query: 149 CQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFG 200
           C    CK ++P+   PT C     C ++  Y   T+A  NL ++  + +  +  P    G
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
           C Q +T N    +G+LG+  G LS ++Q +    S FSYC+PS    + +G   LG    
Sbjct: 187 CAQASTEN----RGILGMNHGRLSFISQAK---ISKFSYCVPSRTGSNPTGLFYLGDNPN 239

Query: 261 PKRIKYTPLLKNPRRSS-------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
             + KY  +L  P   S        Y + + AI++  + ++IPP A + +      T+ID
Sbjct: 240 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 299

Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVPIVAPT------ITLM 365
           SG+  T LV  AY  V++   R VG+ +    +     D C+   + A        I+  
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359

Query: 366 F-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
           F +G+ + + +   ++      + C+ +  + + +    N+I  + QQN  + YD+ N R
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS-ERLGIGSNIIGTVHQQNMWVEYDLANKR 418

Query: 425 LGVARELCT 433
           +G     C+
Sbjct: 419 VGFGGAECS 427


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/301 (31%), Positives = 133/301 (44%), Gaps = 43/301 (14%)

Query: 61  MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           +   DQ RL+ +    V+     PI+    I     Y  R  +GTP Q   + +DT ++ 
Sbjct: 9   LRKHDQRRLRRMLPEVVS----FPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNV 64

Query: 121 AWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCKQVPNP-TCGGG--ACAF 169
           AWV C  C GC          + F+  +STT  ++ C  A+C  +     C     +C +
Sbjct: 65  AWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPERLSCPY 124

Query: 170 NLTYGSSTIAANLSQDTI----------SLATDIVPGYTFGCIQKATGNSVPPQGLLGLG 219
           +L YG  +  A    + +          S A        FGC    TG S    GLLG G
Sbjct: 125 SLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTG-SWSVDGLLGFG 183

Query: 220 RGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRS 276
             ++SL   LAQ QN+  + F++CL     +S  GSL +G I +P  + YTP++      
Sbjct: 184 PTTVSLPNQLAQ-QNISVNIFAHCLQG--DVSGRGSLVIGTIREPDLV-YTPMVFGEDH- 238

Query: 277 SLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
             Y V LL I +  R V  P     F+     G IIDSGT  T LV PAY    D FRR 
Sbjct: 239 --YNVQLLNIGISGRNVTTPA---SFDLEYTGGVIIDSGTTLTYLVQPAY----DEFRRG 289

Query: 337 V 337
           V
Sbjct: 290 V 290


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 99/206 (48%), Gaps = 16/206 (7%)

Query: 237 FSYCLPSFKALSFSGSLRLGPIG----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
           FSYCL S    + + SL  G +      P +I  TPL++NP   S YY+ L  I VG  +
Sbjct: 180 FSYCLTSIHE-NKTSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTL 238

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           + IP  A Q       G I+DSGT  T L   A+  +++ F  +    +  +S  G D C
Sbjct: 239 LPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLC 298

Query: 353 YSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           + +P      +  P +   F G+++ LP +N ++      + CLA+ A        L++ 
Sbjct: 299 FHLPVKNAAEVKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDAT-----GSLSIF 353

Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
            N+QQQN  +L+D+  S L +    C
Sbjct: 354 GNIQQQNMLVLHDLKKSTLSLVPTQC 379


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 152/358 (42%), Gaps = 82/358 (22%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
           V   +G+P QT+ M +DT ++ +W+ C       S VF+  +S+++  + C +  C+   
Sbjct: 377 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCR--- 432

Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
                        T+  +T                                    GL+G+
Sbjct: 433 -----------TRTHSKTT------------------------------------GLIGM 445

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKN---- 272
            RGSLS + Q   +    FSYC+    +   SG L  G       K +KYTPL++     
Sbjct: 446 NRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALKYTPLVQISTPL 499

Query: 273 PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    + Y V L  I+V   ++ +P      + T    T++DSGT FT L+ P YTA+++
Sbjct: 500 PYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 559

Query: 332 VFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLL 379
            F R+  ++L V         G  D CY VP+        PT+TLMF G  +++  + L+
Sbjct: 560 EFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLM 619

Query: 380 -----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                +   + S+ C     + + +     +I +  QQN  + +D+  SR+G A   C
Sbjct: 620 YRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 676


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 137/343 (39%), Gaps = 42/343 (12%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
           I  P     M++DTS D  W+ C       C    + +F+  +S T   + C +A C ++
Sbjct: 155 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 214

Query: 158 PNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN-SVPPQGLL 216
                          YG   +   +                        GN S    G +
Sbjct: 215 GR-------------YGRWLLQQPVPVLRRLRRRQGQ--PRGRTCHAVRGNFSASTSGTM 259

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRR- 275
            LG G  SLL+QT   + + FSYC+P   +  F         G   R   TPL++NP   
Sbjct: 260 SLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII 319

Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
            +LY V L  I VG R +++PP           G ++DS  + T+L   AY A+R  FR 
Sbjct: 320 PTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRS 373

Query: 336 RVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAGSITC 389
            + +   V     G DTCY       +  P ++L+F G  V  L    +++        C
Sbjct: 374 AMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG------C 427

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LA    P +    L  I N+QQQ H +LYDV    +G  R  C
Sbjct: 428 LAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 152/357 (42%), Gaps = 31/357 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP   +    DT +D  W   +PCT C    + +F+   S+++ N+ C    
Sbjct: 60  YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTES 119

Query: 154 CKQVPNPTCGGG--ACAFNLTYGSSTIAAN-LSQDTISLATDI-----VPGYTFGCIQKA 205
           C ++ +  C      C +  +Y  ++I    L+Q+T++L +         G  FGC    
Sbjct: 120 CNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNN 179

Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKAL-SFSGSLRLGPIGQ- 260
           +G +    GL+GLGRG LSL++Q  +   +    FS CL  F    S +  +  G   + 
Sbjct: 180 SGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEV 239

Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP-PGALQFNPTTGAGTIIDSGTVF 318
                  TPL+   +  + Y+  LL I V    +++P          T    +IDSGT  
Sbjct: 240 LGNGTVSTPLIS--KDGTGYFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTTI 295

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQD 376
           T L    Y  + +  R +V   L    + G++ CY  P  +  PT+T+ F G +V L   
Sbjct: 296 TYLPEEFYHRLIEQVRNKVA--LEPFRIDGYELCYQTPTNLNGPTLTIHFEGGDVLLTPA 353

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            + I     +  C A+     + N       N  Q N+ I +D+    +      CT
Sbjct: 354 QMFIPVQDDNF-CFAVF----DTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 108/430 (25%), Positives = 189/430 (43%), Gaps = 58/430 (13%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT----YIVR 100
           P + + PL+ +  +  + A+D+AR   +    V    VV  +   Q T  P     Y  +
Sbjct: 31  PLERAIPLNQQVELEALRARDRARHGRILQGVVG--GVVDFS--VQGTSDPYFVGLYFTK 86

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAA 152
            K+G+PA+   + +DT +D  W+ C  C  C  +         F++A S+T   + C   
Sbjct: 87  VKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADP 146

Query: 153 QCK---QVPNPTCGGGA--CAFNLTYGSST------IAANLSQDTISLATDIVPGYT--- 198
            C    Q     C   A  C++   YG  +      ++  +  DT+ L   +V   +   
Sbjct: 147 ICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTI 206

Query: 199 -FGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
            FGC    +G+         G+ G G G+LS+++Q  ++ +    FS+CL   +  +  G
Sbjct: 207 VFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE--NGGG 264

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            L LG I +P  I Y+PL+ +      Y +NL +I V  ++  +P  +  F  T   GTI
Sbjct: 265 VLVLGEILEPS-IVYSPLVPSLPH---YNLNLQSIAVNGQL--LPIDSNVFATTNNQGTI 318

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMF- 366
           +DSGT    LV  AY    D     V S  +   +   + CY    SV  + P ++L F 
Sbjct: 319 VDSGTTLAYLVQEAYNPFVDAITAAV-SQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFM 377

Query: 367 SGMNVTLPQDNLLIHS---TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            G ++ L  ++ L+H     + ++ C+        V     ++ ++  ++   +YD+ N 
Sbjct: 378 GGASMVLNPEHYLMHYGFLDSAAMWCIGF----QKVERGFTILGDLVLKDKIFVYDLANQ 433

Query: 424 RLGVARELCT 433
           R+G A   C+
Sbjct: 434 RIGWADYNCS 443


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 127/452 (28%), Positives = 189/452 (41%), Gaps = 69/452 (15%)

Query: 48  PSKPLSWEESVLEMLAKDQ-ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
           P  P + +   L  LA+   AR   L      + +  P+ +         Y     +GTP
Sbjct: 36  PLPPAAAQHHPLSRLARASLARASRLRGHHQGQAASSPVRAALYPHSYGGYAFSLSLGTP 95

Query: 107 AQTLLMAMDTSNDAAWVPCTG---CVGCSST-----VFNSAQSTT------------FKN 146
            Q L + +DT +   WVPCT    C  CS+      VF+   S++            + +
Sbjct: 96  PQPLPVLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIH 155

Query: 147 LGCQAAQCKQVPNP----TCGGGACAFN------LTYGSSTIAANLSQDTISLATDIVPG 196
                + C +   P    T    A A N      + YGS + A  L  DT+ L+      
Sbjct: 156 SKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAAS 215

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSL 253
             F           PP GL G GRG+ S+ AQ   L  + FSYCL S +     + SG L
Sbjct: 216 RNFAVGCSLASVHQPPSGLAGFGRGAPSVPAQ---LGVNKFSYCLLSRRFDDDAAISGEL 272

Query: 254 RLGP--IGQPK-RIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDIPPGALQ-FNPT 305
            LG    G+ K  ++Y PLLKN    P  S  YY++L  I VG + V +P  AL   +  
Sbjct: 273 VLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPARALAPVSGG 332

Query: 306 TGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-- 359
            G G IIDSGT FT L      P   A+      R   +  V    G   C+++P  A  
Sbjct: 333 GGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGART 392

Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHS-----TAGSITCLAMAAAPDNVNSVLN------ 404
              P ++L FS G  + LP +N  + +      A    CLA+ +   + +          
Sbjct: 393 MDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGGGG 452

Query: 405 ---VIANMQQQNHRILYDVPNSRLGVARELCT 433
              ++ + QQQN+++ YD+  +RLG  ++ C+
Sbjct: 453 PAIILGSFQQQNYQVEYDLEKNRLGFRQQPCS 484


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 101/413 (24%), Positives = 164/413 (39%), Gaps = 44/413 (10%)

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPI-ASGRQITQSPTYIVRAKIGTPAQT 109
           P++ E+ +  M     AR ++L +  V             Q  ++  + V   +G P   
Sbjct: 21  PVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVP 80

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
               MDT +   W+ C  C  CSS      VFN A S+TF    C    C+  PN  C  
Sbjct: 81  QFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCSS 140

Query: 165 GACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ--GLL 216
             C +   Y S T +   L+++ ++  T      +     FGC  +  G  +  +  G+L
Sbjct: 141 NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHE-NGEQLESEFTGIL 199

Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGP----IGQPKRIKYTPLLK 271
           GLG    SL  Q      S FSYC+      ++    L LG     +G P  I++     
Sbjct: 200 GLGAKPTSLAVQL----GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFET--- 252

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
               + +YY+NL  I VG + ++I P   +    +  G I+D+GT++T L   AY  + +
Sbjct: 253 ---ENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLYTWLADIAYRELYN 308

Query: 332 VFRRRVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMFSG-----MNVTLPQDNLLIH 381
             +  +   L       F  CY        I  P +T  F+G     M  T     +   
Sbjct: 309 EIKSILDPKLERFWFRDF-LCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTES 367

Query: 382 STAGSITCLAMAAAPDNVNSV--LNVIANMQQQNHRILYDVPNSRLGVARELC 432
            T  ++ C+++    ++         I  M QQ + I YD+    + + R  C
Sbjct: 368 DTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRIDC 420


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 161/368 (43%), Gaps = 55/368 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T+K + C    
Sbjct: 88  YTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQC---- 143

Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKA 205
                NP+C     G  C +   Y   S+ +  L++D +S    +++ P    FGC    
Sbjct: 144 -----NPSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVE 198

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG   S    G++GLGRG LS++ Q   + +  ++FS C      +   G++ LG I  P
Sbjct: 199 TGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVV--GGAMVLGNIPPP 256

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTV 317
             + +     +P RS+ Y + L  + V G+R        L+ NP       GT++DSGT 
Sbjct: 257 PDMVFA--HSDPYRSAYYNIELKELHVAGKR--------LKLNPRVFDGKHGTVLDSGTT 306

Query: 318 FTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF- 366
           +  L   A+ A +D   + +     +        D C+S        +  + P + ++F 
Sbjct: 307 YAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFG 366

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++ L  H+      CL +     +  ++L  I     +N  + YD  N ++
Sbjct: 367 NGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIV---VRNTLVTYDRDNDKI 423

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 424 GFWKTNCS 431


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 160/372 (43%), Gaps = 48/372 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R ++G+P +   + +DT +D  WV C+ C GC  +         F+   S T   + 
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149

Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
           C   +C    Q  +  C      C +   YG  +      ++  L  DTI   S+  +  
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209

Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
               FGC    TG+   P     G+ G G+  +S+++Q  +Q +    FS+CL      S
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDD--S 267

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G L LG I +P  I YTPL+ +      Y +NL +I V  + + I P    F  ++  
Sbjct: 268 GGGILVLGEIVEPN-IVYTPLVPSQPH---YNLNLQSIYVNGQTLAIDPSV--FATSSNQ 321

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
           GTIIDSGT    L   AY          V  +++   L   + CY    S+  V P ++L
Sbjct: 322 GTIIDSGTTLAYLTEAAYDPFISAITSTVSPSVS-PYLSKGNQCYLTSSSINDVFPQVSL 380

Query: 365 MFSGMN--VTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            F+G    + +PQD L+  S+    ++ C+            + ++ ++  ++   +YD+
Sbjct: 381 NFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQ---GQEITILGDLVLKDKIFVYDI 437

Query: 421 PNSRLGVARELC 432
              R+G A   C
Sbjct: 438 AGQRIGWANYDC 449


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 151/363 (41%), Gaps = 48/363 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R KIGTP     + +DT +   +VPC+ C  C +     F+ A S+++K L C +  
Sbjct: 35  YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGSE- 93

Query: 154 CKQVPNPTCGGGACAFNLTY-----GSSTIAANLSQDTISLATDIVPG---YTFGCIQKA 205
                   C  G C  +  Y       ST +  L +D I  +     G     FGC    
Sbjct: 94  --------CSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETAE 145

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+       G++GLGRG LS++ Q   +N  +  FS C          G++ LG    P
Sbjct: 146 TGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMD--EGGGAMILGGFQPP 203

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
           K + +T    +P RS  Y + L  IRVG   + + P           GT++DSGT +   
Sbjct: 204 KDMVFTA--SDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGK----YGTVLDSGTTYAYF 257

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS--------VPIVAPTITLMF-SGMN 370
              A+ A +   + +VGS   V        D CY+        +    P++  +F  G +
Sbjct: 258 PGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQS 317

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           VTL P++ L  H+      CL +    D    +  +I     +N  + Y+   + +G  +
Sbjct: 318 VTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIV----RNMLVTYNRGKASIGFLK 373

Query: 430 ELC 432
             C
Sbjct: 374 TKC 376


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/445 (25%), Positives = 182/445 (40%), Gaps = 69/445 (15%)

Query: 31  SSTLQVFHVFSPCS----PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR----KSV 82
           S T  + H++SP      PF  S P   +E  L+  A       F+ S  + +    + +
Sbjct: 57  SFTFNIHHLYSPAVRQILPFH-SFP---DEGTLDYYAAMVRTDHFVHSRRLGQVQDHRPL 112

Query: 83  VPIASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------ 133
             ++    +  SP    Y     +GTP    L+A+DT +D  W+PC  CV C +      
Sbjct: 113 TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQ 171

Query: 134 -----TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDT 186
                 +++   S+T K + C ++ C  +   +     C + ++Y S   ++   L +D 
Sbjct: 172 GPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDI 231

Query: 187 ISLATDIVPG------YTFGCIQKATG---NSVPPQGLLGLGRGSLSLLAQTQN--LYQS 235
           + L T+ V         T GC +  +G   +S  P GL GLG  ++S+ +   N  L  +
Sbjct: 232 LHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISN 291

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           +FS C    +     G +  G  G P + + TP     RR   Y V++  I VG  + D+
Sbjct: 292 SFSLCFGPARM----GRIEFGDKGSPGQNE-TPFNLG-RRHPTYNVSITQIGVGGHISDL 345

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYS 354
                          I DSGT FT L  PAY+   D F   V     T+ S   F+ CY 
Sbjct: 346 -----------DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYE 394

Query: 355 VPIVAPTITL------MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +     T T       M  G +  +    +LI + +  + CLA+A +       +N+I  
Sbjct: 395 LSPNQTTFTYPLMNLTMKGGGHFVINHPIVLISTESKRLFCLAIARS-----DSINIIGQ 449

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
                + I++D     LG     CT
Sbjct: 450 NFMTGYHIVFDREKMVLGWKESNCT 474


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 104/428 (24%), Positives = 173/428 (40%), Gaps = 53/428 (12%)

Query: 13  FLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQF 71
           FLF L E    +   +    ++ + H  SP SPF  PSK  +  E + +   +  +R+  
Sbjct: 17  FLFQLLE----VALARGGGFSVDLIHRDSPHSPFFDPSK--TQAERLTDAFRRSVSRVGR 70

Query: 72  LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
               A+    +      R +  +  Y++   IGTP   ++  +DT +D  W  C  C  C
Sbjct: 71  FRPTAMTSDGI----QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHC 126

Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-PNPTCGG-GACAFNLTYGSSTI-AANLSQD 185
              V   F+   S+T+++  C  + C  +  + +C     C F  +Y   +    NL+ +
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186

Query: 186 TISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           T+++ +        PG+ FGC   + G       G++GLG G LSL++Q ++     FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246

Query: 240 C-LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
           C LP     S S  +  G  G   R+     +  P R        L  +   +  ++  G
Sbjct: 247 CLLPVSTDSSISSRINFGASG---RVSGYGTVSTPLR--------LPYKGYSKKTEVEEG 295

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVP 356
            +          I+DSGT +T L    Y+ +       +         G F  CY  +  
Sbjct: 296 NI----------IVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 345

Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           I AP IT  F   NV L   N  +      + C  +A   D     + V+ N+ Q N  +
Sbjct: 346 INAPIITAHFKDANVELQPLNTFMRMQE-DLVCFTVAPTSD-----IGVLGNLAQVNFLV 399

Query: 417 LYDVPNSR 424
            +D+   R
Sbjct: 400 GFDLRKKR 407


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/378 (26%), Positives = 164/378 (43%), Gaps = 47/378 (12%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
           +Q   Y  + K+GTP + L + +DT +D  WV C  C GC  T         F+   S+T
Sbjct: 72  SQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSST 131

Query: 144 FKNLGCQAAQCK---QVPNPTCGG--GACAFNLTYGSSTIAANLSQDTI---------SL 189
              + C   +C+   Q  + +C G    C +   YG  +  +      +         +L
Sbjct: 132 SSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTL 191

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
            T+      FGC    TG+    +    G+ G G+  +S+++Q  +Q +    FS+CL  
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
               S  G L LG I +P  I Y+PL+ +      Y +NL +I V  ++V I P    F 
Sbjct: 252 DN--SGGGVLVLGEIVEPN-IVYSPLVPSQPH---YNLNLQSISVNGQIVRIAPSV--FA 303

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYT----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA 359
            +   GTI+DSGT    L   AY     A+  V  + V S L+  +     T  S   + 
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIF 363

Query: 360 PTITLMFSGMN--VTLPQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
           P ++L F+G    V  PQD L+  +    GS+ C+            + ++ ++  ++  
Sbjct: 364 PQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKIS---GQSITILGDLVLKDKI 420

Query: 416 ILYDVPNSRLGVARELCT 433
            +YD+   R+G A   C+
Sbjct: 421 FVYDLAGQRIGWANYDCS 438


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 155/368 (42%), Gaps = 37/368 (10%)

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
           I+    Y++   +GTP   +L   DT +D  W  C  C  C   V   F+  +S T+K L
Sbjct: 88  ISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTL 147

Query: 148 GCQAAQCKQVPNP-TCGG-GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTF 199
            C    C+ +    +C     C ++ +YG  S    +LS DT+++ +        PG  F
Sbjct: 148 DCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAF 207

Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP 257
           GC     G  +    GL+GLG G LSL+ Q  +     FSYCL P     + S  +  G 
Sbjct: 208 GCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGK 267

Query: 258 IG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP--------PGALQFNPTTG 307
            G         TPL+K     + YY+ L  + VG   V           P A++      
Sbjct: 268 SGVVSGSGTVSTPLIKG-TPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVE-----E 321

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLM 365
              IIDSGT  T L    YT V       +G   T    G F  CYS    +  PTIT  
Sbjct: 322 GNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEIPTITAH 381

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F+G +V LP  N  +      + C +M  +     S L +  N+ Q N  + YD+ N+++
Sbjct: 382 FTGADVQLPPLNTFVQ-VQEDLVCFSMIPS-----SNLAIFGNLAQINFLVGYDLKNNKV 435

Query: 426 GVARELCT 433
              +  CT
Sbjct: 436 SFKQTDCT 443


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 111/469 (23%), Positives = 176/469 (37%), Gaps = 112/469 (23%)

Query: 65  DQARLQFLSSLAVARKS----------------------VVPIASGRQITQSPTYIVRAK 102
           DQ R  F+SS A  R +                       +P++SG   T +  Y VR +
Sbjct: 2   DQERTAFISSHARRRATEAGRAKPKPKAKAKAAPADEAFAMPLSSG-AYTGTGQYFVRFR 60

Query: 103 IGTPAQTLLMAMDTSNDAAWVPC-------------------------------TGCVGC 131
           +GTPA+  L+  DT +D  WV C                               +     
Sbjct: 61  VGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSVSAAASS 120

Query: 132 SSTVFNSAQSTTFKNLGCQAAQC--------KQVPNPTCGGGACAFNLTYGSSTIA-ANL 182
            + VF   +S T+  + C +  C           P P   G  CA+   Y   + A   +
Sbjct: 121 PARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP---GSPCAYEYRYKDGSAARGTV 177

Query: 183 SQDTISLATD-----------IVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQ 230
             D+ ++A              + G   GC    TG S +   G+L LG  ++S  ++  
Sbjct: 178 GTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAA 237

Query: 231 NLYQSTFSYCL------------------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
             +   FSYCL                  P+  + S S +   G    P   + TPLL +
Sbjct: 238 ARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPG-ARQTPLLLD 296

Query: 273 PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
            R    Y V +  + V   ++ IP   L ++   G G I+DSGT  T LV+PAY AV   
Sbjct: 297 HRMRPFYAVAVNGVSVDGELLRIP--RLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAA 354

Query: 333 FRRRVGSNLTVTSLGGFDTCYS---------VPIVAPTITLMFSGMNVTLPQDNLLIHST 383
             +++   L   ++  FD CY+         + +  P + + F+G     P     +   
Sbjct: 355 LGKKL-VGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA 413

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           A  + C+ +    +     ++VI N+ QQ H   +D+ N RL   R  C
Sbjct: 414 APGVKCIGLQ---EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 165/379 (43%), Gaps = 52/379 (13%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTF 144
           Q   Y  + ++GTP     + +DT +D  WV C  C GC  T         F+   S+T 
Sbjct: 74  QVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTS 133

Query: 145 KNLGCQAAQC---KQVPNPTCG--GGACAFNLTYGSST------IAANLSQDTI---SLA 190
             + C   +C   KQ  + TC      C++   YG  +      ++  +  +TI   S+ 
Sbjct: 134 SMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMT 193

Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
           T+      FGC  + TG+         G+ G G+  +S+++Q  +Q +    FS+CL   
Sbjct: 194 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
              S  G L LG I +P  I YT L+   P     Y +NL +I V  + + I      F 
Sbjct: 254 S--SGGGILVLGEIVEPN-IVYTSLVPAQPH----YNLNLQSISVNGQTLQIDSSV--FA 304

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCY----SVPIV 358
            +   GTI+DSGT    L   AY          +  ++ TV S G  + CY    SV  V
Sbjct: 305 TSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRG--NQCYLITSSVTDV 362

Query: 359 APTITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNH 414
            P ++L F+G    +  PQD L+  ++ G  ++ C+            + ++ ++  ++ 
Sbjct: 363 FPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQ---GQGITILGDLVLKDK 419

Query: 415 RILYDVPNSRLGVARELCT 433
            ++YD+   R+G A   C+
Sbjct: 420 IVVYDLAGQRIGWANYDCS 438


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/225 (30%), Positives = 111/225 (49%), Gaps = 22/225 (9%)

Query: 32  STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
           S+L+V H+   CS    +K    +    E+L +D+AR++ + S         ++ A+ + 
Sbjct: 63  SSLRVVHMHGACSHLSSNKDARLDHD--EILRRDEARVESIHSKLSKNIADEVSKAKSTK 120

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
           +P  +G  I  SP YIV   IGTP   + +  DT +D  W  C  C+G C S     FN 
Sbjct: 121 LPAKNG-IILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 179

Query: 139 AQSTTFKNLGCQAAQCKQVPNP-TCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVP 195
           + S+++ N+ C +  C    NP +C    C + + YG  ++    L+++  +L  +D++ 
Sbjct: 180 SSSSSYHNVSCSSPMC---GNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLTNSDVLD 236

Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
              FGC +   G  +   G+LGLG G  S   QT   Y + FSYC
Sbjct: 237 DIYFGCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 160/379 (42%), Gaps = 71/379 (18%)

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
           AQT  MA+DT+ D  W+ C  C         + +F+  +S +   + C +  C+ + N  
Sbjct: 164 AQT--MAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYG 221

Query: 162 CG-------------------GGACAFNLTYGSSTIAANLSQD---TISLATDIVPGYTF 199
            G                    G C + + Y    +++        TIS  T  +  + F
Sbjct: 222 NGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFL-NFRF 280

Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRL 255
           GC     G+ S    G + LG G  SLL+QT   Y + FSYC+P   A   LS  G++  
Sbjct: 281 GCSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPKPSASGFLSLGGAIND 340

Query: 256 GPIGQPKRIKY--TPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           G         +  TPL++N R    + Y V L  I V  R +++PP           GT+
Sbjct: 341 GDSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVFS------GGTL 394

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRR---------RVGSNLTVTSLGG---FDTCYSVP--- 356
           +DS  V T+L   AY A+R  FR          R GS  + T  GG    DTCY      
Sbjct: 395 MDSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGST-SSTPAGGEMILDTCYDFEGLD 453

Query: 357 -IVAPTITLMFSGMNVT--LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
            +  PT++L+F G  V    P   +++        CLA    P + +  L  I N+QQQ 
Sbjct: 454 NVTVPTVSLVFFGGAVVDLDPTTAVMMEG------CLAFVPTPADFD--LGFIGNVQQQT 505

Query: 414 HRILYDVPNSRLGVARELC 432
           H +LYDV    +G  R  C
Sbjct: 506 HEVLYDVGARNVGFRRGAC 524


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 107/430 (24%), Positives = 179/430 (41%), Gaps = 55/430 (12%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
           P + + PL     + E+ A+D+ R   +  L   R+S V      P+           Y 
Sbjct: 43  PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
            + K+G+P     + +DT +D  WV C+ C  C  +         F++  S T  ++ C 
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCS 161

Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
              C  V   T         C ++  YG  S  +     DT         SL  +     
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221

Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
            FGC    +G+         G+ G G+G LS+++Q  ++ +    FS+CL      S  G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
              LG I  P  + Y+PL+ +      Y +NLL+I V  ++  +P  A  F  +   GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEASNTRGTI 333

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
           +D+GT  T LV  AY    +     V S L    +   + CY    S+  + P+++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFA 392

Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G    +  PQD L  +      S+ C+    AP+       ++ ++  ++   +YD+   
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448

Query: 424 RLGVARELCT 433
           R+G A   C+
Sbjct: 449 RIGWASYDCS 458


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 107/430 (24%), Positives = 187/430 (43%), Gaps = 58/430 (13%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT----YIVR 100
           P + + PL+ +  +  + A+D+AR   +    V    VV  +   Q T  P     Y  +
Sbjct: 31  PLERAIPLNQQVELEALRARDRARHGRILQGVVG--GVVDFS--VQGTSDPYFVGLYFTK 86

Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAA 152
            K+G+PA+   + +DT +D  W+ C  C  C  +         F++A S+T   + C   
Sbjct: 87  VKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDP 146

Query: 153 QCK---QVPNPTCGGGA--CAFNLTYGSST------IAANLSQDTISLATDIVPGYT--- 198
            C    Q     C   A  C++   YG  +      ++  +  DT+ L   +V   +   
Sbjct: 147 ICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTI 206

Query: 199 -FGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
            FGC    +G+         G+ G G G+LS+++Q  ++ +    FS+CL   +  +  G
Sbjct: 207 IFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE--NGGG 264

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            L LG I +P  I Y+PL+ +      Y +NL +I V  ++  +P  +  F  T   GTI
Sbjct: 265 VLVLGEILEPS-IVYSPLVPSQPH---YNLNLQSIAVNGQL--LPIDSNVFATTNNQGTI 318

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMF- 366
           +DSGT    LV  AY          V S  +   +   + CY    SV  + P ++L F 
Sbjct: 319 VDSGTTLAYLVQEAYNPFVKAITAAV-SQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFM 377

Query: 367 SGMNVTLPQDNLLIHS---TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            G ++ L  ++ L+H       ++ C+        V     ++ ++  ++   +YD+ N 
Sbjct: 378 GGASMVLNPEHYLMHYGFLDGAAMWCIGF----QKVEQGFTILGDLVLKDKIFVYDLANQ 433

Query: 424 RLGVARELCT 433
           R+G A   C+
Sbjct: 434 RIGWADYDCS 443


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 164/376 (43%), Gaps = 54/376 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
           Y  R ++GTP +   + +DT +D  WV C  C GC            F+   S T   + 
Sbjct: 52  YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111

Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
           C   +C    Q  +  C      C +N  YG  +      ++  L  DT+   S+  +  
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSS 171

Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
               FGC    TG+         G+ G G+  +S+++Q  +Q +    FS+CL      S
Sbjct: 172 APIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDD--S 229

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G L LG I +P  I YTPL+ +      Y +N+ +I V  + + I P    F  ++  
Sbjct: 230 GGGILVLGEIVEPN-IVYTPLVPSQPH---YNLNMQSISVNGQTLAIDPSV--FGTSSSQ 283

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCY----SVPIVAPT 361
           GTIIDSGT    L   AY    D F   + S ++ +    L   + CY    S+  + P 
Sbjct: 284 GTIIDSGTTLAYLAEAAY----DPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIFPQ 339

Query: 362 ITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
           ++L F+G    + +PQD L+  S+ G  ++ C+            + ++ ++  ++   +
Sbjct: 340 VSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQ---GQGITILGDLVLKDKIFV 396

Query: 418 YDVPNSRLGVARELCT 433
           YD+ N R+G A   C+
Sbjct: 397 YDIANQRIGWANYDCS 412


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 134/323 (41%), Gaps = 22/323 (6%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
           T +  Y++   +GTP Q +   +D ++D  W+ C+ C  C +    +  +  F       
Sbjct: 92  TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAF-LSF 150

Query: 152 AQCKQVPNPTCGGGACAFNLTYG---SSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
              +    P CG     ++  YG   ++T A  L+ D  + AT    G  FGC     G+
Sbjct: 151 HDTRAPTTPPCG-----YSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAVATEGD 205

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--RIKY 266
                G++GLGRG LS ++Q Q      FSY L    A+     +      +P+  R   
Sbjct: 206 I---GGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDDAKPRTSRAVS 259

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
           TPL+ +    SLYYV L  IRV    + IP G          G ++      T L A AY
Sbjct: 260 TPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPVTFLDAGAY 319

Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDNLLIH 381
             VR     ++       S  G D CY+   +A    P++ L+F+G  V  L   N    
Sbjct: 320 KVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVMELEMGNYFYM 379

Query: 382 STAGSITCLAMAAAPDNVNSVLN 404
            +   + CL +  +P    S+L 
Sbjct: 380 DSTTGLECLTILPSPAGDGSLLG 402


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 177/407 (43%), Gaps = 43/407 (10%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           EE VL  +A   +R Q    L    +  V     R   Q   YI    IG+P Q     +
Sbjct: 49  EERVLRAVAV--SRQQQQQRLMAGAEDDVSAQVHRATRQ---YIASYLIGSPPQRTEALI 103

Query: 115 DTSNDAAWVPC-TGCV--GCSST---VFNSAQSTTFKNLGC--QAAQCKQVPNPTCG-GG 165
           DT +D  W  C T C+   C+      +N +QS+TF  + C  +A  C       CG  G
Sbjct: 104 DTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKAGFCAANGVHLCGLDG 163

Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
           +C F  +YG+  +  +L  ++ +  +       FGC+   +  +G      GL+GLGRG 
Sbjct: 164 SCTFIASYGAGRVIGSLGTESFAFESGTTS-LAFGCVSLTRITSGALNDASGLIGLGRGR 222

Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIKYT-PLLKNPRR---SS 277
           LSL++Q   +  + FSYCL P F +   S  L +G          + P +K+P+    S+
Sbjct: 223 LSLVSQ---IGATRFSYCLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYST 279

Query: 278 LYYVNLLAIRVGR-RVVDIPPGALQ----FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
            YY+ L  I VG+ R+  +     Q    F      G IID+G+  T+L + AY A+++ 
Sbjct: 280 FYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEE 339

Query: 333 FRRRVGSNLTVTS--LGGFDTCYS---VPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGS 386
              ++G+   V +    G + C +      V P +   F  G ++ +P  +        +
Sbjct: 340 VAAQLGNGSLVPAPEDSGLELCVAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAA 399

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              + +    D      ++I N QQQ+  +LYD+   R       CT
Sbjct: 400 ACMMILEGGYD------SIIGNFQQQDMHLLYDLRRGRFSFQTADCT 440


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 109/437 (24%), Positives = 182/437 (41%), Gaps = 64/437 (14%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV-------------PIASGRQI 91
           P + + PL     + E+ A+D+ R   +  L   R+S V             P   G ++
Sbjct: 43  PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGSKM 101

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
           T    Y  + K+G+P     + +DT +D  WV C+ C  C  +         F++  S T
Sbjct: 102 TM--LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLT 159

Query: 144 FKNLGCQAAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLA 190
             ++ C    C  V   T         C ++  YG  S  +     DT         SL 
Sbjct: 160 AGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 219

Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
            +      FGC    +G+         G+ G G+G LS+++Q  ++ +    FS+CL   
Sbjct: 220 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 279

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
              S  G   LG I  P  + Y+PL+ +      Y +NLL+I V  ++  +P  A  F  
Sbjct: 280 G--SGGGVFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEA 331

Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAP 360
           +   GTI+D+GT  T LV  AY    +     V S L    +   + CY    S+  + P
Sbjct: 332 SNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFP 390

Query: 361 TITLMFSGMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           +++L F+G    +  PQD L  +      S+ C+    AP+       ++ ++  ++   
Sbjct: 391 SVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVF 446

Query: 417 LYDVPNSRLGVARELCT 433
           +YD+   R+G A   C+
Sbjct: 447 VYDLARQRIGWASYDCS 463


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 119/459 (25%), Positives = 189/459 (41%), Gaps = 59/459 (12%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLA 63
           L F L+ +FL     G + +   +  S T ++ H  SP SP F  S+  + +  +   + 
Sbjct: 11  LSFALSIIFLTVSMSGFS-LVQAEKLSFTTELIHRDSPNSPLFNASE--TTDIRLANAVE 67

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
           +   R+   + L     + +  A    I  +  ++++  IG P   LL+ + T +D  W+
Sbjct: 68  RSADRVNRFNDLI---SNSITAAEFPSILDNGDFLMKISIGIPPTELLVNVATGSDLVWI 124

Query: 124 PCTG---CV-GCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT--YGSST 177
           PC     C   C    F+  +S+T+KN+ C + +C+     TC    C ++    +  S 
Sbjct: 125 PCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDSC 184

Query: 178 IAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
              +L+ DT++L +      ++P   F C  +  G+  P  G+LGLG GSLSLL +  +L
Sbjct: 185 PDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGGD-YPGVGILGLGHGSLSLLNRISHL 243

Query: 233 YQSTFSYCLPSFKA-----LSFSG----------SLRLGPIGQPKRIKYTPLLKNPRRSS 277
               FS+C+  + +     LSF            S RL   G P                
Sbjct: 244 IDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYS-------------- 289

Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
            Y ++   I VG +   I  G +  +     G  +DSGT+FT      Y+ +    R  +
Sbjct: 290 -YTLSFYGISVGNK--SISAGGIGSDYYMN-GLGMDSGTMFTYFPEYFYSQLEYDVRYAI 345

Query: 338 GSN-LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAA 394
               L          C  YS     PTIT+ F G +V L   N  I  T   I CLA A 
Sbjct: 346 QQEPLYPDPTRRLRLCYRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTE-DIVCLAFAT 404

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           +    ++V       QQ N  I YD+    L   +  CT
Sbjct: 405 SSSEQDAVFGY---WQQTNLLIGYDLDAGFLSFLKTDCT 440


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 150/332 (45%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G++S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 107/429 (24%), Positives = 178/429 (41%), Gaps = 55/429 (12%)

Query: 45  PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
           P + + PL     + E+ A+D+ R   +  L   R+S V      P+           Y 
Sbjct: 43  PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
            + K+G+P     + +DT +D  WV C+ C  C  +         F++  S T  ++ C 
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCS 161

Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
              C  V   T         C ++  YG  S  +     DT         SL  +     
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221

Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
            FGC    +G+         G+ G G+G LS+++Q  ++ +    FS+CL      S  G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
              LG I  P  + Y+PL+ +      Y +NLL+I V  ++  +P  A  F  +   GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEASNTRGTI 333

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
           +D+GT  T LV  AY    +     V S L    +   + CY    S+  + P+++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFA 392

Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
           G    +  PQD L  +      S+ C+    AP+       ++ ++  ++   +YD+   
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448

Query: 424 RLGVARELC 432
           R+G A   C
Sbjct: 449 RIGWASYDC 457


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 151/332 (45%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G++S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L +  + +  +        +A AP    S++
Sbjct: 289 DLGRGGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 112/445 (25%), Positives = 182/445 (40%), Gaps = 69/445 (15%)

Query: 31  SSTLQVFHVFSPCS----PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR----KSV 82
           S T  + H++SP      PF  S P   +E  L+  A       F+ S  + +    + +
Sbjct: 34  SFTFNIHHLYSPAVRQILPFH-SFP---DEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPL 89

Query: 83  VPIASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------ 133
             ++    +  SP    Y     +GTP    L+A+DT +D  W+PC  CV C +      
Sbjct: 90  TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQ 148

Query: 134 -----TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDT 186
                 +++   S+T K + C ++ C  +   +     C + ++Y S   ++   L +D 
Sbjct: 149 GPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDI 208

Query: 187 ISLATDIVPG------YTFGCIQKATG---NSVPPQGLLGLGRGSLSLLAQTQN--LYQS 235
           + L T+ V         T GC +  +G   +S  P GL GLG  ++S+ +   N  L  +
Sbjct: 209 LHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISN 268

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           +FS C    +     G +  G  G P + + TP     RR   Y V++  I VG  + D+
Sbjct: 269 SFSLCFGPARM----GRIEFGDKGSPGQNE-TPFNLG-RRHPTYNVSITQIGVGGHISDL 322

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYS 354
                          I DSGT FT L  PAY+   D F   V     T+ S   F+ CY 
Sbjct: 323 -----------DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYE 371

Query: 355 VPIVAPTITL------MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
           +     T T       M  G +  +    +LI + +  + CLA+A +       +N+I  
Sbjct: 372 LSPNQTTFTYPLMNLTMKGGGHFVINHPIVLISTESKRLFCLAIARS-----DSINIIGQ 426

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
                + I++D     LG     CT
Sbjct: 427 NFMTGYHIVFDREKMVLGWKESNCT 451


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 161/387 (41%), Gaps = 51/387 (13%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+      +++  Y  +  IGTP++   + +DT +D  WV C GC        +G   T
Sbjct: 141 LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 200

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
           +++   STT   +GC    C     P  G   G  C +++ YG  S+      QD +   
Sbjct: 201 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 260

Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
                   T       FGC  K +G     S    G+LG G+ + S+L+Q  +    +  
Sbjct: 261 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 320

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           FS+CL +       G   +G + +PK +  TPL++N    + Y V +  I VG   +D+P
Sbjct: 321 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQN---QAHYNVVMKEIEVGGDPLDVP 373

Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
             A  F      GTIIDSGT        V+  L+    +   D+    V    T      
Sbjct: 374 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 426

Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           FD   +V    PT+TL F   +++T+ P + L  H     I      A   +    L ++
Sbjct: 427 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKD-LTLL 485

Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
            ++   N  ++YD+    +G     C+
Sbjct: 486 GDLVLSNKLVVYDLEKQGIGWVEYNCS 512


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 99/386 (25%), Positives = 166/386 (43%), Gaps = 51/386 (13%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-------SSTV 135
           +P+    Q      Y  +  +GTP++   + +DT +D  WV C GC+ C         T 
Sbjct: 71  IPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP 130

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA-CAFNLTYGS-STIAANLSQDTISLATD 192
           ++   S+T K++ C    C  V   + C  G+ C + + YG  S+    L +D + L  D
Sbjct: 131 YDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHL--D 188

Query: 193 IVPG----------YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQST 236
           +V G            FGC  K +G     Q    G++G G+ + S ++Q  +Q   + +
Sbjct: 189 LVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRS 248

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           F++CL +       G   +G +  PK +K TP+L    +S+ Y VNL AI VG  V+++ 
Sbjct: 249 FAHCLDNNNG---GGIFAIGEVVSPK-VKTTPMLS---KSAHYSVNLNAIEVGNSVLELS 301

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYT-AVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
             A  F+     G IIDSGT    L    Y   + ++        LT+ ++    TC+  
Sbjct: 302 SNA--FDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASH--PELTLHTVQESFTCFHY 357

Query: 356 PIVA---PTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIA 407
                  PT+T  F   +V+L   P++ L          C             + L ++ 
Sbjct: 358 TDKLDRFPTVTFQFD-KSVSLAVYPREYLF--QVREDTWCFGWQNGGLQTKGGASLTILG 414

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
           +M   N  ++YD+ N  +G     C+
Sbjct: 415 DMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 153/355 (43%), Gaps = 55/355 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
           +    K+GTP    ++A+DT +D  WVPC  C  C+ T            ++N   STT 
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTTN 165

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDI-----VPGY 197
           K + C  + C Q          C + ++Y S+  + +  L +D + L T+      V  Y
Sbjct: 166 KKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAY 225

Query: 198 -TFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSG 251
            TFGC Q  +G   +   P GL GLG   +S+  +   + L   +FS C          G
Sbjct: 226 VTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF----GHDGVG 281

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G   + + TP   NP   + Y + +  +RVG  ++D    AL           
Sbjct: 282 RISFGDKGSSDQ-EETPFNLNPSHPN-YNITVTRVRVGTTLIDDEFTAL----------- 328

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA-----PTITLM 365
            D+GT FT LV P YT V + F  +      +  S   F+ CY +   A     P+++L 
Sbjct: 329 FDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLT 388

Query: 366 FSGMNVTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
             G +     D +++ ST G  + CLA+  +     S LN+I       +R+++D
Sbjct: 389 MKGNSHFTINDPIIVISTEGELVYCLAIVKS-----SELNIIGQNYMTGYRVVFD 438


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 115/447 (25%), Positives = 180/447 (40%), Gaps = 97/447 (21%)

Query: 48  PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS--------------VVPIASGRQ 90
           P  P + E  +  +LA D+AR   LQ  +  A  +                 VP+ SG +
Sbjct: 38  PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIR 97

Query: 91  ITQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQS 141
             Q+  Y+    +G           L + +DT +D  WV   PC+ C      +F+ + S
Sbjct: 98  F-QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGS 156

Query: 142 TTFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQ 184
            ++  + C A+ C+       G  G+CA               ++L YG  + +   L+ 
Sbjct: 157 ASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLAT 216

Query: 185 DTISLATDIVPGYTFGCIQKATGNSVP----------PQGLLGLGRGSLSLLAQTQNLYQ 234
           DT++L    V G+ FGC     G   P          P G  G   GSLSL   T +   
Sbjct: 217 DTVALGGASVDGFVFGCGLSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRN 276

Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
           +T                           + YT ++ +P +   Y++N+    VG   V 
Sbjct: 277 AT--------------------------PVSYTRMIADPAQPPFYFMNVTGASVGGAAV- 309

Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTC 352
                        A  ++DSGTV TRL    Y AVR  F R+ G+     +      D C
Sbjct: 310 ------AAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDAC 363

Query: 353 YSV----PIVAPTITL-MFSGMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVI 406
           Y++     +  P +TL + +G ++T+    +L  +   GS  CLAMA+   +      +I
Sbjct: 364 YNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPII 421

Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
            N QQ+N R++YD   SRLG A E C+
Sbjct: 422 GNYQQKNKRVVYDTVGSRLGFADEDCS 448


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 152/390 (38%), Gaps = 73/390 (18%)

Query: 114 MDTSNDAAWVPCTG-----CVG--------------------CSSTVFNSAQSTTFKNLG 148
           +DT +D  W PC       C G                    C+S + ++A ++   +  
Sbjct: 109 LDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRRIPCASPLCSAAHASAPPSDL 168

Query: 149 CQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-------VPGY 197
           C AA+C  + +   +CG   AC      YG  ++ A+L +  ++L           V  +
Sbjct: 169 CAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNF 228

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKA--------- 246
           TF C   A G    P G+ G GRG LSL  Q        FSYCL   SF+A         
Sbjct: 229 TFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPL 285

Query: 247 -LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
            L  S         +     YTPLL NP+    Y V L A+ VG   +   P   + +  
Sbjct: 286 ILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELARVDRA 345

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV-- 358
              G ++DSGT FT L    Y  V + F R + +             G   CY       
Sbjct: 346 GNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDR 405

Query: 359 -APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----ITCLAMA----AAPDNVNSVL 403
             P + L F G   V LP+ N  +      + AG+    + CL +     A+ +  +   
Sbjct: 406 GVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPA 465

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
             + N QQQ   ++YDV   R+G AR  CT
Sbjct: 466 GTLGNFQQQGFEVVYDVDAGRVGFARRRCT 495


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 109/223 (48%), Gaps = 21/223 (9%)

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLKNP 273
           +GLG G+ SL++QT       FSYCLP     S SG L LG  G      +  TP+L++ 
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLP--PTPSSSGFLTLGAAGGSGTSGFVKTPMLRSS 58

Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
           +  + Y V L AIRVG R + IP           AGT++DSGTV TRL   AY+A+   F
Sbjct: 59  QVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAF 112

Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITC 389
           +  +         G  DTC+       +  P++ L+FSG  V     + +I S      C
Sbjct: 113 KAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILS-----NC 167

Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           LA A   D  +S L +I N+QQ+   +LYDV    +G     C
Sbjct: 168 LAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 159/364 (43%), Gaps = 47/364 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C S     F    S T++ + C   Q
Sbjct: 93  YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC-TWQ 151

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATGN- 208
           C    N       C +   Y   ST +  L +D +S    T++ P    FGC    TG+ 
Sbjct: 152 C----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCENDETGDI 207

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            +    G++GLGRG LS++ Q   + +   +FS C          G++ LG I  P  + 
Sbjct: 208 YNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCY--GGMGVGGGAMVLGGISPPADMV 265

Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRL 321
           +T    +P RS  Y ++L  I V G+R        L  NP       GT++DSGT +  L
Sbjct: 266 FT--RSDPVRSPYYNIDLKEIHVAGKR--------LHLNPKVFDGKHGTVLDSGTTYAYL 315

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYS--------VPIVAPTITLMF-SGMN 370
              A+ A +    +   S   ++       D C+S        +    P + ++F +G  
Sbjct: 316 PESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVFGNGHK 375

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           ++L P++ L  HS      CL + +   N N    ++  +  +N  ++YD  ++++G  +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS---NGNDPTTLLGGIVVRNTLVMYDREHTKIGFWK 432

Query: 430 ELCT 433
             C+
Sbjct: 433 TNCS 436


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 145/327 (44%), Gaps = 27/327 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG+TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP       F    +G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233

Query: 321 LVAPAYTAVRDVFRR---RVGSNLTVTSLGGFDTCYSVPIVAPTITLMF-SGMNVTLPQD 376
           +   A + +R   R    + G+    +    +D         P I+L F  G    L   
Sbjct: 234 IPDRALSVLRQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSH 293

Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVL 403
            + +  +        +A AP    S++
Sbjct: 294 GVFVERSVQEQDVWCLAFAPTKSVSII 320


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 107/427 (25%), Positives = 166/427 (38%), Gaps = 52/427 (12%)

Query: 24  ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFLSSLAVARKS 81
           +  TQ+H   +++ H  S  SPF   K    +   S+L         L  + S +  +  
Sbjct: 19  LTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYLNHVFSFSPNKIQ 78

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNS 138
            VP++S         Y++   IGTP   L   +DT ND  W    PC  C+  +S +F+ 
Sbjct: 79  DVPLSS----FMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHP 134

Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
           ++S+T+K + C +  CK       G      N   G+     N+                
Sbjct: 135 SKSSTYKTIPCTSPICKNADGHYLGVDTLTLNSNNGTPISFKNI---------------V 179

Query: 199 FGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
            GC  +  G   P +G     +GL RG LS ++Q  +     FSYCL P F   + S  L
Sbjct: 180 IGCGHRNQG---PLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKL 236

Query: 254 RLGPIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
             G       +    TP+    +  + Y+V+L A  VG  ++ +       N      +I
Sbjct: 237 HFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVGDHIIKLE------NSDNRGNSI 286

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAPT--ITLMF 366
           IDSGT  T L    Y+ +  V    V           F+ CY   S  ++     IT  F
Sbjct: 287 IDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF 346

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           SG  V L   N   +     + C A  +  +   S L +  N+ QQN  + +D+    + 
Sbjct: 347 SGSEVHLNALNTF-YPITDEVICFAFVSGGN--FSSLAIFGNVVQQNFLVGFDLNKKTIS 403

Query: 427 VARELCT 433
                CT
Sbjct: 404 FKPTDCT 410


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 152/350 (43%), Gaps = 55/350 (15%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
           K+GTP    ++A+DT +D  WVPC  C  C+ T            ++N   STT K + C
Sbjct: 110 KLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKISTTNKKVTC 168

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDI-----VPGY-TFGC 201
             + C Q          C + ++Y S+  + +  L +D + L T+      V  Y TFGC
Sbjct: 169 NNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGC 228

Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
            Q  +G   +   P GL GLG   +S+  +   + L   +FS C          G +  G
Sbjct: 229 GQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF----GHDGVGRISFG 284

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G   + + TP   NP   + Y + +  +RVG  ++D    AL            D+GT
Sbjct: 285 DKGSSDQ-EETPFNLNPSHPN-YNITVTRVRVGTTLIDDEFTAL-----------FDTGT 331

Query: 317 VFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA-----PTITLMFSGMN 370
            FT LV P YT V + F  +      +  S   F+ CY +   A     P+++L   G +
Sbjct: 332 SFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNS 391

Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
                D +++ ST G  + CLA+  +     S LN+I       +R+++D
Sbjct: 392 HFTINDPIIVISTEGELVYCLAIVKS-----SELNIIGQNYMTGYRVVFD 436


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 104/393 (26%), Positives = 149/393 (37%), Gaps = 79/393 (20%)

Query: 114 MDTSNDAAWVPCTG-----CVG--------------------------CSSTVFNSAQST 142
           +DT +D  W PC       C G                          C+S + ++A ++
Sbjct: 113 LDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSAAHAS 172

Query: 143 TFKNLGCQAAQC--KQVPNPTCGGGACA---FNLTYGSSTIAANLSQDTISLATDI-VPG 196
              +  C AA C  + +   +C G + A       YG  ++ A+L +  + L   + V  
Sbjct: 173 APPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLVAHLRRGRVGLGASVAVDN 232

Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKALSFSGSLR 254
           +TF C   A G    P G+ G GRG LSL  Q        FSYCL   SF+A      +R
Sbjct: 233 FTFACAHTALGE---PVGVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRL---IR 286

Query: 255 LGPI---------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
             P+          +     YTPLL NP+    Y V L A+ VG   +   P   + +  
Sbjct: 287 PSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRA 346

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV-- 358
              G ++DSGT FT L    Y  V + F R + +             G   CY       
Sbjct: 347 GNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCYHYAASDR 406

Query: 359 -APTITLMFSG-MNVTLPQDNLLI----HSTAG------SITCLAMAAAPD------NVN 400
             P + L F G   V LP+ N  +       AG       + CL +    D        +
Sbjct: 407 GVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDD 466

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
                + N QQQ   ++YDV   R+G AR  CT
Sbjct: 467 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 499


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 149/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT + A+WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 158/360 (43%), Gaps = 41/360 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T++ + C    
Sbjct: 84  YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 142

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
           C    N       C +   Y   ST +  L +D IS    +++ P    FGC    TG+ 
Sbjct: 143 C----NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENVETGDL 198

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            S    G++GLGRG LS++ Q   +N+   +FS C          G++ LG I  P  + 
Sbjct: 199 YSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDV--GGGAMVLGGISPPSDMA 256

Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
           +     +P RS  Y ++L  I V G+R   +P  A  F+     GT++DSGT +  L   
Sbjct: 257 FA--YSDPVRSPYYNIDLKEIHVAGKR---LPLNANVFD--GKHGTVLDSGTTYAYLPEA 309

Query: 325 AYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVPIVA--------PTITLMF-SGMNVTL 373
           A+ A +D   + + S   ++       D C+S   +         P + ++F +G   TL
Sbjct: 310 AFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTL 369

Query: 374 -PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            P++ +  HS      CL +     N N    ++  +  +N  ++YD   +++G  +  C
Sbjct: 370 SPENYMFRHSKVRGAYCLGVF---QNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  +R  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 156/398 (39%), Gaps = 73/398 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----------VGC---SSTVFNSAQSTT 143
           YI    IG P Q     +DT +D  W  C+ C           GC   +   +N + S T
Sbjct: 78  YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137

Query: 144 FKNLGCQ---AAQCKQVPNPT-C--GGG----ACAFNLTYGSSTIAANLSQDTISLATDI 193
            + + C     A C   P    C  GGG    AC    +YG+      L  D  +  +  
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALGVLGTDAFTFPSSS 197

Query: 194 VPGYTFGCIQK---ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSF 249
                FGC+ +   + G      G++GLGRG+LSL++Q   L  + FSYCL P F+    
Sbjct: 198 SVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQ---LNATEFSYCLTPYFRDTVS 254

Query: 250 SGSLRLGPIGQPKR--------------IKYTPLLKNPRR---SSLYYVNLLAIRVGRRV 292
              L +G  G+                 +   P  KNP+    S+ YY+ L+ +  G   
Sbjct: 255 PSHLFVGD-GELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNAT 313

Query: 293 VDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRR------------- 335
           V +P GA             G +IDSG+ FTRLV PA+ A+     R             
Sbjct: 314 VALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPA 373

Query: 336 RVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS-----GMNVTLPQDNLLIHSTAGSITCL 390
           ++G  L +    G D         P + L F      G  + +P +       A +    
Sbjct: 374 KLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMA 433

Query: 391 AMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRL 425
            +++A  N     N   +I N  QQ+ R+LYD+ N  L
Sbjct: 434 VVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLL 471


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 102/433 (23%), Positives = 182/433 (42%), Gaps = 43/433 (9%)

Query: 6   VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAK 64
           + F +  F+ S S  L        +S + ++ H  S  SP +KP++            + 
Sbjct: 9   LLFFSLCFIISFSHSLR-------NSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSI 61

Query: 65  DQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVP 124
           ++A   F  SL+   +S V +  G        Y++   +GTP   +   +DT +D  W+ 
Sbjct: 62  NRANRLFKDSLSNTPESTVYVNGGE-------YLMTYSVGTPPFNVYGVVDTGSDIVWLQ 114

Query: 125 CTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA- 179
           C  C  C   ++ +FN ++S+++KN+ C +  C+ V   +C    +C + + +   + + 
Sbjct: 115 CKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQ 174

Query: 180 ANLSQDTISLATDI-----VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
             LS +T++L +        P    GC     G       G++GLG G +SL  Q ++  
Sbjct: 175 GELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSI 234

Query: 234 QSTFSYC-LPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
              FSYC LP     + +  L  G   +     +  TP +K   + + YY+ L A  VG 
Sbjct: 235 GGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQ-AFYYLTLEAFSVGN 293

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
           + ++        + +     I+DSGT  T L +  YT +     + V  +         +
Sbjct: 294 KRIEFEV----LDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLN 349

Query: 351 TCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
            CYS+       P IT  F G ++ L   +   H  A  + CLA  ++         +  
Sbjct: 350 LCYSITSDQYDFPIITAHFKGADIKLNPISTFAH-VADGVVCLAFTSSQTGP-----IFG 403

Query: 408 NMQQQNHRILYDV 420
           N+ Q N  + YD+
Sbjct: 404 NLAQLNLLVGYDL 416


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           ++V   +G P    L+ +DT +D  WV C  C  C   S+ +F+ ++S+T+ +L   +  
Sbjct: 91  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 150

Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
           C   P         C +N +Y   ST + NL+ + I   T       V    FGC     
Sbjct: 151 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 210

Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
           G     Q G+LGL  G  S++++      S FSYC+   F        L LG  G     
Sbjct: 211 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 265

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TP       +  YYV L  I VG   +DI P   Q   +   G ++DSGT  T L   
Sbjct: 266 SSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 322

Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
            +  + +  +R V  +   +   ++ G+  CY   +       P +   F+ G ++ L  
Sbjct: 323 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 381

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++L +      + CLA+  +  N+ ++ +VI  M QQ++ + YD+   R+   R  C
Sbjct: 382 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           ++V   +G P    L+ +DT +D  WV C  C  C   S+ +F+ ++S+T+ +L   +  
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
           C   P         C +N +Y   ST + NL+ + I   T       V    FGC     
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
           G     Q G+LGL  G  S++++      S FSYC+   F        L LG  G     
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TP       +  YYV L  I VG   +DI P   Q   +   G ++DSGT  T L   
Sbjct: 234 SSTPFHT---FNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 290

Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
            +  + +  +R V  +   +   ++ G+  CY   +       P +   F+ G ++ L  
Sbjct: 291 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++L +      + CLA+  +  N+ ++ +VI  M QQ++ + YD+   R+   R  C
Sbjct: 350 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 65/222 (29%), Positives = 106/222 (47%), Gaps = 24/222 (10%)

Query: 28  QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------- 77
            D  ++L+V H   PCS     K  S   S  +ML +D++R+  + S             
Sbjct: 62  DDKRASLEVIHKHGPCSKLSQDKGRS--PSRTQMLDQDESRVNSIRSRLAKNPADGGKLK 119

Query: 78  ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SS 133
             K  +P  SG  I  +  Y+V   +GTP + L    DT +D  W  C  C   C     
Sbjct: 120 GSKVTLPSKSGSTIG-TGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQE 178

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTI 187
            +FN ++ST++ N+ C +  C ++ +     P+C    C + + YG  + +    +QD +
Sbjct: 179 PIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKL 238

Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           +L +TD+   + FGC Q   G  V   GL+GLGR +LSL+++
Sbjct: 239 ALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G PA+   + +DT +D  WV C+ C GC ++         FN   S+T   + 
Sbjct: 5   YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64

Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
           C   +C    Q     C         C +  TYG  S  +     DT+   T +    T 
Sbjct: 65  CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 124

Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
                  FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    
Sbjct: 125 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 182

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           + +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +
Sbjct: 183 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 236

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
              GTI+DSGT    L   AY          V  ++ ++ S G   F T  SV    PT+
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 296

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           TL F  G+ +++  +N L+   +   + L       N    + ++ ++  ++   +YD+ 
Sbjct: 297 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 356

Query: 422 NSRLGVARELCT 433
           N R+G A   C+
Sbjct: 357 NMRMGWADYDCS 368


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 102/374 (27%), Positives = 165/374 (44%), Gaps = 52/374 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  + K+G+P +   + +DT +D  WV C  C  C  T         F+S+ S+T   + 
Sbjct: 66  YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125

Query: 149 CQAAQCKQVPNPT---CGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
           C    C      T   C      C++   YG  +      ++  L  D I   SL  +  
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185

Query: 195 PGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
               FGC    +G+         G+ G G+G LS+++Q  T+ +    FS+CL      S
Sbjct: 186 ALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDG--S 243

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G L LG I +P  I Y+PL+ +      Y +NLL+I V  +++ I P A  F  +   
Sbjct: 244 GGGILVLGEILEPG-IVYSPLVPSQPH---YNLNLLSIAVNGQLLPIDPAA--FATSNSQ 297

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTIT 363
           GTI+DSGT    LVA AY          V  ++T +TS G  + CY    SV  + P  +
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKG--NQCYLVSTSVSQMFPLAS 355

Query: 364 LMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
             F+G    V  P+D L+   ++G  ++ C+            + ++ ++  ++   +YD
Sbjct: 356 FNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-----VTILGDLVLKDKIFVYD 410

Query: 420 VPNSRLGVARELCT 433
           +   R+G A   C+
Sbjct: 411 LVRQRIGWANYDCS 424


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           ++V   +G P    L+ +DT +D  WV C  C  C   S+ +F+ ++S+T+ +L   +  
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
           C   P         C +N +Y   ST + NL+ + I   T       V    FGC     
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
           G     Q G+LGL  G  S++++      S FSYC+   F        L LG  G     
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TP       +  YYV L  I VG   +DI P   Q   +   G ++DSGT  T L   
Sbjct: 234 SSTPFHT---FNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 290

Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
            +  + +  +R V  +   +   ++ G+  CY   +       P +   F+ G ++ L  
Sbjct: 291 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           ++L +      + CLA+  +  N+ ++ +VI  M QQ++ + YD+   R+   R  C
Sbjct: 350 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 103/210 (49%), Gaps = 18/210 (8%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y +   IGTP  T  +  DT +   W    PCT C    +  F  A S+TF  L C ++ 
Sbjct: 90  YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149

Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
           C+ + +P  TC    C +   YG    A  L+ +T+ +     PG TFGC  +   GNS 
Sbjct: 150 CQFLTSPYRTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPGVTFGCSTENGVGNS- 208

Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
              G++GLGR  LSL++Q      + FSYCL S  A +    +  G + +     ++ TP
Sbjct: 209 -SSGIVGLGRSPLSLVSQVG---VARFSYCLRS-NADAGDSPILFGSLAKVTGGNVQSTP 263

Query: 269 LLKNPR--RSSLYYVNLLAIRVGRRVVDIP 296
           LL+NP    SS YYVNL  I VG    D+P
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLP 291


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 164/385 (42%), Gaps = 49/385 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-------SSTV 135
           +P+    Q      Y  +  +GTP++   + +DT +D  WV C GC+ C         T 
Sbjct: 71  LPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP 130

Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA-CAFNLTYGS-STIAANLSQDTISLATD 192
           +++  S+T K++ C    C  V   + C  G+ C + + YG  S+    L +D + L  D
Sbjct: 131 YDADASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHL--D 188

Query: 193 IVPG----------YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQST 236
           +V G            FGC  K +G     Q    G++G G+ + S ++Q  +Q   + +
Sbjct: 189 LVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRS 248

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           F++CL +       G   +G +  PK +K TP+L    +S+ Y VNL AI VG  V+ + 
Sbjct: 249 FAHCLDNNNG---GGIFAIGEVVSPK-VKTTPMLS---KSAHYSVNLNAIEVGNSVLQLS 301

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP 356
             A  F+     G IIDSGT    L    Y  + +         L + ++    TC+   
Sbjct: 302 SDA--FDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILAS-HQELNLHTVQDSFTCFHYI 358

Query: 357 IVA---PTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIAN 408
                 PT+T  F   +V+L   PQ+ L          C             + L ++ +
Sbjct: 359 DRLDRFPTVTFQFD-KSVSLAVYPQEYLF--QVREDTWCFGWQNGGLQTKGGASLTILGD 415

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
           M   N  ++YD+ N  +G     C+
Sbjct: 416 MALSNKLVVYDIENQVIGWTNHNCS 440


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 167/383 (43%), Gaps = 60/383 (15%)

Query: 93  QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTF 144
           Q   Y  + ++GTP     + +DT +D  WV C  C GC  T         F+   S+T 
Sbjct: 71  QVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTS 130

Query: 145 KNLGCQAAQCK---QVPNPTCG--GGACAFNLTYGSST------IAANLSQDTI---SLA 190
             + C   +C    Q  + TC      C++   YG  +      ++  +  +TI   S+ 
Sbjct: 131 SMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVT 190

Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
           T+      FGC  + TG+         G+ G G+  +S+++Q  +Q +    FS+CL   
Sbjct: 191 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 250

Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
              S  G L LG I +P  I YT L+   P     Y +NL +I V  + + I      F 
Sbjct: 251 S--SGGGILVLGEIVEPN-IVYTSLVPAQPH----YNLNLQSIAVNGQTLQIDSSV--FA 301

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-----TVTSLGGFDTCY----S 354
            +   GTI+DSGT    L   AY    D F   + +++     TV S G  + CY    S
Sbjct: 302 TSNSRGTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRG--NQCYLITSS 355

Query: 355 VPIVAPTITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQ 410
           V  V P ++L F+G    +  PQD L+  ++ G  ++ C+            + ++ ++ 
Sbjct: 356 VTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQ---GQGITILGDLV 412

Query: 411 QQNHRILYDVPNSRLGVARELCT 433
            ++  ++YD+   R+G A   C+
Sbjct: 413 LKDKIVVYDLAGQRIGWANYDCS 435


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G PA+   + +DT +D  WV C+ C GC ++         FN   S+T   + 
Sbjct: 89  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148

Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
           C   +C    Q     C         C +  TYG  S  +     DT+   T +    T 
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 208

Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
                  FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    
Sbjct: 209 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 266

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           + +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +
Sbjct: 267 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 320

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
              GTI+DSGT    L   AY          V  ++ ++ S G   F T  SV    PT+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 380

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           TL F  G+ +++  +N L+   +   + L       N    + ++ ++  ++   +YD+ 
Sbjct: 381 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 440

Query: 422 NSRLGVARELCT 433
           N R+G A   C+
Sbjct: 441 NMRMGWADYDCS 452


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G PA+   + +DT +D  WV C+ C GC ++         FN   S+T   + 
Sbjct: 91  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150

Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
           C   +C    Q     C         C +  TYG  S  +     DT+   T +    T 
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 210

Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
                  FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    
Sbjct: 211 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 268

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           + +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +
Sbjct: 269 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 322

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
              GTI+DSGT    L   AY          V  ++ ++ S G   F T  SV    PT+
Sbjct: 323 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 382

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           TL F  G+ +++  +N L+   +   + L       N    + ++ ++  ++   +YD+ 
Sbjct: 383 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 442

Query: 422 NSRLGVARELCT 433
           N R+G A   C+
Sbjct: 443 NMRMGWADYDCS 454


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 159/349 (45%), Gaps = 39/349 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   +GTP   +   MDT ++  W+   PC  C   +S +FN ++S+++KN+ C ++ 
Sbjct: 89  YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSST 148

Query: 154 CKQVPNP--TC--GGGACAFNLTYGSSTIA-ANLSQDTISL-----ATDIVPGYTFGC-- 201
           CK   +   +C  GG  C +++TYG    +  +LS D+++L     ++ + P    GC  
Sbjct: 149 CKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGH 208

Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQT-QNLYQSTFSYCLPSFKALSFSGS-LRLGP-- 257
           I     NS    G++G+GRG +SL+ Q   +   S FSYCL  + + S S S L  G   
Sbjct: 209 INVLQDNS-QSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGEDV 267

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT---IIDS 314
           +   + +  TP++K   + + Y++ L A  VG          +++   + A T   +IDS
Sbjct: 268 VVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNN-------RIEYGERSNASTQNILIDS 320

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNV 371
           GT  T L     + +     + V              CY+     +  P IT  F+G +V
Sbjct: 321 GTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHFNGADV 380

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            L  +        G I C    ++     + L +  N+ Q N  I YD+
Sbjct: 381 KLNSNGTFFPFEDG-IMCFGFISS-----NGLEIFGNIAQNNLLIDYDL 423


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 157/372 (42%), Gaps = 38/372 (10%)

Query: 89  RQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
           + I Q+P       +++   IGTP   +   +DT +D  W+ C  C+GC   +   F+  
Sbjct: 54  QNIVQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPL 113

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLATDI---- 193
           +S+T+ N+ C +  C ++    C     C +   YG +++    L+QDT +  ++     
Sbjct: 114 KSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPV 173

Query: 194 -VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCL-PSFKALSF 249
            +  + FGC    TG  +    GL+GLG G  SL++Q   L+    FS CL P    +  
Sbjct: 174 SLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKI 233

Query: 250 SGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           S  +  G   Q     +  TPL+   + +S Y+V LL I V      +       N T G
Sbjct: 234 SSRMSFGKGSQVLGNGVVTTPLVPREKDTS-YFVTLLGISVEDTYFPM-------NSTIG 285

Query: 308 -AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP--IVAPTIT 363
            A  ++DSGT    L    Y  V    R +V    +T     G   CY     +  PT+T
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLKGPTLT 345

Query: 364 LMFSGMNVTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
             F G NV L      I  T     I CLA+    +  NS   V  N  Q N+ I +D+ 
Sbjct: 346 FHFVGANVLLTPIQTFIPPTPQTKGIFCLAIY---NRTNSDPGVYGNFAQSNYLIGFDLD 402

Query: 422 NSRLGVARELCT 433
              +      CT
Sbjct: 403 RQVVSFKPTDCT 414


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 129/312 (41%), Gaps = 52/312 (16%)

Query: 169 FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
           F   YG  ++ A L  D++SL +  V  +TFGC          P G+ G GRG LSL AQ
Sbjct: 182 FYYAYGDGSLVAKLFSDSLSLPSVSVANFTFGCAHTTLAE---PIGVAGFGRGRLSLPAQ 238

Query: 229 ---TQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------GQPKRIK------------- 265
                    ++FSYCL S  +       R  P+        + KR+              
Sbjct: 239 LSVHSPHLGNSFSYCLVS-HSFDSDRVRRPSPLILGRFVDKKEKRVATTDDDDDGDETKK 297

Query: 266 ------YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
                 +T +L NP+    Y V+L  I +G+R +  P    + +   G G ++DSGT FT
Sbjct: 298 KKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFT 357

Query: 320 RLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV--PIVAPTITLMFS--GMNV 371
            L A  Y +V + F  RVG        V    G   CY +   +  P + L F+  G  V
Sbjct: 358 MLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNGSTV 417

Query: 372 TLPQDNLLIHSTAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDV 420
           TLP+ N       G         + CL +    D          ++ N QQQ   ++YD+
Sbjct: 418 TLPRRNYFYEFMDGGDGKEEKRKVGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDL 477

Query: 421 PNSRLGVARELC 432
            N R+G A+  C
Sbjct: 478 LNRRVGFAKRKC 489


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 108/428 (25%), Positives = 165/428 (38%), Gaps = 95/428 (22%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG--------------- 127
           +P+ +GR       Y    K+G+P Q   +A DT ++  W  C                 
Sbjct: 98  MPMRAGRDDALGE-YFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKN 156

Query: 128 ------------------------------CVGCSSTVFNSAQSTTFKNLGCQAAQCK-- 155
                                         C G    VF   +S +F+ + C + +CK  
Sbjct: 157 KTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKG----VFCPHRSKSFQAVTCASQKCKID 212

Query: 156 --------QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLATDIVPG-------YTF 199
                     P P+     C ++++Y   + A      DTI++  D+  G        T 
Sbjct: 213 LSQLFSLSLCPKPS---DPCLYDISYADGSSAKGFFGTDTITV--DLKNGKEGKLNNLTI 267

Query: 200 GCIQKATGNSV----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLR 254
           GC  K+  N V       G+LGLG    S + +    Y + FSYCL       + S  L 
Sbjct: 268 GCT-KSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLT 326

Query: 255 LGPIGQPK---RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           +G     K    IK T L+  P     Y VN++ I +G +++ IPP    FN  +  GT+
Sbjct: 327 IGGHHNAKLLGEIKRTELILFP---PFYGVNVVGISIGGQMLKIPPQVWDFN--SQGGTL 381

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYSVP----IVAPTITLM 365
           IDSGT  T L+ PAY  V +   + +     VT    G  D C+        V P +   
Sbjct: 382 IDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGFDDSVVPRLVFH 441

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F+G     P     I   A  + C+ +    D +    +VI N+ QQNH   +D+  + +
Sbjct: 442 FAGGARFEPPVKSYIIDVAPLVKCIGIVPI-DGIGGA-SVIGNIMQQNHLWEFDLSTNTI 499

Query: 426 GVARELCT 433
           G A  +CT
Sbjct: 500 GFAPSICT 507


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 158/367 (43%), Gaps = 63/367 (17%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
           ++GTP    ++A+DT +D  WVPC  C  C+ T            ++N  +S+T K + C
Sbjct: 102 ELGTPGVKFMVALDTGSDLFWVPCD-CSRCAPTHGASYASDFELSIYNPRESSTSKKVTC 160

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-----DIVPGY-TFGC 201
               C Q         +C + ++Y S+  + +  L +D + L T     + V  Y TFGC
Sbjct: 161 NNDMCAQRNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFVEAYVTFGC 220

Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
            Q  +G   +   P GL GLG   +S+  +   + L   +FS C          G +  G
Sbjct: 221 GQVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCF----GHDGIGRISFG 276

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G P + + TP   NP   + Y V +   RVG  ++D+   AL            DSGT
Sbjct: 277 DKGSPDQ-EETPFNVNPAHPT-YNVTVTQARVGTMLIDVEFTAL-----------FDSGT 323

Query: 317 VFTRLVAPAYTAVRDVFR-----RRVGSNLTVTSLGGFDTCYSV-----PIVAPTITLMF 366
            FT +V PAY+ V + F      +R   +  +     F+ CY +       + P+++L  
Sbjct: 324 SFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIP----FEYCYDMSPDANASLVPSMSLTM 379

Query: 367 SGMNVTLPQDNLLIHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
            G       D +++ ST   I  CLA+  + +     LN+I       +R+++D     L
Sbjct: 380 KGGRHFTVYDPIIVISTQNEIVYCLAVVKSTE-----LNIIGQNFMTGYRVVFDREKLVL 434

Query: 426 GVARELC 432
           G  +  C
Sbjct: 435 GWKKFDC 441


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 166/398 (41%), Gaps = 68/398 (17%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
           P A+  +   + +  V   +GTP Q + M +DT ++ +W+ C G         + A   T
Sbjct: 42  PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNG---------SYAPPLT 92

Query: 144 FKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGSSTIAAN-LSQDTISLATDIVP---G 196
            ++      +   VP P C      AC  +L+Y  ++ A   L+ DT  L     P   G
Sbjct: 93  RRSTRRWRGRDLPVP-PFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVG 151

Query: 197 YTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
             FGCI             TG  V     GLLG+ RG+LS + QT       F+YC+   
Sbjct: 152 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFAYCIAPG 208

Query: 245 KALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPG 298
           +     G L LG  G     + YTPL++  +         Y V L  IRVG  ++ IP  
Sbjct: 209 EG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKS 265

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--------GGFD 350
            L  + T    T++DSGT FT L+A AY A++  F  +  + L +  L        G FD
Sbjct: 266 VLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFVFQGAFD 323

Query: 351 TCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAA 394
            C+  P         + P + L+  G  V +  + LL            A ++ CL    
Sbjct: 324 ACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGN 383

Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + D       VI +  QQN  + YD+ N R+G A   C
Sbjct: 384 S-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 420


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/328 (28%), Positives = 144/328 (43%), Gaps = 42/328 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
           YI++  IG P   +   +DT +D  WV C+ C GC+   S +++ A+S +   L C +  
Sbjct: 87  YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146

Query: 154 CK-----QVPNPTCGGGA--CAFNLTYGSS---TIAANLSQDTISLATDIVPG-YTFGCI 202
           C+     ++ +  C      C ++  YG S   +    L  +T +     V    +FG  
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRS 206

Query: 203 QKATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLRLGPI 258
               G+      GL+GLGRG LSL++Q   L    F+YCL   P+  +    GSL     
Sbjct: 207 DTIDGSQFGGTAGLVGLGRGHLSLVSQ---LGAGRFAYCLAADPNVYSTILFGSLAALDT 263

Query: 259 GQPKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
                +  TPL+ NP+  R + YYVNL  I VG   + I  G    N     G   DSG 
Sbjct: 264 -SAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGA 322

Query: 317 VFTRLVAPAYTAVRDVFR---RRVGSNLTVTSLGGFDTCY------SVPIVAPTITLMFS 367
           + T L   AY  VR       +R+G +       G DTC+      +V  + P +     
Sbjct: 323 IDTSLKDAAYQVVRQAITSEIQRLGYD------AGDDTCFVAANQQAVAQMPPLVLHFDD 376

Query: 368 GMNVTLPQDNLLIHSTAGS---ITCLAM 392
           G +++L   N L  ST G    + C+A+
Sbjct: 377 GADMSLNGRNYLKTSTKGPSEVLVCMAI 404


>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 441

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/431 (25%), Positives = 176/431 (40%), Gaps = 90/431 (20%)

Query: 82  VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSST-- 134
           + PIA     T +  Y++   +GTP Q   + +DT +D  WVPC       C+ C +   
Sbjct: 15  IEPIA-----TYTDGYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHS 69

Query: 135 ------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-------------------- 168
                  F+ +QS +     C +  C  V +      ACA                    
Sbjct: 70  ISKPTPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPP 129

Query: 169 FNLTYGS-STIAANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGL 218
           F  TYG  + +  +L++DTI+L   I         PG+ FGC+    G+S+  P G+ G 
Sbjct: 130 FAYTYGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGCV----GSSIREPIGIAGF 185

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSF---KALSFSGSLRLGPIGQPKR--IKYTPLLKNP 273
           G+G LSL +Q   L    FS+C   F   +  + +  + +G +    +    +TP+LK+ 
Sbjct: 186 GKGKLSLPSQLGFL-DKGFSHCFLGFWFARNPNITSPMVIGDLALSVKDGFLFTPMLKSL 244

Query: 274 RRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
              + YY+ L  + +G    +  PP     +     G I+D+GT +T L  P Y +V   
Sbjct: 245 TYPNFYYIGLEGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFYASVLSS 304

Query: 333 FRRRVGSN--LTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIH 381
               V  N    +    GFD C  VP +         P IT+   G + + LP+++    
Sbjct: 305 LSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKESCYYA 364

Query: 382 STAGS----ITCLAMAAAPDN-VNSVLN---------------VIANMQQQNHRILYDVP 421
            TA      I CL      D+ V S  N               V+ + Q QN  ++YD+ 
Sbjct: 365 VTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLE 424

Query: 422 NSRLGVARELC 432
           + R+G     C
Sbjct: 425 SGRVGFQPRDC 435


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT + A+WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L+AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  +R  +            CY +  V     P I+L F      
Sbjct: 234 IPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDAARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 146/358 (40%), Gaps = 83/358 (23%)

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
           V   +GTP Q + M +DT ++ +W+ C        T F+  +S+++  + C +       
Sbjct: 70  VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSSYSPVPCSS------- 121

Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
                                                     C  + + N+    GL+G+
Sbjct: 122 ----------------------------------------LTCTDQDSKNT----GLMGM 137

Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKN---- 272
            RGSLS ++Q        FSYC+       FSG L LG         + YTPL++     
Sbjct: 138 NRGSLSFVSQMDF---PKFSYCI---SDSDFSGVLLLGDANFSWLMPLNYTPLIQISTPL 191

Query: 273 PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
           P    + Y V L  I+V  +++ +P      + T    T++DSGT FT L+ P Y+A+R+
Sbjct: 192 PYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRN 251

Query: 332 VFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLL 379
            F  +    L V         GG D CY VP+        PT++LMF G  + +  D LL
Sbjct: 252 EFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLL 311

Query: 380 IH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                    + S+ C     + D +     VI +  QQN  + +D+  SR+G A+  C
Sbjct: 312 YRVPGEVRGSDSVYCFTFGNS-DLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 171/395 (43%), Gaps = 58/395 (14%)

Query: 81  SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------- 133
           SVV +A G     +   +   KIG   +   + +DT +D  WV C GC  C         
Sbjct: 58  SVVDVALGGNGRPTSNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMD 117

Query: 134 -TVFNSAQSTTFKNLGCQAAQCK-----QVPNPTCGGGACAFNLTYGS------STIAAN 181
            T+++   S T K + C    C      Q+   T  G +C +++TYG       S I  +
Sbjct: 118 LTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCT-KGMSCPYSITYGDGSTTSGSYIKDD 176

Query: 182 LSQDTISLATDIVPGYT---FGCIQKATG-----NSVPPQGLLGLGRGSLSLLAQ--TQN 231
           L+ D +      VP  T   FGC  K +G           G++G G+ + S+L+Q     
Sbjct: 177 LTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAG 236

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
             +  FS+CL S   +S  G   +G + QPK +K TPLL+       Y V L  I V   
Sbjct: 237 KVKRIFSHCLDS---ISGGGIFAIGEVVQPK-VKTTPLLQGMAH---YNVVLKDIEVAGD 289

Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA-VRDVFRRRVGSNLTVTSLGGFD 350
            + +P   L  + ++G GTIIDSGT    L    Y   +  +  +R G  L +     F 
Sbjct: 290 PIQLPSDIL--DSSSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVE-DQF- 345

Query: 351 TCY------SVPIVAPTITLMF-SGMNV-TLPQDNLLIHSTAGSITCL----AMAAAPDN 398
           TC+      SV  + PT+   F  G+ + T P+D L +      + C+    +MA   D 
Sbjct: 346 TCFHYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKE--DMWCVGWQKSMAQTKDG 403

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              +L  + ++   N  ++YD+ N  +G A   C+
Sbjct: 404 KELIL--LGDLVLANKLVVYDLDNMAIGWADYNCS 436


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 158/364 (43%), Gaps = 47/364 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C S     F    S T++ + C   Q
Sbjct: 93  YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQ 151

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATGN- 208
           C    N       C +   Y   ST +  L +D +S    +++ P    FGC    TG+ 
Sbjct: 152 C----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDETGDI 207

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            +    G++GLGRG LS++ Q   + +    FS C          G++ LG I  P  + 
Sbjct: 208 YNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCY--GGMGVGGGAMVLGGISPPADMV 265

Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRL 321
           +T    +P RS  Y ++L  I V G+R        L  NP       GT++DSGT +  L
Sbjct: 266 FTH--SDPVRSPYYNIDLKEIHVAGKR--------LHLNPKVFDGKHGTVLDSGTTYAYL 315

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYSVPIVA--------PTITLMF-SGMN 370
              A+ A +    +   S   ++       D C+S   +         P + ++F +G  
Sbjct: 316 PESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHK 375

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           ++L P++ L  HS      CL + +   N N    ++  +  +N  ++YD  +S++G  +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS---NGNDPTTLLGGIVVRNTLVMYDREHSKIGFWK 432

Query: 430 ELCT 433
             C+
Sbjct: 433 TNCS 436


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 160/372 (43%), Gaps = 37/372 (9%)

Query: 89  RQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
           + I Q+P       Y++   IGTP   +   +DT +D  WV C  C+GC + +   F+  
Sbjct: 50  QDIVQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPL 109

Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLATDI---- 193
           +S+T+ N+ C +  C +     C     C +   Y  S++    L+Q+T++L ++     
Sbjct: 110 KSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPI 169

Query: 194 -VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCL-PSFKALSF 249
            + G  FGC    TGN +    GL+GLG G  SL++Q   L+    FS CL P    ++ 
Sbjct: 170 SLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITI 229

Query: 250 SGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           S  +  G   +   + +  TPL++  +  + YYV LL I V           L  N T  
Sbjct: 230 SSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTY-------LPMNSTIE 282

Query: 308 AGT-IIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCY--SVPIVAPTIT 363
            G  ++DSGT    L    Y  V    + +V    +T     G   CY     +  PT+T
Sbjct: 283 KGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLT 342

Query: 364 LMFSGMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
             F G N+ L      I  T  +  + CLA+       NS   +  N  Q N+ I +D+ 
Sbjct: 343 YHFEGANLLLTPIQTFIPPTPETKGVFCLAITNC---ANSDPGIYGNFAQTNYLIGFDLD 399

Query: 422 NSRLGVARELCT 433
              +      CT
Sbjct: 400 RQIVSFKPTDCT 411


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L +  + +  +        +A AP    S++
Sbjct: 289 DLGRRGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 158/362 (43%), Gaps = 45/362 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T++ + C    
Sbjct: 112 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 170

Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
           C       C G    C +   Y   ST +  L +D IS    +++ P    FGC    TG
Sbjct: 171 C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETG 224

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG I  P  
Sbjct: 225 DLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMVLGGISPPSD 282

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
           + +     +P RS  Y ++L  + V G+R   +P  A  F+     GT++DSGT +  L 
Sbjct: 283 MTFA--YSDPDRSPYYNIDLKEMHVAGKR---LPLNANVFD--GKHGTVLDSGTTYAYLP 335

Query: 323 APAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNV 371
             A+ A +D   + + S   ++       D C+S        +    P + ++F +G   
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKY 395

Query: 372 TL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           +L P++ +  HS      CL +     N N    ++  +  +N  ++YD   +++G  + 
Sbjct: 396 SLSPENYMFRHSKVRGAYCLGIF---QNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKT 452

Query: 431 LC 432
            C
Sbjct: 453 NC 454


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 166/367 (45%), Gaps = 40/367 (10%)

Query: 98  IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC------SSTVFNSAQSTTFKNLGCQA 151
           +V   IGTP Q   M +DT +  +W+ C    G       +++ F+ + S++F  L C  
Sbjct: 70  VVTLPIGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNH 129

Query: 152 AQCK-QVPN---PT-CGGGA-CAFNLTYGSSTI-AANLSQDTISLATDIV-PGYTFGCIQ 203
             CK QVP+   PT C     C ++ +Y   T+   NL ++ I+L+  +  P    GC  
Sbjct: 130 PLCKPQVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCAN 189

Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           ++       +G+LG+  G LS   Q +    + FSY +P  +    SGSL LG       
Sbjct: 190 QSDD----ARGILGMNLGRLSFPNQAK---ITKFSYFVPVKQTQPGSGSLYLGNNPNSSC 242

Query: 264 IKYTPLLKNPRRSSLYYVNL---------LAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
            +Y  LL   +  S    NL           I +G + ++IPP   + + T    TIIDS
Sbjct: 243 FRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDS 302

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP------IVAPTITLMF 366
           G+ F+ +V  AY  +R+   ++VGS +    + G   D C+         +V   +    
Sbjct: 303 GSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADICFDGDATEIGRLVGDMVFEFE 362

Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
            G+ + +P++ +LI    G + C  +  A + +    N+I N  QQN  + +D+   R+G
Sbjct: 363 KGVEIVIPKERVLIE-VDGGVHCFGIGRA-EGLGGGGNIIGNFYQQNLWVEFDLAKHRVG 420

Query: 427 VARELCT 433
                C+
Sbjct: 421 FRGANCS 427


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/424 (23%), Positives = 174/424 (41%), Gaps = 59/424 (13%)

Query: 46  FKPSKPLSWEESVLEMLAKDQARLQ--FLSSLAVARKSVVPIASGRQITQSPTYIVRAKI 103
           FK     + +E  LE       R     L+S+ +      P+    ++     Y  + K+
Sbjct: 27  FKVQHKFAGKEKKLEHFKSHDTRRHSRMLASIDL------PLGGDSRVDSVGLYFTKIKL 80

Query: 104 GTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCK 155
           G+P +   + +DT +D  WV C  C  C S        ++F+   S+T K +GC    C 
Sbjct: 81  GSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCS 140

Query: 156 QVP-----NPTCGGGACAFNLTYG-SSTIAANLSQDTISLAT---DIVPG-----YTFGC 201
            +       P  G   C++++ Y   ST   N  +D ++L     D+  G       FGC
Sbjct: 141 FISQSDSCQPAVG---CSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGC 197

Query: 202 IQKATG----NSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRL 255
               +G    +     G++G G+ + S+L+Q       +  FS+CL + K     G   +
Sbjct: 198 GSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG---GGIFAV 254

Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
           G +  PK +K TP++ N      Y V L+ + V    +D+PP  ++       GTI+DSG
Sbjct: 255 GVVDSPK-VKTTPMVPNQMH---YNVMLMGMDVDGTALDLPPSIMR-----NGGTIVDSG 305

Query: 316 TVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVT 372
           T         Y ++ +    R+ V  ++   +   F    +V +  P ++  F   + +T
Sbjct: 306 TTLAYFPKVLYDSLIETILARQPVKLHIVEDTFQCFSFSENVDVAFPPVSFEFEDSVKLT 365

Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--ANMQQQNHRILYDVPNSRLGVAR 429
           + P D L   +    + C    A          VI   ++   N  ++YD+ N  +G A 
Sbjct: 366 VYPHDYLF--TLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWAD 423

Query: 430 ELCT 433
             C+
Sbjct: 424 HNCS 427


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/392 (26%), Positives = 153/392 (39%), Gaps = 78/392 (19%)

Query: 114 MDTSNDAAWVPCTG-----CVG--------------------CSSTVFNSAQSTTFKNLG 148
           +DT +D  W PC       C G                    C+S + ++A ++   +  
Sbjct: 109 LDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRIPCASPLCSAAHASAPPSDL 168

Query: 149 CQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-------VPGY 197
           C  A+C  + +   +CG   AC      YG  ++ A+L +  ++L           V  +
Sbjct: 169 CAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNF 228

Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKALSFSGSLRL 255
           TF C   A G    P G+ G GRG LSL  Q        FSYCL   SF+A      +R 
Sbjct: 229 TFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRL---IRP 282

Query: 256 GPI------------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
            P+             +     YTPLL NP+    Y V L A+ VG   +   P   + +
Sbjct: 283 SPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELARVD 342

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV 358
                G ++DSGT FT L    Y  V + F R + +             G   CY     
Sbjct: 343 RAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAAS 402

Query: 359 ---APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----ITCLAMA----AAPDNVNS 401
               P + L F G   V LP+ N  +      + AG+    + CL +     A+ +  + 
Sbjct: 403 DRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDG 462

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               + N QQQ   ++YDV   R+G AR  CT
Sbjct: 463 PAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 494


>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
          Length = 490

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 74/404 (18%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-------TVFNSAQSTTFKNL 147
           Y     +GTP Q L + ++T +  +WVP T      CSS        VF+   S++ + +
Sbjct: 89  YAFTVSLGTPPQPLPVLLETGSHLSWVPSTSSYSANCSSLSAASPLHVFHPKNSSSSRLI 148

Query: 148 GCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQD 185
           GC+   C  + +P          +C G  C             + + YGS + A  L  D
Sbjct: 149 GCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISD 208

Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
           T+      V  +  GC   +     PP GL G GRG+ S+ +Q   L  + FSYCL S +
Sbjct: 209 TLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSRR 263

Query: 246 ---ALSFSGSLRLGPIGQPKR---IKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDI 295
                + SG L LG  G       ++Y PL ++    P  S  YY+ L AI VG + V +
Sbjct: 264 FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQL 323

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTR----LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
           P  A       G G I+DSGT F+     +  P   AV      R   +  V    G   
Sbjct: 324 PERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSP 382

Query: 352 CYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNSV 402
           C+++P     +  P ++L F G +V  LP +N  +    + +G    +A A     V+ V
Sbjct: 383 CFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDV 442

Query: 403 LN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
                           ++ + QQQN+ I YD+   RLG  R+ C
Sbjct: 443 PTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 486


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 145/351 (41%), Gaps = 59/351 (16%)

Query: 124 PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTY-GSSTIA 179
           PC  C      VFN   S+++  + C +  C Q+    C     GAC +   Y G     
Sbjct: 5   PCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTK 64

Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
             L+ D +++  D+     FGC   + G  +    GL+GLGRG LSL++Q   L    F 
Sbjct: 65  GTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ---LSVHRFM 121

Query: 239 YCLPSFKALSFSGSLRLGPIGQP-----KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           YCLP   + + SG L LG           R+  T +  + R  S YY+NL  + VG    
Sbjct: 122 YCLPPPMSRT-SGKLVLGAGADAVRNMSDRVTVT-MSSSTRYPSYYYLNLDGLAVG---- 175

Query: 294 DIPPGALQFNPTT------------------------GAGTIIDSGTVFTRLVAPAYTAV 329
           D  PG  + N T+                          G I+D  +  + L    Y  +
Sbjct: 176 DQTPGTTR-NATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDEL 234

Query: 330 RDVFRRRVGSNLTVTSLG-GFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIH 381
            D     +       SL  G D C+ +P       +  PT++L F G  + L +D L + 
Sbjct: 235 ADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFV- 293

Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            T G + CL +        S ++++ N Q QN R+L+++   ++  A+  C
Sbjct: 294 -TDGRMMCLMIGR-----TSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSKGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 152/360 (42%), Gaps = 36/360 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y++   IGTP   +    DT +D  W   VPC  C    + +F+  +ST+++N+ C +  
Sbjct: 25  YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84

Query: 154 CKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLAT---DIVP--GYTFGCIQKAT 206
           C ++    C     C +   Y S+ I    L+Q+TI+L++   + VP  G  FGC    T
Sbjct: 85  CHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNNT 144

Query: 207 GN-SVPPQGLLGLGRGSLSLLAQTQNLYQST-FSYCLPSFKA-LSFSGSLRLGPIGQ--P 261
           G  +    G++GLG G +S ++Q  + +    FS CL  F   +S S  + LG   +   
Sbjct: 145 GGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSG 204

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-----AGTIIDSGT 316
           K +  TPL+    ++  Y+V LL I VG          L FN ++          +DSGT
Sbjct: 205 KGVVSTPLVAKQDKTP-YFVTLLGISVGNTY-------LHFNGSSSQSVEKGNVFLDSGT 256

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVP--IVAPTITLMFSGMNVTL 373
             T L    Y  +    R  V        L  G   CY     +  P +T  F G +V L
Sbjct: 257 PPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPVLTAHFEGGDVKL 316

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
                 +    G + CL       N +S   V  N  Q N+ I +D+    +      CT
Sbjct: 317 LPTQTFVSPKDG-VFCLGFT----NTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCT 371


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 159/370 (42%), Gaps = 41/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G+P +   + +DT +D  WV C+ C GC S+         FN   S+T   + 
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT--- 198
           C   +C    Q     C       C +  TYG  S  +     DT+   T +    T   
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 199 -----FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL 247
                FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    + 
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG--SD 294

Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +  
Sbjct: 295 NGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTSNT 348

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTITL 364
            GTI+DSGT    L   AY    +     V  ++ ++ S G   F T  SV    PT++L
Sbjct: 349 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSL 408

Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            F  G+ +T+  +N L+   +     L       N    + ++ ++  ++   +YD+ N 
Sbjct: 409 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 468

Query: 424 RLGVARELCT 433
           R+G     C+
Sbjct: 469 RMGWTDYDCS 478


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 159/370 (42%), Gaps = 41/370 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G+P +   + +DT +D  WV C+ C GC S+         FN   S+T   + 
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT--- 198
           C   +C    Q     C       C +  TYG  S  +     DT+   T +    T   
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210

Query: 199 -----FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL 247
                FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    + 
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG--SD 268

Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +  
Sbjct: 269 NGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTSNT 322

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTITL 364
            GTI+DSGT    L   AY    +     V  ++ ++ S G   F T  SV    PT++L
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSL 382

Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
            F  G+ +T+  +N L+   +     L       N    + ++ ++  ++   +YD+ N 
Sbjct: 383 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 442

Query: 424 RLGVARELCT 433
           R+G     C+
Sbjct: 443 RMGWTDYDCS 452


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/384 (24%), Positives = 165/384 (42%), Gaps = 46/384 (11%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+    +      Y  + K+G+P +   + +DT +D  WV C  C        +G   +
Sbjct: 63  LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 122

Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTISLAT 191
           +++S  S+T KN+GC+ A C  +  + TCG    C++++ YG  ST   +  +D I+L  
Sbjct: 123 LYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITL-- 180

Query: 192 DIVPG----------YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
           D V G            FGC +  +G          G++G G+ + S+++Q       + 
Sbjct: 181 DQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKR 240

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            FS+CL +       G   +G +  P  +K TPL+ N      Y V L  + V    +D+
Sbjct: 241 IFSHCLDNMNG---GGIFAIGEVESP-VVKTTPLVPNQVH---YNVILKGMDVDGEPIDL 293

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCY 353
           PP     N     GTIIDSGT    L    Y ++ +    +++V  ++   +   F    
Sbjct: 294 PPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSFTS 351

Query: 354 SVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--ANM 409
           +     P + L F   + +++ P D L   S    + C    +         +VI   ++
Sbjct: 352 NTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLGDL 409

Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
              N  ++YD+ N  +G A   C+
Sbjct: 410 VLSNKLVVYDLENEVIGWADHNCS 433


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 159/372 (42%), Gaps = 49/372 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+GTP +   + +DT +D  WV C+ C  C  T         F++  S+T + + 
Sbjct: 81  YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140

Query: 149 CQAAQCK---QVPNPTC--GGGACAFNLTYGS-STIAANLSQDTI--------SLATDIV 194
           C    C    Q     C      C++   YG  S  +     DT         SL  +  
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200

Query: 195 PGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
               FGC    +G+         G+ G G+G LS+++Q  +  +    FS+CL      S
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGED--S 258

Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
             G L LG I +P  I Y+PL+ +      Y ++L +I V  +++ I P A  F  ++  
Sbjct: 259 GGGILVLGEILEPG-IVYSPLVPSQPH---YNLDLQSIAVSGQLLPIDPAA--FATSSNR 312

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
           GTIID+GT    LV  AY          V S L   ++   + CY    SV  V P ++ 
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSNSVSEVFPPVSF 371

Query: 365 MFSGMNVTL--PQDNL--LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
            F+G    L  P++ L  L +    ++ C+        +   + ++ ++  ++   +YD+
Sbjct: 372 NFAGGATMLLKPEEYLMYLTNYAGAALWCIGF----QKIQGGITILGDLVLKDKIFVYDL 427

Query: 421 PNSRLGVARELC 432
            + R+G A   C
Sbjct: 428 AHQRIGWANYDC 439


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/414 (25%), Positives = 167/414 (40%), Gaps = 46/414 (11%)

Query: 51  PLSWEESVLEMLAKDQARLQFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
           P++ E+ +  +     AR ++L +S+     S        Q  ++  ++V   +G P   
Sbjct: 49  PITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVP 108

Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCG- 163
            L  MDT +   W+ C  C  CSS      VFN A S+TF    C    C+  PN  CG 
Sbjct: 109 QLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGS 168

Query: 164 GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ--GL 215
              C +   Y S T +   L+++ ++  T      +     FGC     G  +     G+
Sbjct: 169 SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGC-GYENGEQLESHFTGI 227

Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGP----IGQPKRIKYTPLL 270
           LGLG    SL  Q      S FSYC+      ++    L LG     +G P  I++    
Sbjct: 228 LGLGAKPTSLAVQL----GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFE--- 280

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN-PTTGAGTIIDSGTVFTRLVAPAYTAV 329
                +S+YY+NL  I VG   ++I P   +   P TG   I+DSGT++T L   AY  +
Sbjct: 281 ---TENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGV--ILDSGTLYTWLADIAYREL 335

Query: 330 RDVFRRRVGSNLTVTSLGGFDTCY----SVPIVA-PTITLMFSGMNVTLPQDNLLIH--S 382
            +  +  +   L       F  CY    S  ++  P +T  F+G      +   + +  S
Sbjct: 336 YNEIKSILDPKLERFWFRDF-LCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLS 394

Query: 383 TAGSITCLAMAAAPDNVN----SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
              +     M+  P   +         I  M QQ + I YD+    + + R  C
Sbjct: 395 EPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFK 145
           Y V A +GTP  T L+A+DT +D  WVPC  C+ C+             V++ AQSTT +
Sbjct: 63  YAVVA-LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 120

Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGY 197
            + C +  C           +C +++ Y S   +++  L +D + L +D      +    
Sbjct: 121 KVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 180

Query: 198 TFGCIQKATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGS 252
            FGC Q  TG+   S  P GLLGLG  S S+  L  ++ L  ++FS C          G 
Sbjct: 181 MFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGR 236

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
           +  G  G   + K TPL  N  + + YY + +  I VG + +           +T    I
Sbjct: 237 INFGDTGSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAI 282

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMF 366
           +DSGT FT L  P YT +   F  ++ S  N+  +S+  F+ CYSV    IV P ++L  
Sbjct: 283 VDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTA 341

Query: 367 SGMNVTLPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
            G ++  P ++ +I  T  +      CLA+  +       +N+I        ++++D   
Sbjct: 342 KGGSI-FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRER 395

Query: 423 SRLGVARELC 432
             LG     C
Sbjct: 396 MVLGWKNFNC 405


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 153/363 (42%), Gaps = 55/363 (15%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
           ++GTP    ++A+DT +D  WVPC  C  C+ T            +++  QS+T K + C
Sbjct: 106 ELGTPGMKFMVALDTGSDLFWVPC-DCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTC 164

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGC 201
               C           +C + ++Y S+  + +  L +D + L ++      I    TFGC
Sbjct: 165 NNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGC 224

Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
            Q  +G   N+  P GL GLG   +S+  +   + L   +FS C          G +  G
Sbjct: 225 GQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCF----GHDGVGRISFG 280

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G P + + TP   NP   S Y +++  +RVG  +VD+   AL            DSGT
Sbjct: 281 DKGSPDQ-EETPFNSNPSHPS-YNISVTQVRVGTTLVDVDFTAL-----------FDSGT 327

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVPIVA-----PTITLMFSGMN 370
            FT L+ P Y  V + F  +             F+ CY +   A     P+++L   G  
Sbjct: 328 SFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLTMKGRG 387

Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
                D +++ +T    + CLA+  + +     LN+I       +R+++D     LG   
Sbjct: 388 HFTVFDPIIVITTQNELVYCLAIVKSTE-----LNIIGQNFMTGYRVVFDREKLVLGWKE 442

Query: 430 ELC 432
             C
Sbjct: 443 TDC 445


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 170/390 (43%), Gaps = 56/390 (14%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P+      T++  Y  R  IGTPA+   + +DT +D  WV C  C GC        + T
Sbjct: 76  LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135

Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
            +   G Q+ +   C Q   V N     P+C   + C ++++YG  +  A       L  
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195

Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
           + +S      P     +FGC  K  G+    ++   G+LG G+ + S+L+Q       + 
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            F++CL +       G   +G + QPK +K TPL+ +      Y V L  I VG   + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVSDMPH---YNVILKGIDVGGTALGL 308

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
           P     F+     GTIIDSGT    +    Y A+   VF +    +++V +L  F +C+ 
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363

Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM----AAAPDNVNSVL 403
              SV    P +T  F G +V+L   P D L       ++ C+          D  + VL
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVL 420

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
             + ++   N  +LYD+ N  +G A   C+
Sbjct: 421 --LGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 119/247 (48%), Gaps = 24/247 (9%)

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LSFS 250
           PG  FGC  ++ G      GL+GLGRG LSL+ Q   L    F Y L S  +    +SF 
Sbjct: 15  PGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQ---LNVEAFGYRLSSDLSAPSPISF- 70

Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           GSL     G       TPLL NP    L  YYV L  I VG ++V IP G   F+ +TGA
Sbjct: 71  GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 130

Query: 309 GTII-DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD-TCY---SVPIVAPTIT 363
           G +I DSGT  T L  PAYT VRD    ++G      +    D  C+   S     P++ 
Sbjct: 131 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMV 190

Query: 364 LMFS-GMNVTLPQDNLL--IHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
           L F  G ++ L  +N L  +    G    C ++  +    +  L +I N+ Q +  +++D
Sbjct: 191 LHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKS----SQALTIIGNIMQMDFHVVFD 246

Query: 420 VP-NSRL 425
           +  N+R+
Sbjct: 247 LSGNARM 253


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 111/436 (25%), Positives = 180/436 (41%), Gaps = 62/436 (14%)

Query: 5   LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLA 63
           ++F+ +  F+ SLS  LN       +  ++++ H  S  SP ++P++            +
Sbjct: 8   ILFYFSLCFIISLSHALN-------NGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRS 60

Query: 64  KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
            ++A   + ++L    +S V       I     Y++   +GTP   L    DT +D  W+
Sbjct: 61  INRANHFYKTALTNTPQSTV-------IPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWL 113

Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
              PC  C   ++  F  ++S+T+KN+ C +  CK                    S    
Sbjct: 114 QCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLCK--------------------SGQQG 153

Query: 181 NLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQN 231
           NLS DT++L +        P    GC    T N+V  +    G++GLG G  SL+ Q  +
Sbjct: 154 NLSVDTLTLESSTGHPISFPKTVIGC---GTDNTVSFEGASSGIVGLGGGPASLITQLGS 210

Query: 232 LYQSTFSYC-LPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
              + FSYC LP+    + +  L  G   +     +  TP++K       YY+ L A  V
Sbjct: 211 SIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKK-DPIVFYYLTLEAFSV 269

Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
           G + ++    +   N       IIDSGT  T +    Y  +       V           
Sbjct: 270 GNKRIEFEGSS---NGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRL 326

Query: 349 FDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS-VLN 404
           F+ CYSV       P IT  F G +V L   +  +   A  I CLA A     + S V++
Sbjct: 327 FNLCYSVTSDGYDFPIITTHFKGADVKLHPISTFV-DVADGIVCLAFATTSAFIPSDVVS 385

Query: 405 VIANMQQQNHRILYDV 420
           +  N+ QQN  + YD+
Sbjct: 386 IFGNLAQQNLLVGYDL 401


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y+    +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L +  + +  +        +A AP    S++
Sbjct: 289 DLGRHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
           +GTP  T L+A+DT +D  WVPC  C+ C+             V++ AQSTT + + C +
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPLQSPNYGSLKFDVYSPAQSTTSRKVPCSS 163

Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
             C           +C +++ Y S   +++  L +D + L +D      +     FGC Q
Sbjct: 164 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 223

Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
             TG+   S  P GLLGLG  S S+  L  ++ L  ++FS C          G +  G  
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 279

Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           G   + K TPL  N  + + YY + +  I VG + +           +T    I+DSGT 
Sbjct: 280 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 325

Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
           FT L  P YT +   F  ++ S  N+  +S+  F+ CYSV    IV P ++L   G ++ 
Sbjct: 326 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 383

Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            P ++ +I  T  +      CLA+  +       +N+I        ++++D     LG  
Sbjct: 384 FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRERMVLGWK 438

Query: 429 RELC 432
              C
Sbjct: 439 NFNC 442


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 140/324 (43%), Gaps = 45/324 (13%)

Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFN-------LTYGSSTI-AANLSQDTISLATD 192
           S+TFK + C    C+  P+      ACA          +YG  +I A ++ +DT +  + 
Sbjct: 2   SSTFKAVACPDPICR--PSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSP 59

Query: 193 -----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
                 V    FGC    TG  V  + G+ G GRG  SL +Q   L    FSYCL +   
Sbjct: 60  NGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQ---LKVGRFSYCL-TLVT 115

Query: 247 LSFSGSLRLGPIGQPKRIKY--------TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
            S S  + LG    P  ++         TP++ NP   + YY++L  I VG+  +     
Sbjct: 116 ESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKS 175

Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV-----TSLGGFDTCY 353
                     GT+IDSGT  T L      AV ++ +  + +   +     T   G   C+
Sbjct: 176 VFALKKDGSGGTVIDSGTSLTTLPE----AVFELLQEELVAQFPLPRYDNTPEVGDRLCF 231

Query: 354 SVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
             P     +  P + L  +G ++ LP+DN  +      + CL +  A D   + + +I N
Sbjct: 232 RRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAED---TTMVLIGN 288

Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
            QQQN  ++YDV N++L  A   C
Sbjct: 289 FQQQNMHVVYDVENNKLLFAPAQC 312


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 157/369 (42%), Gaps = 39/369 (10%)

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
           I+   +Y++   +GTP  ++L   DT +D  W  C  C  C   V   F+  +S T+K L
Sbjct: 88  ISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTL 147

Query: 148 GCQAAQCKQVPNP-TCGG-GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTF 199
           GC    C+ +    +CG    C  + +YG  S    +LS +T ++ +        PG  F
Sbjct: 148 GCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAF 207

Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP 257
           GC     G  +    GL+GLG G LSL+ Q  +     FSYCL P     + S  +  G 
Sbjct: 208 GCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGK 267

Query: 258 --IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI---------PPGALQFNPTT 306
             +        TPL+K     + YY+ L  + +G   V           P  A + N   
Sbjct: 268 SAVVSGSGTVSTPLIKG-TPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESN--- 323

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS--VPIVAPTITL 364
               IIDSGT  T L    YT +     + +G   T    G F  CYS    +  PTIT 
Sbjct: 324 ---IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITA 380

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            F G +V LP  N  + +    + C +M  +     S L +  N+ Q N  + YD+ N++
Sbjct: 381 HFIGADVQLPPLNTFVQAQE-DLVCFSMIPS-----SNLAIFGNLSQMNFLVGYDLKNNK 434

Query: 425 LGVARELCT 433
           +      CT
Sbjct: 435 VSFKPTDCT 443


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 105/427 (24%), Positives = 165/427 (38%), Gaps = 107/427 (25%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTV-----FNSAQSTTFKN 146
           Y++   +GTP Q   + +DT +D  WVPC       C+ C S+V     F  ++ST+   
Sbjct: 25  YLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNTR 84

Query: 147 LGCQAAQCKQVPN---------------PTCGGGAC-----AFNLTYGSSTIA-ANLSQD 185
             C +  C  V +               P   GG C      F+ TYG   +   +LS+D
Sbjct: 85  DLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSRD 144

Query: 186 TISLATDI-------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQN 231
           +++L                  PG+ FGC+    G+S+  P G+ G GRG+LSL +Q   
Sbjct: 145 SVTLHGSTHGSGAGAGPLPVAFPGFGFGCV----GSSIREPLGIAGFGRGALSLPSQLGF 200

Query: 232 LYQSTFSYCL--------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
           L +  FS+C         P+F +    G L L          +TP+L +    + YYV L
Sbjct: 201 LGKG-FSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGL 259

Query: 284 LAIRVGRR----VVDIPPGALQFNPTTGAGTIIDSGTVFTRL--------------VAPA 325
             + +G       +  PP     +     G ++D+GT +T+L               AP 
Sbjct: 260 EGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPP 319

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG--------- 368
           Y   RD+  R            GFD C+ VP           P ITL  +G         
Sbjct: 320 YERSRDLEART-----------GFDLCFKVPCARAPCADDELPPITLHLAGGARLALPKL 368

Query: 369 ---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
                VT  +D++++         +       +      V+ + Q QN  ++YD+   R+
Sbjct: 369 SSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRV 428

Query: 426 GVARELC 432
           G     C
Sbjct: 429 GFRPRDC 435


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 107/415 (25%), Positives = 175/415 (42%), Gaps = 47/415 (11%)

Query: 56  ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
           E + E      AR + L   A A   VV  P+           Y  R K+G PA+   + 
Sbjct: 46  EHLKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEYFVQ 105

Query: 114 MDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPNPTC 162
           +DT +D  WV C+ C GC ++         FN   S+T   + C   +C    Q     C
Sbjct: 106 IDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVC 165

Query: 163 GGGA-----CAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGN 208
                    C +  TYG  +      ++  +  DT+       +      FGC    +G+
Sbjct: 166 QSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGD 225

Query: 209 SVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKALSFSGSLRLGPIGQPK 262
            +       G+ G G+  LS+++Q  +L  S  TFS+CL    + +  G L LG I +P 
Sbjct: 226 LMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKG--SDNGGGILVLGEIVEPG 283

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            + +TPL+ +      Y +NL +I V  +   +P  +  F  +   GTI+DSGT    LV
Sbjct: 284 LV-FTPLVPSQPH---YNLNLESIAVSGQ--KLPIDSSLFATSNTQGTIVDSGTTLVYLV 337

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVAPTITLMFS-GMNVTLPQDNL 378
             AY    +     V  ++      G   F T  SV    PT TL F  G+++T+  +N 
Sbjct: 338 DGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMTVKPENY 397

Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           L+    GS+    +       +  + ++ ++  ++   +YD+ N R+G A   C+
Sbjct: 398 LLQQ--GSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYDLANMRMGWADYDCS 450


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 170/390 (43%), Gaps = 56/390 (14%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P+      T++  Y  R  IGTPA+   + +DT +D  WV C  C GC        + T
Sbjct: 76  LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135

Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
            +   G Q+ +   C Q   V N     P+C   + C ++++YG  +  A       L  
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195

Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
           + +S      P     +FGC  K  G+    ++   G+LG G+ + S+L+Q       + 
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            F++CL +       G   +G + QPK +K TPL+ +      Y V L  I VG   + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVPDMPH---YNVILKGIDVGGTALGL 308

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
           P     F+     GTIIDSGT    +    Y A+   VF +    +++V +L  F +C+ 
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363

Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM----AAAPDNVNSVL 403
              SV    P +T  F G +V+L   P D L       ++ C+          D  + VL
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVL 420

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
             + ++   N  +LYD+ N  +G A   C+
Sbjct: 421 --LGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 159/372 (42%), Gaps = 45/372 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  R K+G+P +   + +DT +D  WV C+ C GC S+         FN   S+T   + 
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPG----- 196
           C   +C    Q     C       C +  TYG  S  +     DT+    D V G     
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYF--DSVMGNEQTA 208

Query: 197 -----YTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
                  FGC    +G+         G+ G G+  LS+++Q  +L  S   FS+CL    
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG-- 266

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
           + +  G L LG I +P  + YTPL+ +      Y +NL +I V  +   +P  +  F  +
Sbjct: 267 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTS 320

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
              GTI+DSGT    L   AY    +     V  ++ ++ S G   F T  SV    PT+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 380

Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           +L F  G+ +T+  +N L+   +     L       N    + ++ ++  ++   +YD+ 
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440

Query: 422 NSRLGVARELCT 433
           N R+G     C+
Sbjct: 441 NMRMGWTDYDCS 452


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
           +GTP  T L+A+DT +D  WVPC  C+ C+             V++ AQSTT + + C +
Sbjct: 82  LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 140

Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
             C           +C +++ Y S   +++  L +D + L +D      +     FGC Q
Sbjct: 141 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 200

Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
             TG+   S  P GLLGLG  S S+  L  ++ L  ++FS C          G +  G  
Sbjct: 201 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 256

Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           G   + K TPL  N  + + YY + +  I VG + +           +T    I+DSGT 
Sbjct: 257 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 302

Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
           FT L  P YT +   F  ++ S  N+  +S+  F+ CYSV    IV P ++L   G ++ 
Sbjct: 303 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 360

Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            P ++ +I  T  +      CLA+  +       +N+I        ++++D     LG  
Sbjct: 361 FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRERMVLGWK 415

Query: 429 RELC 432
              C
Sbjct: 416 NFNC 419


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +   WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSRGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 164/393 (41%), Gaps = 49/393 (12%)

Query: 69  LQFLSSLAVARKSVVPIASGRQITQSPTY------IVRAKIGTPAQTLLMAMDTSNDAAW 122
           L+ L  L+   K++ P        QSP Y      ++   IGTP   +    DT +D  W
Sbjct: 46  LRRLMELSAMEKTLTP--------QSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTW 97

Query: 123 ---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTI 178
              VPC  C    + +F+  +STT++N+ C +  C ++    C     C +   Y S+ I
Sbjct: 98  TSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAI 157

Query: 179 AAN-LSQDTISLAT---DIVP--GYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQN 231
               L+Q+TI+L++     VP  G  FGC    TG  +    G++GLG G +SL++Q  +
Sbjct: 158 TRGVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGS 217

Query: 232 LYQST-FSYCLPSFKA-LSFSGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIR 287
            +    FS CL  F   +S S  +  G   +   K +  TPL+    ++  Y+V LL I 
Sbjct: 218 SFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTP-YFVTLLGIS 276

Query: 288 VGRRVVDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LT 342
           V           L FN ++         +DSGT  T L    Y  V    R  V    +T
Sbjct: 277 VENTY-------LHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVT 329

Query: 343 VTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
                G   CY     +  P +T  F G +V L      I    G + CL       N +
Sbjct: 330 DDPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDG-VFCLGFT----NTS 384

Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           S   V  N  Q N+ I +D+    +    + CT
Sbjct: 385 SDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDCT 417


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
           +GTP  T L+A+DT +D  WVPC  C+ C+             V++ AQSTT + + C +
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 163

Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
             C           +C +++ Y S   +++  L +D + L +D      +     FGC Q
Sbjct: 164 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 223

Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
             TG+   S  P GLLGLG  S S+  L  ++ L  ++FS C          G +  G  
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 279

Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           G   + K TPL  N  + + YY + +  I VG + +           +T    I+DSGT 
Sbjct: 280 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 325

Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
           FT L  P YT +   F  ++ S  N+  +S+  F+ CYSV    IV P ++L   G ++ 
Sbjct: 326 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 383

Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            P ++ +I  T  +      CLA+  +       +N+I        ++++D     LG  
Sbjct: 384 FPVNDPIITITDNAFNPVGYCLAIMKS-----EGVNLIGENFMSGLKVVFDRERMVLGWK 438

Query: 429 RELC 432
              C
Sbjct: 439 NFNC 442


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 149/362 (41%), Gaps = 52/362 (14%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------------TVFNSAQSTTFKNLG 148
           IGTP+ + L+A+D+ +D  W+PC  CV C+                 F+ + STT K   
Sbjct: 103 IGTPSVSFLVALDSGSDLLWIPCN-CVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFP 161

Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLA------TDIVPGYTFG 200
           C    C+  P        C + +TY S   +++  L +D + LA      + +      G
Sbjct: 162 CSHKLCESAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVG 221

Query: 201 CIQKATGN---SVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
           C +K +G     + P G++GLG G +S+   LA+   L +++FS C         SG + 
Sbjct: 222 CGEKQSGEFLKGIAPDGVMGLGPGEISVPSFLAKA-GLMRNSFSMCFDEED----SGRIY 276

Query: 255 LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
            G +G        P  +   R   Y    +A  VG  V  +    L+ +  T   T+IDS
Sbjct: 277 FGDVG--------PSTQQSTRFLPYKNEFVAYFVGVEVCCVGNSCLKQSSFT---TLIDS 325

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV--APTITLMFSGMNVT 372
           G  FT L    Y  V       + + +     G ++ CY        P I L FS  N  
Sbjct: 326 GQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEYCYETSFEPKVPAIKLKFSSNNTF 385

Query: 373 LPQDNLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
           +    L +   +  +   CL ++A+ +       VI       +RI++D  N +LG +  
Sbjct: 386 VIHKPLFVLQRSEGLVQFCLPISASEEGTG---GVIGQNYMAGYRIVFDRENMKLGWSAS 442

Query: 431 LC 432
            C
Sbjct: 443 KC 444


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 113/418 (27%), Positives = 174/418 (41%), Gaps = 51/418 (12%)

Query: 59  LEMLAKDQARLQ--FLSSLAVARKSV---------VPIASGRQITQSPTYIVRAKIGTPA 107
           L   A+D AR      S LA  R+           +P++SG   T +  Y VR ++GTPA
Sbjct: 57  LGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSG-AYTGTGQYFVRFRVGTPA 115

Query: 108 QTLLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQ-VP--- 158
           Q  ++  DT +D  WV C G  G  ++      F +++S ++  L C +  C   VP   
Sbjct: 116 QPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCTSYVPFSL 175

Query: 159 -NPTCGGGACAFNLTYGSSTIAANL---SQDTISLATDI-------------VPGYTFGC 201
            N +     CA++  Y   + A  +      TI+L+                + G   GC
Sbjct: 176 ANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKLQGVVLGC 235

Query: 202 IQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS---LRLGP 257
                G S     G+L LG  ++S  ++    +   FSYCL    A   + S      GP
Sbjct: 236 TATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSYLTFGPGP 295

Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
            G       TPL+ + R S  Y V + A+ V    +DIP  A  ++   G G I+DSGT 
Sbjct: 296 EGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIP--ADVWDVGRGGGAILDSGTS 353

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTI---TLMFSGMNVTLP 374
            T L  PAY AV      R+ + L   ++  F+ CY+    AP I    + F+G     P
Sbjct: 354 LTVLATPAYRAVVAALGGRLAA-LPRVAMDPFEYCYNWTAGAPEIPKLEVSFAGSARLEP 412

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
                +   A  + C+ +    +     ++VI N+ QQ H   +D+ +  L      C
Sbjct: 413 PAKSYVIDAAPGVKCIGVQ---EGAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 166/385 (43%), Gaps = 50/385 (12%)

Query: 83  VPIASG----RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSS 133
           VP A G    + IT+S  Y++   +GTP   +L   DT +D  WV C+           +
Sbjct: 82  VPEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGA 141

Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQDTISLA 190
            VF+ ++STT+  L CQ+A C+ +   +C   + C +   Y  GS TI   LS +T S A
Sbjct: 142 VVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGV-LSTETFSFA 200

Query: 191 TDI--------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYC 240
                      VP  +FGC   + G S    GL+GLG G+LSL++Q          FSYC
Sbjct: 201 AAGGGGEGQVRVPRVSFGCSTGSAG-SFRSDGLVGLGAGALSLVSQLGAAARIARRFSYC 259

Query: 241 L-PSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           L P + A + S +L  G    +  P     TPL+ +    S Y V L ++ V  + V   
Sbjct: 260 LVPPYAAANSSSTLSFGARAVVSDPGAAS-TPLVPS-EVDSYYTVALESVAVAGQDV--- 314

Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
                      +  I+DSGT  T L       +     RR+              CY V 
Sbjct: 315 ------ASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQ 368

Query: 356 ------PIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
                     P +TL F  G +VTL  +N       G++ CL +   P + +  ++++ N
Sbjct: 369 GKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTL-CLVL--VPVSESQPVSILGN 425

Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
           + QQN  + YD+    +  A   CT
Sbjct: 426 IAQQNFHVGYDLDARTVTFAAVDCT 450


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 157/375 (41%), Gaps = 62/375 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-------------TVFNSAQSTT 143
           +     +GTP    L+A+DT +D  W+PC  C+ C                 ++  +S+T
Sbjct: 105 HFANVSVGTPPLWFLVALDTGSDLFWLPCD-CISCVHGGLRTRTGKILKFNTYDLDKSST 163

Query: 144 FKNLGCQAAQ-CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT------DIV 194
              + C  +  C+Q       G  C + + Y S+  ++   + +D + L T      D  
Sbjct: 164 SNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKDAD 223

Query: 195 PGYTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSF 249
               FGC Q  TG   N   P GL GLG  ++S+  +   + L  ++FS C  S  A   
Sbjct: 224 TRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSA--- 280

Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
            G +  G  G P + K TP   N R+    Y + +  I V   V D     L+F+     
Sbjct: 281 -GRITFGDTGSPDQRK-TPF--NVRKLHPTYNITITKIIVEDSVAD-----LEFH----- 326

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRV----GSNLTVTSLGGFDTCYSVP----IVAP 360
             I DSGT FT +  PAYT + +++  +V     S+ +  S   FD CY +     I  P
Sbjct: 327 -AIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVP 385

Query: 361 TITLMFSGMNVTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
            + L   G +     D ++  S+   G + CL +  +       +N+I       ++I++
Sbjct: 386 FLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKS-----DSVNIIGQNFMTGYKIVF 440

Query: 419 DVPNSRLGVARELCT 433
           D  N  LG     C+
Sbjct: 441 DRDNMNLGWKETNCS 455


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 159/381 (41%), Gaps = 66/381 (17%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  + K+G PA+   + +DT +D  WV C+ C GC  +        +F++ +S++ + L 
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 149 CQAAQCKQVPNPT----CGGGACAFNLTY---------------------GSSTIAANLS 183
           C    C  V   T         C+++  Y                     G STIA   S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN--S 201

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
             TI     I   Y +G + +AT       G+ G G+G  S+++Q  ++ +    FS+CL
Sbjct: 202 SATIVFGCSI---YQYGDLTRATK---ALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
              +  +  G L LG I +P  I Y+PL+ +      Y + L +I +  ++    P    
Sbjct: 256 KGGE--NGGGILVLGEILEPS-IVYSPLIPSQPH---YTLKLQSIALSGQLF---PNPTM 306

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIV 358
           F  +    TIIDSGT    LV   Y  +  V    V  + T T   G   F    SV  +
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI 366

Query: 359 APTITLMFSGMN--VTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQ 411
            P +   F G+   V  P++ L   S        S+ C+    A D     LN++ ++  
Sbjct: 367 FPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDG----LNILGDLVL 422

Query: 412 QNHRILYDVPNSRLGVARELC 432
           ++  I+YD+   R+G A   C
Sbjct: 423 KDKIIVYDLAQQRIGWANYDC 443


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P ++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP       F    +G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+      +++  Y  +  IGTP++   + +DT +D  WV C GC        +G   T
Sbjct: 141 LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 200

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
           +++   STT   +GC    C     P  G   G  C +++ YG  S+      QD +   
Sbjct: 201 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 260

Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
                   T       FGC  K +G     S    G+LG G+ + S+L+Q  +    +  
Sbjct: 261 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 320

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           FS+CL +       G   +G + +PK +  TPL++N    + Y V +  I VG   +D+P
Sbjct: 321 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQN---QAHYNVVMKEIEVGGDPLDVP 373

Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
             A  F      GTIIDSGT        V+  L+    +   D+    V    T      
Sbjct: 374 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 426

Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           FD   +V    PT+TL F   +++T+ P + L              + A       L ++
Sbjct: 427 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLL 486

Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
            ++   N  ++YD+    +G     C+
Sbjct: 487 GDLVLSNKLVVYDLEKQGIGWVEYNCS 513


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 150/324 (46%), Gaps = 55/324 (16%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
           +GTP  T L+A+DT +D  WVPC  C+ C+             V++ AQSTT + + C +
Sbjct: 41  LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 99

Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
             C           +C +++ Y S   +++  L +D + L +D      +     FGC Q
Sbjct: 100 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 159

Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
             TG+   S  P GLLGLG  S S+  L  ++ L  ++FS C          G +  G  
Sbjct: 160 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 215

Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
           G   + K TPL  N  + + YY + +  I VG + +           +T    I+DSGT 
Sbjct: 216 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 261

Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
           FT L  P YT +   F  ++ S  N+  +S+  F+ CYSV    IV P ++L   G ++ 
Sbjct: 262 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 319

Query: 373 LPQDNLLIHSTAGSIT----CLAM 392
            P ++ +I  T  +      CLA+
Sbjct: 320 FPVNDPIITITDNAFNPVGYCLAI 343


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y+    +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSSGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 167/361 (46%), Gaps = 56/361 (15%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
           +GTP QT ++A+DT +D  W+PC  C GC+         ++ +  + S+T + + C +  
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
           C ++         C + + Y S+  +++  L +D + L+T D +P        FGC Q  
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239

Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           TG   ++  P GL GLG   +   S+LAQ + L  ++F+ C     +    G +  G  G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
              + + TPL  NP+  + Y +++  I VG  + D     L+F+      TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEITVGNSLTD-----LEFS------TIFDTGTSFT 341

Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
            L  PAYT +   F  +V +N     S   F+ CY +      I  P+I+L   G +V  
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            + +  ++       + CLA+  +     + LN+I        R+++D     LG  +  
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456

Query: 432 C 432
           C
Sbjct: 457 C 457


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 167/361 (46%), Gaps = 56/361 (15%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
           +GTP QT ++A+DT +D  W+PC  C GC+         ++ +  + S+T + + C +  
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
           C ++         C + + Y S+  +++  L +D + L+T D +P        FGC Q  
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239

Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           TG   ++  P GL GLG   +   S+LAQ + L  ++F+ C     +    G +  G  G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
              + + TPL  NP+  + Y +++  I VG  + D     L+F+      TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEITVGNSLTD-----LEFS------TIFDTGTSFT 341

Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
            L  PAYT +   F  +V +N     S   F+ CY +      I  P+I+L   G +V  
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            + +  ++       + CLA+  +     + LN+I        R+++D     LG  +  
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456

Query: 432 C 432
           C
Sbjct: 457 C 457


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 158/365 (43%), Gaps = 49/365 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP+Q   + +D+ +   +VPC  C  C +     F    S+T+  + C    
Sbjct: 91  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN-VD 149

Query: 154 CKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
           C      TC      C +   Y   S+ +  L +D +S    +++ P    FGC    TG
Sbjct: 150 C------TCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETG 203

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG +  P  
Sbjct: 204 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPAPPD 261

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + ++    NP RS  Y + L  I V  + + + P    FN  +  GT++DSGT +  L  
Sbjct: 262 MVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAYLPE 315

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF-SGM 369
            A+ A +D    +V S   +  + G D  Y              +  V P + ++F +G 
Sbjct: 316 QAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 372

Query: 370 NVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            ++L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++G  
Sbjct: 373 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFW 429

Query: 429 RELCT 433
           +  C+
Sbjct: 430 KTNCS 434


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 100/392 (25%), Positives = 162/392 (41%), Gaps = 78/392 (19%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSSTVFNSAQSTT 143
           T +  Y    ++GTP +   + +DT +D  WV C  C        +G   T+++   S+T
Sbjct: 83  TDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASST 142

Query: 144 FKNLGCQAAQCKQVPN---PTCGGGA-CAFNLTY--GSSTIAANLSQDTISLATDIVPG- 196
              + C    C        P C     C +++TY  GSST+ + ++    +L  D V G 
Sbjct: 143 GSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVND---ALQFDQVTGD 199

Query: 197 ---------YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
                      FGC  +  G+    S    G+LG G  + S+L+Q  T    +  F++CL
Sbjct: 200 GQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL 259

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
            + K     G   +G + QPK +K TPL+ +      Y VNL  I VG   +++P  A  
Sbjct: 260 DTIKG---GGIFAIGDVVQPK-VKTTPLVADKPH---YNVNLKTIDVGGTTLELP--ADI 310

Query: 302 FNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
           F P    GTIIDSGT        VF +++   +   +D+    V   L     G  D  +
Sbjct: 311 FKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGF 370

Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIH--------STAGSITCLAMA----AAPDNVNS 401
                 PT+T  F        +D+L +H             + C+        + D  + 
Sbjct: 371 ------PTLTFHF--------EDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDI 416

Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
           VL  + ++   N  ++YD+ N  +G     C+
Sbjct: 417 VL--MGDLVLSNKLVVYDLENRVIGWTDYNCS 446


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 158/387 (40%), Gaps = 50/387 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+      +++  Y  +  IGTP++   + +DT +D  WV C GC        +G   T
Sbjct: 60  LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 119

Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
           +++   STT   +GC    C     P  G   G  C +++ YG  S+      QD +   
Sbjct: 120 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 179

Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
                   T       FGC  K +G     S    G+LG G+ + S+L+Q  +    +  
Sbjct: 180 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 239

Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
           FS+CL +       G   +G + +PK +  TPL++N      Y V +  I VG   +D+P
Sbjct: 240 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQNQAH---YNVVMKEIEVGGDPLDVP 292

Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
             A  F      GTIIDSGT        V+  L+    +   D+    V    T      
Sbjct: 293 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 345

Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
           FD   +V    PT+TL F   +++T+ P + L              + A       L ++
Sbjct: 346 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLL 405

Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
            ++   N  ++YD+    +G     C+
Sbjct: 406 GDLVLSNKLVVYDLEKQGIGWVEYNCS 432


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 163/383 (42%), Gaps = 40/383 (10%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----VFN 137
           +P++SG   T +  Y VR ++GTPAQ  ++  DT +D  WV C G  G  ++      F 
Sbjct: 1   MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFR 59

Query: 138 SAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDTISL 189
           +++S ++  L C +  C   VP    N +     CA++  Y   + A  +      TI+L
Sbjct: 60  ASESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIAL 119

Query: 190 ATDI-------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQS 235
           +                + G   GC     G S     G+L LG  ++S  ++    +  
Sbjct: 120 SGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGG 179

Query: 236 TFSYCLPSFKALSFSGS---LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
            FSYCL    A   + S      GP G       TPL+ + R S  Y V + A+ V    
Sbjct: 180 RFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEA 239

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
           +DIP  A  ++   G G I+DSGT  T L  PAY AV      R+ + L   ++  F+ C
Sbjct: 240 LDIP--ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAA-LPRVAMDPFEYC 296

Query: 353 YSVPIVAPTI---TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
           Y+    AP I    + F+G     P     +   A  + C+ +    +     ++VI N+
Sbjct: 297 YNWTAGAPEIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQ---EGAWPGVSVIGNI 353

Query: 410 QQQNHRILYDVPNSRLGVARELC 432
            QQ H   +D+ +  L      C
Sbjct: 354 LQQEHLWEFDLRDRWLRFKHTRC 376


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y+    +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGIHGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 56/169 (33%), Positives = 83/169 (49%), Gaps = 9/169 (5%)

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L +NP+  + YYV L+ I VG  ++ IP  + + +     G I+DSGT  TRL +  Y  
Sbjct: 1   LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60

Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHST 383
           VRD F +     L    +  FDTCY +     +  PT+   F  G  + LP  N L+   
Sbjct: 61  VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120

Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +    C A A       S L++I N+QQQ  R+ +D+ NS +G +   C
Sbjct: 121 SVGTFCFAFAPTM----SSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
           Y  + K+G PA+   + +DT +D  WV C+ C GC  +        +F++ +S++ + L 
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 149 CQAAQCKQVPNPT----CGGGACAFNLTY---------------------GSSTIAANLS 183
           C    C  V   T         C+++  Y                     G STIA   S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN--S 201

Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
             TI     I   Y +G + +AT       G+ G G+G  S+++Q  ++ +    FS+CL
Sbjct: 202 SATIVFGCSI---YQYGDLTRATK---ALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
              +  +  G L LG I +P  I Y+PL+ +      Y + L +I +  ++    P    
Sbjct: 256 KGGE--NGGGILVLGEILEPS-IVYSPLIPSQPH---YTLKLQSIALSGQLF---PNPTM 306

Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIV 358
           F  +    TIIDSGT    LV   Y  +  V    V  + T T   G   F    SV  +
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI 366

Query: 359 APTITLMFSGMN--VTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNH 414
            P +   F G+   V  P++ L   S     ++ C+    A D     LN++ ++  ++ 
Sbjct: 367 FPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDG----LNILGDLVLKDK 422

Query: 415 RILYDVPNSRLGVARELC 432
            I+YD+   R+G A   C
Sbjct: 423 IIVYDLARQRIGWANYDC 440


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 108/418 (25%), Positives = 169/418 (40%), Gaps = 48/418 (11%)

Query: 51  PLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQT 109
           PLS  E   +++  DQ R   +S     +  V + + SG     +  Y    ++GTPA+ 
Sbjct: 45  PLSRIE---DIIGADQKRHSLISRKRKFKGGVKMDLGSGIDY-GTAQYFTEVRVGTPAKK 100

Query: 110 LLMAMDTSNDAAWVPC------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-------- 155
             + +DT ++  WV C       G V  +  VF + +S +FK +GC    CK        
Sbjct: 101 FRVVVDTGSELTWVNCRYRGRGKGKVK-NRRVFRAEESKSFKTVGCFTQTCKVDLMNLFS 159

Query: 156 --QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATG 207
               P P+     C+++  Y   + A  + +++TI++         + G   GC    +G
Sbjct: 160 LSTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSG 216

Query: 208 NSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIK 265
            S     G+LGL     S  +   +L+ +  SYCL       + S  L  G        K
Sbjct: 217 QSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTK 276

Query: 266 YTPLLKNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
             P    P   +L    Y +N++ I +G  ++DIP     ++ TTG GTI+DSGT  T L
Sbjct: 277 TAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQV--WDATTGGGTILDSGTSLTLL 334

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSV-----PIVAPTITLMFSGMNVTLPQ 375
              AY  V     R +     V   G   + C+S          P +T    G     P 
Sbjct: 335 AEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPH 394

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
               +   A  + CL   +A        NV+ N+ QQN+   +D+  S L  A   CT
Sbjct: 395 RKSYLVDAAPGVKCLGFMSAG---TPATNVVGNIMQQNYLWEFDLMASTLSFAPSTCT 449


>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 110/430 (25%), Positives = 168/430 (39%), Gaps = 93/430 (21%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVFNSAQ 140
           +PI+SG        Y++   +G+ +  + + +DT +D  W PC    C+ C S     + 
Sbjct: 75  LPISSGSD------YLISLSVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSP 128

Query: 141 STTFKNLG----------------------CQAAQCKQVPNPTCGGGACA--------FN 170
            ++  +                        C  + C   P      G C         F 
Sbjct: 129 PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNC---PLDFIETGDCNTSSYPCPPFY 185

Query: 171 LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ-- 228
             YG  ++ A L  D++SL +  V  +TFGC          P G+ G GRG LSL AQ  
Sbjct: 186 YAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAE---PIGVAGFGRGRLSLPAQLA 242

Query: 229 -TQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------GQPKRIK--------------- 265
                  ++FSYCL S  +       R  P+        + KR+                
Sbjct: 243 VHSPHLGNSFSYCLVS-HSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKK 301

Query: 266 ----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
               +T +L+NP+    Y V+L  I +G+R +  P    + +   G G ++DSGT FT L
Sbjct: 302 NEFVFTEMLENPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTML 361

Query: 322 VAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV--PIVAPTITLMFSG--MNVTL 373
            A  Y +V + F  RVG        V    G   CY +   +  P + L F+G   +VTL
Sbjct: 362 PAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTL 421

Query: 374 PQDNLLIHSTAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPN 422
           P+ N       G         I CL +    D          ++ N QQQ   ++YD+ N
Sbjct: 422 PRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLN 481

Query: 423 SRLGVARELC 432
            R+G A+  C
Sbjct: 482 RRVGFAKRKC 491


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 94/390 (24%), Positives = 157/390 (40%), Gaps = 41/390 (10%)

Query: 68  RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
           R + L+    A  S VPI   R +      +    IGTP Q     +D + +  W  C+ 
Sbjct: 42  RGRLLADATPAGGSAVPIHWSRHLYN----VANFTIGTPPQPASAIIDVAGELVWTQCSM 97

Query: 128 CVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-- 182
           C  C      +F    S+TF+   C    CK +P   C    C +  T  S      L  
Sbjct: 98  CSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGI 157

Query: 183 -SQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
            + DT ++ T       FGC+  +  +++  P GL+GLGR   SL++Q   +  + FSYC
Sbjct: 158 VATDTFAIGTATA-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQ---MNITKFSYC 213

Query: 241 LPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
           L    +     L    S +L   G      +         S  Y + L  I+ G   + +
Sbjct: 214 LTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIAL 273

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-- 353
           PP        +G   ++ +    + LV  AY A++    + VG+  T T L  FD C+  
Sbjct: 274 PP--------SGNTVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPK 325

Query: 354 ------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP----DNVNSVL 403
                 S P +  T     + + V  P+  + +    G++ C+A+ +        ++  L
Sbjct: 326 AGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTV-CMAILSTSWLNTTALDENL 384

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
           N++ ++QQ+N   L D+    L      C+
Sbjct: 385 NILGSLQQENTHFLLDLEKKTLSFEPADCS 414


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 143/350 (40%), Gaps = 53/350 (15%)

Query: 131 CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDT 186
           C+S + ++A ++   +  C  A+C  + +   +CG   AC      YG  ++ A+L +  
Sbjct: 26  CASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGR 85

Query: 187 ISLATDI-------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
           ++L           V  +TF C   A G    P G+ G GRG LSL  Q        FSY
Sbjct: 86  VALGAGARASVAVAVDNFTFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSY 142

Query: 240 CL--PSFKALSFSGSLRLGPI------------GQPKRIKYTPLLKNPRRSSLYYVNLLA 285
           CL   SF+A      +R  P+             +     YTPLL NP+    Y V L A
Sbjct: 143 CLVSHSFRADRL---IRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEA 199

Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT--- 342
           + VG   +   P   + +     G ++DSGT FT L    Y  V + F R + +      
Sbjct: 200 VSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARA 259

Query: 343 --VTSLGGFDTCYSVPIV---APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----I 387
                  G   CY         P + L F G   V LP+ N  +      + AG+    +
Sbjct: 260 ERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDV 319

Query: 388 TCLAMA----AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
            CL +     A+ +  +     + N QQQ   ++YDV   R+G AR  CT
Sbjct: 320 GCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 369


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 51/380 (13%)

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
           +Q   Y  + K+GTP +   + +DT +D  WV C  C GC  T         F+   S+T
Sbjct: 72  SQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSST 131

Query: 144 FKNLGCQAAQCK---QVPNPTCG--GGACAFNLTYGSSTIAANLSQDTI---------SL 189
              + C   +C+   Q  + +C      C +   YG  +  +      +         +L
Sbjct: 132 SSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTL 191

Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
            T+      FGC    TG+    +    G+ G G+  +S+++Q   Q +    FS+CL  
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251

Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
               S  G L LG I +P  I Y+PL+++      Y +NL +I V  ++V I P    F 
Sbjct: 252 DN--SGGGVLVLGEIVEPN-IVYSPLVQSQPH---YNLNLQSISVNGQIVPIAPAV--FA 303

Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVPI----- 357
            +   GTI+DSGT    L   AY    +     V  ++ +V S G  + CY +       
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRG--NQCYLITTSSNVD 361

Query: 358 VAPTITLMFSGMN--VTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
           + P ++L F+G    V  PQD L+  +    GS+ C+     P      + ++ ++  ++
Sbjct: 362 IFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIP---GQSITILGDLVLKD 418

Query: 414 HRILYDVPNSRLGVARELCT 433
              +YD+   R+G A   C+
Sbjct: 419 KIFVYDLAGQRIGWANYDCS 438


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 155/367 (42%), Gaps = 53/367 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFK----NLGC 149
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T++    N+ C
Sbjct: 13  YTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNIDC 72

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
                KQ          C +   Y   ST +  L +D IS    + + P    FGC    
Sbjct: 73  NCDDEKQ---------QCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENME 123

Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           TG+  S    G++G+GRG LS++     + +   +FS C          G++ LG I  P
Sbjct: 124 TGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCY--GGMGIGGGAMVLGGISPP 181

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVF 318
             + ++    +P RS  Y ++L  I V  +        L  NPT      GTI+DSGT +
Sbjct: 182 SNMVFSQ--SDPVRSPYYNIDLKEIHVAGK-------PLPLNPTVFDGKHGTILDSGTTY 232

Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMF-S 367
             L   A+ + +D   + + S   +        D C+S        +    P + ++F +
Sbjct: 233 AYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGN 292

Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
           G  + L P++ L  HS      CL +     +  ++L  I     +N  +LYD  NS++G
Sbjct: 293 GQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIV---VRNTLVLYDRENSKIG 349

Query: 427 VARELCT 433
             +  C+
Sbjct: 350 FWKTNCS 356


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y+    +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +P +TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP  K+    FS   G   LG +  
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
              ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  + 
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233

Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
              R ++     +R++  RR  +            CY +  V     P I+L F  G   
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
            L    + +  +        +A AP    S++
Sbjct: 289 DLGSRGVFVERSVQEQDVWCLAFAPTESVSII 320


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 108/429 (25%), Positives = 186/429 (43%), Gaps = 42/429 (9%)

Query: 29  DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
           D   +L + H  SP SP        ++  +    ++  +R+    + AV   S       
Sbjct: 31  DPGFSLNLIHRDSPLSPLYNPNHTDFDR-LRNAFSRSISRVNVFKTKAVDINSF----QN 85

Query: 89  RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFK 145
             +     Y ++  IGTP   +++  DT +D  WV   PC  C    S +F+ ++S++++
Sbjct: 86  DLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYR 145

Query: 146 NLGCQAAQCK--QVPNPTC--GGGACAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGY-- 197
           ++ C +  C    V    C      C ++ +YG  +    NL+ +  ++ +T   P +  
Sbjct: 146 HMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLS 205

Query: 198 --TFGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFS 250
              FGC    TGN      L    +GLG G+LSL++Q  ++ +  FSYCL P  +  + +
Sbjct: 206 PIVFGC---GTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVT 262

Query: 251 GSLRLGP---IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
             ++ G    I  P+ +  TPL+ K P   + YYV L AI VG + +    G L  N   
Sbjct: 263 SKIKFGTDSVISGPQVVS-TPLVSKQP--DTYYYVTLEAISVGNKRLPYTNGLLNGNVEK 319

Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITL 364
           G   IIDSGT  T L +  +T +  V    V +       G F  C+  +  I  P I +
Sbjct: 320 G-NVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDLPVIAV 378

Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
            F+  +V L   N  + +    + C  M ++     + + +  N+ Q +  + YD+    
Sbjct: 379 HFNDADVKLQPLNTFVKADE-DLLCFTMISS-----NQIGIFGNLAQMDFLVGYDLEKRT 432

Query: 425 LGVARELCT 433
           +      CT
Sbjct: 433 VSFKPTDCT 441


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 86/334 (25%), Positives = 147/334 (44%), Gaps = 39/334 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTPA+T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG+TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP       F    +G   LG    
Sbjct: 120 ANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178

Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             R  ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233

Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
           +    R ++     +R++  RR  +            CY +  V     P I+L F  G 
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
              L    + +  +        +A AP    S++
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFAPTESVSII 322


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 103/428 (24%), Positives = 174/428 (40%), Gaps = 53/428 (12%)

Query: 33  TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           ++ + H  SP SP + PS      E+  E L +   R    S  +++  +  P  S    
Sbjct: 36  SIDLIHRDSPKSPLYNPS------ETPAERLDRFFRRFMSFSEASISPNTPEPPVS---- 85

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
           + +  Y+++  IGTP   +    DT +D  W  C  C+ C    + +F+ ++ST+FK + 
Sbjct: 86  SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVS 145

Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFG 200
           C++ QC+ +   +C      C F+  YG  ++A   ++ +T++L ++      +    FG
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFG 205

Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL-SFSGSLRLG 256
           C    +G  +    GL G G   LSL +Q  +   S   FS CL  F+   S +  +  G
Sbjct: 206 CGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFG 265

Query: 257 PIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           P  +     +  TPL+      + Y+V L  I VG ++           P + +  +   
Sbjct: 266 PEAEVSGSXVVSTPLVTK-DDPTYYFVTLDGISVGDKLF----------PFSSSSPMATK 314

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-------CY--SVPIVAPTITLM 365
           G VF     P     RD + R V        +            CY  +  I  P +T  
Sbjct: 315 GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAH 374

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F G +V L   N  I    G + C AM      ++    +  N  Q N  I +D+   ++
Sbjct: 375 FDGADVQLKPLNTFISPKEG-VYCFAMQP----IDGDTGIFGNFVQMNFLIGFDLDGKKV 429

Query: 426 GVARELCT 433
                 CT
Sbjct: 430 SFKAVDCT 437


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 157/358 (43%), Gaps = 41/358 (11%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST------VFNSAQSTTFKNLGCQAAQCK 155
           +GTPA   L+ +DT +  +WV C  C V C +        FN++ S+T++ +GC A  C 
Sbjct: 29  LGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCH 88

Query: 156 QV---PNPTCG----GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGC--IQK 204
            +    N   G      +C ++L Y S   +A  LSQD ++LA    +  + FGC    +
Sbjct: 89  DMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKFIFGCGSDNR 148

Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ-STFSYCLPSFKALSFSGSLRLGP-IGQPK 262
             G+S    G++G G  S S   Q   L   S FSYC PS +     G L +GP +    
Sbjct: 149 YNGHSA---GIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQEN--EGFLSIGPYVRDSN 203

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
           ++  T L        +Y +    + V G R+   PP        T   T++DSGTV T +
Sbjct: 204 KLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPP------VYTTRMTVVDSGTVETFV 257

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SVPIVA-PTITLMFSGMNVTLPQ 375
           ++P + A+     + + +   V      + C+     SV     P + + FS   + LP 
Sbjct: 258 LSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNGDSVDWSKLPVVEIKFSRSILKLPA 317

Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNS-VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +N+  + T+    C      PD+     + ++ N   ++ R+++D+     G     C
Sbjct: 318 ENVFYYETSDGSICSTF--QPDDAGVPGVQILGNRATRSFRVVFDIQQRNFGFEAGAC 373


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 158/365 (43%), Gaps = 39/365 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
           Y     +G P Q L + +DT +D  WV C+ C  C S        +++N + S+T     
Sbjct: 83  YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142

Query: 149 CQAAQC---KQVPNPTCGGGACAFNLTY--GSSTIAANLSQD---TISLATDIVPGYTFG 200
           C    C   + V + +    ACA+ ++Y   S++I A +  D    +           FG
Sbjct: 143 CSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFFG 202

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           C    TG S P  G++G G+ S ++  Q  TQ      FS+CL   K     G L  G  
Sbjct: 203 CAINITG-SWPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEK--HGGGILEFGEE 259

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF--NPTTGAGTIIDSGT 316
                + +TPLL     ++ Y V+LL+I V  +V+ I      +  N T   G IIDSGT
Sbjct: 260 PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGT 316

Query: 317 VFTRLVAPA----YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMN-V 371
            F  L   A    ++ ++++   ++G  L         +  +V    P +TL FSG + +
Sbjct: 317 SFALLATKANRILFSEIKNLTTAKLGPKLEGLQCFYLKSGLTVETSFPNVTLTFSGGSTM 376

Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            L  DN L+        +  C A ++A       L +   +  ++  + YDV N R+G  
Sbjct: 377 KLKPDNYLVMVELKKKRNGYCYAWSSADG-----LTIFGEIVLKDKLVFYDVENRRIGWK 431

Query: 429 RELCT 433
            + C+
Sbjct: 432 GQNCS 436


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 167/361 (46%), Gaps = 56/361 (15%)

Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
           +GTP QT ++A+DT +D  W+PC  C GC+         ++ +  + S+T + + C +  
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
           C ++         C + + Y S+  +++  L +D + L+T D +P        FGC Q  
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239

Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
           TG   ++  P GL GLG   +   S+LAQ + L  ++F+ C     +    G +  G  G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294

Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
              + + TPL  NP+  + Y +++  + VG  + D     L+F+      TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEMTVGNSLTD-----LEFS------TIFDTGTSFT 341

Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
            L  PAYT +   F  +V +N     S   F+ CY +      I  P+I+L   G +V  
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
            + +  ++       + CLA+  +     + LN+I        R+++D     LG  +  
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456

Query: 432 C 432
           C
Sbjct: 457 C 457


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 114/465 (24%), Positives = 187/465 (40%), Gaps = 63/465 (13%)

Query: 9   LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFS-PCSPFKPSK----PLSWEESVLEMLA 63
           LA +F+      L   C    H  T  + H  S P   +  S     P   EE  +E  A
Sbjct: 2   LASVFIIVSLLSLWECCQCHGHVYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYA 61

Query: 64  KDQARLQFLSSLAVAR-KSVVPIASGRQITQSPT----YIVRAKIGTPAQTLLMAMDTSN 118
           +   R + L    +++  + +  + G    +  +    +    +IGTP    ++A+DT +
Sbjct: 62  ELADRDRLLRGRKLSQIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGS 121

Query: 119 DAAWVP--CTGCVGCSST---------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGAC 167
           D  WVP  CT C    ST         V+N   S+T K + C  + C            C
Sbjct: 122 DLFWVPCDCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNC 181

Query: 168 AFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQKATG---NSVPPQGLL 216
            + ++Y S+  + +  L +D + L  +      +     FGC Q  +G   +   P GL 
Sbjct: 182 PYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLF 241

Query: 217 GLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
           GLG   +S+  +   +     +FS C          G +  G  G   + + TP   NP 
Sbjct: 242 GLGMEKISVPSMLSREGFTADSFSMCF----GRDGIGRISFGDKGSFDQDE-TPFNLNPS 296

Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
             + Y + +  +RVG  V+D+   AL            DSGT FT LV P YT + + F 
Sbjct: 297 HPT-YNITVTQVRVGTTVIDVEFTAL-----------FDSGTSFTYLVDPTYTRLTESFH 344

Query: 335 RRVGSNLTVT-SLGGFDTCYSVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGS-I 387
            +V      + S   F+ CY +   A     P+++L   G +     D ++I ST    +
Sbjct: 345 SQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSELV 404

Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            CLA+  + +     LN+I       +R+++D     LG  +  C
Sbjct: 405 YCLAVVKSAE-----LNIIGQNFMTGYRVVFDREKLVLGWKKFDC 444


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 108/430 (25%), Positives = 185/430 (43%), Gaps = 71/430 (16%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
           T ++ H  SP SPF  +  ++    +   + + ++RL +L  +    ++ +       ++
Sbjct: 9   TARLIHHDSPLSPFY-NHTMTDTARIEATVHRSRSRLNYLYYINKLSENALD----NDVS 63

Query: 93  QSPT-------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CS------STVFNS 138
            SPT       Y++   IG P+  ++  +DTSN   WV C+ C   C       +T F S
Sbjct: 64  LSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLS 123

Query: 139 AQSTTFKNLGCQAAQCKQVPN-PTCGGGA--CAFNLTYGSSTIAAN-LSQDTISLATD-- 192
           ++S T++   C +  C  +    TC      C + L YG +   +  LS D+    T   
Sbjct: 124 SKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDG 183

Query: 193 --IVPGY-TFGCIQK-ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
             +  G+  FGC +   TG+     G +GL +  LSL++Q   L    FSYCL  F  L 
Sbjct: 184 MLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQ---LGIKKFSYCLVPFNNLG 240

Query: 249 FSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
            +  +  G +     GQ      TPLL     S  YYV +L I +G    D P     F+
Sbjct: 241 STSKMYFGSLPVTSGGQ------TPLLY--PNSDAYYVKVLGISIGN---DEPHFDGVFD 289

Query: 304 -PTTGAGTIIDSGTVFTRLVAPAYTA-------VRDVFRRRVGSNLTVTSLGGFDTCYSV 355
                 G IID+G  ++ L   A+ +       ++D  +R+            F+ C+ +
Sbjct: 290 VYEVRDGWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKER------FELCFEL 343

Query: 356 PIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
                    P +T+ F G ++ L  ++  +      I CLA+  +     S ++++ N Q
Sbjct: 344 QNANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRS----GSPVSILGNFQ 399

Query: 411 QQNHRILYDV 420
            QN+ + YD+
Sbjct: 400 LQNYHVGYDL 409


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 139/316 (43%), Gaps = 45/316 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP QT  + +DT +   +VPC+ C  C       F    S+T++ + C    
Sbjct: 90  YTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNI-- 147

Query: 154 CKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
                + TC      C +   Y   S+ +  L +D IS    +++VP    FGC  + TG
Sbjct: 148 -----DCTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQETG 202

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
           +  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG I  P  
Sbjct: 203 DLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDI--GGGAMILGGISPPSG 260

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + +     +P RS  Y ++L AI V  + + + P           GT++DSGT +  L  
Sbjct: 261 MVFAE--SDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGK----HGTVLDSGTTYAYLPE 314

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMFS-GM 369
            A+TA +D   + + S   +  + G      D C+S        +    P + ++FS G 
Sbjct: 315 AAFTAFKDAMMKELTS---LKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSNGQ 371

Query: 370 NVTLPQDNLLIHSTAG 385
            ++L  +N L     G
Sbjct: 372 KLSLSPENYLFQYYLG 387


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 148/334 (44%), Gaps = 39/334 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG+TFGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP       F    +G   LG    
Sbjct: 120 ANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178

Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             R  ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233

Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
           +    R ++     +R++  RR  +            CY +  V     P I+L F  G 
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
              L +  + +  +        +A AP    S++
Sbjct: 289 RFDLGRHGVFVERSVQEQDVWCLAFAPTESVSII 322


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 154/362 (42%), Gaps = 41/362 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y+    IGTP Q +   +D + +  W  CT C  C      +F+  +S+TF+ L C +  
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116

Query: 154 CKQVPNPT--CGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGN 208
           C+ +P  +  C    C +     +         DT ++         FGC+    K    
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAIGA-AKETLGFGCVVMTDKRLKT 175

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------- 260
              P G++GLGR   SL+ Q   +  + FSYCL    A   SG+L LG   +        
Sbjct: 176 IGGPSGIVGLGRTPWSLVTQ---MNVTAFSYCL----AGKSSGALFLGATAKQLAGGKNS 228

Query: 261 --PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA-LQFNPTTGAGTIIDSGTV 317
             P  IK +    +   +  Y V L  I+ G        GA LQ   ++G+  ++D+ + 
Sbjct: 229 STPFVIKTSAGSSDNGSNPYYMVKLAGIKAG--------GAPLQAASSSGSTVLLDTVSR 280

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS---GMNVTLP 374
            + L   AY A++      VG     +    +D C+S  +      L+F+   G  +T+P
Sbjct: 281 ASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVP 340

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVL---NVIANMQQQNHRILYDVPNSRLGVAREL 431
             N L+ S  G++     ++A  N+   L   +++ ++QQ+N  +L+D+    L      
Sbjct: 341 PANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPAD 400

Query: 432 CT 433
           C+
Sbjct: 401 CS 402


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 84/162 (51%), Gaps = 11/162 (6%)

Query: 76  AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
           A ARK+VV  A    +     Y+V+  IGTP      A+DT++D  W  C  C GC   V
Sbjct: 70  ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127

Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
              FN   S+T+  L C +  C ++    CG     +C +  TY G++T    L+ D + 
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187

Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQ 228
           +  D   G  FGC   +TG + PPQ  G++GLGRG LSL++Q
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ 229



 Score = 44.7 bits (104), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 52/131 (39%), Gaps = 10/131 (7%)

Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPT 361
           G IID  +  T L A  Y  + +     +       S  G D C+ +P       +  P 
Sbjct: 236 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPA 295

Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
           + L F G  + L +  L        + CL +  A     S+L    N QQQN ++LY++ 
Sbjct: 296 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSIL---GNFQQQNMQVLYNLR 352

Query: 422 NSRLGVARELC 432
             R+   +  C
Sbjct: 353 RGRVTFVQSPC 363


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 131/331 (39%), Gaps = 42/331 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------------TGCVG 130
           +P+ S   I     Y+V  + GTPA    + +DT+ND  W+ C            T  VG
Sbjct: 113 LPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVG 172

Query: 131 CSS-----------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
                           +  A+S++++ + C   +C  +P  TC   + A + +Y      
Sbjct: 173 AGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232

Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
           G+ T+     +      +D     +PG   GC     G SV    G+L LG G +S    
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292

Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
               +   FS+CL S  +   + S L  GP   +  P  ++ T ++ N      Y   + 
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDIVYNVDVKPAYGPLVT 351

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            I VG   +DIP          G G I+D+ T  T LV  AY AV     R +     V 
Sbjct: 352 GIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVY 411

Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
            L GF+ CY        + L     NVT+P+
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLAH---NVTVPR 439


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 169/402 (42%), Gaps = 41/402 (10%)

Query: 55  EESVLEMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
           + S +E     +++++ L S+   AR S++P   G        ++V   IG+P  T L+ 
Sbjct: 67  QTSSIERFDFLESKIKELKSVGNEARSSLIPFNRG------SGFLVNLSIGSPPVTQLVV 120

Query: 114 MDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAF 169
           +DT +   WV C  C+ C   S++ F+  +S +FK LGC       +    C       +
Sbjct: 121 VDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEY 180

Query: 170 NLTY-GSSTIAANLSQDTISLATDIVPG------YTFGC--IQKATGNSVPPQGLLGLGR 220
            L Y G  +    L+++++   T +  G       TFGC  +   T N     G+ GLG 
Sbjct: 181 KLRYLGGDSSQGILAKESLLFET-LDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGA 239

Query: 221 G-SLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL 278
              +++  Q  N     FSYC+      L     L LG  G       TPL         
Sbjct: 240 YPHITMATQLGN----KFSYCIGDINNPLYTHNHLVLGQ-GSYIEGDSTPL---QIHFGH 291

Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA----YTAVRDVFR 334
           YYV L +I VG + + I P A + +     G +IDSG  +T+L        Y  + D+ +
Sbjct: 292 YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMK 351

Query: 335 RRVGSNLTVTSLGG--FDTCYSVPIVA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
             +    T     G  F    S  +V  P +T  F+G    + +   L     G   CLA
Sbjct: 352 GLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLA 411

Query: 392 MAAAPDNVNSV-LNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +   P N   + L+VI  + QQN+ + +D+   ++   R  C
Sbjct: 412 I--LPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/330 (25%), Positives = 127/330 (38%), Gaps = 42/330 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC----------------- 125
           +P+ S   I     Y+V  +IGTPA    + +DT+ D  W+ C                 
Sbjct: 111 LPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQ 170

Query: 126 ------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
                  G    S   +  A+S++++ + C   +C  +P  TC   + A + +Y      
Sbjct: 171 TMSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQD 230

Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
           G+ TI     +      +D     +PG   GC     G SV    G+L LG G +S    
Sbjct: 231 GTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVH 290

Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
               +   FS+CL S  +   + S L  GP   +  P  ++ T +L N      Y   + 
Sbjct: 291 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDILYNVDVKPAYGAQVT 349

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            + VG   +DIP          G G I+D+ T  T LV  AY  V     R +     V 
Sbjct: 350 GVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVY 409

Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
            L GF+ CY        +       NVT+P
Sbjct: 410 ELEGFEYCYKWTFTGDGVD---PAHNVTIP 436


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 97/207 (46%), Gaps = 12/207 (5%)

Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQP-KRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
           + ++ FSYCL S    S +  L LG + +  K    TPLL NP + S YY++L  I VG 
Sbjct: 1   MKEAKFSYCLTSMDD-SKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGG 59

Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
             + I       +     G IIDSGT  T L    +  ++  F  +    L  +S  G D
Sbjct: 60  TQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLD 119

Query: 351 TCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
            C+S+P     +  P +   F G ++ LP ++ +I  +   + CLAM A+     + +++
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAMGAS-----NGMSI 174

Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
             N+QQQN  + +D+    +      C
Sbjct: 175 FGNVQQQNILVNHDLEKETISFVPTQC 201


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 115/442 (26%), Positives = 182/442 (41%), Gaps = 56/442 (12%)

Query: 33  TLQVFHVFSPCSPFKPSKPLSWEESV-LEML-AKDQARLQFLSSLAVARKSVVPIASGRQ 90
           T  V H  SP S     +     + V LE+L A+DQAR   L    V       +     
Sbjct: 20  TAAVVHCGSPASLLTLERAFPVNQRVELEVLRARDQARHGRLLRGVVGGVVDFTVYGTSD 79

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
                 Y  + K+G+P +   + +DT +D  WV C  C  C  T     + + F      
Sbjct: 80  PYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSS 139

Query: 151 AAQCKQVPNPTCG-------------GGACAFNLTYGSST------IAANLSQDTI---S 188
                   +P C                 C+++  YG  +      ++  L  DT+   S
Sbjct: 140 TTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDS 199

Query: 189 LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQTQNL--YQSTFSYCLP 242
           L  +      FGC    +G+         G+ G G+  LS+++Q  +L      FS+CL 
Sbjct: 200 LIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLK 259

Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
                   G L LG I +P  I Y+PL+  P +S  Y +NL +I V  +++ I P    F
Sbjct: 260 GEG--DGGGKLVLGEILEPN-IIYSPLV--PSQSH-YNLNLQSISVNGQLLPIDPAV--F 311

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPI 357
             +   GTI+DSGT  T LV  AY          V S+ T V S G  + CY    SV  
Sbjct: 312 ATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKG--NQCYLVSTSVDE 369

Query: 358 VAPTITLMFS-GMNVTLPQDNLLIH---STAGSITCLAM--AAAPDNVNSVLNVIANMQQ 411
           + P ++L F+ G ++ L     L+H   S   ++ C+     A P      + ++ ++  
Sbjct: 370 IFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPG-----ITILGDLVL 424

Query: 412 QNHRILYDVPNSRLGVARELCT 433
           ++   +YD+ + R+G A   C+
Sbjct: 425 KDKIFVYDLAHQRIGWANYDCS 446


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 163/368 (44%), Gaps = 62/368 (16%)

Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ----CKQVPN 159
            QT  + +DT +   +VPC GC  C       ++  +S  F+ L C  A     C++   
Sbjct: 48  GQTYDLIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCEETMK 107

Query: 160 PTC-GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGY-TFGCIQKATGNSVPPQ--- 213
            TC   G C++ ++Y   S+    + +D + L    +     FGC ++A  N++  Q   
Sbjct: 108 GTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLAFGC-EEAETNAIYEQKAD 166

Query: 214 GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI---GQPKRIKYTP 268
           GL G GRG+ ++ AQ  +  L ++ FS+C+  F A    G L LG          +  TP
Sbjct: 167 GLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGA--NGGVLTLGRFDFGADAPALARTP 224

Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
           L+ +P   + + V   + ++G  +++        N  T   T +DSGT FT +    + +
Sbjct: 225 LVADPANPAFHNVRTSSWKLGDSLIE------HLNSYT---TTLDSGTTFTFVPRSVWVS 275

Query: 329 VRDVFRRRVGSNLTVTSLGGF--------DTCYSVPIVAPTITLMFS------------- 367
               F+ R+ +  T   L           D CY V   A  +TL  S             
Sbjct: 276 ----FKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAY 331

Query: 368 --GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
             G+++TL P++ L  H T  +  C+ + A P+  N +L  +  +  ++  + +DV NSR
Sbjct: 332 EGGVSLTLGPENYLFAHETNSAAFCVGIFANPN--NQIL--LGQITMRDTLMEFDVANSR 387

Query: 425 LGVARELC 432
           +G+A   C
Sbjct: 388 VGMAPANC 395


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 157/376 (41%), Gaps = 55/376 (14%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
           Y  + K+GTP     + +DT +D  WV C  C GC  +     Q   F      ++    
Sbjct: 79  YFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVS 138

Query: 157 VPNPTCGGG-------------ACAFNLTY--GSSTIAANLSQDTISLATDIVPGYT--- 198
             +P C                 C++   Y  GS T    +S+   S+  D+V G +   
Sbjct: 139 CSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSE---SMYFDMVMGQSMIA 195

Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFK 245
                  FGC    +G+         G+ G G G LS+++Q   + +    FS+CL    
Sbjct: 196 NSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEG 255

Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
             +  G L LG + +P  I Y+PL+ +    +LY   L +I V  + + I P    F  +
Sbjct: 256 --NGGGILVLGEVLEPG-IVYSPLVPSQPHYNLY---LQSISVNGQTLPIDPSV--FATS 307

Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPT 361
              GTIIDSGT    LV  AYT         V  ++T T   G + CY    SV  + P 
Sbjct: 308 INRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKG-NQCYLVSTSVGEIFPL 366

Query: 362 ITLMFSG-MNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
           ++L F+G  ++ L  +  L+H       ++ C+      + V     ++ ++  ++   +
Sbjct: 367 VSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGV----TILGDLVMKDKIFV 422

Query: 418 YDVPNSRLGVARELCT 433
           YD+   R+G A   C+
Sbjct: 423 YDLARQRIGWASYDCS 438


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 103/428 (24%), Positives = 174/428 (40%), Gaps = 53/428 (12%)

Query: 33  TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
           ++ + H  SP SP + PS      E+  E L +   R    S  +++  +  P  S    
Sbjct: 36  SIDLIHRDSPKSPLYNPS------ETPAERLDRFFRRFMSFSEASISPNTPEPPVS---- 85

Query: 92  TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
           + +  Y+++  IGTP   +    DT +D  W  C  C+ C    + +F+ ++ST+FK + 
Sbjct: 86  SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVS 145

Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFG 200
           C++ QC+ +   +C      C F+  YG  ++A   ++ +T++L ++      +    FG
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFG 205

Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL-SFSGSLRLG 256
           C    +G  +    GL G G   LSL +Q  +   S   FS CL  F+   S +  +  G
Sbjct: 206 CGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFG 265

Query: 257 PIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
           P  +     +  TPL+      + Y+V L  I VG ++           P + +  +   
Sbjct: 266 PEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLF----------PFSSSSPMATK 314

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-------CY--SVPIVAPTITLM 365
           G VF     P     RD + R V        +            CY  +  I  P +T  
Sbjct: 315 GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAH 374

Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           F G +V L   N  I    G + C AM      ++    +  N  Q N  I +D+   ++
Sbjct: 375 FDGADVQLKPLNTFISPKEG-VYCFAMQP----IDGDTGIFGNFVQMNFLIGFDLDGKKV 429

Query: 426 GVARELCT 433
                 CT
Sbjct: 430 SFKAVDCT 437


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 164/382 (42%), Gaps = 48/382 (12%)

Query: 84  PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST------VF 136
           P+    +I +   + +   +GTP    L+ +DT +  +WV C  C + C +T      VF
Sbjct: 63  PVVGNHEIHEG-KFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVF 121

Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSS----TIAANLSQD 185
           +  +STT++ +GC +  C  V                C ++L YGS       A  L  D
Sbjct: 122 DPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTD 181

Query: 186 TISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT--QNLYQSTFSYCL 241
            ++LA  + I+ G+ FGC    +       G++G G  + S   Q   Q  Y++ FSYC 
Sbjct: 182 KLTLASSSSIIDGFIFGCSGDDSFKGY-ESGVIGFGGANFSFFNQVARQTNYRA-FSYCF 239

Query: 242 PSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGA 299
           P        G L +G    PK  + YT L+ +    S+Y +  + + V G R+       
Sbjct: 240 PGDHTA--EGFLSIG--AYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRL------Q 289

Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------ 353
           +  +  T    ++DSGTV T L+ P + A        + +   ++   G +TC+      
Sbjct: 290 VDQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGD 349

Query: 354 SVPIVA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV--IANMQ 410
           SV     PT+ + F G  + LP +N+  H    S   + +A  PD V  V NV  + N  
Sbjct: 350 SVDSGDLPTVEMRFIGTTLKLPPENVF-HDLLPSHDKICLAFKPD-VAGVRNVQILGNKA 407

Query: 411 QQNHRILYDVPNSRLGVARELC 432
             + R++YD+     G     C
Sbjct: 408 TXSFRVVYDLQAMYFGFQAGAC 429


>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like, partial [Brachypodium distachyon]
          Length = 364

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 138/304 (45%), Gaps = 47/304 (15%)

Query: 167 CAFNLTY--GSSTIAANLSQDTISLATDIVPGY--TFGCIQKATGNS---VPPQGLLGLG 219
           C  +L+Y  GSS+  A L+ D  ++ +   P     FGC+  A  +S   V   GLLG+ 
Sbjct: 59  CRVSLSYADGSSSDGA-LATDVFAVGS-ATPSLRAAFGCMASAFDSSPDGVASAGLLGMN 116

Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--RIKYTPL----LKNP 273
           RG+LS ++Q        FSYC+        +G L LG    P    + YTPL    L  P
Sbjct: 117 RGALSFVSQAGT---RRFSYCISDRDD---AGVLLLGHSDLPNFLPLNYTPLYQPSLPLP 170

Query: 274 RRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
               + Y V LL I VG + + IP   L  + T    T++DSGT FT L+  AY A++  
Sbjct: 171 YFDRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAE 230

Query: 333 FRR------RVGSNLTVTSLGGFDTCYSVP--------IVAPTITLMFSGMNVTLPQDNL 378
           F R      R     +    G FDTC+ VP         + P++TL F+G  + +  D L
Sbjct: 231 FYRQSTPFLRALDEPSFAFQGAFDTCFRVPRGMSPPPGRLLPSVTLRFNGAEMVVGGDRL 290

Query: 379 LIH----------STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
           L            +   ++ CL    A D V  +  VI +  Q N  + YD+   R+G+A
Sbjct: 291 LYKVPGERRGGAGADDDAVWCLTFGNA-DMVPIMAYVIGHHHQMNLWVEYDLERGRVGLA 349

Query: 429 RELC 432
           +  C
Sbjct: 350 QVRC 353


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 172/426 (40%), Gaps = 67/426 (15%)

Query: 40  FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK---SVVPIASGRQITQSPT 96
           + P    K    L  E S    LA  QAR++   SL        SV P  +GR I     
Sbjct: 50  YKPNETAKDRMELDIEHSAAR-LAYIQARIE--GSLVYNNDYTASVSPSLTGRTI----- 101

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNL-----G 148
            +V   IG P+   L+ MDT +D  W+ C  C  C +    +F+ + S+TF  L     G
Sbjct: 102 -LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPCG 160

Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
            +  +C  +P         + + T+G   +    + +  S  +D++     GC      N
Sbjct: 161 FKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVI----IGCGHNIGFN 216

Query: 209 SVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGPIGQPKRIKY 266
           S P   G+LGL  G  SL  Q        FSYC+ +     ++   LRLG  G       
Sbjct: 217 SDPGYNGILGLNNGPNSLATQIGR----KFSYCIGNLADPYYNYNQLRLGE-GADLEGYS 271

Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA- 325
           TP          YYV +  I VG + +DI     +       G I+DSGT  T LV  A 
Sbjct: 272 TPF---EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAH 328

Query: 326 ---YTAVRDV----FRRRVGSNLTVTSLGGFDTCY----SVPIVA-PTITLMF-SGMNVT 372
              Y  VR++    FR+ +  N        +  CY    S  +V  P +T  F  G ++ 
Sbjct: 329 KLLYNEVRNLLKWSFRQVIFEN------APWKLCYYGIISRDLVGFPVVTFHFVDGADLA 382

Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN------VIANMQQQNHRILYDVPNSRLG 426
           L   +    S    I C+ ++ A     S+LN      VI  + QQ++ + YD+ N  + 
Sbjct: 383 LDTGSFF--SQRDDIFCMTVSPA-----SILNTTISPSVIGLLAQQSYNVGYDLVNQFVY 435

Query: 427 VARELC 432
             R  C
Sbjct: 436 FQRIDC 441


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 119/473 (25%), Positives = 197/473 (41%), Gaps = 69/473 (14%)

Query: 1   MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVL 59
           M  Q++      F  +LS   +P       + ++++ H  SP SP + P           
Sbjct: 1   MATQILLCFFLFFSVTLSSSGHP------KNFSVELIHRDSPLSPIYNP----------- 43

Query: 60  EMLAKDQARLQFLSSLAVARK-----SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           ++   D+    FL S++ +R+     S   + SG  I     + +   IGTP   +    
Sbjct: 44  QITVTDRLNAAFLRSVSRSRRFNHQLSQTDLQSGL-IGADGEFFMSITIGTPPIKVFAIA 102

Query: 115 DTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG----GGAC 167
           DT +D  WV C  C  C      +F+  +S+T+K+  C +  C+ + +   G       C
Sbjct: 103 DTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNIC 162

Query: 168 AFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGR 220
            +  +YG  + +  +++ +T+S+ +        PG  FGC     G       G++GLG 
Sbjct: 163 KYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGG 222

Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--LRLGPIGQPKRIKY------TPLL-K 271
           G LSL++Q  +     FSYCL S K+ + +G+  + LG    P  +        TPL+ K
Sbjct: 223 GHLSLISQLGSSISKKFSYCL-SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDK 281

Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT-------TGAGTIIDSGTVFTRLVAP 324
            P   + YY+ L AI VG++   IP     +NP        T    IIDSGT  T L A 
Sbjct: 282 EPL--TYYYLTLEAISVGKK--KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAG 337

Query: 325 AYTAVRDVFRRRV-GSNLTVTSLGGFDTCY---SVPIVAPTITLMFSGMNVTLPQDNLLI 380
            +          V G+       G    C+   S  I  P IT+ F+G +V L   N  +
Sbjct: 338 FFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFV 397

Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
              +  + CL+M    +     + +  N  Q +  + YD+    +      C+
Sbjct: 398 K-LSEDMVCLSMVPTTE-----VAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 131/331 (39%), Gaps = 42/331 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------------TGCVG 130
           +P+ S   I     Y+V  + GTPA    + +DT+ND  W+ C            T  VG
Sbjct: 113 LPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVG 172

Query: 131 CSS-----------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
                           +  A+S++++ + C   +C  +P  TC   + A + +Y      
Sbjct: 173 AGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232

Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
           G+ T+     +      +D     +PG   GC     G SV    G+L LG G +S    
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292

Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
               +   FS+CL S  +   + S L  GP   +  P  ++ T ++ N      Y   + 
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDIVYNVDVKPAYGPLVT 351

Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
            I VG   +DIP          G G I+D+ T  T LV  AY AV     R +     V 
Sbjct: 352 GIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVY 411

Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
            L GF+ CY        + L     NVT+P+
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLTH---NVTVPR 439


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 116/441 (26%), Positives = 185/441 (41%), Gaps = 63/441 (14%)

Query: 33  TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-----SVVPIA 86
           ++++ H  SP SP + P   ++           D+    FL S++ +R+     S   + 
Sbjct: 27  SVELIHRDSPLSPLYNPKNTVT-----------DRLNAAFLRSISRSRRLNNILSQTDLQ 75

Query: 87  SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTT 143
           SG  I     + +   IGTP   +    DT +D  WV C  C  C   +  +F+  +S+T
Sbjct: 76  SGL-IGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSST 134

Query: 144 FKNLGCQAAQCKQVPNPTCG----GGACAFNLTYGS-----STIAAN-LSQDTISLATDI 193
           +K+  C +  C  + +   G       C +  +YG        +A   +S D+ S +   
Sbjct: 135 YKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVS 194

Query: 194 VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
            PG  FGC     G       G++GLG G LSL++Q  +     FSYCL S K+ + +G+
Sbjct: 195 FPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL-SHKSATTNGT 253

Query: 253 --LRLGPIGQPKRIKY------TPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
             + LG    P  +        TPL+ K PR  + YY+ L AI VG++   IP     +N
Sbjct: 254 SVINLGTNSIPSSLSKDSGVISTPLVDKEPR--TYYYLTLEAISVGKK--KIPYTGSSYN 309

Query: 304 PTTG-------AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCY-- 353
           P  G          IIDSGT  T L +  +          V G+       G    C+  
Sbjct: 310 PNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKS 369

Query: 354 -SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
            S  I  P IT+ F+G +V L   N  +   +  + CL+M    +     + +  N  Q 
Sbjct: 370 GSAEIGLPEITVHFTGADVRLSPINAFVK-VSEDMVCLSMVPTTE-----VAIYGNFAQM 423

Query: 413 NHRILYDVPNSRLGVARELCT 433
           +  + YD+    +   R  C+
Sbjct: 424 DFLVGYDLETRTVSFQRMDCS 444


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 156/366 (42%), Gaps = 51/366 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +D+ +   +VPC  C  C +     F    S+T+  + C +A 
Sbjct: 85  YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC-SAD 143

Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLAT--DIVPGY-TFGCIQKATG 207
           C      TC      C +   Y   S+ +  L +D +S  T  ++ P    FGC    TG
Sbjct: 144 C------TCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETG 197

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPK 262
           +  S    G++GLGRG LS++ Q   + +   +FS C   +  +    G++ LG +  P 
Sbjct: 198 DLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPP 254

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            + ++    +P RS  Y + L  I V  + + + P        +  GT++DSGT +  L 
Sbjct: 255 DMVFS--RSDPVRSPYYNIELKEIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAYLP 308

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------------PTITLMF-SG 368
             A+ A +D    +V     +  + G D  Y     A             P + ++F  G
Sbjct: 309 EQAFVAFKDAVTSKV---RPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDG 365

Query: 369 MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
             ++L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++G 
Sbjct: 366 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGF 422

Query: 428 ARELCT 433
            +  C+
Sbjct: 423 WKTNCS 428


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 153/359 (42%), Gaps = 39/359 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +DT +   +VPC+ C  C       F    S+T+     Q  +
Sbjct: 81  YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTY-----QPVK 135

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
           C    N       C +   Y   ST +  L +D +S    +++ P    FGC    TG+ 
Sbjct: 136 CTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENVETGDL 195

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            S    G++GLGRG LS++ Q   +N+   +FS C          G++ LG I  P  + 
Sbjct: 196 YSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDV--GGGAMVLGGISPPSDMV 253

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           +     +P RS  Y ++L  I V  + + + P           G+++DSGT +  L   A
Sbjct: 254 FAQ--SDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYLPEEA 307

Query: 326 YTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNVTL- 373
           + A ++   + +   S ++       D C+S        +    P + ++F +G   +L 
Sbjct: 308 FLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLS 367

Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           P++ +  HS      CL +     +  ++L  I     +N  +LYD   +++G  +  C
Sbjct: 368 PENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIV---VRNTLVLYDREQTKIGFWKTNC 423


>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 404

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 82/244 (33%), Positives = 109/244 (44%), Gaps = 26/244 (10%)

Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRLG 256
           FGC     G  S    G + LG G  SL +QT + Y   FSYC+P   A  F S    +G
Sbjct: 177 FGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIG 236

Query: 257 PIGQPKRIKYTPLLK--NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
             G       TPL+   NP   + Y V L  I V  R +++PP          AGT++DS
Sbjct: 237 SSGSGSGFASTPLVATANP---TFYVVRLQGIDVAGRRLNVPPAVFS------AGTLMDS 287

Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP----IVAPTITLMFSG 368
             V T+L   AY A+R  FR  +     V + G    DTCY       +  P ++L+FSG
Sbjct: 288 SAVVTQLPPTAYRALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSG 347

Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
             V   +   ++        CLA    P   +S L  I N+QQQ H +LYDV    +G  
Sbjct: 348 GAVVRLEPMAVMMEG-----CLAFVPTP--ADSDLGFIGNVQQQTHEVLYDVGARNVGFR 400

Query: 429 RELC 432
           R  C
Sbjct: 401 RGAC 404


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 153/362 (42%), Gaps = 41/362 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y+    IGTP Q +   +D + +  W  CT C  C      +F+  +S+TF+ L C +  
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116

Query: 154 CKQVPNPT--CGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGN 208
           C+ +P  +  C    C +     +         DT ++         FGC+    K    
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAIGA-AKETLGFGCVVMTDKRLKT 175

Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------- 260
              P G++GLGR   SL+ Q   +  + FSYCL    A   SG+L LG   +        
Sbjct: 176 IGGPSGIVGLGRTPWSLVTQ---MNVTAFSYCL----AGKSSGALFLGATAKQLAGGKNS 228

Query: 261 --PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA-LQFNPTTGAGTIIDSGTV 317
             P  IK +    +   +  Y V L  I+ G        GA LQ   ++G+  ++D+ + 
Sbjct: 229 STPFVIKTSAGSSDNGSNPYYMVKLAGIKTG--------GAPLQAASSSGSTVLLDTVSR 280

Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS---GMNVTLP 374
            + L   AY A++      VG     +    +D C+   +      L+F+   G  +T+P
Sbjct: 281 ASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVP 340

Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVL---NVIANMQQQNHRILYDVPNSRLGVAREL 431
             N L+ S  G++     ++A  N+   L   +++ ++QQ+N  +L+D+    L      
Sbjct: 341 PANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPAD 400

Query: 432 CT 433
           C+
Sbjct: 401 CS 402


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 154/365 (42%), Gaps = 49/365 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAA- 152
           Y  R  IGTP Q   + +D+ +   +VPC  C  C +     F    S+T+  + C    
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147

Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--DIVPGYT-FGCIQKATGN 208
            C    N       C +   Y   S+ +  L +D +S  T  ++ P    FGC    TG+
Sbjct: 148 TCDSDKN------QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGD 201

Query: 209 --SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPKR 263
             S    G++GLGRG LS++ Q   + +   +FS C   +  +    G++ LG +  P  
Sbjct: 202 LFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPPG 258

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + YT    N  RS  Y + L  + V  + + + P           GT++DSGT +  L  
Sbjct: 259 MIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGK----HGTVLDSGTTYAYLPE 312

Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF-SGM 369
            A+ A +D    +V     +  + G D+ Y              +  V P + ++F +G 
Sbjct: 313 QAFVAFKDAVSSQV---HPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369

Query: 370 NVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            ++L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++G  
Sbjct: 370 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFW 426

Query: 429 RELCT 433
           +  C+
Sbjct: 427 KTNCS 431


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 168/388 (43%), Gaps = 52/388 (13%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
           +P+      T++  Y  R  IGTPA+   + +DT +D  WV C  C GC        + T
Sbjct: 76  LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135

Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
            +   G Q+ +   C Q   V N     P+C   + C ++++YG  +  A       L  
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195

Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
           + +S      P     +FGC  K  G+    ++   G+LG G+ + S+L+Q       + 
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255

Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
            F++CL +       G   +G + QPK +K TPL+ +      Y V L  I VG   + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVPDMPH---YNVILKGIDVGGTALGL 308

Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
           P     F+     GTIIDSGT    +    Y A+   VF +    +++V +L  F +C+ 
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363

Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNV 405
              SV    P +T  F G +V+L   P D L       ++ C+              L +
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGGKTKDGKDLGL 420

Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
           + ++   N  +LYD+ N  +G A   C+
Sbjct: 421 LGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 162/370 (43%), Gaps = 58/370 (15%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
           Y     +GTP  + ++A+DT +D  WVPC  C+ C+              ++  A+STT 
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCD-CIECAPLAGYRETLDRDLGIYKPAESTTS 201

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGY 197
           ++L C    C      +     C ++  Y   ++T +  L +D + L +      +    
Sbjct: 202 RHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASV 261

Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
             GC +K +G   + + P GLLGLG   +S+   LA+   L +++FS C   FK    SG
Sbjct: 262 VIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FK--EDSG 315

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G   + + TP +    +   Y VN+    VG +  +           T    +
Sbjct: 316 RIFFGDQGVSIQ-QSTPFVPLYGKYQTYAVNVDKSCVGHKCFE----------ATSFEAL 364

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
           +DSGT FT L    Y AV   F ++V +         F+ CYS     +P V PT+TL F
Sbjct: 365 VDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDV-PTVTLTF 423

Query: 367 SGMNVTLPQDN--LLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
           +  N +    N  +++    GS+   CLA+  +P+ +     +I       + I++D  N
Sbjct: 424 AA-NKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPI----GIIGQNFLTGYHIVFDKEN 478

Query: 423 SRLGVARELC 432
            +LG  R  C
Sbjct: 479 MKLGWYRSEC 488


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 41/339 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y+++ ++GTP   +   +DT +D  W   +PCT C    + +F+ + S+TFK        
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATG 207
                   C G +C + + Y  +T +   L+ +T+++ +      ++P  T GC   ++ 
Sbjct: 113 -----EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIK 265
                 G++GL  G  SL+ Q    Y    SYC  S     ++F  +  +   G    + 
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG----VV 223

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            T +     +  LYY+NL A+ VG   V+       F+   G   IIDSGT  T      
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETM--GTTFHALEG-NIIIDSGTTLTYFPVSY 280

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--PTITLMFS-GMNVTLPQDNLLIHS 382
              VR+     V +  T    G    CY    +   P IT+ FS G ++ L + N+ I +
Sbjct: 281 CNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIET 340

Query: 383 TAGSITCLAMAAA--PDNVNSVLNVIANMQQQNHRILYD 419
                 CLA+     P +      +  N  Q N  + YD
Sbjct: 341 ITRGTFCLAIICNNPPQDA-----IFGNRAQNNFLVGYD 374


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 111/397 (27%), Positives = 163/397 (41%), Gaps = 44/397 (11%)

Query: 63  AKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA--QTLLMAMDTSNDA 120
           AK + R +     A A  +   I +      S  Y V   +GT    +   + MD +   
Sbjct: 67  AKQEVRCRIAHRFAGADITAASIRTYLCPPASMVYAVAVGVGTEHGYENYELEMDMAAGF 126

Query: 121 AWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSST 177
           +W+ C  C  C    + VF+ A+S TF+ +    A   + P      G C F + Y +  
Sbjct: 127 SWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPLQDGRCGFGIAYRNGA 186

Query: 178 IAAN-LSQDTISLAT-----DIVPGYTFGCIQK----ATGNSVPPQGLLGLGRGSLS--L 225
            AA  L++DT S  T       +PG  FGC  +     T  ++   G+LG+G G+    L
Sbjct: 187 SAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRIARFDTHGAL--AGVLGMGMGAEGKPL 244

Query: 226 LAQTQNLYQS---TFSYC--LPSFKALSFSGSLRLG---PIGQPKRIKYTPL--LKNPRR 275
               + LY +    FSYC  +P   A SF   LR G   P   P  +    +  L     
Sbjct: 245 TGFMRQLYHNGGGRFSYCPIVPGTTAYSF---LRFGNDIPSQPPAGVHRQSMAVLAPTTT 301

Query: 276 SSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
           S  YYV L  I VG  RV  + P   + +     G  ID GT  T +V  AY  V    R
Sbjct: 302 SEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVR 361

Query: 335 RRVGSNLT--VTSLGGFDTCYSVPIVA---PTITLMFSG---MNVTLPQDNLLIHSTAGS 386
             +  N    V S G     +  P +    P++TL F G   + V      L++ S  G 
Sbjct: 362 GHLQRNRARFVQSPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGG 421

Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
              L +   PD   + + VI  MQQ + R ++D+ N+
Sbjct: 422 GEYLCLGLVPD---AEMTVIGAMQQIDTRFIFDLHNN 455


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 41/339 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
           Y+++ ++GTP   +   +DT +D  W   +PCT C    + +F+ + S+TFK        
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112

Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATG 207
                   C G +C + + Y  +T +   L+ +T+++ +      ++P  T GC   ++ 
Sbjct: 113 -----EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIK 265
                 G++GL  G  SL+ Q    Y    SYC  S     ++F  +  +   G    + 
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG----VV 223

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
            T +     +  LYY+NL A+ VG   V+       F+   G   IIDSGT  T      
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETM--GTTFHALEG-NIIIDSGTTLTYFPVSY 280

Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--PTITLMFS-GMNVTLPQDNLLIHS 382
              VR+     V +  T    G    CY    +   P IT+ FS G ++ L + N+ I +
Sbjct: 281 CNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIET 340

Query: 383 TAGSITCLAMAAA--PDNVNSVLNVIANMQQQNHRILYD 419
                 CLA+     P +      +  N  Q N  + YD
Sbjct: 341 ITRGTFCLAIICNNPPQDA-----IFGNRAQNNFLVGYD 374


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 163/365 (44%), Gaps = 58/365 (15%)

Query: 91  ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-------CSSTVFNSAQSTT 143
           IT+S  Y++   +GTP   LL   DT +D  WV C+   G         + VF   +S+T
Sbjct: 97  ITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSST 156

Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQDTISLATD------IV 194
           +  L CQ+  C+ +   +C   + C +  +Y  GS TI   LS +T S           V
Sbjct: 157 YSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGV-LSTETFSFVDGGGKGQVRV 215

Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL-PSFKALSFSG 251
           P   FGC   A+  +    GL+GLG G+ SL++Q           SYCL PS+ A S S 
Sbjct: 216 PRVNFGC-STASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANS-SS 273

Query: 252 SLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
           +L  G    + +P     TPL+ +    S Y V L ++ VG + V           T  +
Sbjct: 274 TLNFGSRAVVSEPGAAS-TPLVPSD-VDSYYTVALESVAVGGQEV----------ATHDS 321

Query: 309 GTIIDSGTVFT----RLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
             I+DSGT  T     L+ P  T + R +  +RV     +  L     CY V   +    
Sbjct: 322 RIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQL-----CYDVQGKSETDN 376

Query: 360 ---PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
              P +TL F  G  VTL  +N       G++ CL +   P + +  ++++ N+ QQN  
Sbjct: 377 FGIPDVTLRFGGGAAVTLRPENTFSLLQEGTL-CLVL--VPVSESQPVSILGNIAQQNFH 433

Query: 416 ILYDV 420
           + YD+
Sbjct: 434 VGYDL 438


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 159/368 (43%), Gaps = 45/368 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----AQSTTFK-NLGCQ 150
           Y  R  IGTP+Q   + +D+ +   +VPC  C  C +    S     A    F+ +L   
Sbjct: 92  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 151

Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQK 204
            +  K   + TC      C +   Y   S+ +  L +D +S    +++ P    FGC   
Sbjct: 152 YSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 211

Query: 205 ATGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
            TG+  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG +  
Sbjct: 212 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPA 269

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P  + ++    NP RS  Y + L  I V  + + + P    FN  +  GT++DSGT +  
Sbjct: 270 PPDMVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAY 323

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF- 366
           L   A+ A +D    +V S   +  + G D  Y              +  V P + ++F 
Sbjct: 324 LPEQAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFG 380

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++
Sbjct: 381 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKI 437

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 438 GFWKTNCS 445


>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
 gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
          Length = 486

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 160/354 (45%), Gaps = 46/354 (12%)

Query: 74  SLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
           +L+ A  +++P       +  P    YIV    G+P Q   + + T+   + + C  C  
Sbjct: 125 ALSPAAATIIPANGSSDPSTLPGALDYIVLVSYGSPEQQFPVFLGTNVGTSLLRCKPCAS 184

Query: 131 CSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTI 187
            S      F++ QS+TF ++ C +  C       C    C F   YG  T+    + D +
Sbjct: 185 GSDDCNPAFDTLQSSTFAHVPCSSPDCPV----NCSSSVCPFYDLYG--TVGGTFATDVL 238

Query: 188 SLATD--IVPGYTFGCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLY-----QSTFSY 239
           +LA     V  + F C+  ++    +P  G + L R   SL +Q  +        ++FSY
Sbjct: 239 TLAPSSMAVHDFRFVCMDVESPSPDLPEAGSIDLSRHRNSLPSQLSSSSGIAPTAASFSY 298

Query: 240 CLPSFKALSFSGSLRLGP----IGQPKRIK-YTPLLKN--PRRSSLYYVNLLAIRVGRRV 292
           CLP  ++ +  G L LG     +G    +  + P++ N  P  +S+Y+++L+ + +G   
Sbjct: 299 CLP--QSRNSQGFLSLGGDATVVGDDDNLTVHAPMVWNNDPDLASMYFIDLVGMSLGGED 356

Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS---LGGF 349
           + IP G         A T +D G  FT L   AYT +RD FR+ +      +S     GF
Sbjct: 357 LPIPSGTFG-----NASTNLDVGATFTMLAPEAYTTLRDAFRKEMSQYNNRSSPAGFDGF 411

Query: 350 DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS--TAGSIT--CLAMAA 394
           DTC++      +V P + L FS G ++ +  D +L +    AG  T  CLA ++
Sbjct: 412 DTCFNFTGLNELVVPLVQLKFSNGESLMIDGDQMLYYHDPAAGPFTMACLAFSS 465


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 165/386 (42%), Gaps = 50/386 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+    +      Y  + K+G+P +   + +DT +D  WV C  C        +G   +
Sbjct: 60  LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 119

Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTIS--- 188
           +++S  S+T KN+GC+   C  +  + TCG    C++++ YG  ST   +  +D I+   
Sbjct: 120 LYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQ 179

Query: 189 ---------LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLY 233
                    LA ++V    FGC +  +G          G++G G+ + S+++Q       
Sbjct: 180 VTGNLRTAPLAQEVV----FGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 235

Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           +  FS+CL +       G   +G +  P  +K TP++ N      Y V L  + V    +
Sbjct: 236 KRIFSHCLDNMNG---GGIFAVGEVESP-VVKTTPIVPNQVH---YNVILKGMDVDGDPI 288

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDT 351
           D+PP     N     GTIIDSGT    L    Y ++ +    +++V  ++   +   F  
Sbjct: 289 DLPPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSF 346

Query: 352 CYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--A 407
             +     P + L F   + +++ P D L   S    + C    +         +VI   
Sbjct: 347 TSNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLG 404

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
           ++   N  ++YD+ N  +G A   C+
Sbjct: 405 DLVLSNKLVVYDLENEVIGWADHNCS 430


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 162/389 (41%), Gaps = 81/389 (20%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----------VFNSAQSTTFK 145
           +     +GTP+ + L+A+DT ++  W+PC  C  C  +           +++   S+T +
Sbjct: 62  HYANVSVGTPSVSFLVALDTGSNLLWLPCD-CSSCVHSLRSPSGTVDLNIYSPNTSSTSE 120

Query: 146 NLGCQAAQCKQVPNPTC--GGGACAFNLTY---GSSTIAANLSQDTISLATD------IV 194
            + C +  C Q     C      C + + Y   G+ST    + QD + L +D      + 
Sbjct: 121 KVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGY-IVQDLLHLISDDSQSKAVD 179

Query: 195 PGYTFGCIQKATGNSV---PPQGLLGLGRGSLSLLAQ-TQNLYQS-TFSYCLP--SFKAL 247
              TFGC +  TG+ +    P GL GLG  ++S+ +    N Y S +FS C        +
Sbjct: 180 AKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIGRI 239

Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
           SF      G  GQ +    T   +   RSSLY +++    +G +  D+   A        
Sbjct: 240 SFGDK---GSTGQGE----TSFNQGQPRSSLYNISITQTSIGGQASDLVYSA-------- 284

Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV------------ 355
              I DSGT FT L  PAYT + + F + V      ++   FD CY +            
Sbjct: 285 ---IFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFS 341

Query: 356 --------PIVAPTITLMFSG---MNVTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVL 403
                   P + P +TL+ SG    NVT P   +L+    GS + CL M  + D     +
Sbjct: 342 CAYANQTEPTI-PAVTLVMSGGDYFNVTDPI--VLVQLADGSAVYCLGMIKSGD-----V 393

Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
           N+I       HRI++D     LG     C
Sbjct: 394 NIIGQNFMTGHRIVFDRERMILGWKPSNC 422


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 111/424 (26%), Positives = 171/424 (40%), Gaps = 74/424 (17%)

Query: 55  EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
           EE +  +   D  RL     L        P+      T +  Y  +  IGTP++   + +
Sbjct: 55  EEHLAALRKHDGRRLLTAVDL--------PLGGNGIPTDTGLYFTQIGIGTPSKGYYVQV 106

Query: 115 DTSNDAAWVPCTGC--------VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN----PTC 162
           DT +D  WV C  C        +G   T+++   S + K + C    C    N    P+C
Sbjct: 107 DTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCATATNGGVPPSC 166

Query: 163 GGGA-CAFNLTYGSST------IAANLSQDTIS------LATDIVPGYTFGCIQKATG-- 207
              + C +++TYG  +      +A  L  D +S      LA   V   TFGC  K  G  
Sbjct: 167 AANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASV---TFGCGAKIGGAL 223

Query: 208 --NSVPPQGLLGLGRGSLSLLAQTQNLYQST--FSYCLPSFKALSFSGSLRLGPIGQPKR 263
             ++V   G+LG G+ + S+L+Q  +  + T  FS+CL +       G   +G + QPK 
Sbjct: 224 GSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNG---GGIFAIGNVVQPK- 279

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           +K TPL+        Y V L  I VG   + +P         +  GTIIDSGT    L  
Sbjct: 280 VKTTPLVPGMPH---YNVVLKTIDVGGSTLQLPTNIFDIGGGS-RGTIIDSGTTLAYLPE 335

Query: 324 PAYTAVR--------DVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM--NVTL 373
             Y AV         DV  + V   L     G  D  +      P +T  F G    V  
Sbjct: 336 VVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGF------PEVTFHFDGDLPLVVY 389

Query: 374 PQDNLLIHSTAGSITCLAMAA----APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
           P D L  ++    + C+   +    + D  + VL  + ++   N  ++YD+ N  +G   
Sbjct: 390 PHDYLFQNTE--DVYCVGFQSGGVQSKDGKDMVL--LGDLALSNKLVVYDLENQVIGWTN 445

Query: 430 ELCT 433
             C+
Sbjct: 446 YNCS 449


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 157/382 (41%), Gaps = 75/382 (19%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV------------FNSAQSTTF 144
           Y     +GTP+   L+A+DT +D  W+PC  C  C + +            ++   STT 
Sbjct: 104 YYANVSVGTPSLDFLVALDTGSDLFWLPCE-CSSCFTYLNTSNGGKFMLNHYSPNDSTTS 162

Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA--NLSQDTISLATD------IVPG 196
             + C ++ C +    T     C + + Y S+  ++   L +D + LATD      +   
Sbjct: 163 STVPCTSSLCNRC---TSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPVEAK 219

Query: 197 YTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSG 251
            TFGC    TG    +  P GL+GLG   +S+      Q L  ++FS C   F A  + G
Sbjct: 220 ITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMC---FGADGY-G 275

Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
            +  G  G P   K TP        S Y V    I VG    D+P  A           I
Sbjct: 276 RIDFGDTG-PADQKQTPFNTMLEYQS-YNVTFNVINVGGEPNDVPFTA-----------I 322

Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSVPIVAPT---ITL 364
            DSGT FT L  PAY+ +    +   G  L   SL G    F+ CY +P  A     +TL
Sbjct: 323 FDSGTSFTYLTEPAYSTITK--QMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTL 380

Query: 365 MF----------SGMNVTLPQD----NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
            F          + + V LP D    N++   T   + CLA+A + D     +++I    
Sbjct: 381 NFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT-HVACLAIAKSTD-----IDLIGQNF 434

Query: 411 QQNHRILYDVPNSRLGVARELC 432
              +RI ++     LG +   C
Sbjct: 435 MTGYRITFNRDQMVLGWSSSDC 456


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 159/368 (43%), Gaps = 45/368 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----AQSTTFK-NLGCQ 150
           Y  R  IGTP+Q   + +D+ +   +VPC  C  C +    S     A    F+ +L   
Sbjct: 91  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 150

Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQK 204
            +  K   + TC      C +   Y   S+ +  L +D +S    +++ P    FGC   
Sbjct: 151 YSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 210

Query: 205 ATGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
            TG+  S    G++GLGRG LS++ Q   + +   +FS C          G++ LG +  
Sbjct: 211 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPA 268

Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
           P  + ++    NP RS  Y + L  I V  + + + P    FN  +  GT++DSGT +  
Sbjct: 269 PPDMVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAY 322

Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF- 366
           L   A+ A +D    +V S   +  + G D  Y              +  V P + ++F 
Sbjct: 323 LPEQAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFG 379

Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
           +G  ++L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++
Sbjct: 380 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKI 436

Query: 426 GVARELCT 433
           G  +  C+
Sbjct: 437 GFWKTNCS 444


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 165/386 (42%), Gaps = 50/386 (12%)

Query: 83  VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
           +P+    +      Y  + K+G+P +   + +DT +D  WV C  C        +G   +
Sbjct: 64  LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 123

Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTIS--- 188
           +++S  S+T KN+GC+   C  +  + TCG    C++++ YG  ST   +  +D I+   
Sbjct: 124 LYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQ 183

Query: 189 ---------LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLY 233
                    LA ++V    FGC +  +G          G++G G+ + S+++Q       
Sbjct: 184 VTGNLRTAPLAQEVV----FGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 239

Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
           +  FS+CL +       G   +G +  P  +K TP++ N      Y V L  + V    +
Sbjct: 240 KRIFSHCLDNMNG---GGIFAVGEVESP-VVKTTPIVPNQVH---YNVILKGMDVDGDPI 292

Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDT 351
           D+PP     N     GTIIDSGT    L    Y ++ +    +++V  ++   +   F  
Sbjct: 293 DLPPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSF 350

Query: 352 CYSVPIVAPTITLMFSG-MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--A 407
             +     P + L F   + +++ P D L   S    + C    +         +VI   
Sbjct: 351 TSNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLG 408

Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
           ++   N  ++YD+ N  +G A   C+
Sbjct: 409 DLVLSNKLVVYDLENEVIGWADHNCS 434


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 158/366 (43%), Gaps = 51/366 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +D+ +   +VPC  C  C +     F    S+++  + C    
Sbjct: 89  YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN-VD 147

Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
           C      TC      C +   Y   S+ +  L +D +S    +++ P    FGC    TG
Sbjct: 148 C------TCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETG 201

Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPK 262
           +  S    G++GLGRG LS++ Q   + +   +FS C   +  +    G++ LG +  P 
Sbjct: 202 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLC---YGGMDIGGGAMVLGGVPAPS 258

Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
            + ++    +P RS  Y + L  I V  + + +   +  FN  +  GT++DSGT +  L 
Sbjct: 259 DMVFS--HSDPLRSPYYNIELKEIHVAGKALRV--DSRVFN--SKHGTVLDSGTTYAYLP 312

Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-------------SVPIVAPTITLMF-SG 368
             A+ A +D    +V S   +  + G D  Y              +  V P + ++F +G
Sbjct: 313 EQAFVAFKDAVTSKVHS---LKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 369

Query: 369 MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
             ++L P++ L  HS      CL +     N      ++  +  +N  + YD  N ++G 
Sbjct: 370 QKLSLTPENYLFRHSKVDGAYCLGVFQ---NGKDPTTLLGGIIVRNTLVTYDRHNEKIGF 426

Query: 428 ARELCT 433
            +  C+
Sbjct: 427 WKTNCS 432


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 151/377 (40%), Gaps = 50/377 (13%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSS---------TVFNSAQSTTF 144
           Y V  K+GTP+Q  ++  DT +D  W+ C        CS+          VF++  S++F
Sbjct: 83  YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142

Query: 145 KNLGCQAAQCK----------QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD- 192
           K + C    CK            P P      C ++  Y   ST     + +T+++    
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLT---PCGYDYRYSDGSTALGFFANETVTVELKE 199

Query: 193 ----IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKA 246
                +     GC +   G S     G++GLG    S   +    +   FSYCL      
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH 259

Query: 247 LSFSGSLRLGPIGQPK----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
            + S  L  G     +     + YT L+     +S Y VN++ I +G  ++ IP     +
Sbjct: 260 KNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV--W 316

Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT-SLGGFDTCYS----VPI 357
           +     GTI+DSG+  T L  PAY  V    R  +     V   +G  + C++       
Sbjct: 317 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEES 376

Query: 358 VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
           + P +   F+ G     P  + +I S A  + CL   +      S   V+ N+ QQNH  
Sbjct: 377 LVPRLVFHFADGAEFEPPVKSYVI-SAADGVRCLGFVSVAWPGTS---VVGNIMQQNHLW 432

Query: 417 LYDVPNSRLGVARELCT 433
            +D+   +LG A   CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 39/365 (10%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
           Y     +G P Q L + +DT +D  WV C+ C  C S        +++N + S+T     
Sbjct: 83  YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142

Query: 149 CQAAQC---KQVPNPTCGGGACAFNLTY--GSSTIAANLSQD---TISLATDIVPGYTFG 200
           C    C   + V + +    ACA+  +Y   S+++ A +  D    +           FG
Sbjct: 143 CSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFFG 202

Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
           C    TG S P  G++G G  S ++  Q  TQ      FS+CL   K     G L  G  
Sbjct: 203 CATNITG-SWPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEK--HGGGILEFGEA 259

Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF--NPTTGAGTIIDSGT 316
                + +TPLL     ++ Y V+LL+I V  +V+ I P    +  N T   G IIDSGT
Sbjct: 260 PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGT 316

Query: 317 VFTRLVAPA----YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMN-V 371
            F  L   A    +  ++ +   ++G  L         +  ++    P +TL FSG + +
Sbjct: 317 TFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECFYLKSGLTMETSFPNVTLTFSGGSTM 376

Query: 372 TLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
            L  DN L+   +    +  C A ++A       L +   +  ++  + YDV N R+G  
Sbjct: 377 KLKPDNYLVMAEYKKKRNGYCYAWSSADG-----LTIFGEIVLKDKLVFYDVENRRIGWK 431

Query: 429 RELCT 433
            + C+
Sbjct: 432 GQNCS 436


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 153/362 (42%), Gaps = 43/362 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAA- 152
           Y  R  IGTP Q   + +D+ +   +VPC  C  C +     F    S+T+  + C    
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147

Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--DIVPGYT-FGCIQKATGN 208
            C    N       C +   Y   S+ +  L +D +S  T  ++ P    FGC    TG+
Sbjct: 148 TCDSDKN------QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGD 201

Query: 209 --SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPKR 263
             S    G++GLGRG LS++ Q   + +   +FS C   +  +    G++ LG +  P  
Sbjct: 202 LFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPPG 258

Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
           + YT    N  RS  Y + L  + V  + + + P           GT++DSGT +  L  
Sbjct: 259 MIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGK----HGTVLDSGTTYAYLPE 312

Query: 324 PAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNVT 372
            A+ A +D    +V     +        D C++        +  V P + ++F +G  ++
Sbjct: 313 QAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372

Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
           L P++ L  HS      CL +     +  ++L  I     +N  + YD  N ++G  +  
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFWKTN 429

Query: 432 CT 433
           C+
Sbjct: 430 CS 431


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 161/367 (43%), Gaps = 58/367 (15%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-------------TVFNSAQSTTFKNLG 148
            IGTP  + ++A+D+ +D  WVPC  CV C+              + ++ +QS+T K L 
Sbjct: 103 DIGTPHVSFMVALDSGSDLFWVPCD-CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLS 161

Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLA--------TDIVPGYT 198
           C    C   PN      +C +++ Y + + +++  L +D I LA        T +     
Sbjct: 162 CSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVI 221

Query: 199 FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGS 252
            GC  K +G   + V P GLLGLG   +S+   LA+   L Q++FS C         SG 
Sbjct: 222 IGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSFLAKA-GLIQNSFSMCFNEDD----SGR 276

Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
           +  G  G P   +  P LK     + Y V +    VG   +            +    ++
Sbjct: 277 IFFGDQG-PATQQSAPFLKLNGNYTTYIVGVEVCCVGTSCLK----------QSSFSALV 325

Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMFS 367
           DSGT FT L    +  + + F  +V ++ +      +  CY      +P + P++ L+F 
Sbjct: 326 DSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLPKI-PSLRLIFP 384

Query: 368 GMNVTLPQDNL-LIHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
             N  + Q+ + +I+   G I  CLA+  A  ++ +    I       +R+++D  N +L
Sbjct: 385 QNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGT----IGQNFMMGYRVVFDRENLKL 440

Query: 426 GVARELC 432
           G +R  C
Sbjct: 441 GWSRSNC 447


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 147/334 (44%), Gaps = 39/334 (11%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
           Y++   +GTP++T ++ +DT +  +WV C  C GC +    F  ++STT   + C  + C
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59

Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
                +P C        C F ++Y   + +   L QDT++ +    +PG++FGC   + G
Sbjct: 60  LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119

Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
            +      GLLG+G G +S+L Q+   +   FSYCLP       F    +G   LG    
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178

Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
             R  ++YT ++   + + L++V+L AI V    + + P        +  G + DSG+  
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233

Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
           +    R ++     +R++  RR  +            CY +  V     P I+L F  G 
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288

Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
              L    + +  +        +A AP    S++
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFAPTESVSII 322


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 162/394 (41%), Gaps = 45/394 (11%)

Query: 62  LAKDQARLQ-FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
           LA  QAR++  L S    +  V P  +GR I      +    IG P    L+ MDT +D 
Sbjct: 71  LANIQARIEGSLVSNNDYKARVSPSLTGRTI------MANISIGQPPIPQLVVMDTGSDI 124

Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLG---CQAAQCKQVPNPTCGGGACAFNLTYG 174
            WV CT C  C + +   F+ ++S+TF  L    C    C+  P P        F +TY 
Sbjct: 125 LWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFEGCRCDPIP--------FTVTYA 176

Query: 175 -SSTIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLA 227
            +ST +    +DT+        T  +    FGC      ++ P   G+LGL  G  SL+ 
Sbjct: 177 DNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVT 236

Query: 228 QTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIR 287
           +        FSYC+ +     ++    +   G       TP       +  YYV +  I 
Sbjct: 237 K----LGQKFSYCIGNLADPYYNYHQLILGEGADLEGYSTPF---EVYNGFYYVTMEGIS 289

Query: 288 VGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG 347
           VG + +DI P   +       G IID+G+  T LV   +  +    R  +G +    ++ 
Sbjct: 290 VGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIE 349

Query: 348 G------FDTCYSVPIVA-PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD-N 398
                  F    S  +V  P +T  FS G ++ L   +   +    ++ C+ +      N
Sbjct: 350 KSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSFF-NQLNDNVFCMTVGPVSSLN 408

Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           + S  ++I  + QQ++ + YD+ N  +   R  C
Sbjct: 409 IKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 157/364 (43%), Gaps = 46/364 (12%)

Query: 97  YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
           Y  R  IGTP Q   + +D+ +   +VPC+ C  C       F    S+T+     Q  +
Sbjct: 93  YTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTY-----QPVK 147

Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKATGN- 208
           C    N       C +   Y   S+    L +D IS   +  + P    FGC    TG+ 
Sbjct: 148 CNMDCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDL 207

Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
            S    G++GLG+G LSL+ Q   + L  ++F  C          GS+ LG    P  + 
Sbjct: 208 YSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDV--GGGSMILGGFDYPSDMV 265

Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
           +T    +P RS  Y ++L  IRV  + + +   +  F+   GA  ++DSGT +  L   A
Sbjct: 266 FTD--SDPDRSPYYNIDLTGIRVAGKQLSL--HSRVFDGEHGA--VLDSGTTYAYLPDAA 319

Query: 326 YTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPI---------VAPTITLMF-SGMN 370
           + A  +   R V    T+  + G      DTC+ V           + P++ ++F SG +
Sbjct: 320 FAAFEEAVMREVS---TLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQS 376

Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
             L P++ +  HS      CL +     +  ++L  I     +N  ++YD  NS++G  R
Sbjct: 377 WLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIV---VRNTLVVYDRENSKVGFWR 433

Query: 430 ELCT 433
             C+
Sbjct: 434 TNCS 437


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 152/363 (41%), Gaps = 55/363 (15%)

Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
           +IGTP    ++A+DT +D  WVPC  C  C++T            V+N   S+T K + C
Sbjct: 101 QIGTPGVKFMVALDTGSDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTC 159

Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGC 201
             + C            C + ++Y S+  + +  L +D + L  +      +     FGC
Sbjct: 160 NNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGC 219

Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
            Q  +G   +   P GL GLG   +S+  +   +     +FS C          G +  G
Sbjct: 220 GQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF----GRDGIGRISFG 275

Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
             G   + + TP   NP   + Y + +  +RVG  ++D+   AL            DSGT
Sbjct: 276 DKGSFDQDE-TPFNLNPSHPT-YNITVTQVRVGTTLIDVEFTAL-----------FDSGT 322

Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVT-SLGGFDTCYSVPIVA-----PTITLMFSGMN 370
            FT LV P YT + + F  +V      + S   F+ CY +   A     P+++L   G +
Sbjct: 323 SFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGS 382

Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
                D ++I ST    + CLA+    +     LN+I       +R+++D     LG  +
Sbjct: 383 HFAVYDPIIIISTQSELVYCLAVVKTAE-----LNIIGQNFMTGYRVVFDREKLVLGWKK 437

Query: 430 ELC 432
             C
Sbjct: 438 FDC 440


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 151/369 (40%), Gaps = 48/369 (13%)

Query: 95  PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQA 151
           P Y+    IGTP Q     +D + +  W  C+ C  C      VF    S+TFK   C  
Sbjct: 43  PYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGT 102

Query: 152 AQCKQVPNPTCGGGACAFN-----LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKAT 206
           A C+ +P  +C G  C++      L   +S  AA    DT ++ T  V    FGC+  + 
Sbjct: 103 AVCESIPTRSCSGDVCSYKGPPTQLRGNTSGFAAT---DTFAIGTATVR-LAFGCVVASD 158

Query: 207 GNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPK 262
            +++  P G +GLGR   SL+AQ   +  + FSYCL S +    S  L LG    +   +
Sbjct: 159 IDTMDGPSGFIGLGRTPWSLVAQ---MKLTRFSYCL-SPRNTGKSSRLFLGSSAKLAGSE 214

Query: 263 RIKYTPLLK---NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV-- 317
                P +K   +   S+ Y ++L AIR G   +           T  +G I+   TV  
Sbjct: 215 STSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI----------ATAQSGGILVMHTVSP 264

Query: 318 FTRLVAPAYTAVRDVFRRRVGS---NLTVTSLGGFDTCYSVP-----IVAPTITLMFSG- 368
           F+ LV  AY A +      VG        T    FD C+          AP +   F G 
Sbjct: 265 FSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 324

Query: 369 MNVTLPQDNLLI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
             +T+P    LI             L+MA         ++V+ ++QQ++   LYD+    
Sbjct: 325 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKET 384

Query: 425 LGVARELCT 433
           L      C+
Sbjct: 385 LSFEPADCS 393


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/418 (24%), Positives = 162/418 (38%), Gaps = 42/418 (10%)

Query: 41  SPCSPFKPSKPLSWEESVLEMLAKDQ--ARLQFLSSLAVARKSVVPIASGRQITQSPTYI 98
           SP SPF  +   +   S       D    R   +S    A +S +  + G        Y+
Sbjct: 46  SPNSPFYNALEAAATRSTNASQHYDAQIGRFNLMSDSYYASQSELNFSKGN-------YL 98

Query: 99  VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQ 156
           ++  +GTP   +L   D + D  W+PC  C  C+     F  ++S+T+ +  C++ QC+ 
Sbjct: 99  IKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFFPSESSTYTSAACESYQCQI 158

Query: 157 VPNPTCGGGACAFNL-----TYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
                C    C +          S T    ++ DTIS  +        P   F C     
Sbjct: 159 TNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSYPNTNFICGTFID 218

Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRI 264
                  G++GLGRG  S+ +Q ++L   TFS CL  + +   S  +  G  G    + +
Sbjct: 219 NWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQ-SSKINFGLKGVVSGEGV 277

Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
             TP+  +   S  Y++ L A+ VG   V     A  F     +   ID  T FT L   
Sbjct: 278 VSTPIADDG-ESGAYFLFLEAMSVGGNRV-----ANNFYSAPKSNIYIDWRTTFTSLPHD 331

Query: 325 AYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFSGMNVTLPQDN 377
            Y  V    R+ +  NLT  +         CY         AP IT+ F+  +V L   N
Sbjct: 332 FYENVEAEVRKAI--NLTPINYNNERKLSLCYKSESDHDFDAPPITMHFTNADVQLSPLN 389

Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLN--VIANMQQQNHRILYDVPNSRLGVARELCT 433
             +     ++ C A      N    +   V  + QQ N  + YD+ +S +   +  CT
Sbjct: 390 TFVR-MDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLKSSTVSFKQADCT 446


>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
          Length = 191

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 77/167 (46%), Gaps = 8/167 (4%)

Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
           K     + YYV + ++ VG  V++IP      +     GTIIDSGT  +    PAY  ++
Sbjct: 25  KENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIK 84

Query: 331 DVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF-SGMNVTLPQDNLLIHSTAG 385
             F  +V     +        CY+V  V     P+  ++F  G   T P +N  I     
Sbjct: 85  QAFVNKVKRYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPE 144

Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
            I CLA+   P   +S +++I N QQQN  ILYD   SRLG A   C
Sbjct: 145 DIVCLAILGTP---HSAMSIIGNYQQQNFHILYDTKRSRLGFAPRRC 188


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 106/421 (25%), Positives = 178/421 (42%), Gaps = 54/421 (12%)

Query: 51  PLSWEESVLEMLAKDQARL-QFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQ 108
           PL+    + E+ A+D+ R  +FL SS+ V      P+       +   Y  R  +G+P +
Sbjct: 23  PLNQRVELDELKARDRVRHGRFLQSSVGVVD---FPVEGTYDPYRVGLYFTRVLLGSPPK 79

Query: 109 TLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QV 157
              + +DT +D  WV C  C GC  +         F+   S+T   + C   +C    Q 
Sbjct: 80  EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQS 139

Query: 158 PNPTCG--GGACAFNLTYGSST------IAANLSQDTI--SLATDIVPGYTFGCIQKATG 207
            +  C   G  C +   YG  +      ++  L+ D I  S  T+      FGC    TG
Sbjct: 140 SDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTG 199

Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
           +         G+ G G+  +S+++Q  +Q +    FS+CL           L    I + 
Sbjct: 200 DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLG--EIVE- 256

Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
           + I Y+PL+ +      Y +NL +I V  + + I P    F  +T  GTI+DSGT    L
Sbjct: 257 EDIVYSPLVPSQPH---YNLNLQSISVNGKSLAIDPEV--FATSTNRGTIVDSGTTLAYL 311

Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSG---MNVTLP 374
              AY          V  ++    L     CY    SV  + PT++L F+G   MN+  P
Sbjct: 312 AEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLK-P 369

Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
           +D LL  ++ G  ++ C+            + ++ ++  ++   +YD+   R+G A   C
Sbjct: 370 EDYLLQQNSIGDAAVWCIGFQKIQ---GQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 426

Query: 433 T 433
           +
Sbjct: 427 S 427


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.134    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,474,907,821
Number of Sequences: 23463169
Number of extensions: 259548537
Number of successful extensions: 638413
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 872
Number of HSP's successfully gapped in prelim test: 3138
Number of HSP's that attempted gapping in prelim test: 630590
Number of HSP's gapped (non-prelim): 4395
length of query: 433
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 288
effective length of database: 8,957,035,862
effective search space: 2579626328256
effective search space used: 2579626328256
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)