BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 039412
(433 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/435 (83%), Positives = 394/435 (90%), Gaps = 6/435 (1%)
Query: 1 MKPQLVFFLAFLFLFSLSEG--LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESV 58
MK L F LAFLF F+L++G LNP C QD S LQVFHV+SPCSPF PSKPL WEESV
Sbjct: 1 MKTHL-FSLAFLF-FTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESV 58
Query: 59 LEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
L+M AKDQARLQFLSSL VARKSVVPIASGRQI QSPTYIVRAKIGTPAQT+L+AMDTSN
Sbjct: 59 LQMQAKDQARLQFLSSL-VARKSVVPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSN 117
Query: 119 DAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
DAAW+PC+GCVGCSSTVFN+ +STTFK +GC+A QCKQVPN CGG ACAFN+TYGSS+I
Sbjct: 118 DAAWIPCSGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSI 177
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
AANLSQD ++LATD +P YTFGC+ +ATG+S+PPQGLLGLGRG +SLL+QTQNLYQSTFS
Sbjct: 178 AANLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFS 237
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLPSF++L+FSGSLRLGP+GQPKRIK TPLLKNPRRSSLYYVNL+AIRVGRRVVDIPP
Sbjct: 238 YCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPS 297
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
AL FNPTTGAGTI DSGTVFTRLVAPAYTAVRD FR+RVG N TVTSLGGFDTCY+ PIV
Sbjct: 298 ALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVG-NATVTSLGGFDTCYTSPIV 356
Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
APTIT MFSGMNVTLP DNLLIHSTA SITCLAMAAAPDNVNSVLNVIANMQQQNHRIL+
Sbjct: 357 APTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILF 416
Query: 419 DVPNSRLGVARELCT 433
DVPNSRLGVARE CT
Sbjct: 417 DVPNSRLGVAREPCT 431
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/419 (80%), Positives = 374/419 (89%), Gaps = 5/419 (1%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
EGL P CDTQDH STL+VFHVFSPCSPF+P KPLSW ESVL++ AKDQARLQFL+S+ VA
Sbjct: 21 EGLTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLASM-VA 79
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
+SVVPIASGRQI QSPTYIVRAKIG+P QTLL+AMDTSNDAAW+PCT C GC+ST+F
Sbjct: 80 GRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTLFAP 139
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+STTFKN+ C + QC QVPNP+CG AC FNLTYGSS+IAAN+ QDT++LATD +P YT
Sbjct: 140 EKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIAANVVQDTVTLATDPIPDYT 199
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 200 FGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 259
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
QP RIKYTPLLKNPRRSSLYYVNL+AIRVGR+VVDIPP AL FN TGAGT+ DSGTVF
Sbjct: 260 AQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVF 319
Query: 319 TRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
TRLVAPAYTAVRD F+RRV +NLTVTSLGGFDTCY+VPIVAPTIT MFSGMNVTLP
Sbjct: 320 TRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIVAPTITFMFSGMNVTLP 379
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+DN+LIHSTAGS TCLAMA+APDNVNSVLNVIANMQQQNHR+LYDVPNSRLGVARELCT
Sbjct: 380 EDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 438
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/419 (79%), Positives = 374/419 (89%), Gaps = 5/419 (1%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
+GL P CDTQDH STL+VFHVFSPCSPF+PSKPLSW ESVL++ AKDQARLQFL+S+ VA
Sbjct: 20 QGLTPKCDTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASM-VA 78
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
+S+VPIASGRQI QSPTYIVRAKIGTP QTLL+A+DTSNDAAW+PCT C GC+ST+F
Sbjct: 79 GRSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTSTLFAP 138
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+STTFKN+ C + +C +VP+P+CG AC FNLTYGSS+IAAN+ QDT++LATD +PGYT
Sbjct: 139 EKSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIAANVVQDTVTLATDPIPGYT 198
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 199 FGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 258
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
QP RIKYTPLLKNPRRSSLYYVNL AIRVGR++VDIPP AL FN TGAGT+ DSGTVF
Sbjct: 259 AQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVF 318
Query: 319 TRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
TRLVAP YTAVRD FRRRV +NLTVTSLGGFDTCY+VPIVAPTIT MFSGMNVTLP
Sbjct: 319 TRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVAPTITFMFSGMNVTLP 378
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
QDN+LIHSTAGS +CLAMA+APDNVNSVLNVIANMQQQNHR+LYDVPNSRLGVARELCT
Sbjct: 379 QDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 437
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/433 (78%), Positives = 381/433 (87%), Gaps = 9/433 (2%)
Query: 9 LAFLFLFSLSEGL-NPICDT---QDHS-STLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
L LF++++GL NP CD DH STLQVFHVFSPCSPF+PSKP+SWEESVL++ A
Sbjct: 6 LVLFLLFTIAKGLHNPKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLKLQA 65
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
KDQAR+Q+LSSL VAR+S+VPIASGRQITQSPTYIV+AKIGTPAQTLL+AMDTSNDA+WV
Sbjct: 66 KDQARMQYLSSL-VARRSIVPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWV 124
Query: 124 PCTGCVGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL 182
PCT CVGCS+T F A+STTFK +GC A+QCKQV NPTC G ACAFN TYG+S++AA+L
Sbjct: 125 PCTACVGCSTTTPFAPAKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAASL 184
Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
QDT++LATD VP Y FGCIQK TG+SVPPQGLLGLGRG LSLLAQTQ LYQSTFSYCLP
Sbjct: 185 VQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLP 244
Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
SFK L+FSGSLRLGP+ QPKRIK+TPLLKNPRRSSLYYVNL+AIRVGRR+VDIPP AL F
Sbjct: 245 SFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAF 304
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVAP 360
N TGAGT+ DSGTVFTRLV PAY AVR+ FRRR+ LTVTSLGGFDTCY+ PIVAP
Sbjct: 305 NANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYTAPIVAP 364
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
TIT MFSGMNVTLP DN+LIHSTAGS+TCLAMA APDNVNSVLNVIANMQQQNHR+L+DV
Sbjct: 365 TITFMFSGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDV 424
Query: 421 PNSRLGVARELCT 433
PNSRLGVARELCT
Sbjct: 425 PNSRLGVARELCT 437
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/411 (83%), Positives = 378/411 (91%), Gaps = 1/411 (0%)
Query: 23 PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
P C+T D STLQV HV+SPCSPF+P +PLSWEESVL+M AKD+ARLQFLSSL VARKSV
Sbjct: 28 PNCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSL-VARKSV 86
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
VPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D AW+PC GC+GCSST+FNS ST
Sbjct: 87 VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPAST 146
Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI 202
T+K+LGCQAAQCKQVP PTCGGG C+FNLTYG S++AANLSQDTI+LATD VPGY+FGCI
Sbjct: 147 TYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLAANLSQDTITLATDAVPGYSFGCI 206
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
QKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+GQPK
Sbjct: 207 QKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPK 266
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
RIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+ FNP+TGAGTI DSGTVFTRLV
Sbjct: 267 RIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 326
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHS 382
PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI APTIT MF+GMNVTLP DNLLIHS
Sbjct: 327 TPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAPTITFMFTGMNVTLPPDNLLIHS 386
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
TAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDVPNSRLGVARELCT
Sbjct: 387 TAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/406 (82%), Positives = 360/406 (88%), Gaps = 1/406 (0%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
+GLNP CD QD+ STLQV HVFSPCSPF+PSKPLSWEESVL+M AKD RLQFL SL VA
Sbjct: 16 QGLNPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSL-VA 74
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
RKS+VPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F
Sbjct: 75 RKSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 134
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+STTFKN+ C A +CKQVPNP CG + FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLATDPVPSYT 194
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 254
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
QPKRIKYTPLLKNPRRSSLYYVNL AIRVGR+VVDIPP AL FNPTTGAGTI DSGTVF
Sbjct: 255 AQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVF 314
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
TRLVAP Y AVRD FRRRVG LTVTSLGGFDTCY+VPIV PTIT +F+GMNVTLPQDN+
Sbjct: 315 TRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIVVPTITFIFTGMNVTLPQDNI 374
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR
Sbjct: 375 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/425 (80%), Positives = 378/425 (88%), Gaps = 15/425 (3%)
Query: 23 PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
P C+T D STLQV HV+SPCSPF+P +PLSWEESVL+M AKD+ARLQFLSSL VARKSV
Sbjct: 28 PNCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSL-VARKSV 86
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
VPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D AW+PC GC+GCSST+FNS ST
Sbjct: 87 VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPAST 146
Query: 143 TFKNLGCQAAQCKQV--------------PNPTCGGGACAFNLTYGSSTIAANLSQDTIS 188
T+K+LGCQAAQCKQV P PTCGGG C+FNLTYG S++AANLSQDTI+
Sbjct: 147 TYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLSQDTIT 206
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+
Sbjct: 207 LATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN 266
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+ FNP+TGA
Sbjct: 267 FSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA 326
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG 368
GTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI APTIT MF+G
Sbjct: 327 GTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAPTITFMFTG 386
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
MNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDVPNSRLGVA
Sbjct: 387 MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVA 446
Query: 429 RELCT 433
RELCT
Sbjct: 447 RELCT 451
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/433 (78%), Positives = 378/433 (87%), Gaps = 8/433 (1%)
Query: 1 MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
MK L F LAFLFL SL +GLN T+ +T++VFHV+SP SPF+PSKP+SWE+SVL+
Sbjct: 1 MKAYL-FSLAFLFL-SLVQGLN----TRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQ 54
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
MLA+DQARLQFLSSL V RKS VPIASGRQI QSPTYIV+A +GTPAQT LMA+DTSNDA
Sbjct: 55 MLAEDQARLQFLSSL-VGRKSWVPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDA 113
Query: 121 AWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
AW+PC GCVGCSSTVFNS STTFK LGC A QCKQVPNPTCGG C +N TYG STI +
Sbjct: 114 AWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILS 173
Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
NL++DTI+L+TDIVPGYTFGCIQK TG+SVPPQGLLGLGRG LS L+QTQ+LY+STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233
Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
LPSF+ L+FSG+LRLGP GQP RIK TPLLKNPRRSSLYYVNL+ IRVGR++VDIP AL
Sbjct: 234 LPSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASAL 293
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP 360
FNPTTGAGTI DSGTVFTRLVAP YTAVRD FR+RVG N V+SLGGFDTCY+ PIVAP
Sbjct: 294 AFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVG-NAIVSSLGGFDTCYTGPIVAP 352
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
T+T MFSGMNVTLP DNLLI STAGS +CLAMAAAPDNVNSVLNVIANMQQQNHRIL+DV
Sbjct: 353 TMTFMFSGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDV 412
Query: 421 PNSRLGVARELCT 433
PNSR+GVARE C+
Sbjct: 413 PNSRIGVAREPCS 425
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/428 (78%), Positives = 376/428 (87%), Gaps = 7/428 (1%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+F LAFLFL SL +GLN T+ +T++VFHV+SP SPF+PSKP+SWE+SVL+MLA+D
Sbjct: 5 LFSLAFLFL-SLVQGLN----TRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAED 59
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
QARLQFLSSL V RKS VPIASGRQI QSPTYIV+A +GTPAQT LMA+DTSNDAAW+PC
Sbjct: 60 QARLQFLSSL-VGRKSWVPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPC 118
Query: 126 TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQD 185
GCVGCSSTVFNS STTFK LGC A QCKQVPNPTCGG C +N TYG STI +NL++D
Sbjct: 119 NGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSNLTRD 178
Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
TI+L+TDIVPGYTFGCIQK TG+SVPPQGLLGLGRG LS L+QTQ+LY+STFSYCLPSF+
Sbjct: 179 TIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFR 238
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
L+FSG+LRLGP GQP RIK TPLLKNPRRSSLYYVNL+ IRVGR++VDIP AL FNPT
Sbjct: 239 TLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPT 298
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLM 365
TGAGTI DSGTVFTRLVAP YTAVRD FR+RVG N V+SLGGFDTCY+ PIVAPT+T M
Sbjct: 299 TGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVG-NAIVSSLGGFDTCYTGPIVAPTMTFM 357
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
FSGMNVTLP DNLLI STAGS +CLAMAAAPDNVNSVLNVIANMQQQNHRIL+DVPNSR+
Sbjct: 358 FSGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRI 417
Query: 426 GVARELCT 433
GVARE C+
Sbjct: 418 GVAREPCS 425
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/406 (78%), Positives = 370/406 (91%), Gaps = 2/406 (0%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
D SSTLQVFH+FSPCSPF+PSKPLSW ++VL+M AKDQARLQFLSSL VAR+S VPIAS
Sbjct: 36 DRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSL-VARRSFVPIASA 94
Query: 89 RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
RQ+ QSPT++VRAKIGTPAQTLL+A+DTSNDAAW+PC+GC+GC S+TVF+S +S++F+ L
Sbjct: 95 RQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPL 154
Query: 148 GCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG 207
CQ+ QC QVPNP+C G AC FNLTYGSST+AA+L QD ++LATD VP YTFGCI+KATG
Sbjct: 155 PCQSPQCNQVPNPSCSGSACGFNLTYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATG 214
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYT 267
+SVPPQGLLGLGRG LSLL Q+Q+LYQSTFSYCLPSFK+++FSGSLRLGP+ QP RIKYT
Sbjct: 215 SSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYT 274
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
PLL+NPRRSSLYYVNL++IRVGR++VDIPP AL FN TGAGT+IDSGT FTRLVAPAYT
Sbjct: 275 PLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYT 334
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
AVRD FRRRVG N+TV+SLGGFDTCY+VPI++PTIT MF+GMNVTLP DN LIHSTAGS
Sbjct: 335 AVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGST 394
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D+PNSR+GVARE C+
Sbjct: 395 TCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 440
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/427 (75%), Positives = 372/427 (87%), Gaps = 3/427 (0%)
Query: 8 FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
FL LF+ SL + P CD QD STL+VFH+FS CSPFKPSKP+SWEESVL + AKDQA
Sbjct: 10 FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
R+Q+ SSL VARKSVVPIAS RQI QSPTYIV+AK GTP QTLL+A+DTS+DAAW+PC+G
Sbjct: 69 RMQYFSSL-VARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSG 127
Query: 128 CVGCS-STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDT 186
CVGCS S F +ST+F+N+ C + CKQVPNPTCGG ACAFN TYGSS+IAA++ QDT
Sbjct: 128 CVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIAASVVQDT 187
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++LATD +PGYTFGC+ K TG+S P QGLLGLGRG LSLL+Q+QNLY+STFSYCLPSFK+
Sbjct: 188 LTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKS 247
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
++FSGSLRLGP+ QPKRIKYTPLL+NPRRSSLYYVNL+AI+VGR++VDIPP AL FNPTT
Sbjct: 248 INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTT 307
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMF 366
GAGTI DSGTVFTRL P YTAVR+ FRRRVG L VT+LGGFDTCY+VPIV PTIT +F
Sbjct: 308 GAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIVVPTITFLF 367
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SGMNVTLP DN++IHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSR+G
Sbjct: 368 SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIG 427
Query: 427 VARELCT 433
+ARELCT
Sbjct: 428 IARELCT 434
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/424 (78%), Positives = 369/424 (87%), Gaps = 10/424 (2%)
Query: 18 SEGL-NPICDT---QDHS-STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL 72
++GL NP CD DH STLQVFHVFSPCSPF+PSKP+SWEESVL++ AKDQAR+Q+L
Sbjct: 23 AKGLHNPKCDAAYQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYL 82
Query: 73 SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS 132
S+L VAR+S+VPIASGRQITQSPTYIVRAK GTPAQTLL+AMDTSNDAAWVPCT CVGCS
Sbjct: 83 SNL-VARRSIVPIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCS 141
Query: 133 STV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
+T F +STTFK +GC A+QCKQV NPTC G ACAFN TYG+S++AA+L QDT++LAT
Sbjct: 142 TTTPFAPPKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAASLVQDTVTLAT 201
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
D VP YTFGCIQKATG+S+PPQGLLGLGRG LSLLAQTQ LYQSTFSYCLPSFK L+FSG
Sbjct: 202 DPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSG 261
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L P+ QP+ Y P KNPRRSSLYYVNL+AIRVGRR+VDIPP AL FNP TGAGT+
Sbjct: 262 HXDLXPVAQPRDQVY-PSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTV 320
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
DSGTVFTRLV PAYTAVR+ FRRRV LTVTSLGGFDTCY+VPIVAPTIT MFSGM
Sbjct: 321 FDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTVPIVAPTITFMFSGM 380
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
NVTLP DN+LIHSTAGS+TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSRLGVAR
Sbjct: 381 NVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVAR 440
Query: 430 ELCT 433
ELCT
Sbjct: 441 ELCT 444
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/427 (75%), Positives = 370/427 (86%), Gaps = 3/427 (0%)
Query: 8 FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
FL LF+ SL + P CD QD STL+VFH+FS CSPFKPSKP+SWEESVL + AKDQA
Sbjct: 10 FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
R+Q+ SSL VARKSVVPIAS RQI QSPTYIV+AK GTP QTLL+A+DTS+DAAW+PC+G
Sbjct: 69 RMQYFSSL-VARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSG 127
Query: 128 CVGCS-STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDT 186
CVGCS S F +ST+F+N+ C + CKQVPNPTCGG ACAFN TYGSS+IAA++ QDT
Sbjct: 128 CVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIAASVVQDT 187
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++LA D +PGYTFGC+ K TG+S P QGLLGLGRG LSLL+Q+QNLY+STFSYCLPSFK+
Sbjct: 188 LTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKS 247
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
++FSGSLRLGP+ QPKRIKYTPLL+NPRRSSLYYVNL+AI+VGR++VDIPP AL FNPTT
Sbjct: 248 INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTT 307
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMF 366
GAGTI DSGTVFTRL P YTAVR+ FRRRVG L VT+LGGFDTCY+VPIV PTIT +F
Sbjct: 308 GAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIVVPTITFLF 367
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SGMNV LP DN++IHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+L+DVPNSR+G
Sbjct: 368 SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIG 427
Query: 427 VARELCT 433
+ARELCT
Sbjct: 428 IARELCT 434
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/415 (79%), Positives = 355/415 (85%), Gaps = 16/415 (3%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
+GLNP CD QD+ STLQV HVF +SVL+M AKD RLQFL SL VA
Sbjct: 16 QGLNPKCDVQDNGSTLQVIHVF---------------KSVLQMQAKDTTRLQFLDSL-VA 59
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
RKSVVPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F
Sbjct: 60 RKSVVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 119
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+STTFKN+ C A +CKQVPNP CG +C FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 120 EKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIAANLVQDTITLATDPVPSYT 179
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 180 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 239
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
QPKRIKYTPLLKNPRRSSLYYVNL AIRVGR+VVDIPP AL FNPTTGAGTI DSGTVF
Sbjct: 240 AQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVF 299
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
TRLVAP Y AVRD FRRRVG LTVTSLGGFDTCY+VPIV PTIT +F+GMNVTLPQDN+
Sbjct: 300 TRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIVVPTITFIFTGMNVTLPQDNI 359
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR+GVARELCT
Sbjct: 360 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 317/373 (84%), Positives = 346/373 (92%), Gaps = 1/373 (0%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
M AKD+ARLQFLSSL VARKSVVPIASGRQI Q+PTYIVRAKIGTPAQT+LMAMDTS+D
Sbjct: 1 MQAKDKARLQFLSSL-VARKSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDV 59
Query: 121 AWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
AW+PC GC+GCSST+FNS STT+K+LGCQAAQCKQVP PTCGGG C+FNLTYG S++AA
Sbjct: 60 AWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLAA 119
Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
NLSQDTI+LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLYQSTFSYC
Sbjct: 120 NLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYC 179
Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
LPSFK+L+FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVVD+PPG+
Sbjct: 180 LPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF 239
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP 360
FNP+TGAGTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY+VPI AP
Sbjct: 240 TFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTVPIAAP 299
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
TIT MF+GMNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQNHR+LYDV
Sbjct: 300 TITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDV 359
Query: 421 PNSRLGVARELCT 433
PNSRLGVARELCT
Sbjct: 360 PNSRLGVARELCT 372
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 602 bits (1553), Expect = e-170, Method: Compositional matrix adjust.
Identities = 295/372 (79%), Positives = 328/372 (88%), Gaps = 2/372 (0%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+AKDQARLQFLSSL VA+KSVVPIASGR + QSP+YIV+AK+GTP QTLLMA+D S DAA
Sbjct: 1 MAKDQARLQFLSSL-VAKKSVVPIASGRGVIQSPSYIVKAKVGTPPQTLLMALDNSYDAA 59
Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN 181
W+PC GCVGCSSTVFN+ +STTFK LGC A QCKQVPNP CGG C +N TYGSSTI +N
Sbjct: 60 WIPCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN 119
Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
L++DTI+L+ D VP Y FGCIQKATG+SVPPQGLLG GRG LS L+QTQNLY+STFSYCL
Sbjct: 120 LTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCL 179
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
PSF+ L+FSGSLRLGP+GQP RIK TPLLKNPRRSSLYYV L IRVGR++VDIP AL
Sbjct: 180 PSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALA 239
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPT 361
FNPTTGAGTI DSGTVFTRLVAPAY AVR+ FR+RVG N TV+SLGGFDTCYSVPIV PT
Sbjct: 240 FNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVG-NATVSSLGGFDTCYSVPIVPPT 298
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
IT MFSGMNVT+P +NLLIHSTAG +CLAMAAAPDNVNSVLNVIA+MQQQNHRIL+DVP
Sbjct: 299 ITFMFSGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVP 358
Query: 422 NSRLGVARELCT 433
NSRLGVARE C+
Sbjct: 359 NSRLGVAREQCS 370
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 590 bits (1520), Expect = e-166, Method: Compositional matrix adjust.
Identities = 301/437 (68%), Positives = 359/437 (82%), Gaps = 9/437 (2%)
Query: 5 LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
LV FL + L+ GLN P CD TQD STL++FH+ SPCSPFK S PLSWE VL+
Sbjct: 4 LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 63
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+A IGTPAQ LL+AMDTS+D
Sbjct: 64 TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDV 122
Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNPTCG AC+FNLTYGSS+IA
Sbjct: 123 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA 182
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
ANLSQDTI LA D + +TFGC+ K G PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 183 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 242
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYCLPSF++L+FSGSLRLGP QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 243 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 302
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
A+ FNP+TGAGTI DSGTV+TRL P Y AVR+ FR+RV + VTSLGGFDTCYS
Sbjct: 303 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQ 362
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+ PTIT MF G+N+T+P DNL++HSTAGS +CLAMAAAP+NVNSV+NVIA+MQQQNHR+
Sbjct: 363 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 422
Query: 417 LYDVPNSRLGVARELCT 433
L DVPN RLG+ARE C+
Sbjct: 423 LIDVPNGRLGLARERCS 439
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 589 bits (1519), Expect = e-166, Method: Compositional matrix adjust.
Identities = 301/437 (68%), Positives = 359/437 (82%), Gaps = 9/437 (2%)
Query: 5 LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
LV FL + L+ GLN P CD TQD STL++FH+ SPCSPFK S PLSWE VL+
Sbjct: 20 LVLFLQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQ 79
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+A IGTPAQ LL+AMDTS+D
Sbjct: 80 TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDV 138
Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNPTCG AC+FNLTYGSS+IA
Sbjct: 139 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA 198
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
ANLSQDTI LA D + +TFGC+ K G PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 199 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 258
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYCLPSF++L+FSGSLRLGP QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 259 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 318
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
A+ FNP+TGAGTI DSGTV+TRL P Y AVR+ FR+RV + VTSLGGFDTCYS
Sbjct: 319 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQ 378
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+ PTIT MF G+N+T+P DNL++HSTAGS +CLAMAAAP+NVNSV+NVIA+MQQQNHR+
Sbjct: 379 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 438
Query: 417 LYDVPNSRLGVARELCT 433
L DVPN RLG+ARE C+
Sbjct: 439 LIDVPNGRLGLARERCS 455
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 579 bits (1492), Expect = e-163, Method: Compositional matrix adjust.
Identities = 296/437 (67%), Positives = 354/437 (81%), Gaps = 9/437 (2%)
Query: 5 LVFFLAFLFLFSLSEGLN-PICD---TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
LV FL + L+ GLN P CD QD STL++FH+ SPCSPFK PLSWE VL+
Sbjct: 4 LVLFLQLFSIVPLALGLNHPNCDLTKNQDQGSTLRIFHIDSPCSPFKSPSPLSWEARVLQ 63
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
LA+DQARLQ+LSSL VA +SVVPIASGRQ+ QS TYIV+ IGTPAQ LL+AMDTS+D
Sbjct: 64 TLAQDQARLQYLSSL-VAGRSVVPIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDV 122
Query: 121 AWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
AW+PC+GCVGC S+T F+ A+ST+FKN+ C A QCKQVPNP CG AC+FNLTYGSS+IA
Sbjct: 123 AWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSIA 182
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTF 237
ANLSQDTI LA D + +TFGC+ K G PPQGLLGLGRG LSL++Q Q++Y+STF
Sbjct: 183 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTF 242
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYCLPSF++L+FSGSLRLGP QP+R+KYT LL+NPRRSSLYYVNL+AIRVGR+VVD+PP
Sbjct: 243 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 302
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP 356
A+ FNP+TGAGTI DSGTV+TRL P Y AVR+ FR+RV VTSLGGFDTCYS
Sbjct: 303 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQ 362
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+ PTIT MF G+N+T+P DNL++HSTAGS +CLAMA+AP+NVNSV+NVIA+MQQQNHR+
Sbjct: 363 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRV 422
Query: 417 LYDVPNSRLGVARELCT 433
L DVPN RLG+ARE C+
Sbjct: 423 LIDVPNGRLGLARERCS 439
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 283/418 (67%), Positives = 344/418 (82%), Gaps = 11/418 (2%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
SE +N C+ + HSS L+VFH+ S CSPFK S +SW +++L+ D+AR +LSSLA
Sbjct: 17 SESIN--CNEKSHSSDLRVFHINSQCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
RKS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69 VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLF 128
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
+ ++S++ + L C+A QCKQ PNP+C +C FN+TYG STI A L+QDT++LA+D++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTLASDVIP 188
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
GP QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
TV+TRLV PAY AVR+ FRRRV N TSLGGFDTCYS +V P++T MF+GMNVTLP
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 288/421 (68%), Positives = 343/421 (81%), Gaps = 14/421 (3%)
Query: 18 SEGLNPICDT---QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS 74
SE LN C+ Q H S L+VFHV SPCSPFK +SWE ++L KD+ARLQ+LSS
Sbjct: 17 SESLN--CNENNPQGHPSDLRVFHVNSPCSPFKQPNTVSWESTLL----KDKARLQYLSS 70
Query: 75 LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
LA +K VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAWVPC+GCVGC+S+
Sbjct: 71 LA--KKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASS 128
Query: 135 V-FNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYGSSTIAANLSQDTISLATD 192
V F+ ++S++ +NL C A QCKQ PNPTC G +C FN+TYG STI A+L+QDT++LA D
Sbjct: 129 VLFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYGGSTIEASLTQDTLTLAND 188
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
++ YTFGCI KATG S+P QGL+GLGRG LSL++QTQNLY STFSYCLP+ K+ +FSGS
Sbjct: 189 VIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGS 248
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
LRLGP QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP AL F+ +TGAGTI
Sbjct: 249 LRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIF 308
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVT 372
DSGTVFTRLV PAY AVR+ FRRR+ N TSLGGFDTCYS +V P++T MF+GMNVT
Sbjct: 309 DSGTVFTRLVEPAYVAVRNEFRRRI-KNANATSLGGFDTCYSGSVVYPSVTFMFAGMNVT 367
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP DNLLIHS++GS +CLAMAAAP+NVNSVLNVIA+MQQQNHR+L D+PNSRLG++RE C
Sbjct: 368 LPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427
Query: 433 T 433
T
Sbjct: 428 T 428
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 283/418 (67%), Positives = 344/418 (82%), Gaps = 11/418 (2%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
SE +N C+ + HSS L+VFH+ S CSPFK S +SW +++L+ D+AR +LSSLA
Sbjct: 17 SESIN--CNEKSHSSDLRVFHINSLCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
RKS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69 VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLF 128
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
+ ++S++ + L C+A QCKQ PNP+C +C FN+TYG STI A L+QDT++LA+D++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTLTLASDVIP 188
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
GP QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
TV+TRLV PAY AVR+ FRRRV N TSLGGFDTCYS +V P++T MF+GMNVTLP
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 281/418 (67%), Positives = 342/418 (81%), Gaps = 11/418 (2%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
SE +N C+ + HSS L+VFH+ S CSPFK S +SW +++L+ D+AR +LSSLA
Sbjct: 17 SESIN--CNEKSHSSDLRVFHINSQCSPFKTS--VSWADTLLQ----DKARFLYLSSLAG 68
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-F 136
KS VPIASGR I QSPTYIVRA IGTPAQ +L+A+DTSNDAAW+PC+GCVGCSS+V F
Sbjct: 69 VTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSVLF 128
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVP 195
+ ++S++ + L C+A QCKQ PNP+C +C FN+TYG S I A L+QDT++LATD++P
Sbjct: 129 DPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSAIEAYLTQDTLTLATDVIP 188
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
YTFGCI KA+G S+P QGL+GLGRG LSL++Q+QNLYQSTFSYCLP+ K+ +FSGSLRL
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
GP QP RIK TPLLKNPRRSSLYYVNL+ IRVG ++VDIP AL F+P TGAGTI DSG
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
TV+TRLV PAY A+R+ FRRRV N TSLGGFDTCYS +V P++T MF+GMNVTLP
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRV-KNANATSLGGFDTCYSGSVVFPSVTFMFAGMNVTLPP 367
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
DNLLIHS+AG+++CLAMAAAP NVNSVLNVIA+MQQQNHR+L DVPNSRLG++RE CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 284/364 (78%), Positives = 331/364 (90%), Gaps = 2/364 (0%)
Query: 71 FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
FLSSL VAR+S VPIAS RQ+ QSPT++VRAKIGTPAQTLL+A+DTSNDAAW+PC+GC+G
Sbjct: 1 FLSSL-VARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIG 59
Query: 131 C-SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISL 189
C S+TVF+S +S++F+ L CQ+ QC QVPNP+C G AC FNLTYGSST+AA+L QD ++L
Sbjct: 60 CPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYGSSTVAADLVQDNLTL 119
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
ATD VP YTFGCI+KATG+SVPPQGLLGLGRG LSLL Q+Q+LYQSTFSYCLPSFK+++F
Sbjct: 120 ATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF 179
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
SGSLRLGP+ QP RIKYTPLL+NPRRSSLYYVNL++IRVGR++VDIPP AL FN TGAG
Sbjct: 180 SGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAG 239
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM 369
T+IDSGT FTRLVAPAYTAVRD FRRRVG N+TV+SLGGFDTCY+VPI++PTIT MF+GM
Sbjct: 240 TVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIISPTITFMFAGM 299
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
NVTLP DN LIHST+GS TCLAMAAAPDNVNSVLNVIA+MQQQNHRIL+D+PNSR+GVAR
Sbjct: 300 NVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVAR 359
Query: 430 ELCT 433
E C+
Sbjct: 360 ESCS 363
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 270/320 (84%), Positives = 296/320 (92%)
Query: 114 MDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY 173
MDTS+D AW+PC GC+GCSST+FNS STT+K+LGCQAAQCKQVP PTCGGG C+FNLTY
Sbjct: 1 MDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTY 60
Query: 174 GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
G S++AANLSQDTI+LATD VPGY+FGCIQKATG S+P QGLLGLGRG LSLL+QTQNLY
Sbjct: 61 GGSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 120
Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
QSTFSYCLPSFK+L+FSGSLRLGP+GQPKRIKYTPLLKNPRR SLY+VNL+A+RVGRRVV
Sbjct: 121 QSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVV 180
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
D+PPG+ FNP+TGAGTI DSGTVFTRLV PAY AVRD FR RVG NLTVTSLGGFDTCY
Sbjct: 181 DVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCY 240
Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+VPI APTIT MF+GMNVTLP DNLLIHSTAGS TCLAMAAAPDNVNSVLNVIAN+QQQN
Sbjct: 241 TVPIAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 300
Query: 414 HRILYDVPNSRLGVARELCT 433
HR+LYDVPNSRLGVARELCT
Sbjct: 301 HRLLYDVPNSRLGVARELCT 320
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 248/410 (60%), Positives = 319/410 (77%), Gaps = 5/410 (1%)
Query: 29 DHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS 87
D +TLQV H F PCSP S SW + + A+D +RL +L SLAV ++ PIAS
Sbjct: 38 DAGATLQVSHAFGPCSPLGAESAAPSWAGFLADQAARDASRLLYLDSLAVKGRAYAPIAS 97
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKN 146
GRQ+ Q+PTY+VRA++GTPAQ LL+A+DTSNDAAW+PC+GC GC +S+ FN A S +++
Sbjct: 98 GRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRP 157
Query: 147 LGCQAAQCKQVPNPTCGGGA--CAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQK 204
+ C + QC PNP+C A C F+L+Y S++ A LSQDT+++A D+V YTFGC+Q+
Sbjct: 158 VPCGSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAALSQDTLAVAGDVVKAYTFGCLQR 217
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
ATG + PPQGLLGLGRG LS L+QT+++Y +TFSYCLPSFK+L+FSG+LRLG GQP+RI
Sbjct: 218 ATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRI 277
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
K TPLL NP RSSLYYVN+ IRVG++VV IP AL F+P TGAGT++DSGT+FTRLVAP
Sbjct: 278 KTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAP 337
Query: 325 AYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHST 383
Y A+RD RRRVG+ V+SLGGFDTCY+ + P +TL+F GM VTLP++N++IH+T
Sbjct: 338 VYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVAWPPVTLLFDGMQVTLPEENVVIHTT 397
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
G+ +CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 398 YGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 447
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 256/414 (61%), Positives = 320/414 (77%), Gaps = 11/414 (2%)
Query: 29 DHSSTLQVFHVFSPCSPFKP-SKPLSWEESVLEMLAKDQARLQFLSSLAVARKS--VVPI 85
D +TLQV H F PCSP P + SW + + ++D +RL +L SLA K+ PI
Sbjct: 39 DAGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPI 98
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
ASGRQ+ Q+PTY+VRA++GTP Q LL+A+DTSNDAAW+PC GC GC S+ F+ A ST
Sbjct: 99 ASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAAST 158
Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
+++++ C + C Q PN C GG AC F+LTY S++ A LSQD++++A D V YTFG
Sbjct: 159 SYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGDAVKTYTFG 218
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C+QKATG + PPQGLLGLGRG LS L+QT+++YQ TFSYCLPSFK+L+FSG+LRLG GQ
Sbjct: 219 CLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQ 278
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P RIK TPLL NP RSSLYYVN+ IRVGR+VV IPP AL F+P TGAGT++DSGT+FTR
Sbjct: 279 PPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTR 338
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
LVAPAY AVRD RRRVG+ V+SLGGFDTC++ VA P +TL+F GM VTLP++N++
Sbjct: 339 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPVTLLFDGMQVTLPEENVV 396
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 397 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 450
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 248/409 (60%), Positives = 316/409 (77%), Gaps = 5/409 (1%)
Query: 29 DHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS 87
D +TLQV H F PCSP + SW + + ++D +RL +L SLAVA ++ PIAS
Sbjct: 39 DAGATLQVSHAFGPCSPLGNAAAAPSWAGFLADQSSRDASRLLYLDSLAVAGRAYAPIAS 98
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-FNSAQSTTFKN 146
GRQ+ Q+PTY+VRA++GTP Q LL+A+DTSNDAAW+PC+GC GC +T FN A S +++
Sbjct: 99 GRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTPFNPAASKSYRA 158
Query: 147 LGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQK 204
+ C + C + PNP+C +C F+LTY S++ A LSQD++++A D+V YTFGC+QK
Sbjct: 159 VPCGSPACSRAPNPSCSLNTKSCGFSLTYADSSLEAALSQDSLAVANDVVKSYTFGCLQK 218
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
ATG + PPQGLLGLGRG LS L+QT+++Y+ TFSYCLPSFK+L+FSG+LRLG GQP RI
Sbjct: 219 ATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRI 278
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
K TPLL NP RSSLYYV++ IRVG++VV IPP AL F+P TGAGT++DSGT+FTRLVAP
Sbjct: 279 KTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAP 338
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTA 384
AY AVRD RRR+ ++SLGGFDTCY+ + P +T MF+GM VTLP DNL+IHST
Sbjct: 339 AYVAVRDEVRRRI-RGAPLSSLGGFDTCYNTTVKWPPVTFMFTGMQVTLPADNLVIHSTY 397
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
G+ +CLAMAAAPD VN+VLNVIA+MQQQNHRIL+DVPN R+G ARE CT
Sbjct: 398 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 244/431 (56%), Positives = 308/431 (71%), Gaps = 9/431 (2%)
Query: 8 FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
F F LFS ++ ++P C TQ +S L V ++S CSPF P K SW +V+ M +KD
Sbjct: 10 FFLFALLFSTTKAVDP-CATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPE 68
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
RL++LS+LA + + VPIA G+Q+ + Y+VR K+GTP Q + M +DTSNDAAWVPC+G
Sbjct: 69 RLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG 128
Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANLS 183
C GCSST F STT +L C AQC QV +C G AC FN +YG S++ A L
Sbjct: 129 CTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLV 188
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
QD I+LA D++PG+TFGCI +G S+PPQGLLGLGRG +SL++Q +Y FSYCLPS
Sbjct: 189 QDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS 248
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
FK+ FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL + VGR V IP L F+
Sbjct: 249 FKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFD 308
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPT 361
P TGAGTIIDSGTV TR V P Y A+RD FR++V N ++SLG FDTC++ AP
Sbjct: 309 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGAFDTCFAATNEAEAPA 366
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
ITL F G+N+ LP +N LIHS++GS+ CL+MAAAP+NVNSVLNVIAN+QQQN RI++D
Sbjct: 367 ITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTT 426
Query: 422 NSRLGVARELC 432
NSRLG+ARELC
Sbjct: 427 NSRLGIARELC 437
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 245/432 (56%), Positives = 309/432 (71%), Gaps = 10/432 (2%)
Query: 7 FFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ 66
FFL L LFS ++ ++P C TQ +S L V ++S CSPF P K SW +V+ M +KD
Sbjct: 10 FFLVAL-LFSTTKAVDP-CATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDP 67
Query: 67 ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
RL++LS+LA + + VPIA G+Q+ + Y+VR K+GTP Q + M +DTSNDAAWVPC+
Sbjct: 68 ERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 127
Query: 127 GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANL 182
GC G SST F STT +L C AQC QV +C G AC FN +YG S++ A L
Sbjct: 128 GCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATL 187
Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
QD I+LA D++PG+TFGCI +G S+PPQGLLGLGRG +SL++Q +Y FSYCLP
Sbjct: 188 VQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLP 247
Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
SFK+ FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL + VGR V IP L F
Sbjct: 248 SFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAP 360
+P TGAGTIIDSGTV TR V P Y A+RD FR++V N ++SLG FDTC++ AP
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGAFDTCFAATNEAEAP 365
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
ITL F G+N+ LP +N LIHS++GS+ CL+MAAAP+NVNSVLNVIAN+QQQN RI++D
Sbjct: 366 AITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDT 425
Query: 421 PNSRLGVARELC 432
NSRLG+ARELC
Sbjct: 426 TNSRLGIARELC 437
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 248/414 (59%), Positives = 316/414 (76%), Gaps = 15/414 (3%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPL-SWEESVLEMLAKDQARLQFLSSLAVA--RKSVVPI 85
D +TLQV H F PCSP P SW + + ++D +RL +L SLAV ++ PI
Sbjct: 41 DAGNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSLAVRGRARAYAPI 100
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
ASGRQ+ Q+PTY+VRA +GTP Q LL+A+DTSNDA+W+PC GC GC S+ F+ A S
Sbjct: 101 ASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSA 160
Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
+++ + C + C Q PN C GG AC F+LTY S++ A LSQD++++A + V YTFG
Sbjct: 161 SYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGNAVKAYTFG 220
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C+Q+ATG + PPQGLLGLGRG LS L+QT+++Y++TFSYCLPSFK+L+FSG+LRLG GQ
Sbjct: 221 CLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQ 280
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P+RIK TPLL NP RSSLYYVN+ IRVGR+VV IP F+P TGAGT++DSGT+FTR
Sbjct: 281 PQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP----AFDPATGAGTVLDSGTMFTR 336
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
LVAPAY AVRD RRRVG+ V+SLGGFDTC++ VA P +TL+F GM VTLP++N++
Sbjct: 337 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPVTLLFDGMQVTLPEENVV 394
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 395 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 246/414 (59%), Positives = 315/414 (76%), Gaps = 15/414 (3%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPL-SWEESVLEMLAKDQARLQFLSSLAVA--RKSVVPI 85
D +TLQV H F PCSP P SW + + ++D +RL +L SLAV ++ PI
Sbjct: 41 DAGNTLQVSHAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDSLAVRGRARAYAPI 100
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
ASGRQ+ Q+ TY+VRA +GTP Q LL+A+DTSNDA+W+PC GC GC S+ F+ A S
Sbjct: 101 ASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASA 160
Query: 143 TFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
+++ + C + C Q PN C GG AC F+LTY S++ A LSQD++++A + V YTFG
Sbjct: 161 SYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGNAVKAYTFG 220
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C+Q+ATG + PPQGLLGLGRG LS L+QT+++Y++TFSYCLPSFK+L+FSG+LRLG GQ
Sbjct: 221 CLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQ 280
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P+RIK TPLL NP RSSLYYVN+ +RVGR+VV IP F+P TGAGT++DSGT+FTR
Sbjct: 281 PQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP----AFDPATGAGTVLDSGTMFTR 336
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-PTITLMFSGMNVTLPQDNLL 379
LVAPAY AVRD RRRVG+ V+SLGGFDTC++ VA P +TL+F GM VTLP++N++
Sbjct: 337 LVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTAVAWPPMTLLFDGMQVTLPEENVV 394
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 395 IHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 236/437 (54%), Positives = 304/437 (69%), Gaps = 14/437 (3%)
Query: 8 FLAFLFL---FSLSEGLNPICD--TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEML 62
F AF+FL S ++ +P ++ S L V HV+ CSPF K SW +V+ M
Sbjct: 4 FTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINMA 63
Query: 63 AKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+KD AR+ +LSSL + K+ VPIASG+Q+ Y+VR K+GTP Q + M +DTS DAA
Sbjct: 64 SKDPARVTYLSSLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAA 123
Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN---PTCGGGACAFNLTYG-SST 177
WVPC C GCSS F+ S+T+ +L C QC QV PT G AC FN TYG S+
Sbjct: 124 WVPCADCAGCSSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSS 183
Query: 178 IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
+A LSQD++ LA D +P Y+FGC+ +G+++PPQGLLGLGRG +SLL+Q+ +LY F
Sbjct: 184 FSAMLSQDSLGLAVDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVF 243
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYC PSFK+ FSGSLRLGP+GQPK I+ TPLL+NP R +LYYVNL + VGR +V + P
Sbjct: 244 SYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAP 303
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-- 355
L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V ++G FDTC++
Sbjct: 304 ELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPF--ATIGAFDTCFAATN 361
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+AP +T F+GM++ LP +N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN R
Sbjct: 362 EDIAPPVTFHFTGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLR 421
Query: 416 ILYDVPNSRLGVARELC 432
I++DV NSRLG+ARELC
Sbjct: 422 IMFDVTNSRLGIARELC 438
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 225/358 (62%), Positives = 290/358 (81%), Gaps = 4/358 (1%)
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNS 138
++ PIASGRQ+ Q+PTY+VRA++GTPAQ LL+A+DTSNDAAW+PC+GC GC +S+ FN
Sbjct: 37 RAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNP 96
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYGSSTIAANLSQDTISLATDIVPG 196
A S +++ + C + QC PNP+C A C F+L+Y S++ A LSQDT+++A D+V
Sbjct: 97 AASASYRPVPCGSPQCVLAPNPSCSPNAKSCGFSLSYADSSLQAALSQDTLAVAGDVVKA 156
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
YTFGC+Q+ATG + PPQGLLGLGRG LS L+QT+++Y +TFSYCLPSFK+L+FSG+LRLG
Sbjct: 157 YTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG 216
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
GQP+RIK TPLL NP RSSLYYVN+ IRVG++VV IP AL F+P TGAGT++DSGT
Sbjct: 217 RNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGT 276
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
+FTRLVAP Y A+RD RRRVG+ V+SLGGFDTCY+ + P +TL+F GM VTLP+
Sbjct: 277 MFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVAWPPVTLLFDGMQVTLPE 336
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+N++IH+T G+ +CLAMAAAPD VN+VLNVIA+MQQQNHR+L+DVPN R+G ARE CT
Sbjct: 337 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 394
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 246/431 (57%), Positives = 310/431 (71%), Gaps = 9/431 (2%)
Query: 8 FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
FL F L S + L+P C +Q S L + ++S CSPF P K +V++M +KD A
Sbjct: 9 FLLFALLVSSTIALDP-CASQADDSDLSIIPIYSKCSPFIPPKQEPLVNTVIDMASKDPA 67
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
RL++LSSLA + VPIA G+Q+ Y+VR K+GTP Q + M +DTSNDAAWVPC+G
Sbjct: 68 RLKYLSSLAAQMTTAVPIAPGQQVLNIGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSG 127
Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYG-SSTIAANLS 183
C GCSST F++ S+T+ +L C AQC QV +C G +C FN +YG S+ +A L
Sbjct: 128 CTGCSSTTFSTNTSSTYGSLDCSMAQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLV 187
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
+D++ L D++P + FGCI +G SVPPQGLLGLGRG LSL+AQ+ +LY FSYCLPS
Sbjct: 188 EDSLRLVNDVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPS 247
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
FK+ FSGSL+LGP GQPK I+YTPLL+NP R SLYYVNL + VGR +V I P L FN
Sbjct: 248 FKSYYFSGSLKLGPAGQPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFN 307
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPT 361
P TGAGTIIDSGTV TR V P YTA+RD FR++V +SLG FDTC++ VAP
Sbjct: 308 PNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQVAGPF--SSLGAFDTCFAATNEAVAPA 365
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+TL F+G+N+ LP +N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN R+L+DVP
Sbjct: 366 VTLHFTGLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVP 425
Query: 422 NSRLGVARELC 432
NSRLG+ARELC
Sbjct: 426 NSRLGIARELC 436
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 231/406 (56%), Positives = 294/406 (72%), Gaps = 9/406 (2%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
L V ++ CSPF K SW +V++M +KD AR+++LSSL + PIASG+Q+
Sbjct: 32 LSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQVLN 91
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-STTFKNLGCQAA 152
Y+VR ++GTP QT+ M +DTSNDAAW PC+GC+GCSST SAQ S+TF L C
Sbjct: 92 VGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTTFSAQNSSTFATLDCSKP 151
Query: 153 QCKQVPN---PTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+C Q PT G C FN TYG ST +A L QD++ L +++P ++FGCI A+G+
Sbjct: 152 ECTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGCISSASGS 211
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
S+PPQGL+GLGRG LSL++Q+ +LY FSYCLPSFK+ FSGSL+LGP+GQPK I+ TP
Sbjct: 212 SIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTP 271
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LL NP R SLYYVNL I VGR +V I P L F+P TGAGTIIDSGTV TR V YTA
Sbjct: 272 LLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTA 331
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
VRD FR++VG + + LG FDTC++ + AP ITL SG+++ LP +N LIHS+AGS
Sbjct: 332 VRDEFRKQVGGSF--SPLGAFDTCFATNNEVSAPAITLHLSGLDLKLPMENSLIHSSAGS 389
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLAMAAAP+NVNSV+NVIAN+QQQNHRIL+D+ NS+LG+ARELC
Sbjct: 390 LACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/358 (59%), Positives = 275/358 (76%), Gaps = 8/358 (2%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
VPIA GRQI P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC SS F+ Q
Sbjct: 88 VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 147
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
S+T++ + C + QC QVP+P+C G +C FNLTY +ST A L QD+++L ++V Y
Sbjct: 148 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 207
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
TFGC++ +GNSVPPQGL+G GRG LS L+QT++ Y S FSYCLP++++ +FSG+L+LGP
Sbjct: 208 TFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGP 267
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P AL FNP TG+GTIID+GT+
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
FTRL AP Y AVRD FR RV + + LGGFDTCY+V + PT+T MF+G + VTLP++
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 386
Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 387 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/358 (59%), Positives = 275/358 (76%), Gaps = 8/358 (2%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
VPIA GRQI P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC SS F+ Q
Sbjct: 69 VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 128
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
S+T++ + C + QC QVP+P+C G +C FNLTY +ST A L QD+++L ++V Y
Sbjct: 129 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 188
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
TFGC++ +GNSVPPQGL+G GRG LS L+QT++ Y S FSYCLP++++ +FSG+L+LGP
Sbjct: 189 TFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGP 248
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P AL FNP TG+GTIID+GT+
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
FTRL AP Y AVRD FR RV + + LGGFDTCY+V + PT+T MF+G + VTLP++
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 367
Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 368 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 425
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 233/435 (53%), Positives = 307/435 (70%), Gaps = 12/435 (2%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAK 64
L+ + ++L ++ ++P C +Q +S L V ++S CSPFKP K +W+ ++ M +K
Sbjct: 9 LIVIFSVMWLMRVN-AIDP-CASQPDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMASK 66
Query: 65 DQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
D R+++LS+L V++K+V PIASG+ Y+VR K+GTP Q L M +DTS D A+
Sbjct: 67 DPVRVKYLSTL-VSQKTVSTAPIASGQAFNIG-NYVVRVKLGTPGQLLFMVLDTSTDEAF 124
Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIA 179
VPC+GC GCS T F+ ST++ L C QC QV +C G GAC+FN +Y S+ +
Sbjct: 125 VPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS 184
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
A L QD + LATD++P Y+FGC+ TG SVP QGLLGLGRG LSLL+Q+ + Y FSY
Sbjct: 185 ATLVQDALRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSY 244
Query: 240 CLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
CLPSFK+ FSGSL+LGP+GQPK I+ TPLL++P R SLYYVN I VGR +V P
Sbjct: 245 CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEY 304
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPI 357
L FNP TG+GTIIDSGTV TR V P Y AVR+ FR++VG T TS+G FDTC+ +
Sbjct: 305 LGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGT-TFTSIGAFDTCFVKTYET 363
Query: 358 VAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
+AP ITL F G+++ LP +N LIHS+AGS+ CLAMAAAPDNVNSVLNVIAN QQQN RIL
Sbjct: 364 LAPPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRIL 423
Query: 418 YDVPNSRLGVARELC 432
+D+ N+++G+ARE+C
Sbjct: 424 FDIVNNKVGIAREVC 438
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 235/436 (53%), Positives = 306/436 (70%), Gaps = 13/436 (2%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLA 63
++ + ++L ++ G++P C +Q +S L V ++S CSPFKP K SW+ ++ M +
Sbjct: 9 IILIFSVIWLMRVN-GIDP-CASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMAS 66
Query: 64 KDQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
KD R ++LS+L V +K+V PIASG Q Y+VR K+GTP Q L M +DTS D A
Sbjct: 67 KDPLRFKYLSTL-VGQKTVSTAPIASG-QTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEA 124
Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTI 178
+VPC+GC GCS T F+ ST++ L C QC QV +C G GAC+FN +Y S+
Sbjct: 125 FVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
+A L QD++ LATD++P Y+FGC+ TG SVP QGLLGLGRG LSLL+Q+ + Y FS
Sbjct: 185 SATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFS 244
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLPSFK+ FSGSL+LGP+GQPK I+ TPLL++P R SLYYVN I VGR +V P
Sbjct: 245 YCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSE 304
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVP 356
L FNP TG+GTIIDSGTV TR V P Y AVR+ FR++VG T TS+G FDTC+ +
Sbjct: 305 YLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGT-TFTSIGAFDTCFVKTYE 363
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+AP ITL F G+++ LP +N LIHS+AGS+ CLAMAAAPDNVNSVLNVIAN QQQN RI
Sbjct: 364 TLAPPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRI 423
Query: 417 LYDVPNSRLGVARELC 432
L+D N+++G+ARE+C
Sbjct: 424 LFDTVNNKVGIAREVC 439
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 234/431 (54%), Positives = 298/431 (69%), Gaps = 12/431 (2%)
Query: 9 LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
+ ++ S ++P C +Q S L V ++ CSPF P K SW+ V+ M +KD AR
Sbjct: 11 ICYVIYISNINAIDP-CASQPDDSDLNVIPMYGKCSPFNPPKADSWDNRVINMASKDPAR 69
Query: 69 LQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
+ +LS+L VA+K+ PIASG+ Y+VR KIGTP Q L M +DTS D A+VP +
Sbjct: 70 MSYLSTL-VAQKTATSAPIASGQTFNIG-NYVVRVKIGTPGQLLFMVLDTSTDEAFVPSS 127
Query: 127 GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLS 183
GC+GCS+T F ST+F L C QC QV +C G GAC+FN +Y ST +A L
Sbjct: 128 GCIGCSATTFYPNVSTSFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYAGSTFSATLV 187
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
QD++ LATD++P Y+FG I +G+SVP QGLLGLGRG LSLL+Q+ +Y FSYCLPS
Sbjct: 188 QDSLRLATDVIPSYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPS 247
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
FK+ FSGSL+LGP+GQPK I+ TPLL NP R SLYYVNL AI VGR V +P L FN
Sbjct: 248 FKSYYFSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFN 307
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPT 361
P+TGAGTIIDSGTV TR V P Y AVRD FR++V +SLG FDTC+ + +AP
Sbjct: 308 PSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPA 365
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
ITL F+ +++ LP +N LIHS++GS+ CLAMAAAP NVNSVLNVIAN QQQN R+L+D
Sbjct: 366 ITLHFTDLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTV 425
Query: 422 NSRLGVARELC 432
N+++G+ARELC
Sbjct: 426 NNKVGIARELC 436
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 222/409 (54%), Positives = 288/409 (70%), Gaps = 24/409 (5%)
Query: 48 PSKPL-SWEESVLEMLAKDQARLQFL----------SSLAVA----RKSVVPIASGRQIT 92
P P+ +W ++ A D AR L S++ A R+S VPIA GRQ+
Sbjct: 43 PGTPVTAWAATLAAQTASDAARAATLATGPRDPPPASAVDAAKKGPRRSFVPIAPGRQLL 102
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQSTTFKNLGCQA 151
P+Y+ RA++GTPAQ LL+A+D SNDAAWVPC C GC+ F+ +S+T++ + C A
Sbjct: 103 SIPSYVARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPSFDPTRSSTYRPVRCGA 162
Query: 152 AQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDI--VPGYTFGCIQKAT 206
QC Q P P+C GG +CAFNL+Y +ST A L QD ++L D+ V YTFGC+ T
Sbjct: 163 PQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDDVDAVAAYTFGCLHVVT 222
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
G SVPPQGL+G GRG LS +QT+++Y S FSYCLPS+K+ +FSG+LRLGP GQPKRIK
Sbjct: 223 GGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKT 282
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
TPLL NP R SLYYVN++ IRVG R V +P AL F+PT+G GTI+D+GT+FTRL AP Y
Sbjct: 283 TPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVY 342
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAG 385
AVRDVFR RV + + LGGFDTCY+V I PT+T F G ++VTLP++N++I S++G
Sbjct: 343 AAVRDVFRSRVRAPV-AGPLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSG 401
Query: 386 SITCLAMAAA-PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
I CLAMAA PD V++ LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 402 GIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELCT 450
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 230/427 (53%), Positives = 296/427 (69%), Gaps = 13/427 (3%)
Query: 14 LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
S+S +P C +Q S L V ++ CSPF P K SW+ VL M +KD AR+ +LS
Sbjct: 16 FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74
Query: 74 SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
SL VA+K+V PIASG+ YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75 SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132
Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
S+T F+ ST++ L C QC QV +C G GAC+FN +Y ST +A L QD++
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
LATD++P Y+FG I +G+S+P QGLLGLGRG LSLL+QT +LY FSYCLPSFK+
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYY 252
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL I VG+ V P L F+ TG+
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMF 366
GTIIDSGTV TR V P Y AVRD FR++V +SLG FDTC+ + +AP ITL F
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPAITLHF 370
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN-SVLNVIANMQQQNHRILYDVPNSRL 425
+ +++ LP +N LIHS++GS+ CLAMA+ P NVN +VLNVIAN QQQN R+L+D N+++
Sbjct: 371 TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKV 430
Query: 426 GVARELC 432
G+ARELC
Sbjct: 431 GIARELC 437
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 211/417 (50%), Positives = 287/417 (68%), Gaps = 15/417 (3%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLSSLAVARK--SVVPI 85
D S L + + + CSPF P+ S ++VL M + D RL +LSSL + + VP+
Sbjct: 34 DGSDDLSIIPINAKCSPFAPTHVSASVIDTVLHMASSDSHRLTYLSSLVAGKPKPTSVPV 93
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT-- 143
ASG Q+ Y+VRAK+GTP Q + M +DTSNDA W+PC+GC GCS+ + +++
Sbjct: 94 ASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSST 152
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
+ + C AQC Q TC + C+FN +YG S+ +A+L QDT++LA D++P +
Sbjct: 153 YSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNF 212
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
+FGCI A+GNS+PPQGL+GLGRG +SL++QT +LY FSYCLPSF++ FSGSL+LG
Sbjct: 213 SFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 272
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+GQPK I+YTPLL+NPRR SLYYVNL + VG V + P L F+ +GAGTIIDSGTV
Sbjct: 273 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 332
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQ 375
TR P Y A+RD FR++V + + ++LG FDTC+S VAP ITL + +++ LP
Sbjct: 333 ITRFAQPVYEAIRDEFRKQVNVS-SFSTLGAFDTCFSADNENVAPKITLHMTSLDLKLPM 391
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+N LIHS+AG++TCL+MA N N+VLNVIAN+QQQN RIL+DVPNSR+G+A E C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 207/417 (49%), Positives = 282/417 (67%), Gaps = 16/417 (3%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLSSLAVARK--SVVPI 85
D S L + + + CSPF + S ++VL M + D R +LSSL + + VP+
Sbjct: 35 DGSHDLSIIPINAKCSPFAHTHVSASVIDTVLHMASSDSHRFTYLSSLVAGKSKPTSVPV 94
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT-- 143
ASG Q+ Y+VRA++GTP Q + M +DTSNDA W+PC+GC GCS+ + +++
Sbjct: 95 ASGNQL-HIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSST 153
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
+ + C QC Q TC C+FN +YG S+ +ANL QDT++L+ D++P +
Sbjct: 154 YSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPDVIPNF 213
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
+FGCI A+GNS+PPQGL+GLGRG +SL++QT +LY FSYCLPSF++ FSGSL+LG
Sbjct: 214 SFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 273
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+GQPK I+YTPLL+NPRR SLYYVNL + VG V + P L F+ +GAGTIIDSGTV
Sbjct: 274 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTV 333
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQ 375
TR P Y A+RD FR++V N + ++LG FDTC+S V P ITL + +++ LP
Sbjct: 334 ITRFAQPVYEAIRDEFRKQV--NGSFSTLGAFDTCFSADNENVTPKITLHMTSLDLKLPM 391
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+N LIHS+AG++TCL+MA N N+VLNVIAN+QQQN RIL+DVPNSR+G+A E C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 216/407 (53%), Positives = 286/407 (70%), Gaps = 27/407 (6%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
SW + ++D +R+ +LSSLA P+ASGRQ+ +PTY+VRA +GTP Q LL+
Sbjct: 51 SWTSFIAAQTSRDTSRVLYLSSLASGFGGA-PLASGRQLLHTPTYLVRASLGTPPQRLLL 109
Query: 113 AMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQCKQVPNPTC-----GGG 165
A+DTSNDAAWVPC GC GC +T FN A S TF+ + C A C Q PNP+C
Sbjct: 110 AVDTSNDAAWVPCAGCHGCPTTAPSFNPASSATFRPVPCGAPPCSQAPNPSCTSLAKSKN 169
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
+C F+L+YG S++ A LSQD +++ + ++ GYTFGC+ K+ G++ P QGLLGLGRG L
Sbjct: 170 SCGFSLSYGDSSLDATLSQDNLAVTANGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPL 229
Query: 224 SLLAQTQNLYQSTFSYCLPSF--KALSFSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLY 279
+AQT+ +Y+ TFSYCLPS+ A +FSGSL LG GQP +++K TPLL +P R SLY
Sbjct: 230 GFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLY 289
Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
YV + +R+G++ V IPP AL F+ TGAGT++DSGT+F RL PAY AVRD RRRV
Sbjct: 290 YVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAG 349
Query: 340 NL----------TVTSLGGFDTCYSVPIVA-PTITLMF-SGMNVTLPQDNLLIHSTAGSI 387
+L +V+SLGGFDTCY+V VA P +TL+F GM V LP++N++I ST GS
Sbjct: 350 SLRRRGGGGASVSVSSLGGFDTCYNVSTVAWPAVTLVFGGGMEVRLPEENVVIRSTYGST 409
Query: 388 TCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+CLAMAA+P D VN+ LNVI ++QQQNHR+L+DVPN+R+G ARE CT
Sbjct: 410 SCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERCT 456
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 224/419 (53%), Positives = 288/419 (68%), Gaps = 13/419 (3%)
Query: 14 LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
S+S +P C +Q S L V ++ CSPF P K SW+ VL M +KD AR+ +LS
Sbjct: 16 FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74
Query: 74 SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
SL VA+K+V PIASG+ YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75 SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132
Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
S+T F+ ST++ L C QC QV +C G GAC+FN +Y ST +A L QD++
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
LATD++P Y+FG I +G+S+P QGLLGLGRG LSLL+QT +LY FSYCLPSFK+
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYY 252
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL I VG+ V P L F+ TG+
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMF 366
GTIIDSGTV TR V P Y AVRD FR++V +SLG FDTC+ + +AP ITL F
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPF--SSLGAFDTCFVKNYETLAPAITLHF 370
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN-SVLNVIANMQQQNHRILYDVPNSR 424
+ +++ LP +N LIHS++GS+ CLAMA+ P NVN +VLNVIAN QQQN R+L+D N++
Sbjct: 371 TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNK 429
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 222/442 (50%), Positives = 291/442 (65%), Gaps = 19/442 (4%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQ--DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
+ F + L S+ N C +Q D S + + ++ CSPFK + SWE +++M +
Sbjct: 13 ILFTSMLLHLSIIAIANDPCASQHDDDDSDITMIPIYGNCSPFK-NYSTSWENIIIDMAS 71
Query: 64 KDQARLQFLSSL--AVARK--SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
KD R+ +LSSL ++ RK S PIASG+ +Y+VR K+G+P Q M +DTS D
Sbjct: 72 KDPERVVYLSSLDASLRRKPISAAPIASGQAFGIG-SYVVRVKLGSPNQLFFMVLDTSTD 130
Query: 120 AAWVPCTGCVGCSS--TVFNSAQSTTFKN-LGCQAAQCKQ----VPNPTCGGGACAFNLT 172
AWVPCTGC GCSS T ++ STT+ + C A +C Q +P P G AC FN +
Sbjct: 131 EAWVPCTGCTGCSSSSTYYSPQASTTYGGAVACYAPRCAQARGALPCPYTGSKACTFNQS 190
Query: 173 YGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
Y ST +A L QD++ L D +P Y FGC+ A+G ++P QGLLGLGRG LSL +Q+ L
Sbjct: 191 YAGSTFSATLVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKL 250
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
Y FSYCLPSF++ FSGSL+LGP GQP+RI+ TPLL+NPRR SLYYVNL + VGR
Sbjct: 251 YSGIFSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVK 310
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
V +P L F+P G+GTI+DSGTV TR V P Y+A+RD FR +V S GGFDTC
Sbjct: 311 VPLPIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPF--FSRGGFDTC 368
Query: 353 Y--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
+ + + P I L F+G++VTLP +N LIH+ G + CLAMAAAP+NVNSVLNVIAN Q
Sbjct: 369 FVKTYENLTPLIKLRFTGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQ 428
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN R+L+D N+R+G+ARELC
Sbjct: 429 QQNLRVLFDTVNNRVGIARELC 450
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 207/377 (54%), Positives = 270/377 (71%), Gaps = 22/377 (5%)
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SS 133
+R + VPIA+GRQI ++P+Y+ RA++GTP QTLL+A+D SNDAAWVPC+ C+GC SS
Sbjct: 81 SRHTFVPIAAGRQILRTPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS 140
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGG---ACAFNLTYGSSTIAANLSQDTIS 188
F+ QS+T++ + C A QC QVP P+C G +CAFNL+Y SST+ A L QD +S
Sbjct: 141 PSFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHAVLGQDALS 200
Query: 189 LATD---IVPG--YTFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
L+ VP YTFGC++ TG+ SVPPQGL+G GRG LS L+QT+ Y S FSYCL
Sbjct: 201 LSDSNGAAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCL 260
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
PS+K+ +FSG+LRLGP GQP+RIK TPLL NP R SLYYV ++ +RV + V IP AL
Sbjct: 261 PSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALA 320
Query: 302 FNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIV 358
+ TG GTI+D+GT+FTRL PAY A+R+ FRR V S +LGGFDTCY V
Sbjct: 321 LDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGV-SAPAAPALGGFDTCYYVNGTKS 379
Query: 359 APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRI 416
P + +F+ G VTLP++N++I ST+G + CLAMAA P D VN+ LNV+A+MQQQNHR+
Sbjct: 380 VPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRV 439
Query: 417 LYDVPNSRLGVARELCT 433
++DV N R+G +RELCT
Sbjct: 440 VFDVGNGRVGFSRELCT 456
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/376 (52%), Positives = 266/376 (70%), Gaps = 14/376 (3%)
Query: 69 LQFLSSLAVARK--SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
L +LSSL + + VP+ASG Q+ Y+VRAK+GTP Q + M +DTSNDA W+PC+
Sbjct: 1 LTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS 59
Query: 127 GCVGCSSTVFNSAQSTT--FKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYG-SSTI 178
GC GCS+ + +++ + + C AQC Q TC + C+FN +YG S+
Sbjct: 60 GCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSF 119
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
+A+L QDT++LA D++P ++FGCI A+GNS+PPQGL+GLGRG +SL++QT +LY FS
Sbjct: 120 SASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFS 179
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLPSF++ FSGSL+LG +GQPK I+YTPLL+NPRR SLYYVNL + VG V + P
Sbjct: 180 YCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPV 239
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--P 356
L F+ +GAGTIIDSGTV TR P Y A+RD FR++V + + ++LG FDTC+S
Sbjct: 240 YLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVS-SFSTLGAFDTCFSADNE 298
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
VAP ITL + +++ LP +N LIHS+AG++TCL+MA N N+VLNVIAN+QQQN RI
Sbjct: 299 NVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRI 358
Query: 417 LYDVPNSRLGVARELC 432
L+DVPNSR+G+A E C
Sbjct: 359 LFDVPNSRIGIAPEPC 374
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/341 (55%), Positives = 242/341 (70%), Gaps = 8/341 (2%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
SW +V+ M +KD RL++LS+LA + + VPIA G+Q+ + Y+VR K+GTP Q + M
Sbjct: 1 SWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFM 60
Query: 113 AMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAF 169
+DTSNDAAWVPC+GC GCSST F STT +L C AQC QV +C G AC F
Sbjct: 61 VLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLF 120
Query: 170 NLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
N +YG S++AA L QD I+LA D++PG+TFGCI +G S+PPQGLLGLGRG +SL++Q
Sbjct: 121 NQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQ 180
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
+Y FSYCLPSFK+ FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL + V
Sbjct: 181 AGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSV 240
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
GR V IP L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V N ++SLG
Sbjct: 241 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGA 298
Query: 349 FDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
FDTC++ AP +TL F G+N+ LP +N LIHS++GS+
Sbjct: 299 FDTCFAATNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/341 (55%), Positives = 242/341 (70%), Gaps = 8/341 (2%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
SW +V+ M +KD RL++LS+LA + + VPIA G+Q+ + Y+VR K+GTP Q + M
Sbjct: 1 SWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFM 60
Query: 113 AMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAF 169
+DTSNDAAWVPC+GC GCSST F STT +L C AQC QV +C G AC F
Sbjct: 61 VLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLF 120
Query: 170 NLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
N +YG S++AA L QD I+LA D++PG+TFGCI +G S+PPQGLLGLGRG +SL++Q
Sbjct: 121 NQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQ 180
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
+Y FSYCLPSFK+ FSGSL+LGP+GQPK I+ TPLL+NP R SLYYVNL + V
Sbjct: 181 AGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSV 240
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
GR V IP L F+P TGAGTIIDSGTV TR V P Y A+RD FR++V N ++SLG
Sbjct: 241 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGA 298
Query: 349 FDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
FDTC++ AP +TL F G+N+ LP +N LIHS++GS+
Sbjct: 299 FDTCFAETNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 203/423 (47%), Positives = 276/423 (65%), Gaps = 34/423 (8%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQIT 92
L V+H P SP PL ES++ + D ARL FLSS A A S P+ASG+
Sbjct: 27 LSVYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA-- 77
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQA 151
P+Y+VRA +G+P+Q LL+A+DTS DA W C+ C C SS++F A S+++ +L C +
Sbjct: 78 -PPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSS 136
Query: 152 AQC-----KQVPNPTCGGGA---------CAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
+ C + P P GG A CAF+ + ++ A L+ DT+ L D +P Y
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNY 196
Query: 198 TFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
TFGC+ TG ++P QGLLGLGRG ++LL+Q +LY FSYCLPS+++ FSGSLRL
Sbjct: 197 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 256
Query: 256 GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G G QP+ ++YTP+L+NP RSSLYYVN+ + VGR V +P G+ F+ TGAGT++DS
Sbjct: 257 GAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDS 316
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGM 369
GTV TR AP Y A+R+ FRR+V + TSLG FDTC++ V AP +T+ M G+
Sbjct: 317 GTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 376
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
++ LP +N LIHS+A + CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+
Sbjct: 377 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAK 436
Query: 430 ELC 432
E C
Sbjct: 437 ESC 439
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 201/421 (47%), Positives = 274/421 (65%), Gaps = 34/421 (8%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQS 94
V+H P SP PL ES++ + D ARL FLSS A A S P+ASG+
Sbjct: 27 VYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA---P 76
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQAAQ 153
P+Y+VRA +G+P+Q LL+A+DTS DA W C+ C C SS++F A S+++ +L C ++
Sbjct: 77 PSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSW 136
Query: 154 C-----KQVPNPTCGGGA---------CAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
C + P P GG A CAF+ + ++ A L+ DT+ L D +P YTF
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNYTF 196
Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
GC+ TG ++P QGLLGLGRG ++LL+Q +LY FSYCLPS+++ FSGSLRLG
Sbjct: 197 GCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGA 256
Query: 258 IG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G QP+ ++YTP+L+NP RSSLYYVN+ + VG V +P G+ F+ TGAGT++DSGT
Sbjct: 257 GGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGT 316
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNV 371
V TR AP Y A+R+ FRR+V + TSLG FDTC++ V AP +T+ M G+++
Sbjct: 317 VITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDL 376
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP +N LIHS+A + CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+E
Sbjct: 377 ALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKES 436
Query: 432 C 432
C
Sbjct: 437 C 437
>gi|217073884|gb|ACJ85302.1| unknown [Medicago truncatula]
Length = 259
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/244 (77%), Positives = 211/244 (86%), Gaps = 1/244 (0%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
+GLNP CD QD+ STLQV HVFSPCSPF+PSKPLSWEESVL+M AKD RLQFL SL VA
Sbjct: 16 QGLNPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSL-VA 74
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS 138
RKS+VPIASGRQI QSPTYIVRAKIGTP QTLL+AMDTSNDAAW+PCT C GC+ST+F
Sbjct: 75 RKSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAP 134
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+STTFKN+ C A +CKQVPNP CG +C FNLTYGSS+IAANL QDTI+LATD VP YT
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIAANLVQDTITLATDPVPSYT 194
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC+ K TG S PPQGLLGLGRG LSLL+QTQNLYQSTFSYCLPSFK+L+FSGSLRLGP+
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPV 254
Query: 259 GQPK 262
P+
Sbjct: 255 AHPE 258
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/358 (53%), Positives = 242/358 (67%), Gaps = 30/358 (8%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQ 140
VPIA GRQI P YI RA +GTPAQTLL+A+D SNDAAWVPC+ C GC SS F+ Q
Sbjct: 88 VPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQ 147
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSSTIAANLSQDTISLATDIVPGY 197
S+T++ + C + QC QVP+P+C G +C FNLTY +ST A L QD+++L ++V Y
Sbjct: 148 SSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSY 207
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
TFGC++ GNS G L + LL Q LGP
Sbjct: 208 TFGCLRVVNGNSRAAAGAHRLRPRAALLLVADQG----------------------HLGP 245
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
IGQPKRIK TPLL NP R SLYYVN++ IRVG +VV +P AL FNP TG+GTIID+GT+
Sbjct: 246 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 305
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQD 376
FTRL AP Y AVRD FR RV + + LGGFDTCY+V + PT+T MF+G + VTLP++
Sbjct: 306 FTRLAAPVYAAVRDAFRGRVRTPV-APPLGGFDTCYNVTVSVPTVTFMFAGAVAVTLPEE 364
Query: 377 NLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N++IHS++G + CLAMAA P D VN+ LNV+A+MQQQN R+L+DV N R+G +RELCT
Sbjct: 365 NVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 422
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 367 bits (941), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 195/417 (46%), Positives = 266/417 (63%), Gaps = 30/417 (7%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-SVVPIASGRQITQS 94
V+H P S S PL ES++ + +D ARL FLSS A + S P+ASG+
Sbjct: 25 VYHNVHPPS----SSPL---ESIIALAREDDARLLFLSSKAASTGVSSAPVASGQS---P 74
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--SSTVFNSAQSTTFKNLGCQAA 152
P+Y+VRA +G+PAQ +L+A+DTS DA W C+ C C S ++F A ST++ L C +
Sbjct: 75 PSYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGSLFAPANSTSYAPLPCSST 134
Query: 153 QCKQVPNPTCGGG----------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI 202
C + C CAF + ++ A+L+ D + L D +P Y FGC+
Sbjct: 135 MCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHLGKDAIPNYAFGCV 194
Query: 203 QKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
+G ++P QGLLGLGRG ++LL+Q N+Y FSYCLPS+K+ FSGSLRLG GQ
Sbjct: 195 SAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQ 254
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P+ ++YTP+LKNP RSSLYYVN+ + VGR V +P G+ F+P TGAGT++DSGTV TR
Sbjct: 255 PRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITR 314
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITL-MFSGMNVTLPQ 375
P Y A+R+ FRR V + TSLG FDTC++ VAP +T+ M G+++ LP
Sbjct: 315 WTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPM 374
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+N LIHS+A + CLAMA AP NVN+V+NV+AN+QQQN R+++DV NSR+G ARE C
Sbjct: 375 ENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 199/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
++ L V+H P SP PL ES++ + D ARL FLSS A + V P+ASG
Sbjct: 21 AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGVTSAPVASG 73
Query: 89 RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
+ P+Y+VRA +GTP Q LL+A+DTS DA W C C C + + F A S+++ +L
Sbjct: 74 QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130
Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
C + C C ACAF+ + ++ A+L DT+ L D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190
Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
GC+ G ++P QGLLGLGRG +SLL+QT + Y FSYCLPS+++ FSGSLRLG
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
GQP+ ++YTPLL NP R SLYYVN+ + VGR V +P G+ F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
TR AP Y A+R+ FRR+V + TSLG FDTC++ V AP +TL M G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N LIHS+A + CLAMA AP NVN+V+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 343 bits (881), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 199/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
++ L V+H P SP PL ES++ + D ARL FLSS A + V P+ASG
Sbjct: 21 AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGVTSAPVASG 73
Query: 89 RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
+ P+Y+VRA +GTP Q LL+A+DTS DA W C C C + + F A S+++ +L
Sbjct: 74 QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130
Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
C + C C ACAF+ + ++ A+L DT+ L D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190
Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
GC+ G ++P QGLLGLGRG +SLL+QT + Y FSYCLPS+++ FSGSLRLG
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
GQP+ ++YTPLL NP R SLYYVN+ + VGR V +P G+ F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
TR AP Y A+R+ FRR+V + TSLG FDTC++ V AP +TL M G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N LIHS+A + CLAMA AP NVN+V+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 198/420 (47%), Positives = 265/420 (63%), Gaps = 28/420 (6%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASG 88
++ L V+H P SP PL ES++ + D ARL FLSS A + + P+ASG
Sbjct: 21 AADLSVYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSGGITSAPVASG 73
Query: 89 RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNL 147
+ P+Y+VRA +GTP Q LL+A+DTS DA W C C C + + F A S+++ +L
Sbjct: 74 QT---PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASL 130
Query: 148 GCQAAQCKQVPNPTCGGG--------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
C + C C ACAF+ + ++ A+L DT+ L D + GY F
Sbjct: 131 PCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRLGKDAIAGYAF 190
Query: 200 GCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
GC+ G ++P QGLLGLGRG +SLL+QT + Y FSYCLPS+++ FSGSLRLG
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
GQP+ ++YTPLL NP R SLYYVN+ + VGR V +P G+ F+P TGAGT+IDSGTV
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVT 372
TR AP Y A+R+ FRR+V + TSLG FDTC++ V AP +TL M G+++T
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N LIHS+A + CLAMA AP NVN+V+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 153/271 (56%), Positives = 197/271 (72%), Gaps = 5/271 (1%)
Query: 167 CAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
C + Y A L QD ++L D+V YTFGC++ TG SVPPQGL+G G G LS
Sbjct: 328 CIIGMIYAYFHPNALLGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLS 387
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+Q +++Y FSYCLPS+K+ +FS +LRLGP GQPKRIK TPLL NP R SLYYVN++
Sbjct: 388 FPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMV 447
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I VG R + +P AL F+P +G GTI+D+GT+FTRL AP Y AVRDVFR RV + +T
Sbjct: 448 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVT-G 506
Query: 345 SLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSV 402
LGGFDTCY+V I PT+T F G ++VTLP++N++I S++ I CLAMAA P D V++V
Sbjct: 507 PLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAV 566
Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 567 LNVLASMQQQNHRVLFDVANGRVGFSRELCT 597
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 153/271 (56%), Positives = 197/271 (72%), Gaps = 5/271 (1%)
Query: 167 CAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
C + Y A L QD ++L D+V YTFGC++ TG SVPPQGL+G G G LS
Sbjct: 267 CIIGMIYAYFHPNALLGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLS 326
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+Q +++Y FSYCLPS+K+ +FS +LRLGP GQPKRIK TPLL NP R SLYYVN++
Sbjct: 327 FPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMV 386
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I VG R + +P AL F+P +G GTI+D+GT+FTRL AP Y AVRDVFR RV + +T
Sbjct: 387 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVT-G 445
Query: 345 SLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSV 402
LGGFDTCY+V I PT+T F G ++VTLP++N++I S++ I CLAMAA P D V++V
Sbjct: 446 PLGGFDTCYNVTISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAV 505
Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 506 LNVLASMQQQNHRVLFDVANGRVGFSRELCT 536
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/258 (58%), Positives = 194/258 (75%), Gaps = 5/258 (1%)
Query: 180 ANLSQDTISLATDI--VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
A L QD ++L D+ + YTFGC+ TG SVP QGL+G RG LS +Q +N+Y S F
Sbjct: 308 ALLGQDALALHDDVDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVF 367
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYCLPS+K+ +FSG+LRLGP GQPKRIK TPLL NP R SLYYVN++ IRVG R V +P
Sbjct: 368 SYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPA 427
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI 357
AL F+P +G GTI+D+GT+FTRL AP Y AV DVFR RV + + LGGFDTCY+V I
Sbjct: 428 SALAFDPASGHGTIVDAGTMFTRLSAPVYAAVCDVFRSRVRAPVA-GPLGGFDTCYNVTI 486
Query: 358 VAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP-DNVNSVLNVIANMQQQNHR 415
PT+T +F G ++VTLP++N++I S+ I CLAMAA P D+V++VLNV+A+MQQQNHR
Sbjct: 487 SVPTVTFLFDGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHR 546
Query: 416 ILYDVPNSRLGVARELCT 433
+L+DV N R+G +RELCT
Sbjct: 547 VLFDVANGRVGFSRELCT 564
>gi|255647724|gb|ACU24323.1| unknown [Glycine max]
Length = 334
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 217/326 (66%), Gaps = 10/326 (3%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSK-PLSWEESVLEMLA 63
++ + ++L + G++P C +Q +S L V ++S CSPFKP K SW+ ++ M +
Sbjct: 9 IILIFSVIWLMRV-NGIDP-CASQADNSDLNVIPIYSKCSPFKPPKSDSSWDNRIINMAS 66
Query: 64 KDQARLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
KD R ++LS+L V +K+V PIASG+ Y+VR K+GTP Q L M +DTS D A
Sbjct: 67 KDPLRFKYLSTL-VGQKTVSTAPIASGQTFNIG-NYVVRVKLGTPGQLLFMVLDTSTDEA 124
Query: 122 WVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTI 178
+VPC+GC GCS F+ ST++ L C QC QV +C G GAC+FN +Y S+
Sbjct: 125 FVPCSGCTGCSDATFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSF 184
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
+A L QD++ LATD++P Y+FGC+ TG SVP QGLLGLGRG LSLL+Q+ + Y FS
Sbjct: 185 SATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFS 244
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLPSFK+ FSGSL+L P+GQPK I+ TPLL++P R SLYYVN I VGR +V P
Sbjct: 245 YCLPSFKSYYFSGSLKLRPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSE 304
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAP 324
L FNP TG+GTIIDSGTV TR V P
Sbjct: 305 YLGFNPNTGSGTIIDSGTVITRFVEP 330
>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
Length = 369
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 140/197 (71%), Positives = 166/197 (84%), Gaps = 5/197 (2%)
Query: 240 CLPSFKALSFSGS--LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
CLPSFK+L+FSGS LRLG GQP+RIK TPLL NP RSSLYYVN+ IRVGR+VV IPP
Sbjct: 173 CLPSFKSLNFSGSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPP 232
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI 357
AL F+P TGAGT++DSGT+FTRLVAPAY AVRD RRRVG+ V+SLGGFDTC++
Sbjct: 233 PALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGA--PVSSLGGFDTCFNTTA 290
Query: 358 VA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
VA P +TL+F GM VTLP++N++IHST G+I+CLAMAAAPD VN+VLNVIA+MQQQNHR+
Sbjct: 291 VAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRV 350
Query: 417 LYDVPNSRLGVARELCT 433
L+DVPN R+G ARE CT
Sbjct: 351 LFDVPNGRVGFARERCT 367
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 173/424 (40%), Positives = 239/424 (56%), Gaps = 24/424 (5%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARK 80
+S+L V H+ CSPF+ SW +V E + D AR + + + +
Sbjct: 50 ETSSLSVMHIQGKCSPFRLLNS-SWWTAVSESIKGDTARYRAMVKGGWSAGKTMVNPQED 108
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNS 138
+ +P+ASG+ I+ S YI++ GTP Q+ +DT ++ AW+PC C GCSS F
Sbjct: 109 ADIPLASGQAISSS-NYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEP 167
Query: 139 AQSTTFKNLGCQAAQCK--QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVP 195
++S+T+ L C + QC+ +V + C+ YG S + LS +T+S+ + V
Sbjct: 168 SKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQVE 227
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
+ FGC A G L+G GR LS ++QT LY STFSYCLPS + +F+GSL L
Sbjct: 228 NFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLL 287
Query: 256 GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G + +K+TPLL N R S YYV L I VG +V IP G L + +TG GTIIDS
Sbjct: 288 GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDS 347
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP---IVAPTITLMF-SGM 369
GTV TRLV PAY A+RD FR ++ SNLT+ S FDTCY+ P + P ITL F +
Sbjct: 348 GTVITRLVEPAYNAMRDSFRSQL-SNLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNL 406
Query: 370 NVTLPQDNLLI-HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++TLP DN+L + GS+ CLA P + VL+ N QQQ RI++DV SRLG+A
Sbjct: 407 DLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIA 466
Query: 429 RELC 432
E C
Sbjct: 467 SENC 470
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 184/419 (43%), Positives = 242/419 (57%), Gaps = 58/419 (13%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV---PIASGRQIT 92
V+H P SP PL ES++ + D ARL FLSS A + V P+ASG+
Sbjct: 25 VYHNVHPPSP----SPL---ESIIALARADDARLLFLSSKAASSSGGVTSAPVASGQT-- 75
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-SSTVFNSAQSTTFKNLGCQA 151
P+Y+VRA +GTP Q LL+A+DTS DA W C C C + + F A S+++ +L C +
Sbjct: 76 -PPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCAS 134
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
C P G + A D+ +Q A+ P
Sbjct: 135 DWCPLFRRPAVPG------------------EPGRVGAAADVR------LLQAAS--RTP 168
Query: 212 PQGLLGLGR-------------GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
G+L R G +SLL+QT + Y FSYCLPS+++ FSGSLRLG
Sbjct: 169 RSGVLAATRCGWARTPSPATRSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAA 228
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
GQP+ ++YTPLL NP R SLYYVN+ + VGR +V P G+ F+P+TGAGT+IDSGTV
Sbjct: 229 GQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVI 288
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITL-MFSGMNVTL 373
TR AP Y A+RD FRR+V + TSLG FDTC++ V AP +TL M G+++TL
Sbjct: 289 TRWTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTL 348
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P +N LIHS+A + CLAMA AP NVNSV+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 349 PMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 170/455 (37%), Positives = 254/455 (55%), Gaps = 46/455 (10%)
Query: 5 LVFFLAFLFLFSLSE---GLNPICDTQD-----------HSSTLQVFHVFSPCSPFKPSK 50
L+ LA F+ ++E GLN C + D HS + + H++S CSPF+P
Sbjct: 13 LILSLAITFMCGVAEIAPGLN--CRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPN 70
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARK----SVVPIASGRQITQSPTYIVRAKIGTP 106
+WE + E + D RL+FL + + K + VP+ SG S YI++ GTP
Sbjct: 71 -RTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSG-----SGEYIIQVDFGTP 124
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
Q++ +DT +D AW+PC C GC ST +F+ A+S+++K C + C+++ G
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN 184
Query: 165 GACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
C F ++YG T + L+ D I+L + +P ++FGC + + ++ P GL+GLG GSL
Sbjct: 185 SKCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSL 244
Query: 224 SLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYV 281
SLL Q T L+ TFSYCLPS S S L +K+T L+K+P + Y+V
Sbjct: 245 SLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFV 304
Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
L AI VG + +P N +G GTIIDSGT T LV AYTA+RD FR+++ S+L
Sbjct: 305 TLKAISVGNTRISVP----GTNIASGGGTIIDSGTTITHLVPSAYTALRDAFRQQL-SSL 359
Query: 342 TVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
T + DTCY S + PTITL +++ LP++N+LI +G + CLA ++
Sbjct: 360 QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESG-LACLAFSSTDS 418
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++I N+QQQN RI++DVPNS++G A+E C
Sbjct: 419 R-----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 163/455 (35%), Positives = 245/455 (53%), Gaps = 46/455 (10%)
Query: 5 LVFFLAFLFLFSLSE---GLNPICDTQD-----------HSSTLQVFHVFSPCSPFKPSK 50
L+ LA F+ ++E GLN C + D HS + + H++S CSPF+P
Sbjct: 13 LILSLAITFMCGVAEIAPGLN--CRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPN 70
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARK----SVVPIASGRQITQSPTYIVRAKIGTP 106
+WE + E + D RL+FL + + K + VP+ SG S YI++ GTP
Sbjct: 71 -RTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSG-----SGEYIIQVDFGTP 124
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
Q++ +DT +D AW+PC C GC ST +F+ A+S+++K C + C+++ G
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN 184
Query: 165 GACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGN--SVPPQGLLGLGRG 221
C F + YG T + L+ D I+L + +P ++FGC + + + S P LG G
Sbjct: 185 SKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSL 244
Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYV 281
SL A T L+ TFSYCLPS S S L +K+T L+K+P + Y+V
Sbjct: 245 SLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFV 304
Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
L AI VG + +P N +G GTIIDSGT T LV AY +RD FR+++ S+L
Sbjct: 305 TLKAISVGNTRISVPAT----NIASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQL-SSL 359
Query: 342 TVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
T + DTCY S + PTITL +++ LP++N+LI +G ++CLA ++
Sbjct: 360 QPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESG-LSCLAFSSTDS 418
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++I N+QQQN RI++DVPNS++G A+E C
Sbjct: 419 R-----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|222624328|gb|EEE58460.1| hypothetical protein OsJ_09701 [Oryza sativa Japonica Group]
Length = 360
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 160/405 (39%), Positives = 213/405 (52%), Gaps = 80/405 (19%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQS 94
V+H P SP PL ES++ + D ARL FLSS A A S P+ASG+
Sbjct: 27 VYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQA---P 76
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC 154
P+Y+VRA +G+P+Q LL+A+DTS DA G A T
Sbjct: 77 PSYVVRAGLGSPSQQLLLALDTSADATARRARRRRGGGDAAPPPATLPT----------- 125
Query: 155 KQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATG--NSVPP 212
CAF+ + ++ A L+ DT+ L D +P YTFGC+ TG ++P
Sbjct: 126 ------------CAFSKPFADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPR 173
Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
QGLLGLGRG ++LL+Q +LY RL PLL
Sbjct: 174 QGLLGLGRGPMALLSQAGSLYNG------------------RL------------PLLPP 203
Query: 273 PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
L I + R P G+ F+ TGAGT++DSGTV TR AP Y A+R+
Sbjct: 204 ---------ELQVILLLRACSGFPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREE 254
Query: 333 FRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITL-MFSGMNVTLPQDNLLIHSTAGSI 387
FRR+V + TSLG FDTC++ VA P +T+ M G+++ LP +N LIHS+A +
Sbjct: 255 FRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPL 314
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLAMA AP NVNSV+NVIAN+QQQN R+++DV NSR+G A+E C
Sbjct: 315 ACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 359
>gi|356551755|ref|XP_003544239.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 249
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/257 (49%), Positives = 163/257 (63%), Gaps = 16/257 (6%)
Query: 176 STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
ST +A L QD++ L D +P Y F C+ A+G ++P Q L L + S
Sbjct: 8 STFSATLVQDSLRLGIDTLPSYAFRCVNSASGWTLPAQPGLL----GLGRGPLSLPSQSS 63
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
SYCLPSF++ FSGSL+LGP GQP+RI+ TPLL+NP+R SLYYVNL I VGR V +
Sbjct: 64 XLSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLRNPQRPSLYYVNLTGINVGRVRVSL 123
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
P L F+P G+GTIIDSGTV TR V P Y A+RD FR +V V +
Sbjct: 124 PTDYLAFDPNKGSGTIIDSGTVITRFVXPVYNAIRDEFRYQVKGPCFVKTYEN------- 176
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+AP I L F+G++VTLP +N LIH+ G + CLAMAAAP+NVNS L N QQQN R
Sbjct: 177 --LAPLIKLRFTGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSAL---TNFQQQNLR 231
Query: 416 ILYDVPNSRLGVARELC 432
+L+D N+R+G+ARELC
Sbjct: 232 VLFDTVNNRVGIARELC 248
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 121/215 (56%), Positives = 156/215 (72%), Gaps = 5/215 (2%)
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
+SLL+QT + Y FSYCLPS+++ FSGSLRLG GQP+ ++YTPLL NP R SLYYVN
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVN 60
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+ + VGR V +P G+ F+P TGAGT+IDSGTV TR AP Y A+R+ FRR+V +
Sbjct: 61 VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120
Query: 343 VTSLGGFDTCYSVPIV----APTITL-MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
TSLG FDTC++ V AP +TL M G+++TLP +N LIHS+A + CLAMA AP
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
NVN+V+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
Length = 429
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 108/178 (60%), Positives = 136/178 (76%), Gaps = 4/178 (2%)
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
P+GQPK I+ TPLL+NP R +LYYVNL + VGR +V + P L F+P TGAGTIIDSGT
Sbjct: 253 PLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGT 312
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
V TR V P Y A+RD FR++V ++G FDTC++ +AP +T F+GM++ LP
Sbjct: 313 VITRFVEPVYAAIRDEFRKQVKGPFA--TIGAFDTCFAATNEDIAPPVTFHFTGMDLKLP 370
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+N LIHS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN RI++DV NSRLG+ARELC
Sbjct: 371 LENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 428
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 65/109 (59%), Gaps = 6/109 (5%)
Query: 7 FFLAFLFL---FSLSEGLNPICD--TQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEM 61
F AF+FL S ++ +P ++ S L V HV+ CSPF K SW +V+ M
Sbjct: 3 IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62
Query: 62 LAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQT 109
+KD AR+ +LSSL + K+ VPIASG+Q+ Y+VR K+GTPA+T
Sbjct: 63 ASKDPARVTYLSSLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPAET 111
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/215 (55%), Positives = 156/215 (72%), Gaps = 5/215 (2%)
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
+SLL+QT + Y FSYCLPS+++ FSGSLRLG GQP+ +++TPLL NP R SLYYVN
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVN 60
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+ + VGR V +P G+ F+P TGAGT+IDSGTV TR AP Y A+R+ FRR+V +
Sbjct: 61 VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120
Query: 343 VTSLGGFDTCYSVPIV----APTITL-MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
TSLG FDTC++ V AP +TL M G+++TLP +N LIHS+A + CLAMA AP
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
NVN+V+NV+AN+QQQN R++ DV SR+G ARE C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 141/413 (34%), Positives = 211/413 (51%), Gaps = 22/413 (5%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL---AVARKSVVPIASGRQ 90
+++ H+ CSP +P SW + V + +D RL + S + S +P+ G +
Sbjct: 73 IRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQPGSK 132
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
+ + YIV A GTPA+ L+ +DT +D W+ C C C S V F QS+++K+L
Sbjct: 133 V-GTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHL 191
Query: 148 GCQAAQCKQVPNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKA 205
C ++ C ++ C G C + + YG S + SQ+T++L +D P + FGC
Sbjct: 192 SCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDSFPSFAFGCGHTN 251
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
TG GLLGLGR +LS +QT++ Y FSYCLP F + + +GS +G P
Sbjct: 252 TGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATAT 311
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ PL+ N S Y+V L I VG + IPP L GTI+DSGTV TRLV A
Sbjct: 312 FVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLG-----RGGTIVDSGTVITRLVPQA 366
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLI 380
Y A++ FR + + + DTCY + + PTIT F + +V + +L
Sbjct: 367 YDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVAVSAVGILF 426
Query: 381 H-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ GS CLA A+A ++++ N+I N QQQ R+ +D R+G A C
Sbjct: 427 TIQSDGSQVCLAFASASQSIST--NIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 144/418 (34%), Positives = 211/418 (50%), Gaps = 28/418 (6%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---SSLAVARKSVVPIASGRQ 90
+++ H+ CSP +P SW + V + +D ARL + +S S +P+ SG
Sbjct: 72 IRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSNLPLQSGTT 131
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
+ + YIV A GTPA+ L+ +DT +D W+ C C C S V F QS+++K L
Sbjct: 132 VG-TGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTL 190
Query: 148 GCQAAQCKQV----PNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGC 201
C +A C ++ NPT C G C + + YG S+ + SQ+T++L +D + FGC
Sbjct: 191 PCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDSFQNFAFGC 250
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG GLLGLG+ SLS +Q+++ Y F+YCLP F + + +GS +G P
Sbjct: 251 GHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGKGSIP 310
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTR 320
+TPL+ N + Y+V L I VG + IPP L G G TI+DSGTV TR
Sbjct: 311 ASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL------GRGSTIVDSGTVITR 364
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQ 375
L+ AY A++ FR + + DTCY + + PTIT F + +V +
Sbjct: 365 LLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNADVAVSD 424
Query: 376 DNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+L+ GS CLA A+A N+I N QQQ R+ +D R+G A C
Sbjct: 425 VGILVPVQNGGSQVCLAFASASQMDG--FNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 138/430 (32%), Positives = 215/430 (50%), Gaps = 48/430 (11%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV---PIASGRQIT 92
V H PCSP + E S E+L +DQ R+ + LA AR S P ++ + ++
Sbjct: 68 VVHRHGPCSPLQAR---GGEPSHAEILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVS 124
Query: 93 ---------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
+ YIV +GTP + LL+ DT +D +WV C C GC +F+ +Q
Sbjct: 125 LPARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQ 184
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-------TD 192
STT+ + C A +C+++ + +C G C + + YG S NL++DT++L +D
Sbjct: 185 STTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
+ + FGC TG GL GLGR +SL +Q Y + FSYCLPS + + G
Sbjct: 245 QLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPS--SSTAEGY 302
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L LG P ++T ++ S YY+NL+ I+V R V + P + GT+I
Sbjct: 303 LSLGSAAPPN-ARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRT-----PGTVI 356
Query: 313 DSGTVFTRLVAPAYTAVRDVFR--RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
DSGTV TRL + AY A+R F R S +L DTCY + P++ L+F
Sbjct: 357 DSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLF 416
Query: 367 SG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G +N+ + +++ S CLA A+ D+ + + ++ NMQQ+ ++YDV N
Sbjct: 417 DGGATLNLGFGE---VLYVANKSQACLAFASNGDDTS--IAILGNMQQKTFAVVYDVANQ 471
Query: 424 RLGVARELCT 433
++G + C+
Sbjct: 472 KIGFGAKGCS 481
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 140/429 (32%), Positives = 215/429 (50%), Gaps = 41/429 (9%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL--------SSLAVARKSV 82
SS L V H PCSP + + S E+L +DQ R+ + ++ + ++
Sbjct: 62 SSALTVVHGHGPCSPQESRRG---APSHTEILGRDQDRVDAIRRKVAAVTTAASSSKPKG 118
Query: 83 VPIASGR-QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
VP+ G + + Y ++GTPA LL+ +DT +D +W+ C C C +F+
Sbjct: 119 VPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDP 178
Query: 139 AQSTTFKNLGCQAAQCKQV----PNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TD 192
++S+T+ ++ C + +C+++ + C + +TY S NL++DT++L+ TD
Sbjct: 179 SKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTD 238
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LS 248
VPG+ FGC G+ GLLGLGRG SL +Q Y + FSYCLPS + LS
Sbjct: 239 AVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLS 298
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
FSG+ P ++T ++ + S YY+NL I V R + +PP T A
Sbjct: 299 FSGAAAA----APTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAIKVPPSVF----ATAA 349
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITL 364
GTIIDSGT F+ L AY A+R R +G S FDTCY + + P++ L
Sbjct: 350 GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVAL 409
Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+F+ G V L +L + S TCLA PD+ + L V+ N QQ+ ++YDV N
Sbjct: 410 VFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTS--LGVLGNTQQRTLAVIYDVDNQ 467
Query: 424 RLGVARELC 432
++G C
Sbjct: 468 KVGFGANGC 476
>gi|217073830|gb|ACJ85275.1| unknown [Medicago truncatula]
Length = 267
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/254 (43%), Positives = 149/254 (58%), Gaps = 8/254 (3%)
Query: 14 LFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS 73
S+S +P C +Q S L V ++ CSPF P K SW+ VL M +KD AR+ +LS
Sbjct: 16 FMSMSNATDP-CASQPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74
Query: 74 SLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
SL VA+K+V PIASG+ YIVR KIGTP Q L M +DTS D A++P +GC+GC
Sbjct: 75 SL-VAQKTVSSAPIASGQAFNIG-NYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC 132
Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTYGSSTIAANLSQDTIS 188
S+T F+ ST++ L C QC QV +C G GAC+FN +Y ST +A L QD++
Sbjct: 133 SATTFSPNASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLR 192
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
LATD++P Y+FG I +G+S+P Q GLG + + + F+ L
Sbjct: 193 LATDVIPSYSFGSINAISGSSIPAQRTFGLGPWPVIFIITNRVTLLGCILLLPSKFQILL 252
Query: 249 FSGSLRLGPIGQPK 262
F SL+LGP+G PK
Sbjct: 253 FFRSLKLGPVGHPK 266
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 133/418 (31%), Positives = 206/418 (49%), Gaps = 35/418 (8%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------ARKSV-VPI 85
V H PCSP E S E+L +DQ R+ + + A K V +P
Sbjct: 121 VVHRHGPCSPLLAR---GGEPSHAEILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPA 177
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
G ++ + YIV +GTP + LL+ DT +D +WV PC C +F+ +QST
Sbjct: 178 HRGLRLGTA-NYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQST 236
Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL--ATDIVPGYTF 199
T+ + C A +C + + TC G C + + YG S NL++DT++L ++D + G+ F
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVF 294
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC TG GL GLGR +SL +Q Y + FSYCLPS + G L LG
Sbjct: 295 GCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPS--SWRAEGYLSLGSAA 352
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P ++T ++ S YY++L+ I+V R V + P + GT+IDSGTV T
Sbjct: 353 APPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFK-----APGTVIDSGTVIT 407
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQ 375
RL + AY+A+R F + +L DTCY + P++ L+F G
Sbjct: 408 RLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLG 467
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+++ S CLA A+ D+ + + ++ NMQQ+ ++YD+ N ++G + C+
Sbjct: 468 FGGVLYVANRSQACLAFASNGDDTS--VGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 138/441 (31%), Positives = 211/441 (47%), Gaps = 38/441 (8%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
S +P D ++L+V H PCS +P K S S ++LA+D++R+ + S
Sbjct: 61 SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANS--PSHTQILAQDESRVASIQSRLA 118
Query: 78 ----------ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
A K+ +P S + S Y+V +G+P + L DT +D W C
Sbjct: 119 KNLAGGSNLKASKATLPSKSASTLG-SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEP 177
Query: 128 CVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTI 178
CVG C +F+ + S ++ N+ C + C+++ + P C C + + YG +
Sbjct: 178 CVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSY 237
Query: 179 AANL-SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
+ +++ +SL +TD+ + FGC Q G GLLGL R LSL++QT Y
Sbjct: 238 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 297
Query: 237 FSYCLPSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
FSYCLPS + S +G L G G K +K+TP N S Y+++++ I VG R + I
Sbjct: 298 FSYCLPS--SSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPI 355
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
P + AGTIIDSGTV +RL Y++V+ VFR + V + DTCY +
Sbjct: 356 PKSVF-----STAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 410
Query: 356 P----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ P I L FSG +I+ S CLA A D + + +I N+QQ
Sbjct: 411 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSD--DDEVAIIGNVQQ 468
Query: 412 QNHRILYDVPNSRLGVARELC 432
+ ++YD R+G A C
Sbjct: 469 KTIHVVYDDAEGRVGFAPSGC 489
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 196/400 (49%), Gaps = 27/400 (6%)
Query: 55 EESVLEMLAKDQARLQFLSS-LAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQ 108
+VL+++++D AR ++L+S L+ A + S ++ S Y VR IG+P
Sbjct: 77 RHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPT 136
Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-G 164
+ +D+ +D WV C C+ C + +F+ A S TF + C +A C+ + CG
Sbjct: 137 EQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDS 196
Query: 165 GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
G C + ++YG S L+ +T++L V G GC + G V GLLGLG G +
Sbjct: 197 GGCEYEVSYGDGSYTKGTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 256
Query: 224 SLLAQTQNLYQSTFSYCLPSFK-----ALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSS 277
SL+ Q FSYCL S A +GSL LG P+ + PL++NP+ S
Sbjct: 257 SLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPS 316
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
YYV + I VG + + G Q G G ++D+GT TRL AY A+RD F V
Sbjct: 317 FYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAV 376
Query: 338 GSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAM 392
G+ + DTCY + + PT++ F G +TLP NLL+ G I CLA
Sbjct: 377 GALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLE-VDGGIYCLAF 435
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A + +S L+++ N+QQ+ +I D N +G C
Sbjct: 436 APS----SSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 137/432 (31%), Positives = 206/432 (47%), Gaps = 40/432 (9%)
Query: 28 QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------- 77
D ++L+V H PCS K S S +ML +D++R+ + S
Sbjct: 62 DDKRASLEVIHKHGPCSKLSQDKGRS--PSRTQMLDQDESRVNSIRSRLAKNPADGGKLK 119
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SS 133
K +P SG I + Y+V +GTP + L DT +D W C C C
Sbjct: 120 GSKVTLPSKSGSTI-GTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQE 178
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTI 187
+FN ++ST++ N+ C + C ++ + P+C C + + YG + + +QD +
Sbjct: 179 PIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKL 238
Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+L +TD+ + FGC Q G V GL+GLGR +LSL++QT Y FSYCLPS
Sbjct: 239 ALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPS--T 296
Query: 247 LSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
S +G L G G K +K+TP L N + S Y++NL+AI VG R +
Sbjct: 297 SSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVF----- 351
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
+ AGTIIDSGTV +RL AY+ +R F++++ DTCY + P
Sbjct: 352 STAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPK 411
Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
I L FS G + L + + CLA A D + + ++ N+QQ+ ++YDV
Sbjct: 412 INLYFSDGAEMDLDPSGIFYILNISQV-CLAFAGNSDATD--IAILGNVQQKTFDVVYDV 468
Query: 421 PNSRLGVARELC 432
R+G A C
Sbjct: 469 AGGRIGFAPGGC 480
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 133/427 (31%), Positives = 204/427 (47%), Gaps = 37/427 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
S+L V H CS K S + +E+L DQAR+ + S +++K S Q
Sbjct: 61 SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHS-KLSKKLTTNHVSQSQS 117
Query: 92 TQSP----------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
T P YIV +GTP L + DT +D W C CV +FN
Sbjct: 118 TDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 177
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISL-A 190
++ST++ N+ C +A C + + T G+C+ + + YG + + L++D +L +
Sbjct: 178 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTS 237
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
+D+ G FGC + G GLLGLGR LS +QT Y FSYCLPS + S++
Sbjct: 238 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 295
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G L G G + +K+TP+ +S Y +N++AI VG + + IP + G
Sbjct: 296 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 350
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
+IDSGTV TRL AY A+R F+ ++ T + + DTC+ + + P + F
Sbjct: 351 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 410
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SG V + ++ S CLA A D+ N+ + N+QQQ ++YD R+G
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 468
Query: 427 VARELCT 433
A C+
Sbjct: 469 FAPNGCS 475
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 132/447 (29%), Positives = 212/447 (47%), Gaps = 60/447 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLS---SLAVARKSV--- 82
S+ +++ H PCSP + KP + +E +LA DQ R++ + S R +
Sbjct: 68 SARMRIVHQHGPCSPLADAHGKPPAHDE----ILAADQNRVESIQRRVSATTGRDKLTKH 123
Query: 83 --------------------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
+P SGR ++ Y+V +GTPA + DT
Sbjct: 124 AAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG-NYVVTVGLGTPASKYTVVFDT 182
Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+D WV C CV C +F+ A+S+T+ N+ C + C + C GG C + +
Sbjct: 183 GSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQ 242
Query: 173 YGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
YG + +QDT+++A D + G+ FGC +K G GL+GLGRG SL Q N
Sbjct: 243 YGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYN 302
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
Y F+YCLP+ + +G L GP + TP+L + + + YYV + IRVG +
Sbjct: 303 KYGGAFAYCLPAL--TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQ 359
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--F 349
V + + AGT++DSGTV TRL A AYTA+ F + + + + G
Sbjct: 360 QVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSIL 414
Query: 350 DTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
DTCY + PT++L+F G + ++++ + + CLA A+ D+ + + +
Sbjct: 415 DTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDES--VAI 472
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
+ N QQ+ + +LYD+ +G A C
Sbjct: 473 VGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 123/365 (33%), Positives = 184/365 (50%), Gaps = 25/365 (6%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNS 138
+P ++G + + ++V GTPAQT + DT +D +W+ PC+G C +F+
Sbjct: 122 IPDSTGTSL-DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDP 180
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
+S T+ + C QC C G C + + YG S+ A LS +T+SL +T +PG
Sbjct: 181 TKSATYSVVPCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPG 240
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
+ FGC Q G+ GL+GLGRG LSL +Q + TFSYCLPS G L +G
Sbjct: 241 FAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTT--HGYLTIG 298
Query: 257 PI--GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P ++YT +++ S Y+V L++I +G ++ +PP T GT +DS
Sbjct: 299 PTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF-----TDDGTFLDS 353
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMN 370
GT+ T L AYTA+RD F+ + + FDTCY I P ++ FS +
Sbjct: 354 GTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGS 413
Query: 371 V-TLPQDNLLI--HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
V L +LI TA +I CL A P + ++ NMQQ+N ++YDV ++G
Sbjct: 414 VFDLSFFGILIFPDDTAPAIGCLGFVARPSAMP--FTIVGNMQQRNTEVIYDVAAEKIGF 471
Query: 428 ARELC 432
A C
Sbjct: 472 ASASC 476
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 133/447 (29%), Positives = 213/447 (47%), Gaps = 60/447 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLSSLAVA---------- 78
S+ +++ H PCSP + KP + +E +LA DQ R++ + A
Sbjct: 68 SARMRIVHQHGPCSPLADAHGKPPAHDE----ILAADQNRVESIQRRVSATTGRDKLTKH 123
Query: 79 --------RKS--------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
+KS +P SGR ++ Y+V +GTPA + DT
Sbjct: 124 AAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG-NYVVTVGLGTPASKYTVVFDT 182
Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+D WV C CV C +F+ A+S+T+ N+ C + C + C GG C + +
Sbjct: 183 GSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQ 242
Query: 173 YGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
YG + +QDT+++A D + G+ FGC +K G GL+GLGRG SL Q N
Sbjct: 243 YGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYN 302
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
Y F+YCLP+ + +G L GP + TP+L + + + YYV + IRVG +
Sbjct: 303 KYGGAFAYCLPAL--TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQ 359
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--F 349
V + + AGT++DSGTV TRL A AYTA+ F + + + + G
Sbjct: 360 QVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSIL 414
Query: 350 DTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
DTCY + PT++L+F G + ++++ + + CLA A+ D+ + + +
Sbjct: 415 DTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDES--VAI 472
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
+ N QQ+ + +LYD+ +G A C
Sbjct: 473 VGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 135/427 (31%), Positives = 213/427 (49%), Gaps = 36/427 (8%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-SLAVARKSVVPIASG 88
+SS L V H PCSP + E+L DQAR+ + +A A V+ A G
Sbjct: 71 NSSALNVVHRQGPCSPLQARGA---PPPHAELLNDDQARVDSIHRKIAAAASPVLDQARG 127
Query: 89 RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTV 135
++ P Y+V +GTPA+ + + DT +D +WV CT C C +
Sbjct: 128 KKGVTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPL 187
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLA-TD 192
F+ A+S+T+ + C + +C+ + + +C C + + YG S L++DT++L +D
Sbjct: 188 FDPARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD 247
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
++PG+ FGC ++ TG GL+GLGR +SL +Q + Y + FSYCLPS + S +G
Sbjct: 248 VLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS--SPSAAGY 305
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L LG P ++T + S YYV L+ ++V R V + P + F + AGT+I
Sbjct: 306 LSLGGPA-PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSP--IVF---SAAGTVI 359
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS----VPIVAPTITLMF 366
DSGTV TRL Y A+R F R +G +L DTCY + P++ L+F
Sbjct: 360 DSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVF 419
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
+G + +++ S CLA A D ++ +I N QQ+ ++YDV ++G
Sbjct: 420 AGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADA--GIIGNTQQKTLAVVYDVARQKIG 477
Query: 427 VARELCT 433
C+
Sbjct: 478 FGANGCS 484
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 37/427 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------LAVARKS 81
S+L V H CS K S + +E+L DQAR+ + S ++ ++ +
Sbjct: 32 SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHSKLSKKLATDHVSESKST 89
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
+P G + S YIV +GTP L + DT +D W C CV +FN
Sbjct: 90 DLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 148
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISLA- 190
++ST++ N+ C +A C + + T G+C+ + + YG + + L+++ +L
Sbjct: 149 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTN 208
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
+D+ G FGC + G GLLGLGR LS +QT Y FSYCLPS + S++
Sbjct: 209 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 266
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G L G G + +K+TP+ +S Y +N++AI VG + + IP + G
Sbjct: 267 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 321
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
+IDSGTV TRL AY A+R F+ ++ T + + DTC+ + + P + F
Sbjct: 322 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 381
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SG V + + S CLA A D+ N+ + N+QQQ ++YD R+G
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 439
Query: 427 VARELCT 433
A C+
Sbjct: 440 FAPNGCS 446
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 37/427 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------LAVARKS 81
S+L V H CS K S + +E+L DQAR+ + S ++ ++ +
Sbjct: 60 SSLHVTHRHGTCSRLNNGKATSPDH--VEILRLDQARVNSIHSKLSKKLATDHVSESKST 117
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFN 137
+P G + S YIV +GTP L + DT +D W C CV +FN
Sbjct: 118 DLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFN 176
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-----FNLTYGSSTIAAN-LSQDTISLA- 190
++ST++ N+ C +A C + + T G+C+ + + YG + + L+++ +L
Sbjct: 177 PSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTN 236
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
+D+ G FGC + G GLLGLGR LS +QT Y FSYCLPS + S++
Sbjct: 237 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SASYT 294
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G L G G + +K+TP+ +S Y +N++AI VG + + IP + G
Sbjct: 295 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF-----STPGA 349
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
+IDSGTV TRL AY A+R F+ ++ T + + DTC+ + + P + F
Sbjct: 350 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 409
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SG V + + S CLA A D+ N+ + N+QQQ ++YD R+G
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVG 467
Query: 427 VARELCT 433
A C+
Sbjct: 468 FAPNGCS 474
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 138/426 (32%), Positives = 210/426 (49%), Gaps = 40/426 (9%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQ 90
++T+ + H PCSP P+K + E E L +DQ R ++
Sbjct: 57 AATVPLHHRHGPCSPL-PTKKMPTLE---ERLHRDQLRAAYIQRKFSGGGVNGSRGGAGD 112
Query: 91 ITQS----PT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-- 134
+ QS PT Y++ ++G+P ++ M +DT +D +WV C C C S
Sbjct: 113 VQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQAD 172
Query: 135 -VFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNLTYGS-STIAANLSQDTISLA 190
+F+ + S+T+ C +A C Q+ C C + +TYG S+ S DT++L
Sbjct: 173 PLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLALG 232
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
++ V + FGC +G + GL+GLG G+ SL++QT + + FSYCLP+ S S
Sbjct: 233 SNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATS--SSS 290
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G L LG G +K TP+L++ + + Y V + AIRVG R + IP AGT
Sbjct: 291 GFLTLG-AGTSGFVK-TPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFS------AGT 342
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
I+DSGTV TRL AY+A+ F+ + + G DTC+ + PT+ L+F
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVF 402
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SG V + ++ T+ SI CLA AA D +S L +I N+QQ+ +LYDV +G
Sbjct: 403 SGGAVVDIASDGIMLQTSNSILCLAFAANSD--DSSLGIIGNVQQRTFEVLYDVGGGAVG 460
Query: 427 VARELC 432
C
Sbjct: 461 FKAGAC 466
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 135/446 (30%), Positives = 204/446 (45%), Gaps = 59/446 (13%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSVVPI- 85
+ V H PCSP ++ S E+LA DQ R +++ + A RK P+
Sbjct: 1 MPVVHQHGPCSPLADNRN-GKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVE 59
Query: 86 -------------------------ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
AS + Y+V ++GTPA+ + DT +D
Sbjct: 60 LRPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDT 119
Query: 121 AWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
WV C CV C +F+ +S T+ N+ C ++ C + C GG C + + YG
Sbjct: 120 TWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDG 179
Query: 177 TIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
+ +QDT++LA D + + FGC +K G GLLGLGRG SL Q + Y
Sbjct: 180 SYTIGFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGG 239
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
F+YCLP+ A +G L LGP + TP+L + R + YYV + I+VG V+ I
Sbjct: 240 VFAYCLPATSA--GTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVLPI 296
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTC 352
P + AGT++DSGTV TRL AY +R F + + L ++ F DTC
Sbjct: 297 PGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAM-QGLGYSAAPAFSILDTC 350
Query: 353 YSV------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
Y + I P ++L+F G + +++ S CLA A D+ + + ++
Sbjct: 351 YDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTD--VAIV 408
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQ+ H +LYD+ +G A C
Sbjct: 409 GNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 135/446 (30%), Positives = 204/446 (45%), Gaps = 59/446 (13%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSVVPI- 85
+ V H PCSP ++ S E+LA DQ R +++ + A RK P+
Sbjct: 66 MPVVHQHGPCSPLADNRN-GKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVE 124
Query: 86 -------------------------ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
AS + Y+V ++GTPA+ + DT +D
Sbjct: 125 LRPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDT 184
Query: 121 AWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
WV C CV C +F+ +S T+ N+ C ++ C + C GG C + + YG
Sbjct: 185 TWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDG 244
Query: 177 TIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS 235
+ +QDT++LA D + + FGC +K G GLLGLGRG SL Q + Y
Sbjct: 245 SYTIGFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGG 304
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
F+YCLP+ A +G L LGP + TP+L + R + YYV + I+VG V+ I
Sbjct: 305 VFAYCLPATSA--GTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVLPI 361
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTC 352
P + AGT++DSGTV TRL AY +R F + + L ++ F DTC
Sbjct: 362 PGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAM-QGLGYSAAPAFSILDTC 415
Query: 353 YSV------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
Y + I P ++L+F G + +++ S CLA A D+ + + ++
Sbjct: 416 YDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTD--VAIV 473
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQ+ H +LYD+ +G A C
Sbjct: 474 GNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 130/391 (33%), Positives = 190/391 (48%), Gaps = 24/391 (6%)
Query: 62 LAKDQARLQFLSSLAVARKSVVP--------IASGRQITQ-SPTYIVRAKIGTPAQTLLM 112
L +D AR++ L+ LA A P + ++Q S Y R +GTP + L M
Sbjct: 86 LERDAARVKTLTHLAAATNKTRPANPGSGFSSSVVSGLSQGSGEYFTRLGVGTPPKYLYM 145
Query: 113 AMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGAC 167
+DT +D W+ PCT C + +F+ ++S +F + C + C+++ +P C C
Sbjct: 146 VLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLC 205
Query: 168 AFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
+ ++YG + + S +T++ VP GC G V GLLGLGRG LS
Sbjct: 206 QYQVSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFP 265
Query: 227 AQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
QT + + FSYCL A + S+ G + ++TPL+KNP+ + YYV LL I
Sbjct: 266 TQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGI 325
Query: 287 RVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
VG V I + + T G IIDSGT TRL PAY ++RD FR
Sbjct: 326 SVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPE 385
Query: 346 LGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
FDTCY + + PT+ L F G +V+LP N L+ C A A S
Sbjct: 386 FSLFDTCYDLSGLSEVKVPTVVLHFRGADVSLPAANYLVPVDNSGSFCFAFAG----TMS 441
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L++I N+QQQ R+++D+ SR+G A C
Sbjct: 442 GLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 180/350 (51%), Gaps = 16/350 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R +GTP + M +DT +D AW+ C C C S +FN + S +F +GC
Sbjct: 154 SGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C Q+ C G C + +YG + + + + +T++ T V GC K G
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTSVANVAIGCGHKNVGLF 273
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLGLG G+LS Q TFSYCL ++ S SG L+ GP P +TPL
Sbjct: 274 IGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDS-SGPLQFGPKSVPVGSIFTPL 332
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
KNP + YY+++ AI VG ++D IPP + + T+G G IIDSGTV TRLV AY
Sbjct: 333 EKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYD 392
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHS 382
AVRD F G ++ FDTCY + + PT+ FS G ++ LP N LI
Sbjct: 393 AVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPM 452
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A A +V +++ N QQQ+ R+ +D NS +G A + C
Sbjct: 453 DTVGTFCFAFAPAASSV----SIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 135/431 (31%), Positives = 209/431 (48%), Gaps = 44/431 (10%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----------LAVA 78
S+L+V H PC + P + EML KDQ+R+ F+ S L +
Sbjct: 59 EQSSLEVIHRHGPCGDEVSNAP-----TAAEMLVKDQSRVDFIHSKIAGELESVDRLRGS 113
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSST 134
+ + +P SG I S YIV +GTP + L + DT +D W C C
Sbjct: 114 KATKIPAKSGATIG-SGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDP 172
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGG-ACAFNLTYGSSTIAAN-LSQDTI 187
VF +QSTT+ N+ C + C Q+ + P C AC + + YG + + +++T+
Sbjct: 173 VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETL 232
Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+L +TD++ + FGC Q G GL+GLG+ +S++ QT Y FSYCLP K
Sbjct: 233 TLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLP--KT 290
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
S +G L G G +KYTP+ K ++ Y V+++ ++VG IP + F+ +
Sbjct: 291 SSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGG--TQIPISSSVFSTS- 347
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTI 362
G IIDSGTV TRL AY+A++ F + + L DTCY + I P +
Sbjct: 348 --GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKV 405
Query: 363 TLMFSGMNVTLPQDNL-LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+F G L D + +++ + S CLA A D S + +I N+QQ+ +++YDV
Sbjct: 406 GFVFKGGE-ELDLDGIGIMYGASTSQVCLAFAGNQD--PSTVAIIGNVQQKTLQVVYDVG 462
Query: 422 NSRLGVARELC 432
++G C
Sbjct: 463 GGKIGFGYNGC 473
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 138/387 (35%), Positives = 195/387 (50%), Gaps = 37/387 (9%)
Query: 72 LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
++ L+ AR V P+ S + S YI + +GTP L+A+DT++D W+ C C C
Sbjct: 115 VAGLSSARGFVAPVVS--RAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRC 172
Query: 132 ---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG---GACAFNLTYGS-STIAANLSQ 184
S VF+ ST+++ + AA C+ + G G C + + YG ST + +
Sbjct: 173 YPQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIE 232
Query: 185 DTISLATDI-VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
+T++ A + +P + GC G P G+LGLGRG +S Q + TFSYCL
Sbjct: 233 ETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQID--HNGTFSYCLV 290
Query: 243 SFKALSFSGSLR------LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RVVDI 295
F LS GSL G + + +TP + N + YYV L I VG RV +
Sbjct: 291 DF--LSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGV 348
Query: 296 PPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FD 350
LQ +P TG G I+DSGT TRL PAYTA RD F R V +L S+GG FD
Sbjct: 349 TERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF-RAVAVDLGQVSIGGPSGFFD 407
Query: 351 TCYSVP----IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
TCY+V PT+++ F+G + V L N LI + C A AA D+ +++
Sbjct: 408 TCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDH---SVSI 464
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I N+QQQ RI+YD+ R+G A C
Sbjct: 465 IGNIQQQGFRIVYDI-GGRVGFAPNSC 490
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 191/395 (48%), Gaps = 34/395 (8%)
Query: 60 EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------YIVRAKIGTPAQTLLMA 113
E + + R+ F + + P A G Q QSP Y++ +G+P Q+ +
Sbjct: 2 EAVQRSHERVAFYT------LKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVI 55
Query: 114 MDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK--QVPNPTCGGGACA 168
+DT +D WV C C C F+ ++S +F+ C C +P C C
Sbjct: 56 VDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQ 115
Query: 169 FNLTYGS-STIAANLSQDTISL----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
+ TYG S +L+ +TISL T VP + FGC + G GL+GLG+G L
Sbjct: 116 YQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPL 175
Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
SL +Q + + + FSYCL S +LS S L G I I+YT ++ N R + YYV L
Sbjct: 176 SLNSQLSHTFANKFSYCLVSLNSLSAS-PLTFGSIAAAANIQYTSIVVNARHPTYYYVQL 234
Query: 284 LAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+I VG + +++ P + +TG GTIIDSGT T L PAY+AV + V
Sbjct: 235 NSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRL 294
Query: 343 VTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH-STAGSITCLAMAAAPD 397
S G D C+++ V+ P + F G + + +NL + T+ + CLAM +
Sbjct: 295 DGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGSQG 354
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++I N+QQQNH ++YD+ ++G A C
Sbjct: 355 -----FSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 189/392 (48%), Gaps = 25/392 (6%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPI-ASGRQITQSPT---------YIVRAKIGTPAQTLL 111
LA+D +R++ L+SLA A S A G + S T Y R +GTPA+ +
Sbjct: 102 LARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPARYVF 161
Query: 112 MAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-- 166
M +DT +D W+ C C C S VFN +S +F N+ C + C+++ +P C
Sbjct: 162 MVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHI 221
Query: 167 CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
C + ++YG + S +T++ V GC G + GLLGLGRG LS
Sbjct: 222 CLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSF 281
Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
+Q + FSYCL A S + G + ++TPL+ NP+ + YYV LL
Sbjct: 282 PSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLG 341
Query: 286 IRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
+ V G RV I + + T G IIDSGT TRL PAY A+RD FR +
Sbjct: 342 VSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAP 401
Query: 345 SLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
FDTC+ + + PT+ L F G +V+LP N LI C A A
Sbjct: 402 EFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDNSGSFCFAFAG----TM 457
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L+++ N+QQQ R++YD+ SR+G A C
Sbjct: 458 SGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 210/425 (49%), Gaps = 53/425 (12%)
Query: 32 STLQVFHVFSPCSPF-------KPSKPLSWEESVLEMLAKDQARLQFL-----------S 73
++L+V H PCS K + P S ++L +D+ R++++ S
Sbjct: 70 ASLEVVHKHGPCSQLNDHDGKAKSTTPHS------DILNQDKERVKYINSRLSKNLGQDS 123
Query: 74 SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCV 129
S+ + +P SG I S Y V +GTP + L + DT +D W C C
Sbjct: 124 SVEELDSATLPAKSGSLI-GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCY 182
Query: 130 GCSSTVFNSAQSTTFKNLGCQAAQCKQVP-----NPTCGGG--ACAFNLTYGSSTIAAN- 181
+F+ ++ST++ N+ C +A C Q+ +P C AC + + YG S+ +
Sbjct: 183 KQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGY 242
Query: 182 LSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
S++ +++ ATD+V + FGC Q G GL+GLGR +S + QT Y+ FSYC
Sbjct: 243 FSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYC 302
Query: 241 LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
LPS S +G L GP + +KYTP R SS Y +++ AI VG V +P +
Sbjct: 303 LPS--TSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGG--VKLPVSSS 358
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP---- 356
F +TG G IIDSGTV TRL AY A+R FR+ + + L DTCY +
Sbjct: 359 TF--STG-GAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKV 415
Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
PTI F+ G+ V LP +L ++ + CLA AA D +S + + N+QQ+
Sbjct: 416 FSIPTIEFSFAGGVTVKLPPQGILFVASTKQV-CLAFAANGD--DSDVTIYGNVQQRTIE 472
Query: 416 ILYDV 420
++YDV
Sbjct: 473 VVYDV 477
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 139/427 (32%), Positives = 201/427 (47%), Gaps = 30/427 (7%)
Query: 26 DTQDHSST--LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV 83
DT + S+T +Q+ HV + P E L +D AR++ +S LA +
Sbjct: 52 DTAESSATFSVQLHHVDALSFNSTP------ETLFTTRLQRDAARVEAISYLAETAGTGK 105
Query: 84 PIASGRQIT-------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSS 133
+ +G + S Y R +GTP + + M +DT +D W+ PC C S
Sbjct: 106 RVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSD 165
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLA 190
VF+ +S +F ++ C++ C ++ +P C C + ++YG + + S +T++
Sbjct: 166 PVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR 225
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
V GC G V GLLGLGRG LS +QT + FSYCL A S
Sbjct: 226 RTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKP 285
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAG 309
S+ G + ++TPL+ NP+ + YYV LL I V G RV I + + T G
Sbjct: 286 SSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 345
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLM 365
IIDSGT TRL PAY A RD FR + FDTC+ + + PT+ L
Sbjct: 346 VIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLH 405
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F G +V+LP N LI CLA A L++I N+QQQ R++YD+ SR+
Sbjct: 406 FRGADVSLPASNYLIPVDTSGNFCLAFAGTMGG----LSIIGNIQQQGFRVVYDLAGSRV 461
Query: 426 GVARELC 432
G A C
Sbjct: 462 GFAPHGC 468
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 189/384 (49%), Gaps = 20/384 (5%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDA 120
L +D R+ L+S A S V SG ++Q S Y R +GTP + L M +DT +D
Sbjct: 78 LHRDTLRVHALNSRAAGFSSSV--VSG--LSQGSGEYFTRLGVGTPPRYLYMVLDTGSDV 133
Query: 121 AWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG--GACAFNLTYGS 175
W+ C+ C C S +FN +S +F + C + C+++ + C C + ++YG
Sbjct: 134 VWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD 193
Query: 176 STI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
+ + + +T++ + + GC G V GLLGLGRG LS +QT +
Sbjct: 194 GSFTTGDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFN 253
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RVV 293
FSYCL A S S+ G + ++TPL++NP+ + YYV L+ I VG RV
Sbjct: 254 HKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVR 313
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
+ P + + G IIDSGT TRL PAYTA+RD FR FDTCY
Sbjct: 314 GVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCY 373
Query: 354 SV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ + PT+ L F G ++ LP N LI C A A S L++I N+
Sbjct: 374 DLSGQSSVKVPTVVLHFRGADMALPATNYLIPVDENGSFCFAFAG----TISGLSIIGNI 429
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
QQQ R++YD+ SR+G A CT
Sbjct: 430 QQQGFRVVYDLAGSRIGFAPRGCT 453
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 123/408 (30%), Positives = 195/408 (47%), Gaps = 22/408 (5%)
Query: 35 QVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQIT 92
++ H P SP + + + E L + + R LS LA R P+ASG
Sbjct: 21 ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHILAEGRLFSTPVASGNG-- 78
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGC 149
Y++ G+P Q + +DT +D W +PC C +S +F+ +S+T+ + C
Sbjct: 79 ---EYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSC 135
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ C +P +C +C ++ YG S+ + LS +T+++ T +P FGC G+
Sbjct: 136 ASNFCSSLPFQSCTT-SCKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHTNLGS 194
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
G++GLG+G LSL++Q ++ FSYCL + S + +G + YT
Sbjct: 195 FAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTS-PMLIGDSAAAGGVAYTA 253
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LL N + YY +L I V + V P G + + G I+DSGT T L A+ A
Sbjct: 254 LLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNA 313
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTA 384
+ + V SL G D C+S VA PT+T F G + LP +N+ +
Sbjct: 314 LVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENVFVALDT 373
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
G CLAMAA+ + +++ N+QQQNH I++D+ N R+G C
Sbjct: 374 GGSICLAMAAS-----TGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 199/399 (49%), Gaps = 32/399 (8%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
K L+ E + +A+ + RL L++ LA A +V + + ++++ IG+P
Sbjct: 317 KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPP 376
Query: 108 QTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
++ MDT +D W PC C S+ +F+ QS++F + C + C +P TC
Sbjct: 377 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS 436
Query: 165 GACAFNLTYG-SSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPP-QGLLG 217
C + TYG SS+ L+ +T + +PG FGC G+ GL+G
Sbjct: 437 DGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVG 496
Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PK----RIKYTPLLK 271
LGRG LSL++Q L + F+YCL + S SL LG + PK +K TPL+K
Sbjct: 497 LGRGPLSLVSQ---LKEQKFAYCLTAIDD-SKPSSLLLGSLANITPKTSKDEMKTTPLIK 552
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
NP + S YY++L I VG + IP + + G IIDSGT T + A+T++++
Sbjct: 553 NPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKN 612
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
F ++ + + GG D C+++P + P +T F G ++ LP +N +I +
Sbjct: 613 EFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG 672
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ CLA+ ++ +++ N+QQQN +++D+ L
Sbjct: 673 LLCLAIGSSRG-----MSIFGNLQQQNFMVVHDLQEETL 706
>gi|302142046|emb|CBI19249.3| unnamed protein product [Vitis vinifera]
Length = 191
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 93/172 (54%), Positives = 123/172 (71%), Gaps = 5/172 (2%)
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+++TP+L P + + + VGR +V + P L F+P TGAGTIIDSGTV TR V
Sbjct: 22 HLRFTPMLCAPVHPGPWPL-VPHHGVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFV 80
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLPQDNLLI 380
P Y A+RD FR++V ++G FDTC++ +AP +T F+GM++ LP +N LI
Sbjct: 81 EPVYAAIRDEFRKQVKGPFA--TIGAFDTCFAATNEDIAPPVTFHFTGMDLKLPLENTLI 138
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
HS+AGS+ CLAMAAAP+NVNSVLNVIAN+QQQN RI++DV NSRLG+ARELC
Sbjct: 139 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 190
>gi|242059939|ref|XP_002459115.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
gi|241931090|gb|EES04235.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
Length = 153
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/153 (57%), Positives = 119/153 (77%), Gaps = 3/153 (1%)
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
++ IRVG + V +P AL F+PT+G GTI+D+GT+FTRL AP Y AVRD FRRRV + +
Sbjct: 1 MVGIRVGGKPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDAFRRRVRAPVA 60
Query: 343 VTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAA-PDNVN 400
LGGFDTCY+V + PT+T +F G ++VTLP++N++I S++G I CLAMAA PD V+
Sbjct: 61 -GPLGGFDTCYNVTVSVPTVTFVFDGPVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVD 119
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ LNV+A+MQQQNHR+L+DV N R+G +RELCT
Sbjct: 120 AALNVLASMQQQNHRVLFDVANGRVGFSRELCT 152
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 199/399 (49%), Gaps = 32/399 (8%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
K L+ E + +A+ + RL L++ LA A +V + + ++++ IG+P
Sbjct: 62 KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPP 121
Query: 108 QTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
++ MDT +D W PC C S+ +F+ QS++F + C + C +P TC
Sbjct: 122 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS 181
Query: 165 GACAFNLTYG-SSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPP-QGLLG 217
C + TYG SS+ L+ +T + +PG FGC G+ GL+G
Sbjct: 182 DGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVG 241
Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PK----RIKYTPLLK 271
LGRG LSL++Q L + F+YCL + S SL LG + PK +K TPL+K
Sbjct: 242 LGRGPLSLVSQ---LKEQKFAYCLTAIDD-SKPSSLLLGSLANITPKTSKDEMKTTPLIK 297
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
NP + S YY++L I VG + IP + + G IIDSGT T + A+T++++
Sbjct: 298 NPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKN 357
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
F ++ + + GG D C+++P + P +T F G ++ LP +N +I +
Sbjct: 358 EFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG 417
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ CLA+ ++ +++ N+QQQN +++D+ L
Sbjct: 418 LLCLAIGSSRG-----MSIFGNLQQQNFMVVHDLQEETL 451
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 132/399 (33%), Positives = 193/399 (48%), Gaps = 29/399 (7%)
Query: 53 SWEESVLEMLAKDQARLQFLSS-LAVA-------RKSVVPIASGRQITQSPTYIVRAKIG 104
S +VL+++A+D AR ++L+S L+ A S + SG S Y VR IG
Sbjct: 76 SRRHAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKVVSGLD-EGSGEYFVRVGIG 134
Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
+P + +D+ +D WV C C+ C + +F+ A S TF + C +A C+ +
Sbjct: 135 SPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSG 194
Query: 162 CG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLG 219
CG G C + ++YG S L+ +T++L V G GC + G V GLLGLG
Sbjct: 195 CGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLG 254
Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSL 278
G +SL+ Q FSYCL S A GSL LG P+ + PL++NP+ S
Sbjct: 255 WGPMSLVGQLGGAAGGAFSYCLASRGA----GSLVLGRSEAVPEGAVWVPLVRNPQAPSF 310
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
YYV L I VG + + Q G ++D+GT TRL AY A+RD F VG
Sbjct: 311 YYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVG 370
Query: 339 SNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMA 393
+ + DTCY + + PT++ F G +TLP NLL+ G I CLA A
Sbjct: 371 ALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLE-VDGGIYCLAFA 429
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +S +++ N+QQ+ +I D N +G C
Sbjct: 430 PS----SSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 203/433 (46%), Gaps = 46/433 (10%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
+ S TL + H+ + S P +E L +D R++ +++LA +P G
Sbjct: 69 ESSITLNLDHIDALSSNKTP------QELFSSRLQRDSRRVKSIATLAAQ----IP---G 115
Query: 89 RQITQSP------------------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
R +T +P Y R +GTPA+ + M +DT +D W+ C C
Sbjct: 116 RNVTHAPRTGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR 175
Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQ 184
C S +F+ +S T+ + C + C+++ + C C + ++YG + + S
Sbjct: 176 CYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFST 235
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
+T++ + V G GC G V GLLGLG+G LS QT + + FSYCL
Sbjct: 236 ETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDR 295
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFN 303
A S S+ G + ++TPLL NP+ + YYV LL I V G RV + + +
Sbjct: 296 SASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLD 355
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVA 359
G IIDSGT TRL+ PAY A+RD FR + FDTC+ + +
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKV 415
Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
PT+ L F G +V+LP N LI C A A L++I N+QQQ R++YD
Sbjct: 416 PTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYD 471
Query: 420 VPNSRLGVARELC 432
+ +SR+G A C
Sbjct: 472 LASSRVGFAPGGC 484
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 139/428 (32%), Positives = 213/428 (49%), Gaps = 40/428 (9%)
Query: 38 HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---LAVA---------------- 78
H S SP++P+ + V L +D+ RL +SS L VA
Sbjct: 2 HRDSADSPYRPANA-TVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 79 ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCS 132
+ P+ SG S Y V +GTP +T+ M DT +D W+ PC C G +
Sbjct: 61 FLQQDFETPLRSGLS-DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119
Query: 133 STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
+FN + S+TF+++ C ++ C+Q+ C C + ++YG + S +T+S +
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGS 179
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
+ V GC G GLLGLG+G LS +Q LY S FSYCLP+ ++ S
Sbjct: 180 NAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SV 238
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGT 310
L G ++T LL NP+ + YYV ++ I+VG V+IP G+L + +TG G
Sbjct: 239 PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGV 298
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLM 365
I+DSGT TRLV AY +RD FR + S+ +TS FDTCY + I+ P ++ +
Sbjct: 299 ILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFV 358
Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F+ G + LP N+++ CLA A +N ++I N+QQQ+ R+ +D +R
Sbjct: 359 FNGGATMALPAQNIMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQSFRMSFDSTGNR 414
Query: 425 LGVARELC 432
+G+ C
Sbjct: 415 VGIGANQC 422
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 207/430 (48%), Gaps = 39/430 (9%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-----------SSLAVARK 80
++L+V H PCS + S +++ D R++++ +S+
Sbjct: 61 ASLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDS 120
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVF 136
+ +P SG I S Y V +GTP + L + DT +D W C C G +F
Sbjct: 121 TTLPAKSGSLI-GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIF 179
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGS-STIAANLSQDTISL 189
+ ++S+++ N+ C ++ C Q+ + + AC + + YG ST LSQ+ +++
Sbjct: 180 DPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTI 239
Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
ATDIV + FGC Q G GL+GLGR +S + QT ++Y FSYCLPS S
Sbjct: 240 TATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPS--TSS 297
Query: 249 FSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G L G +KYTPL ++ Y ++++ I VG +P A+ + +
Sbjct: 298 SLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGG--TKLP--AVSSSTFSA 353
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTIT 363
G+IIDSGTV TRL AY A+R FR+ + G FDTCY I P I
Sbjct: 354 GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKID 413
Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
F+ G+ V LP +LI +A + CLA AA ++ + + + N+QQ+ ++YDV
Sbjct: 414 FEFAGGVTVELPLVGILIGRSAQQV-CLAFAANGNDND--ITIFGNVQQKTLEVVYDVEG 470
Query: 423 SRLGVARELC 432
R+G C
Sbjct: 471 GRIGFGAAGC 480
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/377 (34%), Positives = 184/377 (48%), Gaps = 43/377 (11%)
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFN 137
S +P ASG Y +GTP L+ +DT +D W+ C CV C S +++
Sbjct: 90 SGLPFASGE-------YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYD 142
Query: 138 SAQSTTFKNLGCQAAQCKQVPNP-TCGG--GACAFNLTYG-SSTIAANLSQDTISLATDI 193
S+T+ C QC+ NP TC G G C + + YG +S+ + NL+ D + + D
Sbjct: 143 PRGSSTYAQTPCSPPQCR---NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT 199
Query: 194 VPG-YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSG 251
G T GC G GLLG+ RG+ S Q + Y F+YCL ++ S S
Sbjct: 200 SVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSS 259
Query: 252 SLRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTG-A 308
L G +P +TPL NPRR SLYYV+++ VG V +L +P TG
Sbjct: 260 YLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRG 319
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFR--------RRVGSNLTVTSLGGFDTCYSVPIV-- 358
G ++DSGT TR AY A+RD F R+VG ++V FD CY + V
Sbjct: 320 GVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISV-----FDACYDLRGVAV 374
Query: 359 --APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
AP + L F+ G +V LP +N L+ +G C A+ AA + L+VI N+ QQ R
Sbjct: 375 ADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAG---HDGLSVIGNVLQQRFR 431
Query: 416 ILYDVPNSRLGVARELC 432
+++DV N R+G C
Sbjct: 432 VVFDVENERVGFEPNGC 448
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/423 (31%), Positives = 207/423 (48%), Gaps = 41/423 (9%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-------SSLAVARKSV 82
+ ++L+V H PCS +++E+L +DQ+R+ + S + +
Sbjct: 63 NKASLKVVHKHGPCSQLNQQN--GNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAK 120
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P SG + + YIV +G+P + L++ DT +D W C+ ++ F+ +ST
Sbjct: 121 LPTKSGMSL-GTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCS-----AAETFDPTKST 174
Query: 143 TFKNLGCQAAQCKQV----PNPT-CGGGACAFNLTYGSSTIAAN-LSQDTISL-ATDIVP 195
++ N+ C C V NP+ C C + + YG + + L ++ +++ +TDI
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIFN 234
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
+ FGC Q G GLLGLGR LS+++QT Y FSYCLPS + F L
Sbjct: 235 NFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGF---LSF 291
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G Q K K+TPL P SS Y ++L I VG + + IP + AGTIIDSG
Sbjct: 292 GS-SQSKSAKFTPLSSGP--SSFYNLDLTGITVGGQKLAIPLSVF-----STAGTIIDSG 343
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMN 370
TV TRL AY+A+R FR+ + S L DTCY I P I + FS G++
Sbjct: 344 TVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVD 403
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
V + Q + + + + CLA A ++ + N QQ+N ++YDV ++G A
Sbjct: 404 VDVDQAGIFVANGLKQV-CLAFAGNTGARDTA--IFGNTQQRNFEVVYDVSGGKVGFAPA 460
Query: 431 LCT 433
C+
Sbjct: 461 SCS 463
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 139/428 (32%), Positives = 212/428 (49%), Gaps = 40/428 (9%)
Query: 38 HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---LAVA---------------- 78
H S SP++P+ + V L +D+ RL +SS L VA
Sbjct: 2 HRDSADSPYRPANA-TVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 79 ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCS 132
+ P+ SG S Y V +GTP +T+ M DT +D W+ PC C G +
Sbjct: 61 FLQQDFETPLRSGLS-DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119
Query: 133 STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
+FN + S+TF+++ C ++ C+Q+ C C + ++YG + S +T+S +
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGS 179
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
+ V GC G GLLGLG+G LS +Q LY S FSYCLP+ ++ S
Sbjct: 180 NAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SV 238
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGT 310
L G ++T LL NP+ + YYV ++ I+VG V IP G+L + +TG G
Sbjct: 239 PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGV 298
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLM 365
I+DSGT TRLV AY +RD FR + S+ +TS FDTCY + I+ P ++ +
Sbjct: 299 ILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFV 358
Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F+ G + LP N+++ CLA A +N ++I N+QQQ+ R+ +D +R
Sbjct: 359 FNGGATMALPAQNIMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQSFRMSFDSTGNR 414
Query: 425 LGVARELC 432
+G+ C
Sbjct: 415 VGIGANQC 422
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 174/368 (47%), Gaps = 30/368 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQ 140
P+A+ R Y+ ++GTP + + +DT +D WV C+ C C S +F
Sbjct: 5 PVAAARG-----EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNT 59
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN------LSQDTISLATDIV 194
ST+F L C +A C +P P C C + +YG ++ ++ D I+ V
Sbjct: 60 STSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQV 119
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-L 253
P + FGC G+ G+LGLG+G LS +Q +++Y FSYCL + A S L
Sbjct: 120 PNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPL 179
Query: 254 RLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
G P +KY P+L NP+ + YYV L I VG +++I + GAGTI
Sbjct: 180 LFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTI 239
Query: 312 IDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYS------VPIVAPTITL 364
DSGT T+L AY V + + + + D C S +P V P +T
Sbjct: 240 FDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTV-PAMTF 298
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F G ++ LP N I+ + C AM ++PD +N+I ++QQQN ++ YD +
Sbjct: 299 HFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPD-----VNIIGSVQQQNFQVYYDTAGRK 353
Query: 425 LGVARELC 432
LG + C
Sbjct: 354 LGFVPKDC 361
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 137/410 (33%), Positives = 194/410 (47%), Gaps = 36/410 (8%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARK--------------SVVPIASGRQITQ----- 93
S+E + E L ++ AR++ L + RK + V G ++
Sbjct: 92 SYERRLEEKLRREAARVRALEQ-RIERKLKLKKDPAGSYENVAGVTAEFGSEVVSGMEQG 150
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R IGTP + M +DT +D W+ C C C S +FN + S +F +GC
Sbjct: 151 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 210
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C Q+ C GG C + ++YG S + + +T++ T + GC G
Sbjct: 211 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDNVGLF 270
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GSLS AQ FSYCL + S SG+L GP P +TPL
Sbjct: 271 VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSES-SGTLEFGPESVPIGSIFTPL 329
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
+ NP + YY++++AI VG ++D +P A + + TTG G IIDSGT TRL AY
Sbjct: 330 VANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYD 389
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
A+RD F + FDTCY + + P + FS G LP N LI
Sbjct: 390 ALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPM 449
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A A N L+++ N+QQQ R+ +D NS +G A + C
Sbjct: 450 DSMGTFCFAFAPADSN----LSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/350 (36%), Positives = 173/350 (49%), Gaps = 16/350 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R IGTP + M +DT +D W+ C C C S +FN + S +F +GC
Sbjct: 5 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 64
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C Q+ C GG C + ++YG S + + +T++ T + GC G
Sbjct: 65 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDNVGLF 124
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GSLS AQ FSYCL + S SG+L GP P +TPL
Sbjct: 125 VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSES-SGTLEFGPESVPIGSIFTPL 183
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYT 327
+ NP + YY++++AI VG ++D +P A + + TTG G IIDSGT TRL AY
Sbjct: 184 VANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYD 243
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
A+RD F + FDTCY + + P + FS G LP N LI
Sbjct: 244 ALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPM 303
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A A N L+++ N+QQQ R+ +D NS +G A + C
Sbjct: 304 DSMGTFCFAFAPADSN----LSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 179/359 (49%), Gaps = 32/359 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTPA+ +DT +D W PC CV + F+ A S+T+++LGC A
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPA 151
Query: 154 CKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD----IVPGYTFGCIQKATGN 208
C + P C C + YG S++ A L+ +T + T+ +P +FGC G+
Sbjct: 152 CNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNAGS 211
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRI 264
G++G GRGSLSL++Q L FSYCL SF ++ + G+ +
Sbjct: 212 LANGSGMVGFGRGSLSLVSQ---LGSPRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTV 268
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
+ TP + NP ++Y++N+ I VG + I P L N T G GTIIDSGT T L
Sbjct: 269 QSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAE 328
Query: 324 PAYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP 374
PAY AVR+ F + S L VT DTC+ P + P + L F G + LP
Sbjct: 329 PAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELP 388
Query: 375 -QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
Q+ +L+ + G + CLAMA + D ++I + Q QN +LYD+ NS L C
Sbjct: 389 LQNYMLVDPSTGGL-CLAMATSSDG-----SIIGSYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 179/352 (50%), Gaps = 22/352 (6%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y R +GTPA+++ M DT +D +W+ C+ C C +FN + S++FK L C
Sbjct: 78 SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 137
Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
++ C ++ C C + ++YG + + S +T+S V GC + G
Sbjct: 138 SSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMGCGRNNQGL 197
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
GLLGLGRG LS +QT Y S FSYCLP + + + SL GP P++ ++T
Sbjct: 198 FHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR-RESAIAASLVFGPSAVPEKARFTK 256
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LL N R + YYV L IRV V+IPP A G I+DSGT +RL PAYTA
Sbjct: 257 LLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTA 316
Query: 329 VRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLI 380
+RD FR S +T S G FDTCY + + P + L F G ++ LP D +L+
Sbjct: 317 LRDAFR----SLVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILV 372
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA A + ++I N+QQQ RI D ++G+A + C
Sbjct: 373 NVDDEGTYCLAFAPEEE----AFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 59/432 (13%)
Query: 33 TLQVFHVFSPCSPFKPSK-PLSWEESVLEMLAKDQARLQFLS---------SLAVARKSV 82
T+ + H PCSP +K P S EE L +DQ R ++ + + +
Sbjct: 62 TVPLHHRHGPCSPVPSNKMPASLEE----RLQRDQLRAAYIKRKFSGAKGGDVEQSDAAT 117
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
VP G ++ + Y++ IG+PA T M+MDT +D +WV C C C S V F+ +
Sbjct: 118 VPTTLGTSLS-TLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPS 176
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTY--GSSTIAANLSQDTISLATDI 193
S+T+ C +A C Q+ G G C + ++Y GSST S DT++L ++
Sbjct: 177 ASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTT-GTYSSDTLTLGSNA 235
Query: 194 VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
+ G+ FGC Q +G S GL+GLG + SL++QT + FSYCLP SG
Sbjct: 236 IKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGS--SGF 293
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L LG + +K TP+L++ + + Y V L AIRVG + ++IP AG+++
Sbjct: 294 LTLGAASRSGFVK-TPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFS------AGSVM 346
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
DSGTV TRL AY+A+ F+ + G DTC+ + P++ L+FSG
Sbjct: 347 DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 406
Query: 369 MNVT--------LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
V L DN CLA AA D +S L I N+QQ+ +LYDV
Sbjct: 407 GAVVNLDFNGIMLELDNW----------CLAFAANSD--DSSLGFIGNVQQRTFEVLYDV 454
Query: 421 PNSRLGVARELC 432
+G C
Sbjct: 455 GGGAVGFRAGAC 466
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 130/373 (34%), Positives = 189/373 (50%), Gaps = 28/373 (7%)
Query: 82 VVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFN 137
V P+ SG + Q S Y + +GTPA LM +DT +D W+ C C C S VF+
Sbjct: 128 VAPVVSG--LAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFD 185
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI- 193
+S ++ +GC A C+++ + C AC + + YG ++ A + + +T++ A
Sbjct: 186 PRRSRSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGAR 245
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSF 249
V GC G V GLLGLGRGSLS AQ Y +FSYCL S S
Sbjct: 246 VARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASH 305
Query: 250 SGSLRL--GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT 306
S ++ G +G +TP++KNPR + YYV L+ I V G RV + L+ +P++
Sbjct: 306 SSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSS 365
Query: 307 G-AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAP 360
G G I+DSGT TRL PAY+A+RD FR G L+ FDTCY + + P
Sbjct: 366 GRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVP 425
Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
T+++ F+ G LP +N LI + C A A V ++I N+QQQ R+++D
Sbjct: 426 TVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFD 481
Query: 420 VPNSRLGVARELC 432
R+G + C
Sbjct: 482 GDGQRVGFVPKGC 494
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 132/397 (33%), Positives = 184/397 (46%), Gaps = 23/397 (5%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ---------SPTYIVRAKIGT 105
EE L +D R++ LSSL +++ + S Y R +GT
Sbjct: 78 EELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGT 137
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
P + + M +DT +D W+ C C C S VFN +S +F + C+ C+++ +P C
Sbjct: 138 PPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGC 197
Query: 163 GG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
C + ++YG S +T++ V GC G V GLLGLGR
Sbjct: 198 NQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGR 257
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY 280
G LS +Q + FSYCL A S S+ G + ++TPLL NPR + YY
Sbjct: 258 GGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYY 317
Query: 281 VNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
V LL I VG V I + + T G IID GT TRL PAY A+RD FR S
Sbjct: 318 VELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASS 377
Query: 340 NLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
+ FDTCY + + PT+ L F G +V+LP N LI C A A
Sbjct: 378 LKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAG- 436
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N+QQQ R++YD+ +SR+G + C
Sbjct: 437 ---TTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 179/352 (50%), Gaps = 22/352 (6%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y R +GTPA+++ M DT +D +W+ C+ C C +FN + S++FK L C
Sbjct: 11 SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACA 70
Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
++ C ++ C C + ++YG + + S +T+S V GC + G
Sbjct: 71 SSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMGCGRNNQGL 130
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
GLLGLGRG LS +QT Y S FSYCLP + + + SL GP P++ ++T
Sbjct: 131 FHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR-RESAIAASLVFGPSAVPEKARFTK 189
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LL N R + YYV L IRV V+IPP A G I+DSGT +RL PAYTA
Sbjct: 190 LLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTA 249
Query: 329 VRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLI 380
+RD FR S +T S G FDTCY + + P + L F G ++ LP D +L+
Sbjct: 250 LRDAFR----SLVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILV 305
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA A + ++I N+QQQ RI D ++G+A + C
Sbjct: 306 NVDDEGTYCLAFAPEEE----AFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 186/392 (47%), Gaps = 25/392 (6%)
Query: 62 LAKDQARLQFLSSLAVA------RKSVVPIASGRQIT----QSPTYIVRAKIGTPAQTLL 111
L +D AR++ L SLA ++ P S I+ S Y R +GTPA+ +
Sbjct: 100 LVRDAARVKSLISLAATVGGTNLTRARGPGFSSSVISGLAQGSGEYFTRLGVGTPARYVY 159
Query: 112 MAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-- 166
M +DT +D W+ C C+ C S VF+ +S +F N+ C + C+++ P C
Sbjct: 160 MVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQI 219
Query: 167 CAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
C + ++YG + S +T++ V GC G V GLLGLGRG LS
Sbjct: 220 CLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSF 279
Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
+Q + S FSYCL A S S+ G + ++TPLL NP+ + YYV LL
Sbjct: 280 PSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLG 339
Query: 286 IRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I V G RV I + + T G IIDSGT TRL AY A+RD F +
Sbjct: 340 ISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAP 399
Query: 345 SLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
FDTC+ + + PT+ L F G +V LP N LI C A A
Sbjct: 400 EFSLFDTCFDLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNSGSFCFAFAGTA---- 455
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N+QQQ R++YD+ SR+G A C
Sbjct: 456 SGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 139/427 (32%), Positives = 206/427 (48%), Gaps = 39/427 (9%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESV--LEMLAKDQARLQFLSSL-------------AVA 78
L V H PCSP + ++P +V E+L +DQAR+ + A A
Sbjct: 71 LGVVHRHGPCSPVQ-ARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARA 129
Query: 79 RKSVVPIASGRQIT-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SST 134
+ V + + R I+ + Y+V +GTPA+ + DT +D +WV C C C
Sbjct: 130 SEQGVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP 189
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTYGS-STIAANLSQDTISL-AT 191
+F+ + S+T+ + C A +C+++ C + C + + YG S NL +DT++L A+
Sbjct: 190 LFDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSAS 249
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
D +PG+ FGC + G GL GLGR +SL +Q Y F+YCLPS + S G
Sbjct: 250 DTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS--SSSGRG 307
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L LG G P L + S YY++L+ I+VG R + IP GT+
Sbjct: 308 YLSLG--GAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP----ATAFAAAGGTV 361
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS 367
IDSGTV TRL AY +R F R + +L DTCY PT+ L F+
Sbjct: 362 IDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFA 421
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G V+L +L S S CLA AP+ +S + ++ N QQ+ + YDV N R+G
Sbjct: 422 GGATVSLDFTGVLYVSKV-SQACLAF--APNADDSSIAILGNTQQKTFAVAYDVANQRIG 478
Query: 427 VARELCT 433
+ C+
Sbjct: 479 FGAKGCS 485
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 173/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y +R +G+P + + +D+ +D WV PCT C + VF+ A S +F + C
Sbjct: 139 SGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCS 198
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C+++ N C G C + + YG S L+ +T++ +V GC + G
Sbjct: 199 SSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMF 258
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GS+SL+ Q FSYCL S + +GSL G P + PL
Sbjct: 259 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTDSAGSLEFGRGAMPVGAAWIPL 317
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YY+ L + VG V I Q N G ++D+GT TR+ AY A
Sbjct: 318 IRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAF 377
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
RD F + G+ + + FDTCY+ V + PT++ F+G + TLP N LI
Sbjct: 378 RDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDD 437
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A AA+P S L++I N+QQ+ +I +D N +G +C
Sbjct: 438 VGTFCFAFAASP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 195/416 (46%), Gaps = 54/416 (12%)
Query: 56 ESVLEMLAKDQARLQ----FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
E V + L +D R Q F LA + + V + + + Y++ IGTP +
Sbjct: 47 EFVRDALRRDMHRQQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYP 106
Query: 112 MAMDTSNDAAW---VPCTG--CVGCSSTVFNSAQSTTFKNLGCQAA--QCKQV-----PN 159
DT +D W PC+G C + ++N A STTF L C ++ C V P
Sbjct: 107 AIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPP 166
Query: 160 PTCGGGACAFNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQG 214
P C AC +N TYG+ A +T + + VPG FGC ++ + G
Sbjct: 167 PGC---ACMYNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAG 223
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKN 272
L+GLGRGSLSL++Q L FSYCL F+ + + +L LGP ++ TP + +
Sbjct: 224 LVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVAS 280
Query: 273 PRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
P + S+ YY+NL I +G + + I P A G IIDSGT T LV AY V
Sbjct: 281 PAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQV 340
Query: 330 RDVFRRRV------GSNLTVTSLGGFDTCYSVPI------VAPTITLMFSGMNVTLPQDN 377
R + V GS+ T G D CY++P P++TL F G ++ LP D+
Sbjct: 341 RAAVQSLVTLPAIDGSDST-----GLDLCYALPTPTSAPPAMPSMTLHFDGADMVLPADS 395
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+I + + CLAM + + ++ N QQQN ILYDV N L A C+
Sbjct: 396 YMISGSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 140/452 (30%), Positives = 210/452 (46%), Gaps = 43/452 (9%)
Query: 16 SLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF---- 71
S E + + +S LQV H S S +E + E L +D AR+
Sbjct: 52 SAQEWSETVQGEEKNSIVLQVVH---RDSLSSSSNTSLVKEILQERLKRDAARVDSINAR 108
Query: 72 --LSSLAVARKSVVPIASGRQITQ------------------SPTYIVRAKIGTPAQTLL 111
L+++ V++ + P+ +G I S Y R +GTP +
Sbjct: 109 VQLAAMGVSKAEMKPL-NGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTY 167
Query: 112 MAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-C 167
M +DT +D W+ PC C G + +FN A S+T++ + C CK++ C C
Sbjct: 168 MVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYC 227
Query: 168 AFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
+ ++YG + + S +T++ ++ GC G + GLLGLGRGSLS
Sbjct: 228 EYQVSYGDGSFTVGDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFP 287
Query: 227 AQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
+QT + FSYCL A + SL G PK +TPLL NP+ + YYV L+ I
Sbjct: 288 SQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGI 347
Query: 287 RV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
V GRR+ IP + + T G IIDSGT TRLV AY+ +RD FR G+ +
Sbjct: 348 SVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGG 407
Query: 346 LGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
FDTCY + + PT+ F G +++LP N LI + + C A A
Sbjct: 408 FSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGG-- 465
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L++I N+QQQ +R+++D +R+G C
Sbjct: 466 --LSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 119/370 (32%), Positives = 174/370 (47%), Gaps = 27/370 (7%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
PI SG + Y +GTP + + + +DT +D W+ PCT C +FN +
Sbjct: 4 PIFSGLAF-GTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSS 62
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT- 198
S++FK L C ++ C + C C + YG + L D + L PG
Sbjct: 63 SSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122
Query: 199 -----FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL-SFSGS 252
GC G G+LGLGRG LS ++ FSYCLP ++ + +
Sbjct: 123 LTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKST 182
Query: 253 LRLGPIGQPK----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV-DIPPGALQFNPTTG 307
L G P +K+ P L+NPR ++ YYV + I VG ++ +IP Q +
Sbjct: 183 LVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGN 242
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTIT 363
GTI DSGT TRL A AYTAVRD FR + FDTCY I PT+T
Sbjct: 243 GGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPTVT 302
Query: 364 LMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
F G +++ LP N ++ + +I C A AA+ +VI N+QQQ+ R++YD +
Sbjct: 303 FHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGP-----SVIGNVQQQSFRVIYDNVH 357
Query: 423 SRLGVARELC 432
++G+ + C
Sbjct: 358 KQIGLLPDQC 367
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 134/408 (32%), Positives = 192/408 (47%), Gaps = 48/408 (11%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSP--------------------TYIVRA 101
L +D R++ L+SLA +++GR +T+ P Y +R
Sbjct: 88 LQRDSLRVESLTSLAA-------VSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRL 140
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
+GTPA + M +DT +D W+ C+ C C S VFN A+S TF + C + C+++
Sbjct: 141 GVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLD 200
Query: 159 NPT-C---GGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
+ + C AC + ++YG + + S +T++ V GC G V
Sbjct: 201 DSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAA 260
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
GLLGLGRG LS +QT+N Y FSYCL S + ++ G PK +TPL
Sbjct: 261 GLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPL 320
Query: 270 LKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L NP+ + YY+ LL I V G RV + + + T G IIDSGT TRL AY A
Sbjct: 321 LTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVA 380
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTA 384
+RD FR S FDTC+ + + PT+ F+G V+LP N LI
Sbjct: 381 LRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEVSLPASNYLIPVNN 440
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A + L++I N+QQQ R+ YD+ SR+G C
Sbjct: 441 QGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 136/414 (32%), Positives = 197/414 (47%), Gaps = 35/414 (8%)
Query: 48 PSKPLSWEESVL-EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
P P + S+L + LA D AR + S + + P+ SG +S Y +GTP
Sbjct: 39 PPPPGAKRGSLLRQRLAADAAR--YASLVDATGRLHSPVFSGIPF-ESGEYFALVGVGTP 95
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC- 162
+ ++ +DT +D W+ C+ C C VF+ +S+T++ + C + QC+ + P C
Sbjct: 96 STKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCD 155
Query: 163 ----GGGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLL 216
GG C + + YG S+ +L+ D ++ A D V T GC + G GLL
Sbjct: 156 SGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLL 215
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRR 275
G+GRG +S+ Q Y S F YCL + S S L G +P +T LL NPRR
Sbjct: 216 GVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRR 275
Query: 276 SSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVF 333
SLYYV++ V G RV +L + TG G ++DSGT +R AY A+RD F
Sbjct: 276 PSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAF 335
Query: 334 RRRVGSNLTVTSLGG---FDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI----- 380
R + G FD CY + AP I L F+ G ++ LP +N +
Sbjct: 336 DARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGG 395
Query: 381 -HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
A CL AA D L+VI N+QQQ R+++DV R+G A + CT
Sbjct: 396 RRRAASYRRCLGFEAADDG----LSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 142/429 (33%), Positives = 201/429 (46%), Gaps = 34/429 (7%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD-----------QARLQFLSSLAVARKS 81
++QV H S + S+E + E L +D + RL+ A + ++
Sbjct: 115 SVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHEN 174
Query: 82 VVPIAS--GRQITQ-----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
V +A+ G ++ S Y R +GTP + M +DT +D W+ C C C S
Sbjct: 175 VAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQ 234
Query: 135 V---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA 190
V FN + S +F LGC +A C + C GG C + ++YG S + + + ++
Sbjct: 235 VDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEMLTFG 294
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
T V GC G V GLLGLG G LS +Q FSYCL + S S
Sbjct: 295 TTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVDRFSES-S 353
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGALQFNPTTG-A 308
G+L GP P TPLL NP + YYV L++I VG ++D +PP + + T+G
Sbjct: 354 GTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRG 413
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---VPIV-APTITL 364
G I+DSGT TRL P Y AVRD F + FDTCY +P+V PT+
Sbjct: 414 GFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVF 473
Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
FS G ++ LP N +I C A A A S L+++ N+QQQ R+ +D NS
Sbjct: 474 HFSNGASLILPAKNYMIPMDFMGTFCFAFAPA----TSDLSIMGNIQQQGIRVSFDTANS 529
Query: 424 RLGVARELC 432
+G A C
Sbjct: 530 LVGFALRQC 538
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 131/432 (30%), Positives = 213/432 (49%), Gaps = 50/432 (11%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPI 85
D ++ ++ + H PC+P S S E S+ E L + +AR +++ S A +P
Sbjct: 53 DEGSNTVSVPLVHRHGPCAPSTRS---SDEPSLSERLRRSRARSKYIMSRASKSNVSIPT 109
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQ 140
G + S Y+V +GTPA + ++ +DT +D +WV C T C +F+ ++
Sbjct: 110 HLGGSV-DSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSR 168
Query: 141 STTFKNLGCQAAQCKQVPNP---------TCGGGACAFNLTYGSSTIAANL-SQDTISLA 190
S+T+ + C C+ + + GG C + +TYG + + S +T+++A
Sbjct: 169 SSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMA 228
Query: 191 TDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
+ V + FGC G + GLLGLG SL+ QT ++Y FSYCLP+ A
Sbjct: 229 PGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPA--ANDQ 286
Query: 250 SGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
+G L LG P+ +TP+++ + + Y VN+ I VG +D+PP A
Sbjct: 287 AGFLALGAPVNDASGFVFTPMVR--EQQTFYVVNMTGITVGGEPIDVPPSAFS------G 338
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITL 364
G IIDSGTV T L AY A++ FR+ + + + + G DTCY+ + P + L
Sbjct: 339 GMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPN-GELDTCYNFTGHSNVTVPRVAL 397
Query: 365 MFSG---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDV 420
FSG +++ +P D +L+ + CLA A PDN +L N+ Q+ +LYDV
Sbjct: 398 TFSGGATVDLDVP-DGILLDN------CLAFQEAGPDNQPGIL---GNVNQRTLEVLYDV 447
Query: 421 PNSRLGVARELC 432
+ R+G + C
Sbjct: 448 GHGRVGFGADAC 459
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 133/433 (30%), Positives = 198/433 (45%), Gaps = 46/433 (10%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGR 89
V H PCSP + + +LE DQAR+ + + +VV P G
Sbjct: 22 VMHRHGPCSPLQTPDDAPSDADLLE---HDQARVDSIHRMIANETAVVGQDVSLPAERGI 78
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT--GCVGCSSTVFNSAQSTTF 144
+ Y+V +GTPA+ L + DT +D +WV PC+ GC +F + S+TF
Sbjct: 79 SVGTG-NYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTF 137
Query: 145 KNLGCQAAQC---KQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--------- 191
+ C +C +Q + + G C + + YG S +L DT++L T
Sbjct: 138 SAVRCGEPECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASEN 197
Query: 192 --DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
+ +PG+ FGC + TG GL GLGRG +SL +Q Y FSYCLPS + +
Sbjct: 198 NSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPS-SSSNA 256
Query: 250 SGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG P P ++TP+L S YYV L+ IRV R + + + A
Sbjct: 257 HGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWP----A 312
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVPIVA------P 360
G I+DSGTV TRL AY+A+R F +G L DTCY A P
Sbjct: 313 GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIP 372
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ L+F+G + +++ + CLA AP+ ++ N QQ+ ++YDV
Sbjct: 373 AVALVFAGGATISVDFSGVLYVAKVAQACLAF--APNGNGRSAGILGNTQQRTVAVVYDV 430
Query: 421 PNSRLGVARELCT 433
++G A + C+
Sbjct: 431 GRQKIGFAAKGCS 443
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 136/435 (31%), Positives = 203/435 (46%), Gaps = 63/435 (14%)
Query: 31 SSTLQVFHVFSPCSPFKP--SKPLSWEESVLEMLAKDQARLQFL----SSLAVARKSVVP 84
++ + + H PCSP SKP S +E +LA DQ R + + S+ A +R P
Sbjct: 88 TTRMTIVHRHGPCSPLAAAHSKPPSHDE----ILAADQNRAESIQHRVSTTATSRGQ--P 141
Query: 85 IASGRQ------------------ITQSP-------TYIVRAKIGTPAQTLLMAMDTSND 119
S RQ + SP Y+V +GTPA + DT +D
Sbjct: 142 KRSRRQQPSSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDTGSD 201
Query: 120 AAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS 175
WV C CV +F+ A+S+T+ N+ C A C + C GG C + + YG
Sbjct: 202 TTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQYGD 261
Query: 176 STIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT + Y
Sbjct: 262 GSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKY 321
Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
F++CLP+ + +G L G R+ TP+L + + YYV L IRVG R++
Sbjct: 322 GGVFAHCLPARS--TGTGYLDFGAGSPAARLTTTPMLVD-NGPTFYYVGLTGIRVGGRLL 378
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF----RRRVGSNLTVTSLGGF 349
IP AGTI+DSGTV TRL AY+++R F R SL
Sbjct: 379 YIPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSL--L 431
Query: 350 DTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
DTCY + PT++L+F G + ++++ + S CLA AA D + + +
Sbjct: 432 DTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGI 489
Query: 406 IANMQQQNHRILYDV 420
+ N Q + + YD+
Sbjct: 490 VGNTQLKTFGVAYDI 504
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/349 (35%), Positives = 168/349 (48%), Gaps = 14/349 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R +GTP + + M +DT +D W+ C C C S VFN +S +F + C+
Sbjct: 39 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 98
Query: 151 AAQCKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
C+++ +P C C + ++YG S +T++ V GC G
Sbjct: 99 TPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGL 158
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V GLLGLGRG LS +Q + FSYCL A S S+ G + ++TP
Sbjct: 159 FVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTP 218
Query: 269 LLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
LL NPR + YYV LL I VG V I + + T G IID GT TRL PAY
Sbjct: 219 LLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYI 278
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHST 383
A+RD FR S + FDTCY + + PT+ L F G +V+LP N LI
Sbjct: 279 ALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVD 338
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A S L++I N+QQQ R++YD+ +SR+G + C
Sbjct: 339 GSGRFCFAFAG----TTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 132/457 (28%), Positives = 218/457 (47%), Gaps = 59/457 (12%)
Query: 22 NPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK- 80
NP + ++L+V + PC+ ++ + ++ E+LA DQAR+ + + +
Sbjct: 60 NPATKGKRRGASLEVVNRQGPCTLL--NQKGAKAPTLTEILAHDQARVDSIQARITDQSY 117
Query: 81 ---------------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
+ +P SG + + YIV +GTP + L + DT +D
Sbjct: 118 DLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLG-TGNYIVNVGLGTPKKDLSLIFDTGSD 176
Query: 120 AAWVPCTGCV-GCSST---VFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFN 170
W C CV C + +F+ + S T+ N+ C +A C + + P C C +
Sbjct: 177 LTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYG 236
Query: 171 LTYGSSTIAANL-SQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
+ YG S+ ++D ++L D+ G+ FGC Q G GL+GLGR LS++ Q
Sbjct: 237 IQYGDSSFTIGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQ 296
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIK----YTPLLKNPRRSSLYYV 281
T + FSYCLP+ + +G L G + K +K +TP + + ++ Y++
Sbjct: 297 TAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKASKAVKNGITFTP-FASSQGTAYYFI 353
Query: 282 NLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL 341
++L I VG + + I P Q AGTIIDSGTV TRL + AY +++ F++ +
Sbjct: 354 DVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYP 408
Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAP 396
T +L DTCY + I P I+ F+G NV L + +LI + A + CLA A
Sbjct: 409 TAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVELDPNGILITNGASQV-CLAFAGNG 467
Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
D + + + N+QQQ ++YDV +LG + C+
Sbjct: 468 D--DDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 135/419 (32%), Positives = 205/419 (48%), Gaps = 42/419 (10%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEES-VLEMLAKDQARLQFL-----------SSLAVAR 79
++L+V H PCS + ++ E+L +D+ R++++ SS++
Sbjct: 69 ASLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELD 128
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCVGCSSTV 135
+P SG I S Y V +GTP + L + DT +D W C C +
Sbjct: 129 SVTLPAKSGSLI-GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAI 187
Query: 136 FNSAQSTTFKNLGCQAAQCKQVP-----NPTCGGG--ACAFNLTYGSSTIAAN-LSQDTI 187
F+ ++ST++ N+ C + C Q+ P C AC + + YG S+ + S++ +
Sbjct: 188 FDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERL 247
Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
S+ ATDIV + FGC Q G GL+GLGR +S + QT +Y+ FSYCLP+
Sbjct: 248 SVTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPA--T 305
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
S +G L G +KYTP R SS Y +++ I VG +P + F +T
Sbjct: 306 SSSTGRLSFGTT-TTSYVKYTPFSTISRGSSFYGLDITGISVGG--AKLPVSSSTF--ST 360
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTI 362
G G IIDSGTV TRL AYTA+R FR+ + + L DTCY + P I
Sbjct: 361 G-GAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKI 419
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F+ G+ V LP +L ++A + CLA AA D +S + + N+QQ+ ++YDV
Sbjct: 420 DFSFAGGVTVQLPPQGILYVASAKQV-CLAFAANGD--DSDVTIYGNVQQKTIEVVYDV 475
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 126/378 (33%), Positives = 176/378 (46%), Gaps = 33/378 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQ 140
P+ SG S Y +G P L+ +DT +D W+ C C C V ++
Sbjct: 76 PVMSGVPF-DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRS 134
Query: 141 STTFKNLGCQAAQCKQVPN-PTCGG--GACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
S+T + + C + +C+ V P C G C + + YG S + +L+ D + D V
Sbjct: 135 SSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH 194
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--L 253
T GC G GLLG+GRG LS Q Y FSYCL + + +GS L
Sbjct: 195 NVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYL 254
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTI 311
G +P +TPL NPRR SLYYV+++ V G RV +L NP TG G +
Sbjct: 255 VFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIV 314
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSV--------PIVA 359
+DSGT +R AY AVRD F + T+ L FD CY + +
Sbjct: 315 VDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRV 374
Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P+I L F+ G ++ LPQ N LI G + CL + AA D LNV+ N+QQQ
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDG----LNVLGNVQQQGFG 430
Query: 416 ILYDVPNSRLGVARELCT 433
+++DV R+G C+
Sbjct: 431 LVFDVERGRIGFTPNGCS 448
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 137/426 (32%), Positives = 203/426 (47%), Gaps = 37/426 (8%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFLSSL-------------AVAR 79
L V H PCSP + + + E+L +DQAR+ + A A
Sbjct: 71 LGVVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARAS 130
Query: 80 KSVVPIASGRQIT-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTV 135
+ V + + R I+ + Y+V +GTPA+ + DT +D +WV C C C +
Sbjct: 131 EQGVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPL 190
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTYGS-STIAANLSQDTISL-ATD 192
F+ + S+T+ + C A +C+++ C + C + + YG S NL +DT++L A+D
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
+PG+ FGC + G GL GLGR +SL +Q Y F+YCLPS + S G
Sbjct: 251 TLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS--SSSGRGY 308
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L LG G P L + S YY++L+ I+VG R + IP GT+I
Sbjct: 309 LSLG--GAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP----ATAFAAAGGTVI 362
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS- 367
DSGTV TRL AY +R F R + +L DTCY PT+ L F+
Sbjct: 363 DSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAG 422
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G V+L +L S S CLA AP+ +S + ++ N QQ+ + YDV N R+G
Sbjct: 423 GATVSLDFTGVLYVSKV-SQACLAF--APNADDSSIAILGNTQQKTFAVTYDVANQRIGF 479
Query: 428 ARELCT 433
+ C+
Sbjct: 480 GAKGCS 485
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 133/403 (33%), Positives = 195/403 (48%), Gaps = 33/403 (8%)
Query: 55 EESVL-EMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
EE +L L + AR+ L SLA +A + A + Y++ IGTP +
Sbjct: 46 EEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYSA 105
Query: 113 AMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAF 169
+DT +D W PC CV + F+ A+S T+++LGC + C + P C C +
Sbjct: 106 ILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVY 165
Query: 170 NLTYG-SSTIAANLSQDTISLATDI----VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
YG S++ A L+ +T + T+ +PG +FGC G+ G++G GRGSLS
Sbjct: 166 QYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLS 225
Query: 225 LLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSL 278
L++Q L FSYCL SF + L F L ++ TP + NP ++
Sbjct: 226 LVSQ---LGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTM 282
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y++N+ I VG ++ I P N T G GTIIDSGT T L PAY AVR F ++
Sbjct: 283 YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQI 342
Query: 338 G-SNLTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP-QDNLLIHSTAGSITC 389
L VT DTC+ P + P + L F G + LP Q+ +L+ + G C
Sbjct: 343 TLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLC 402
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LAMA++ S ++I + Q QN +LYD+ NS + C
Sbjct: 403 LAMASS-----SDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 169/373 (45%), Gaps = 34/373 (9%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
P+ASG Y+ +GTPA+ + DT +D W+ PC C +F+
Sbjct: 32 PVASG-----GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD-----IV 194
S+++ + C C +P +C C ++ YG S LS +T++L +
Sbjct: 87 SSSYTTMSCGDTLCDSLPRKSCSP-DCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
FGC G+ GL+GLGRG+LS ++Q +L+ FSYCL P A S + +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 254 RLGPI------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G G+ +TP++ NP S YYV L I + R + IP G+ P
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-------PIVAP 360
G I DSGT T L Y V R ++ S G D CY V + P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIP 325
Query: 361 TITLMFSGMNVTLPQDNLLIHST-AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+ F G + LP +N I + AG+I CLAM ++ N + + NM QQN R++YD
Sbjct: 326 AMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSS----NMDIGIYGNMMQQNFRVMYD 381
Query: 420 VPNSRLGVARELC 432
+ +S++G A C
Sbjct: 382 IGSSKIGWAPSQC 394
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/401 (31%), Positives = 193/401 (48%), Gaps = 36/401 (8%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
K L+ E + + + RLQ L ++ V P+ +G Y++ IGTPAQ
Sbjct: 52 KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDG-----EYLMNLSIGTPAQ 106
Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
MDT +D W PCT C S+ +FN S++F L C + C+ + +PTC
Sbjct: 107 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNN 166
Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSL 223
+C + YG S ++ +T++ + +P TFGC + G GL+G+GRG L
Sbjct: 167 SCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226
Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIKYTPLLKNPRRSS 277
SL +Q L + FSYC+ + S S +L LG + G P T L+++ + +
Sbjct: 227 SLPSQ---LDVTKFSYCMTPIGS-SNSSTLLLGSLANSVTAGSPN----TTLIQSSQIPT 278
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
YY+ L + VG + I P + N G G IIDSGT T V AY AVR F +
Sbjct: 279 FYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQ 338
Query: 337 VGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
+ ++ S GFD C+ +P + PT + F G ++ LP +N I + G I CLA
Sbjct: 339 MNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLI-CLA 397
Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
M ++ +++ N+QQQN ++YD NS + C
Sbjct: 398 MGSSSQG----MSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 135/420 (32%), Positives = 206/420 (49%), Gaps = 37/420 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
S+L+V H+ CS + +E ++ +DQAR++ + S ++ A+ +
Sbjct: 63 SSLRVVHMHGACSHLSSDARVDHDE----IIRRDQARVESIYSKLSKNSANEVSEAKSTE 118
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
+P SG + S YIV IGTP L + DT +D W C C+G C S FN
Sbjct: 119 LPAKSGITL-GSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPG 196
+ S+T++N+ C + C+ +C C +++ YG + L+++ +L +D++
Sbjct: 178 SSSSTYQNVSCSSPMCEDAE--SCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLED 235
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
FGC + G GLLGLG G LSL AQT Y + FSYCLPSF + S +G L G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFG 294
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G + +K+TP+ P + Y ++++ I VG + + I P N + G IIDSGT
Sbjct: 295 SAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELAITP-----NSFSTEGAIIDSGT 348
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT 372
VFTRL Y +R VF+ ++ S + + G FDTCY + PTI F+G V
Sbjct: 349 VFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVV 408
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + S CLA A D + + N+QQ ++YDV R+G A C
Sbjct: 409 ELDGSGISLPIKISQVCLAFAGNDD----LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 136/465 (29%), Positives = 218/465 (46%), Gaps = 73/465 (15%)
Query: 25 CDT---QDHSST-----LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL-- 72
CDT +H ++ + + H PCSP + KP S +E +LA DQ R++ +
Sbjct: 73 CDTPREHEHGASSSGTRMTIVHRHGPCSPLADAHGKPPSHDE----ILAADQNRVESIHH 128
Query: 73 --SSLAVARKS----------------------------VVPIASGRQITQSPTYIVRAK 102
S+ A R +P +SGR + Y+V
Sbjct: 129 RVSTTATVRGKPKRRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTG-NYVVTIG 187
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
+GTPA + DT +D WV C CV +F+ A+S+T+ N+ C A C +
Sbjct: 188 LGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAPACSDLY 247
Query: 159 NPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLL 216
C GG C +++ YG + + + DT++L++ D V G+ FGC ++ G GLL
Sbjct: 248 TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLL 307
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI---KYTPLLKNP 273
GLGRG SL QT + Y F++CLP+ S +G L GP G P + + TP+L +
Sbjct: 308 GLGRGKTSLPVQTYDKYGGVFAHCLPARS--SGTGYLDFGP-GSPAAVGARQTTPMLTD- 363
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
+ YYV + IRVG +++ IP + AGTI+DSGTV TRL AY+++R F
Sbjct: 364 NGPTFYYVGMTGIRVGGQLLSIPQSVF-----STAGTIVDSGTVITRLPPAAYSSLRSAF 418
Query: 334 RRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
+ + +L DTCY + P ++L+F G + ++++ + S
Sbjct: 419 ASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQ 478
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL AA D+ + + ++ N Q + ++YD+ +G + C
Sbjct: 479 VCLGFAANEDDDD--VGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 136/430 (31%), Positives = 210/430 (48%), Gaps = 48/430 (11%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS-------- 81
+ STL + H PCSP + S EE+ L +DQ R ++ + +R +
Sbjct: 56 NGSTLALSHRHGPCSPVISKEKPSHEET----LRRDQLRAAYIQAKVSSRYNNVAKELQQ 111
Query: 82 ---VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSS--- 133
+P +SG + + Y++ IGTPA T +M++DT +D +WV C C CSS
Sbjct: 112 SAVTIPTSSGYSLGTTE-YVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD 170
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNLTYGS-STIAANLSQDTISL- 189
+F+ A S T+ C +AQC Q+ + C C + + YG S A DT+SL
Sbjct: 171 KLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLT 230
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
++D V + FGC +A G GL+GLG + SL++QT Y FSYCLP + S
Sbjct: 231 SSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPP-PSSSG 289
Query: 250 SGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G L LG G R +TP+++ + Y V L I V ++++P +G
Sbjct: 290 GGFLTLGAAGGASSSRYSHTPMVRF-SVPTFYGVFLQGITVAGTMLNVPASVF-----SG 343
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTIT 363
A +++DSGTV T+L AY A+R F++ + + + +G DTC+ I PT+T
Sbjct: 344 A-SVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVT 402
Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
L FS G + L +L AG CLA A + ++ ++ N+QQ+ +L+DV
Sbjct: 403 LTFSRGAAMDLDISGILY---AG---CLAFTATAHDGDT--GILGNVQQRTFEMLFDVGG 454
Query: 423 SRLGVARELC 432
+G C
Sbjct: 455 RTIGFRSGAC 464
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 178/379 (46%), Gaps = 32/379 (8%)
Query: 72 LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
+SS AVA VP+ +G +++ IGTPA +DT +D W C CV C
Sbjct: 82 MSSKAVAPALQVPVHAGNG-----EFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVEC 136
Query: 132 ---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTI 187
S+ VF+ + S+T+ L C + C +P+ C C + TYG SS+ L+ +T
Sbjct: 137 FNQSTPVFDPSSSSTYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETF 196
Query: 188 SLATDIVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+LA +P FGC G+ GL+GLGRG LSL++Q L + FSYCL S
Sbjct: 197 TLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLNKFSYCLTSLDD 253
Query: 247 LSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
S S L LG + ++ TPL++NP + S YYVNL + VG + +P A
Sbjct: 254 TSKS-PLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSA 312
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--- 356
G I+DSGT T L Y A++ F ++ S G DTC+ P
Sbjct: 313 FAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASG 372
Query: 357 ---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ P + G ++ LP +N ++ + CL + + L++I N QQQN
Sbjct: 373 VDQVEVPKLVFHLDGADLDLPAENYMVLDSGSGALCLTVMGSRG-----LSIIGNFQQQN 427
Query: 414 HRILYDVPNSRLGVARELC 432
+ +YDV + L A C
Sbjct: 428 IQFVYDVGENTLSFAPVQC 446
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 145/442 (32%), Positives = 199/442 (45%), Gaps = 47/442 (10%)
Query: 26 DTQDHSSTLQ--VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-SSLAVARKSV 82
D +TL V H + P + + P S+ A A+L+ L S+ A A
Sbjct: 22 DATQRPTTLHIPVVHRDAVFPPRRGAPPGSFR---CRHAAPHTAQLESLHSATAAADLLR 78
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
P+ SG S Y +G P L+ +DT +D W+ C C C V ++
Sbjct: 79 SPVMSGVPF-DSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPR 137
Query: 140 QSTTFKNLGCQAAQCKQVPN-PTCGG--GACAFNLTYGS-STIAANLSQDTISLATDI-V 194
S T + + C + QC+ V P C G C + + YG S + +L+ DT+ L D V
Sbjct: 138 NSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRV 197
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF--KALSFSGS 252
T GC G GLLG GRG LS Q Y FSYCL +A + S
Sbjct: 198 HNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSY 257
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGT 310
L G + +TPL NPRR SLYYV+++ V G RV +L NP TG G
Sbjct: 258 LVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGV 317
Query: 311 IIDSGTVFTRLVAPAYTAVRDVF--------RRRVGSNLTVTSLGGFDTCYSVP------ 356
++DSGT +R AY AVRD F RR+ + +V FDTCY V
Sbjct: 318 VVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSV-----FDTCYDVHGNGPGT 372
Query: 357 -IVAPTITLMF-SGMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQ 411
+ P+I L F + ++ LPQ N LI G + CL + AA D LNV+ N+QQ
Sbjct: 373 GVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDG----LNVLGNVQQ 428
Query: 412 QNHRILYDVPNSRLGVARELCT 433
Q +++DV R+G C+
Sbjct: 429 QGFGVVFDVERGRIGFTPNGCS 450
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 187/389 (48%), Gaps = 26/389 (6%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
E + + + + RLQ LS+ + +S V P+ +G ++++ IGTPA+T
Sbjct: 59 ERLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNG-----EFLMKLAIGTPAETYSAI 113
Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
MDT +D W PC C + +F+ +S++F L C + C +P +C G C +
Sbjct: 114 MDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG-CEYL 172
Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
+YG S+ L+ +T + V FGC + G+ GL+GLGRG LSL++Q
Sbjct: 173 YSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQ 232
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
L + FSYCL S SL +G K TPL++NP + S YY++L I V
Sbjct: 233 ---LGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISV 289
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G ++ I G IIDSGT T L A+ A++ F ++ ++ + G
Sbjct: 290 GDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTG 349
Query: 349 FDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
D C+++P + P + F G ++ LP +N +I + + CL M ++ S +
Sbjct: 350 LDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLTMGSS-----SGM 404
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
++ N QQQN +L+D+ + A C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 137/451 (30%), Positives = 208/451 (46%), Gaps = 70/451 (15%)
Query: 34 LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS------ 81
+ + H PCSP + KP S E+ +LA DQ R + + S+ A AR +
Sbjct: 86 MTIVHRHGPCSPLAAAHGKPPSHED----ILAADQNRAESIQHRVSTTATARGNPKRSRR 141
Query: 82 -----------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
+P +SGR + Y+V +GTPA + DT +
Sbjct: 142 APSRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGS 200
Query: 119 DAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
D WV C CV +F+ A+S+T+ N+ C A C + C GG C + + YG
Sbjct: 201 DTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQYG 260
Query: 175 SSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT +
Sbjct: 261 DGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 320
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVG 289
Y F++CLP+ S +G L GP G P TP+L + + YYV + IRVG
Sbjct: 321 YGGVFAHCLPARS--SGTGYLDFGP-GSPAAAGARLTTPMLTD-NGPTFYYVGMTGIRVG 376
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF----RRRVGSNLTVTS 345
+++ IP AGTI+DSGTV TRL PAY+++R F R S
Sbjct: 377 GQLLSIPQSVFAT-----AGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVS 431
Query: 346 LGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
L DTCY + PT++L+F G + + ++++ + S CL AA D +
Sbjct: 432 L--LDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIMYAASVSQVCLGFAANEDGGD- 488
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ ++ N Q + + YD+ +G + C
Sbjct: 489 -VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 135/420 (32%), Positives = 206/420 (49%), Gaps = 37/420 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
S+L+V H+ CS + +E ++ +DQAR++ + S ++ A+ +
Sbjct: 63 SSLRVVHMHGACSHLSSDARVDHDE----IIRRDQARVESIYSKLSKNSANEVSEAKSTE 118
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
+P SG + S YIV IGTP L + DT +D W C C+G C S FN
Sbjct: 119 LPAKSGITL-GSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPG 196
+ S+T++N+ C + C+ +C C +++ YG + L+++ +L +D++
Sbjct: 178 SSSSTYQNVSCSSPMCEDAE--SCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLED 235
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
FGC + G GLLGLG G LSL AQT Y + FSYCLPSF + S +G L G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGHLTFG 294
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G + +K+TP+ P + Y ++++ I VG + + I P N + G IIDSGT
Sbjct: 295 SAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELAITP-----NSFSTEGAIIDSGT 348
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT 372
VFTRL Y +R VF+ ++ S + + G FDTCY + PTI F+G V
Sbjct: 349 VFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVV 408
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + S CLA A D + + N+QQ ++YDV R+G A C
Sbjct: 409 ELDGSGISLPIKISQVCLAFAGNDD----LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 131/467 (28%), Positives = 217/467 (46%), Gaps = 71/467 (15%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
S N + ++L+V + PC+ ++ + ++ E+LA DQAR+ + +
Sbjct: 56 SSSCNTATKGKRRGASLEVVNRQGPCTQL--NQKGAKAPTLTEILAHDQARVDSIQARVT 113
Query: 78 AR----------------------------KSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
+ +S +P+ +G YIV +GTP +
Sbjct: 114 DQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGN-------YIVNVGLGTPKKD 166
Query: 110 LLMAMDTSNDAAWVPCTGCV-GCSST---VFNSAQSTTFKNLGCQAAQCKQVPN-----P 160
L + DT +D W C CV C + +F+ + S T+ N+ C + C + + P
Sbjct: 167 LSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSP 226
Query: 161 TCGGGACAFNLTYGSSTIAANL-SQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGL 218
C C + + YG S+ ++DT++L D+ G+ FGC Q G GL+GL
Sbjct: 227 GCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGL 286
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIK----YTPLLK 271
GR LS++ QT + FSYCLP+ + +G L G + K +K +TP
Sbjct: 287 GRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKTSKAVKNGITFTP-FA 343
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ + ++ Y++++L I VG + + I P Q AGTIIDSGTV TRL + Y +++
Sbjct: 344 SSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKS 398
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDNLLIHSTAGS 386
F++ + T +L DTCY + I P I+ F+G NV L + +LI + A
Sbjct: 399 TFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGILITNGASQ 458
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ CLA A D + + + N+QQQ ++YDV +LG + C+
Sbjct: 459 V-CLAFAGNGD--DDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 135/414 (32%), Positives = 195/414 (47%), Gaps = 35/414 (8%)
Query: 48 PSKPLSWEESVL-EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
P P + S+L + LA D AR + S + + P+ SG +S Y +GTP
Sbjct: 39 PPPPGAKRGSLLRQRLAADAAR--YASLVDATGRLHSPVFSGIPF-ESGEYFALVGVGTP 95
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC- 162
+ ++ +DT +D W+ C+ C C VF+ +S+T++ + C + QC+ + P C
Sbjct: 96 STKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCD 155
Query: 163 ----GGGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLL 216
GG C + + YG S+ L+ D ++ A D V T GC + G GLL
Sbjct: 156 SGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLL 215
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRR 275
G+ RG +S+ Q Y S F YCL + S S L G +P +T LL NPRR
Sbjct: 216 GVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRR 275
Query: 276 SSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVF 333
SLYYV++ V G RV +L + TG G ++DSGT +R AY A+RD F
Sbjct: 276 PSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAF 335
Query: 334 RRRVGSNLTVTSLGG---FDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI----- 380
R + G FD CY + AP I L F+ G ++ LP +N +
Sbjct: 336 DARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGG 395
Query: 381 -HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
A CL AA D L+VI N+QQQ R+++DV R+G A + CT
Sbjct: 396 RRRAASYRRCLGFEAADDG----LSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 168/373 (45%), Gaps = 34/373 (9%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
P+ASG Y+ +GTPA+ + DT +D W+ PC C +F+
Sbjct: 32 PVASG-----GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD-----IV 194
S+++ + C C +P +C C ++ YG S LS +T++L +
Sbjct: 87 SSSYTTMSCGDTLCDSLPRKSCSPN-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
FGC G+ GL+GLGRG+LS ++Q +L+ FSYCL P A S + +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 254 RLGPI------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G G+ +TP++ NP S YYV L I + R + IP G+ P
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-------PIVAP 360
G I DSGT T L Y V R +V S G D CY V P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIP 325
Query: 361 TITLMFSGMNVTLPQDNLLIHST-AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+ F G + LP +N I + AG+I CLAM ++ N + + NM QQN R++YD
Sbjct: 326 AMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSS----NMDIGIYGNMMQQNFRVMYD 381
Query: 420 VPNSRLGVARELC 432
+ +S++G A C
Sbjct: 382 IGSSKIGWAPSQC 394
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 133/403 (33%), Positives = 194/403 (48%), Gaps = 33/403 (8%)
Query: 55 EESVL-EMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLM 112
EE +L L + AR+ L SLA +A + A + Y++ IGTP +
Sbjct: 46 EEQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMGIGTPTRYYSA 105
Query: 113 AMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAF 169
+DT +D W PC CV + F+ A+S T+++LGC + C + P C C +
Sbjct: 106 ILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVY 165
Query: 170 NLTYG-SSTIAANLSQDTISLATDI----VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
YG S++ A L+ +T + T+ +PG +FGC G G++G GRGSLS
Sbjct: 166 QYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLS 225
Query: 225 LLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSL 278
L++Q L FSYCL SF + L F L ++ TP + NP ++
Sbjct: 226 LVSQ---LGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTM 282
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y++N+ I VG ++ I P N T G GTIIDSGT T L PAY AVR F ++
Sbjct: 283 YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQI 342
Query: 338 G-SNLTVTSLGGFDTCYSVP------IVAPTITLMFSGMNVTLP-QDNLLIHSTAGSITC 389
L VT DTC+ P + P + L F G + LP Q+ +L+ + G C
Sbjct: 343 TLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLC 402
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LAMA++ S ++I + Q QN +LYD+ NS + C
Sbjct: 403 LAMASS-----SDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 205/430 (47%), Gaps = 47/430 (10%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASG 88
+T+ + H PCSP SK EE E+L +DQ R + + ++ A +
Sbjct: 52 TTVALNHRHGPCSPVPSSKKRPTEE---ELLKRDQLRAEHIQRKFAMNAAVDGAGDLQQS 108
Query: 89 RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-----VGCSS 133
+ + PT Y++ +GTPA T + +DT +D +WV C C +
Sbjct: 109 KVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTG 168
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGS-STIAANLSQDTIS 188
+F+ A+S+T++ + C AA+C Q+ G GA C + + YG ST S+DT++
Sbjct: 169 ALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT 228
Query: 189 L--ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
L A+D V G+ FGC +G S GL+GLG G+ SL++QT Y ++FSYCLP
Sbjct: 229 LSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSG 288
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
S +L G T +L++ + + Y L I VG + + + P
Sbjct: 289 SSGFLTLGG--GGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF------ 340
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTI 362
AG+++DSGT+ TRL AY+A+ F+ + + + DTC+ I PT+
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTV 400
Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
L+FSG N +++ CLA AA D + +I N+QQ+ +LYDV +
Sbjct: 401 ALVFSGGAAIDLDPNGIMYG-----NCLAFAATGD--DGTTGIIGNVQQRTFEVLYDVGS 453
Query: 423 SRLGVARELC 432
S LG C
Sbjct: 454 STLGFRSGAC 463
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 190/392 (48%), Gaps = 36/392 (9%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
K L+ E + + + RLQ L ++ V P+ +G Y++ IGTPAQ
Sbjct: 52 KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDG-----EYLMNLSIGTPAQ 106
Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
MDT +D W PCT C S+ +FN S++F L C + C+ + +PTC
Sbjct: 107 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNN 166
Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSL 223
+C + YG S ++ +T++ + +P TFGC + G GL+G+GRG L
Sbjct: 167 SCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226
Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIKYTPLLKNPRRSS 277
SL +Q L + FSYC+ + S S +L LG + G P T L+++ + +
Sbjct: 227 SLPSQ---LDVTKFSYCMTPIGS-STSSTLLLGSLANSVTAGSPN----TTLIESSQIPT 278
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
YY+ L + VG + I P + N G G IIDSGT T AY AVR F +
Sbjct: 279 FYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQ 338
Query: 337 VGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
+ ++ S GFD C+ +P + PT + F G ++ LP +N I + G I CLA
Sbjct: 339 MNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLI-CLA 397
Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
M ++ +++ N+QQQN ++YD NS
Sbjct: 398 MGSSSQG----MSIFGNIQQQNLLVVYDTGNS 425
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 132/433 (30%), Positives = 206/433 (47%), Gaps = 53/433 (12%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASG 88
+T+ + H PCSP SK EE E+L +DQ R + + ++ A +
Sbjct: 52 TTVALNHRHGPCSPVPSSKKRPTEE---ELLKRDQLRAEHIQRKFAMNAAVDGAGDLQQS 108
Query: 89 RQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-----VGCSS 133
+ + PT Y++ +GTPA T + +DT +D +WV C C +
Sbjct: 109 KVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTG 168
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGS-STIAANLSQDTIS 188
+F+ A+S+T++ + C AA+C Q+ G GA C + + YG ST S+DT++
Sbjct: 169 ALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLT 228
Query: 189 L--ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
L A+D V G+ FGC +G S GL+GLG G+ SL++QT Y ++FSYCLP
Sbjct: 229 LSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLP---- 284
Query: 247 LSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
SGS +G T +L++ + + Y L I VG + + + P
Sbjct: 285 -PTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF--- 340
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVA 359
AG+++DSGT+ TRL AY+A+ F+ + + + DTC+ I
Sbjct: 341 ---AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
PT+ L+FSG N +++ CLA AA D + +I N+QQ+ +LYD
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG-----NCLAFAATGD--DGTTGIIGNVQQRTFEVLYD 450
Query: 420 VPNSRLGVARELC 432
V +S LG C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 132/434 (30%), Positives = 199/434 (45%), Gaps = 51/434 (11%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGR 89
V H PCSP + S ++L +DQAR+ + + S V P G
Sbjct: 91 VMHRHGPCSPLQTPGD---APSDADLLDQDQARVDSILGMITNETSAVGPGVSLPAERGI 147
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT--GCVGCSSTVFNSAQSTTF 144
+ Y+V +GTPA+ L + DT +D +WV PC+ GC +F + S+TF
Sbjct: 148 SVGTG-NYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTF 206
Query: 145 KNLGCQAAQCKQVPNPTCGGG----ACAFNLTYGS-STIAANLSQDTISLAT-------- 191
+ C A +C+ +CGG C + + YG S +L DT++L T
Sbjct: 207 SAVRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASA 264
Query: 192 ---DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
+ +PG+ FGC + TG GL GLGRG +SL +Q + FSYCLPS + +
Sbjct: 265 ENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSA 324
Query: 249 FSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G L LG P+ P ++TP+L S YYV L+ IRV R + + +P
Sbjct: 325 -PGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVS------SPRVA 377
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCY------SVPIVA 359
I+DSGTV TRL AY A+R F +G L DTCY + +
Sbjct: 378 LPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSI 437
Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
P + L+F+G + +++ + CLA AP+ ++ N QQ+ ++YD
Sbjct: 438 PAVALVFAGGATISVDFSGVLYVAKVAQACLAF--APNGDGRSAGILGNTQQRTLAVVYD 495
Query: 420 VPNSRLGVARELCT 433
V ++G A + C+
Sbjct: 496 VARQKIGFAAKGCS 509
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 180/360 (50%), Gaps = 25/360 (6%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y + +GTPA LM +DT +D W+ C C C S VF+ +S ++ +GC
Sbjct: 137 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCA 196
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
A C+++ + C AC + + YG ++ A + + +T++ A V GC
Sbjct: 197 APLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNE 256
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRL--GPIGQ 260
G V GLLGLGRGSLS Q Y +FSYCL S S S ++ G +G
Sbjct: 257 GLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGAVGS 316
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVF 318
+TP++KNPR + YYV L+ I V G RV + L+ +P++G G I+DSGT
Sbjct: 317 TVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGTSV 376
Query: 319 TRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVT 372
TRL PAY+A+RD FR G L+ FDTCY + + PT+++ F+ G
Sbjct: 377 TRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAA 436
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N LI + C A A V ++I N+QQQ R+++D R+ + C
Sbjct: 437 LPPENYLIPVDSKGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 178/386 (46%), Gaps = 20/386 (5%)
Query: 62 LAKDQARLQFLS-SLAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMD 115
+ +D R+ L LA + + A G + S Y VR +G+P + + +D
Sbjct: 93 MQRDTKRVAALRRHLAAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVID 152
Query: 116 TSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+ +D WV PCT C S VFN A S+++ + C + C V N C G C + ++
Sbjct: 153 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVS 212
Query: 173 YGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
YG S L+ +T++ ++ GC G V GLLGLG G +S + Q
Sbjct: 213 YGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGG 272
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
TFSYCL S + + SG L+ G P + PL+ NPR S YYV L + VG
Sbjct: 273 QAGGTFSYCLVS-RGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGL 331
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
V I + + G ++D+GT TRL AY A RD F + + + + FDT
Sbjct: 332 RVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDT 391
Query: 352 CYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
CY V + PT++ FSG + TLP N LI C A A + +S L++I
Sbjct: 392 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPS----SSGLSII 447
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N+QQ+ I D N +G +C
Sbjct: 448 GNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 192/404 (47%), Gaps = 34/404 (8%)
Query: 53 SWEESVLEMLAKDQARLQFLS--------SLAVARKSVVPIASGRQITQSPTYIVRAKIG 104
S ++L + A+D AR+++L + V + V I+ G S Y VR +G
Sbjct: 86 STRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISEG-----SGEYFVRVGVG 140
Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
+P + +D+ +D W+ C C C + +F+ A S +F + C + C+ +P +
Sbjct: 141 SPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGVCRTLPGGS 200
Query: 162 CG---GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLL 216
G GAC + ++YG + L+ +T++ V G GC + G V GLL
Sbjct: 201 SGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLL 260
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
GLG G +SL+ Q FSYCL S A + +GSL G P + PLL+N ++
Sbjct: 261 GLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQ 320
Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
S YYV L + VG + + G G G ++D+GT TRL AY A+RD F
Sbjct: 321 PSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFAS 380
Query: 336 RVGSNL-TVTSLGGFDTCYSV----PIVAPTITLMF--SGMNVTLPQDNLLIHSTAGSIT 388
+G +L + DTCY + + PT+ L F G +TLP NLL+ G +
Sbjct: 381 TIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVE-MGGGVY 439
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA AA+ S L+++ N+QQQ +I D N +G C
Sbjct: 440 CLAFAAS----ASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 195/404 (48%), Gaps = 34/404 (8%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
S V+ ++A+D AR++ L VA S VVP S Y VR
Sbjct: 80 SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135
Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
+G+P + +D+ +D WV PC C + +F+ A S++F + C +A C+ +
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195
Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
CGGG C +++TYG S L+ +T++L V G GC + +G V G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-GQPKRIKYTPLLKNP 273
LLGLG G++SL+ Q FSYCL S + +GSL LG P + PL++N
Sbjct: 256 LLGLGWGAMSLIGQLGGAAGGVFSYCLAS-RGAGGAGSLVLGRTEAVPVGAVWVPLVRNN 314
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
+ SS YYV L I VG + + G Q G ++D+GT TRL AY A+R F
Sbjct: 315 QASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
+G+ ++ DTCY + + PT++ F G +TLP NLL+ G++
Sbjct: 375 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVF 433
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A + +S ++++ N+QQ+ +I D N +G C
Sbjct: 434 CLAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 184/373 (49%), Gaps = 27/373 (7%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
V P+ SG S Y + +GTP LM +DT +D W+ C C C S +F+
Sbjct: 133 VAPVVSG-LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-V 194
S ++ + C A C+++ + C AC + + YG ++ A + + +T++ A+ V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG--- 251
P GC G V GLLGLGRGSLS +Q + +FSYCL + S S
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311
Query: 252 ----SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT 306
+ G +G +TP++KNPR + YYV L+ I V G RV + L+ +P+T
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371
Query: 307 G-AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAP 360
G G I+DSGT TRL PAY A+RD FR G L+ FDTCY + + P
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVP 431
Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
T+++ F+ G LP +N LI + C A A V ++I N+QQQ R+++D
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFD 487
Query: 420 VPNSRLGVARELC 432
RLG + C
Sbjct: 488 GDGQRLGFVPKGC 500
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 139/421 (33%), Positives = 214/421 (50%), Gaps = 39/421 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------SLAVARKSVVPI 85
T+ + H + PCSP PSK + E E L +DQ R ++ + + + VP
Sbjct: 56 TVPLHHRYDPCSPV-PSKKVPTLE---ERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPT 111
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQST 142
G ++ + Y++ IG+PA T M+MDT +D +WV C C C S V F+ + S+
Sbjct: 112 TLGTSLS-TLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSS 170
Query: 143 TFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYG-SSTIAANLSQDTISLATDIVPGY 197
T+ C +A C Q+ G G C + + YG SS+ S DT++L + + +
Sbjct: 171 TYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDF 230
Query: 198 TFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
FGC Q +G + GL+GLG G+ SL +QT + + FSYCLP SG L LG
Sbjct: 231 QFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSG--SSGFLTLG 288
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G +K TP+L++ + + Y V L +I+VG + +++P AG+++DSGT
Sbjct: 289 -TGSSGFVK-TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS------AGSLMDSGT 340
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
+ TRL AY+A+ F+ + T G DTC+ I PT+TL+FS G V
Sbjct: 341 IITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAV 400
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
L D +++ ++ SI CLA P+ +S L +I N+QQ+ +LYDV +G
Sbjct: 401 DLAFDGIMLEISS-SIRCLAF--TPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGA 457
Query: 432 C 432
C
Sbjct: 458 C 458
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 171/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P ++ M +D+ +D WV PCT C + +F+ A S +F + C
Sbjct: 40 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCS 99
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C QV N C G C + ++YG S+ L+ +T++L +V GC G
Sbjct: 100 SAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMF 159
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GS+S + Q + FSYCL S + + +G L G P + PL
Sbjct: 160 VGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVS-RVTNSNGFLEFGSEAMPVGAAWIPL 218
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NP S YY+ L + VG V I + G ++D+GT TR AY A
Sbjct: 219 IRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAF 278
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
RD F + G+ + + FDTCY+ + + PT++ FSG + TLP +N LI
Sbjct: 279 RDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDD 338
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A +P S L+++ N+QQ+ +I D N +G +C
Sbjct: 339 AGTFCFAFAPSP----SGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 128/427 (29%), Positives = 203/427 (47%), Gaps = 36/427 (8%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----------LAVARK 80
++L+V H PCS S S +++ D R++++ S +
Sbjct: 65 ASLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDS 124
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVF 136
+ +P SGR I + Y+V +GTP + L + DT + W C C G +F
Sbjct: 125 TTLPAKSGRLIGSADYYVV-VGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIF 183
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISL-AT 191
+ ++S+++ N+ C ++ C Q + C +C +++ YG ++I+ LSQ+ +++ AT
Sbjct: 184 DPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITAT 243
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
DIV + FGC Q G GL+GL R +S + QT ++Y FSYCLPS S G
Sbjct: 244 DIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS--TPSSLG 301
Query: 252 SLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
L G +KYTP +S Y ++++ I VG +P A+ + + G+
Sbjct: 302 HLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGG--TKLP--AVSSSTFSAGGS 357
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
IIDSGTV TRL AY A+R FR+ + DTCY I P I F
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEF 417
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ G+ V LP +L +A + CLA AA + + + + N+QQ+ ++YDV R+
Sbjct: 418 AGGVKVELPLVGILYGESAQQL-CLAFAANGNGND--ITIFGNVQQKTLEVVYDVEGGRI 474
Query: 426 GVARELC 432
G C
Sbjct: 475 GFGAAGC 481
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 171/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P ++ M +D+ +D WV PCT C + +F+ A S +F + C
Sbjct: 40 SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCS 99
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C +V N C G C + ++YG S L+ +T++ +V GC G
Sbjct: 100 SAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMF 159
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GS+S + Q + FSYCL S + + +G L G P + PL
Sbjct: 160 VGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVS-RGTNTNGFLEFGSEAMPVGAAWIPL 218
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YY+ LL + VG V + Q N G ++D+GT TR AY A
Sbjct: 219 VRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAF 278
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
R+ F + + + + FDTCY+ + + PT++ FSG + T+P +N LI
Sbjct: 279 RNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDD 338
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A +P S L+++ N+QQ+ +I D N +G +C
Sbjct: 339 AGTFCFAFAPSP----SGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 143/411 (34%), Positives = 205/411 (49%), Gaps = 42/411 (10%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVA-----RKSVV-PIASGRQITQ-SPTYIVRAKIGTPAQ 108
E + L +D+ R +S A A RK V P+ SG + Q S Y + +GTPA
Sbjct: 83 ELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSG--LAQGSGEYFTKIGVGTPAT 140
Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-- 163
LM +DT +D WV C C C S VF+ +S+++ +GC AA C+++ + C
Sbjct: 141 QALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLR 200
Query: 164 GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRG 221
GAC + + YG ++ A + +T++ A V GC G V GLLGLGRG
Sbjct: 201 RGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRG 260
Query: 222 SLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLR-------LGPIGQPKRIKYTPLLK 271
LS Q Y +FSYCL S A + GS R G +G +TP+++
Sbjct: 261 GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASS-ASFTPMVR 319
Query: 272 NPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAV 329
NPR + YYV L+ I V G RV + L+ +P+TG G I+DSGT TRL +Y+A+
Sbjct: 320 NPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSAL 379
Query: 330 RDVFRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIH 381
RD FR L + S GG FDTCY + + PT+++ F+ G LP +N LI
Sbjct: 380 RDAFRAAAAGGLRL-SPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIP 438
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A V ++I N+QQQ R+++D R+G A + C
Sbjct: 439 VDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 201/424 (47%), Gaps = 37/424 (8%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL-----SSLAVAR------KSV 82
+ + H PCSP + S E+LA DQ R + + ++ V+R +
Sbjct: 89 MPIVHRHGPCSPLADAHDGKLP-SHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPS 147
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV----GCSSTVFNS 138
+P +SG + + Y+V +GTPA + DT +D WV C CV +F+
Sbjct: 148 LPASSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDP 206
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPG 196
A+S+T+ N+ C A C + C GG C + + YG + + + DT++L++ D + G
Sbjct: 207 ARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 266
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
+ FGC ++ G GLLGLGRG SL Q + Y F++C P+ S +G L G
Sbjct: 267 FRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS--SGTGYLDFG 324
Query: 257 PIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P P TP+L + + YYV L IRVG +++ IP T +GTI+DS
Sbjct: 325 PGSLPAVSAKLTTPMLVD-NGPTFYYVGLTGIRVGGKLLSIPQSVF-----TTSGTIVDS 378
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
GTV TRL AY+++R F + +L DTCY + PT++L+F G
Sbjct: 379 GTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQG 438
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ +I++ + S CL A ++ + + ++ N Q + ++YD+ +G
Sbjct: 439 GASLDVHASGIIYAASVSQACLGFAGNKEDDD--VGIVGNTQLKTFGVVYDIGKKVVGFC 496
Query: 429 RELC 432
C
Sbjct: 497 PGAC 500
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 188/408 (46%), Gaps = 48/408 (11%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQ--------------------SPTYIVRA 101
L +D R++ ++SLA +++GR T+ S Y +R
Sbjct: 87 LQRDSLRVKSITSLAA-------VSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRL 139
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
+GTPA + M +DT +D W+ C+ C C + +F+ +S TF + C + C+++
Sbjct: 140 GVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD 199
Query: 159 NP----TCGGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
+ T C + ++YG + + S +T++ V GC G V
Sbjct: 200 DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAA 259
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
GLLGLGRG LS +QT+N Y FSYCL S + ++ G PK +TPL
Sbjct: 260 GLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPL 319
Query: 270 LKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L NP+ + YY+ LL I VG RV + + + T G IIDSGT TRL PAY A
Sbjct: 320 LTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVA 379
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTA 384
+RD FR S FDTC+ + + PT+ F G V+LP N LI
Sbjct: 380 LRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNT 439
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A + L++I N+QQQ R+ YD+ SR+G C
Sbjct: 440 EGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/349 (33%), Positives = 167/349 (47%), Gaps = 14/349 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R +GTP + + M +DT +D W+ C C C S VF+ +S +F ++ C+
Sbjct: 144 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 203
Query: 151 AAQCKQVPNPTCGG-GACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ C ++ +P C +C + + YG + S +T++ VP GC G
Sbjct: 204 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGL 263
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V GLLGLGRG LS QT + FSYCL A S S+ G + +TP
Sbjct: 264 FVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTP 323
Query: 269 LLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
L+ NP+ + YY+ L I V G RV I + + G IIDSGT TRL AY
Sbjct: 324 LITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYV 383
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHST 383
++RD FR FDTC+ + + PT+ + F G +V+LP N LI
Sbjct: 384 SLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGADVSLPATNYLIPVD 443
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A S L++I N+QQQ R+++DV SR+G A C
Sbjct: 444 TNGVFCFAFAG----TMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 138/467 (29%), Positives = 209/467 (44%), Gaps = 73/467 (15%)
Query: 25 CDT---QDHSST-----LQVFHVFSPCSPFKPS---KPLSWEESVLEMLAKDQARLQFL- 72
CDT H +T + + H PCSP + KP S EE +L DQ R + +
Sbjct: 73 CDTPREHKHGATSSGTRMPIVHRHGPCSPLADAHGGKPPSHEE----ILDADQNRAESIQ 128
Query: 73 ----SSLAVAR---KSVVPIASGRQITQSP--------------------------TYIV 99
++ AR K P S RQ S Y+V
Sbjct: 129 RRVSTTTTAARGKPKRNRPSPSRRQQPSSSAPAPGASLSSSAASLPASSGRALGTGNYVV 188
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCK 155
+GTPA + DT +D WV C CV +F+ A+S+T N+ C A C
Sbjct: 189 TIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPACS 248
Query: 156 QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ 213
+ C GG C + + YG + + + DT++L++ D + G+ FGC ++ G
Sbjct: 249 DLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLFGEAA 308
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLK 271
GLLGLGRG SL Q + Y F++C P+ S +G L GP P TP+L
Sbjct: 309 GLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS--SGTGYLDFGPGSSPAVSTKLTTPMLV 366
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ + YYV L IRVG +++ IPP T AGTI+DSGTV TRL AY+++R
Sbjct: 367 D-NGLTFYYVGLTGIRVGGKLLSIPPSVF-----TTAGTIVDSGTVITRLPPAAYSSLRS 420
Query: 332 VFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
F + + +L DTCY + PT++L+F G + +I++ +
Sbjct: 421 AFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASV 480
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S CL AA ++ + + ++ N Q + ++YD+ +G + C
Sbjct: 481 SQACLGFAANEEDDD--VGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 168/348 (48%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P ++ M +D+ +D WV PCT C S VF+ A S +F + C
Sbjct: 137 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 196
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C ++ N C G C + ++YG S L+ +T++ +V GC + G
Sbjct: 197 SSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMF 256
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GS+S + Q FSYCL S + SGSL G P + PL
Sbjct: 257 VGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS-RGTDSSGSLVFGREALPAGAAWVPL 315
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YY+ L + VG V I + G ++D+GT TRL AY A
Sbjct: 316 VRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAF 375
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
RD F + + T + FDTCY V + PT++ FSG + TLP N LI
Sbjct: 376 RDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 435
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A + S L+++ N+QQ+ +I +D N +G +C
Sbjct: 436 AGTFCFAFAPS----TSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 133/449 (29%), Positives = 207/449 (46%), Gaps = 66/449 (14%)
Query: 34 LQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS------ 81
+ + H PCSP + KP S E+ +LA DQ R + + S+ A R +
Sbjct: 87 MTIVHRHGPCSPLADAHGKPPSHED----ILAADQNRAESIQHRVSTTATGRGNPKRSRR 142
Query: 82 -----------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
+P +SGR + Y+V +GTPA + DT +
Sbjct: 143 APSRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGS 201
Query: 119 DAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
D WV C CV +F+ A+S+T+ N+ C A C + C GG C + + YG
Sbjct: 202 DTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQYG 261
Query: 175 SSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT +
Sbjct: 262 DGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 321
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY---TPLLKNPRRSSLYYVNLLAIRVG 289
Y F++CLP+ S +G L GP G P TP+L + + YYV + IRVG
Sbjct: 322 YGGVFAHCLPARS--SGTGYLDFGP-GSPAAAGARLTTPMLTD-NGPTFYYVGMTGIRVG 377
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLG 347
+++ IP T AGTI+DSGTV TRL AY+++R F + + ++
Sbjct: 378 GQLLSIPQSVF-----TTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVS 432
Query: 348 GFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
DTCY + PT++L+F G + ++++ + S CL AA D + +
Sbjct: 433 LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGD--V 490
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
++ N Q + + YD+ +G + C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 130/424 (30%), Positives = 194/424 (45%), Gaps = 39/424 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS----------- 81
+L + H+ + S P E+ L +D R++ + +LA +S
Sbjct: 63 SLHLHHIDALSSNKTP------EQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFSS 116
Query: 82 --VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
+ +A G S Y R +GTPA+ + M +DT +D W+ C C C + VF
Sbjct: 117 SIISGLAQG-----SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVF 171
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLATDI 193
+ +S T+ + C A C+++ +P C C + ++YG + + S +T++
Sbjct: 172 DPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR 231
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
V GC G + GLLGLGRG LS QT + FSYCL A + S+
Sbjct: 232 VTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSV 291
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTII 312
G + ++TPL+KNP+ + YY+ LL I VG V + + + G II
Sbjct: 292 VFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVII 351
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG 368
DSGT TRL PAY A+RD FR FDTC+ + + PT+ L F G
Sbjct: 352 DSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG 411
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+V+LP N LI C A A S L++I N+QQQ R+ +D+ SR+G A
Sbjct: 412 ADVSLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNIQQQGFRVSFDLAGSRVGFA 467
Query: 429 RELC 432
C
Sbjct: 468 PRGC 471
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 175/371 (47%), Gaps = 35/371 (9%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
VP+ +G +++ IGTPA +DT +D W C CV C S+ VF+ +
Sbjct: 109 VPVHAGNG-----EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPS 163
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISLATDIVPG 196
S+T+ L C ++ C +P TC A C + TYG +S+ L+ +T +LA +PG
Sbjct: 164 SSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPG 223
Query: 197 YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
FGC G+ GL+GLGRG LSL++Q L FSYCL S S S L L
Sbjct: 224 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLGKFSYCLTSLDDTSKS-PLLL 279
Query: 256 GPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G + I+ TPL+KNP + S YYV L A+ VG + +P A
Sbjct: 280 GSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTG 339
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTI 362
G I+DSGT T L Y ++ F ++ + S G D C+ P + P +
Sbjct: 340 GVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKL 399
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
L F G ++ LP +N ++ +A CL + + L++I N QQQN + +YDV
Sbjct: 400 VLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRG-----LSIIGNFQQQNIQFVYDVD 454
Query: 422 NSRLGVARELC 432
L A C
Sbjct: 455 KDTLSFAPVQC 465
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 172/353 (48%), Gaps = 30/353 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTPAQ MDT +D W PCT C S+ +FN S++F L C +
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG-NSVP 211
C+ + +PTC C + YG S ++ +T++ + +P TFGC + G
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI------GQPKRIK 265
GL+G+GRG LSL +Q L + FSYC+ + S +L LG + G P
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGS-STPSNLLLGSLANSVTAGSPN--- 267
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
T L+++ + + YY+ L + VG + I P A N G G IIDSGT T V
Sbjct: 268 -TTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNN 326
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLL 379
AY +VR F ++ + S GFD C+ P + PT + F G ++ LP +N
Sbjct: 327 AYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYF 386
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I + G I CLAM ++ +++ N+QQQN ++YD NS + A C
Sbjct: 387 ISPSNGLI-CLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 194/404 (48%), Gaps = 34/404 (8%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
S V+ ++A+D AR++ L VA S VVP S Y VR
Sbjct: 80 SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135
Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
+G+P + +D+ +D WV PC C + +F+ A S++F + C +A C+ +
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195
Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
CGGG C +++TYG S L+ +T++L V G GC + +G V G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-GQPKRIKYTPLLKNP 273
LLGLG G++SL+ Q FSYCL S + +GSL LG P + PL++N
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLVLGRTEAVPVGAVWVPLVRNN 314
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
+ SS YYV L I VG + + Q G ++D+GT TRL AY A+R F
Sbjct: 315 QASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
+G+ ++ DTCY + + PT++ F G +TLP NLL+ G++
Sbjct: 375 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVF 433
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A + +S ++++ N+QQ+ +I D N +G C
Sbjct: 434 CLAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 141/441 (31%), Positives = 205/441 (46%), Gaps = 58/441 (13%)
Query: 29 DHSST---LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSL--------- 75
+HSS+ L + H PCSP P S +L D AR+ F + L
Sbjct: 37 NHSSSAVHLPLHHPRGPCSPLSADIPFS------AVLTHDAARIASFAARLAKKSSPSSA 90
Query: 76 ------AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC- 128
A + + VP+ G + Y+ R +GTPA+ +M +DT + W+ C+ C
Sbjct: 91 SATTQAAGSSLASVPLTPGTSVGVG-NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCR 149
Query: 129 VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
V C S VF+ S+++ + C + QC + T C+ + +YG S+ +
Sbjct: 150 VSCHRQSGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFS 209
Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
LS+DT+S + VP + +GC Q G GL+GL R LSLL Q +FS
Sbjct: 210 VGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFS 269
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLPS S SG L +G P YTP++ N SLY+++L + V + P
Sbjct: 270 YCLPS---TSSSGYLSIGSY-NPGGYSYTPMVSNTLDDSLYFISLSGMTVAGK-----PL 320
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY---- 353
A+ + T TIIDSGTV TRL YTA+ + V GS + DTC+
Sbjct: 321 AVSSSEYTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQA 380
Query: 354 SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
S P +++ FS G + L NLL+ G+ TCLA A A +I N QQQ
Sbjct: 381 SKLRAVPAVSMAFSGGATLKLSAGNLLVD-VDGATTCLAFAPARSAA-----IIGNTQQQ 434
Query: 413 NHRILYDVPNSRLGVARELCT 433
++YDV ++R+G A C+
Sbjct: 435 TFSVVYDVKSNRIGFAAAGCS 455
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 185/404 (45%), Gaps = 36/404 (8%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------YIVRAKIGTPAQTL 110
S L++L + R S VAR + V +G Q P +++ IGTPA +
Sbjct: 54 SRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPALSY 113
Query: 111 LMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGA 166
+DT +D W C CV C S+ VF+ + S+T+ + C +A C +P TC
Sbjct: 114 AAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTSTCTSASK 173
Query: 167 CAFNLTYG-SSTIAANLSQDTISLATDI--VPGYTFGCIQKATGNS-VPPQGLLGLGRGS 222
C + TYG +S+ L+ +T +L + +PG FGC G+ GL+GLGRG
Sbjct: 174 CGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGP 233
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-------IKYTPLLKNPRR 275
LSL++Q L FSYCL S L LG ++ TPL+KNP +
Sbjct: 234 LSLVSQ---LGLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQ 290
Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
S YYV+L + VG + +P A G I+DSGT T L Y A++ F
Sbjct: 291 PSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVA 350
Query: 336 RVGSNLTVTSLGGFDTCYSVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
++ S G D C+ P + P + L F G ++ LP +N ++ +A
Sbjct: 351 QMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGAL 410
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL +A + L++I N QQQN + +YDV L A C
Sbjct: 411 CLTVAPS-----RGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 144/470 (30%), Positives = 231/470 (49%), Gaps = 61/470 (12%)
Query: 4 QLVFFLAFLFLFSLSE--GLNPICDTQDHSSTL-QVFHVFSPCSPFKPSKPLSWEESVLE 60
+LV FL F+ + + S L + + SS L ++HV S +P+ S+ +
Sbjct: 10 KLVCFLTFMIVLATSSFAKLEEYKLSANQSSILLNLYHVHGDASSLEPNSSSSF----CD 65
Query: 61 MLAKDQARLQFLSSLAVARKSV--------------------VPIASGRQITQSPTYIVR 100
+L++D+ ++FLSS + +K V +P+ G I S Y ++
Sbjct: 66 ILSRDEEHVKFLSS-RLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIG-SGNYYLK 123
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQ 156
+G+P + M +DT + +W+ C CV C S V F + S T++ L C +++C
Sbjct: 124 LGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSL 183
Query: 157 VP-----NPTC-GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATGN 208
+ +P C G C + +YG ++ + LS+D ++L + +P +T+GC Q G
Sbjct: 184 LKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGL 243
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
G++GL R LS+LAQ Y FSYCLP+ + S G L +G I P K+TP
Sbjct: 244 FGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTS-SGGGFLSIGKI-SPSSYKFTP 301
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
+++N + SLY++ L AI V R V + Q TIIDSGTV TRL Y A
Sbjct: 302 MIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAA 355
Query: 329 VRDVFRRRVGSNLT-VTSLGGFDTCYSVPIV----APTITLMF-SGMNVTLPQDNLLIHS 382
+R+ F + + + DTC+ + AP I ++F G +++L N+LI +
Sbjct: 356 LREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEA 415
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
G I CLA A++ + + +I N QQQ + I YDV S++G A C
Sbjct: 416 DKG-IACLAFASS-----NQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 178/361 (49%), Gaps = 23/361 (6%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y R +G+PA+ L M +DT +D WV C C C S VF+ +
Sbjct: 151 PVVSGVGLG-SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSL 209
Query: 141 STTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VPG 196
ST++ ++ C +C + C GAC + + YG S + + +T++L V
Sbjct: 210 STSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSS 269
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
GC G V GLL LG G LS +Q + +TFSYCL + S S +L+ G
Sbjct: 270 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQFG 325
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ PL+++PR S+ YYV L I VG +++ IPP A + T G I+DSGT
Sbjct: 326 DAADAEVT--APLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGT 383
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
TRL + AY A+RD F R S + + FDTCY + + P ++L F+ G +
Sbjct: 384 AVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGEL 443
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP N LI CLA A N+ +++I N+QQQ R+ +D S +G
Sbjct: 444 RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNK 499
Query: 432 C 432
C
Sbjct: 500 C 500
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 131/415 (31%), Positives = 196/415 (47%), Gaps = 47/415 (11%)
Query: 53 SWEESVLEMLAKDQARLQFLSS-LAVARK------SVVPIASGRQITQSPTYIVRAKIGT 105
S +VL+++A+D AR ++L++ L+ A + S + SG S Y+VR +G+
Sbjct: 121 SLRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSESKVVSGLD-EGSGEYLVRVSVGS 179
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
P + +D+ +D WV C C+ C + +F+ A S TF + C +A C+ +P C
Sbjct: 180 PPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSAC 239
Query: 163 GGG---ACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLG 217
G G C + ++Y GS T A L+ +T++L V G GC + G V GL+G
Sbjct: 240 GDGELGGCEYEVSYADGSYTKGA-LALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMG 298
Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSF------KALSFSGSLRLG-PIGQPKRIKYTPLL 270
LG G +SL+ Q FSYCL S A +G L LG P+ + PL+
Sbjct: 299 LGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLV 358
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
+NPR S YYV L I VG + + G Q ++D+GT TRL AY A+R
Sbjct: 359 RNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALR 418
Query: 331 DVFR--------RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG-MNVTLPQDN 377
D F R G + +V DTCY + + PT++ F G + L N
Sbjct: 419 DAFVGALAGAVPRAQGVSSSV-----LDTCYDLSGYASVRVPTVSFCFDGDARLILAARN 473
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+L+ G I CLA A + +S L+++ N QQ +I D N +G C
Sbjct: 474 VLLEVDMG-IYCLAFAPS----SSGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 172/347 (49%), Gaps = 24/347 (6%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
Y++ GTP + + DT ++ W+ C CV C +F+ S+T++N+ C +A
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
C + + C G C + +TYG S+ L+ +T +LA ++ + FGC Q G
Sbjct: 76 ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFIFGCGQNNQGLFT 135
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI-KYTPL 269
GL+GLGR SL +Q + FSYCLPS S +G L IG P R YT +
Sbjct: 136 GAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTS--SATGYLN---IGNPLRTPGYTAM 190
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
L N R +LY+++L+ I VG + + Q GTIIDSGTV TRL AY A+
Sbjct: 191 LTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPPTAYGAL 245
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
R FR + + DTCY + + PTI L ++G++VT+P + + +
Sbjct: 246 RTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVF-YVISS 304
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S CLA A D+ + +I N+QQ+ + YD R+G A C
Sbjct: 305 SQVCLAFAGNSDSTQ--IGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 179/386 (46%), Gaps = 39/386 (10%)
Query: 77 VARKSVVPI----ASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
VAR + VP+ A+G Q P +++ IGTPA +DT +D W C
Sbjct: 75 VARATGVPMTSSKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK 134
Query: 127 GCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAAN 181
CV C S+ VF+ + S+T+ + C +A C +P C C + TYG SS+
Sbjct: 135 PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGV 194
Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
L+ +T +LA +PG FGC G+ GL+GLGRG LSL++Q L FSYC
Sbjct: 195 LATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYC 251
Query: 241 LPSFKALSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
L S + S L LG + ++ TPL+KNP + S YYV+L AI VG +
Sbjct: 252 LTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 310
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
+P A G I+DSGT T L Y A++ F ++ S G D C+
Sbjct: 311 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCF 370
Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
P + P + F G ++ LP +N ++ CL + + L++I
Sbjct: 371 RAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSII 425
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQQN + +YDV + L A C
Sbjct: 426 GNFQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 133/420 (31%), Positives = 200/420 (47%), Gaps = 41/420 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T+ + H PCS + P + ++ +ML +DQ R +++ G +T
Sbjct: 58 TVPLHHRHGPCS----TVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEGSDVT 113
Query: 93 QSPT---------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQ 140
T Y++ +G+PA M +DT +D +WV C C C S ++F+ +
Sbjct: 114 VPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSS 173
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
S+T+ C +A C Q+ C C + + YG ST + S DT++L + V + F
Sbjct: 174 SSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSSTVENFQF 233
Query: 200 GCIQKATGNSVPPQGLLGLGRGSL--SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
GC Q +GN + Q +G G SL QT + FSYCLP SG L LG
Sbjct: 234 GCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG--SSGFLTLGA 291
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+K TP+L++ + S Y V L AIRVG R ++IP A AG+I+DSGT+
Sbjct: 292 STSGFVVK-TPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFS------AGSIMDSGTI 344
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT- 372
TRL AY+A+ F+ + +G FDTC+ + PT+ L+FSG V
Sbjct: 345 ITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVD 404
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L D +++ S CLA AA D+ + L +I N+QQ+ +LYDV +G C
Sbjct: 405 LASDGIILGS------CLAFAANSDDTS--LGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 179/386 (46%), Gaps = 39/386 (10%)
Query: 77 VARKSVVPI----ASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
VAR + VP+ A+G Q P +++ IGTPA +DT +D W C
Sbjct: 65 VARATGVPMTSSKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK 124
Query: 127 GCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAAN 181
CV C S+ VF+ + S+T+ + C +A C +P C C + TYG SS+
Sbjct: 125 PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGV 184
Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
L+ +T +LA +PG FGC G+ GL+GLGRG LSL++Q L FSYC
Sbjct: 185 LATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYC 241
Query: 241 LPSFKALSFSGSLRLGPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
L S + S L LG + ++ TPL+KNP + S YYV+L AI VG +
Sbjct: 242 LTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 300
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
+P A G I+DSGT T L Y A++ F ++ S G D C+
Sbjct: 301 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCF 360
Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
P + P + F G ++ LP +N ++ CL + + L++I
Sbjct: 361 RAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSII 415
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQQN + +YDV + L A C
Sbjct: 416 GNFQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/427 (29%), Positives = 209/427 (48%), Gaps = 39/427 (9%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF----LSSLAV--ARKSVV 83
+S +L+V H PC + + S +E+L +D+ R+ LSS V +++ +
Sbjct: 61 NSLSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATL 120
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ--- 140
P+ SG I S Y V +GTP + + DT +D W T C C+ T + +
Sbjct: 121 PVQSGASIG-SGDYAVTVGLGTPKKEFTLIFDTGSDLTW---TQCEPCAKTCYKQKEPRL 176
Query: 141 ----STTFKNLGCQAAQCKQVP---NPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT- 191
ST++KN+ C +A CK + +C C + + YG + + + +T++L++
Sbjct: 177 DPTKSTSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSS 236
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
++ + FGC Q+ +G GLLGLGR LSL +QT Y+ FSYCLP+ + S G
Sbjct: 237 NVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPA--SSSSKG 294
Query: 252 SLRLGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
L G GQ K +K+TPL ++ + + Y +++ + VG + I + +GT
Sbjct: 295 YLSFG--GQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIF-----STSGT 347
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
+IDSGTV TRL + AY+A+ F++ + + FDTCY I P + + F
Sbjct: 348 VIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSF 407
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G+ + + +L CLA A D+V + + N QQ+ ++++YD R+
Sbjct: 408 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAA--IFGNTQQKTYQVVYDDAKGRV 465
Query: 426 GVARELC 432
G A C
Sbjct: 466 GFAPSGC 472
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 140/441 (31%), Positives = 215/441 (48%), Gaps = 64/441 (14%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL--------AVARK----- 80
L+++H+ S SP P S M AKD+ R+++ S A ++K
Sbjct: 33 LKLYHMTSLKSP-----PNSTSLLFAYMFAKDEERIRYFHSRLAKNSDANASSKKVGPKL 87
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVF 136
+ +P+ SG + S Y V+ +G+P + M +DT + +W+ C C + C VF
Sbjct: 88 AGIPLKSGLSMG-SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVF 146
Query: 137 NSAQSTTFKNLGCQAAQCKQ-----VPNPTCG--GGACAFNLTYGSSTIA-ANLSQDTIS 188
N + S T+K + C ++QC + PTC AC + +YG S+ + LSQD ++
Sbjct: 147 NPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLT 206
Query: 189 LA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
L + + + +GC Q G G++GL LS+L+Q Y + FSYCLP+
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT---- 262
Query: 248 SFS-------GSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
SFS G L +G + K+TPLLKNP SLY+++L +I V R + +
Sbjct: 263 SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAAS 322
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYS--- 354
+ + TIIDSGTV TRL P YT +++ + + + DTC+
Sbjct: 323 SYK------VPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSL 376
Query: 355 --VPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ VAP I ++F G ++ L N L+ G ITCLAMA + S + +I N QQ
Sbjct: 377 AGISEVAPDIRIIFKGGADLQLKGHNSLVELETG-ITCLAMAGS-----SSIAIIGNYQQ 430
Query: 412 QNHRILYDVPNSRLGVARELC 432
Q ++ YDV NSR+G A C
Sbjct: 431 QTVKVAYDVGNSRVGFAPGGC 451
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 178/361 (49%), Gaps = 23/361 (6%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y R +G+PA+ L M +DT +D WV C C C S VF+ +
Sbjct: 155 PVVSGVGL-GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSL 213
Query: 141 STTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VPG 196
ST++ ++ C +C + C GAC + + YG S + + +T++L V
Sbjct: 214 STSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSS 273
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
GC G V GLL LG G LS +Q + +TFSYCL + S S +L+ G
Sbjct: 274 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQFG 329
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ PL+++PR S+ YYV L + VG +++ IPP A + T G I+DSGT
Sbjct: 330 DAADAEVTA--PLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGT 387
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNV 371
TRL + AY A+RD F R S + + FDTCY + + P ++L F+ G +
Sbjct: 388 AVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGEL 447
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP N LI CLA A N+ +++I N+QQQ R+ +D S +G
Sbjct: 448 RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNK 503
Query: 432 C 432
C
Sbjct: 504 C 504
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 138/454 (30%), Positives = 199/454 (43%), Gaps = 60/454 (13%)
Query: 16 SLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL 75
S+SE + H L F SP FK L +D R++ ++SL
Sbjct: 56 SVSESTTSLSVHLSHVDALSSFSDASPVDLFKL------------RLQRDSLRVKSITSL 103
Query: 76 AVARKSVVPIASGRQITQ--------------------SPTYIVRAKIGTPAQTLLMAMD 115
A +++GR T+ S Y +R +GTPA + M +D
Sbjct: 104 AA-------VSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLD 156
Query: 116 TSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCGGGACA 168
T +D W+ C+ C C S +F+ +S TF + C + C+++ + T C
Sbjct: 157 TGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCL 216
Query: 169 FNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLA 227
+ ++YG + + S +T++ V GC G V GLLGLGRG LS +
Sbjct: 217 YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPS 276
Query: 228 QTQNLYQSTFSYCL----PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
QT++ Y FSYCL S + ++ G PK +TPLL NP+ + YY+ L
Sbjct: 277 QTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQL 336
Query: 284 LAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
L I VG RV + + + T G IIDSGT TRL AY A+RD FR
Sbjct: 337 LGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKR 396
Query: 343 VTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
S FDTC+ + + PT+ F G V+LP N LI C A A +
Sbjct: 397 APSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGS 456
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L++I N+QQQ R+ YD+ SR+G C
Sbjct: 457 ----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 131/455 (28%), Positives = 199/455 (43%), Gaps = 68/455 (14%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLA--VARKSVV 83
D ++ + + H PCSP + S E+LA DQ+R + + V
Sbjct: 81 DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTDRVN 138
Query: 84 PIASGRQITQ--------------------SP-------TYIVRAKIGTPAQTLLMAMDT 116
P S + Q SP Y+V +GTPA + DT
Sbjct: 139 PKRSRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDT 198
Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+D WV C CV C +F+ A S+T+ N+ C A C + C GG C + +
Sbjct: 199 GSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQ 258
Query: 173 YGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
YG + + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT
Sbjct: 259 YGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTY 318
Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
Y F++CLP+ + +G L G G P TP+L + YYV + IRVG
Sbjct: 319 GKYGGVFAHCLPARS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRVGG 374
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGSNL 341
R++ I P AGTI+DSGTV TRL AY+++R +R+ +L
Sbjct: 375 RLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL 429
Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
DTCY + PT++L+F G + ++++ + S CLA A D
Sbjct: 430 -------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNED 482
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + ++ N Q + + YD+ +G + C
Sbjct: 483 GGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 173/371 (46%), Gaps = 34/371 (9%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
+VP+ +G +++ IGTPA +DT +D W C CV C S+ VF+
Sbjct: 64 LVPVHAGNG-----EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDP 118
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPG 196
+ S+T+ + C +A C +P C C + TYG SS+ L+ +T +LA +PG
Sbjct: 119 SSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG 178
Query: 197 YTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
FGC G+ GL+GLGRG LSL++Q L FSYCL S + S L L
Sbjct: 179 VVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDKFSYCLTSLDDTNNS-PLLL 234
Query: 256 GPIG-------QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G + ++ TPL+KNP + S YYV+L AI VG + +P A
Sbjct: 235 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 294
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTI 362
G I+DSGT T L Y A++ F ++ S G D C+ P + P +
Sbjct: 295 GVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 354
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
F G ++ LP +N ++ CL + + L++I N QQQN + +YDV
Sbjct: 355 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-----LSIIGNFQQQNFQFVYDVG 409
Query: 422 NSRLGVARELC 432
+ L A C
Sbjct: 410 HDTLSFAPVQC 420
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 178/352 (50%), Gaps = 25/352 (7%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNSAQSTTFKNLGCQAA 152
++V GTPAQT + DT +D +W+ PC+G C +F+ +S T+ + C
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179
Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
QC G C + + YG S+ A LS +T+SL + +PG+ FGC + G+
Sbjct: 180 QCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGCGETNLGDFG 239
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYT 267
GL+GLGRG LSL +Q + + FSYCLPS+ + G L +G P ++YT
Sbjct: 240 DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN--TSHGYLTIGTTTPASGSDGVRYT 297
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
+++ S Y+V+L++I VG V+ +PP + F T GT++DSGTV T L AYT
Sbjct: 298 AMIQKQDYPSFYFVDLVSIVVGGFVLPVPP--ILF---TRDGTLLDSGTVLTYLPPEAYT 352
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLI-- 380
A+RD F+ + + FDTCY I P ++ FS G + L +LI
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVLIFP 412
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
TA + CLA P + ++ N QQ+N ++YDV ++G C
Sbjct: 413 DDTAPATGCLAFVPRPSTMP--FTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 139/431 (32%), Positives = 199/431 (46%), Gaps = 25/431 (5%)
Query: 17 LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLA 76
LSE L+ +T S L + H+ + S P E+ L +D R++ L +
Sbjct: 40 LSETLSEPQETLSLSLHLHLHHIDALSSNKTP------EQLFHLRLQRDAKRVEALLNQI 93
Query: 77 VARKSVVPIASGRQITQ----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC- 131
AR+S S I+ S Y R +GTPA+ + M +DT +D W+ C C C
Sbjct: 94 HARRSAGSSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCY 153
Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTIA-ANLSQDT 186
+ VF+ +S T+ + C A C+++ +P C C + ++YG + + S +T
Sbjct: 154 TQTDHVFDPTKSRTYAGIPCGAPLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTET 213
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++ + V GC G GLLGLGRG LS QT + FSYCL A
Sbjct: 214 LTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSA 273
Query: 247 LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPT 305
+ S+ G + +TPL+KNP+ + YY+ LL I VG V + + +
Sbjct: 274 SAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAA 333
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPT 361
G IIDSGT TRL PAY A+RD FR FDTC+ + + PT
Sbjct: 334 GNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPT 393
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+ L F G +V+LP N LI C A A S L++I N+QQQ RI YD+
Sbjct: 394 VVLHFRGADVSLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNIQQQGFRISYDLT 449
Query: 422 NSRLGVARELC 432
SR+G A C
Sbjct: 450 GSRVGFAPRGC 460
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 168/356 (47%), Gaps = 27/356 (7%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQSTTFKNLGCQAAQ 153
Y+ ++GTP + + +DT +D WV C+ C C S ++F ST+F L C
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN------LSQDTISLATDIVPGYTFGCIQKATG 207
C +P P C C + +YG +++ ++ D I+ VP + FGC G
Sbjct: 63 CNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG 122
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGPIGQPK--RI 264
+ G+LGLG+G LS +Q + ++ FSYCL + A S L G P +
Sbjct: 123 SFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGV 182
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
KY LL NP+ + YYV L I VG ++++I A + AGTI DSGT T+L
Sbjct: 183 KYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGE 242
Query: 325 AYTAV--------RDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTLPQD 376
+ V D R+ S+ LGGF +P V P++T F G ++ LP
Sbjct: 243 VHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEG-QLPTV-PSMTFHFEGGDMELPPS 300
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N I + C +M ++PD + +I ++QQQN ++ YD ++G + C
Sbjct: 301 NYFIFLESSQSYCFSMVSSPD-----VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 186/389 (47%), Gaps = 26/389 (6%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
E + + + + RLQ LS+ + + V P+ +G +++ IGTPA+T
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNG-----EFLMNLAIGTPAETYSAI 113
Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
MDT +D W PC C + +F+ +S++F L C + C +P +C G C +
Sbjct: 114 MDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CEYR 172
Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
+YG S+ L+ +T + V FGC + G + GL+GLGRG LSL++Q
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ 232
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
L FSYCL S +L +G K TPL++NP R S YY++L I V
Sbjct: 233 ---LGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISV 289
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G ++ I G IIDSGT T L A+ A++ F ++ ++ +
Sbjct: 290 GDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTE 349
Query: 349 FDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
+ C+++ P+ P + F G+++ LP++N +I +A + CL M ++ S +
Sbjct: 350 LELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSS-----SGM 404
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
++ N QQQN +L+D+ + A C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 187/412 (45%), Gaps = 45/412 (10%)
Query: 58 VLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT---------YIVRAKIGTPAQ 108
V + L +D R Q S R + + GR + T Y++ IGTP
Sbjct: 65 VRDALRRDMHR-QRSRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPL 123
Query: 109 TLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA--QCKQVPNPTC 162
DT +D W C T C + ++N A STTF L C ++ C
Sbjct: 124 PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAA 183
Query: 163 GGGACA--FNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQGL 215
CA +N TYG+ A +T + + VPG FGC ++ + GL
Sbjct: 184 PPPGCACMYNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGL 243
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNP 273
+GLGRGSLSL++Q L FSYCL F+ + + +L LGP ++ TP + +P
Sbjct: 244 VGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASP 300
Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
R S+ YY+NL I +G + + I PGA P G IIDSGT T L AY VR
Sbjct: 301 ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVR 360
Query: 331 DVFRRRVGSNLTV--TSLGGFDTCYSV-------PIVAPTITLMFSGMNVTLPQDNLLIH 381
+ V + TV + G D C+++ P V P++TL F G ++ LP D+ +I
Sbjct: 361 AAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMIS 420
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + CLAM + + ++ N QQQN ILYDV L A C+
Sbjct: 421 GSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 171/363 (47%), Gaps = 38/363 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + +DT +D W PC CV + F+ AQS ++ L C +
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148
Query: 154 CKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD----IVPGYTFGCIQKATGN 208
C + P C C + YG S+ A LS +T + T+ VP FGC G+
Sbjct: 149 CNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGS 208
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRL----GPIG 259
G++G GRG LSL++Q L FSYCL SF + L F L G
Sbjct: 209 LFNGSGMVGFGRGPLSLVSQ---LGSPRFSYCLTSFMSPVPSRLYFGAYATLNSTSASTG 265
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVF 318
+P ++ TP + NP ++YY+N+ I VG ++ I P N G G IIDSG+
Sbjct: 266 EP--VQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTI 323
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLT-VTSLGG-FDTCYSVP------IVAPTITLMFSGMN 370
T L AY V F +VG LT TSL DTC+ P + P + F G N
Sbjct: 324 TYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFEGAN 383
Query: 371 VTLPQDN-LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ LP +N +LI G++ CLA+AA+ D ++I + Q QN +LYD NS L
Sbjct: 384 MELPLENYMLIDGDTGNL-CLAIAASDDG-----SIIGSFQHQNFHVLYDNENSLLSFTP 437
Query: 430 ELC 432
C
Sbjct: 438 ATC 440
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 184/394 (46%), Gaps = 32/394 (8%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+A+ +AR+ L SLA A ++ + ++ Y++ IG+P + +DT +D
Sbjct: 51 VARSRARVAALQSLATAADAITAARILLRFSEG-EYLMDVGIGSPPRYFSAMIDTGSDLI 109
Query: 122 W---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
W PC CV + F A+ST++ +L C +A C + +P C AC + YG S
Sbjct: 110 WTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYGDSAS 169
Query: 179 AAN-LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
+A L+ +T + T+ VP +FGC G G++G GRG+LSL++Q L
Sbjct: 170 SAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQ---LG 226
Query: 234 QSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAI 286
FSYCL SF + L F L ++ TP + NP ++Y++N+ I
Sbjct: 227 SPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGI 286
Query: 287 RVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVT 344
V ++ I P N T G G IIDSGT T L PAY V+ F VG T
Sbjct: 287 SVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANAT 346
Query: 345 SLGGFDTCYSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
FDTC+ P + P + L F G ++ LP +N ++ CLAM + D
Sbjct: 347 PSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDG 406
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++I + Q QN +LYD+ NS L C
Sbjct: 407 -----SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 122/400 (30%), Positives = 192/400 (48%), Gaps = 38/400 (9%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------VPIASGRQITQSPTYIVRA 101
K L+ E V + + ++RLQ L+++ +A S PI +G + Y++
Sbjct: 58 KNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAG-----NGEYLIEL 112
Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
IGTP + +DT +D W PCT C + +F+ +S++F + C ++ C +P
Sbjct: 113 AIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALP 172
Query: 159 NPTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI----VPGYTFGCIQKATGNSVP-P 212
+ TC G C + +YG ++ L+ +T + V FGC + G+
Sbjct: 173 SSTCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQA 231
Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL--RLGPIGQPKRIKYTPLL 270
GL+GLGRG LSL++Q L + FSYCL S L LG + K + TPLL
Sbjct: 232 SGLVGLGRGPLSLVSQ---LKEQRFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLL 288
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
KNP + S YY++L AI VG + I + G IIDSGT T + AY A++
Sbjct: 289 KNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALK 348
Query: 331 DVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAG 385
F + L TS G D C+S+P + P + F G ++ LP +N +I +
Sbjct: 349 KEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSNL 408
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ CLAM A+ S +++ N+QQQN + +D+ +
Sbjct: 409 GVACLAMGAS-----SGMSIFGNVQQQNILVNHDLEKETI 443
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 184/394 (46%), Gaps = 32/394 (8%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+A+ +AR+ L SLA A ++ + ++ Y++ IG+P + +DT +D
Sbjct: 54 VARSRARVAALQSLATAADAITAARILLRFSEG-EYLMDVGIGSPPRYFSAMIDTGSDLI 112
Query: 122 W---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
W PC CV + F A+ST++ +L C +A C + +P C AC + YG S
Sbjct: 113 WTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYGDSAS 172
Query: 179 AAN-LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
+A L+ +T + T+ VP +FGC G G++G GRG+LSL++Q L
Sbjct: 173 SAGVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQ---LG 229
Query: 234 QSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAI 286
FSYCL SF + L F L ++ TP + NP ++Y++N+ I
Sbjct: 230 SPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGI 289
Query: 287 RVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVT 344
V ++ I P N T G G IIDSGT T L PAY V+ F VG T
Sbjct: 290 SVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANAT 349
Query: 345 SLGGFDTCYSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
FDTC+ P + P + L F G ++ LP +N ++ CLAM + D
Sbjct: 350 PSDTFDTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDG 409
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++I + Q QN +LYD+ NS L C
Sbjct: 410 -----SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 129/455 (28%), Positives = 200/455 (43%), Gaps = 68/455 (14%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVAR-- 79
D ++ + + H PCSP + S E+LA DQ+R + + S+ R
Sbjct: 85 DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTGRVN 142
Query: 80 -----------------------KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
+ AS + + Y+V +GTPA + DT
Sbjct: 143 PKRRRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDT 202
Query: 117 SNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+D WV C CV C +F+ A S+T+ N+ C A C + C GG C + +
Sbjct: 203 GSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQ 262
Query: 173 YGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
YG + + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT
Sbjct: 263 YGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTY 322
Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
Y F++CLP+ + +G L G G P TP+L + YYV + IRVG
Sbjct: 323 GKYGGVFAHCLPARS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRVGG 378
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGSNL 341
R++ I P AGTI+DSGTV TRL AY+++R +R+ +L
Sbjct: 379 RLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL 433
Query: 342 TVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
DTCY + PT++L+F G + ++++ + S CLA A D
Sbjct: 434 -------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNED 486
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + ++ N Q + + YD+ +G + C
Sbjct: 487 GGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 133/457 (29%), Positives = 201/457 (43%), Gaps = 72/457 (15%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVARKS 81
D ++ + + H PCSP + S E+LA DQ+R + + S+ R
Sbjct: 82 DATSSTTRMTIVHRHGPCSPLAAAH--GEPPSHGEILAADQSRAESIQHRVSTTTTGR-- 137
Query: 82 VVPIASGRQITQ--------------------SP-------TYIVRAKIGTPAQTLLMAM 114
V P S + Q SP Y+V +GTPA +
Sbjct: 138 VNPKRSRHRQQQPPSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVF 197
Query: 115 DTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
DT +D WV C CV C +F+ A S+T+ N+ C A C + C GG C +
Sbjct: 198 DTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYG 257
Query: 171 LTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
+ YG + + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL Q
Sbjct: 258 VQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQ 317
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
T Y F++CLP + +G L G G P TP+L + YYV + IRV
Sbjct: 318 TYGKYGGVFAHCLPPRS--TGTGYLDFG-AGSPPATTTTPMLTG-NGPTFYYVGMTGIRV 373
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV---------FRRRVGS 339
G R++ I P AGTI+DSGTV TRL AY+++R +R+
Sbjct: 374 GGRLLPIAPSVF-----AAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAV 428
Query: 340 NLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
+L DTCY + PT++L+F G + ++++ + S CLA A
Sbjct: 429 SL-------LDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGN 481
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D + + ++ N Q + + YD+ +G + C
Sbjct: 482 EDGGD--VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 186/389 (47%), Gaps = 26/389 (6%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
E + + + + RLQ LS+ + + V P+ +G +++ IGTPA+T
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNG-----EFLMNLAIGTPAETYSAI 113
Query: 114 MDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
MDT +D W PC C + +F+ +S++F L C + C +P +C G C +
Sbjct: 114 MDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CEYR 172
Query: 171 LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQ 228
+YG S+ L+ +T + V FGC + G + GL+GLGRG LSL++Q
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ 232
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
L FSYCL S +L +G K TPL++NP R S YY++L I V
Sbjct: 233 ---LGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISV 289
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G ++ I G IIDSGT T L A+ A++ F ++ ++ +
Sbjct: 290 GDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTE 349
Query: 349 FDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
+ C+++ P+ P + F G+++ LP++N +I +A + CL M ++ S +
Sbjct: 350 LELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSS-----SGM 404
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
++ N QQQN +L+D+ + A C
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 127/413 (30%), Positives = 189/413 (45%), Gaps = 52/413 (12%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------------------T 96
+E L +D R++ +++LA +P GR +T +P
Sbjct: 89 DELFSSRLQRDSRRVKSIATLAAQ----IP---GRNVTHAPRPGGFSSSVVSGLSQGSGE 141
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ---------STTFKNL 147
Y R +GTPA+ + M +DT +D W+ C C +Q S T+ +
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR------CYSQSDPIFDPRKSKTYATI 195
Query: 148 GCQAAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQK 204
C + C+++ + C C + ++YG + + S +T++ + V G GC
Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
G V GLLGLG+G LS QT + + FSYCL A S S+ G +
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
++TPLL NP+ + YYV LL I V G RV + + + G IIDSGT TRL+
Sbjct: 316 RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLL 379
PAY A+RD FR + FDTC+ + + PT+ L F G +V+LP N L
Sbjct: 376 PAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I C A A L++I N+QQQ R++YD+ +SR+G A C
Sbjct: 436 IPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 198/451 (43%), Gaps = 60/451 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------SSLAVARKSV-- 82
++ + + H PCSP K S E+L DQ R++++ ++ V R+
Sbjct: 64 ATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVADQRRVEYIHRRVSETTGRVRRQKHSA 123
Query: 83 ----------------------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
+P SG + + Y+V ++GTPA +
Sbjct: 124 PVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSGLSL-NTGNYVVPIRLGTPAARFTVVF 182
Query: 115 DTSNDAAWVPCTGCVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
DT +D WV C CV C +F +S T+ N+ C ++ C + C GG C +
Sbjct: 183 DTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSSYCSDLDTRGCSGGHCLYA 242
Query: 171 LTYGSSTIAANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
+ YG + +QDT++L D V + FGC +K G GL+GLGRG S+ Q
Sbjct: 243 VQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQA 302
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRL-GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
+ Y F+YC+P+ S +G L + TP+L + + YYV + I+V
Sbjct: 303 YDKYSGVFAYCIPATS--SGTGFLDFGPGAPAAANARLTPMLVD-NGPTFYYVGMTGIKV 359
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSL 346
G ++ IP + AG ++DSGTV TRL AY +R F + + T +
Sbjct: 360 GGHLLSIPATVF-----SDAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAF 414
Query: 347 GGFDTCYSV-----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
DTCY + I P ++L+F G + +++ S CLA AA D+ +
Sbjct: 415 SILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDDTD- 473
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ ++ N QQ+ + +LYD+ +G A C
Sbjct: 474 -MTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 168/348 (48%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P + M +D+ +D WV PC+ C S VF+ A S++F + C
Sbjct: 140 SGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCG 199
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+ C ++ N C G C + ++YG S L+ +T+++ ++ GC G
Sbjct: 200 SDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGHTNQGMF 259
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLGLG GS+S + Q FSYCL S + +G+L G P + L
Sbjct: 260 IGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS-RGTGSTGALEFGRGALPVGATWISL 318
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YY+ L I VG V +P Q G ++D+GT TR AY A
Sbjct: 319 IRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAF 378
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
RD F + + + FDTCY + + PT++ FS G +TLP N LI
Sbjct: 379 RDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDG 438
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
G CLA A +P S L++I N+QQ+ +I +D N +G +C
Sbjct: 439 GGTFCLAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 133/423 (31%), Positives = 201/423 (47%), Gaps = 39/423 (9%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
++T+ + H PCSP P+K + E E L +DQ R ++ + V R
Sbjct: 57 AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 112
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
+ VP A G + + Y++ +G+PA + M +DT +D +WV C C C S +F+
Sbjct: 113 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
+ S+T+ C +A C Q+ G C + +TYG S+ S DT++L +
Sbjct: 172 PSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
V + FGC +G + GL+GLG G+ SL++QT FSYCLP + S +L
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 291
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
TP+L++ + + Y V L AIRVG R + IP AGT++D
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 345
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
SGTV TRL AY+A+ F+ + G DTC+ + P++ L+FSG
Sbjct: 346 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 405
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
V + +I S CLA AA D +S L +I N+QQ+ +LYDV +G
Sbjct: 406 AVVSLDASGIILS-----NCLAFAANSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458
Query: 430 ELC 432
C
Sbjct: 459 GAC 461
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 185/372 (49%), Gaps = 29/372 (7%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG S Y + +GTP+ LM +DT +D W+ C C C S VF+ +
Sbjct: 128 PVVSG-LAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRR 186
Query: 141 STTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPG 196
S+++ + C A C+++ + C AC + + YG ++ A + + +T++ A V
Sbjct: 187 SSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVAR 246
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--------PSFKALS 248
GC G V GLLGLGRGSLS Q Y +FSYCL + S
Sbjct: 247 VALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRS 306
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG 307
S ++ GP +TP+++NPR + YYV L+ I V G RV + L+ +P+TG
Sbjct: 307 RSSTVTFGPP-SASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 365
Query: 308 -AGTIIDSGTVFTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
G I+DSGT TRL P+Y+A+RD FR G L+ FDTCY + + PT
Sbjct: 366 RGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPT 425
Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+++ F+ G LP +N LI + C A A V ++I N+QQQ R+++D
Sbjct: 426 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDG 481
Query: 421 PNSRLGVARELC 432
R+G A + C
Sbjct: 482 DGQRVGFAPKGC 493
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 129/432 (29%), Positives = 199/432 (46%), Gaps = 53/432 (12%)
Query: 28 QDHSSTLQVFHVFSPCSPFKPS-KPLSWEESVLEMLAKDQARLQF---------LSSLAV 77
+ SS+L++ H F PC+P + S P S S E+L +D+ R+ L+S
Sbjct: 57 NEGSSSLKLVHRFGPCNPHRTSTAPAS---SFNEILRRDKLRVDSIIQARRSMNLTSSVE 113
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV-- 135
KS VP +IT S YIV IGTP + + + DT + W C C C V
Sbjct: 114 HMKSSVPFYGLSKITAS-DYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKVPV 172
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY-GSSTIAANLSQDTISLATDIV 194
F+ +S +FK L C + C+ + C C + Y +S+ L+ +TIS +
Sbjct: 173 FDPTKSASFKGLPCSSKLCQSI-RQGCSSPKCTYLTAYVDNSSSTGTLATETISFSH--- 228
Query: 195 PGYTF-----GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA--- 246
Y F GC + +G S+ G++GL R +SL +QT N+Y FSYC+PS
Sbjct: 229 LKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTG 288
Query: 247 -LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
L+F G + P ++++P+ K SS Y + + I VG R + I A + T
Sbjct: 289 HLTFGGKV-------PNDVRFSPVSKTA-PSSDYDIKMTGISVGGRKLLIDASAFKIAST 340
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
IDSG V TRL AY+A+R VFR + + DTCY + P+
Sbjct: 341 ------IDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPS 394
Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
I++ F G+ + + ++ + CLA A D V ++ N QQ+ + +++D
Sbjct: 395 ISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEV----SIFGNFQQKTYTVVFDG 450
Query: 421 PNSRLGVARELC 432
R+G A C
Sbjct: 451 AKERIGFAPGGC 462
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 131/414 (31%), Positives = 199/414 (48%), Gaps = 59/414 (14%)
Query: 61 MLAKDQARLQFLSSLAVARKSV-------------VPIASGRQITQSPTYIVRAKIGTPA 107
M AKD+ R+++ S +P+ SG + S Y V+ +G+P
Sbjct: 55 MFAKDEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMG-SGNYYVKMGLGSPT 113
Query: 108 QTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQ-----VP 158
+ M +DT + +W+ C C + C VFN + S T+K + C ++QC +
Sbjct: 114 KYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLN 173
Query: 159 NPTCG--GGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQG 214
PTC AC + +YG S+ + LSQD ++L + + + +GC Q G G
Sbjct: 174 EPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-------GSLRLG--PIGQPKRIK 265
++GL LS+L+Q Y + FSYCLP+ SFS G L +G + K
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPT----SFSTPNSPKEGFLSIGTSSLTPSSSYK 289
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+TPLLKNP SLY+++L +I V R + + + + TIIDSGTV TRL P
Sbjct: 290 FTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYK------VPTIIDSGTVITRLPTPV 343
Query: 326 YTAVRDVFRRRVGSNL-TVTSLGGFDTCYS-----VPIVAPTITLMFS-GMNVTLPQDNL 378
YT +++ + + + DTC+ + VAP I ++F G ++ L N
Sbjct: 344 YTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNS 403
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L+ G ITCLAMA + S + +I N QQQ ++ YDV NSR+G A C
Sbjct: 404 LVELETG-ITCLAMAGS-----SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 196/404 (48%), Gaps = 38/404 (9%)
Query: 58 VLEMLAKDQARLQFLSS---LAVA---RKSVVPIASGRQITQ-------------SPTYI 98
+L LA+D AR++ +++ LAV+ + +VP+ + Q S Y
Sbjct: 102 MLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYF 161
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCK 155
+R IG P++T M +DT +D W+ C C C V F+ A S++F LGCQ QC+
Sbjct: 162 LRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCR 221
Query: 156 QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ 213
+ C +C + ++YG S + + +T+S + V GC G V
Sbjct: 222 NLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAA 281
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
GL+GLG G LSL +Q + S+FSYCL + ++ S +L +P P+ KN
Sbjct: 282 GLIGLGGGPLSLTSQIK---ASSFSYCLVNRDSVD-SSTLEFNS-AKPSDSVTAPIFKNS 336
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
+ + YYV + + VG + IPP + + + G I+D GT TRL AY A+RD F
Sbjct: 337 KVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDTF 396
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSIT 388
+ + + FDTCY++ + PT+ +F G ++ LP N LI +
Sbjct: 397 VKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTF 456
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A + L++I N+QQQ R+ YD+ NS++ + C
Sbjct: 457 CLAFAP----TTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 185/370 (50%), Gaps = 38/370 (10%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCT-GCVGCSSTVFNSAQSTTFKNLG 148
S Y+V IGTPA+ + DT +D WV PCT C +F+ ++S+T+ ++
Sbjct: 122 HSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVP 181
Query: 149 CQAAQCK--QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQ 203
C QCK + TCGG C +++ YG ++ NL+Q+ +L+ P G FGC
Sbjct: 182 CGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSH 241
Query: 204 ------KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ-STFSYCLPSFKALSFSGSLRLG 256
K + GLLGLGRG S+L+QT+ FSYCLP S +G L +G
Sbjct: 242 EYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRG--SSAGYLTIG 299
Query: 257 PIGQPK-RIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P+ + +TPL+ N + SS+Y VNL+ I V + I A GT+IDS
Sbjct: 300 AAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDS 353
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP----IVAPTITLMF-S 367
GTV T + A AY +RD FRR +G + L + DTCY V + AP + L F
Sbjct: 354 GTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGG 413
Query: 368 GMNVTLPQDNLL----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G + + +L + ++ S+T +A P N+ + +I NMQQ+ + +++DV
Sbjct: 414 GARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFV-IIGNMQQRAYNVVFDVEGR 472
Query: 424 RLGVARELCT 433
R+G C+
Sbjct: 473 RIGFGANGCS 482
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 131/403 (32%), Positives = 196/403 (48%), Gaps = 47/403 (11%)
Query: 62 LAKDQARLQFLSSLAVARKSVV-PIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSN 118
+A+ +AR+ L S AV+ V PI + R + S Y+V IGTP MDT +
Sbjct: 51 IARSKARVAALQSAAVSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGS 110
Query: 119 DAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG- 174
D W C C+ C++ F+ +S T++ L C++++C + +P+C C + YG
Sbjct: 111 DLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGD 170
Query: 175 SSTIAANLSQDTISL----ATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
+++ A L+ +T + +T + +FGC G G++G GRG LSL++Q
Sbjct: 171 TASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQ- 229
Query: 230 QNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSSLYY 280
L S FSYCL S+ + L F L G P ++ TP + NP ++Y+
Sbjct: 230 --LGPSRFSYCLTSYLSPTPSRLYFGVFANLNSTNTSSGSP--VQSTPFVINPALPNMYF 285
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
+++ I +G + + I P N G IIDSGT T L AY AV RR + S
Sbjct: 286 LSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGLAST 341
Query: 341 LTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVTLPQDN-LLIHSTAGSITC 389
+ + ++ G DTC+ P + P F G N+TLP +N +LI ST G + C
Sbjct: 342 IPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDGANMTLPPENYMLIASTTGYL-C 400
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LAMA SV +I N QQQN +LYD+ NS L C
Sbjct: 401 LAMAP-----TSVGTIIGNYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 130/409 (31%), Positives = 186/409 (45%), Gaps = 42/409 (10%)
Query: 60 EMLAKDQARLQFLSSLAVARKSV----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
+ LA D RL FLS + RK + P+ SG + S Y V +IG P Q+LL+ D
Sbjct: 47 QALALDTRRLHFLS---LRRKPIPFVKSPVVSG-AASGSGQYFVDLRIGQPPQSLLLIAD 102
Query: 116 TSNDAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNP--------TCG 163
T +D WV C+ C CS +TVF S+TF C C+ VP P T
Sbjct: 103 TGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRI 162
Query: 164 GGACAFNLTYGSSTIAANL-SQDTISLATDI-----VPGYTFGCIQKATGNSVP------ 211
C + Y ++ + L +++T SL T + FGC + +G SV
Sbjct: 163 HSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNG 222
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP-KRIKYTPL 269
G++GLGRG +S +Q + + FSYCL + + + L +G G ++ +TPL
Sbjct: 223 ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPL 282
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
L NP + YYV L ++ V + I P + + + GT++DSGT L PAY +V
Sbjct: 283 LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSV 342
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIHST 383
RRRV + GFD C +V V P + FSG V +P T
Sbjct: 343 IAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 402
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I CLA+ + V +VI N+ QQ +D SRLG +R C
Sbjct: 403 EEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 129/416 (31%), Positives = 200/416 (48%), Gaps = 38/416 (9%)
Query: 49 SKPLSWEESVLEMLAKDQARLQFLSS-LAVARKSVV---------------------PIA 86
+ LS+ E + + L +D AR+ ++S L +A + P+
Sbjct: 76 NNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSPVV 135
Query: 87 SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTT 143
SG S Y R +G P + LM +DT +D W+ PC+ C S ++N A S++
Sbjct: 136 SGMD-QGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSS 194
Query: 144 FKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGC 201
+K +GCQA C+Q+ C G+C + ++YG S N + +T++L + GC
Sbjct: 195 YKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQNVAIGC 254
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
G V GLLGLG GSLS +Q + FSYCL + S S +L+ G P
Sbjct: 255 GHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSES-SSTLQFGRAAVP 313
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
P+LKN R + YYV+L I VG +++ I + + G I+DSGT TRL
Sbjct: 314 NGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRL 373
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQD 376
AY ++RD FR + + + FDTCY + + PT+ FS G +++LP
Sbjct: 374 QTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAK 433
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N L+ + C A A +S L+++ N+QQQ R+ +D N+++G A C
Sbjct: 434 NYLVPVDSMGTFCFAFAP----TSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 137/417 (32%), Positives = 207/417 (49%), Gaps = 44/417 (10%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKS---VVPIASGRQ-----ITQSPT---YIVRAKIG 104
E + L +D+ R ++ S A A + VV +++GR ++++PT Y+ + +G
Sbjct: 82 ELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPTSGEYMAKIAVG 141
Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
TPA L+A+DT++D W+ C C C S VF+ ST++ + A C+ +
Sbjct: 142 TPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSG 201
Query: 162 CGG---GACAFNLTYG-----SSTIAANLSQDTISLATDIVPGY-TFGCIQKATG-NSVP 211
G G C + + YG +ST +L ++T++ A + Y + GC G P
Sbjct: 202 GGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAP 261
Query: 212 PQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPSFKALSFSGSLRL----GPIGQPKRIKY 266
G+LGLGRG +S+ Q L Y ++FSYCL F + S S L G + +
Sbjct: 262 AAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASF 321
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
TP + N + YYV L+ + VG RV + LQ +P TG G I+DSGT TRL P
Sbjct: 322 TPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARP 381
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSV----PIVAPTITLMFS-GMNVTLPQ 375
AY A RD FR ++L S GG FDTCY+V + P +++ F+ G+ V+L
Sbjct: 382 AYVAFRDAFRAAA-TSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQP 440
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A D ++VI N+ QQ R++YD+ R+G A C
Sbjct: 441 KNYLIPVDSRGTVCFAFAGTGDR---SVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/406 (32%), Positives = 198/406 (48%), Gaps = 54/406 (13%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSND 119
+A+ +AR+ L S AV V PI + R + S Y+V IGTP MDT +D
Sbjct: 52 IARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSD 111
Query: 120 AAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-S 175
W C C+ C+ + F+ +S T++ L C++++C + +P+C C + YG +
Sbjct: 112 LIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQYYYGDT 171
Query: 176 STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
++ A L+ +T + AT+I FGC G+ G++G GRG LSL+
Sbjct: 172 ASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGRGPLSLV 227
Query: 227 AQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSS 277
+Q L S FSYCL S+ + L F L G P ++ TP + NP +
Sbjct: 228 SQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVINPALPN 282
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
+Y+++L AI +G +++ I P N G IIDSGT T L AY AV RR +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGL 338
Query: 338 GSNLTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVT-LPQDNLLIHSTAGS 386
S + + ++ G DTC+ P + P + F N+T LP++ +LI ST G
Sbjct: 339 VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGY 398
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CL M AP V + +I N QQQN +LYD+ NS L C
Sbjct: 399 L-CLVM--APTGVGT---IIGNYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/399 (30%), Positives = 192/399 (48%), Gaps = 37/399 (9%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-------VPIASGRQITQSPTYIVRAK 102
K L+ E V + + ++RLQ L+++ +A ++ PI +G + Y++
Sbjct: 59 KNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAG-----NGEYLMELA 113
Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
IGTP + +DT +D W PCT C + +F+ +S++F + C ++ C VP+
Sbjct: 114 IGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPS 173
Query: 160 PTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI----VPGYTFGCIQKATGNSVP-PQ 213
TC G C + +YG ++ L+ +T + V FGC + G+
Sbjct: 174 STCSDG-CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQAS 232
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL--RLGPIGQPKRIKYTPLLK 271
GL+GLGRG LSL++Q L + FSYCL S L LG + K + TPLLK
Sbjct: 233 GLVGLGRGPLSLVSQ---LKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLK 289
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
NP + S YY++L I VG + I + G IIDSGT T + A+ A++
Sbjct: 290 NPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKK 349
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
F + L TS G D C+S+P + P I F G ++ LP +N +I +
Sbjct: 350 EFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLG 409
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ CLAM A+ S +++ N+QQQN + +D+ +
Sbjct: 410 VACLAMGAS-----SGMSIFGNVQQQNILVNHDLEKETI 443
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 131/420 (31%), Positives = 196/420 (46%), Gaps = 45/420 (10%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV------VPI 85
+T+ + H + PCSP P + ++LE+L DQ R +++ + VP
Sbjct: 63 TTVPLNHRYGPCSP----APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQPLDLTVPT 118
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
G + + Y++ IG+PA T M +DT +D +WV C G T+F+ ++STT+
Sbjct: 119 TLGSAL-DTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGL--TLFDPSKSTTYA 175
Query: 146 NLGCQAAQCKQVPN--PTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPGYTFGC 201
C +A C Q+ N C C + + YG S S DT++L A+D V + FGC
Sbjct: 176 PFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGC 235
Query: 202 IQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIG 259
+ GL+GLG + SL++QT Y +FSYCLP SG L G P G
Sbjct: 236 SHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRT--SGFLTFGAPNG 293
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
TP+L+ P+ +LY V L I VG + I P L G+++DSGTV T
Sbjct: 294 TSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS------NGSVMDSGTVIT 347
Query: 320 RLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVT- 372
L AY+A+ FR + + LG DTCY V + P ++L+ G V
Sbjct: 348 WLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVD 407
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L + ++I CLA AA + ++I N+QQ+ +L+DV G C
Sbjct: 408 LDGNGIMIQD------CLAFAATSGD-----SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/441 (29%), Positives = 201/441 (45%), Gaps = 51/441 (11%)
Query: 27 TQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAKDQARLQFLSSLAVA---- 78
+ D +S++ + H + PCSP P+ +P E + L D R +F S A
Sbjct: 55 SSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGED 114
Query: 79 ---RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------TGCV 129
K VP G + + Y++ +G+PA T + +DT +D +WV C + C
Sbjct: 115 GQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCH 173
Query: 130 GCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYGS-STIAANLS 183
+ +F+ A S+T+ C AA C Q+ + G C + + YG S S
Sbjct: 174 AHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYS 233
Query: 184 QDTISLA-TDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYC 240
D ++L+ +D+V G+ FGC G + + GL+GLG + SL++QT Y +FSYC
Sbjct: 234 SDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYC 293
Query: 241 LPSFKALSFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
LP+ A S G L LG R TP+L++ + + Y+ L I VG + + +
Sbjct: 294 LPATPASS--GFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGL 351
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
P AG+++DSGTV TRL AY A+ FR + LG DTC++
Sbjct: 352 SPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNF 405
Query: 356 ----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ PT+ L+F+G V +L H S CLA AP + I N+QQ
Sbjct: 406 TGLDKVSIPTVALVFAGGAVV----DLDAHGIV-SGGCLAF--APTRDDKAFGTIGNVQQ 458
Query: 412 QNHRILYDVPNSRLGVARELC 432
+ +LYDV G C
Sbjct: 459 RTFEVLYDVGGGVFGFRAGAC 479
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 183/389 (47%), Gaps = 32/389 (8%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQ 108
K L+ E + + + + R++ ++++ + + P+ +G Y++ IGTP
Sbjct: 53 KNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDG-----EYLMNVAIGTPDS 107
Query: 109 TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
+ MDT +D W PCT C + +FN S++F L C++ C+ +P+ TC
Sbjct: 108 SFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN 167
Query: 166 ACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQK----ATGNSVPPQGLLGLGR 220
C + YG ST ++ +T + T VP FGC + GN GL+G+G
Sbjct: 168 ECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGW 224
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI--GQPKRIKYTPLLKNPRRSSL 278
G LSL +Q L FSYC+ S+ + S S +L LG G P+ T L+ + +
Sbjct: 225 GPLSLPSQ---LGVGQFSYCMTSYGSSSPS-TLALGSAASGVPEGSPSTTLIHSSLNPTY 280
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
YY+ L I VG + IP Q G IIDSGT T L AY AV F ++
Sbjct: 281 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 340
Query: 339 SNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
S G TC+ P + P I++ F G + L + N+LI G I CLAM
Sbjct: 341 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVI-CLAMG 399
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPN 422
++ +++ N+QQQ ++LYD+ N
Sbjct: 400 SSS---QLGISIFGNIQQQETQVLYDLQN 425
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 212/439 (48%), Gaps = 56/439 (12%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
S+TL++ H CS K + W + + L D R+Q L + + +SV
Sbjct: 67 ESTTLEMKHR-ELCS----GKTIDWGKKMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVS 121
Query: 83 ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
+P+ SG ++ ++ YIV ++G +L++ DT +D WV C C C + ++
Sbjct: 122 ETQIPLTSGIKL-ETLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 178
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGG------GACAFNLTYGS-STIAANLSQ 184
+ + S+++K + C ++ C+ + T CGG C + ++YG S +L+
Sbjct: 179 DPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLAS 238
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
++I L + FGC + G GL+GLGR S+SL++QT + FSYCLPS
Sbjct: 239 ESIVLGDTKLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 298
Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ + SG+L G + YTPL++NP+ S Y +NL +G + L
Sbjct: 299 EDGA-SGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIG----GVELKTL 353
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
F G G +IDSGTV TRL Y AV+ F ++ + DTC+++
Sbjct: 354 SF----GRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYED 409
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
I PTI ++F G N L D + S+ CLA+A+ + + + +I N QQ+N
Sbjct: 410 ISIPTIKMIFEG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 466
Query: 414 HRILYDVPNSRLGVARELC 432
R++YD RLG+A E C
Sbjct: 467 QRVIYDTTQERLGIAGENC 485
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/432 (31%), Positives = 199/432 (46%), Gaps = 38/432 (8%)
Query: 21 LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSLAVAR 79
+N C+ +Q+ HV + LS E + M + +AR + LSS A A
Sbjct: 24 INSCCNAAAAPVRMQLTHV-------DAGRGLSGRELMRRMALRSKARAPRLLSSSATAP 76
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
S G +T+ Y++ IGTP Q + + +DT +D W C C C S +
Sbjct: 77 VSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYY 133
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-CGG---GACAFNLTYGS-STIAANLSQDTIS-LA 190
++++S+TF C + QCK P+ T C CAF+ +YG S L +T+S +A
Sbjct: 134 DASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVA 193
Query: 191 TDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
VPG FGC TG + G+ G GRG LSL +Q L FS+C +
Sbjct: 194 GASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVSGRKP 250
Query: 250 SGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
S L P K ++ TPL+KNP + YY++L I VG + +P A
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNG 310
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----P 360
TG GTIIDSGT FT L Y V D F V + ++ G C+S P + P
Sbjct: 311 TG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 369
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ L F G + LP++N + + G + +A + + +I N QQQN +LYD+
Sbjct: 370 KLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDL 425
Query: 421 PNSRLGVARELC 432
NS+L R C
Sbjct: 426 KNSKLSFVRAKC 437
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/423 (31%), Positives = 200/423 (47%), Gaps = 39/423 (9%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
++T+ + H PCSP P+K + E E L +DQ R ++ + V R
Sbjct: 127 AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 182
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
+ VP A G + + Y++ +G+PA + M +DT +D +WV C C C S +F+
Sbjct: 183 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 241
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
+ S+T+ C +A C Q+ G C + +TYG S+ S DT++L +
Sbjct: 242 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 301
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
V + FGC +G + GL+GLG G+ SL++QT FSYCLP + S +L
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 361
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
TP+L++ + + Y V L AIRVG R + IP AGT++D
Sbjct: 362 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 415
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
SGTV TRL AY+A+ F+ + G DTC+ + P++ L+FSG
Sbjct: 416 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 475
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
V + +I S CLA A D +S L +I N+QQ+ +LYDV +G
Sbjct: 476 AVVSLDASGIILS-----NCLAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 528
Query: 430 ELC 432
C
Sbjct: 529 GAC 531
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/423 (31%), Positives = 200/423 (47%), Gaps = 39/423 (9%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---------SSLAVAR-K 80
++T+ + H PCSP P+K + E E L +DQ R ++ + V R
Sbjct: 57 AATVPLHHRHGPCSPL-PTKKMPTLE---ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSD 112
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
+ VP A G + + Y++ +G+PA + M +DT +D +WV C C C S +F+
Sbjct: 113 ATVPTALGTSL-NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATDI 193
+ S+T+ C +A C Q+ G C + +TYG S+ S DT++L +
Sbjct: 172 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
V + FGC +G + GL+GLG G+ SL++QT FSYCLP + S +L
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 291
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
TP+L++ + + Y V L AIRVG R + IP AGT++D
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMD 345
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
SGTV TRL AY+A+ F+ + G DTC+ + P++ L+FSG
Sbjct: 346 SGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 405
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
V + +I S CLA A D +S L +I N+QQ+ +LYDV +G
Sbjct: 406 AVVSLDASGIILS-----NCLAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458
Query: 430 ELC 432
C
Sbjct: 459 GAC 461
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 138/426 (32%), Positives = 208/426 (48%), Gaps = 49/426 (11%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-LAVARK-----SVVPIASGR 89
+ H SPCSP PLS + + D AR+ L+S LA K S VP+ASG
Sbjct: 46 LHHPQSPCSP----APLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASGA 101
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFK 145
+ YI R +GTP T +M +D+ + W+ C C V C + +++ S+T+
Sbjct: 102 SVGVG-NYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYA 160
Query: 146 NLGCQAAQCKQV------PNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-IVPGY 197
+ C A QC ++ P+ G G C + +YG + + LS+DT+SL++ PG+
Sbjct: 161 AVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGF 220
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
+GC Q G GL+GL R LSLL+Q ++F+YCLP+ A S +G L G
Sbjct: 221 YYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAAS-AGYLSFGS 279
Query: 258 IGQ---PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P + YT ++ + +SLY+V+L + V + +P P TIIDS
Sbjct: 280 NSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLP-----TIIDS 334
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD---TCYSVPIV---APTITLMFS- 367
GTV TRL P YTA+ + VG+ L S + TC+ + P + + F+
Sbjct: 335 GTVITRLPTPVYTAL----SKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVNMAFAG 390
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G + L N+L+ + TCLA AP + + +I N QQQ ++YDV SR+G
Sbjct: 391 GATLRLTPGNVLVDVNE-TTTCLAF--APTDSTA---IIGNTQQQTFSVVYDVKGSRIGF 444
Query: 428 ARELCT 433
A C+
Sbjct: 445 AAGGCS 450
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/390 (30%), Positives = 196/390 (50%), Gaps = 25/390 (6%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
K L+ + + + + RL+ L+++ +A S I S ++ + +++ IGTP +T
Sbjct: 54 KNLTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINS-PVLSGNGEFLMNLAIGTPPET 112
Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
MDT +D W PCT C S +F+ +S++F L C + CK +P +C +
Sbjct: 113 YSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCSD-S 171
Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
C + TYG S+ ++ +T + +P FGC + G+ GL+GLGRG LS
Sbjct: 172 CEYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLS 231
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSSLYY 280
L++Q L ++ FSYCL S S +L +G + G I+ TPL++NP + S YY
Sbjct: 232 LVSQ---LKEAKFSYCLTSIDDTKTS-TLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYY 287
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
++L I VG + I Q G IIDSGT T L A+ V+ F ++G
Sbjct: 288 LSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLP 347
Query: 341 LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
+ + G + CY++P + P + L F+G ++ LP +N +I ++ + CLAM ++
Sbjct: 348 VDNSGATGLELCYNLPSDTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSS 407
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+++ N+QQQN + +D+ L
Sbjct: 408 -----GGMSIFGNVQQQNMFVSHDLEKETL 432
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 135/431 (31%), Positives = 204/431 (47%), Gaps = 44/431 (10%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----RKSVVPIA 86
S+L V H PCSP + S S E+L +DQ R+ + A + V +A
Sbjct: 71 SSLTVVHRHGPCSPLRSRG--SGAPSHTEILRRDQDRVDAIRRKVTASSNKPKGGVSLLA 128
Query: 87 SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTT 143
+ + + Y+ ++GTPA L++ +DT +D +WV C C C VF+ S+T
Sbjct: 129 NWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASST 188
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-------CAFNLTYGS-STIAANLSQDTISLA----- 190
+ + C A +C+++ + + C + ++Y S +L++DT++L+
Sbjct: 189 YSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSP 248
Query: 191 --TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
D VPG+ FGC G GLLGLG G SL +Q Y + FSYCLPS + S
Sbjct: 249 SPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPS--SPS 306
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
+G L G ++T ++ +S YY+NL I V R + +P A T A
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTS-YYLNLTGIVVAGRAIKVPASAF----ATAA 361
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSV----PIVAPTI 362
GTIIDSGT F+RL AY A+R FR +G + FDTCY + P +
Sbjct: 362 GTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAV 421
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
L+F+ G V L +L + TCLA N L ++ N QQ+ ++YDV
Sbjct: 422 ELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-----NHDLGILGNTQQRTLAVIYDVG 476
Query: 422 NSRLGVARELC 432
+ R+G R+ C
Sbjct: 477 SQRIGFGRKGC 487
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 183/368 (49%), Gaps = 28/368 (7%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGCSSTVFNS 138
+P ++G + + ++V G+PAQ +++DT +D +W+ PC+G C VF+
Sbjct: 148 IPDSTGTSL-DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDP 206
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPG 196
+S T+ + C QC G C + +TYG S+ A LS +T+SL+ T +PG
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPG 266
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
+ FGC Q G GL+GLGRG+LSL +Q + +TFSYCLPS+ G L +G
Sbjct: 267 FAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTT--HGYLTMG 324
Query: 257 PI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
++YT +++ SLY+V +++I +G ++ +PP T GT+
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF-----TRDGTL 379
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS 367
DSGT+ T L AY ++RD F+ + + FDTCY I P + FS
Sbjct: 380 FDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFS 439
Query: 368 -GMNVTLPQDNLLIH--STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G L +LI+ TA + CLA P + N+I N QQ+ ++YDV +
Sbjct: 440 DGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMP--FNIIGNTQQRGTEVIYDVAAEK 497
Query: 425 LGVARELC 432
+G + C
Sbjct: 498 IGFGQFTC 505
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/407 (30%), Positives = 185/407 (45%), Gaps = 37/407 (9%)
Query: 60 EMLAKDQARLQFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
+ L+ D RL F S+L + P+ SG T S Y V ++GTP Q LL+ DT +
Sbjct: 52 QALSFDSHRLSFFFSALHTPQSLKSPVVSGAS-TGSGQYFVDLRLGTPPQKLLLVADTGS 110
Query: 119 DAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGG----AC 167
D WV C+ C C+ + F + STTF C + C+ VP P C C
Sbjct: 111 DLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPC 170
Query: 168 AFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVP------PQGL 215
+ +YG S + S++T +L T + G FGC + +G SV G+
Sbjct: 171 RYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGV 230
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+GLGRG +SL +Q + + + FSYCL PS + GS + +R+++TPL
Sbjct: 231 MGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPL 290
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
NP + YY+ + ++ V + I P + GTI+DSGT T L PAY +
Sbjct: 291 HINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI 350
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAG 385
V +RRV GFD C +V + P ++ G +V P T
Sbjct: 351 LTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE 410
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA+ A S +VI N+ QQ + +D +RLG +R C
Sbjct: 411 DVKCLALQAV--MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 135/434 (31%), Positives = 194/434 (44%), Gaps = 48/434 (11%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWE----------ESVLEMLAKDQARLQFLSSLAVA 78
++ + L+V H PCS + + +S+ L+KD LS +
Sbjct: 80 ENKAFLKVVHKHGPCSDLRQGHKAEAQYILLQDQSRVDSIHSKLSKDSG----LSDVKAT 135
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SST 134
+ +P G I S Y V +GTP + + DT +D W C CV
Sbjct: 136 AATTLPAKDG-SIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEA 194
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGACAFNLTYGSSTIAANL-SQDTIS 188
+FN +QST++ N+ C + C + + T C C + + YG S+ + ++ +S
Sbjct: 195 IFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLS 254
Query: 189 L-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
L ATD+ + FGC Q G GLLGLGR LSL++QT Y FSYCLPS +
Sbjct: 255 LTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSS 314
Query: 247 ---LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
L+F GS K +TPL SS Y ++L I VG R + I P
Sbjct: 315 TGFLTFGGS-------TSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF--- 364
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVA 359
+ AGTIIDSGTV TRL AY+A+ FR+ + +L DTC+ I
Sbjct: 365 --STAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISV 422
Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
P I L FSG V + + + CLA A D S + + N+QQ+ ++YD
Sbjct: 423 PKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSD--ASDVAIFGNVQQKTLEVVYD 480
Query: 420 VPNSRLGVARELCT 433
R+G A C+
Sbjct: 481 GAAGRVGFAPAGCS 494
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/427 (29%), Positives = 205/427 (48%), Gaps = 38/427 (8%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQ-FLSSLAVARKSVVPIAS 87
+ +S+L+V + + PC P + S E L +DQ R++ F L++ S V
Sbjct: 66 NRASSLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFKEM 125
Query: 88 GRQITQS--PT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SSTVFNS 138
I S PT Y+V +GTP + ++ DT +D W C C+G C + F+
Sbjct: 126 QTTIPASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDP 185
Query: 139 AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT-D 192
ST++KN+ C + CK + P C C + + YGS L+ +T+++A+ D
Sbjct: 186 TTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGYTIGFLATETLAIASSD 245
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
+ + FGC +++ G GLLGLGR ++L +QT N Y++ FSYCLP+ + S +G
Sbjct: 246 VFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA--SPSSTGH 303
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L G + + K TP+ +P+ LY +N + I V R + I G++ + TII
Sbjct: 304 LSFG-VEVSQAAKSTPI--SPKLKQLYGLNTVGISVRGRELPI-NGSI-------SRTII 352
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTITLMF 366
DSGT FT L +P Y+A+ FR + + F CY + P I++ F
Sbjct: 353 DSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFF 412
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G+ V + ++I CLA A +S + N QQ+ + ++YDV +
Sbjct: 413 EGGVEVEIDVSGIMIPVNGLKEVCLAFADT--GSDSDFAIFGNYQQKTYEVIYDVAKGMV 470
Query: 426 GVARELC 432
G A + C
Sbjct: 471 GFAPKGC 477
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/434 (30%), Positives = 199/434 (45%), Gaps = 54/434 (12%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV----------- 82
L++ H SPCSP P+ + +L D AR+ L++ S
Sbjct: 45 LELHHPRSPCSP----APVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADA 100
Query: 83 --------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC-- 131
VP++ G + Y+ R +GTPA +M +DT + W+ C+ C V C
Sbjct: 101 GLAGSLASVPLSPGASVGVG-NYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHR 159
Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIAAN-LS 183
S VFN S+T+ ++GC A QC +P+ T AC+ + +YG S+ + LS
Sbjct: 160 QSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLS 219
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
+DT+S + +P + +GC Q G GL+GL R LSLL Q +F+YCLPS
Sbjct: 220 KDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPS 279
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
+ + P + YTP++ + SLY++ L + V + + A
Sbjct: 280 SSSSGYLSLGSY----NPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSL 335
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAP 360
P TIIDSGTV TRL Y+A+ + ++ DTC+ + + AP
Sbjct: 336 P-----TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAP 390
Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+T+ F+ G + L NLL+ S TCLA A A +I N QQQ ++YD
Sbjct: 391 AVTMSFAGGAALKLSAQNLLVD-VDDSTTCLAFAPARSAA-----IIGNTQQQTFSVVYD 444
Query: 420 VPNSRLGVARELCT 433
V +SR+G A C+
Sbjct: 445 VKSSRIGFAAGGCS 458
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 196/422 (46%), Gaps = 49/422 (11%)
Query: 50 KPLSWEESVLEMLAKDQAR-----LQFLSSLAVARK-----SVVPIASGRQITQSPTYIV 99
P + + + +LA D++R L+ + A A + VP+ SG + Q+ Y+
Sbjct: 129 DPAAHDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRF-QTLNYVT 187
Query: 100 RAKIG-----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
+G +PA L + +DT +D WV PC+ C +F+ A S T+ + C A
Sbjct: 188 TIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 247
Query: 152 AQCKQVPNP------TCGGG--ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
+ C +CGGG C + L YG + + L+ DT++L + G+ FGC
Sbjct: 248 SACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCG 307
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
G GL+GLGR LSL++QT Y FSYCLP+ + SGSL LG
Sbjct: 308 LSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSY 367
Query: 263 R----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
R + YT ++ +P + Y++N+ VG AL + +IDSGTV
Sbjct: 368 RNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------ALAAQGLGASNVLIDSGTVI 420
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSV----PIVAPTITLMFS-GMNV 371
TRL Y VR F R+ + T+ G DTCY + + P +TL G V
Sbjct: 421 TRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEV 480
Query: 372 TLPQDNLL-IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
T+ +L + GS CLAMA+ + +I N QQ+N R++YD SRLG A E
Sbjct: 481 TVDAAGMLFVVRKDGSQVCLAMASL--SYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADE 538
Query: 431 LC 432
C
Sbjct: 539 DC 540
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/415 (31%), Positives = 186/415 (44%), Gaps = 49/415 (11%)
Query: 52 LSWEESVLEMLAKDQAR-LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
+S E V + L +D R +F LA + V + + + YI+ IGTP +
Sbjct: 42 VSATEFVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSY 101
Query: 111 LMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQ------AAQCKQVPNP 160
DT +D W C + C + +N + STTF L C AA P P
Sbjct: 102 PAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPP 161
Query: 161 TCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGL 215
C +C +N TYG+ A S +T + + VPG FGC ++ + GL
Sbjct: 162 GC---SCMYNQTYGTGWTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGL 218
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNP 273
+GLGRGS+SL++Q L FSYCL F+ + + +L LGP + TP + +P
Sbjct: 219 VGLGRGSMSLVSQ---LGAGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASP 275
Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
+ S+ YY+NL I +G + IPP A G IIDSGT T LV AY VR
Sbjct: 276 SKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVR 335
Query: 331 DVFRRRV------GSNLTVTSLGGFDTCY------SVPIVAPTITLMFSGMNVTLPQDNL 378
V GS+ T G D C+ S P P++T F G ++ LP DN
Sbjct: 336 AAIESLVTLPVADGSDST-----GLDLCFALTSETSTPPSMPSMTFHFDGADMVLPVDNY 390
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+I + + CLAM + ++ N QQQN +LYD+ L A C+
Sbjct: 391 MILGSG--VWCLAMR---NQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 126/415 (30%), Positives = 200/415 (48%), Gaps = 45/415 (10%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAV---------ARKSVVPIASGRQITQSPTYIVR 100
K W + + + L D R++ L S A S +P++SG ++ Q+ YIV
Sbjct: 12 KSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGVRL-QTLNYIVT 70
Query: 101 AKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
+IG + + + +DT +D WV PC C +FN + S +++ + C ++ C+ +
Sbjct: 71 VEIG--GRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSL 128
Query: 158 PNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
T G C N + YG S +L + ++L T V + FGC + G
Sbjct: 129 QYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLF 188
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IK 265
GL+GLG+ LSL++QT +++ FSYCLP+ A SGSL LG + I
Sbjct: 189 GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPT-TAADASGSLILGGNSSVYKNTTPIS 247
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
YT ++ NP+ + Y++NL I +G ALQ +G +IDSGTV TRL P
Sbjct: 248 YTRMIANPQLPTFYFLNLTGISIGGV-------ALQAPNYRQSGILIDSGTVITRLPPPV 300
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIH 381
Y ++ F ++ + DTC+++ + PTI + F G N L D I
Sbjct: 301 YRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEG-NAELTVDVTGIF 359
Query: 382 ---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
T S CLA+A+ + + + +I N QQ+N R++Y+ S+LG A E C+
Sbjct: 360 YFVKTDASQVCLALASL--SFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 164/352 (46%), Gaps = 17/352 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y+++ +GTP Q +DT +D WV C C C +F S+++ N C
Sbjct: 5 SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCT 64
Query: 151 AAQCKQVPNPTCG-GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ C +P PTC C ++ +YG S + + +T++L + FGC G
Sbjct: 65 DSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHNQEGT 124
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
GL+GLG+G LSL +Q + + FSYCL + G + R +TP
Sbjct: 125 FAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRASFTP 184
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LL+N S YYV + +I VG R V PP A + + G I+DSGT T A+
Sbjct: 185 LLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLAAFIP 244
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIH- 381
+ RR++ + G + CY + V+ P++T+ + ++ +P NL +
Sbjct: 245 ILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIPVSNLWVLV 304
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
G C AM+ + ++I N+QQQN+ I+ DV NSR+G C+
Sbjct: 305 DNFGETVCTAMSTSDQ-----FSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 135/440 (30%), Positives = 204/440 (46%), Gaps = 55/440 (12%)
Query: 23 PICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
P+ +T+ + H PCSP P + E ++ E+L +DQ R +++ A+ SV
Sbjct: 44 PVTPPSSSGTTVPLSHRHGPCSP----APSTVEPTMAELLRRDQLRAKYIQ----AKLSV 95
Query: 83 VPIASGRQITQS-----PT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
+ + QS PT Y++ IGTPA T + +DT +D +WV C
Sbjct: 96 NSGSGTDGVQQSAAITLPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHA 155
Query: 128 CVGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANL 182
G S++ F+ +S+T+ C +A C ++ G C + + YG S
Sbjct: 156 RAGAGSSLFFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTY 215
Query: 183 SQDTISL-ATDIVPGYTFGCIQKAT-GNSVPP---QGLLGLGRGSLSLLAQTQNLYQSTF 237
DT++L +T+ V + FGC + + G + GL+GLG G+ SL++QT Y S F
Sbjct: 216 GSDTLALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAF 275
Query: 238 SYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SYCLP+ SG L LG TP+ ++ R + Y+V L I VG V I P
Sbjct: 276 SYCLPA--TTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISP 333
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP- 356
AG+I+DSGT+ TRL AY+A+ FR + + DTC+
Sbjct: 334 TVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTG 387
Query: 357 ---IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ P + L+FSG V L D ++ S CLA A A + S +I N+QQ+
Sbjct: 388 QDNVSIPAVELVFSGGAVVDLDADGIMYGS------CLAFAPATGGIGS---IIGNVQQR 438
Query: 413 NHRILYDVPNSRLGVARELC 432
+L+DV S LG C
Sbjct: 439 TFEVLHDVGQSVLGFRPGAC 458
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 139/486 (28%), Positives = 224/486 (46%), Gaps = 73/486 (15%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHS----STLQVFHVFSPCSPFKPS----------- 49
L FFL+F+FL+ + N C+ + LQ H F P
Sbjct: 9 LPFFLSFVFLYFIIA--NGGCELEQKKMFKVQMLQRNHQFGSKGCILPESRKEKGAIVLE 66
Query: 50 ---------KPLSWEESVLEMLAKDQARLQFLSSLAVARKS-----------VVPIASGR 89
+ ++W + + L D R++ + + A+ S +P+ASG
Sbjct: 67 MKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLASGI 126
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
+ ++ YIV IG Q + + +DT +D WV C C+ C S VFN + S+++ +
Sbjct: 127 NL-ETLNYIV--TIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNS 183
Query: 147 LGCQAAQCKQVPNPTCGGGACAFN--------LTYGSSTIA-ANLSQDTISLATDIVPGY 197
L C ++ C+ + T AC N ++YG + L + +S V +
Sbjct: 184 LLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNF 243
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
FGC + G G++GLGR +LS+++QT + FSYCLP+ + + SGSL +G
Sbjct: 244 VFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGA-SGSLVIGN 302
Query: 258 IGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+ I YT ++ NP+ S+ Y +NL I VG A+Q G +ID
Sbjct: 303 ESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGV-------AIQDTSFGNGGILID 355
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGM 369
SGTV TRL Y A++ F ++ +L DTC+++ + PT+++ F
Sbjct: 356 SGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFEN- 414
Query: 370 NVTLPQD--NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
NV L D +L GS CLA+A+ D + + +I N QQ+N R++YD S++G
Sbjct: 415 NVDLNVDAVGILYMPKDGSQVCLALASLSDEND--MAIIGNYQQRNQRVIYDAKQSKIGF 472
Query: 428 ARELCT 433
ARE C+
Sbjct: 473 AREDCS 478
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 184/376 (48%), Gaps = 26/376 (6%)
Query: 71 FLSSLAVARKSVVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
F +SLA A + P+ SG + Q S Y R IG+PA+ L M +DT +D WV C C
Sbjct: 146 FGASLAAAIQG--PVVSG--VGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCA 201
Query: 130 GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLS 183
C S VF+ + S ++ + C + +C+ + C GAC + + YG S + +
Sbjct: 202 DCYQQSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFA 261
Query: 184 QDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
+T++L V GC G V GLL LG G LS +Q + STFSYCL
Sbjct: 262 TETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCLV 318
Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
+ + S +L+ G G PL+++PR + YYV L I VG + + IP A
Sbjct: 319 DRDSPAAS-TLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAM 377
Query: 303 NPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
+ T+G+G I+DSGT TRL + AY A+RD F R S + + FDTCY + +
Sbjct: 378 DATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSV 437
Query: 358 VAPTITLMFSGMN-VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
P ++L F G + LP N LI CLA A N+ +++I N+QQQ R+
Sbjct: 438 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRV 493
Query: 417 LYDVPNSRLGVARELC 432
+D +G C
Sbjct: 494 SFDTAKGVVGFTPNKC 509
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 126/413 (30%), Positives = 189/413 (45%), Gaps = 52/413 (12%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------------------T 96
+E L +D R++ +++LA +P GR +T +P
Sbjct: 89 QELFSSRLQRDSRRVRSIATLAAQ----IP---GRNVTHAPRPGGFSSSVVSGLSQGSGE 141
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ---------STTFKNL 147
Y R +GTPA+ + M +DT +D W+ C C +Q S T+ +
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR------CYSQSDPIFDPRKSKTYATI 195
Query: 148 GCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQK 204
C + C+++ + C C + ++YG + + S +T++ + V G GC
Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
G V GLLGLG+G LS QT + + FSYCL A S S+ G +
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
++TPLL NP+ + YYV LL I V G RV + + + G IIDSGT TRL+
Sbjct: 316 RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLL 379
PAY A+RD FR + + FDTC+ + + PT+ L F +V+LP N L
Sbjct: 376 PAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRADVSLPATNYL 435
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I C A A L++I N+QQQ R++YD+ +SR+G A C
Sbjct: 436 IPVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 164/350 (46%), Gaps = 29/350 (8%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
IGTPA +DT +D W C CV C S+ VF+ + S+T+ + C +A C +P
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 160 PTC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLL 216
C C + TYG SS+ L+ +T +LA +PG FGC G+ GL+
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLV 292
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG-------QPKRIKYTPL 269
GLGRG LSL++Q L FSYCL S + S L LG + ++ TPL
Sbjct: 293 GLGRGPLSLVSQ---LGLDKFSYCLTSLDDTNNS-PLLLGSLAGISEASAAASSVQTTPL 348
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
+KNP + S YYV+L AI VG + +P A G I+DSGT T L Y A+
Sbjct: 349 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 408
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVP------IVAPTITLMFS-GMNVTLPQDNLLIHS 382
+ F ++ S G D C+ P + P + F G ++ LP +N ++
Sbjct: 409 KKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLD 468
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL + + L++I N QQQN + +YDV + L A C
Sbjct: 469 GGSGALCLTVMG-----SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 39/409 (9%)
Query: 53 SWEESVLEMLAKDQARLQFLSS-LAVARKSV-------------------VPIASGRQIT 92
+++ VL LA+D AR+ L++ L +A S+ P++SG
Sbjct: 94 NYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSSG-TAQ 152
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGC 149
S Y R +G P++ M +DT +D W+ PC+ C S +F+ S+++ L C
Sbjct: 153 GSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTC 212
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATGN 208
A QC+ + C G C + ++YG + +T+S V GC G
Sbjct: 213 DAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVNRVAIGCGHDNEGL 272
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V G GL L+ T + ++FSYCL + S +L +P P
Sbjct: 273 FV---GSAGLLGLGGGPLSLTSQIKATSFSYCLVD-RDSGKSSTLEFNS-PRPGDSVVAP 327
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
LLKN + ++ YYV L + VG +V +PP + + G I+DSGT TRL AY +
Sbjct: 328 LLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNS 387
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNV-TLPQDNLLIHST 383
VRD F+R+ + + FDTCY + + PT++ FSG LP N LI
Sbjct: 388 VRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVD 447
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A S +++I N+QQQ R+ +D+ NS +G + C
Sbjct: 448 GAGTYCFAFAP----TTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 189/390 (48%), Gaps = 32/390 (8%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
L+K+ R + L + +P SG I S Y+V +GTP + L + DT +D
Sbjct: 15 LSKNLGRENTVKDL---DSTTLPAESGSLI-GSANYVVVVGLGTPKRDLSLVFDTGSDLT 70
Query: 122 WVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNP-------TCGGGACAFN 170
W C C G +F+ ++S+++ N+ C ++ C Q+ + + +C ++
Sbjct: 71 WTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYD 130
Query: 171 LTYG-SSTIAANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
YG +ST LSQ+ +++ ATDIV + FGC Q G GL+GLGR +S++ Q
Sbjct: 131 AKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQ 190
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIR 287
T + Y FSYCLP+ S G L G + YTPL +S Y +++++I
Sbjct: 191 TSSNYNKIFSYCLPATS--SSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSIS 248
Query: 288 VGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG 347
VG +P A+ + + G+IIDSGTV TRL Y A+R FRR + G
Sbjct: 249 VGG--TKLP--AVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAG 304
Query: 348 GFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
DTCY + I P I FS G+ V L +L + + CLA AA + ++
Sbjct: 305 LLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQV-CLAFAA--NGSDND 361
Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ V N+QQ+ ++YDV R+G C
Sbjct: 362 ITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 188/369 (50%), Gaps = 29/369 (7%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST---VFNS 138
+P+ G I S Y V+ +GTP + M +DT + +W+ C C V C + +++
Sbjct: 112 IPLNPGLSIG-SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDP 170
Query: 139 AQSTTFKNLGCQAAQCKQVP-----NPTC--GGGACAFNLTYGSSTIA-ANLSQDTISL- 189
+ S T+K L C + +C ++ +P C AC + +YG ++ + LSQD ++L
Sbjct: 171 SVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLT 230
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
++ +P +T+GC Q G G++GL R LS+LAQ Y FSYCLP+ + S
Sbjct: 231 SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSS 290
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
G P K+TP+L + + SLY++ L AI V R +D+ A+ P
Sbjct: 291 GGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDL-AAAMYRVP----- 344
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITL 364
T+IDSGTV TRL Y A+R F + + + + DTC+ + P I +
Sbjct: 345 TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKM 404
Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+F G ++TL ++LI + G ITCLA A + + + + +I N QQQ + I YDV S
Sbjct: 405 IFQGGADLTLRAPSILIEADKG-ITCLAFAGS--SGTNQIAIIGNRQQQTYNIAYDVSTS 461
Query: 424 RLGVARELC 432
R+G A C
Sbjct: 462 RIGFAPGSC 470
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 148/470 (31%), Positives = 217/470 (46%), Gaps = 58/470 (12%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSST-LQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
L +FL F +L+ L Q L ++HV S + P S+ + M+
Sbjct: 3 LFWFLVFSAHLALASSLVEFQGMQKQEGMQLNLYHVKGLDSSQTSTSPFSFSD----MIT 58
Query: 64 KDQARLQFLSSLAVARKSV----------------VPIASGRQITQSPTYIVRAKIGTPA 107
KD+ R++FL S ++S P+ SG I S Y V+ +GTPA
Sbjct: 59 KDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSI-GSGNYYVKIGVGTPA 117
Query: 108 QTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCK-------Q 156
+ M +DT + +W+ C CV C V F + S T+K L C ++QC
Sbjct: 118 KYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLN 177
Query: 157 VPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQKATGNSVPPQ 213
P + GAC + +YG ++ + LSQD ++L P G+ +GC Q G
Sbjct: 178 APGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSA 237
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKAL---SFSGSLRLGPIGQPKR-IKYTP 268
G++GL LS+L Q N Y + FSYCLP SF A S SG L +G K+TP
Sbjct: 238 GIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSGFLSIGASSLSSSPYKFTP 297
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+KNP+ SLY++ L I V + + + A +N TIIDSGTV TRL Y A
Sbjct: 298 LVKNPKIPSLYFLGLTTITVAGKPLGV--SASSYN----VPTIIDSGTVITRLPVAIYNA 351
Query: 329 VRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQDNLLIHS 382
++ F + DTC+ + P I ++F G + L N L+
Sbjct: 352 LKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEI 411
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
G+ TCLA+AA+ + + ++I N QQQ + YDV NS++G A C
Sbjct: 412 EKGT-TCLAIAASSNPI----SIIGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 206/424 (48%), Gaps = 51/424 (12%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKS----VVPIASGRQ-----ITQSPT---YIVRAKI 103
E + L +D+ R ++ S A A + VV +++GR ++++PT YI + +
Sbjct: 88 ELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIAKIAV 147
Query: 104 GTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
GTPA L+A+DT++D W+ C C C S VF+ ST++ + A C+ +
Sbjct: 148 GTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRS 207
Query: 161 TCGG---GACAFNLTYG-------SSTIAANLSQDTISLATDIVPGY-TFGCIQKATG-N 208
G G C + + YG +ST +L ++T++ A + Y + GC G
Sbjct: 208 GGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLF 267
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPSFKALSFSGSLRL----GPIGQPKR 263
P G+LGL RG +S+ Q L Y ++FSYCL F + S S L G +
Sbjct: 268 GAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPP 327
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRL 321
+TP + N + YYV L+ + VG RV + LQ +P TG G I+DSGT TRL
Sbjct: 328 ASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRL 387
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSVP--------IVAPTITLMFS-G 368
PAYTA RD FR + L S GG FDTCY+V + P +++ F+ G
Sbjct: 388 ARPAYTAFRDAFRAAA-TGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGG 446
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ ++L N LI + C A A D ++VI N+ QQ R++YD+ R+G A
Sbjct: 447 VELSLQPKNYLITVDSRGTVCFAFAGTGDR---SVSVIGNILQQGFRVVYDIGGQRVGFA 503
Query: 429 RELC 432
C
Sbjct: 504 PNSC 507
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 120/383 (31%), Positives = 178/383 (46%), Gaps = 38/383 (9%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS----STVFNSA 139
P+ SG + S Y V +IGTP QTLL+ DT +D WV C+ C CS + F +
Sbjct: 74 PVISGAS-SGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFAR 132
Query: 140 QSTTFKNLGCQAAQCKQVPNP-------TCGGGACAFNLTYG-SSTIAANLSQDTISLAT 191
STT+ + C + QC+ VP+P T C + TY SST S++ ++L T
Sbjct: 133 HSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNT 192
Query: 192 DI-----VPGYTFGCIQKATGNSVP------PQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
+ G +FGC + +G S+ QG++GLGR +S +Q + S FSYC
Sbjct: 193 STGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYC 252
Query: 241 L-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
L P L+ G+ + + + + +TPLL NP + YY+ + + V +
Sbjct: 253 LMDYTLSPPPTSFLTIGGAQNVA-VSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
I P + GTIIDSGT T + PAYT + F++RV GFD C
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCM 371
Query: 354 SVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+V V P ++ +G +V P T I CLA+ P + + +V+ N+
Sbjct: 372 NVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAV--QPVSQDGGFSVLGNL 429
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ + +D SRLG R C
Sbjct: 430 MQQGFLLEFDRDKSRLGFTRRGC 452
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 165/351 (47%), Gaps = 28/351 (7%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNSAQSTTFKNLGCQAAQCKQ-- 156
+GTP Q + +D +D W C+ VG ++ VF++A+S++F L C + C+
Sbjct: 113 VGTPPQPSKVILDLGSDLLWTQCS-LVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGT 171
Query: 157 VPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQG 214
N TC CA+ YG T L+ +T + + TFGC + A G G
Sbjct: 172 FTNKTCTDRKCAYENDYGIMTATGVLATETFTFGAHHGVSANLTFGCGKLANGTIAEASG 231
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSLRLGPIGQPKRIKYTPL 269
+LGL G LS+L Q L + FSYCL F + F LG +++ PL
Sbjct: 232 ILGLSPGPLSMLKQ---LAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPL 288
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
LKNP YYV ++ + VG + +D+P L P GT++DS T LV PA+T +
Sbjct: 289 LKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTEL 348
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLMFSG-MNVTLPQDNLLIH 381
+ + + S+ + C+ +P + P + L F G ++LP+DN
Sbjct: 349 KKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQE 408
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ G + CLA+ AP NVI N+QQQN +LYDV N + A C
Sbjct: 409 PSPG-MMCLAVMQAP--FEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 191/403 (47%), Gaps = 41/403 (10%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
S V+ ++A+D AR++ L VA S VVP S Y VR
Sbjct: 80 SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135
Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
+G+P + +D+ +D WV PC C + +F+ A S++F + C +A C+ +
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195
Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
CGGG C +++TYG S L+ +T++L V G GC + +G V G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
LLGLG G++SL+ Q FSYCL S + +GSL LG R + P + R
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLVLG------RTEAVP--RGRR 306
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
SS YYV L I VG + + Q G ++D+GT TRL AY A+R F
Sbjct: 307 ASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFD 366
Query: 335 RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITC 389
+G+ ++ DTCY + + PT++ F G +TLP NLL+ G++ C
Sbjct: 367 GAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVFC 425
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LA A + +S ++++ N+QQ+ +I D N +G C
Sbjct: 426 LAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 171/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P + M +D+ +D WV PC C S VF+ A+S ++ + C
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C ++ N C G C + + YG S L+ +T++ A +V GC + G
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLG+G GS+S + Q F YCL S + +GSL G P + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 306
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YYV L + VG + +P G T G ++D+GT TRL AY A
Sbjct: 307 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 366
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
RD F+ + + + + FDTCY V + PT++ F+ G +TLP N L+
Sbjct: 367 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 426
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A AA+P + L++I N+QQ+ ++ +D N +G +C
Sbjct: 427 SGTYCFAFAASP----TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 171/365 (46%), Gaps = 36/365 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
Y++ IGTP DT +D W C T C + ++N A STTF L C ++
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 173
Query: 153 --QCKQVPNPTCGGGACA--FNLTYGSSTIAANLSQDTISLATDI-----VPGYTFGCIQ 203
C CA + TYG+ A +T + + VPG FGC
Sbjct: 174 LSMCAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSN 233
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QP 261
++ + GL+GLGRGSLSL++Q L FSYCL F+ + + +L LGP
Sbjct: 234 ASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLLGPSAALNG 290
Query: 262 KRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
++ TP + +P R S+ YY+NL I +G + + I PGA P G IIDSGT
Sbjct: 291 TGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTI 350
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTV---TSLGGFDTCYSV-------PIVAPTITLMFSG 368
T L AY VR + ++ + L + G D C+++ P V P++TL F G
Sbjct: 351 TSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDG 410
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++ LP D+ +I + + CLAM + + ++ N QQQN ILYDV L A
Sbjct: 411 ADMVLPADSYMISGSG--VWCLAMR---NQTDGAMSTFGNYQQQNMHILYDVREETLSFA 465
Query: 429 RELCT 433
C+
Sbjct: 466 PAKCS 470
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 140/455 (30%), Positives = 211/455 (46%), Gaps = 56/455 (12%)
Query: 13 FLFSLSEGLNP--ICDTQD-----HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
++ S L P +C Q + +TL + H PCSP + S EE+ L +D
Sbjct: 33 YMVVASSSLEPSEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEET----LGRD 88
Query: 66 QARL-QFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
Q R + L+ R S +P +SG + +P Y++ +GTPA T +M++
Sbjct: 89 QLRAANIHAKLSSPRNSSAKELQQSGVTIPTSSGYSL-GTPEYVITVSLGTPAVTQVMSI 147
Query: 115 DTSNDAAWVPCTGCVG--CSS---TVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGAC 167
DT +D +WV C C CSS +F+ A+S T+ C +AQC Q+ C C
Sbjct: 148 DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHC 207
Query: 168 AFNLTY-GSSTIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSL 225
+ + Y S DT+ L T D V + FGC +A G GL+GLG + SL
Sbjct: 208 QYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESL 267
Query: 226 LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG---QPKRIKYTPLLKNPRRSSLYYVN 282
++QT Y FSYCLP + S G L LG R TPL++ + Y V
Sbjct: 268 VSQTAATYGKAFSYCLPP-SSSSAGGFLTLGAAAGGTSSSRYSRTPLVRF-NVPTFYGVF 325
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
L AI V +++P +GA +++DSGTV T+L AY A+R F++ + + +
Sbjct: 326 LQAITVAGTKLNVPASVF-----SGA-SVVDSGTVITQLPPTAYQALRTAFKKEMKAYPS 379
Query: 343 VTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
+G DTC+ + P +TL FS G + L + AG CLA A
Sbjct: 380 AAPVGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY---AG---CLAFTATAQ 433
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ ++ ++ N+QQ+ +L+DV S LG C
Sbjct: 434 DGDT--GILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 171/348 (49%), Gaps = 14/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P + M +D+ +D WV PC C S VF+ A+S ++ + C
Sbjct: 129 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 188
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C ++ N C G C + + YG S L+ +T++ A +V GC + G
Sbjct: 189 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 248
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLG+G GS+S + Q F YCL S + +GSL G P + PL
Sbjct: 249 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YYV L + VG + +P G T G ++D+GT TRL AY A
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAF 367
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
RD F+ + + + + FDTCY V + PT++ F+ G +TLP N L+
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A AA+P + L++I N+QQ+ ++ +D N +G +C
Sbjct: 428 SGTYCFAFAASP----TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/409 (32%), Positives = 194/409 (47%), Gaps = 47/409 (11%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPI--ASGRQITQSPT---YIVRAKIGTPAQT-- 109
S ++LA+ R + + K+ P +G +T +PT YI + +GTP +
Sbjct: 81 SAADLLARRLQR-DMRRAAWIITKAATPADPENGTVVTGAPTSGEYIAKITVGTPYENDS 139
Query: 110 ---LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
L++ D +D W+ C C C V+N +S++ ++GC A C+ + + G
Sbjct: 140 SFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRALG--SSG 197
Query: 164 G-----GACAFNLTYGS-STIAANLSQDTISLATDI-VPGYTFGCIQKATG-NSVPPQGL 215
G C + + YG S+ A + +T++ + VPG GC G P G+
Sbjct: 198 GCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGI 257
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-----LL 270
LGLGRGSLS +Q Y +FSYCL S +L G TP +L
Sbjct: 258 LGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPML 317
Query: 271 KNPRRSSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTA 328
N R + YYV L+ I VG RV + L+ +P+TG G I+DSGT TRL PAY A
Sbjct: 318 TNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAA 377
Query: 329 VRDVFRRRVGSNLTVTSLGG----FDTCYS-----VPIVAPTITLMFS-GMNVTLPQDNL 378
RD FR L S GG FDTCYS V P +++ F+ G+ V LP N
Sbjct: 378 FRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNY 437
Query: 379 LI--HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
LI S G++ C A A + D +++I N+Q Q R++YDV R+
Sbjct: 438 LIPVDSNKGTM-CFAFAGSGDR---GVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 193/392 (49%), Gaps = 29/392 (7%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
K L+ E + + + + RLQ L ++A+ S I + + + ++++ IGTP +T
Sbjct: 51 KNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEA-PVLPGNGEFLMKLAIGTPPET 109
Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
+DT +D W PCT C S+ +F+ +S++F L C + C+ +P +C G
Sbjct: 110 YSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG- 168
Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
C + +YG S+ L+ +T++ VP FGC G+ GL+GLGRG LS
Sbjct: 169 CEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLS 228
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKRIKYTPLLKNPRRSSLYY 280
L++Q L + FSYCL + S +L +G + IK TPL+ +P S YY
Sbjct: 229 LVSQ---LKEPKFSYCLTTVDDTKTS-TLLMGSLASVNASSSAIKTTPLIHSPAHPSFYY 284
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
++L I VG + I G IIDSGT T L A+ V F ++ N
Sbjct: 285 LSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI--N 342
Query: 341 LTVTSLG--GFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
L V S G G D C+++P I P + F G ++ LP +N +I ++ + CLAM
Sbjct: 343 LPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMG 402
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
++ S +++ N+QQQN +L+D+ L
Sbjct: 403 SS-----SGMSIFGNVQQQNMLVLHDLEKETL 429
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 174/366 (47%), Gaps = 23/366 (6%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y +R +GTP + + + MDT +D W+ C CV C VF+ +
Sbjct: 25 PVISGLSL-GSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYK 83
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT- 198
S+T+ LGC + QC + C G C + + YG + + + D +SL + G
Sbjct: 84 SSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143
Query: 199 -----FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GS 252
GC G V GLLGLG+G LS Q + FSYCL S S
Sbjct: 144 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSS 203
Query: 253 LRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L G P +++TP N R S+ YY+ + I VG ++ IP A Q + G I
Sbjct: 204 LIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVI 263
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF- 366
IDSGT TRL AY ++R+ FR + T FDTCY++ + PT+TL F
Sbjct: 264 IDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQ 323
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP N L+ S CLA A + ++I N+QQQ R++YD ++++G
Sbjct: 324 GGADLKLPASNYLVPVDNSSTFCLAFAGT-----TGPSIIGNIQQQGFRVIYDNLHNQVG 378
Query: 427 VARELC 432
C
Sbjct: 379 FVPSQC 384
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 203/417 (48%), Gaps = 45/417 (10%)
Query: 49 SKPLSWEESVLEMLAKDQARLQFLSSL---------AVARKSVVPIASGRQITQSPTYIV 99
K + W + + L D R++ + + A ++ +P++SG + Q+ YIV
Sbjct: 9 EKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL-QTLNYIV 67
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
+G+ T+++ DT +D WV C C+ C + +F + S++++++ C ++ C+
Sbjct: 68 TMGLGSKNMTVII--DTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 157 VPNPTCGGGACA--------FNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG 207
+ T GAC + + YG S L + +S V + FGC + G
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCGRNNKG 185
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKR 263
GL+GLGR LSL++QT + FSYCLP+ +A S SGSL +G
Sbjct: 186 LFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGS-SGSLVMGNESSVFKNANP 244
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
I YT +L NP+ S+ Y +NL I VG + P L F G +IDSGTV TRL +
Sbjct: 245 ITYTRMLSNPQLSNFYILNLTGIDVGGVALKAP---LSFG---NGGILIDSGTVITRLPS 298
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG---MNVTLPQD 376
Y A++ F ++ + DTC+++ + PTI+L F G +NV
Sbjct: 299 SVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGT 358
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++ A + CLA+A+ D ++ +I N QQ+N R++YD S++G A E C+
Sbjct: 359 FYVVKEDASQV-CLALASLSDAYDTA--IIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)
Query: 78 ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
ARK + +SG ++ SPT Y++ IGTP DT +D W PCT
Sbjct: 6 ARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 65
Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
C + ++N + STTF L C ++ P P C AC +N+TYGS
Sbjct: 66 QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 122
Query: 178 IAANLSQDTISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
+ +T + + VPG FGC ++G N+ GL+GLGRG LSL++Q
Sbjct: 123 TSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 179
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
L FSYCL ++ + + +L LGP + + TP + +P + + YY+NL
Sbjct: 180 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 239
Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
I +G + IPP A N G IIDSGT T L AY VR V T S
Sbjct: 240 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 299
Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
G D C+ S P P++TL F+G ++ LP D+ ++ +G + CLAM +
Sbjct: 300 ADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 355
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ +N++ N QQQN ILYD+ L A C+
Sbjct: 356 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 167/354 (47%), Gaps = 22/354 (6%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y VR IG+P + + MDT +D W+ C+ C C + VF+ S++F+ L C
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCS 70
Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATG 207
QCK + C C + ++YG + +L+ D+ S++ FGC G
Sbjct: 71 TPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTSPVVFGCGHDNEG 130
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP--KRI 264
V GLLGLG G LS +Q L FSYCL S + S +L G P
Sbjct: 131 LFVGAAGLLGLGAGKLSFPSQ---LSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASF 187
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
YT LLKNP+ + YY L I +G ++ IP A + + +TG G IIDSGT TRL
Sbjct: 188 AYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPT 247
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNL 378
AYT +RD FR FDTCY + PT++ F G +V LP N
Sbjct: 248 YAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNY 307
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L+ C A + + L++I N+QQQ R+ D+ +SR+G A C
Sbjct: 308 LVPVDTSGTFCFAFSKTSLD----LSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 138/435 (31%), Positives = 205/435 (47%), Gaps = 59/435 (13%)
Query: 21 LNPICDTQDH--SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------ 72
L+P+ + + S+ L++ H PC+P + S + SV + L DQ R +++
Sbjct: 52 LDPVAQRRRNGTSAVLRLTHKHGPCAPSRASSLAT--PSVADTLRADQRRAEYILRRVSG 109
Query: 73 -------SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
S A A + VP G I + Y+V +GTP + +DT +D +WV C
Sbjct: 110 RGTPQLWDSKAEAATATVPANWGFNI-GTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQC 168
Query: 126 TGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYGSSTI 178
T C C S +F+ AQS+++ + C C + +C C + ++YG +
Sbjct: 169 TPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSK 228
Query: 179 AANL-SQDTISLA-TDIVPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
+ S DT++L+ D V G+ FGC TGN GLLGLGR SL+ QT Y
Sbjct: 229 TTGVYSSDTLTLSPNDAVRGFFFGCGHAQSGFTGN----DGLLGLGREEASLVEQTAGTY 284
Query: 234 QSTFSYCLPSFKALSFSGSLRL-GPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
FSYCLP+ S +G L L GP G P T LL +P ++ Y V L I VG +
Sbjct: 285 GGVFSYCLPTRP--STTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQ 342
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGF 349
+ +P GT++D+GTV TRL AY A+R FR + S + + G
Sbjct: 343 QLSVPSSVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGIL 396
Query: 350 DTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
DTCY+ + P + L FS G VTL D +L S CLA AP + +
Sbjct: 397 DTCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL------SFGCLAF--APSGSDGGMA 448
Query: 405 VIANMQQQNHRILYD 419
++ N+QQ++ + D
Sbjct: 449 ILGNVQQRSFEVRID 463
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 188/378 (49%), Gaps = 37/378 (9%)
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFN 137
S +PI+SG ++ Q+ YIV IG TL++ DT +D WV PC C +FN
Sbjct: 130 SQIPISSGARL-QTLNYIVTVGIGGQNSTLIV--DTGSDLTWVQCLPCRLCYNQQEPLFN 186
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGG---------ACAFNLTYGSSTIA-ANLSQDTI 187
+ S++F +L C + C + PT G +C + + YG + + L + +
Sbjct: 187 PSNSSSFLSLPCNSPTCVAL-QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKL 245
Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
+L + + FGC + G GL+GL R LSL++QT +L+ S FSYCLP+ +
Sbjct: 246 TLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT-TGV 304
Query: 248 SFSGSLRLG-----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
SGSL LG I YT +++NP+ S+ Y++NL I +G +++P +
Sbjct: 305 GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----RL 360
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIV 358
+ G +++DSGTV TRL Y A + F ++ T +TC+++ +
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420
Query: 359 APTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
PT+ +F G M V + + S A I CLA A+ ++ +I N QQ+N R
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTM--IIGNYQQKNQR 477
Query: 416 ILYDVPNSRLGVARELCT 433
++Y+ S++G A E C+
Sbjct: 478 VIYNSKESKVGFAGEPCS 495
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)
Query: 78 ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
ARK + +SG ++ SPT Y++ IGTP DT +D W PCT
Sbjct: 66 ARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 125
Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
C + ++N + STTF L C ++ P P C AC +N+TYGS
Sbjct: 126 QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 182
Query: 178 IAANLSQDTISLATD-----IVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
+ +T + + VPG FGC ++G N+ GL+GLGRG LSL++Q
Sbjct: 183 TSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 239
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
L FSYCL ++ + + +L LGP + + TP + +P + + YY+NL
Sbjct: 240 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 299
Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
I +G + IPP A N G IIDSGT T L AY VR V T S
Sbjct: 300 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 359
Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
G D C+ S P P++TL F+G ++ LP D+ ++ +G + CLAM +
Sbjct: 360 ADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 415
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ +N++ N QQQN ILYD+ L A C+
Sbjct: 416 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 170/351 (48%), Gaps = 32/351 (9%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S +++ IG PA +DT +D W PCT C + +F+ +S+++ +GC
Sbjct: 105 SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 164
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKAT 206
+ C +P C +C + TYG S+ L+ +T + + + G FGC +
Sbjct: 165 SGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 224
Query: 207 GNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG------ 259
G+ GL+GLGRG LSL++Q L ++ FSYCL S + S SL +G +
Sbjct: 225 GDGFSQGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 281
Query: 260 -----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+ K LL+NP + S YY+ L I VG + + + + + G IIDS
Sbjct: 282 TGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDS 341
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGM 369
GT T L A+ +++ F R+ + + G D C+ +P I P + F G
Sbjct: 342 GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFKGA 401
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
++ LP +N ++ ++ + CLAM ++ + +++ N+QQQN +L+D+
Sbjct: 402 DLELPGENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 447
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 191/392 (48%), Gaps = 29/392 (7%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
K L+ E + + + + RLQ ++A+ S I + + ++++ IGTP +T
Sbjct: 51 KNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEI-DAPVLPGNGEFLMKLAIGTPPET 109
Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA 166
MDT +D W PCT C + +F+ +S++F L C + C+ +P TC G
Sbjct: 110 YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG- 168
Query: 167 CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
C + YG S+ L+ +T++ VP FGC + G+ GL+GLGRG LS
Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLS 228
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK----RIKYTPLLKNPRRSSLYY 280
L++Q L + FSYCL S S +L +G + K IK TPL++N + S YY
Sbjct: 229 LVSQ---LKEPKFSYCLTSVDDTKAS-TLLMGSLASVKASDSEIKTTPLIQNSAQPSFYY 284
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
++L I VG + I G IIDSGT T L A+ V F ++ N
Sbjct: 285 LSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQI--N 342
Query: 341 LTVTSLG--GFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
L V + G G + C+++P I P + F G ++ LP +N +I + + CLAM
Sbjct: 343 LPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMG 402
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
++ S +++ N+QQQN +L+D+ L
Sbjct: 403 SS-----SGMSIFGNIQQQNMLVLHDLEKETL 429
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 169/351 (48%), Gaps = 32/351 (9%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S +++ IG PA +DT +D W PCT C + +F+ +S+++ +GC
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 163
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKAT 206
+ C +P C AC + TYG S+ L+ +T + + + G FGC +
Sbjct: 164 SGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 223
Query: 207 GNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG------ 259
G+ GL+GLGRG LSL++Q L ++ FSYCL S + S SL +G +
Sbjct: 224 GDGFSQGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 280
Query: 260 -----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+ K LL+NP + S YY+ L I VG + + + + G IIDS
Sbjct: 281 TGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDS 340
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGM 369
GT T L A+ +++ F R+ + + G D C+ +P I P + F G
Sbjct: 341 GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGA 400
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
++ LP +N ++ ++ + CLAM ++ + +++ N+QQQN +L+D+
Sbjct: 401 DLELPGENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 446
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/395 (32%), Positives = 186/395 (47%), Gaps = 49/395 (12%)
Query: 78 ARKSVVPIASGRQIT----QSPT---YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTG 127
ARK + +SG ++ SPT Y++ IGTP DT +D W PCT
Sbjct: 64 ARKLALAASSGATVSAPTQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS 123
Query: 128 -CVGCSSTVFNSAQSTTFKNLGCQAA---------QCKQVPNPTCGGGACAFNLTYGSST 177
C + ++N + STTF L C ++ P P C AC +N+TYGS
Sbjct: 124 QCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSGW 180
Query: 178 IAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQN 231
+ +T + + VPG FGC ++G N+ GL+GLGRG LSL++Q
Sbjct: 181 TSVFQGSETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ--- 237
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS---SLYYVNLLA 285
L FSYCL ++ + + +L LGP + + TP + +P + + YY+NL
Sbjct: 238 LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTG 297
Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
I +G + IPP A N G IIDSGT T L AY VR V T S
Sbjct: 298 ISLGTTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGS 357
Query: 346 LG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
G D C+ S P P++TL F+G ++ LP D+ ++ +G + CLAM +
Sbjct: 358 AATGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSG-LWCLAMQ---NQ 413
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ +N++ N QQQN ILYD+ L A C+
Sbjct: 414 TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 188/379 (49%), Gaps = 37/379 (9%)
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVF 136
S +PI+SG ++ Q+ YIV IG TL++ DT +D WV PC C +F
Sbjct: 50 DSQIPISSGARL-QTLNYIVTVGIGGQNSTLIV--DTGSDLTWVQCLPCRLCYNQQEPLF 106
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGGG---------ACAFNLTYGSSTIA-ANLSQDT 186
N + S++F +L C + C + PT G +C + + YG + + L +
Sbjct: 107 NPSNSSSFLSLPCNSPTCVAL-QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEK 165
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++L + + FGC + G GL+GL R LSL++QT +L+ S FSYCLP+
Sbjct: 166 LTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT-TG 224
Query: 247 LSFSGSLRLG-----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ SGSL LG I YT +++NP+ S+ Y++NL I +G +++P +
Sbjct: 225 VGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP----R 280
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
+ G +++DSGTV TRL Y A + F ++ T +TC+++ +
Sbjct: 281 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 340
Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
PT+ +F G M V + + S A I CLA A+ ++ +I N QQ+N
Sbjct: 341 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTM--IIGNYQQKNQ 397
Query: 415 RILYDVPNSRLGVARELCT 433
R++Y+ S++G A E C+
Sbjct: 398 RVIYNSKESKVGFAGEPCS 416
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/446 (28%), Positives = 199/446 (44%), Gaps = 59/446 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
++ + + H PCSP + KP S E +LA DQ R + + S+ A R
Sbjct: 89 TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 144
Query: 82 ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+P +SGR + Y+V +GTPA + DT +D
Sbjct: 145 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGSDT 203
Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
WV C CV +F+ A+S+T+ N+ C A C + C GG C + + YG
Sbjct: 204 TWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 263
Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT + Y
Sbjct: 264 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 323
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
F++CLP+ + G + + TP+L + YYV + IRVG +++
Sbjct: 324 GVFAHCLPARSTGTGYLDFGAGSLAAARARLTTPMLTE-NGPTFYYVGMTGIRVGGQLLS 382
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
IP AGTI+DSGTV TRL AY+++R R SL D
Sbjct: 383 IPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL--LD 435
Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
TCY + PT++L+F G + ++++ + S CLA AA D + + ++
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 493
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N Q + + YD+ +G C
Sbjct: 494 GNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/416 (29%), Positives = 202/416 (48%), Gaps = 45/416 (10%)
Query: 49 SKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYIV 99
K + W + + L D R++ + S A ++ +P++SG + Q+ YIV
Sbjct: 9 EKKIDWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINL-QTLNYIV 67
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
+G+ T+++ DT +D WV C C+ C + +F + S++++++ C ++ C+
Sbjct: 68 TMGLGSTNMTVII--DTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 157 VPNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ T GAC N + YG S L + +S V + FGC + G
Sbjct: 126 LQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFGCGRNNKGL 185
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----I 264
GL+GLGR LSL++QT + FSYCLP+ ++ + SGSL +G + I
Sbjct: 186 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGA-SGSLVMGNESSVFKNVTPI 244
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
YT +L NP+ S+ Y +NL I D+ ALQ G +IDSGTV TRL +
Sbjct: 245 TYTRMLPNPQLSNFYILNLTGI-------DVDGVALQVPSFGNGGVLIDSGTVITRLPSS 297
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD---N 377
Y A++ +F ++ + DTC+++ + PTI++ F G N L D
Sbjct: 298 VYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEG-NAELKVDATGT 356
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ S CLA+A+ D ++ +I N QQ+N R++YD S++G A E C+
Sbjct: 357 FYVVKEDASQVCLALASLSDAYDTA--IIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/408 (31%), Positives = 183/408 (44%), Gaps = 52/408 (12%)
Query: 67 ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
AR Q S A A V + + + YI+ IGTP + DT +D W C
Sbjct: 57 AREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116
Query: 127 -----------GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCKQVPNPTCGGGACAF 169
C S ++N + STTF L C AA P P C AC +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGC---ACMY 173
Query: 170 NLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
N TYG+ A S +T + + VP FGC ++ + GL+GLGRGS+
Sbjct: 174 NQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSM 233
Query: 224 SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-----IGQPKRIKYTPLLKNPRR--- 275
SL++Q L FSYCL F+ + + +L LGP + ++ TP + P +
Sbjct: 234 SLVSQ---LGAGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPM 290
Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
S+ YY+NL I VG + IPP A G IIDSGT T LV AY VR R
Sbjct: 291 STYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRS 350
Query: 336 RVGSNLTVT----SLGGFDTCYSV-----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAG 385
+ + L + G D C+++ P P++TL F G ++ LP +N +I +
Sbjct: 351 LLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGSG- 409
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ CLAM + ++++ N QQQN +LYDV L A +C+
Sbjct: 410 -VWCLAMR---NQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCS 453
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 198/429 (46%), Gaps = 55/429 (12%)
Query: 48 PSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYI 98
P P++ + + +LA D++R S+ + + VP+ SG ++ Q+ Y+
Sbjct: 87 PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL-QTLNYV 145
Query: 99 VRAKIG----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
+G +PA L + +DT +D WV PC+ C +F+ A S T+ + C A
Sbjct: 146 TTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 205
Query: 152 AQCKQ-------VPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTF 199
+ C P GA C + L YG + + L+ DT++L + G+ F
Sbjct: 206 SACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVF 265
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G GL+GLGR LSL++QT + Y FSYCLP+ + SGSL LG
Sbjct: 266 GCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 325
Query: 260 QPKR-------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
+ YT ++ +P + Y++N+ VG AL + +I
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT-------ALAAQGLGASNVLI 378
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTCYSV----PIVAPTITLM 365
DSGTV TRL Y AVR F R+ G+ + GF DTCY + + P +TL
Sbjct: 379 DSGTVITRLAPSVYRAVRAEFMRQFGA-AGYPAAPGFSILDTCYDLTGHDEVKVPLLTLR 437
Query: 366 FS-GMNVTLPQDNLL-IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G +VT+ +L + GS CLAMA+ + +I N QQ+N R++YD S
Sbjct: 438 LEGGADVTVDAAGMLFVVRKDGSQVCLAMASL--SYEDETPIIGNYQQKNKRVVYDTLGS 495
Query: 424 RLGVARELC 432
RLG A E C
Sbjct: 496 RLGFADEDC 504
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/428 (30%), Positives = 199/428 (46%), Gaps = 27/428 (6%)
Query: 23 PICDTQDHSS----TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA 78
PI + ++SS L++FH F P P ++E + ++D R+ L L +
Sbjct: 58 PIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERI----SRDSKRVSSLLRLLSS 113
Query: 79 RKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVG 130
G + S Y VR +G+P ++ + +D+ +D WV PC+ C
Sbjct: 114 GSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQ 173
Query: 131 CSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
S VF+ A S T+ + C ++ C ++ N C G C + ++YG S L+ +T++
Sbjct: 174 QSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF 233
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
++ GC G + GLLGLG G++S + Q FSYCL S +
Sbjct: 234 GRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVS-RGTES 292
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
+G+L G P + PL++NPR S YYV L + VG V IP + G
Sbjct: 293 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 352
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLM 365
++D+GT TRL APAY A RD F + + + FDTCY+ V + PT++
Sbjct: 353 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 412
Query: 366 FSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
FSG + TLP N LI C A AA+ S L++I N+QQ+ +I D N
Sbjct: 413 FSGGPILTLPARNFLIPVDGEGTFCFAFAASA----SGLSIIGNIQQEGIQISIDGSNGF 468
Query: 425 LGVARELC 432
+G +C
Sbjct: 469 VGFGPTIC 476
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/426 (30%), Positives = 198/426 (46%), Gaps = 47/426 (11%)
Query: 22 NPICDTQDHSST-LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR- 79
+P+ Q+ + T L++ H PC+P + S + SV + L DQ R + + R
Sbjct: 53 DPVAPQQNDTFTVLRLTHRHGPCAPLRASSLAA--PSVADTLRADQRRAEHILRRVSGRG 110
Query: 80 ----------KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
+ VP G I S Y+V A +GTP + +DT +D +WV C C
Sbjct: 111 APQLWDYKAAAATVPANWGYDIGTS-NYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCA 169
Query: 130 GCS-----STVFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG-SSTIAAN 181
S +F+ AQS+++ + C + C + C C + ++YG S
Sbjct: 170 APSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGV 229
Query: 182 LSQDTISLATD-IVPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSY 239
S DT++LA + V G+ FGC +G GLLG GR SL+ QT Y FSY
Sbjct: 230 YSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSY 289
Query: 240 CLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
CLP+ S +G L LG P G T LL +P + Y V L I VG + + +P
Sbjct: 290 CLPTKS--STTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPAS 347
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
A AGT++D+GTV TRL AY A+R FR + S + +G DTCYS
Sbjct: 348 AFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGY 401
Query: 357 --IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ ++ L F SG +TL D ++ S CLA A++ + + ++ N+QQ++
Sbjct: 402 GTVNLTSVALTFSSGATMTLGADGIM------SFGCLAFASS--GSDGSMAILGNVQQRS 453
Query: 414 HRILYD 419
+ D
Sbjct: 454 FEVRID 459
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 125/437 (28%), Positives = 207/437 (47%), Gaps = 46/437 (10%)
Query: 24 ICDTQDH----SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR 79
+CD + +S+L+V + PC+ K S E+L +DQ R++ + +
Sbjct: 53 VCDHSNKVLNKASSLKVVSKYGPCTVTGDPKTF---PSAAEILRRDQLRVKSIRAKHSMN 109
Query: 80 KSVVPIASGRQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C- 131
S + + T+ PT Y V +GTP + + DT +D W C C G C
Sbjct: 110 SSTTGVFN-EMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCF 168
Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG----GACAFNLTYGSSTIAANLSQD 185
+ F+ +ST++KNL C + CK + + G +C + + YG+ L+ +
Sbjct: 169 PQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTGYTVGFLATE 228
Query: 186 TISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
T+++ +D+ + GC ++ G GLLGLGR ++L +QT + Y++ FSYCLP+
Sbjct: 229 TLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPA- 287
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
+ S +G L G G + K+TP+ + LY +++ I VG R + I P +
Sbjct: 288 -SSSSTGHLSFGG-GVSQAAKFTPITS--KIPELYGLDVSGISVGGRKLPIDPSVFRT-- 341
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP------I 357
AGTIIDSGT T L + A++A+ F+ + +N T+T G CY I
Sbjct: 342 ---AGTIIDSGTTLTYLPSTAHSALSSAFQEMM-TNYTLTKGTSGLQPCYDFSKHANDNI 397
Query: 358 VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV-LNVIANMQQQNHR 415
P I++ F G+ V + + I + CLA DN N + + N+QQ+ +
Sbjct: 398 TIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFK---DNGNDTDVAIFGNVQQKTYE 454
Query: 416 ILYDVPNSRLGVARELC 432
++YDV +G A C
Sbjct: 455 VVYDVAKGMVGFAPGGC 471
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 125/410 (30%), Positives = 199/410 (48%), Gaps = 34/410 (8%)
Query: 41 SPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIV 99
SP SPF P +S E + + Q RL+ L K+V P+ +G +++
Sbjct: 64 SPLSPFSPGN-ISSTERFKRAIKRSQDRLEKLQMSVDEVKAVEAPVYAGNG-----EFLM 117
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
+ IGTP+ + +DT +D W PCT C + +++ +QS+T+ + C ++ C+
Sbjct: 118 KMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQA 177
Query: 157 VPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGL 215
+P +C G C + +YG S+ LS ++ +L + +P FGC Q+ G G
Sbjct: 178 LPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGG 237
Query: 216 LGLGRGS-LSLLAQTQNLYQSTFSYCLPSF-KALSFSGSLRLGPIG--QPKRIKYTPLLK 271
L LSL++Q + FSYCL S + S + L +G K + TPL++
Sbjct: 238 LVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQ 297
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ R + YY++L I VG +++DI G G IIDSGT T L Y D
Sbjct: 298 SRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGY----D 353
Query: 332 VFRRRVGSNLTVTSLG----GFDTCY-----SVPIVAPTITLMFSGMNVTLPQDNLLIHS 382
V ++ V S++ + + G D C+ S PTIT F G + LP++N +
Sbjct: 354 VVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFEGADFNLPKENYIYTD 413
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++G I CLAM P N +++ N+QQQN++ILYD + L A +C
Sbjct: 414 SSG-IACLAM--LPSN---GMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 131/428 (30%), Positives = 202/428 (47%), Gaps = 49/428 (11%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
++T+ + H PCSP K S E+ L +DQ R ++ + V
Sbjct: 56 ATTVPLHHRHGPCSPLPTKKMPSLED----RLHRDQLRAAYIKRKFSGDVKKDGQGAGGV 111
Query: 78 ARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV- 135
+ V VP G + + Y++ ++G+PA+T + +D+ +D +WV C C+ C S V
Sbjct: 112 EQSHVTVPTTLGTSL-NTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD 170
Query: 136 --FNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTY--GSSTIAANLSQDTIS 188
F+ + S+T+ C +A C Q+ G C + + Y GSST S DT++
Sbjct: 171 PLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTG-TYSSDTLA 229
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
L ++ + + FGC +G + GL+GLG G+ SL +QT + + FSYCLP S
Sbjct: 230 LGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLP--PTPS 287
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
SG L LG G +K TP+L++ + Y V L AIRVG + IP A
Sbjct: 288 SSGFLTLG-AGTSGFVK-TPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS------A 339
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITL 364
G ++DSGT+ TRL AY+A+ F+ + DTC+ + P++ L
Sbjct: 340 GMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVAL 399
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+FSG V N +I CLA AA D +S ++ N+QQ+ +LYDV
Sbjct: 400 VFSGGAVVNLDANGIILG-----NCLAFAANSD--DSSPGIVGNVQQRTFEVLYDVGGGA 452
Query: 425 LGVARELC 432
+G C
Sbjct: 453 VGFKAGAC 460
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 130/445 (29%), Positives = 202/445 (45%), Gaps = 64/445 (14%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----------RK-- 80
L + H SPCSP PL + +L D AR+ L+S A RK
Sbjct: 46 LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQK 101
Query: 81 -----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
+ VP++ G + Y+ + +GTP+ + M +DT + W+
Sbjct: 102 KAAGGASGGHHLDDDSLASVPLSPGTSVGVG-NYVTQLGLGTPSTSYAMVVDTGSSLTWL 160
Query: 124 PCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQV------PNPTCGGGACAFNLTY 173
C+ CV C V F+ S+T+ ++ C A+QC ++ P+ C + +Y
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY 220
Query: 174 GSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
G S+ + +LS DT+S + P + +GC Q G GL+GL R LSLL Q
Sbjct: 221 GDSSFSVGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
+FSYCLP+ + +G L +GP YTP+ + +SLY++ L + VG
Sbjct: 281 LGYSFSYCLPTAAS---TGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
+ + P P TIIDSGTV TRL +TA+ + + + DTC
Sbjct: 338 LAVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTC 392
Query: 353 Y---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+ + + PT+ + F+ G ++ L N+LI S TCLA AP + + +I N
Sbjct: 393 FEGQASQLRVPTVAMAFAGGASMKLTTRNVLI-DVDDSTTCLAF--APTDSTA---IIGN 446
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
QQQ ++YDV SR+G + C+
Sbjct: 447 TQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/362 (33%), Positives = 168/362 (46%), Gaps = 34/362 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQAA 152
+++ IGTP L DT +D W C C + ++N + STTF L C ++
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144
Query: 153 QCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKAT 206
+ P C AC +N+TYGS +T + + VPG FGC ++
Sbjct: 145 --LGLCAPAC---ACMYNMTYGSGWTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASS 199
Query: 207 G-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP---IGQPK 262
G N+ GL+GLGRGSLSL++Q L FSYCL ++ + + +L LGP +
Sbjct: 200 GFNASSASGLVGLGRGSLSLVSQ---LGAPKFSYCLTPYQDTNSTSTLLLGPSASLNDTG 256
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+ TP + +P S YY+NL I +G + IPP A G IIDSGT T L
Sbjct: 257 VVSSTPFVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLG 315
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLG-GFDTCY------SVPIVAPTITLMFSGMNVTLPQ 375
AY VR V T S G D C+ S P P++TL F G ++ LP
Sbjct: 316 NTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADMVLPA 375
Query: 376 DNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
DN ++ + CLAM D V++++ N QQQN ILYDV L A
Sbjct: 376 DNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAK 435
Query: 432 CT 433
C+
Sbjct: 436 CS 437
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 135/443 (30%), Positives = 211/443 (47%), Gaps = 78/443 (17%)
Query: 46 FKPSKPLSWEESVLEMLAKDQARLQFL---------------SSLAV-ARKSVVPIASGR 89
F P+ S EE +L+ D AR+ L + +AV A K+ VP++SG
Sbjct: 77 FSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGA 136
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
++ ++ Y+ +G T+++ DT+++ WV C C C +F+ + S ++
Sbjct: 137 RL-RTLNYVATVGLGGGEATVIV--DTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAA 193
Query: 147 LGCQAAQCKQVPN----------PTCGGG---ACAFNLTYGSSTIAAN-LSQDTISLATD 192
+ C + C + P C G AC++ L+Y + + L+ D +SLA +
Sbjct: 194 VPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE 253
Query: 193 IVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
++ G+ FGC T N PP GL+GLGR LSL++QT + + FSYCLP +
Sbjct: 254 VIDGFVFGC---GTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD 310
Query: 249 FSGSLRLGPIGQPKR----IKYT-------PLLKNPRRSSLYYVNLLAIRVGRRVVDIPP 297
SGSL LG R + YT PLL+ P Y VNL I VG + V+
Sbjct: 311 ASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGP----FYLVNLTGITVGGQEVE--- 363
Query: 298 GALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
+TG A I+DSGTV T LV Y AVR F ++ DTC+++
Sbjct: 364 -------STGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNM 416
Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANM 409
+ P++TL+F G V + +L S+ S CLA+A+ + ++I N
Sbjct: 417 TGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDET--SIIGNY 474
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ+N R+++D S++G A+E C
Sbjct: 475 QQKNLRVVFDTSASQVGFAQETC 497
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 135/432 (31%), Positives = 198/432 (45%), Gaps = 38/432 (8%)
Query: 21 LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARL-QFLSSLAVAR 79
+N C+ +Q+ HV + LS E + M + +AR + LSS A A
Sbjct: 24 INSCCNAAAAPVRMQLTHV-------DAGRGLSGRELMRRMALRSKARAPRLLSSSATAP 76
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
S G +T+ Y++ IGTP Q + + +DT + W C C C S +
Sbjct: 77 VSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYY 133
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA---CAFNLTYGS-STIAANLSQDTIS-LA 190
++++S+TF C + QCK P+ T C CA++ +YG S L +T+S +A
Sbjct: 134 DASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA 193
Query: 191 TDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
VPG FGC TG + G+ G GRG LSL +Q L FS+C +
Sbjct: 194 GASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVSGRKP 250
Query: 250 SGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
S L P K ++ TPL+KNP + YY++L I VG + +P A
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNG 310
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----P 360
TG GTIIDSGT FT L Y V D F V + ++ G C+S P + P
Sbjct: 311 TG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 369
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ L F G + LP++N + + G + +A + + +I N QQQN +LYD+
Sbjct: 370 KLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDL 425
Query: 421 PNSRLGVARELC 432
NS+L R C
Sbjct: 426 KNSKLSFVRAKC 437
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 167/358 (46%), Gaps = 19/358 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
PI SG S Y R IG P+ + M +DT +D W+ C C C + +F A
Sbjct: 132 PIISGTS-QGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPAS 190
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
ST++ L C QC+ + C C + ++YG S + +TI+L + V
Sbjct: 191 STSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI 250
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G + GLLGLG G LS +Q + S+FSYCL + S S +L
Sbjct: 251 GCGHNNEGLFIGAAGLLGLGGGKLSFPSQ---INASSFSYCLVDRDSDSAS-TLEFNSAL 306
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P I PLL+N + YYV + + VG ++ IP + + + G IIDSGT T
Sbjct: 307 LPHAIT-APLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVT 365
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVT-LP 374
RL AY A+RD F + + + FDTCY + + PT+T +G V LP
Sbjct: 366 RLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLP 425
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A +S L++I N+QQQ R+ +D+ NS +G C
Sbjct: 426 ATNYLIPVDSDGTFCFAFAP----TSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 123/386 (31%), Positives = 176/386 (45%), Gaps = 20/386 (5%)
Query: 62 LAKDQARL-QFLSSLAVARKSVVPIASGRQITQ-----SPTYIVRAKIGTPAQTLLMAMD 115
+ +D R L LA + + A G + S Y VR +G+P + + MD
Sbjct: 95 MQRDTKRAASLLRRLAAGKPTYAAEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVMD 154
Query: 116 TSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT 172
+ +D WV PCT C S VFN A S++F + C + C V N C G C + ++
Sbjct: 155 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVS 214
Query: 173 YGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
YG S L+ +TI+ ++ GC G V GLLGLG G +S + Q
Sbjct: 215 YGDGSYTKGTLALETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGG 274
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
FSYCL S + + SG L G P + PL+ NPR S YY+ L + VG
Sbjct: 275 QTGGAFSYCLVS-RGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGL 333
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
V I + + G ++D+GT TRL AY A RD F + + + + FDT
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDT 393
Query: 352 CYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
CY V + PT++ FSG + TLP N LI C A A + +S L++I
Sbjct: 394 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPS----SSGLSII 449
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N+QQ+ +I D N +G +C
Sbjct: 450 GNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 130/445 (29%), Positives = 201/445 (45%), Gaps = 64/445 (14%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-----------RK-- 80
L + H SPCSP PL + +L D AR+ L+S A RK
Sbjct: 46 LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQK 101
Query: 81 -----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
+ VP++ G + Y+ + +GTP+ + M +DT + W+
Sbjct: 102 KAAGGASGGHHLDDDSLASVPLSPGTSVGVG-NYVTQLGLGTPSTSYAMVVDTGSSLTWL 160
Query: 124 PCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQV------PNPTCGGGACAFNLTY 173
C+ CV C V F+ S+T+ ++ C A+QC ++ P+ C + +Y
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY 220
Query: 174 GSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
G S+ + LS DT+S + P + +GC Q G GL+GL R LSLL Q
Sbjct: 221 GDSSFSVGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
+FSYCLP+ + +G L +GP YTP+ + +SLY++ L + VG
Sbjct: 281 LGYSFSYCLPTAAS---TGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
+ + P P TIIDSGTV TRL +TA+ + + + DTC
Sbjct: 338 LAVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTC 392
Query: 353 Y---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+ + + PT+ + F+ G ++ L N+LI S TCLA AP + + +I N
Sbjct: 393 FEGQASQLRVPTVVMAFAGGASMKLTTRNVLI-DVDDSTTCLAF--APTDSTA---IIGN 446
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
QQQ ++YDV SR+G + C+
Sbjct: 447 TQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 166/346 (47%), Gaps = 32/346 (9%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
+ IG PA +DT +D W PCT C + +F+ +S+++ +GC + C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 156 QVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVP 211
+P C AC + TYG S+ L+ +T + + + G FGC + G+
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFS 120
Query: 212 P-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG----------- 259
GL+GLGRG LSL++Q L ++ FSYCL S + S SL +G +
Sbjct: 121 QGSGLVGLGRGPLSLISQ---LKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASL 177
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ K LL+NP + S YY+ L I VG + + + + G IIDSGT T
Sbjct: 178 DGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTIT 237
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLP 374
L A+ +++ F R+ + + G D C+ +P I P + F G ++ LP
Sbjct: 238 YLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELP 297
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+N ++ ++ + CLAM ++ + +++ N+QQQN +L+D+
Sbjct: 298 GENYMVADSSTGVLCLAMGSS-----NGMSIFGNVQQQNFNVLHDL 338
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 128/409 (31%), Positives = 184/409 (44%), Gaps = 42/409 (10%)
Query: 60 EMLAKDQARLQFLSSLAVARKSV----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
+ LA D RL FLS + RK V P+ SG + S Y V +IG P Q+LL+ D
Sbjct: 46 QALALDTRRLHFLS---LRRKPVPFVKSPVVSGAS-SGSGQYFVDLRIGQPPQSLLLIAD 101
Query: 116 TSNDAAWVPCTGCVGCS----STVFNSAQSTTFKNLGCQAAQCKQVPNP--------TCG 163
T +D WV C+ C CS +TVF S+TF C C+ VP P T
Sbjct: 102 TGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRI 161
Query: 164 GGACAFNLTYGSSTIAANL-SQDTISLATD-----IVPGYTFGCIQKATGNSVP------ 211
C + Y ++ + L +++T SL T + FGC + +G SV
Sbjct: 162 HSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNG 221
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP-KRIKYTPL 269
G++GLGRG +S +Q + + FSYCL + + + L +G G ++ +TPL
Sbjct: 222 ANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPL 281
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
L NP + YYV L ++ V + I P + + + GT++DSGT L PAY V
Sbjct: 282 LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLLIHST 383
++R+ GFD C +V V P + FSG V +P T
Sbjct: 342 IAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 401
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I CLA+ + V +VI N+ QQ +D SRLG +R C
Sbjct: 402 EEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 178/348 (51%), Gaps = 15/348 (4%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAA 152
S Y+++ IGTPA +L MDT +D W C C CS+ ++++ + S+T+ + CQ++
Sbjct: 39 SGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPSSSSTYSKVLCQSS 98
Query: 153 QCKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSV 210
C+ +C G C + YG S+ + LS +T S+++ +P TFGC G
Sbjct: 99 LCQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQGFD- 157
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTP 268
GL+G GRGSLSL++Q + FSYCL S S + L +G + + TP
Sbjct: 158 KVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTP 217
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+++ ++ YY++L I VG + + IP G G IIDSGT T L AY A
Sbjct: 218 LVQS-SSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDA 276
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTA 384
V++ + NL G D C++ + P++T F G + +P++N L +
Sbjct: 277 VKEAMVSSI--NLPQAD-GQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDST 333
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I CLAM N+ + + + N+QQQN++ILYD N+ L A C
Sbjct: 334 SDIVCLAMMPTNSNLGN-MAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 166/354 (46%), Gaps = 22/354 (6%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y VR IG+P + + MDT +D W+ C+ C C + VF+ S++F+ L C
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCS 70
Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGSSTI-AANLSQDTISLATDIVPGYTFGCIQKATG 207
QCK + C C + ++YG + +L+ D+ ++ FGC G
Sbjct: 71 TPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTSPVVFGCGHDNEG 130
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIGQP--KRI 264
V GLLGLG G LS +Q L FSYCL S + S +L G P
Sbjct: 131 LFVGAAGLLGLGAGKLSFPSQ---LSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASF 187
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVA 323
YT LLKNP+ + YY L I +G ++ IP A + + +TG G IIDSGT TRL
Sbjct: 188 AYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPT 247
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFS-GMNVTLPQDNL 378
AYT +RD FR FDTCY + PT++ F G +V LP N
Sbjct: 248 YAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNY 307
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L+ C A + + L++I N+QQQ R+ D+ +SR+G A C
Sbjct: 308 LVPVDTSGTFCFAFSKTSLD----LSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 186/387 (48%), Gaps = 33/387 (8%)
Query: 52 LSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTL 110
L+ E + + + + R++ ++++ + + P+ +G S Y++ IGTPA +L
Sbjct: 55 LTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG-----SGEYLMNVAIGTPASSL 109
Query: 111 LMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGAC 167
MDT +D W PCT C + +FN S++F L C++ C+ +P+ +C C
Sbjct: 110 SAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DC 168
Query: 168 AFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQK----ATGNSVPPQGLLGLGRGS 222
+ YG S+ ++ +T + T VP FGC + GN GL+G+G G
Sbjct: 169 QYTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGWGP 225
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI--GQPKRIKYTPLLKNPRRSSLYY 280
LSL +Q L FSYC+ + S +L LG G P+ T L+ + + YY
Sbjct: 226 LSLPSQ---LGVGQFSYCM-TSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYY 281
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
+ L I VG + IP Q G IIDSGT T L AY AV F ++ +
Sbjct: 282 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLS 341
Query: 341 LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
S G TC+ +P + P I++ F G + L ++N+LI S A + CLAM ++
Sbjct: 342 PVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLI-SPAEGVICLAMGSS 400
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPN 422
+++ N+QQQ ++LYD+ N
Sbjct: 401 SQQ---GISIFGNIQQQETQVLYDLQN 424
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 186/371 (50%), Gaps = 40/371 (10%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSA 139
P+ G I S Y V+ +G+PA+ M +DT + +W+ C CV C + +F+ +
Sbjct: 1 PLNPGASI-GSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPS 59
Query: 140 QSTTFKNLGCQAAQCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA- 190
S T+K+L C ++QC + NP C C + +YG S+ + LSQD ++LA
Sbjct: 60 ASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAP 119
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
+ +PG+ +GC Q + G G+LGLGR LS+L Q + + FSYCLP+ F
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGF- 178
Query: 251 GSLRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
L +G K+TP+ +P SLY++ L AI VG R + + A Q+
Sbjct: 179 --LSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV--AAAQYR----VP 230
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF---DTCYSVPI----VAPTI 362
TIIDSGTV TRL YT + F + + S GF DTC+ + P +
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAP--GFSILDTCFKGNLKDMQSVPEV 288
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
L+F G ++ L N+L+ G +TCLA A N+ + +I N QQQ ++ +D+
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVDEG-LTCLAFAG-----NNGVAIIGNHQQQTFKVAHDIS 342
Query: 422 NSRLGVARELC 432
+R+G A C
Sbjct: 343 TARIGFATGGC 353
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 182/380 (47%), Gaps = 50/380 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---------CSST-VFNSAQSTTFKN 146
Y+V GTP Q +L+ DT +D W+ C+ CS F +++S T
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 147 LGCQAAQCKQVPNPTCGGGAC----------AFNLTYGSSTIAANLSQDTISLATDI--- 193
+ C AAQC VP P G AC A++ GSST L++DT +++
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTG-FLARDTATISNGTSGG 171
Query: 194 --VPGYTFGCIQKATGNSVPPQG-LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
V G FGC + G S G ++GLG+G LS AQ+ +L+ TFSYCL +
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231
Query: 251 GSLRLGPIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
S +G+P+R YTPL+ NP + YYV ++AIRVG RV+ +P +
Sbjct: 232 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 291
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSLGGFDTCYSVPIVA--- 359
GT+IDSG+ T L AY + F R+ S+ T G + CY+V +
Sbjct: 292 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF--FQGLELCYNVSSSSSSA 349
Query: 360 ------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
P +T+ F+ G+++ LP N L+ A + CLA+ P NV+ N+ QQ
Sbjct: 350 PANGGFPRLTIDFAQGLSLELPTGNYLV-DVADDVKCLAI--RPTLSPFAFNVLGNLMQQ 406
Query: 413 NHRILYDVPNSRLGVARELC 432
+ + +D ++R+G AR C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/446 (28%), Positives = 199/446 (44%), Gaps = 59/446 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
++ + + H PCSP + KP S E +LA DQ R + + S+ A R
Sbjct: 87 TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 142
Query: 82 ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+P +SGR + Y+V +GTPA + DT +D
Sbjct: 143 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPASRYTVVFDTGSDT 201
Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
WV C CV +F+ +S+T+ N+ C A C + C GG C + + YG
Sbjct: 202 TWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 261
Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT + Y
Sbjct: 262 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 321
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
F++CLP+ + G TP+L + + YY+ + IRVG +++
Sbjct: 322 GVFAHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTD-NGPTFYYIGMTGIRVGGQLLS 380
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
IP AGTI+DSGTV TRL PAY+++R R SL D
Sbjct: 381 IPQSVFAT-----AGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSL--LD 433
Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
TCY + PT++L+F G + ++++ + S CLA AA D + + ++
Sbjct: 434 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 491
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N Q + + YD+ +G +C
Sbjct: 492 GNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 178/361 (49%), Gaps = 26/361 (7%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y + +GTPA T LM +DT +D W+ PC C S VF+ +S ++ + C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
A C+++ + C +C + + YG ++ A + + +T++ A V GC
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 238
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
G + GLLGLGRG LS +Q + +FSYCL + S R +
Sbjct: 239 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
+TP+ +NPR ++ YYV+LL V G RV + L+ NPTTG G I+DSGT
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 358
Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
TRL P Y AVRD FR VG ++ FDTCY++ + PT+++ + G +V
Sbjct: 359 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 418
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP +N LI C AMA V ++I N+QQQ R+++D R+G +
Sbjct: 419 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 474
Query: 432 C 432
C
Sbjct: 475 C 475
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 170/366 (46%), Gaps = 23/366 (6%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y +R +GTP + + + MDT +D W+ C CV C S +F+ +
Sbjct: 46 PVVSGLSL-GSGEYFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYK 104
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD------I 193
S+T+ LGC QC + TC C + + YG + D +SL + +
Sbjct: 105 SSTYSTLGCSTRQCLNLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS- 252
+ GC G V GLLGLG+G LS Q FSYCL + S GS
Sbjct: 165 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSS 224
Query: 253 LRLGPIG-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L G P ++TP N R + YY+ + I VG ++ IP A Q + G I
Sbjct: 225 LVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVI 284
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF- 366
IDSGT TRL AY ++RD FR FDTCY + +A PT+TL F
Sbjct: 285 IDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQ 344
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP N LI + CLA A + ++I N+QQQ R++YD ++++G
Sbjct: 345 GGTDLKLPASNYLIPVDNSNTFCLAFAGT-----TGPSIIGNIQQQGFRVIYDNLHNQVG 399
Query: 427 VARELC 432
C
Sbjct: 400 FVPSQC 405
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 170/345 (49%), Gaps = 29/345 (8%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
TY+V IGTP L +DT +D W PC C + ++ A+S T+ N+ C++
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
C+ Q P C CA+ +YG T L+ +T +L +D V G FGC +
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRI 264
G++ GL+G+GRG LSL++Q L + FSYC F A + S L LG +
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ---LGVTRFSYCFTPFNATAAS-PLFLGSSARLSSAA 266
Query: 265 KYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
K TP + +P RRSS YY++L I VG ++ I P + P G IIDSGT FT
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFT 326
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQ 375
L A+ A+ RV L + G C++ + P + L F G ++ L +
Sbjct: 327 ALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 386
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
++ ++ + + CL M +A ++V+ +MQQQN ILYD+
Sbjct: 387 ESYVVEDRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 426
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 178/361 (49%), Gaps = 26/361 (7%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y + +GTPA T LM +DT +D W+ PC C S VF+ +S ++ + C
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 184
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
A C+++ + C +C + + YG ++ A + + +T++ A V GC
Sbjct: 185 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 244
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
G + GLLGLGRG LS +Q + +FSYCL + S R +
Sbjct: 245 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 304
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
+TP+ +NPR ++ YYV+LL V G RV + L+ NPTTG G I+DSGT
Sbjct: 305 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 364
Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
TRL P Y AVRD FR VG ++ FDTCY++ + PT+++ + G +V
Sbjct: 365 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 424
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP +N LI C AMA V ++I N+QQQ R+++D R+G +
Sbjct: 425 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 480
Query: 432 C 432
C
Sbjct: 481 C 481
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 181/374 (48%), Gaps = 42/374 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y V ++GTPA +++ MDT +D +W VPC CV FN S++F L C ++
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198
Query: 154 CKQV---PNPTCG--GGACAFNLTYGSSTIAANL-SQDTISLAT----DIVP----GYTF 199
C V P C G C F++ YG ++++ L + +TI+ T D P T
Sbjct: 199 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 258
Query: 200 GCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP 257
GC G GLLG+ R +S +Q + Y FS+C P A L+ SG + G
Sbjct: 259 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 318
Query: 258 --IGQPKRIKYTPLLKNPRRSS----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-T 310
I P ++YTPL++NP S YYV L+ I V + + + TG+G T
Sbjct: 319 SDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 377
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--------PIVAPTI 362
IIDSGT FT L PA+ A+R F R V GF CY++ + P+I
Sbjct: 378 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSI 437
Query: 363 TLMF-SGMNVTLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
TL F G++V LP++++LI S + CLA + D N+I N QQQN + Y
Sbjct: 438 TLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGD---IPFNIIGNYQQQNLWVEY 494
Query: 419 DVPNSRLGVARELC 432
D+ RLG+A C
Sbjct: 495 DLEKLRLGIAPAQC 508
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 126/415 (30%), Positives = 196/415 (47%), Gaps = 45/415 (10%)
Query: 53 SWEESVLEMLAKDQARLQFL------------SSLAVARK-SVVPIASGRQITQSPTYIV 99
S E +LA D AR+ L S A A K + VP+ SG ++ ++ Y+
Sbjct: 57 SRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARL-RTLNYVA 115
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQ 156
IG T+++ DT+++ WV C C C +F+ + S ++ + C ++ C
Sbjct: 116 TVGIGGGEATVIV--DTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDA 173
Query: 157 VPNPTCGGG--------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
+ T G AC++ L+Y + + L+ D +SLA + + G+ FGC G
Sbjct: 174 LRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGTSNQG 233
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
GL+GLGR LSL++QT + + FSYCLP ++ S SGSL LG R
Sbjct: 234 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGS-SGSLVLGDDASVYRNSTP 292
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
I YT ++ +P + Y NL I VG V P F+ G I+DSGT+ T LV
Sbjct: 293 IVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPG----FSAGGGGKAIVDSGTIITSLVP 348
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
Y AVR F ++ DTC+ + + P++ L+F G V + +
Sbjct: 349 SVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGV 408
Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L T S CLA+A+ ++ +I N QQ+N R+++D S++G A+E C
Sbjct: 409 LYVVTGDASQVCLALASLKSEYDT--PIIGNYQQKNLRVIFDTVGSQIGFAQETC 461
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 195/417 (46%), Gaps = 44/417 (10%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
S+ L++ H PC+P + L S L+ L DQ R +++ LA
Sbjct: 53 SAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 112
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSST- 134
++ + VP G I + Y+V +GTPA + +DT +D +WV C C C S
Sbjct: 113 SKAATVPANLGFSI-GTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQR 171
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
+F+ +S+++ + C AA C Q+ + C GG C + ++YG ST S DT++L
Sbjct: 172 DPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL 231
Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
++ + G+ FGC G GLLGLGR SL++Q + Y FSYCLP + +
Sbjct: 232 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQ--N 289
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G + LG TPLL + Y V L I VG + + I +
Sbjct: 290 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------S 343
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSV----PIVAPTI 362
G ++D+GTV TRL AY+A+R FR + + + G DTCY + PTI
Sbjct: 344 GAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTI 403
Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
++ F G + T+G +T +A AP +S +++ N+QQ++ + +D
Sbjct: 404 SIAFGGGAA-------MDLGTSGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD 453
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 170/345 (49%), Gaps = 29/345 (8%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
TY+V IGTP L +DT +D W PC C + ++ A+S T+ N+ C++
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
C+ Q P C CA+ +YG T L+ +T +L +D V G FGC +
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRI 264
G++ GL+G+GRG LSL++Q L + FSYC F A + S L LG +
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ---LGVTRFSYCFTPFNATAAS-PLFLGSSARLSSAA 266
Query: 265 KYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
K TP + +P RRSS YY++L I VG ++ I P + P G IIDSGT FT
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFT 326
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQ 375
L A+ A+ RV L + G C++ + P + L F G ++ L +
Sbjct: 327 ALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 386
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
++ ++ + + CL M +A ++V+ +MQQQN ILYD+
Sbjct: 387 ESYVVEDRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 426
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 181/374 (48%), Gaps = 42/374 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y V ++GTPA +++ MDT +D +W VPC CV FN S++F L C ++
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197
Query: 154 CKQV---PNPTCG--GGACAFNLTYGSSTIAANL-SQDTISLAT----DIVP----GYTF 199
C V P C G C F++ YG ++++ L + +TI+ T D P T
Sbjct: 198 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 257
Query: 200 GCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP 257
GC G GLLG+ R +S +Q + Y FS+C P A L+ SG + G
Sbjct: 258 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 317
Query: 258 --IGQPKRIKYTPLLKNPRRSS----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-T 310
I P ++YTPL++NP S YYV L+ I V + + + TG+G T
Sbjct: 318 SDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 376
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--------PIVAPTI 362
IIDSGT FT L PA+ A+R F R V GF CY++ + P+I
Sbjct: 377 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALESTILPSI 436
Query: 363 TLMF-SGMNVTLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
TL F G++V LP++++LI S + CLA + D N+I N QQQN + Y
Sbjct: 437 TLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGD---IPFNIIGNYQQQNLWVEY 493
Query: 419 DVPNSRLGVARELC 432
D+ RLG+A C
Sbjct: 494 DLEKLRLGIAPAQC 507
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 177/361 (49%), Gaps = 26/361 (7%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y + +GTPA T LM +DT +D W+ PC C S VF+ +S ++ + C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
A C+++ + C +C + + YG ++ A + + +T++ A V GC
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 238
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------G 259
G + GLLGLGRG LS Q + +FSYCL + S R +
Sbjct: 239 GLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
+TP+ +NPR ++ YYV+LL V G RV + L+ NPTTG G I+DSGT
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTS 358
Query: 318 FTRLVAPAYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
TRL P Y AVRD FR VG ++ FDTCY++ + PT+++ + G +V
Sbjct: 359 VTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASV 418
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
LP +N LI C AMA V ++I N+QQQ R+++D R+G +
Sbjct: 419 ALPPENYLIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKS 474
Query: 432 C 432
C
Sbjct: 475 C 475
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 195/417 (46%), Gaps = 44/417 (10%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS-------------SLAV 77
S+ L++ H PC+P + L S L+ L DQ R +++ LA
Sbjct: 64 SAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 123
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CSST- 134
++ + VP G I + Y+V +GTPA + +DT +D +WV C C C S
Sbjct: 124 SKAATVPANLGFSI-GTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQR 182
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVP--NPTCGGGACAFNLTYGS-STIAANLSQDTISL 189
+F+ +S+++ + C AA C Q+ + C GG C + ++YG ST S DT++L
Sbjct: 183 DPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTL 242
Query: 190 -ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
++ + G+ FGC G GLLGLGR SL++Q + Y FSYCLP + +
Sbjct: 243 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQ--N 300
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G + LG TPLL + Y V L I VG + + I +
Sbjct: 301 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------S 354
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSV----PIVAPTI 362
G ++D+GTV TRL AY+A+R FR + + + G DTCY + PTI
Sbjct: 355 GAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTI 414
Query: 363 TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
++ F G + T+G +T +A AP +S +++ N+QQ++ + +D
Sbjct: 415 SIAFGGGAA-------MDLGTSGILTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD 464
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 165/363 (45%), Gaps = 29/363 (7%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
PI SG S Y R IG P + +DT +D WV C C C + +F A
Sbjct: 137 PIISGTS-QGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPAS 195
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
S +F L C QC+ + C C + ++YG S + +TI+L + V
Sbjct: 196 SASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI 255
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFSGSLR 254
GC G V GLLGLG GSLS +Q + ++FSYCL S L F+ +L
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INATSFSYCLVDRDSESASTLEFNSTL- 311
Query: 255 LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P PLL+N + YYV L + VG +V IP A Q + + G I+DS
Sbjct: 312 ------PPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDS 365
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGM 369
GT TRL Y ++RD F +R + + FDTCY + + PT++ F G
Sbjct: 366 GTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGK 425
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ LP N L+ + C A A S L++I N+QQQ R++YD+ N +G
Sbjct: 426 ELPLPAKNYLVPLDSEGTFCFAFAPTA----SSLSIIGNVQQQGTRVVYDLVNHLVGFVP 481
Query: 430 ELC 432
C
Sbjct: 482 NKC 484
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 176/360 (48%), Gaps = 21/360 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
P+ SG S Y R +G PA+ M +DT +D W+ PCT C + +F+
Sbjct: 149 PVTSGTS-QGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTA 207
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYT 198
S+T+ + CQ+ QC + +C G C + + YG + + + +++S + V
Sbjct: 208 SSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVA 267
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
GC G V GLLGLG G LSL T L ++FSYCL + + + S +L
Sbjct: 268 LGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVN-RDSAGSSTLDFNSA 323
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
PL+KN + + YYV L + VG ++V IP + + + G I+D GT
Sbjct: 324 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 383
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLMFS-GMNVT 372
TRL AY +RD F R+ NL +TS + FDTCY + + PT++ F+ G +
Sbjct: 384 TRLQTQAYNPLRDAF-VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 442
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP N LI + C A A S L++I N+QQQ R+ +D+ N+R+G + C
Sbjct: 443 LPAANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 182/380 (47%), Gaps = 50/380 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---------CSST-VFNSAQSTTFKN 146
Y+V GTP Q +L+ DT +D W+ C+ CS F +++S T
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 147 LGCQAAQCKQVPNPTCGGGAC----------AFNLTYGSSTIAANLSQDTISLATDI--- 193
+ C AAQC VP P G +C A++ GSST L++DT +++
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTG-FLARDTATISNGTSGG 172
Query: 194 --VPGYTFGCIQKATGNSVPPQG-LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
V G FGC + G S G ++GLG+G LS AQ+ +L+ TFSYCL +
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232
Query: 251 GSLRLGPIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
S +G+P+R YTPL+ NP + YYV ++AIRVG RV+ +P +
Sbjct: 233 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 292
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSLGGFDTCYSVPIVA--- 359
GT+IDSG+ T L AY + F R+ S+ T G + CY+V +
Sbjct: 293 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF--FQGLELCYNVSSSSSLA 350
Query: 360 ------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
P +T+ F+ G+++ LP N L+ A + CLA+ P NV+ N+ QQ
Sbjct: 351 PANGGFPRLTIDFAQGLSLELPTGNYLV-DVADDVKCLAI--RPTLSPFAFNVLGNLMQQ 407
Query: 413 NHRILYDVPNSRLGVARELC 432
+ + +D ++R+G AR C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 117/381 (30%), Positives = 184/381 (48%), Gaps = 31/381 (8%)
Query: 73 SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC- 131
S +A + ++ VP+ SG + Q+ YIV +G+ Q + + +DT +D WV C C C
Sbjct: 99 SQIADSSETQVPLTSGIKF-QTLNYIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCY 155
Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----GACAFNLTYGS-STIAANLS 183
+ +F + S +++ + C + C+ + CG C + + YG S + L
Sbjct: 156 NQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELG 215
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
+ + V + FGC + G GL+GLGR LS+++QT + FSYCLPS
Sbjct: 216 IEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPS 275
Query: 244 FKALSFSGSLRLG-PIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
SGSL +G G K I YT +L N + S+ Y +NL I VG + +
Sbjct: 276 TDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHV---- 331
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---- 355
Q + G I+DSGTV +RL Y A++ F + + DTC+++
Sbjct: 332 -QASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYD 390
Query: 356 PIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ PTI++ F G +NV L+ A + CLA+A+ D + +I N QQ+
Sbjct: 391 QVNIPTISMYFEGNAELNVDATGIFYLVKEDASRV-CLALASLSDEYE--MGIIGNYQQR 447
Query: 413 NHRILYDVPNSRLGVARELCT 433
N R+LYD S++G A+E CT
Sbjct: 448 NQRVLYDAKLSQVGFAKEPCT 468
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 161/348 (46%), Gaps = 33/348 (9%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR +G+P ++ M +D+ +D WV PCT C S VF+ A S +F + C
Sbjct: 198 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 257
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
++ C ++ N C G C + ++YG S L+ +T++ +V GC + G
Sbjct: 258 SSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMF 317
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GS+S + Q FSYCL S + PL
Sbjct: 318 VGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS--------------------AAWVPL 357
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
++NPR S YY+ L + VG V I + G ++D+GT TRL AY A
Sbjct: 358 VRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAF 417
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHSTA 384
RD F + + T + FDTCY V + PT++ FSG + TLP N LI
Sbjct: 418 RDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 477
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A + S L+++ N+QQ+ +I +D N +G +C
Sbjct: 478 AGTFCFAFAPS----TSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 121/363 (33%), Positives = 177/363 (48%), Gaps = 24/363 (6%)
Query: 84 PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
P+ SG + Q S Y R IG+PA+ L M +DT +D WV C C C S VF+ +
Sbjct: 154 PVVSG--VGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 211
Query: 140 QSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
S ++ + C + +C+ + C GAC + + YG S + + +T++L V
Sbjct: 212 LSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVG 271
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
GC G V GLL LG G LS +Q + STFSYCL + + S +L+
Sbjct: 272 NVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCLVDRDSPAAS-TLQF 327
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDS 314
G PL+++PR S+ YYV L I VG + + IP A + T+G+G I+DS
Sbjct: 328 GDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDS 387
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMN 370
GT TRL + AY A+RD F + S + + FDTCY + + P ++L F G
Sbjct: 388 GTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGG 447
Query: 371 -VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ LP N LI CLA A N+ +++I N+QQQ R+ +D +G
Sbjct: 448 ALRLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTARGAVGFTP 503
Query: 430 ELC 432
C
Sbjct: 504 NKC 506
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 141/476 (29%), Positives = 216/476 (45%), Gaps = 64/476 (13%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQD-----HSSTLQVFHVFSPCSPFKPSKPLSWEESVL 59
L +F+ F L+ L D + L ++HV S + P S+ +
Sbjct: 3 LFWFIVFSAHLVLASSLVEFQDNDNPRQKQEGMQLNLYHVKGLDSSQTSTSPFSFSD--- 59
Query: 60 EMLAKDQARLQFLSSLAVARKSV------------------VPIASGRQITQSPTYIVRA 101
M+ KD+ R++FL S ++SV P+ SG I S Y V+
Sbjct: 60 -MITKDEERVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSI-GSGNYYVKI 117
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCK-- 155
+GTPA+ M +DT + +W+ C CV C V F + S T+K L C ++QC
Sbjct: 118 GLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSL 177
Query: 156 -----QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVP--GYTFGCIQKATG 207
P + GAC + +YG ++ + LSQD ++L P G+ +GC Q G
Sbjct: 178 KSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQG 237
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP----SFKALSFSGSLRLGPIGQPKR 263
G++GL +S+L Q Y + FSYCLP + + S SG L +G
Sbjct: 238 LFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSS 297
Query: 264 -IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
K+TPL+KN + SLY+++L I V + + + A +N TIIDSGTV TRL
Sbjct: 298 PYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGV--SASSYN----VPTIIDSGTVITRLP 351
Query: 323 APAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQD 376
Y A++ F + DTC+ + P I ++F G + L
Sbjct: 352 VAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAH 411
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N L+ G+ TCLA+AA+ + + ++I N QQQ ++ YDV N ++G A C
Sbjct: 412 NSLVEIEKGT-TCLAIAASSNPI----SIIGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 128/435 (29%), Positives = 205/435 (47%), Gaps = 56/435 (12%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV----VPIASG 88
++ + H + PC+P + S + S+ E L + +AR ++ S A + P
Sbjct: 56 SMSLVHRYGPCAPSQYSNVPT--PSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDD 113
Query: 89 RQIT---------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGC---SST 134
+T S Y+V GTP+ ++ MDT +D +WV CT C C
Sbjct: 114 AAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDP 173
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNP-----TCGGGACAFNLTYGSSTIAANL-SQDTIS 188
+F+ ++S+T+ + C C+++ + T GG C +++ Y + + + S +T++
Sbjct: 174 LFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT 233
Query: 189 LATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
LA I V + FGC + G S GLLGLG +SL+ QT ++Y FSYCLP+
Sbjct: 234 LAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN-- 291
Query: 248 SFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
S +G L LG P G +TP+ P ++ Y V + I VG + + IP A +
Sbjct: 292 SEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR---- 347
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
G IIDSGTV T L AY A+ R+ + + V S FDTCY+ I P
Sbjct: 348 --GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPS-DDFDTCYNFTGYSNITVPR 404
Query: 362 ITLMFSG---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRIL 417
+ FSG +++ +P + +L++ CLA + PD+ L +I N+ Q+ +L
Sbjct: 405 VAFTFSGGATIDLDVP-NGILVND------CLAFQESGPDD---GLGIIGNVNQRTLEVL 454
Query: 418 YDVPNSRLGVARELC 432
YD +G C
Sbjct: 455 YDAGRGNVGFRAGAC 469
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 190/416 (45%), Gaps = 59/416 (14%)
Query: 58 VLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT---YIVRAKIGTPAQTLLMAM 114
V + L +D R LA + + +++ QI SPT Y++ IGTP +
Sbjct: 47 VRDALRRDMHR-HNARQLAASSSNGTTVSAPTQI--SPTAGEYLMTLAIGTPPVSYQAIA 103
Query: 115 DTSNDAAWVPCTGCVGCSSTVF-------NSAQSTTFKNLGCQ-------AAQCKQVPNP 160
DT +D W T C CSS F N + STTF L C AA P P
Sbjct: 104 DTGSDLIW---TQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTPPP 160
Query: 161 TCGGGACAFNLTYGSSTIAANLSQDTISLATDI------VPGYTFGCIQKATG-NSVPPQ 213
C C +N+TYGS + +T + + VPG FGC + G N+
Sbjct: 161 GC---TCMYNMTYGSGWTSVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGGFNTSSAS 217
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLL 270
GL+GLGRGSLSL++Q L FSYCL ++ + + +L LGP + + TP +
Sbjct: 218 GLVGLGRGSLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFV 274
Query: 271 KNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
+P S+ YY+NL I +G + IP AL G IIDSGT T L AY
Sbjct: 275 ASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQ 334
Query: 328 AVRDVFRRRVGSNLTVTSLG----GFDTCY------SVPIVAPTITLMFSGMNVTLPQDN 377
VR V L T G G D C+ S P P++TL F G ++ LP D+
Sbjct: 335 QVRAAVVSLV--TLPTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHFDGADMVLPADS 392
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++ + ++ CLAM + + ++++ N QQQN ILYDV L A C+
Sbjct: 393 YMMLDS--NLWCLAMQ---NQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKCS 443
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 164/339 (48%), Gaps = 31/339 (9%)
Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN--PTCGG 164
+ + +DT +D W+ C C C ++F A S T+K L C + C+Q+ + +C
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60
Query: 165 GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQGLLGL 218
+C + ++YG ST + + +T++L +D VP + FGC G GL+GL
Sbjct: 61 SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGL 120
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPRRSS 277
G+ S+ AQT + FSYCLPS + SG L G +++TPL+ + S
Sbjct: 121 GKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPS 180
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y+V++ I VG ++ I A ++DSGTV +R AY +RD F + +
Sbjct: 181 QYFVSMTGINVGDELLPI-----------SATVMVDSGTVISRFEQSAYERLRDAFTQIL 229
Query: 338 GSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
T S+ FDTC+ V V P ITL F +++ + C A A
Sbjct: 230 PGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFA 289
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +S +V+ N QQQN R +YD+P SRLG++ C
Sbjct: 290 PS----SSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 134/472 (28%), Positives = 220/472 (46%), Gaps = 62/472 (13%)
Query: 11 FLFLFSLSEG-----------------LNPICDTQ--DHSS------TLQVFHVFSPCSP 45
FL LFSL +G +N + T +HSS +L+V H PC
Sbjct: 2 FLLLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHSSKVSNSLSLEVVHRHGPCIG 61
Query: 46 FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSVVPIASGRQITQSPTYI 98
+ + S +E+ +DQ R+ + + +R + +P+ SG I + Y+
Sbjct: 62 IVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATTLPVQSGASI-GAGDYV 120
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQC 154
V +GTP + + DT +D W C CV N + ST++KN+ C +A C
Sbjct: 121 VTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALC 180
Query: 155 KQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATG 207
K V + +C C + + YG + + + +T++L++ ++ + FGC Q+ G
Sbjct: 181 KLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNG 240
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKY 266
GLLGLGR L+L +QT Y+ FSYCLP+ + S G L LG GQ K +K+
Sbjct: 241 LFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKGYLSLG--GQVSKSVKF 296
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
TPL + + Y +++ + VG R + I A AGT+IDSGTV TRL AY
Sbjct: 297 TPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGTVIDSGTVITRLSPTAY 350
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIH 381
+ + F+ + + + FDTCY + P + + F G+ + + +L
Sbjct: 351 SELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYP 410
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
CLA A D+ ++ ++ N+QQ+ ++++YD R+G A C+
Sbjct: 411 VNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 186/383 (48%), Gaps = 46/383 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSA 139
VP+ SG ++ ++ Y+ +G T+++ DT+++ WV C C C +F+ +
Sbjct: 140 VPVTSGAKL-RTLNYVATVGLGGGEATVIV--DTASELTWVQCAPCESCHDQQDPLFDPS 196
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG--GGA------------CAFNLTYGSSTIAAN-LSQ 184
S ++ + C ++ C + T G GGA C++ L+Y + + L+
Sbjct: 197 SSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAH 256
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
D +SLA +++ G+ FGC T N PP GL+GLGR LSL++QT + + FSYC
Sbjct: 257 DRLSLAGEVIDGFVFGC---GTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYC 313
Query: 241 LPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
LP K SGSL +G R I Y ++ +P + Y+VNL I VG + V+
Sbjct: 314 LP-LKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESS 372
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
+ IIDSGTV T LV Y AV+ F + DTC+++
Sbjct: 373 GFSSGGGGGK---AIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMT 429
Query: 356 ---PIVAPTITLMFSGMNVTLPQDN---LLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ P++ L+F G V + D+ L S+ S CLAM AP N+I N
Sbjct: 430 GLREVQVPSLKLVFDG-GVEVEVDSGGVLYFVSSDSSQVCLAM--APLKSEYETNIIGNY 486
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ+N R+++D S++G A+E C
Sbjct: 487 QQKNLRVIFDTSGSQVGFAQETC 509
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 131/446 (29%), Positives = 203/446 (45%), Gaps = 58/446 (13%)
Query: 15 FSLSEGLNPIC------DTQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAK 64
F L E L P C + D +S++ + H + PCSP P+ +P E + L
Sbjct: 11 FGLCEEL-PACGAATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRA 69
Query: 65 DQARLQFLSSLAVA-------RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTS 117
D R +F S A K VP G + + Y++ +G+PA T + +DT
Sbjct: 70 DYIRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAVTQRVVIDTG 128
Query: 118 NDAAWVPC------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----- 166
+D +WV C + C + +F+ A S+T+ C AA C Q+ + G
Sbjct: 129 SDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSR 188
Query: 167 CAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGS 222
C + + YG S S D ++L+ +D+V G+ FGC G + + GL+GLG +
Sbjct: 189 CQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDA 248
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSS 277
S ++QT Y +F YCLP+ A S G L LG R TP+L++ + +
Sbjct: 249 QSPVSQTAARYGKSFFYCLPATPASS--GFLTLGAPASGGGGGASRFATTPMLRSKKVPT 306
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y+ L I VG + + + P AG+++DSGTV TRL AY A+ FR +
Sbjct: 307 YYFAALEDIAVGGKKLGLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGM 360
Query: 338 GSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
LG DTC++ + PT+ L+F+G V +L H G ++ +A
Sbjct: 361 TRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVV----DLDAH---GIVSGGCLA 413
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYD 419
AP + I N+QQ+ +LYD
Sbjct: 414 FAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 144/452 (31%), Positives = 205/452 (45%), Gaps = 67/452 (14%)
Query: 24 ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----------- 72
+ + + HSST V P P + E + +LA D AR L
Sbjct: 107 VLELKHHSSTATV-----------PDHPAARERYLKHLLAADSARAASLQLRKPKPASST 155
Query: 73 -SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVG 130
++ A A + VP+ SG + Q+ Y+ +G A+ L + +DT +D WV C C G
Sbjct: 156 TTTQASAAAAEVPLGSGIRY-QTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPG 214
Query: 131 CS-----STVFNSAQSTTFKNLGCQAAQCK-QVPNPTCGGGACA-----------FNLTY 173
S +F+ A S TF + C + C + + T G+CA + L+Y
Sbjct: 215 SSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSY 274
Query: 174 GSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
G + + L+QDT+ L T + G+ FGC G GL+GLGR LSL++QT
Sbjct: 275 GDGSFSRGVLAQDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAA 334
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
+ FSYCLP+ + S SL GP + YT ++ +P + Y++N+ VG
Sbjct: 335 RFGGVFSYCLPATTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGG 394
Query: 292 VVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF- 349
PG GAG + +DSGTV TRL Y AVR F RR + GF
Sbjct: 395 AALTAPG-------FGAGNVLVDSGTVITRLAPSVYKAVRAEFARR----FEYPAAPGFS 443
Query: 350 --DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL-IHSTAGSITCLAMAAAPDNVNS 401
D CY + + P +TL G VT+ +L + GS CLAMA+ P
Sbjct: 444 ILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLP--YED 501
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+I N QQ+N R++YD SRLG A E CT
Sbjct: 502 QTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 174/360 (48%), Gaps = 21/360 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
P+ SG S Y R +G PA+ M +DT +D W+ PCT C + +F+
Sbjct: 8 PVTSGTS-QGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTA 66
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYT 198
S+T+ + CQ+ QC + +C G C + + YG + + + +++S + V
Sbjct: 67 SSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVA 126
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
GC G V GLLGLG G LSL T L ++FSYCL + + S +L
Sbjct: 127 LGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSAG-SSTLDFNSA 182
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
PL+KN + + YYV L + VG ++V IP + + + G I+D GT
Sbjct: 183 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 242
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----PIVAPTITLMFS-GMNVT 372
TRL AY +RD F R NL +TS + FDTCY + + PT++ F+ G +
Sbjct: 243 TRLQTQAYNPLRDAFVRMT-QNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 301
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP N LI + C A A S L++I N+QQQ R+ +D+ N+R+G + C
Sbjct: 302 LPAANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 124/416 (29%), Positives = 198/416 (47%), Gaps = 47/416 (11%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSS----------LAVARKSVVPIASGRQITQSPTYIV 99
K L W + + + L D +L+ L S + + + +P+ SG ++ QS YIV
Sbjct: 10 KILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRL-QSLNYIV 68
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
++G T+++ DT +D +WV PC C VFN ++S +++ + C + C+
Sbjct: 69 TVELGGRKMTVIV--DTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRS 126
Query: 157 VPNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ T G C N + YG S + + + ++L V + FGC +K G
Sbjct: 127 LQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQGL 186
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----I 264
GL+GLGR LSL++Q ++ FSYCLP+ +A + SGSL +G + I
Sbjct: 187 FGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEA-SGSLVMGGNSSVYKNTTPI 245
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
YT ++ NP Y++NL I VG V P F IIDSGTV +RL
Sbjct: 246 SYTRMIHNPLL-PFYFLNLTGITVGGVEVQAP----SFGKDR---MIIDSGTVISRLPPS 297
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSG---MNVTLPQDN 377
Y A++ F ++ + S D+C+++ + P I + F G +NV +
Sbjct: 298 IYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVF 357
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + A + CLA+A+ P + +I N QQ+N RI+YD S LG A E C+
Sbjct: 358 YSVKTDASQV-CLAIASLP--YEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 136/411 (33%), Positives = 202/411 (49%), Gaps = 39/411 (9%)
Query: 55 EESVLEMLAKDQARLQFLSS---LAVARKSVV-------PIASGRQITQSPTYIVRAKIG 104
E+ +LE L +D+ R++++ S LA +K P+ SG + S Y VR +G
Sbjct: 78 EQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTDLNGPVTSGL-LYGSGEYFVRLGVG 136
Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
TPA++L M +DT +D W+ C C C + +F+ S++F+ + C + CK + +
Sbjct: 137 TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEIHS 196
Query: 162 CGG--GA---CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQG 214
C G GA C++ + YG + + + S D +L T FGC G G
Sbjct: 197 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 256
Query: 215 LLGLGRGSLSLLAQ-----TQNLYQSTFSYCL--PSFKALSFSGSLRLGPIGQPKRIKYT 267
LLGLG G LS +Q T + ++FSYCL S S SL G P +
Sbjct: 257 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALS 316
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
PLLKNP+ + YY ++ + VG + I +LQ + + G IIDSGT TR Y
Sbjct: 317 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 376
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMF-SGMNVTLPQDNLLIH- 381
+RD FR + + FDTCY+ + P + L F +G ++ LP N LI
Sbjct: 377 TIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 436
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+TAGS CLA AP ++ L +I N+QQQ+ RI +D+ S L A + C
Sbjct: 437 NTAGSF-CLAF--APTSME--LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 116/377 (30%), Positives = 173/377 (45%), Gaps = 55/377 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y+V +GTP + + + +DT +D W C C C + + A S+T+ L C A +
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPR 151
Query: 154 CKQVPNPTCGGG----------ACAFNLTYGSSTI-AANLSQDTISLATDIVPG------ 196
C+ +P +CGGG +CA+ YG ++ ++ D + D G
Sbjct: 152 CRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPT 211
Query: 197 --YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGS 252
TFGC G + G+ G GRG SL +Q L +TFSYC S F++ S +
Sbjct: 212 RRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQ---LNVTTFSYCFTSMFESKSSLVT 268
Query: 253 LRLGPIGQ---------PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
L P ++ TPLLKNP + SLY+++L I VG+ + +P L+
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKLR-- 326
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPIVA--- 359
TIIDSG T L Y AV+ F +VG T V D C+++P+ A
Sbjct: 327 -----STIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWR 381
Query: 360 ----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P++TL G + LP+ N + A + C+ + AAP + VI N QQQN
Sbjct: 382 RPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGD----QTVIGNFQQQNTH 437
Query: 416 ILYDVPNSRLGVARELC 432
++YD+ N L A C
Sbjct: 438 VVYDLENDWLSFAPARC 454
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 134/488 (27%), Positives = 218/488 (44%), Gaps = 82/488 (16%)
Query: 3 PQLVFFLAFLFLFSLSEGLN---------------PIC-------DTQDHSSTLQVFHVF 40
P LV + + +SL+ G N P+C D ++ ++ + H
Sbjct: 5 PLLVCIILCTYEYSLAHGGNEHGFVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVHRH 64
Query: 41 SPCSPFKPS--KPLSWEESVLEMLAKDQARLQFLSS------LAVARKSVVPIASGRQIT 92
PC+P + S KP S+ + L +++AR +++ S + +P G +
Sbjct: 65 GPCAPTQLSSDKPSSFTD----RLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGGSV- 119
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNL 147
S Y+V +GTP+ + ++ +DT +D +WV C T C +F+ ++S+T+ +
Sbjct: 120 DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPI 179
Query: 148 GCQAAQCKQVPNPTCGGGA--------CAFNLTYGSSTIAANL-SQDTISLATDI-VPGY 197
C C+ + + GGG C F +TYG + + S +T++LA + V +
Sbjct: 180 PCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDF 239
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-----ALSFSGS 252
FGC G + GLLGLG SL+ QT ++Y FSYCLP+ G
Sbjct: 240 RFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGG 299
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
G + +TP+++ + Y VN+ I VG +D+PP A G II
Sbjct: 300 APSGGVVNTSGFVFTPMIR--EEETFYVVNMTGITVGGEPIDVPPSAFS------GGMII 351
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSG 368
DSGTV T L AY A++ FR+ + + V + G DTCY + P + L FSG
Sbjct: 352 DSGTVVTELQHTAYNALQAAFRKAMAAYPLVRN-GELDTCYDFSGYSNVTLPKVALTFSG 410
Query: 369 ---MNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+++ +P LL CLA + PD+ +L N+ Q+ +LYD R
Sbjct: 411 GATIDLDVPNGILLDD-------CLAFQESGPDDQPGIL---GNVNQRTLEVLYDAGRGR 460
Query: 425 LGVARELC 432
+G +C
Sbjct: 461 VGFRAAVC 468
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 122/425 (28%), Positives = 204/425 (48%), Gaps = 37/425 (8%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSVVPI 85
+L+V H PC + + S +E+ +DQ R+ + + +R + +P+
Sbjct: 1 SLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATTLPV 60
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQS 141
SG I + Y+V +GTP + + DT +D W C CV N + S
Sbjct: 61 QSGASIG-AGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTS 119
Query: 142 TTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIV 194
T++KN+ C +A CK V + +C C + + YG + + + +T++L++ ++
Sbjct: 120 TSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVF 179
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
+ FGC Q+ G GLLGLGR L+L +QT Y+ FSYCLP+ + S G L
Sbjct: 180 KNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKGYLS 237
Query: 255 LGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
LG GQ K +K+TPL + + Y +++ + VG R + I A AGT+ID
Sbjct: 238 LG--GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFS------AGTVID 289
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
SGTV TRL AY+ + F+ + + + FDTCY + P + + F G
Sbjct: 290 SGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGG 349
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ + + +L CLA A D+ ++ ++ N+QQ+ ++++YD R+G A
Sbjct: 350 VEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRVGFA 407
Query: 429 RELCT 433
C+
Sbjct: 408 PGGCS 412
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 206/428 (48%), Gaps = 37/428 (8%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR-------KSV 82
+S +L+V H PC + + S +E+ +DQ R+ + + +R +
Sbjct: 58 NSLSLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQATT 117
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNS 138
+P+ SG I + Y+V +GTP + + DT +D W C CV N
Sbjct: 118 LPVQSGASI-GAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNP 176
Query: 139 AQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTISLAT- 191
+ ST++KN+ C +A CK V + +C C + + YG + + + +T++L++
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
++ + FGC Q+ G GLLGLGR L+L +QT Y+ FSYCLP+ + S G
Sbjct: 237 NVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA--SSSSKG 294
Query: 252 SLRLGPIGQ-PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
L LG GQ K +K+TPL + + Y +++ + VG R + I A AGT
Sbjct: 295 YLSLG--GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS------AGT 346
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF 366
+IDSGTV TRL AY+ + F+ + + + FDTCY + P + + F
Sbjct: 347 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 406
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G+ + + +L CLA A D+ ++ ++ N+QQ+ ++++YD R+
Sbjct: 407 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDT--SIFGNVQQRTYQVVYDGAKGRV 464
Query: 426 GVARELCT 433
G A C+
Sbjct: 465 GFAPGGCS 472
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 189/396 (47%), Gaps = 39/396 (9%)
Query: 60 EMLAKDQARLQFL---------SSLAVAR-KSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
E L +DQ R ++ + V R + VP A G + + Y++ +G+PA +
Sbjct: 6 ETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSL-NTLEYLITVGLGSPATS 64
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG--- 163
M +DT +D +WV C C C S +F+ + S+T+ C +A C Q+ G
Sbjct: 65 QTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSS 124
Query: 164 GGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
C + +TYG S+ S DT++L + V + FGC +G + GL+GLG G+
Sbjct: 125 SSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGA 184
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLKNPRRSSLYY 280
SL++QT FSYCLP S SG L LG G + TP+L++ + + Y
Sbjct: 185 QSLVSQTAGTLGRAFSYCLP--PTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYG 242
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
V L AIRVG R + IP AGT++DSGTV TRL AY+A+ F+ +
Sbjct: 243 VRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQY 296
Query: 341 LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP 396
G DTC+ + P++ L+FSG V + +I S CLA A
Sbjct: 297 PPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILS-----NCLAFAGNS 351
Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D +S L +I N+QQ+ +LYDV +G C
Sbjct: 352 D--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 127/392 (32%), Positives = 184/392 (46%), Gaps = 31/392 (7%)
Query: 61 MLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
M + +AR + LSS A A S G +T+ Y++ IGTP Q + + +DT +
Sbjct: 1 MALRSKARAPRLLSSSATAPVSPGAYDDGVPMTE---YLLHLAIGTPPQPVQLTLDTGSV 57
Query: 120 AAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA---CAFNLT 172
W C C C S +++++S+TF C + QCK P+ T C CA++ +
Sbjct: 58 LVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYS 117
Query: 173 YGS-STIAANLSQDTIS-LATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQT 229
YG S L +T+S +A VPG FGC TG + G+ G GRG LSL +Q
Sbjct: 118 YGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ- 176
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLA 285
L FS+C + S L P K ++ TPL+KNP + YY++L
Sbjct: 177 --LKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKG 234
Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
I VG + +P A TG GTIIDSGT FT L Y V D F V + ++
Sbjct: 235 ITVGSTRLPVPESAFALKNGTG-GTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSN 293
Query: 346 LGGFDTCYSVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
G C+S P + P + L F G + LP++N + + G + +A +
Sbjct: 294 ETGPLLCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSICLAI----IE 349
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +I N QQQN +LYD+ NS+L R C
Sbjct: 350 GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 123/418 (29%), Positives = 193/418 (46%), Gaps = 44/418 (10%)
Query: 44 SPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARKSVVPIASGRQITQ---- 93
+P K K L VL L +D +R+Q +++ V++ + P+ + Q
Sbjct: 93 TPHKDYKAL-----VLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTP 147
Query: 94 --------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
S Y R +G PA++ M +DT +D W+ PC+ C S +F A S+
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASS 207
Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFG 200
++ L C + QC + +C G C + + YG + + +T+S + V G
Sbjct: 208 SYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-ALSFSGSLRLGPIG 259
C G V GLLGLG G LSL +Q L ++FSYCL + A S + P+G
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSLTSQ---LKATSFSYCLVNRDSAASSTLDFNSAPVG 324
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
PLLK+ + + YYV L + VG ++ IP + + + G I+D GT T
Sbjct: 325 DSV---IAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAIT 381
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLP 374
RL + AY ++RD F + + + FDTCY + + PT++ F G + LP
Sbjct: 382 RLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLP 441
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A S L++I N+QQQ R+ +D+ N+R+G + C
Sbjct: 442 AANYLIPVDSAGTYCFAFAP----TTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 208/439 (47%), Gaps = 53/439 (12%)
Query: 28 QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD-------QARLQFLSSLAVARK 80
++ ++ L++ H S CS K L W + + + L D Q+R++ + S
Sbjct: 62 ENGATILEMKHKDS-CS----GKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDD 116
Query: 81 SV---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST 134
SV +P+ SG ++ Q+ YIV ++G T+++ DT +D +WV PC C
Sbjct: 117 SVDAPIPLTSGIRL-QTLNYIVTVELGGRKMTVIV--DTGSDLSWVQCQPCKRCYNQQDP 173
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN-------LTYGS-STIAANLSQDT 186
VFN + S +++ + C + C+ + + T G C N + YG S L +
Sbjct: 174 VFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEH 233
Query: 187 ISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
+ L V + FGC + G GL+GLGR SLSL++QT ++ FSYCLP
Sbjct: 234 LDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLP-IT 292
Query: 246 ALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
SGSL +G + I YT ++ NP+ Y++NL I VG V P
Sbjct: 293 ETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQL-PFYFLNLTGITVGSVAVQAPSFGKD 351
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PI 357
G +IDSGTV TRL Y A++D F ++ + + DTC+++ +
Sbjct: 352 -------GMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEV 404
Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
P I + F G +NV + + + A + CLA+A+ + + + +I N QQ+N
Sbjct: 405 EIPNIKMHFEGNAELNVDVTGVFYFVKTDASQV-CLAIASL--SYENEVGIIGNYQQKNQ 461
Query: 415 RILYDVPNSRLGVARELCT 433
R++YD S LG A E CT
Sbjct: 462 RVIYDTKGSMLGFAAEACT 480
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 170/374 (45%), Gaps = 48/374 (12%)
Query: 84 PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSA 139
P+ SG ++Q S Y R +GTPA+ + + +DT +D W+ PC+ C S VFN
Sbjct: 150 PVVSG--VSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPT 207
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTF 199
S+T+K+L C A QC + C C + ++YG + LATD V TF
Sbjct: 208 SSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TF 258
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQ--------------TQNLYQSTFSYCLPSFK 245
G K N V LG G + L T + ++FSYCL +
Sbjct: 259 GNSGKI--NDVA----LGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVD-R 311
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
S SL + PLL+N + + YYV L VG + V +P + +
Sbjct: 312 DSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDAS 371
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT--VTSLGGFDTCYSV----PIVA 359
G I+D GT TRL AY ++RD F ++ +NL +S+ FDTCY +
Sbjct: 372 GSGGVILDCGTAVTRLQTQAYNSLRDAF-LKLTTNLKKGTSSISLFDTCYDFSSLSSVKV 430
Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
PT+ F+ G ++ LP N LI C A A +S L++I N+QQQ RI Y
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAP----TSSSLSIIGNVQQQGTRITY 486
Query: 419 DVPNSRLGVARELC 432
D+ N +G++ C
Sbjct: 487 DLANKIIGLSGNKC 500
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/424 (30%), Positives = 192/424 (45%), Gaps = 39/424 (9%)
Query: 38 HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARKSVVPIASGRQI 91
H+ S S KPS ++ L LA+D AR++ L + V+ + P S +
Sbjct: 71 HLRSRASIQKPSH-RDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEF 129
Query: 92 T----QSP----------TYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST 134
Q P Y +R IG P + +DT +D +W+ PC+ C S
Sbjct: 130 EANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDP 189
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDI 193
+F+ S ++ + C A QCK + C G C + ++YG S + +T++L T
Sbjct: 190 IFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGTAA 249
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
V GC G V GLLGLG G LS AQ ++FSYCL + + + S
Sbjct: 250 VENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVN---ATSFSYCLVNRDSDAVSTLE 306
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
P+ P+ + PL +NP + YY+ L I VG + IP + + G G IID
Sbjct: 307 FNSPL--PRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIID 364
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SG 368
SGT TRL + Y A+RD F + + FDTCY + + PT++ F G
Sbjct: 365 SGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEG 424
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ LP N LI + C A A S L+++ N+QQQ R+ +D+ NS +G +
Sbjct: 425 RELPLPARNYLIPVDSVGTFCFAFAP----TTSSLSIMGNVQQQGTRVGFDIANSLVGFS 480
Query: 429 RELC 432
+ C
Sbjct: 481 ADSC 484
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/362 (32%), Positives = 176/362 (48%), Gaps = 25/362 (6%)
Query: 84 PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
P+ SG + Q S Y R +G PA+ L M +DT +D W+ C C C S V++ +
Sbjct: 151 PVVSG--VGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPS 208
Query: 140 QSTTFKNLGCQAAQCKQVPNPTC--GGGACAFNLTYGS-STIAANLSQDTISLATDI-VP 195
ST++ +GC + +C+ + C G+C + + YG S + + +T++L V
Sbjct: 209 VSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVS 268
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
GC G V GLL LG G LS +Q + +TFSYCL + S S +L+
Sbjct: 269 NVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATTFSYCLVDRDSPS-SSTLQF 324
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G QP PL+++PR ++ YYV L I VG + IP A + G I+DSG
Sbjct: 325 GDSEQPAVT--APLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSG 382
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMN 370
T TRL + AY A+R+ F + S + + FDTCY + + P + L F G
Sbjct: 383 TAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGE 442
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ LP N LI A CLA A + +++I N+QQQ R+ +D + +G +
Sbjct: 443 LKLPAKNYLIPVDAAGTYCLAFA----GTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTAD 498
Query: 431 LC 432
C
Sbjct: 499 KC 500
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/416 (29%), Positives = 198/416 (47%), Gaps = 53/416 (12%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLA------------VARKSVVPIASGRQITQSPTYIVR 100
S EE + + + D AR+ L A A VP+ SG ++ ++ Y+
Sbjct: 71 SREEELGGLFSSDAARVSSLQRRAGGGSWAEDEAAAAAATGRVPVTSGARL-RTLNYVAT 129
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK-- 155
+G T+++ DT+++ WV C C C +F+ A S ++ L C ++ C
Sbjct: 130 VGLGGGEATVIV--DTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDAL 187
Query: 156 QVPNPTCGGG-------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
QV + G +C++ L+Y + + L+ D +SLA +++ G+ FGC G
Sbjct: 188 QVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQG 247
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
GL+GLGR LSL++QT + + FSYCLP K SGSL LG R
Sbjct: 248 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-LKESESSGSLVLGDDTSVYRNSTP 306
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
I YT ++ +P + Y+VNL I +G + V+ G + I+DSGT+ T LV
Sbjct: 307 IVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKV----------IVDSGTIITSLVP 356
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN-- 377
Y AV+ F + DTC+++ + P++ +F G NV + D+
Sbjct: 357 SVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG-NVEVEVDSSG 415
Query: 378 -LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L S+ S CLA+A+ + ++I N QQ+N R+++D S++G A+E C
Sbjct: 416 VLYFVSSDSSQVCLALASLKSEYET--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/416 (29%), Positives = 198/416 (47%), Gaps = 53/416 (12%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLA------------VARKSVVPIASGRQITQSPTYIVR 100
S EE + + + D AR+ L A A VP+ SG ++ ++ Y+
Sbjct: 72 SREEELGGLFSSDAARVSSLQRRAGGGSWAEDEAAAAAATGRVPVTSGARL-RTLNYVAT 130
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCK-- 155
+G T+++ DT+++ WV C C C +F+ A S ++ L C ++ C
Sbjct: 131 VGLGGGEATVIV--DTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDAL 188
Query: 156 QVPNPTCGGG-------ACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATG 207
QV + G +C++ L+Y + + L+ D +SLA +++ G+ FGC G
Sbjct: 189 QVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQG 248
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR---- 263
GL+GLGR LSL++QT + + FSYCLP K SGSL LG R
Sbjct: 249 PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-LKESESSGSLVLGDDTSVYRNSTP 307
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
I YT ++ +P + Y+VNL I +G + V+ G + I+DSGT+ T LV
Sbjct: 308 IVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKV----------IVDSGTIITSLVP 357
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN-- 377
Y AV+ F + DTC+++ + P++ +F G NV + D+
Sbjct: 358 SVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG-NVEVEVDSSG 416
Query: 378 -LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L S+ S CLA+A+ + ++I N QQ+N R+++D S++G A+E C
Sbjct: 417 VLYFVSSDSSQVCLALASLKSEYET--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 133/419 (31%), Positives = 204/419 (48%), Gaps = 44/419 (10%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI----TQSPTYIVRAKIGTPAQTLL 111
E V E L++ Q+++Q + + + P + R + + ++ IG+ + L
Sbjct: 55 EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114
Query: 112 MAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----- 166
+DT ++A V C S VF+ A S +++ + C + C V T G +
Sbjct: 115 AIIDTGSEAVLVQCGSR---SRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVN 171
Query: 167 ----CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYT-------FGCIQKATGNSVP--P 212
C ++L+YG S + + SQD I L + G FGC G V
Sbjct: 172 SSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGS 231
Query: 213 QGLLGLGRGSLSLLAQTQN-LYQSTFSYCLPSFKAL-SFSGSLRLGPIGQPK-RIKYTPL 269
G++G RG+LSL +Q ++ L S FSYC PS +G + LG G K ++ YTPL
Sbjct: 232 LGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPL 291
Query: 270 LKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPA 325
L NP RS LYYV L +I V + + IP A + +P+TG GT++DSGT FTR+V A
Sbjct: 292 LDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDA 351
Query: 326 YTAVRDVF--RRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLMFSGMNVTLPQDN 377
YTA R+ F R G V + GFD CY S+P V + + + + L ++
Sbjct: 352 YTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEH 411
Query: 378 LLIH-STAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
L + S AG+ CLA+ ++ + +NV+ N QQ N+ + YD SR+G R C+
Sbjct: 412 LFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADCS 470
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/446 (28%), Positives = 198/446 (44%), Gaps = 59/446 (13%)
Query: 31 SSTLQVFHVFSPCSPFKPS--KPLSWEESVLEMLAKDQARLQFL----SSLAVARKS--- 81
++ + + H PCSP + KP S E +LA DQ R + + S+ A R
Sbjct: 89 TTRMTIVHRHGPCSPLAAAHRKPPSHGE----ILAADQNRAESIQHRVSTTATGRGKPKR 144
Query: 82 ---------------------VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+P +SGR + Y+V +GTP + DT +D
Sbjct: 145 SRRQQPSSAPAPAASLSSSTASLPASSGRALGTG-NYVVTVGLGTPVSRYTVVFDTGSDT 203
Query: 121 AWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSS 176
WV C CV +F+ A+S+T+ N+ C A C + C GG C + + YG
Sbjct: 204 TWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYGDG 263
Query: 177 TIAANL-SQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
+ + + DT++L++ D V G+ FGC ++ G GLLGLGRG SL QT + Y
Sbjct: 264 SYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYG 323
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
F++CLP+ + G + TP+L + + YYV + IRVG +++
Sbjct: 324 GVFAHCLPARSTGTGYLDFGAGSLAAASARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLS 382
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFD 350
IP AGTI+DSGTV TRL AY+++R R SL D
Sbjct: 383 IPQSVFAT-----AGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL--LD 435
Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
TCY + PT++L+F G + ++++ + S CLA AA D + + ++
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGD--VGIV 493
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N Q + + YD+ +G C
Sbjct: 494 GNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 133/434 (30%), Positives = 200/434 (46%), Gaps = 52/434 (11%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
L + H SPCSP PL + +L D AR+ L++ S P R +
Sbjct: 43 LTLHHPRSPCSP----APLPADVPFSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSSS 98
Query: 94 SP-------------------TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC-- 131
SP Y+ R +GTPA++ +M +DT + W+ C+ C V C
Sbjct: 99 SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHR 158
Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVP----NP-TCG-GGACAFNLTYGSSTIAAN-LS 183
S VFN S+++ ++ C A QC + NP TC C + +YG S+ + LS
Sbjct: 159 QSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLS 218
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
+DT+S + VP + +GC Q G GL+GL R LSLL Q +FSYCLP+
Sbjct: 219 KDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT 278
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
+ S S+ P + YTP+ K+ SLY++ + I V + + + A
Sbjct: 279 SSSSSGYLSIG---SYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSL 335
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAP 360
P TIIDSGTV TRL Y+A+ + ++ DTC+ + + P
Sbjct: 336 P-----TIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVP 390
Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+++ F+ G + L NLL+ + + TCLA A A +I N QQQ ++YD
Sbjct: 391 QVSMAFAGGAALKLKATNLLVDVDSAT-TCLAFAPARSAA-----IIGNTQQQTFSVVYD 444
Query: 420 VPNSRLGVARELCT 433
V NS++G A C+
Sbjct: 445 VKNSKIGFAAGGCS 458
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 199/434 (45%), Gaps = 52/434 (11%)
Query: 31 SSTLQV--FHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
S+TL V H + PC+ + S + S E L +AR ++ S A + P +
Sbjct: 52 SATLSVPLVHRYGPCAASQYSDMPT--PSFSETLRHSRARTNYIKSRASTGMASTPDDAA 109
Query: 89 RQI-------TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVF 136
+ S Y+V GTP+ ++ MDT +D +WV C T C +F
Sbjct: 110 VTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLF 169
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP-----TCGGGACAFNLTYGS-STIAANLSQDTISLA 190
+ ++S+T+ + C A C ++ + T GG C + + YG S+ S +TI+ A
Sbjct: 170 DPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFA 229
Query: 191 TDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
I V + FGC G S GLLGLG SL+ QT ++Y FSYCLP+ S
Sbjct: 230 PGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALN--SE 287
Query: 250 SGSLRLG----PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+G L LG +TP+ P ++ Y VN+ I VG + +DIP A +
Sbjct: 288 AGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR---- 343
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPT 361
G +IDSGT+ T L AY A+ R+ + V S FDTCY+ + P
Sbjct: 344 --GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASE-DFDTCYNFTGYSNVTVPR 400
Query: 362 ITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
+ L FSG +++ +P + +L+ CLA + +V L +I N+ Q+ +LY
Sbjct: 401 VALTFSGGATIDLDVP-NGILVKD------CLAFRESGPDVG--LGIIGNVNQRTLEVLY 451
Query: 419 DVPNSRLGVARELC 432
D + ++G C
Sbjct: 452 DAGHGKVGFRAGAC 465
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 165/366 (45%), Gaps = 32/366 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG S Y R +GTPA+ + + +DT +D W+ C C C S VFN
Sbjct: 150 PVVSGAS-QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTS 208
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
S+T+K+L C A QC + C C + ++YG + LATD V TFG
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TFG 259
Query: 201 CIQK----ATGNSVPPQGLL----GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
K A G +GL GL +L+ T + ++FSYCL + S S
Sbjct: 260 NSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVD-RDSGKSSS 318
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L + PLL+N + + YYV L VG V +P + + G I+
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 313 DSGTVFTRLVAPAYTAVRDVF-RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
D GT TRL AY ++RD F + V +S+ FDTCY + PT+ F+
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP N LI C A A +S L++I N+QQQ RI YD+ + +G
Sbjct: 439 GGKSLDLPAKNYLIPVDDSGTFCFAFAP----TSSSLSIIGNVQQQGTRITYDLSKNVIG 494
Query: 427 VARELC 432
++ C
Sbjct: 495 LSGNKC 500
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 176/348 (50%), Gaps = 33/348 (9%)
Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG--GGA 166
M +DT +D WV C C C S VF+ +S+++ +GC AA C+++ + C GA
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 167 CAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
C + + YG ++ A + +T++ A V GC G V GLLGLGRG LS
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120
Query: 225 LLAQTQNLYQSTFSYCL---PSFKALSFSGSLR-------LGPIGQPKRIKYTPLLKNPR 274
Q Y +FSYCL S A + GS R G +G +TP+++NPR
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASS-ASFTPMVRNPR 179
Query: 275 RSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDV 332
+ YYV L+ I V G RV + L+ +P+TG G I+DSGT TRL +Y+A+RD
Sbjct: 180 METFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDA 239
Query: 333 FRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTA 384
FR L + S GG FDTCY + + PT+++ F+ G LP +N LI +
Sbjct: 240 FRAAAAGGLRL-SPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDS 298
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A V ++I N+QQQ R+++D R+G A + C
Sbjct: 299 RGTFCFAFAGTDGGV----SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 116/384 (30%), Positives = 171/384 (44%), Gaps = 38/384 (9%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS----STVFNSA 139
P+ SG T S Y V ++GTP Q+LL+ DT +D WV C+ C CS S+ F
Sbjct: 76 PLISGAS-TGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134
Query: 140 QSTTFKNLGCQAAQCKQVP-------NPTCGGGACAFNLTYGSSTIAANL-SQDTISL-- 189
S++F C C+ +P N T C F +Y ++++ S++T +L
Sbjct: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194
Query: 190 --ATDI-VPGYTFGCIQKATGNSVP------PQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
++I + G +FGC + +G SV +G++GLGRGS+S +Q + + FSYC
Sbjct: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254
Query: 241 L-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
L P L G L P+ +I YTPL NP + YY+ + +I + +
Sbjct: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
I P + + GT++DSGT T L AY V RRRV GFD C
Sbjct: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCV 374
Query: 354 SVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+ + P + G V P T + CLA+ A + +VI N
Sbjct: 375 NASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV--ESGNGFSVIGN 432
Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
+ QQ + +D SRLG R C
Sbjct: 433 LMQQGFLLEFDKEESRLGFTRRGC 456
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 136/444 (30%), Positives = 200/444 (45%), Gaps = 63/444 (14%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------------LAVA 78
L + H SPCSP PL + ++ D AR+ L+S L
Sbjct: 45 LTLHHPQSPCSP----APLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGH 100
Query: 79 RK-------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
RK S VP+ G + Y+ R +GTPA + +M +DT + W+ C
Sbjct: 101 RKKKAGGVGGSQASSSSVPLTPGASVAVG-NYVTRLGLGTPATSYVMVVDTGSSLTWLQC 159
Query: 126 TGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGS 175
+ C V C + VF+ S T+ + C +++C ++ T AC+ + +YG
Sbjct: 160 SPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGD 219
Query: 176 STIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ 234
S+ + LS+DT+S + PG+ +GC Q G GL+GL + LSLL Q
Sbjct: 220 SSYSVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLG 279
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
FSYCLP+ A +G L +G P + YTP+ + +SLY+V L I V +
Sbjct: 280 YAFSYCLPTSSAA--AGYLSIGSY-NPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLA 336
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY 353
+PP + P TIIDSGTV TRL YTA+ R V + + DTC+
Sbjct: 337 VPPSEYRSLP-----TIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCF 391
Query: 354 ---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ + P + + F+ G + L N+LI S TCLA A +I N
Sbjct: 392 RGSAAGLRVPRVDMAFAGGATLALSPGNVLI-DVDDSTTCLAFAPTGGTA-----IIGNT 445
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
QQQ ++YDV SR+G A C+
Sbjct: 446 QQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 165/366 (45%), Gaps = 32/366 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG S Y R +GTPA+ + + +DT +D W+ C C C S VFN
Sbjct: 150 PVVSGAS-QGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTS 208
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFG 200
S+T+K+L C A QC + C C + ++YG + LATD V TFG
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVG------ELATDTV---TFG 259
Query: 201 CIQK----ATGNSVPPQGLL----GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
K A G +GL GL +L+ T + ++FSYCL + S S
Sbjct: 260 NSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVD-RDSGKSSS 318
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
L + PLL+N + + YYV L VG V +P + + G I+
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 313 DSGTVFTRLVAPAYTAVRDVF-RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
D GT TRL AY ++RD F + V +S+ FDTCY + PT+ F+
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP N LI C A A +S L++I N+QQQ RI YD+ + +G
Sbjct: 439 GGKSLDLPAKNYLIPVDDSGTFCFAFAP----TSSSLSIIGNVQQQGTRITYDLSKNVIG 494
Query: 427 VARELC 432
++ C
Sbjct: 495 LSGNKC 500
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 136/411 (33%), Positives = 202/411 (49%), Gaps = 39/411 (9%)
Query: 55 EESVLEMLAKDQARLQFLSS---LAVARKSVV-------PIASGRQITQSPTYIVRAKIG 104
E+ +LE L +D+ R++++ S LA +K P+ SG + S Y VR +G
Sbjct: 3 EQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGL-LYGSGEYFVRLGLG 61
Query: 105 TPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
TPA++L M +DT +D W+ C C C + +F+ S++F+ + C + CK + +
Sbjct: 62 TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHS 121
Query: 162 CGG--GA---CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQG 214
C G GA C++ + YG + + + S D +L T FGC G G
Sbjct: 122 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 181
Query: 215 LLGLGRGSLSLLAQ-----TQNLYQSTFSYCL--PSFKALSFSGSLRLGPIGQPKRIKYT 267
LLGLG G LS +Q T + ++FSYCL S S SL G P +
Sbjct: 182 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALS 241
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
PLLKNP+ + YY ++ + VG + I +LQ + + G IIDSGT TR Y
Sbjct: 242 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 301
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMF-SGMNVTLPQDNLLIH- 381
+RD FR + + FDTCY+ + P + L F +G ++ LP N LI
Sbjct: 302 TIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 361
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+TAGS CLA AP ++ L +I N+QQQ+ RI +D+ S L A + C
Sbjct: 362 NTAGSF-CLAF--APTSME--LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 123/435 (28%), Positives = 191/435 (43%), Gaps = 33/435 (7%)
Query: 8 FLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
F + LF F ++ ++H S SP + + E + + +
Sbjct: 15 FCSVLFCFVFNQVFRAELIYREHQS-----------SPLRSETLKTPSEIFIAAVKRGHE 63
Query: 68 RLQFLSSLAVARKSV--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
R L+ +A + P+ASG Y++ G P Q +DT +D WV C
Sbjct: 64 RRARLAKHVLAGDQLFETPVASGNG-----EYLIDISYGNPPQKSTAIVDTGSDLNWVQC 118
Query: 126 TGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAAN 181
C C T+ F+ ++S ++K LGC + C+ +P +C +C ++ YG S+ +
Sbjct: 119 LPCKSCYETLSAKFDPSKSASYKTLGCGSNFCQDLPFQSC-AASCQYDYMYGDGSSTSGA 177
Query: 182 LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
LS D +++ T +P FGC G GL+GLG+G LSL++Q FSYCL
Sbjct: 178 LSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCL 237
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ S L +G + YTP+L N + YY L I V + V+ P
Sbjct: 238 VPLGSTKTS-PLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFD 296
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-- 359
T G I+DSGT T L A+ + + + S G + C+S VA
Sbjct: 297 IAATGRGGLILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANP 356
Query: 360 --PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
PT+ F+G +V L DN I TCLAMA++ + ++ N+QQ NH I+
Sbjct: 357 TYPTVVFHFNGADVALAPDNTFIALDFEGTTCLAMASS-----TGFSIFGNIQQLNHVIV 411
Query: 418 YDVPNSRLGVARELC 432
+D+ N R+G C
Sbjct: 412 HDLVNKRIGFKSANC 426
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 169/368 (45%), Gaps = 47/368 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y+VR +GTP + + + +DT +D W C C C V + A S+T+ L C AA+
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTI-AANLSQDTISLATDIVPG-------YTF 199
C+ +P +CG +C + YG ++ ++ D + G TF
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203
Query: 200 GC--IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLG 256
GC + K S G+ G GRG SL +Q L ++FSYC S F++ S +L
Sbjct: 204 GCGHLNKGVFQS-NETGIAGFGRGRWSLPSQ---LNVTSFSYCFTSMFESKSSLVTLGGS 259
Query: 257 PI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
P ++ TP+LKNP + SLY+++L I VG+ + +P + TI
Sbjct: 260 PAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------STI 312
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------PTITL 364
IDSG T L Y AV+ F +VG + D C+++P+ A P++TL
Sbjct: 313 IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTL 372
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G + LP+ N + + C+ + AAP VI N QQQN ++YD+ N R
Sbjct: 373 HLEGADWELPRSNYVFEDLGARVMCIVLDAAPGE----QTVIGNFQQQNTHVVYDLENDR 428
Query: 425 LGVARELC 432
L A C
Sbjct: 429 LSFAPARC 436
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 179/389 (46%), Gaps = 27/389 (6%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
+ S L+ + D+ R Q P+ SG S Y R +GTPA+ + + +
Sbjct: 130 DRSDLKPVDIDETRFQ-------PEDLTTPVVSGTS-QGSGEYFSRIGVGTPAKEMYVVL 181
Query: 115 DTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNL 171
DT +D W+ PC+ C S +F+ S+TFK+L C +C + C C + +
Sbjct: 182 DTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNKCLYQV 241
Query: 172 TYGSSTI-AANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
+YG + N + DT++ + V GC G GLLGLG G+LS+ T
Sbjct: 242 SYGDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSM---T 298
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
+ +FSYCL + S SL + PLL+N + + YYV L VG
Sbjct: 299 NQIKAKSFSYCLVDRDSAK-SSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVG 357
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGG 348
+ V IP + + + G I+D GT TRL AY ++RD F + TS +
Sbjct: 358 GQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISL 417
Query: 349 FDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
FDTCY + PT+T F+ G ++ LP N LI C A A +S L
Sbjct: 418 FDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAP----TSSSL 473
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
++I N+QQQ RI YD+ N+ +G++ C
Sbjct: 474 SIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 187/385 (48%), Gaps = 59/385 (15%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNL 147
QS Y+V IGTP + + DT +D WV C C S +F+ ++S+T+ ++
Sbjct: 118 QSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDV 177
Query: 148 GCQAAQCK--QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLA--TDIVP---GYTF 199
C A +C V CG +C +++ YG S +L+++T +L+ + + P G F
Sbjct: 178 PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 200 GCIQK------ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKALSFS 250
GC + TG V GLLGLGRG S+L+QT+ S FSYCLP S +
Sbjct: 238 GCSHEYISVFNDTGMGV--AGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRG--SST 293
Query: 251 GSLRLG-----PIGQPKRIKYTPLLKN-PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
G L +G P Q + +TPL+ + S Y VNL + V VDIP A
Sbjct: 294 GYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL-- 351
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVP----IV 358
G +IDSGTV T + A AY +RD FR +GS L S+ DTCY V +
Sbjct: 352 ----GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVT 407
Query: 359 APTITLMF----------SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
AP + L F SG+ + LP ++ + S+T +A P N ++ L ++ N
Sbjct: 408 APRVALEFGGGARIDVDASGILLVLPAED----GSGQSLTLACLAFLPTN-SAGLVIVGN 462
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
MQQ+ + +++DV R+G C+
Sbjct: 463 MQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 132/444 (29%), Positives = 198/444 (44%), Gaps = 64/444 (14%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL------AVARK------- 80
L + H SPCSP PL + +L D AR L+S A +R+
Sbjct: 47 LTLHHPQSPCSP----APLPSDLPFSTVLTHDDARAAHLASRLATTSNAPSRRPTTSLRK 102
Query: 81 ----------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVP 124
+ VP+ G + Y+ +GTPA + M +DT + W+
Sbjct: 103 PKAAAGASGGPLDDSLASVPLTPGTSVGVG-NYVTELGLGTPATSYAMVVDTGSSLTWLQ 161
Query: 125 CTGCV-GCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYG 174
C+ CV C V ++ S+T+ + C A+QC ++ T AC+ + +YG
Sbjct: 162 CSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYG 221
Query: 175 SSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY 233
S+ + LS+DT+S + P + +GC Q G GL+GL R LSLL Q
Sbjct: 222 DSSFSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSL 281
Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
+FSYCLP+ + +G L +GP YTP+ + +SLY+V L + VG +
Sbjct: 282 GYSFSYCLPTPAS---TGYLSIGPYTS-GHYSYTPMASSSLDASLYFVTLSGMSVGGSPL 337
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
+ P P TIIDSGTV TRL YTA+ + + + DTC+
Sbjct: 338 AVSPAEYSSLP-----TIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCF 392
Query: 354 ---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ + P + + F+ G + L N+LI S TCLA AP + + +I N
Sbjct: 393 QGQASQLRVPAVAMAFAGGATLKLATQNVLID-VDDSTTCLAF--APTDSTT---IIGNT 446
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
QQQ ++YDV SR+G A C+
Sbjct: 447 QQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 168/358 (46%), Gaps = 19/358 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
PI SG S Y R IG P + M +DT +D +WV C C C + +F
Sbjct: 139 PIVSGAS-QGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTS 197
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
S +F +L C+ QCK + C G C + ++YG S + +T++L + +
Sbjct: 198 SASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAI 257
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G + GLLGLG GSLS +Q L S+FSYCL + S S PI
Sbjct: 258 GCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCLVDRDSDSTSTLDFNSPI- 313
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P + PL +NP + +Y+ L + VG V+ IP + Q + G I+DSGT T
Sbjct: 314 TPDAVT-APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVT 372
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLP 374
RL Y +RD F + T + FDTCY + + PT++ F+ G + LP
Sbjct: 373 RLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLP 432
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A +S L+++ N QQQ R+ +D+ NS +G + C
Sbjct: 433 AKNYLIPVDSEGTFCFAFAP----TDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 174/340 (51%), Gaps = 28/340 (8%)
Query: 112 MAMDTSNDAAWVPCTGC-VGCSST---VFNSAQSTTFKNLGCQAAQCKQVP-----NPTC 162
M +DT + +W+ C C V C + +++ + S T+K L C + +C ++ +P C
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 163 --GGGACAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
AC + +YG ++ + LSQD ++L ++ +P +T+GC Q G G++GL
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL 278
R LS+LAQ Y FSYCLP+ + S G P K+TP+L + + SL
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
Y++ L AI V R +D+ A+ P T+IDSGTV TRL Y A+R F + +
Sbjct: 181 YFLRLTAITVSGRPLDL-AAAMYRVP-----TLIDSGTVITRLPMSMYAALRQAFVKIMS 234
Query: 339 SNLTVT-SLGGFDTCYSVPI----VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM 392
+ + DTC+ + P I ++F G ++TL ++LI + G ITCLA
Sbjct: 235 TKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG-ITCLAF 293
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A + + + + +I N QQQ + I YDV SR+G A C
Sbjct: 294 AGS--SGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 174/374 (46%), Gaps = 42/374 (11%)
Query: 84 PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
P+ SG + Q S Y R IG+PA+ L M +DT +D W+ C C C S +F+ A
Sbjct: 184 PVVSG--VGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPA 241
Query: 140 QSTTFKNLGCQAAQCKQVPNPTC------GGGACAFNLTYGS-STIAANLSQDTISLATD 192
S+++ + C + C+ + C G +C + + YG S + + +T++L D
Sbjct: 242 LSSSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGD 301
Query: 193 ---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSF 244
V GC G V GLL LG G LS +Q + + FSYCL PS
Sbjct: 302 GSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISATEFSYCLVDRDSPSA 358
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV-DIPPGALQFN 303
L F S PL+++PR ++ YYV L I VG + DIPP A +
Sbjct: 359 STLQFGAS--------DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMD 410
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVA 359
G I+DSGT TRL + AY+A+RD F R + + + FDTCY + +
Sbjct: 411 EQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQV 470
Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
P ++L F G + LP N LI CLA AA ++++ N+QQQ R+ +
Sbjct: 471 PAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAA----TGGAVSIVGNVQQQGIRVSF 526
Query: 419 DVPNSRLGVARELC 432
D + +G + C
Sbjct: 527 DTAKNTVGFSPNKC 540
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 127/443 (28%), Positives = 192/443 (43%), Gaps = 62/443 (13%)
Query: 36 VFHVFSPCSPF---KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
+ H PCSP KP S E +LA DQ R++ SL S G+ T
Sbjct: 77 IVHRHGPCSPLAGAHAGKPPSHAE----ILAADQNRVE---SLHHRVSSTTTGLGGKPRT 129
Query: 93 QSPT-----------------------------YIVRAKIGTPAQTLLMAMDTSNDAAWV 123
+ T Y+V +GTP + DT +D WV
Sbjct: 130 KKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWV 189
Query: 124 PCTGCV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA 179
C CV C +F+ A+S+T+ N+ C C + C G C + + YG +
Sbjct: 190 QCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQYGDGSYT 249
Query: 180 ANL-SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
++DT+++A D + G+ FGC +K G GLLGLGRG S+ Q Y +FS
Sbjct: 250 VGFFAKDTLAVAQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFS 309
Query: 239 YCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
YCLP+ A +G L P K TP+L + + + YYV L IRVG + +
Sbjct: 310 YCLPASSAA--TGYLEFGPLSPSSSGSNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGA 366
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRL--VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
P ++ N +GT++DSGTV TRL A A + + DTCY
Sbjct: 367 IPESVFSN----SGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCY 422
Query: 354 SV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ PT++L+F G + ++++ + S CL A+ D+ + + ++ N
Sbjct: 423 DFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDES--VGIVGNT 480
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ+ + +LYDV +G A C
Sbjct: 481 QQRTYGVLYDVSKKVVGFAPGAC 503
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 126/371 (33%), Positives = 182/371 (49%), Gaps = 48/371 (12%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
IG+ + L +DT ++A V C S VF+ A S +++ + C + C V T
Sbjct: 5 IGSLQKNLSAIIDTGSEAVLVQCGSR---SRPVFDPAASQSYRQVPCISQLCLAVQQQTS 61
Query: 163 GG---------GACAFNLTYGSS-TIAANLSQDTISLAT-----------DIVPGYTFGC 201
G AC ++L+YG S + SQD I L + D+ FGC
Sbjct: 62 NGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVA----FGC 117
Query: 202 IQKATGNSVP--PQGLLGLGRGSLSLLAQTQN-LYQSTFSYCLPSFKAL-SFSGSLRLGP 257
G V G++G RG+LSL +Q ++ L S FSYC PS +G + LG
Sbjct: 118 AHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGD 177
Query: 258 IGQPK-RIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTII 312
G K ++ YTPLL NP RS LYYV L +I V + + IP A + +P+TG GT++
Sbjct: 178 SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVL 237
Query: 313 DSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCY------SVPIVAPTITL 364
DSGT FTR+V AYTA R+ F R G V + GFD CY S+P V
Sbjct: 238 DSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLS 297
Query: 365 MFSGMNVTLPQDNLLIH-STAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+ + + + L ++L + S AG+ CLA+ ++ + +NV+ N QQ N+ + YD
Sbjct: 298 LQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNE 357
Query: 422 NSRLGVARELC 432
SR+G R C
Sbjct: 358 RSRVGFERADC 368
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 173/350 (49%), Gaps = 19/350 (5%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
++V +GTP Q ++ +DT +D W+ PC C + +F+ ++S+T+ + C ++
Sbjct: 25 FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84
Query: 154 CKQV-PNPTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATG--N 208
C + TC A C + YG ++ S++TI+ FG TG
Sbjct: 85 CADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG 144
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSGSLRLGPIGQPK-RIKY 266
+G+LGLG+G +S+ +Q ++ + FSYCL + A S + ++ G P ++Y
Sbjct: 145 DTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQY 204
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
TP++ N + YY+ + I VG ++DI + + GTIIDSGT T L +
Sbjct: 205 TPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVF 264
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHS 382
A+ + +V T TS G D C++ V P +T+ G+++ LP N I S
Sbjct: 265 NALVAAYTSQV-RYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLDGVHLELPTANTFI-S 322
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+I CLA A+A D + + N+QQQN I+YD+ N R+G A C
Sbjct: 323 LETNIICLAFASALD---FPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 121/403 (30%), Positives = 184/403 (45%), Gaps = 54/403 (13%)
Query: 53 SWEESVLEMLAKDQARLQFLSSLAVARKS----------VVPIASGRQITQSPTYIVRAK 102
S V+ ++A+D AR++ L VA S VVP S Y VR
Sbjct: 80 SRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVD----DGSGEYFVRVG 135
Query: 103 IGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
+G+P + +D+ +D WV PC C + +F+ A S++F + C +A C+ +
Sbjct: 136 VGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSG 195
Query: 160 PTCGGGA----CAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQG 214
CGGG C +++TYG S L+ +T++L V G GC + +G V G
Sbjct: 196 TGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAG 255
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
LLGLG G++SL+ Q FSYCL S + +GSL
Sbjct: 256 LLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLA-------------------- 294
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
SS YYV L I VG + + Q G ++D+GT TRL AY A+R F
Sbjct: 295 -SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFD 353
Query: 335 RRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITC 389
+G+ ++ DTCY + + PT++ F G +TLP NLL+ G++ C
Sbjct: 354 GAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVE-VGGAVFC 412
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LA A + +S ++++ N+QQ+ +I D N +G C
Sbjct: 413 LAFAPS----SSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 207/429 (48%), Gaps = 55/429 (12%)
Query: 28 QDHSSTLQV--FHVFSPCSPFKPSKPLSWE-ESVLEMLAKDQARLQFLSSLAVARKSVVP 84
+ + ST+ V H PC+P PS LS + S ++ + +AR ++ +K VP
Sbjct: 48 EQNGSTVYVPLVHRHGPCAP-APS--LSTDTRSFADIFRRSRARPSYI---VRGKKVSVP 101
Query: 85 IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSA 139
G + S Y+VR GTPA ++ +DT +D +W+ C C C +++ +
Sbjct: 102 AHLGTSV-MSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPS 160
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD 192
S+T+ + C + CK++ G G C F ++Y G+ST+ A SQD ++LA
Sbjct: 161 HSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGA-YSQDKLTLAPG 219
Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
IV + FGC G+LGLGR SL A+ Y FSYCLPS S G
Sbjct: 220 AIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS--SKPG 273
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L LG P +TP+ P + + V L I VG + +D+ P A G I
Sbjct: 274 FLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMI 327
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
+DSGTV T L + AY A+R FR+ + + + + G DTCY++ +V P I L F+
Sbjct: 328 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFT 386
Query: 368 G---MNVTLPQDNLLIHSTAGSITCLAMA-AAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G +N+ +P + +L++ CLA A + PD VL N+ Q+ +L+D S
Sbjct: 387 GGATINLDVP-NGILVNG------CLAFAESGPDGSAGVL---GNVNQRAFEVLFDTSTS 436
Query: 424 RLGVARELC 432
+ G + C
Sbjct: 437 KFGFRAKAC 445
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 168/346 (48%), Gaps = 30/346 (8%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAAQCKQVP 158
+GTPA +M +DT + W+ C+ C V C S VFN S+T+ ++GC A QC +P
Sbjct: 3 LGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLP 62
Query: 159 NPTCGGGACA------FNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVP 211
+ T AC+ + +YG S+ + LS+DT+S + +P + +GC Q G
Sbjct: 63 SATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGR 122
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
GL+GL R LSLL Q +F+YCLPS + + P + YTP++
Sbjct: 123 SAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSY----NPGQYSYTPMVS 178
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ SLY++ L + V + + A P TIIDSGTV TRL Y+A+
Sbjct: 179 SSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP-----TIIDSGTVITRLPTSVYSALSK 233
Query: 332 VFRRRVGSNLTVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSI 387
+ ++ DTC+ + + AP +T+ F+ G + L NLL+ S
Sbjct: 234 AVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVD-VDDST 292
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
TCLA A A +I N QQQ ++YDV +SR+G A C+
Sbjct: 293 TCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 167/358 (46%), Gaps = 19/358 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
PI SG S Y R IG P + M +DT +D +WV C C C + F
Sbjct: 139 PIVSGAS-QGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTS 197
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
S +F +L C+ QCK + C G C + ++YG S + +T++L + +
Sbjct: 198 SASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAI 257
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G + GLLGLG GSLS +Q L S+FSYCL + S S PI
Sbjct: 258 GCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCLVDRDSDSTSTLDFNSPI- 313
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P + PL +NP + +Y+ L + VG V+ IP + Q + G I+DSGT T
Sbjct: 314 TPDAVT-APLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVT 372
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLP 374
RL Y +RD F + T + FDTCY + + PT++ F+ G + LP
Sbjct: 373 RLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLP 432
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A +S L+++ N QQQ R+ +D+ NS +G + C
Sbjct: 433 AKNYLIPVDSEGTFCFAFAP----TDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 131/434 (30%), Positives = 198/434 (45%), Gaps = 68/434 (15%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
S L + + PCS S+P S +E + +D++R+ F++S S +
Sbjct: 63 SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 118
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
+ ++V GTP + + +DT + W C CV C S+ F+S+ S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTY 178
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
C + + +N+TYG ST N DT++L +D+ + FGC
Sbjct: 179 SFGSCIPSTVEN-----------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCG 227
Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
+ G+ G+LGLG+G LS ++QT + + FSYCLP ++ GSL G
Sbjct: 228 RNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATS 284
Query: 260 QPKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
Q +K+T L+ P + S Y+VNL I VG ++IP GTIIDS T
Sbjct: 285 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRT 339
Query: 317 VFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----PIVAP 360
V TRL AY+A++ F RR+ G L DTCY++ ++ P
Sbjct: 340 VITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKDVLLP 391
Query: 361 TITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
I L F G +V L N++ S A S CLA A S L +I N QQ + +LYD
Sbjct: 392 EIVLHFGGGADVRLNGTNIVWGSDA-SRLCLAFAGT-----SELTIIGNRQQLSLTVLYD 445
Query: 420 VPNSRLGVARELCT 433
+ R+G C+
Sbjct: 446 IQGRRIGFGGNGCS 459
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 169/358 (47%), Gaps = 19/358 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
P+ SG S Y +R IG P + +DT +D +W+ PC+ C S +F+
Sbjct: 137 PVVSGTS-QGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPIS 195
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTF 199
S ++ + C QCK + C G C + ++YG S + +T++L + V
Sbjct: 196 SNSYSPIRCDEPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGSAAVENVAI 255
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G V GLLGLG G LS AQ ++FSYCL + + + S P+
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGKLSFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSPL- 311
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P+ PL++NP + YY+ L I VG + IP + + + G G IIDSGT T
Sbjct: 312 -PRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVT 370
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLP 374
RL + Y A+RD F + + FDTCY + + PT++ F G + LP
Sbjct: 371 RLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLP 430
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N LI + C A A S L++I N+QQQ R+ +D+ NS +G + + C
Sbjct: 431 ARNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 207/429 (48%), Gaps = 55/429 (12%)
Query: 28 QDHSSTLQV--FHVFSPCSPFKPSKPLSWE-ESVLEMLAKDQARLQFLSSLAVARKSVVP 84
+ + ST+ V H PC+P PS LS + S ++ + +AR ++ +K VP
Sbjct: 14 EQNGSTVYVPLVHRHGPCAP-APS--LSTDTRSFADIFRRSRARPSYI---VRGKKVSVP 67
Query: 85 IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSA 139
G + S Y+VR GTPA ++ +DT +D +W+ C C C +++ +
Sbjct: 68 AHLGTSV-MSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPS 126
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD 192
S+T+ + C + CK++ G G C F ++Y G+ST+ A SQD ++LA
Sbjct: 127 HSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGA-YSQDKLTLAPG 185
Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
IV + FGC G+LGLGR SL A+ Y FSYCLPS S G
Sbjct: 186 AIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS--SKPG 239
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L LG P +TP+ P + + V L I VG + +D+ P A G I
Sbjct: 240 FLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------GGMI 293
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS 367
+DSGTV T L + AY A+R FR+ + + + + G DTCY++ +V P I L F+
Sbjct: 294 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFT 352
Query: 368 G---MNVTLPQDNLLIHSTAGSITCLAMA-AAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G +N+ +P + +L++ CLA A + PD VL N+ Q+ +L+D S
Sbjct: 353 GGATINLDVP-NGILVNG------CLAFAESGPDGSAGVL---GNVNQRAFEVLFDTSTS 402
Query: 424 RLGVARELC 432
+ G + C
Sbjct: 403 KFGFRAKAC 411
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 127/430 (29%), Positives = 201/430 (46%), Gaps = 45/430 (10%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ-------ARLQFLSSLAVARKS 81
D +S+LQV H + PC + + S +E L +DQ ARL +S + +
Sbjct: 65 DKASSLQVLHKYGPC------MQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEM 118
Query: 82 V--VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTV 135
V +P SG I + Y+V +GTP + + DT + W C C+G
Sbjct: 119 VTKLPAQSGIAIG-TGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQK 177
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGSSTIAANL-SQDTISLA 190
F+ +ST++ N+ C +A C +P G A C + + YG + + + +T++++
Sbjct: 178 FDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTIS 237
Query: 191 T-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
+ D+ + FGC Q G GLLGL S+SL +QT YQ FSYCLPS S
Sbjct: 238 SSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--TPSS 295
Query: 250 SGSLRLGPIGQPKRIK-YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
+G L G G+ + +TP+ +P SS Y ++++ I V + I P T +
Sbjct: 296 TGYLNFG--GKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIF-----TTS 346
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITL 364
G IIDSGTV TRL AY A+++ F ++ + DTCY + P +++
Sbjct: 347 GAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSV 406
Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F G+ V + +L + CLA AA D +S + N QQ+ + ++YD
Sbjct: 407 SFKGGVEVDIDASGILYLVNGVKMVCLAFAANKD--DSEFGIFGNHQQKTYEVVYDGAKG 464
Query: 424 RLGVARELCT 433
+G A C+
Sbjct: 465 MIGFAAGACS 474
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 168/352 (47%), Gaps = 30/352 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGC---SSTVFNSAQSTTFKNLGCQAA 152
Y+ R +GTPA+ +M +DT + W+ C+ C V C S VF+ S+++ + C
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 153 QCKQVPNPTCGGGACA------FNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCIQKA 205
QC + T AC+ + +YG S+ + LS+DT+S ++ VP + +GC Q
Sbjct: 197 QCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDN 256
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
G GL+GL R LSLL Q +FSYCLPS + + P +
Sbjct: 257 EGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSY----NPGQYS 312
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
YTP++ + SLY++ L + V + P A+ + + TIIDSGTV TRL
Sbjct: 313 YTPMVSSTLDDSLYFIKLSGMTVAGK-----PLAVSSSEYSSLPTIIDSGTVITRLPTTV 367
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH 381
Y A+ + + DTC+ + + P +++ FS G + L NLL+
Sbjct: 368 YDALSKAVAGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVD 427
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ S TCLA A A +I N QQQ ++YDV ++R+G A CT
Sbjct: 428 VDS-STTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 124/395 (31%), Positives = 173/395 (43%), Gaps = 46/395 (11%)
Query: 71 FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
F L +S V + SG Y + +GTP + + +DT +D W+ C C
Sbjct: 176 FSGQLVATLESGVSLGSGE-------YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYA 228
Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT----CGGG--ACAFNLTYGSS----- 176
C + ++ S++FKN+ C +C+ V +P C G +C + YG S
Sbjct: 229 CFEQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTG 288
Query: 177 -----TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
T NL+ IV FGC G GLLGLGRG LS Q Q+
Sbjct: 289 DFALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQS 348
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLL---------KNPRRSSLYYVN 282
LY +FSYCL + S S + G+ K + P L +NP + YYV
Sbjct: 349 LYGHSFSYCLVDRNSNSSVSSKLI--FGEDKELLSHPNLNFTSFVGGKENPV-DTFYYVL 405
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+ +I VG V+ IP + G GTIIDSGT T PAY +++ F R++
Sbjct: 406 IKSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL 465
Query: 343 VTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
V + CY+V V P ++F+ G P +N I + CLA+ P
Sbjct: 466 VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTP- 524
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N QQQN ILYD+ SRLG A C
Sbjct: 525 --RSALSIIGNYQQQNFHILYDLKKSRLGYAPMKC 557
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 138/430 (32%), Positives = 198/430 (46%), Gaps = 68/430 (15%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
S+ L++ H PC+P + S + SV + L DQ R +++ S A A
Sbjct: 65 SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
+ VP + G I + Y+V A +GTP M +DT +D +WV PC+ C S
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
+F+ AQS+++ + C P C G C + ++YG S
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233
Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
S DT++L A+ V G+ FGC +G N V GLLGLGR SL+ QT Y F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291
Query: 238 SYCLPSFKALSFSGSLRL-GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
SYCLP+ + + +L L GP G T LL +P + Y V L I VG + + +P
Sbjct: 292 SYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVP 351
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTCYS 354
A GT++D+GTV TRL AY A+R FR + S T S G DTCY+
Sbjct: 352 ASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN 405
Query: 355 VP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
+ P + L F SG V L D +L S CLA AP + + ++ N+
Sbjct: 406 FAGYGTVTLPNVALTFGSGATVMLGADGIL------SFGCLAF--APSGSDGGMAILGNV 457
Query: 410 QQQNHRILYD 419
QQ++ + D
Sbjct: 458 QQRSFEVRID 467
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 20/359 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
PI SG S Y R +G PA+ M +DT +D W+ PCT C + +F+
Sbjct: 143 PIISGTS-QGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRS 201
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLA-TDIVPGYT 198
S++F +L C++ QC+ + C C + ++YG + +T++ + ++
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVA 261
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
GC G V G GL L+ T + S+FSYCL + S S L
Sbjct: 262 VGCGHDNEGLFV---GSAGLLGLGGGPLSLTSQMKASSFSYCLVD-RDSSSSSDLEFNSA 317
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
P PLLK+ + + YYV L + VG +++ IPP Q + + G I+DSGT
Sbjct: 318 A-PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAI 376
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTL 373
TRL AY +RD F R FDTCY + + PT++ F+ G ++ L
Sbjct: 377 TRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQL 436
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P N LI + C A A S L++I N+QQQ R+ YD+ NS +G + C
Sbjct: 437 PPKNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 171/368 (46%), Gaps = 22/368 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y V +GTP Q + +D+ +D WV C C+ C + ++ +
Sbjct: 53 PVVSGSTLG-SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSN 111
Query: 141 STTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSSTIAANLSQDTISLATDI 193
S+TF + C + +C +P T G GACA+ Y ++++ + + D+
Sbjct: 112 SSTFNPVPCLSPECLLIP-ATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDV 170
Query: 194 -VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSG 251
+ FGC + G+ G+LGLG+G LS +Q Y + F+YCL ++ S S
Sbjct: 171 RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSS 230
Query: 252 SLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
L G I +++TP++ N R +LYYV + + VG + I A + G
Sbjct: 231 WLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGG 290
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLM 365
+I DSGT T + PAY + F + V S+ G D C V V P+ T++
Sbjct: 291 SIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQGLDLCVDVTGVDQPSFPSFTIV 349
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G V PQ A ++ CLAMA P +V N I N+ QQN + YD +R+
Sbjct: 350 LGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGG-FNTIGNLLQQNFLVQYDREENRI 408
Query: 426 GVARELCT 433
G A C+
Sbjct: 409 GFAPAKCS 416
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 165/371 (44%), Gaps = 37/371 (9%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y + IGTP + + +DT +D W+ C C C + ++ +S++FKN+GC
Sbjct: 189 SGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCH 248
Query: 151 AAQCKQVPNP------TCGGGACAFNLTYGSS----------TIAANLSQDTISLATDIV 194
+C V +P C + YG S T NL+ V
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSL 253
FGC G GLLGLGRG LS +Q Q+LY +FSYCL + + S L
Sbjct: 309 ENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 368
Query: 254 RLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
G + P+ + +T L+ +NP + YYV + +I VG V+ IP +P
Sbjct: 369 IFGEDKDLLNHPE-VNFTSLVAGKENPV-DTFYYVQIKSIMVGGEVLKIPEETWHLSPEG 426
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTI 362
GTI+DSGT + P+Y ++D F ++V + D CY+V V P
Sbjct: 427 AGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEF 486
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
++F G P +N I I CLA+ P S L++I N QQQN ILYD
Sbjct: 487 RILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTP---RSALSIIGNYQQQNFHILYDTK 543
Query: 422 NSRLGVARELC 432
SRLG A C
Sbjct: 544 KSRLGYAPMKC 554
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/381 (33%), Positives = 176/381 (46%), Gaps = 45/381 (11%)
Query: 84 PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNS 138
P++ G PT Y+V IGTP Q + + +DT +D W C CV C F++
Sbjct: 20 PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDT 79
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC------GGGACAFNLTYGSSTIAAN-LSQDTIS-LA 190
++S+T L C++ QCK P T CA+ +YG +++ L+ D + +A
Sbjct: 80 SRSSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVA 139
Query: 191 TDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC-------L 241
+PG TFGC TG NS G+ G GRG LSL +Q L FS+C +
Sbjct: 140 GTSLPGVTFGCGLNNTGVFNSN-ETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAI 195
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
PS L L G ++ TPL+ KN +LYY++L I VG + +P
Sbjct: 196 PSTVLLDLPADLFSNGQG---AVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPES 252
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
A TG GTIIDSGT T L Y VRD F ++ + + G TC+S P
Sbjct: 253 AFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQ 311
Query: 359 A----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
A P + L F G + LP++N + SI CLA+ N +I N QQ
Sbjct: 312 AKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQ 366
Query: 412 QNHRILYDVPNSRLGVARELC 432
QN +LYD+ N+ L C
Sbjct: 367 QNMHVLYDLQNNMLSFVAAQC 387
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 175/359 (48%), Gaps = 21/359 (5%)
Query: 84 PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
P+ SG TQ S Y R IG PA+ + M +DT +D W+ CT C C + +F +
Sbjct: 139 PLISG--TTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPS 196
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT 198
S++++ L C QC + C C + ++YG S + + +T+++ + +V
Sbjct: 197 SSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVA 256
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
GC G V GLLGLG G L+L +Q L ++FSYCL + S S ++ G
Sbjct: 257 VGCGHSNEGLFVGAAGLLGLGGGLLALPSQ---LNTTSFSYCLVDRDSDSAS-TVEFGTS 312
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
P + PLL+N + + YY+ L I VG ++ IP + + + + G IIDSGT
Sbjct: 313 LPPDAV-VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAV 371
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTL 373
TRL Y ++RD F + + FDTCY++ I PT+ F G + L
Sbjct: 372 TRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLAL 431
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P N +I + CLA A S L +I N+QQQ R+ +D+ NS +G + C
Sbjct: 432 PAKNYMIPVDSVGTFCLAFAPTA----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 165/359 (45%), Gaps = 20/359 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQ 140
PI SG S Y R +G PA+ M +DT +D W+ PCT C + +F+
Sbjct: 143 PIISGTS-QGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRS 201
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLA-TDIVPGYT 198
S++F +L C++ QC+ + C C + ++YG + +T++ + ++
Sbjct: 202 SSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVA 261
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
GC G V G GL L+ T + S+FSYCL + S S L
Sbjct: 262 VGCGHDNEGLFV---GSAGLLGLGGGSLSLTSQMKASSFSYCLVD-RDSSSSSDLEFNSA 317
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
P PLLK+ + + YYV L + VG +++ IPP Q + + G I+DSGT
Sbjct: 318 A-PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAI 376
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTL 373
TRL AY +RD F R FDTCY + + PT++ F+ G ++ L
Sbjct: 377 TRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQL 436
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P N LI + C A A S L++I N+QQQ R+ YD+ NS +G + C
Sbjct: 437 PPKNYLIPVDSVGTFCFAFAP----TTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 185/379 (48%), Gaps = 23/379 (6%)
Query: 66 QARLQFLSSLAVARKSVV--PIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
+A L+ +S++ + + P+ SG TQ S Y R IG PA+ + M +DT +D W
Sbjct: 116 KADLKPISTMYTTEEQDIEAPLISG--TTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNW 173
Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STI 178
+ CT C C + +F + S++++ L C QC + C C + ++YG S
Sbjct: 174 LQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYT 233
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
+ + +T+++ + +V GC G V GLLGLG G L+L +Q L ++FS
Sbjct: 234 VGDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQ---LNTTSFS 290
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCL + S S ++ G P + PLL+N + + YY+ L I VG ++ IP
Sbjct: 291 YCLVDRDSDSAS-TVDFGTSLSPDAV-VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQS 348
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
+ + + + G IIDSGT TRL Y ++RD F + + FDTCY++
Sbjct: 349 SFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAK 408
Query: 357 --IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ PT+ F G + LP N +I + CLA A S L +I N+QQQ
Sbjct: 409 TTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTA----SSLAIIGNVQQQG 464
Query: 414 HRILYDVPNSRLGVARELC 432
R+ +D+ NS +G + C
Sbjct: 465 TRVTFDLANSLIGFSSNKC 483
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 192/408 (47%), Gaps = 39/408 (9%)
Query: 54 WEESVLEMLAKDQARLQFLSS---LAVA----------------RKSVVPIASGRQITQS 94
++ VL L +D R++ L++ LA+A P+ SG S
Sbjct: 94 YKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGAS-QGS 152
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQA 151
Y R IG+P + + M +DT +D WV C C C + +F + S+++ L C+
Sbjct: 153 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCET 212
Query: 152 AQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPGYTFGCIQKATGNS 209
QCK + C +C + ++YG S + + +TI+L + + GC G
Sbjct: 213 HQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLF 272
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG GSLS +Q + S+FSYCL + S S PI P PL
Sbjct: 273 VGAAGLLGLGGGSLSFPSQ---INASSFSYCLVNRDTDSASTLEFNSPI--PSHSVTAPL 327
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
L+N + + YY+ + I VG +++ IP + + + + G I+DSGT TRL + Y ++
Sbjct: 328 LRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYNSL 387
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHSTA 384
RD F R + + + FDTCY + + PT++ F G + LP N LI +
Sbjct: 388 RDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDS 447
Query: 385 GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A S L++I N+QQQ R+ YD+ NS +G + C
Sbjct: 448 AGTFCFAFAP----TTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 171/352 (48%), Gaps = 27/352 (7%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST---VFNSAQSTTFKNLGC 149
S Y++ GTP +T + DT +D W+ C C V C + +F+ + S+T++N+ C
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSC 72
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYG--SSTIAANLSQDTISLA-TDIVPGYTFGCIQKAT 206
C + C C + + YG SSTI L+ DT L + FGC Q T
Sbjct: 73 TEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGF-LAMDTFMLTPAQKFKNFIFGCGQNNT 131
Query: 207 GNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI- 264
G GL+GLGR S SL +Q + FSYCLPS S +G L IG P+
Sbjct: 132 GLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTS--SATGYLN---IGNPQNTP 186
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
YT +L + R +LY+++L+ I VG + + Q GTIIDSGTV TRL
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVITRLPPT 241
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSGMNVTLPQDNLLI 380
AY+A++ R + ++ DTCY + +V P I L F+G++V +P +
Sbjct: 242 AYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFF 301
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + CLA A D ++++ +I N+QQ + YD R+G + C
Sbjct: 302 VFNSSQV-CLAFAGNTD--STMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 130/429 (30%), Positives = 201/429 (46%), Gaps = 50/429 (11%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
S L + + PCS S+P S +E + +D++R+ F++S S +
Sbjct: 62 SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 117
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
+ ++V GTP Q + +DT + W C CV C S F+S S+T+
Sbjct: 118 NNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTY 177
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
C +P+ T G +N+TYG ST N DT++L +D+ + FGC
Sbjct: 178 SFGSC-------IPS-TVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCG 226
Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
+ G+ G+LGLG+G LS ++QT + ++ FSYCLP ++ GSL G
Sbjct: 227 RNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSI---GSLLFGEKATS 283
Query: 260 QPKRIKYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
Q +K+T L+ P S Y+V LL I VG + ++IP GTIIDS
Sbjct: 284 QSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDS 338
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVG----SNLTVTSLGGFDTCYSV----PIVAPTITLMF 366
GTV TRL AY+A++ F++ + SN DTCY++ ++ P L F
Sbjct: 339 GTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHF 398
Query: 367 -SGMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G +V L ++ + A + CLA A + +N L +I N QQ + +LYD+ R
Sbjct: 399 GDGADVRLNGKRVVWGNDASRL-CLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRR 457
Query: 425 LGVARELCT 433
+G C+
Sbjct: 458 IGFGGNGCS 466
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 133/382 (34%), Positives = 193/382 (50%), Gaps = 39/382 (10%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
V P+ S R T S Y+ + +GTPA L+AMDT +D W+ C C C S VF+
Sbjct: 120 VAPVVS-RAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDP 178
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGG-----ACAFNLTYGS--STIAANLSQDTISLAT 191
ST+++ +G A C+ + GGG C + + YG ST + ++T++ A
Sbjct: 179 RHSTSYREMGYDAPDCQALGR--SGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG 236
Query: 192 DI-VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNL-YQST-FSYCLPSF--- 244
+ VP + GC G + P G+LGLGRG +S +Q L Y T FSYCL F
Sbjct: 237 GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLS 296
Query: 245 -KALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP-PGA 299
S S +L +G G P +TP ++N ++ YYV L+ + VG V
Sbjct: 297 SPGRSVSSTLTIGDGAAAGSPPP-SFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDD 355
Query: 300 LQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYS 354
L+ +P TG G I+DSGT TRL AY A RD FR +L S+GG FDTCY+
Sbjct: 356 LKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAA-VDLGQVSIGGPSGFFDTCYT 414
Query: 355 V---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
+ + PT+++ F+ G+ +TLP N LI + C A A D +++I N+Q
Sbjct: 415 MGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDR---SVSIIGNIQ 471
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQ R++Y++ R+G A C
Sbjct: 472 QQGFRVVYNIGGGRVGFAPNSC 493
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 191/438 (43%), Gaps = 56/438 (12%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-------SVVP 84
+++ + H PC+P S + S E L D+AR + A R+ + +P
Sbjct: 54 ASVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGASIP 113
Query: 85 IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSA 139
G S Y+V IGTPA + +DT +D +WV C + C +F+ +
Sbjct: 114 TYLG-GFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPS 172
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGA----------CAFNLTYGSSTIAANL-SQDTIS 188
+S+TF + C + CKQ+P G C + + YG+ I + S +T++
Sbjct: 173 KSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLA 232
Query: 189 LATD-IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
L + +V + FGC G GLLGLG SL++QT ++Y FSYCLP
Sbjct: 233 LGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLN-- 290
Query: 248 SFSGSLRLGPIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
S +G L LG +TP+ +P+ ++ Y V L I VG + +DIPP
Sbjct: 291 SGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFA- 349
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSV----PI 357
G I+DSGTV T + AY A+R FR + L + DTCY+ +
Sbjct: 350 -----KGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTV 404
Query: 358 VAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
P + L F G +++ +P L+ CLA A A D +I N+ +
Sbjct: 405 TVPKVALTFVGGATVDLDVPSGVLVED-------CLAFADAGDG---SFGIIGNVNTRTI 454
Query: 415 RILYDVPNSRLGVARELC 432
+LYD LG C
Sbjct: 455 EVLYDSGKGHLGFRAGAC 472
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 158/346 (45%), Gaps = 25/346 (7%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS---TVFNSAQSTTFKNLG 148
T + ++V+ +G P Q M D D W+ C C+ C ++F+ +QS+++ L
Sbjct: 182 TGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLS 241
Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAAN-LSQDTISL-ATDIVPGYTFGCIQKA 205
C+ C +PN +C G C +N+TY T L +T+S ++ V + GC K
Sbjct: 242 CETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWVDRVSLGCSNKN 301
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
G V G GLGRGSLS ++ S+ SYCL K S +L +K
Sbjct: 302 QGPFVGSDGTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSPPCSGSVK 358
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
LL+NP+ +LYYV L I+VG +D+P +P G I+ S ++ T L
Sbjct: 359 -AKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLENDT 417
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYS--------VPIVAPTITLMFSGMNVTLPQDN 377
Y VRD F + + + FDTCY+ +PI+ + G + LP+++
Sbjct: 418 YNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVN---DGKSWLLPKES 474
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
L C A A + + +++ +QQ R+ +D+ NS
Sbjct: 475 YLYAVDKNGTFCFAFAPSKGS----FSILGTLQQYGTRVTFDLVNS 516
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 126/437 (28%), Positives = 192/437 (43%), Gaps = 54/437 (12%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL----SSLAVAR-- 79
D+ +T+ + H PCSP K + + E+L +DQ R ++ S R
Sbjct: 52 DSSSSGATVPLNHRHGPCSPVPSGK--KKQPTFTELLRRDQLRANYIQRQFSDEHYPRTG 109
Query: 80 -----KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
++ VPIA G + + Y++ IG+PA M +DT +D +W+ C S
Sbjct: 110 GLQQSEATVPIALG-SLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRC------KSR 162
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLA 190
+++ S+T+ C A C Q+ G G C +++ YG S DT++LA
Sbjct: 163 LYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLA 222
Query: 191 ---TDIVPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++ G+ FGC G GL+GLG + S ++QT Y S FSYCLP
Sbjct: 223 GTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLP--PT 280
Query: 247 LSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
+ SG L LG TP+L++ + ++ Y + L I VG + ++IP
Sbjct: 281 WNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS--- 337
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP------ 356
AG+I+DSGTV TRL AY A+ FR + G DTC+
Sbjct: 338 ---AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGN 394
Query: 357 -IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P++ L+ G V N ++ CLA AA D+ + +I N+QQ+
Sbjct: 395 NFTVPSVALVLDGGAVVDLHPNGIVQD-----GCLAFAATDDDGRT--GIIGNVQQRTFE 447
Query: 416 ILYDVPNSRLGVARELC 432
+LYDV S G C
Sbjct: 448 VLYDVGQSVFGFRPGAC 464
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/362 (32%), Positives = 181/362 (50%), Gaps = 19/362 (5%)
Query: 85 IASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQS 141
I+SG + S Y R IG P ++ + +DT +D W+ C C C S V ++ + S
Sbjct: 1 ISSGLSLG-SGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 59
Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISL---ATDIVPGY 197
++++ + C +A C+ + C G C++ + YG SS + +L ++ L ++ +
Sbjct: 60 SSYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNI 119
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKAL-SFSGSLRL 255
FGC +G GLLG+G G+LS +Q FSYCL + L S S L
Sbjct: 120 AFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIF 179
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G P ++TPLLKNPR ++ YY L I VG + IPP G I+DSG
Sbjct: 180 GRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSG 239
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---VPIVA-PTITLMF-SGMN 370
T TR+V PAY +RD +R + + DTC++ +P V P++ L F +G++
Sbjct: 240 TSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVD 299
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ LP N+LI CLA A + + ++VI N+QQQ RI +D+ S + +A
Sbjct: 300 MVLPGGNILIPVDRSGTFCLAFAPS----SMPISVIGNVQQQTFRIGFDLQRSLIAIAPR 355
Query: 431 LC 432
C
Sbjct: 356 EC 357
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 132/425 (31%), Positives = 198/425 (46%), Gaps = 74/425 (17%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----LAVARKSVVPI 85
S L + + PCS S+P S +E + +D++R+ F++S K P
Sbjct: 97 SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKFNQYAPENLKDHTP- 151
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQST 142
+ + + ++V GTP Q + +DT + W C CV C S F+ + S
Sbjct: 152 -NNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASL 210
Query: 143 TFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFG 200
T+ C +P+ T G +N+TYG ST N DT++L +D+ P + FG
Sbjct: 211 TYSLGSC-------IPS-TVGN---TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFG 259
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
C + G+ G+LGLG+G LS ++QT + ++ FSYCLP ++ GSL G
Sbjct: 260 CGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSI---GSLLFGEKA 316
Query: 258 IGQPKRIKYTPLLKNP-----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
Q +K+T L+ P S Y+V LL I VG + ++IP GTII
Sbjct: 317 TSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTII 371
Query: 313 DSGTVFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----P 356
DSGTV TRL AY+A++ F RR+ G L DTCY++
Sbjct: 372 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKD 423
Query: 357 IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
++ P I L F G +V L ++ + A + CLA A NS L +I N QQ +
Sbjct: 424 VLLPEIVLHFGEGADVRLNGKRVIWGNDASRL-CLAFAG-----NSELTIIGNRQQVSLT 477
Query: 416 ILYDV 420
+LYD+
Sbjct: 478 VLYDI 482
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 172/373 (46%), Gaps = 30/373 (8%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y V +GTP Q + +DT +D A+V C C C ++ +
Sbjct: 22 PLVSGTTLG-SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSN 80
Query: 141 STTFKNLGCQAAQCKQVPNPT---CGG--------GACAFNLTYG--SSTIAANLSQDTI 187
S+TF + C +A+C +P P C GAC++ YG SST+ + +T
Sbjct: 81 SSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGV-FAYETA 139
Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KA 246
++ V FGC + G+ V G+LGLG+G+LS +Q +++ F+YCL S+
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSP 199
Query: 247 LSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
S SL G + +++TPL+ NP S+YYV ++ I G + IP A + +
Sbjct: 200 TSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDS 259
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-----PIVA 359
GTI DSGT T AY + F + V S G C +V PI
Sbjct: 260 VGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIY- 318
Query: 360 PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
P+ T+ F P + +I CLAM ++ + NVI N+ QQN+ + YD
Sbjct: 319 PSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAML---ESSSDGFNVIGNIIQQNYLVQYD 375
Query: 420 VPNSRLGVARELC 432
R+G A C
Sbjct: 376 REEHRIGFAHANC 388
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/382 (32%), Positives = 176/382 (46%), Gaps = 46/382 (12%)
Query: 84 PIASGRQITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNS 138
PI + R + + Y++ IGTP +DT +D W C CV C+ + F
Sbjct: 77 PITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRP 136
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATD---- 192
A+S T++ + C++ C +P P C C + YG ++ A L+ +T +
Sbjct: 137 ARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSK 196
Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----- 246
+V FGC +G G++GLGRG LSL++Q L S FSYCL SF +
Sbjct: 197 VMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQ---LGPSRFSYCLTSFLSPEPSR 253
Query: 247 LSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
L+F L G P ++ TPL+ N SLY+++L I +G++ + I P
Sbjct: 254 LNFGVFATLNGTNASSSGSP--VQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRD----VFRRRVGSNLTVTSLGGFDTCY---- 353
N G IDSGT T L AY AVR V R +N T G +TC+
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEI---GLETCFPWPP 368
Query: 354 --SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
SV + P + L F G N+T+P +N ++ A CLAM + D +I N Q
Sbjct: 369 PPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-----TIIGNYQ 423
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN ILYD+ NS L C
Sbjct: 424 QQNMHILYDIANSLLSFVPAPC 445
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 123/382 (32%), Positives = 176/382 (46%), Gaps = 46/382 (12%)
Query: 84 PIASGRQITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNS 138
PI + R + + Y++ IGTP +DT +D W C CV C+ + F
Sbjct: 77 PITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRP 136
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTC-GGGACAFNLTYG-SSTIAANLSQDTISLATD---- 192
A+S T++ + C++ C +P P C C + YG ++ A L+ +T +
Sbjct: 137 ARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSK 196
Query: 193 -IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----- 246
+V FGC +G G++GLGRG LSL++Q L S FSYCL SF +
Sbjct: 197 VMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQ---LGPSRFSYCLTSFLSPEPSR 253
Query: 247 LSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
L+F L G P ++ TPL+ N SLY+++L I +G++ + I P
Sbjct: 254 LNFGVFATLNGTNASSSGSP--VQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFDTCY---- 353
N G IDSGT T L AY AVR V R +N T G +TC+
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEI---GLETCFPWPP 368
Query: 354 --SVPIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
SV + P + L F G N+T+P +N ++ A CLAM + D +I N Q
Sbjct: 369 PPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-----TIIGNYQ 423
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN ILYD+ NS L C
Sbjct: 424 QQNMHILYDIANSLLSFVPAPC 445
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 209/439 (47%), Gaps = 56/439 (12%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
S+TL++ H CS K + + + L D R+Q L + + +SV
Sbjct: 16 ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 70
Query: 83 ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
+P+ SG ++ +S YIV ++G +L++ DT +D WV C C C + ++
Sbjct: 71 ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 127
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
+ + S+++K + C ++ C+ + T CGG C + ++YG S +L+
Sbjct: 128 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 187
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
++I L + + FGC + G GL+GLGR S+SL++QT + FSYCLPS
Sbjct: 188 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 247
Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ SGSL G + YTPL++NP+ S Y +NL +G G
Sbjct: 248 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 298
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
+ + G G +IDSGTV TRL Y AV+ F ++ T DTC+++
Sbjct: 299 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 358
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
I P I ++F G N L D + S+ CLA+A+ + + + +I N QQ+N
Sbjct: 359 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 415
Query: 414 HRILYDVPNSRLGVARELC 432
R++YD RLG+ E C
Sbjct: 416 QRVIYDTTQERLGIVGENC 434
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 130/433 (30%), Positives = 191/433 (44%), Gaps = 56/433 (12%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF------- 71
E P+ + + TL V + C P S S E+ D+ R+++
Sbjct: 57 EPAGPVIAPRQRNGTLAVLRLAHRCGPSTASA------SFAEVQRADEQRVEYIQRRVSG 110
Query: 72 ---------LSSLAV-ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
L LA +R + VP G Q Y+V +GTP + + +DT +D +
Sbjct: 111 GGARGAKGALQQLATGSRSATVPTTMGVGTFQ---YVVTVSLGTPGVSQTVEVDTGSDVS 167
Query: 122 WVPCTGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG 174
WV C C C+S +F+ A+S+T+ + C A C + + C G C + ++YG
Sbjct: 168 WVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYG 227
Query: 175 -SSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
S DT++LA + V + FGC G GLL LGR S+SL +Q
Sbjct: 228 DGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGA 287
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
Y FSYCLPS + S +G L LG T LL + Y V L I VG +
Sbjct: 288 YGGVFSYCLPSKQ--SAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQ 345
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
V +P A GT++D+GTV TRL AY A+R FR + + + G D
Sbjct: 346 VAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILD 399
Query: 351 TCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
TCY + PT+ L FSG TL + I S+ CLA AP+ + ++
Sbjct: 400 TCYDFSRYGVVTLPTVALTFSG-GATLALEAPGILSSG----CLAF--APNGGDGDAAIL 452
Query: 407 ANMQQQNHRILYD 419
N+QQ++ + +D
Sbjct: 453 GNVQQRSFAVRFD 465
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 120/379 (31%), Positives = 173/379 (45%), Gaps = 42/379 (11%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSA 139
VP+ +G + +++ +GTPA +DT +D W C CV C ++ VF+ A
Sbjct: 107 VPVHAG-----NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPA 161
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGAC--------AFNLTYG-SSTIAANLSQDTISLA 190
S+T+ L C +A C +P TC + + TYG +S+ L+ +T +LA
Sbjct: 162 ASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLA 221
Query: 191 TDIVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
VPG FGC G+ GL+GLGRG LSL++Q L FSYCL S +
Sbjct: 222 RQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGIDRFSYCLTSLDDAAG 278
Query: 250 SGSLRLGPIGQPKRI------KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
L LG + TPL+KNP + S YYV+L + VG + +P A
Sbjct: 279 RSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQ 338
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP------- 356
G I+DSGT T L AY A+R F + S G D C+ P
Sbjct: 339 DDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQD 398
Query: 357 --IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ P + L F G ++ LP +N ++ +A CL + A + L++I N QQQN
Sbjct: 399 VQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMA-----SRGLSIIGNFQQQN 453
Query: 414 HRILYDVPNSRLGVARELC 432
+ +YDV L A C
Sbjct: 454 FQFVYDVAGDTLSFAPAEC 472
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 192/395 (48%), Gaps = 29/395 (7%)
Query: 62 LAKDQARLQFL-----SSLAVARKS-----VVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
+ +D+ARL+++ SS R+ ++SG + S Y R IG+P ++
Sbjct: 1 MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLG-SGEYFARMGIGSPQRSYY 59
Query: 112 MAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA 168
+ +DT +D W+ C C C S V ++ + S++++ + C +A C+ + C G C+
Sbjct: 60 LELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119
Query: 169 FNLTYG-SSTIAANLSQDTISL---ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
+ + YG SS + +L ++ L ++ + FGC +G GLLG+G G+LS
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLS 179
Query: 225 LLAQTQNLYQSTFSYCL-PSFKAL-SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVN 282
+Q FSYCL + L S S L G P ++TPLLKNPR + YY
Sbjct: 180 FFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAI 239
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
L I VG + IPP G I+DSGT TR+V AY +RD +R +
Sbjct: 240 LTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPP 299
Query: 343 VTSLGGFDTCYS---VPIVA-PTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
+ DTC++ +P V P++ L F +++ LP N+LI CLA A +
Sbjct: 300 APGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPS-- 357
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ ++VI N+QQQ RI +D+ S + +A C
Sbjct: 358 --SMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 208/439 (47%), Gaps = 56/439 (12%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
S+TL++ H CS K + + + L D R+Q L + + +SV
Sbjct: 64 ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 118
Query: 83 ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
+P+ SG ++ +S YIV ++G +L++ DT +D WV C C C + ++
Sbjct: 119 ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 175
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
+ + S+++K + C ++ C+ + T CGG C + ++YG S +L+
Sbjct: 176 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 235
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
++I L + + FGC + G GL+GLGR S+SL++QT + FSYCLPS
Sbjct: 236 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 295
Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ SGSL G + YTPL++NP+ S Y +NL +G G
Sbjct: 296 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 346
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
+ + G G +IDSGTV TRL Y AV+ F ++ T DTC+++
Sbjct: 347 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 406
Query: 357 IVAPTITLMFSGMNVTLPQD---NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
I P I ++F G N L D S+ CLA+A+ + + + +I N QQ+N
Sbjct: 407 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 463
Query: 414 HRILYDVPNSRLGVARELC 432
R++YD RLG+ E C
Sbjct: 464 QRVIYDTTQERLGIVGENC 482
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 158/356 (44%), Gaps = 28/356 (7%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP T + DT +D W PCT C + F A S+TF L C ++
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
C+ +PN TC C +N YGS A L+ +T+ + P FGC + GNS
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGCSTENGVGNST 205
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
G+ GLGRG+LSL+ Q L FSYCL S A S + G + ++ TP
Sbjct: 206 --SGIAGLGRGALSLIPQ---LGVGRFSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTP 259
Query: 269 LLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAP 324
+ NP S YYVNL I VG D+P F T G GTI+DSGT T L
Sbjct: 260 FVNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYS------VPIVAPTITLMFS-GMNVTLPQDN 377
Y V+ F + TV G D C+ I P++ L F G +P
Sbjct: 318 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 377
Query: 378 LLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + + GS+T + P + ++VI N+ Q + +LYD+ A C
Sbjct: 378 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 130/433 (30%), Positives = 191/433 (44%), Gaps = 56/433 (12%)
Query: 19 EGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF------- 71
E P+ + + TL V + C P S S E+ D+ R+++
Sbjct: 57 EPAGPVIAPRQRNGTLAVLRLAHRCGPSTASA------SFAEVQRADEQRVEYIQRRVSG 110
Query: 72 ---------LSSLAV-ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
L LA +R + VP G Q Y+V +GTP + + +DT +D +
Sbjct: 111 GGARGAKGALQQLATGSRSATVPTTMGVGTFQ---YVVTVSLGTPGVSQTVEVDTGSDVS 167
Query: 122 WVPCTGCVG--CSST---VFNSAQSTTFKNLGCQAAQCKQ--VPNPTCGGGACAFNLTYG 174
WV C C C+S +F+ A+S+T+ + C A C + + C G C + ++YG
Sbjct: 168 WVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYG 227
Query: 175 -SSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
S DT++LA + V + FGC G GLL LGR S+SL +Q
Sbjct: 228 DGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGA 287
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
Y FSYCLPS + S +G L LG T LL + Y V L I VG +
Sbjct: 288 YGGVFSYCLPSKQ--SAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQ 345
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
V +P A GT++D+GTV TRL AY A+R FR + + + G D
Sbjct: 346 VAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILD 399
Query: 351 TCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
TCY + PT+ L FSG TL + I S+ CLA AP+ + ++
Sbjct: 400 TCYDFSRYGVVTLPTVALTFSG-GATLALEAPGILSSG----CLAF--APNGGDGDAAIL 452
Query: 407 ANMQQQNHRILYD 419
N+QQ++ + +D
Sbjct: 453 GNVQQRSFAVRFD 465
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 208/439 (47%), Gaps = 56/439 (12%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------SLAVARKSV- 82
S+TL++ H CS K + + + L D R+Q L + + +SV
Sbjct: 64 ESTTLEMKHR-ELCS----GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVS 118
Query: 83 ---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VF 136
+P+ SG ++ +S YIV ++G +L++ DT +D WV C C C + ++
Sbjct: 119 ETQIPLTSGIKL-ESLNYIVTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQGPLY 175
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT-----CGGGA------CAFNLTYGS-STIAANLSQ 184
+ + S+++K + C ++ C+ + T CGG C + ++YG S +L+
Sbjct: 176 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 235
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
++I L + + FGC + G GL+GLGR S+SL++QT + FSYCLPS
Sbjct: 236 ESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL 295
Query: 245 KALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ SGSL G + YTPL++NP+ S Y +NL +G G
Sbjct: 296 ED-GASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIG--------GVE 346
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----P 356
+ + G G +IDSGTV TRL Y AV+ F ++ T DTC+++
Sbjct: 347 LKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYED 406
Query: 357 IVAPTITLMFSGMNVTLPQD---NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
I P I ++F G N L D S+ CLA+A+ + + + +I N QQ+N
Sbjct: 407 ISIPIIKMIFQG-NAELEVDVTGVFYFVKPDASLVCLALASL--SYENEVGIIGNYQQKN 463
Query: 414 HRILYDVPNSRLGVARELC 432
R++YD RLG+ E C
Sbjct: 464 QRVIYDSTQERLGIVGENC 482
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 173/395 (43%), Gaps = 43/395 (10%)
Query: 70 QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
++ S L +S V + SG Y + IGTP + + +DT +D W+ C C+
Sbjct: 172 EYSSQLVATLESGVSLGSGE-------YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCI 224
Query: 130 GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT----CGGG--ACAFNLTYGSS---- 176
C S ++ +S++F+N+ C +CK V +P C C + YG S
Sbjct: 225 ACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTT 284
Query: 177 ------TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
T NL+ V FGC G GLLGLGRG LS +Q Q
Sbjct: 285 GDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 344
Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLLKNPRRS--SLYYVNL 283
++Y +FSYCL + S S L G + P + +T + S + YYV +
Sbjct: 345 SIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHP-NLNFTSFVGGEENSVDTFYYVGI 403
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
+I V V+ IP + G GTIIDSGT T PAY +++ F +++ V
Sbjct: 404 KSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELV 463
Query: 344 TSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
CY+V + P ++FS G P +N I + CLA+ P
Sbjct: 464 EGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQ-IEPDLVCLAILGTP-- 520
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
S L++I N QQQN ILYD+ SRLG A CT
Sbjct: 521 -KSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCT 554
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 134/457 (29%), Positives = 218/457 (47%), Gaps = 71/457 (15%)
Query: 27 TQDHSSTLQVFH----VFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV 82
T+ S+ L++ H FSP P +PSK S E +L+ D AR+ L + +S
Sbjct: 35 TESGSTILELRHHISSSFSP-GPNRPSK-TSRGEVDGGVLSSDAARVSSLQRRIESYRSS 92
Query: 83 --------------VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
VPI SG + ++ Y+ A +G A + +DT+++ WV C C
Sbjct: 93 SEGEEEEASKLALQVPITSGANL-RTLNYV--ATVGLGAAEATVVVDTASELTWVQCQPC 149
Query: 129 VGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG-----------ACAFNLTYG 174
C +F+ + S ++ + C ++ C + G AC++ L+Y
Sbjct: 150 ESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR 209
Query: 175 SSTIAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQT 229
+ + L++D + LA + G+ FGC T N P GL+GLGR +SL++QT
Sbjct: 210 DGSYSRGVLARDKLRLAGQDIEGFVFGC---GTSNQGAPFGGTSGLMGLGRSHVSLVSQT 266
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKN--PRRSSLYYVNL 283
+ + FSYCLP ++ S SGSL LG R I YT ++ + P + Y++NL
Sbjct: 267 MDQFGGVFSYCLPMRESGS-SGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNL 325
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
I VG + V+ +P AG IIDSGT+ T LV Y AVR F ++
Sbjct: 326 TGITVGGQEVE--------SPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQ 377
Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDN---LLIHSTAGSITCLAMAAA 395
+ DTC+++ + P++ +F G +V + D+ L S+ S CLA+A+
Sbjct: 378 APAFSILDTCFNLTGLKEVQVPSLKFVFEG-SVEVEVDSKGVLYFVSSDASQVCLALASL 436
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++ ++I N QQ+N R+++D S++G A+E C
Sbjct: 437 KSEYDT--SIIGNYQQKNLRVIFDTLGSQIGFAQETC 471
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 113/376 (30%), Positives = 186/376 (49%), Gaps = 34/376 (9%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTG-CVGC 131
A A + +P +G + ++P ++V G+PAQT DT +D +W+ PC+G C
Sbjct: 92 AEAPSATIPDHTGTNL-KTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQ 150
Query: 132 SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL- 189
VF+ A+S+++ + C +C C G C + + YG S+ L+++T++
Sbjct: 151 HDPVFDPAKSSSYAVVPCGTTECAAA-GGECNGTTCVYGVEYGDGSSTTGVLARETLTFS 209
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
++ G+ FGC + G+ GLLGLGRGSLSL +Q + FSYCLPS+
Sbjct: 210 SSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPG 269
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
S+ P+ ++YT ++ P S Y++ L++I +G V+ +PP T G
Sbjct: 270 YLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEF-----TKTG 324
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLM 365
T++DSGT+ T L PAYTA+RD F+ + + DTCY I+ P ++
Sbjct: 325 TLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFN 384
Query: 366 FSGMNV---------TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
FS V T P D T ++ CLA + P ++ +V+ + Q++ +
Sbjct: 385 FSDGAVFNLNFFGIMTFPDD------TKPAVGCLAFVSRPADMP--FSVVGSTTQRSAEV 436
Query: 417 LYDVPNSRLGVARELC 432
+YDVP ++G C
Sbjct: 437 IYDVPAQKIGFIPASC 452
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 174/394 (44%), Gaps = 44/394 (11%)
Query: 71 FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
F L +S V + SG Y + +GTP + + +DT +D W+ C C
Sbjct: 162 FSGQLIATLESGVSLGSGE-------YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYE 214
Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
C + ++ QS++++N+GC ++C V +P C + YG S
Sbjct: 215 CFEQNGPHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTG 274
Query: 177 -----TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
T NL+ + V FGC G GLLGLGRG LS +Q Q+
Sbjct: 275 DFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQS 334
Query: 232 LYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNL 283
LY +FSYCL + + S L G + P+ + +T L+ +NP + YYV +
Sbjct: 335 LYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVAGKENPV-DTFYYVQI 392
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
+I VG VV+IP Q GTIIDSGT + PAY +++ F +V V
Sbjct: 393 KSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVV 452
Query: 344 TSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
+ CY+V V P ++FS G P +N I + CLA+ P
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPP- 511
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N QQQN ILYD SRLG A C
Sbjct: 512 --SALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 123/416 (29%), Positives = 180/416 (43%), Gaps = 47/416 (11%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ--------SPTYIVRAKIGTPAQ 108
S L+ L K+Q + F A A S P+ SG+ + S Y + +GTP +
Sbjct: 148 SRLQRLQKEQPKQSFKPVFAPAASSTSPV-SGQLVATLESGVSLGSGEYFMDVFVGTPPK 206
Query: 109 TLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQV-----PNP 160
+ +DT +D W+ C C+ C S ++ S++F+N+ C +C+ V PNP
Sbjct: 207 HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNP 266
Query: 161 -TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+C + YG T NL+ V FGC G
Sbjct: 267 CKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLF 326
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
GLLGLG+G LS +Q Q+LY +FSYCL + + S + G+ K + P
Sbjct: 327 HGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLI--FGEDKELLSHPN 384
Query: 270 L--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
L K+ + YYV + ++ V V+ IP + GTIIDSGT T
Sbjct: 385 LNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYF 444
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQD 376
PAY +++ F R++ V L CY+V + P ++F+ G P +
Sbjct: 445 AEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVE 504
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N I + CLA+ P S L++I N QQQN ILYD+ SRLG A C
Sbjct: 505 NYFIQIDP-DVVCLAILGNP---RSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 556
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 183/360 (50%), Gaps = 28/360 (7%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SSTVFNS 138
+P +G + + ++V GTPAQT + +DT +D +W+ C C G C F+
Sbjct: 124 IPDHTGTNL-DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDP 182
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
A+S+++ + C C C G C + + YG S+ LS+DT++ ++ G
Sbjct: 183 AKSSSYAAVPCGTPVCAAA-GGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTG 241
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
+TFGC +K G+ GLLGLGRG LSL +Q + FSYCLPS+ G L +G
Sbjct: 242 FTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTT--PGYLNIG 299
Query: 257 PIGQPKR---IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+P ++YT ++K P+ S Y++ L++I +G ++ +PP T GT++D
Sbjct: 300 AT-KPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF-----TKTGTLLD 353
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-G 368
SGT+ T L PAYT++RD F+ + N DTCY IV P ++ FS G
Sbjct: 354 SGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDG 413
Query: 369 MNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
L ++I I CLA + P + +++ N QQ+ ++YDVP+ ++G
Sbjct: 414 AVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMP--FSIVGNTQQRAAEVIYDVPSQKIG 471
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 112/343 (32%), Positives = 157/343 (45%), Gaps = 27/343 (7%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP T + DT +D W PCT C + F A S+TF L C ++
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
C+ +PN TC C +N YGS A L+ +T+ + P FGC + GNS
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGCSTENGVGNST 205
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
G+ GLGRG+LSL+ Q L FSYCL S A S + G + ++ TP
Sbjct: 206 --SGIAGLGRGALSLIPQ---LGVGRFSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTP 259
Query: 269 LLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAP 324
+ NP S YYVNL I VG D+P F T G GTI+DSGT T L
Sbjct: 260 FVNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFS-GMNVTLPQDNL 378
Y V+ F + + TV G D C+ I P++ L F G +P
Sbjct: 318 GYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFA 377
Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ + + GS+T + P + ++VI N+ Q + +LYD+
Sbjct: 378 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 420
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
+ LS E + M A+ +AR L S A V P + + + Y+V IGTP Q
Sbjct: 65 RGLSTRELLHRMAARSKARSARLLSGRAASARVDPGSYTDGVPDT-EYLVHMAIGTPPQP 123
Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
+ + +DT +D W C CV C S FN ++S TF L C C+ + +C
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183
Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
G G C + Y +I +L DT S A+ VP TFGC G V +
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 243
Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
G+ G RG+LS+ AQ L FSYC PS L +L G +
Sbjct: 244 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300
Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ L S L YY++L + VG + IP GTI+DSGT T L
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
Y V D F + + ++ C+SVP A P + L F G + LP++N +
Sbjct: 361 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 420
Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
AG I TCLA+ A D L+VI N QQQN +LYD+ N L C
Sbjct: 421 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 163/351 (46%), Gaps = 43/351 (12%)
Query: 106 PAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
P LM +DT++D AWV C + C + +++ ++S + ++ C + C+Q+ P
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GP 236
Query: 161 TCGG--------GACAFNLTY-GSSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGN-- 208
G G C + + Y ST + L D +SL+ T VP + FGC A G+
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGSFS 296
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
G++ LGRG SL++QT Y FSYC P S G LG P R T
Sbjct: 297 RSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFP--PTASHKGFFVLGVPRRSSSRYAVT 354
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
P+LK P LY V L AI V + +D+PP AG +DS TV TRL AY
Sbjct: 355 PMLKTPM---LYQVRLEAIAVAGQRLDVPPTVF------AAGAALDSRTVITRLPPTAYQ 405
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF--SGMNVTLPQDNLLIH 381
A+R FR ++ + G DTCY I+ PTI+L+F +G V L +L
Sbjct: 406 ALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG 465
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S CLA A+ + + +I +Q Q +LY+V +G R C
Sbjct: 466 S------CLAFASTAGD-DRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 175/406 (43%), Gaps = 59/406 (14%)
Query: 83 VPIASGRQ--ITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPC-------------- 125
+P S RQ + Y + +G+ P+Q++ + MDT +D W PC
Sbjct: 3 LPSPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNA 62
Query: 126 --------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYG 174
+ V C S ++A S+ + C A+C + C C F YG
Sbjct: 63 TKPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYG 122
Query: 175 SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL-- 232
+ A+L +DT+S++ + +TFGC A P G+ G GRG LSL AQ L
Sbjct: 123 DGSFIAHLHRDTLSMSQLFLKNFTFGCAHTALAE---PTGVAGFGRGLLSLPAQLATLSP 179
Query: 233 -YQSTFSYCLPSF----KALSFSGSLRLGPIGQ--PKRIK--YTPLLKNPRRSSLYYVNL 283
+ FSYCL S + + L LG +R++ YT +L+NP+ S Y V L
Sbjct: 180 NLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGL 239
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG----S 339
I VG+R + P + + G ++DSGT FT L A Y +V F RRVG
Sbjct: 240 TGISVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKR 299
Query: 340 NLTVTSLGGFDTCYSVP--IVAPTITLMFSG--MNVTLPQDNLLIHSTAGS------ITC 389
V G CY + + PT+T F G NV LP+ N G + C
Sbjct: 300 ASEVEEKTGLGPCYFLEGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGC 359
Query: 390 LAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELC 432
L + D+ ++ N QQQ ++YD+ N R+G A+ C
Sbjct: 360 LMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 173/352 (49%), Gaps = 52/352 (14%)
Query: 114 MDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFN 170
MDT +D W C C+ C+ + F+ +S T++ L C++++C + +P+C C +
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60
Query: 171 LTYG-SSTIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
YG +++ A L+ +T + AT+I FGC G+ G++G GR
Sbjct: 61 YYYGDTASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGR 116
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLK 271
G LSL++Q L S FSYCL S+ + L F L G P ++ TP +
Sbjct: 117 GPLSLVSQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVI 171
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
NP ++Y+++L AI +G +++ I P N G IIDSGT T L AY AV
Sbjct: 172 NPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV-- 229
Query: 332 VFRRRVGSNLTVTSLG----GFDTCYSVP------IVAPTITLMFSGMNVT-LPQDNLLI 380
RR + S + + ++ G DTC+ P + P + F N+T LP++ +LI
Sbjct: 230 --RRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLI 287
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
ST G + CL M AP V + +I N QQQN +LYD+ NS L C
Sbjct: 288 ASTTGYL-CLVM--APTGVGT---IIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 127/448 (28%), Positives = 188/448 (41%), Gaps = 75/448 (16%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------------------ 96
+ES +E +D AR+Q L + + +K+ I+ ++ + P
Sbjct: 10 KESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGT 69
Query: 97 --------------------YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSS 133
Y + IGTP + + +DT +D W VPC C +
Sbjct: 70 GLSGQLMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNG 129
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPT------CGGGACAFNLTYGSS----------T 177
++ +S++F+N+GC +C V +P C + YG S T
Sbjct: 130 PYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATET 189
Query: 178 IAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
NL+ T V FGC G GLLGLGRG LS +Q Q+LY +F
Sbjct: 190 FTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSF 249
Query: 238 SYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVG 289
SYCL + + S L G + P+ + +T L+ +NP + YYV + +I VG
Sbjct: 250 SYCLVDRNSDTNVSSKLIFGEDKDLLNHPE-LNFTTLVGGKENPV-DTFYYVQIKSIMVG 307
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
V++IP GTI+DSGT + PAY ++D F ++V V
Sbjct: 308 GEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPIL 367
Query: 350 DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
D CY+V I P ++F+ G P +N I + CLA+ P S L+
Sbjct: 368 DPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTP---RSALS 424
Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
+I N QQQN +LYD SRLG A C
Sbjct: 425 IIGNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 130/407 (31%), Positives = 178/407 (43%), Gaps = 48/407 (11%)
Query: 60 EMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTS 117
E+L + ARL F +S A AR P A+G T+ Y+V IGTP Q + + +DT
Sbjct: 379 EVLHRMAARLLFSASGRAASARVDPGPYANGVPDTE---YLVHLAIGTPPQPVQLILDTG 435
Query: 118 NDAAWVPCTGCVGCSSTVF---NSAQSTTFKNLGCQAAQCKQVPNPTCG-----GGACAF 169
+D W C C C S + + S+TF L C + C + +CG C +
Sbjct: 436 SDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVY 495
Query: 170 NLTYGSSTIA-ANLSQDTISLATD------IVPGYTFGCIQKATGNSVPPQ-GLLGLGRG 221
Y +I +L +T + A VP FGC G + G+ G GRG
Sbjct: 496 VYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRG 555
Query: 222 SLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
+LSL +Q L FS+C PS L +L G ++ TPL++N
Sbjct: 556 ALSLPSQ---LKVDNFSHCFTAITGSEPSSVLLGLPANLYSDADGA---VQSTPLVQNFS 609
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
YY++L I VG + IP GTIIDSGT T L AY V D F
Sbjct: 610 SLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFT 669
Query: 335 RRVG---SNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL--IHSTAG 385
+V N T +SL +SVP A P + L F G + LP++N + G
Sbjct: 670 AQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEFEDAGG 729
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S+TCLA+ A D L +I N QQQN +LYD+ + L C
Sbjct: 730 SVTCLAINAGDD-----LTIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 200/442 (45%), Gaps = 49/442 (11%)
Query: 24 ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS--------- 74
+ D+ ++L+V H PCS + S + E+L +DQ+R++ + S
Sbjct: 66 VLSNNDNKASLKVVHKHGPCSKLSQDEA-SAAPTHTEILLQDQSRVKSIHSRLSNSKTSG 124
Query: 75 ---LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT----G 127
+ V + +P G + S YIV +GTP + L + DT +D W C
Sbjct: 125 GKDVKVTDSTTIPAKDGSTV-GSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS 183
Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL 182
C +F+ +QST++ N+ C ++ C + + P C AC + + YG S+ +
Sbjct: 184 CYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGF 243
Query: 183 -SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
+ ++L +TD FGC Q G GLLGLGR LS+++QT Y FSYC
Sbjct: 244 FGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYC 303
Query: 241 LPSFKA----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
LPS + L+F GS K K+TPL S Y ++ I VG + + I
Sbjct: 304 LPSSSSSTGFLTFGGS-------ASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAIS 356
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-- 354
+ AG IIDSGTV TRL AY+A+R FR + +L DTCY
Sbjct: 357 ASVF-----STAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFS 411
Query: 355 --VPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
I P I F SG+ V + +L S+ + CLA A D + + + N+QQ
Sbjct: 412 SYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQV-CLAFAGNSDATD--VFIFGNVQQ 468
Query: 412 QNHRILYDVPNSRLGVARELCT 433
+ + YD ++G A C+
Sbjct: 469 KTLEVFYDGSAGKVGFAPGGCS 490
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 174/365 (47%), Gaps = 56/365 (15%)
Query: 5 LVFFLAFLFLFSLSE----------GLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSW 54
LV+FL + +L + + L P + + + HV P S P P+S+
Sbjct: 3 LVWFLGWFYLLATASSFVEKENEAVALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSF 62
Query: 55 EESVLEMLAKDQARLQFLSSLAVAR-----------------KSV-VPIASGRQITQSPT 96
+ +LA D AR++ L+S + KSV VP+ G I S
Sbjct: 63 SD----VLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASI-GSGN 117
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
Y V+ G+PA+ M +DT + +W+ C CV C + +F+ + S T+K+L C ++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 153 QCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
QC + NP C C + +YG S+ + LSQD ++LA + +PG+ +GC Q
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQ 237
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG-QPK 262
+ G G+LGLGR LS+L Q + + FSYCLP+ F L +G
Sbjct: 238 DSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGF---LSIGKASLAGS 294
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
K+TP+ +P SLY++ L AI VG R + + A Q+ TIIDSGTV TRL
Sbjct: 295 AYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV--AAAQYR----VPTIIDSGTVITRLP 348
Query: 323 APAYT 327
YT
Sbjct: 349 MSVYT 353
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 177/418 (42%), Gaps = 71/418 (16%)
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSS--- 133
R+ +P++ G T S +A+ AQ + + MDT +D W PC C+ C
Sbjct: 58 RQLSLPLSPGSDYTLSFNLGPQAQ----AQPITLYMDTGSDLVWFPCAPFKCILCEGKPN 113
Query: 134 -------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA----------------FN 170
T + + + K+ C AA P+ C C F
Sbjct: 114 EPNASPPTNITQSVAVSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFY 173
Query: 171 LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
YG ++ A L +DT+SL++ + +TFGC P G+ G GRG LSL AQ
Sbjct: 174 YAYGDGSLIARLYRDTLSLSSLFLRNFTFGCAHTTLAE---PTGVAGFGRGLLSLPAQLA 230
Query: 231 NL---YQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIK---------YTPLLKNPR 274
L + FSYCL S + + L LG + ++ K YT +L+NP+
Sbjct: 231 TLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPK 290
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
Y V+L+ I VG+R + P + N G ++DSGT FT L A Y +V D F
Sbjct: 291 HPYFYTVSLIGIAVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFD 350
Query: 335 RRVGSN----LTVTSLGGFDTCYSVPIVA--PTITLMFSG---MNVTLPQDNLLIHSTAG 385
RRVG + + G CY + VA P +TL F+G +V LP+ N + G
Sbjct: 351 RRVGRDNKRARKIEEKTGLAPCYYLNSVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDG 410
Query: 386 S--------ITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S + CL + D + + N QQQ + YD+ R+G AR C
Sbjct: 411 SDGAKGKRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 130/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
+ LS E + M A+ +AR L S A + P + + + Y+V IGTP Q
Sbjct: 65 RGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQP 123
Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
+ + +DT +D W C CV C S FN ++S TF L C C+ + +C
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183
Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
G G C + Y +I +L DT S A+ VP TFGC G V +
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 243
Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
G+ G RG+LS+ AQ L FSYC PS L +L G +
Sbjct: 244 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300
Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ L S L YY++L + VG + IP GTI+DSGT T L
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
Y V D F + + ++ C+SVP A P + L F G + LP++N +
Sbjct: 361 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 420
Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
AG I TCLA+ A D L+VI N QQQN +LYD+ N L C
Sbjct: 421 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 130/416 (31%), Positives = 178/416 (42%), Gaps = 42/416 (10%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
+ LS E + M A+ +AR L S A + P + + + Y+V IGTP Q
Sbjct: 39 RGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQP 97
Query: 110 LLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---- 162
+ + +DT +D W C CV C S FN ++S TF L C C+ + +C
Sbjct: 98 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 157
Query: 163 -GGGACAFNLTYGSSTI-AANLSQDTISLATDI-------VPGYTFGCIQKATGNSVPPQ 213
G G C + Y +I +L DT S A+ VP TFGC G V +
Sbjct: 158 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 217
Query: 214 -GLLGLGRGSLSLLAQTQNLYQSTFSYCL-------PSFKALSFSGSLRLGPIGQPKRIK 265
G+ G RG+LS+ AQ L FSYC PS L +L G +
Sbjct: 218 TGIAGFSRGALSMPAQ---LKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 274
Query: 266 YTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ L S L YY++L + VG + IP GTI+DSGT T L
Sbjct: 275 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 334
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLL 379
Y V D F + + ++ C+SVP A P + L F G + LP++N +
Sbjct: 335 AVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYM 394
Query: 380 IH-STAGSI--TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
AG I TCLA+ A D L+VI N QQQN +LYD+ N L C
Sbjct: 395 FEIEEAGGIRLTCLAINAGED-----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 170/370 (45%), Gaps = 58/370 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQA 151
Y++ IGTP Q + +DT +D W+ C C C T+F S S+++K L C +
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 152 AQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATD--------IVPGYTF 199
C + + G C + YG S + ++ D IS + G+ F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGS- 252
GC +K G+ QGL+GLG+ S SL+ Q + FSYCL PS K+ F GS
Sbjct: 125 GCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSS 184
Query: 253 --LRLGPIGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-- 307
LR + TP+L +LYYV+L +I +G V + N + G
Sbjct: 185 AALR------GHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPF 238
Query: 308 --AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV---------GSNLTVTSLGGFDTCYSVP 356
T+IDSGT +T L P Y A+R +V G +L S G DT Y
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSG--DTSYGF- 295
Query: 357 IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P++T F+ + + LP +N+ T+ + CL+M D+ L++I NMQQQN
Sbjct: 296 ---PSVTFYFANQVQLVLPFENIF-QVTSRDVVCLSM----DSSGGDLSIIGNMQQQNFH 347
Query: 416 ILYDVPNSRL 425
ILYD+ S++
Sbjct: 348 ILYDLVASQI 357
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 117/391 (29%), Positives = 181/391 (46%), Gaps = 42/391 (10%)
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCS----STV 135
S P+ SG + S Y V ++G+P QTLL+ DT +D WV C+ C CS +
Sbjct: 68 SKSPLMSGAS-SGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST 126
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT---CGG----GACAFNLTYGS-STIAANLSQDTI 187
F + STTF C ++ C+ VP P C C + Y S + S++T
Sbjct: 127 FLARHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186
Query: 188 SLATD-----IVPGYTFGCIQKATGNSV------PPQGLLGLGRGSLSLLAQTQNLYQST 236
+L T + FGC A+G S+ G++GLGRG +S +Q + +
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRS 246
Query: 237 FSYCLPSFK-ALSFSGSLRLGPIGQPKR-----IKYTPLLKNPRRSSLYYVNLLAIRVGR 290
FSYCL + + + L +G + K+ + +TPLL NP + YY+++ + V
Sbjct: 247 FSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDG 306
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG----SNLTVTSL 346
+ I P + GT+IDSGT T L PAY + F+R V + ++
Sbjct: 307 VKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTR 366
Query: 347 GGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
GFD C +V V+ P ++L G ++ + P N I + G I CLA+ + +
Sbjct: 367 SGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEG-IKCLAIQPV-EAESG 424
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+VI N+ QQ + +D SRLG +R C
Sbjct: 425 RFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 128/422 (30%), Positives = 176/422 (41%), Gaps = 68/422 (16%)
Query: 72 LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCV 129
LS+ R+ +P++ G T S RA+ AQ + + MDT +D W PC C+
Sbjct: 29 LSAKRFRRQLSLPLSPGSDYTLSFNLGPRAQ----AQPITLYMDTGSDLVWFPCAPFKCI 84
Query: 130 GC-----SSTVFNSAQSTTF--KNLGCQAAQCKQVPNPTCGGGACA-------------- 168
C +S N+ +S K+ C AA P+ C C
Sbjct: 85 LCEGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKC 144
Query: 169 --FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
F YG ++ A L +DT+SL++ + +TFGC A P G+ G GRG LSL
Sbjct: 145 PPFYYAYGDGSLIARLYRDTLSLSSLFLRNFTFGC---AYTTLAEPTGVAGFGRGLLSLP 201
Query: 227 AQTQNL---YQSTFSYCLPSF----KALSFSGSLRLGPI----------GQPKRIKYTPL 269
AQ L + FSYCL S + + L LG G YTP+
Sbjct: 202 AQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPM 261
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
L+NP+ Y V L+ I VG+R+V P + N G ++DSGT FT L A Y +V
Sbjct: 262 LENPKHPYFYTVGLIGISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSV 321
Query: 330 RDVFRRRVG----SNLTVTSLGGFDTCYSVPIVA--PTITLMFSGMN--VTLPQDNLLIH 381
D F R VG + G CY + VA P +TL F+G N V LP+ N
Sbjct: 322 VDEFDRGVGRVNERARKIEEKTGLAPCYYLNSVAEVPVLTLRFAGGNSSVVLPRKNYFYE 381
Query: 382 STAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARE 430
G + CL + D + N QQQ + YD+ R+G AR
Sbjct: 382 FLDGRDAAKGKRRVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARR 441
Query: 431 LC 432
C
Sbjct: 442 QC 443
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 136/447 (30%), Positives = 209/447 (46%), Gaps = 61/447 (13%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS----------------- 74
+T+ + H PCSP P+K + E E L +D+ R ++
Sbjct: 62 ATVPLHHRHGPCSPL-PNKKMPTLE---ERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDV 117
Query: 75 -LAVARKSVVPIASGRQITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPCTGCV-GC 131
+ + VP G + + Y++ ++G+ P ++ M +DT +D +WV C C C
Sbjct: 118 VVQQSHAMTVPTTLGTSL-DTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQC 176
Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-----PNPTCGGGACAFNLTYGSSTIA--AN 181
V F+ + S+T+ C +A C Q+ N G C + YG ++
Sbjct: 177 RPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGT 236
Query: 182 LSQDTISLATD----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST- 236
S DT++L ++ +V + FGC TG + GL+GLG G+ SL++QT + +T
Sbjct: 237 YSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTA 296
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKR-IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
FSYCLP S SG L LG G TP+L++ + + Y V L AIRVG R + I
Sbjct: 297 FSYCLP--PTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSI 354
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGF-DTC 352
P AG I+DSGTV TRL AY+++ F+ + S GGF DTC
Sbjct: 355 PTTVFS------AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTC 408
Query: 353 YSV----PIVAPTITLMFSGMN---VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
+ + + PT+ L+FSG V L +L+ SI CLA A D+ ++ +
Sbjct: 409 FDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGST--GI 466
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I N+QQ+ ++LYDV +G C
Sbjct: 467 IGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 165/376 (43%), Gaps = 46/376 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-------AQSTTFKNLGC 149
Y + GTP QTL MDT + W PCT C++ F S S++ K +GC
Sbjct: 77 YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGC 136
Query: 150 QAAQCKQVPNPTCGGGACAFN------------LTYGSSTIAANLSQDTISLATDIVPGY 197
+ +C + C N + YGS T +T+ L IVP +
Sbjct: 137 KNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSETLHLHGLIVPNF 196
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRL 255
GC + +S P G+ G GRG SL +Q L + FSYCL S F S SL L
Sbjct: 197 LVGC---SVFSSRQPAGIAGFGRGPSSLPSQ---LGLTKFSYCLLSHKFDDTQESSSLVL 250
Query: 256 GPIGQPKR----IKYTPLLKNPR------RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ + YTPL+KNP+ S YYV+L I +G R V IP L +
Sbjct: 251 DSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKD 310
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSV----PIV 358
GTIIDSGT FT + A+ + + F +V + L V +L G C++V +
Sbjct: 311 GNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELE 370
Query: 359 APTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRI 416
P + L F G +V LP +N + + C + + + ++ N Q QN +
Sbjct: 371 LPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFYV 430
Query: 417 LYDVPNSRLGVARELC 432
YD+ N RLG +E C
Sbjct: 431 EYDLQNERLGFKKESC 446
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)
Query: 58 VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
V + L +D R +F LA + S P + + + YI+ IGTP Q+
Sbjct: 47 VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 106
Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
DT +D W C C S ++N + S TF+ L C AA+ + P
Sbjct: 107 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 166
Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
P C AC +N TYG+ + +T + + VPG FGC ++ +
Sbjct: 167 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 220
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
G GL L+ L FSYCL F+ +L LGP ++ TP
Sbjct: 221 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 280
Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ +P + S+ YY+NL I VG + IPPGA G IIDSGT T LV A
Sbjct: 281 FVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 340
Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
Y VR R V GSN T G D C+++ P P++TL F G ++
Sbjct: 341 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 395
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N +I G + CLAM + D L+ + N QQQN ILYDV L A C
Sbjct: 396 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
Query: 433 T 433
+
Sbjct: 451 S 451
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 116/370 (31%), Positives = 170/370 (45%), Gaps = 58/370 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQA 151
Y++ IGTP Q + +DT +D W+ C C C T+F S S+++K L C +
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 152 AQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISLATD--------IVPGYTF 199
C + + G C + YG S + ++ D IS + G+ F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGS- 252
GC +K G+ QGL+GLG+ S SL+ Q + FSYCL PS K+ F GS
Sbjct: 125 GCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSS 184
Query: 253 --LRLGPIGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-- 307
LR + TP+L +LYYV+L +I VG V + N + G
Sbjct: 185 AALR------GHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPF 238
Query: 308 --AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV---------GSNLTVTSLGGFDTCYSVP 356
T+IDSGT +T L P Y A+R +V G +L S G DT Y
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSG--DTSYGF- 295
Query: 357 IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P++T F+ + + LP +N+ T+ + CL+M D+ L++I NMQQQN
Sbjct: 296 ---PSVTFYFANQVQLVLPFENIF-QVTSRDVVCLSM----DSSGGDLSIIGNMQQQNFH 347
Query: 416 ILYDVPNSRL 425
ILYD+ S++
Sbjct: 348 ILYDLVASQI 357
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 123/404 (30%), Positives = 182/404 (45%), Gaps = 50/404 (12%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------TYIVRAKIGTPAQT 109
E++ ++AK AR+++++ A A S +G +SP Y++ +GTP +
Sbjct: 10 EAIRGLVAKSHARVRWMA--ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKR 67
Query: 110 LLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GG 164
DT +D WV PCTGC G T+F+ QS+TF+ + C + C ++P +C G
Sbjct: 68 FRAIADTGSDLVWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCTELPG-SCEPGS 124
Query: 165 GACAFNLTYGSSTIAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPPQGLLGL 218
AC+++ YGS ++DTISL T P + GC +G + V GL+GL
Sbjct: 125 SACSYSYEYGSGETEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFDGV--DGLVGL 182
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLLKNPRRS 276
G+G +SL +Q S FSYCL + S S L GP I+ T +
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242
Query: 277 SLYY---VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
YY VN +A+ G +P T TIIDSGT T + + Y V
Sbjct: 243 PTYYLLTVNGIAV----------AGQTMGSPGT---TIIDSGTTLTYVPSGVYGRVLSRM 289
Query: 334 RRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDN-LLIHSTAGSIT 388
V S G D CY P +T+ +G +T P N L+ +G
Sbjct: 290 ESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTV 349
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLAM +A +++I N+ QQ + ILYD +S L + C
Sbjct: 350 CLAMGSAG---GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)
Query: 58 VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
V + L +D R +F LA + S P + + + YI+ IGTP Q+
Sbjct: 52 VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 111
Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
DT +D W C C S ++N + S TF+ L C AA+ + P
Sbjct: 112 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 171
Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
P C AC +N TYG+ + +T + + VPG FGC ++ +
Sbjct: 172 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 225
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
G GL L+ L FSYCL F+ +L LGP ++ TP
Sbjct: 226 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 285
Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ +P + S+ YY+NL I VG + IPPGA G IIDSGT T LV A
Sbjct: 286 FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 345
Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
Y VR R V GSN T G D C+++ P P++TL F G ++
Sbjct: 346 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 400
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N +I G + CLAM + D L+ + N QQQN ILYDV L A C
Sbjct: 401 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455
Query: 433 T 433
+
Sbjct: 456 S 456
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 131/421 (31%), Positives = 181/421 (42%), Gaps = 61/421 (14%)
Query: 58 VLEMLAKD-QARLQFLSSLAVARKSVVPIAS-----GRQITQSPTYIVRAKIGTPAQTLL 111
V + L +D R +F LA + S P + + + YI+ IGTP Q+
Sbjct: 47 VRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQSYP 106
Query: 112 MAMDTSNDAAWVPCT----GCVGCSSTVFNSAQSTTFKNLGCQ------AAQCK---QVP 158
DT +D W C C S ++N + S TF+ L C AA+ + P
Sbjct: 107 AIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGATP 166
Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ 213
P C AC +N TYG+ + +T + + VPG FGC ++ +
Sbjct: 167 PPGC---ACRYNQTYGTGWTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDW---N 220
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-----IKYTP 268
G GL L+ L FSYCL F+ +L LGP ++ TP
Sbjct: 221 GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTP 280
Query: 269 LLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ +P + S+ YY+NL I VG + IPPGA G IIDSGT T LV A
Sbjct: 281 FVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAA 340
Query: 326 YTAVRDVFRRRV------GSNLTVTSLGGFDTCYSV------PIVAPTITLMF-SGMNVT 372
Y VR R V GSN T G D C+++ P P++TL F G ++
Sbjct: 341 YKRVRAAVRSLVKLPVTDGSNAT-----GLDLCFALPSSSAPPATLPSMTLHFGGGADMV 395
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LP +N +I G + CLAM + D L+ + N QQQN ILYDV L A C
Sbjct: 396 LPVENYMILD--GGMWCLAMRSQTDG---ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
Query: 433 T 433
+
Sbjct: 451 S 451
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 123/410 (30%), Positives = 183/410 (44%), Gaps = 50/410 (12%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP------TYIVRAKI 103
K + E++ ++AK AR+++++ A A S +G +SP Y++ +
Sbjct: 4 KGVKRSEAIRALVAKSHARVRWMA--ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISV 61
Query: 104 GTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP 160
GTP + DT +D WV PCTGC G T+F+ QS+TF+ + C + C ++P
Sbjct: 62 GTPGKRFRAIADTGSDLVWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCAELPG- 118
Query: 161 TC--GGGACAFNLTYGSSTIAANLSQDTISLAT-----DIVPGYTFGCIQKATG-NSVPP 212
+C G C+++ YGS ++DTISL T P + GC +G + V
Sbjct: 119 SCEPGSSTCSYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFDGV-- 176
Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRIKYTPLL 270
GL+GLG+G +SL +Q S FSYCL + S S L GP I+ T +
Sbjct: 177 DGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKIT 236
Query: 271 KNPRRSSLYY---VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
YY VN +A+ G +P T TIIDSGT T + + Y
Sbjct: 237 PPSDTYPTYYLLTVNGIAV----------AGQTMGSPGT---TIIDSGTTLTYVPSGVYG 283
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDN-LLIHS 382
V V S G D CY P +T+ +G +T P N L+
Sbjct: 284 RVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+G CLAM +A +++I N+ QQ + ILYD +S L + C
Sbjct: 344 DSGDTVCLAMGSAS---GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 128/449 (28%), Positives = 199/449 (44%), Gaps = 72/449 (16%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------- 82
+++ + H PC+P S + S+ E L +D+AR ++ + A ++
Sbjct: 17 ASVPLVHRHGPCAP---SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAG 73
Query: 83 ----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSS 133
+P G + S Y+V IGTPA + +DT +D +WV C C
Sbjct: 74 GGTSIPTFLGDSV-NSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD 132
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGA---CAFNLTYGS-STIAANL 182
+F+ + S+++ ++ C + C+++ G GGA C + + YG+ +T
Sbjct: 133 PLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVY 192
Query: 183 SQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
S +T++L +V + FGC G GLLGLG SL++QT + + FSYCL
Sbjct: 193 STETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 252
Query: 242 PSFKALSFSGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
P SG +G P + +TP+ + P + Y V L I VG
Sbjct: 253 P-----PTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAP 307
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGG-FD 350
+ IPP A +G +IDSGTV T L A AY A+R FR + L S GG D
Sbjct: 308 LAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLD 361
Query: 351 TCYS----VPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
TCY + PTI+L FSG +++ P L+ CLA A A ++ +
Sbjct: 362 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-------CLAFAGA--GTDNAI 412
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
+I N+ Q+ +LYD +G C
Sbjct: 413 GIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 110/349 (31%), Positives = 157/349 (44%), Gaps = 28/349 (8%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y VR IG+PA M +D+ +D W+ PC C + +FN A S +F + C
Sbjct: 126 SGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACS 185
Query: 151 AAQCKQVPNPT-CGGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ C Q+ + C G C + + YG S L+ +TI++ ++ GC G
Sbjct: 186 SNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTVIQDTAIGCGHWNEGM 245
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V GLLGLG G +S + Q F YCL S R P+G + P
Sbjct: 246 FVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVS----------RAMPVGA----MWVP 291
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+ NP S YYV+L + VG V I Q G ++D+GT TRL AY A
Sbjct: 292 LIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNA 351
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHST 383
RD F + + + FDTCY V + PT++ FSG + T P N LI +
Sbjct: 352 FRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPAD 411
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
C A A +P S L++I N+QQ+ ++ D N +G +C
Sbjct: 412 DVGTFCFAFAPSP----SGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 111/350 (31%), Positives = 172/350 (49%), Gaps = 35/350 (10%)
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV--- 157
A G+ + + + +DT+ D W+ C C ++ +S+T+ C ++ CKQ+
Sbjct: 154 ATDGSSSPPVTVVLDTAGDVPWMRCVPCTFAQCADYDPTRSSTYSAFPCNSSACKQLGRY 213
Query: 158 PNPTCGGGACAFNL-TYGSS-TIAANLSQDTISLAT-DIVPGYTFGCIQKATGN-SVPPQ 213
N G C + + T G S T + S D +++ + D V G+ FGC Q G+
Sbjct: 214 ANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQAD 273
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
G++ LGRG SL+AQT + Y FSYCLP + + G ++G PIG R TP+LK
Sbjct: 274 GIMALGRGVQSLMAQTSSTYGDAFSYCLPPTE--TTKGFFQIGVPIGASYRFVTTPMLKE 331
Query: 273 PRRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
+S LY LLAI V + +++P AGT++DS T+ TRL AY
Sbjct: 332 RGGASAAAATLYRALLLAITVDGKELNVPAEVF------AAGTVMDSRTIITRLPVTAYG 385
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMN-VTLPQDNLLIHS 382
A+R FR R+ + DTCY + V P I L+F G V + + +L++
Sbjct: 386 ALRAAFRNRMRYRV-APPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILLNG 444
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A+ D +S +++ N+QQQ ++L+DV R+G C
Sbjct: 445 ------CLAFASNDD--DSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 124/426 (29%), Positives = 187/426 (43%), Gaps = 58/426 (13%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT------------Y 97
K LS E + + + +AR LS AV ++ SG+ Q+P Y
Sbjct: 42 KQLSRPELIRRAMRRSKARAAALS--AVRNRARF---SGKNEQQTPAGVLPVRPSGDLEY 96
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQC 154
+V IGTP Q + +DT +D W C C C S +F QS +++ + C C
Sbjct: 97 VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLC 156
Query: 155 KQVPNPTCGG-GACAFNLTYGSSTIAANL---------SQDTISLATDIVPGYTFGCIQK 204
+ + +C C + YG T+ + S L T VP FGC
Sbjct: 157 SDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGFGCGSV 215
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS----GSLRLGPIGQ 260
G+ G++G GR LSL++Q L FSYCL S+ + S GSL G G
Sbjct: 216 NVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGD 272
Query: 261 PK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
R++ TPLL++P+ + YYV+ + VG R + IP A P G I+DSGT T
Sbjct: 273 ATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALT 332
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVP-----------IVAPTITLMF 366
L A V FR+++ L + G + C+ VP + P + L F
Sbjct: 333 LLPAAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF 390
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G ++ LP+ N ++ CL +A + D+ ++ I N+ QQ+ R+LYD+ L
Sbjct: 391 QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST----IGNLVQQDMRVLYDLEAETLS 446
Query: 427 VARELC 432
+A C
Sbjct: 447 IAPARC 452
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 165/372 (44%), Gaps = 50/372 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAA 152
Y++ +GTP + + + +DT +D W C C+ C ++ V + A S+T L C A
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAP 149
Query: 153 QCKQVPNPTCGG-----GACAFNLTYGSSTI-AANLSQDTISLATDIVPG------YTFG 200
C+ +P +CGG +C + YG ++ L+ D+ + D G TFG
Sbjct: 150 LCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFG 209
Query: 201 CIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
C G G+ G GRG SL +Q L ++FSYC S S + LG
Sbjct: 210 CGHINKGIFQANETGIAGFGRGRWSLPSQ---LNVTSFSYCFTSMFDTKSSSVVTLGAAA 266
Query: 258 --------IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
++ T L+KNP + SLY+V L I VG V +P L+ +
Sbjct: 267 AELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR------SS 320
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------PTI 362
TIIDSG T L Y AV+ F +VG D C+++P+ A P +
Sbjct: 321 TIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPAL 380
Query: 363 TLMFS-GMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNVNSVLNVIANMQQQNHRILYDV 420
TL G + LP+ N + A + C+ + AAA + V VI N QQQN ++YD+
Sbjct: 381 TLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQV-----VIGNYQQQNTHVVYDL 435
Query: 421 PNSRLGVARELC 432
N L A C
Sbjct: 436 ENDVLSFAPARC 447
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 122/436 (27%), Positives = 196/436 (44%), Gaps = 66/436 (15%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVL--EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
VFH CS P+ L + V+ E + + +L F ++++
Sbjct: 30 VFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFRHNISLT--------------- 74
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSSTVFNSAQSTTFKNLGCQA 151
V +GTP Q + M +DT ++ +W+ C SS+ FN S+++ + C +
Sbjct: 75 -----VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSS 129
Query: 152 AQC----KQVP-NPTCGGGA-CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQK 204
+ C + P P+C C L+Y +S+ NL+ DT + + +P FGC+
Sbjct: 130 STCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDS 189
Query: 205 ATGNSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--I 258
++ GL+G+ RGSLS ++Q + FSYC+ + FSG L LG
Sbjct: 190 IFSSNSEEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCISEYD---FSGLLLLGDANF 243
Query: 259 GQPKRIKYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+ YTPL++ P + Y V L I+V +++ IP + + T T++D
Sbjct: 244 SWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVD 303
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PT 361
SGT FT L+ PAYTA+RD F + +L V G D CY VP P+
Sbjct: 304 SGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPS 363
Query: 362 ITLMFSGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+TL+F G +T+ D +L SI C + D + VI ++ QQN +
Sbjct: 364 VTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNS-DLLGVEAFVIGHLHQQNVWM 422
Query: 417 LYDVPNSRLGVARELC 432
+D+ SR+G+A C
Sbjct: 423 EFDLKKSRIGLAEIRC 438
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 128/449 (28%), Positives = 199/449 (44%), Gaps = 72/449 (16%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------- 82
+++ + H PC+P S + S+ E L +D+AR ++ + A ++
Sbjct: 97 ASVPLVHRHGPCAPSAAS---GGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAG 153
Query: 83 ----VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSS 133
+P G + S Y+V IGTPA + +DT +D +WV C C
Sbjct: 154 GGTSIPTFLGDSV-NSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD 212
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGA---CAFNLTYGS-STIAANL 182
+F+ + S+++ ++ C + C+++ G GGA C + + YG+ +T
Sbjct: 213 PLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVY 272
Query: 183 SQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
S +T++L +V + FGC G GLLGLG SL++QT + + FSYCL
Sbjct: 273 STETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 332
Query: 242 PSFKALSFSGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
P SG +G P + +TP+ + P + Y V L I VG
Sbjct: 333 P-----PTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAP 387
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGG-FD 350
+ IPP A +G +IDSGTV T L A AY A+R FR + L S GG D
Sbjct: 388 LAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLD 441
Query: 351 TCYS----VPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
TCY + PTI+L FSG +++ P L+ CLA A A ++ +
Sbjct: 442 TCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-------CLAFAGA--GTDNAI 492
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
+I N+ Q+ +LYD +G C
Sbjct: 493 GIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 161/334 (48%), Gaps = 21/334 (6%)
Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGGA 166
M +DT +D WV C C C S VF+ + S ++ + C + +C+ + C GA
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 167 CAFNLTYGS-STIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
C + + YG S + + +T++L V GC G V GLL LG G LS
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+Q + STFSYCL + + S +L+ G PL+++PR S+ YYV L
Sbjct: 121 FPSQ---ISASTFSYCLVDRDSPAAS-TLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALS 176
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAG-TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
I VG + + IP A + T+G+G I+DSGT TRL + AY A+RD F + S
Sbjct: 177 GISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRT 236
Query: 344 TSLGGFDTCYSV----PIVAPTITLMFSGMN-VTLPQDNLLIHSTAGSITCLAMAAAPDN 398
+ + FDTCY + + P ++L F G + LP N LI CLA A
Sbjct: 237 SGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP---- 292
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
N+ +++I N+QQQ R+ +D +G C
Sbjct: 293 TNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 196/440 (44%), Gaps = 57/440 (12%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
L + H SPCSP PL + +LA D AR+ L++ L +R
Sbjct: 45 LTLHHPQSPCSP----APLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDESRAG 100
Query: 81 -------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
+ VP+ G + Y+ R +GTPA++ +M +DT + W+ C+
Sbjct: 101 SSSSSSPDDESSLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSP 159
Query: 128 CV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSST 177
CV C S VFN S+++ ++ C A QC + T +C+ + +YG S+
Sbjct: 160 CVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSS 219
Query: 178 IAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
+ LS+DT+S + VP + +GC Q G GL+GL R LSLL Q +
Sbjct: 220 FSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYS 279
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
FSYCLP+ + S P + YTP+ + SLY++ + I+V + + +
Sbjct: 280 FSYCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVS 337
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--- 353
A P TIIDSGTV TRL Y+A+ + ++ DTC+
Sbjct: 338 SSAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQ 392
Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ + P +T+ F+G L+ + TCLA A A +I N QQQ
Sbjct: 393 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQT 447
Query: 414 HRILYDVPNSRLGVARELCT 433
++YDV NS++G A C+
Sbjct: 448 FSVVYDVKNSKIGFAAGGCS 467
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 161/361 (44%), Gaps = 36/361 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y+V IGTP Q + + +DT +D W C C C F+ + S+T C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
C+ +P +CG C + +YG ++ L D + A VPG FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
G + G+ G GRG LSL +Q L FS+C + L S L P K
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258
Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
++ TPL++NP + YY++L I VG + +P TG GTIIDSGT T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG-GTIIDSGTAMT 317
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
L Y VRD F +V L V S D C S P+ A P + L F G + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375
Query: 374 PQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
P++N + + SI CLA+ + + I N QQQN +LYD+ NS+L
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 432 C 432
C
Sbjct: 431 C 431
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 196/440 (44%), Gaps = 57/440 (12%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
L + H SPCSP PL + +LA D AR+ L++ L +R
Sbjct: 45 LTLHHPQSPCSP----APLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDESRAG 100
Query: 81 -------------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
+ VP+ G + Y+ R +GTPA++ +M +DT + W+ C+
Sbjct: 101 SSSSSSPDDESSLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSP 159
Query: 128 CV-GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSST 177
CV C S VFN S+++ ++ C A QC + T +C+ + +YG S+
Sbjct: 160 CVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSS 219
Query: 178 IAAN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
+ LS+DT+S + VP + +GC Q G GL+GL R LSLL Q +
Sbjct: 220 FSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYS 279
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
FSYCLP+ + S P + YTP+ + SLY++ + I+V + + +
Sbjct: 280 FSYCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVS 337
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--- 353
A P TIIDSGTV TRL Y+A+ + ++ DTC+
Sbjct: 338 SSAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQ 392
Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ + P +T+ F+G L+ + TCLA A A +I N QQQ
Sbjct: 393 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQT 447
Query: 414 HRILYDVPNSRLGVARELCT 433
++YDV NS++G A C+
Sbjct: 448 FSVVYDVKNSKIGFAAGGCS 467
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 161/361 (44%), Gaps = 36/361 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y+V IGTP Q + + +DT +D W C C C F+ + S+T C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
C+ +P +CG C + +YG ++ L D + A VPG FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
G + G+ G GRG LSL +Q L FS+C + L S L P K
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258
Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
++ TPL++NP + YY++L I VG + +P TG GTIIDSGT T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTG-GTIIDSGTAMT 317
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
L Y VRD F +V L V S D C S P+ A P + L F G + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375
Query: 374 PQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
P++N + + SI CLA+ + + I N QQQN +LYD+ NS+L
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 432 C 432
C
Sbjct: 431 C 431
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 123/397 (30%), Positives = 198/397 (49%), Gaps = 36/397 (9%)
Query: 61 MLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
ML +DQ R++ + S ++ +P+ SG + + Y+V+ +GTP +L
Sbjct: 1 MLLQDQLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLG-AGNYLVKMALGTPKLSLS 59
Query: 112 MAMDTSNDAAWVPCTGCVGC----SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCG 163
+A+DT +D W C CVG + T F+ +S+++KN+ C ++ C+ + + C
Sbjct: 60 LALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCV 119
Query: 164 GGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRG 221
C + + YG + + + + ++++ +D++ + FGC Q+ G GLLGLGRG
Sbjct: 120 SSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRG 179
Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPRRSSLYY 280
LSL QT Y + F+YCLPSF + S +G L LG GQ PK +K+TPL + + Y
Sbjct: 180 KLSLALQTSEKYNNLFTYCLPSFSSSS-TGHLTLG--GQVPKSVKFTPLSPAFKNTPFYG 236
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
+++ + VG V+ I + AG IIDSGTV TRL Y+A+ F++ +
Sbjct: 237 IDIKGLSVGGHVLPIDASVF-----SNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDY 291
Query: 341 LTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA 395
DTCY I P I+ F G+ V + +L A CLA A
Sbjct: 292 PKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPN 351
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D+ + V V N QQQ + +++D+ R+G A C
Sbjct: 352 DDDGDFV--VFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 127/438 (28%), Positives = 196/438 (44%), Gaps = 55/438 (12%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
L + H SPCSP PL + +LA D AR+ L++ L +R
Sbjct: 45 LTLHHPQSPCSP----APLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRAG 100
Query: 81 -----------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
+ VP+ G + Y+ R +GTPA++ +M +DT + W+ C+ CV
Sbjct: 101 SSSSSPDDESLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159
Query: 130 -GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
C S VFN S+++ ++ C A QC + T +C+ + +YG S+ +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219
Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
LS+DT+S + VP + +GC Q G GL+GL R LSLL Q +FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLP+ + S P + YTP+ + SLY++ + I+V + + +
Sbjct: 280 YCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SV 355
A P TIIDSGTV TRL Y+A+ + ++ DTC+ +
Sbjct: 338 AYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAA 392
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ P +T+ F+G L+ + TCLA A A +I N QQQ
Sbjct: 393 RLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFS 447
Query: 416 ILYDVPNSRLGVARELCT 433
++YDV NS++G A C+
Sbjct: 448 VVYDVKNSKIGFAAAGCS 465
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 127/438 (28%), Positives = 196/438 (44%), Gaps = 55/438 (12%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS------------LAVARK- 80
L + H SPCSP PL + +LA D AR+ L++ L +R
Sbjct: 45 LTLHHPQSPCSP----APLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRAG 100
Query: 81 -----------SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
+ VP+ G + Y+ R +GTPA++ +M +DT + W+ C+ CV
Sbjct: 101 SSSSSPDDESLASVPLGPGTSVGVG-NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159
Query: 130 -GC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA------FNLTYGSSTIA 179
C S VFN S+++ ++ C A QC + T +C+ + +YG S+ +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219
Query: 180 AN-LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
LS+DT+S + VP + +GC Q G GL+GL R LSLL Q +FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
YCLP+ + S P + YTP+ + SLY++ + I+V + + +
Sbjct: 280 YCLPTSSSSSSGYLSIG--SYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SV 355
A P TIIDSGTV TRL Y+A+ + ++ DTC+ +
Sbjct: 338 AYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAA 392
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ P +T+ F+G L+ + TCLA A A +I N QQQ
Sbjct: 393 RLRVPEVTMAFAGGAALKLAARNLLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFS 447
Query: 416 ILYDVPNSRLGVARELCT 433
++YDV NS++G A C+
Sbjct: 448 VVYDVKNSKIGFAAGGCS 465
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 52/400 (13%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
A ARK+VV A + Y+V+ IGTP A+DT++D W C C GC V
Sbjct: 70 ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127
Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
FN S+T+ L C + C ++ CG +C + TY G++T L+ D +
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+ D G FGC +TG + PPQ G++GLGRG LSL++Q L F+YCLP A
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ---LSVRRFAYCLPP-PA 243
Query: 247 LSFSGSLRLGPIGQPKRIK----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG---- 298
G L LG R P+ ++PR S YY+NL + +G R + +PP
Sbjct: 244 SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTT 303
Query: 299 -------------------ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
A+ G IID + T L A Y + + +
Sbjct: 304 ATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRL 363
Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
S G D C+ +P + P + L F G + L + L + CL +
Sbjct: 364 PRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMV 423
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A SV +++ N QQQN ++LY++ R+ + C
Sbjct: 424 GRA--EAGSV-SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 128/435 (29%), Positives = 193/435 (44%), Gaps = 62/435 (14%)
Query: 55 EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
EES+L++ KD R++ + A++ + V + SG + S Y
Sbjct: 93 EESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALSERMVATVESGVAVG-SGEY 151
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
++ +GTP + M MDT +D W+ C C+ C VF+ A S++++N+ C +C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRC 211
Query: 155 KQVPNP---------TC---GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVP 195
V P TC G C + YG S +L+ ++ ++ A+ V
Sbjct: 212 GHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 271
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF------KALSF 249
G FGC + G GLLGLGRG LS +Q + +Y TFSYCL K +
Sbjct: 272 GVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVFG 331
Query: 250 SGSLRLGPIGQPKRIKYTPL----LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
L P+ +KYT + + YYV L + VG +++I
Sbjct: 332 EDDDALALAAHPQ-LKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKD 390
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVPIVA----P 360
GTIIDSGT + V PAY +R F R+ + V CY+V V P
Sbjct: 391 GSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEVP 450
Query: 361 TITLMFS-GMNVTLPQDNLLIH--STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
++L+F+ G P +N I GSI CLA+ P + +++I N QQQN ++
Sbjct: 451 ELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP---RTGMSIIGNFQQQNFHVV 507
Query: 418 YDVPNSRLGVARELC 432
YD+ N+RLG A C
Sbjct: 508 YDLQNNRLGFAPRRC 522
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 52/400 (13%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
A ARK+VV A + Y+V+ IGTP A+DT++D W C C GC V
Sbjct: 70 ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127
Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
FN S+T+ L C + C ++ CG +C + TY G++T L+ D +
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+ D G FGC +TG + PPQ G++GLGRG LSL++Q L F+YCLP A
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ---LSVRRFAYCLPP-PA 243
Query: 247 LSFSGSLRLGPIGQPKRIK----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG---- 298
G L LG R P+ ++PR S YY+NL + +G R + +PP
Sbjct: 244 SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTT 303
Query: 299 -------------------ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
A+ G IID + T L A Y + + +
Sbjct: 304 ATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRL 363
Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
S G D C+ +P + P + L F G + L + L + CL +
Sbjct: 364 PRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMV 423
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A SV +++ N QQQN ++LY++ R+ + C
Sbjct: 424 GRA--EAGSV-SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 165/399 (41%), Gaps = 66/399 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCV------------------------G 130
Y + +G +Q + + MDT +D W PCT C+
Sbjct: 75 YTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTPIS 134
Query: 131 CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQDTI 187
C+S + A S+T + C A C + CG C F YG ++ A+L +DT+
Sbjct: 135 CNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIASLYRDTL 194
Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCL--P 242
SL+T + +TFGC P G+ G GRG LSL AQ + FSYCL
Sbjct: 195 SLSTLQLTNFTFGCAHTTFSE---PTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSH 251
Query: 243 SFKA--LSFSGSLRLGPIGQPKR--------IKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
SF++ + L LG K+ YT +L+NP+ S Y V L I VG++
Sbjct: 252 SFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKT 311
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV-GSNL---TVTSLGG 348
V P + N G ++DSGT FT L Y +V + F RR SN + G
Sbjct: 312 VPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTG 371
Query: 349 FDTCY--SVPIVAPTITLMFSGMN--VTLPQDNLLIHSTAGS--------ITCLAMAAAP 396
CY + + P +TL F GMN V LP+ N G + CL
Sbjct: 372 LSPCYYLNTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGG 431
Query: 397 DNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D V+ N QQQ + YD+ R+G AR C
Sbjct: 432 DEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 169/370 (45%), Gaps = 42/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQSTTFKNLGCQA 151
Y + +GTP + +DT ++ W C C C + V A+S+TF L C
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 152 AQCKQVPNP----TCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKA 205
+ C+ +P TC ACA+N TYGS A L+ +T+++ P FGC +
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGYLATETLTVGDGTFPKVAFGCSTENG 210
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-- 263
NS G++GLGRG LSL++Q L FSYCL S A + + G + +
Sbjct: 211 VDNS---SGIVGLGRGPLSLVSQ---LAVGRFSYCLRSDMADGGASPILFGSLAKLTERS 264
Query: 264 -IKYTPLLKNP--RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT-GAGTIIDSGTVFT 319
++ TPLLKNP +RS+ YYVNL I V + + F T G GTI+DSGT T
Sbjct: 265 VVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLT 324
Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGG---FDTCYS-------VPIVAPTITLMFS- 367
L Y V+ F+ ++ + N T + G D CY + P + L F+
Sbjct: 325 YLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAG 384
Query: 368 GMNVTLPQDNLLIHSTAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G +P N A S + CL + A D++ +++I N+ Q + +LYD+
Sbjct: 385 GAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNLMQMDMHLLYDIDG 442
Query: 423 SRLGVARELC 432
A C
Sbjct: 443 GMFSFAPADC 452
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 173/371 (46%), Gaps = 44/371 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQCKQ 156
V +GTP Q + M +DT ++ +W+ C +S FN +S +++ + C ++ C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTN 92
Query: 157 ------VPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+P C L+Y +S+ NL+ DT + +PG FGC+ ++
Sbjct: 93 QTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSSN 152
Query: 210 VPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKR 263
GL+G+ RGSLS ++Q + FSYC+ FSG L LG
Sbjct: 153 SDEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCI---SGTDFSGMLLLGESNFTWAVP 206
Query: 264 IKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
+ YTPL++ P + Y V L I+V R++ IP + + T T++DSGT F
Sbjct: 207 LNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQF 266
Query: 319 TRLVAPAYTAVRDVFRR------RVGSNLTVTSLGGFDTCYSVPIVA------PTITLMF 366
T L+ PAYTA+R F RV + G D CY VPI PT++L+F
Sbjct: 267 TFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVF 326
Query: 367 SGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+G +T+ + +L S+ CL+ + D + VI + QQN + +D+
Sbjct: 327 NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNS-DLLGVEAYVIGHHHQQNVWMEFDLE 385
Query: 422 NSRLGVARELC 432
SR+G+A+ C
Sbjct: 386 RSRIGLAQVRC 396
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 119/419 (28%), Positives = 182/419 (43%), Gaps = 31/419 (7%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T+ + H SP SPF S+ + + L + +R+ +A A SV P A+ +T
Sbjct: 33 TVDLIHRDSPLSPFYNSEETDLQR-INNALRRSISRVHHFDPIAAA--SVSPKAAESDVT 89
Query: 93 QS-PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLG 148
+ Y++ +GTP ++ DT +D W C C C V F+ S T+++
Sbjct: 90 SNRGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFS 149
Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTFGCI 202
C A QC + TC G C + +YG S N++ DTI+L + P GC
Sbjct: 150 CDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCG 209
Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP--- 257
+ G S G++GLG G LSL++Q + FSYCL P S L G
Sbjct: 210 HENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAV 269
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IIDSGT 316
+ P ++ TPLL + SS Y++ L A+ VG + +L TG G IIDSGT
Sbjct: 270 VSGPG-VQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLG----TGEGNIIIDSGT 324
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
T + ++ + +V G CYS + P IT F+G +V L
Sbjct: 325 TLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKVPAITAHFTGADVKLK 384
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N + + + CLA A+ S +++ N+ Q N + Y++ L CT
Sbjct: 385 PINTFVQ-VSDDVVCLAFAS----TTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCT 438
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 169/370 (45%), Gaps = 42/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQSTTFKNLGCQA 151
Y + +GTP + +DT ++ W C C C + V A+S+TF L C
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 152 AQCKQVPNP----TCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKA 205
+ C+ +P TC ACA+N TYGS A L+ +T+++ P FGC +
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSGYTAGYLATETLTVGDGTFPKVAFGCSTENG 210
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-- 263
NS G++GLGRG LSL++Q L FSYCL S A + + G + +
Sbjct: 211 VDNS---SGIVGLGRGPLSLVSQ---LAVGRFSYCLRSDMADGGASPILFGSLAKLTEGS 264
Query: 264 -IKYTPLLKNP--RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT-GAGTIIDSGTVFT 319
++ TPLLKNP +RS+ YYVNL I V + + F T G GTI+DSGT T
Sbjct: 265 VVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLT 324
Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGG---FDTCY-------SVPIVAPTITLMFS- 367
L Y V+ F+ ++ + N T + G D CY + P + L F+
Sbjct: 325 YLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAG 384
Query: 368 GMNVTLPQDNLLIHSTAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G +P N A S + CL + A D++ +++I N+ Q + +LYD+
Sbjct: 385 GAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNLMQMDMHLLYDIDG 442
Query: 423 SRLGVARELC 432
A C
Sbjct: 443 GMFSFAPADC 452
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 38/372 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP + + + +DT +D +W+ C C C + + + S+T++N+ C +
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPR 230
Query: 154 CK-----------QVPNPTC---GGGACAFNLT--YGSSTIAANLSQDTISLATDIVPGY 197
C+ + N TC A N T + S T NL+ V
Sbjct: 231 CQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDV 290
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLG 256
FGC G GLLGLGRG +S +Q Q++Y +FSYCL F S S L G
Sbjct: 291 MFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFG 350
Query: 257 PIGQ---PKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT-----T 306
+ + +T LL + YY+ + +I VG V+DI ++
Sbjct: 351 EDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADA 410
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPT 361
G GTIIDSG+ T AY +++ F +++ CY+V + P
Sbjct: 411 GGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPD 470
Query: 362 ITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ F+ V P +N + CLA+ P+ +S L +I N+ QQN ILYDV
Sbjct: 471 FGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPN--HSHLTIIGNLLQQNFHILYDV 528
Query: 421 PNSRLGVARELC 432
SRLG + C
Sbjct: 529 KRSRLGYSPRRC 540
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 128/417 (30%), Positives = 193/417 (46%), Gaps = 49/417 (11%)
Query: 36 VFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP 95
+ H PC+P S S+ EM + ARL ++ S +K VP G + +S
Sbjct: 58 LLHRHGPCAP---SLSTDTPPSMSEMFRRSHARLSYIVS---GKKVSVPAHLGTSV-KSL 110
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--CS---STVFNSAQSTTFKNLGCQ 150
Y+ GTPA ++ +DT +D W+ C C CS +F+ + S+T+ + C
Sbjct: 111 EYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170
Query: 151 AAQCKQVPNPTCGGGA-----CAFNLTY--GSSTIAANLSQDTISLATD-IVPGYTFGCI 202
+ +CK++ G G C F ++Y G+ST+ +D ++LA IV + FGC
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGV-YGKDKLTLAPGAIVKDFYFGCG 229
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
+ GLLGLGR S SL AQ FSYCLP+ S G L G P
Sbjct: 230 HSKSSLPGLFDGLLGLGRLSESLGAQYGG--GGGFSYCLPAVN--SKPGFLAFGAGRNPS 285
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+TP+ + P + + V L I VG + +D+ P A G I+DSGTV T L
Sbjct: 286 GFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS------GGMIVDSGTVVTVLQ 339
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSG---MNVTLPQ 375
+ Y A+R FR + + V G DTCY + +V P I L FSG +N+ +P
Sbjct: 340 STVYRALRAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVP- 396
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +L++ CLA A + V+ N+ Q+ +L+D S+ G + C
Sbjct: 397 NGILVNG------CLAFAET--GKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 107/340 (31%), Positives = 158/340 (46%), Gaps = 23/340 (6%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IG P + DT +D W C C C + V++ + S+TF L C +A
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSAT 130
Query: 154 CKQVPNPTCGGGA-CAFNLTYGSSTIAAN-LSQDTISLATDIVP----GYTFGCIQKATG 207
C + + C + C + YG +A L +T++L P G FGC G
Sbjct: 131 CLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGG 190
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKR 263
+S+ G +GLGRG+LSLLAQ L FSYCL F + LG + + P
Sbjct: 191 DSLNSTGTVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPST 247
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
++ TPLL++P+ S Y+V+L I +G + IP G G I+DSGT FT L
Sbjct: 248 VQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAE 307
Query: 324 PAYTAVRDVFRRRVGS-NLTVTSLGG--FDTCYSVPIVAPTITLMFS-GMNVTLPQDNLL 379
+ V R +G + +SL F P P + L F+ G ++ L +DN +
Sbjct: 308 SGFREVVGRVARVLGQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYM 367
Query: 380 IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
++ S CL +A SVL N QQQN ++L+D
Sbjct: 368 SYNEEDSSFCLNIAGTTPESTSVL---GNFQQQNIQMLFD 404
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 173/370 (46%), Gaps = 43/370 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ- 156
IV +GTP Q + M +DT ++ +W+ C + +T F+ +ST+++ + C + C
Sbjct: 32 IVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTT-FDPTRSTSYQTIPCSSPTCTNR 90
Query: 157 -----VPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----T 206
+P C L+Y +S+ NL+ D + + + G FGC+ +
Sbjct: 91 TQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFSSNS 150
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRI 264
GL+G+ RGSLS ++Q L FSYC+ FSG L LG + +
Sbjct: 151 DEDSKSTGLMGMNRGSLSFVSQ---LGFPKFSYCI---SGTDFSGLLLLGESNLTWSVPL 204
Query: 265 KYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
YTPL++ P + Y V L I+V +++ IP + + T T++DSGT FT
Sbjct: 205 NYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFT 264
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPI------VAPTITLMFS 367
L+ P Y A+R F + S L V G D CY VP+ + PT+TL+F
Sbjct: 265 FLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFR 324
Query: 368 GMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G +T+ D +L S+ CL+ + D + VI + QQN + +D+
Sbjct: 325 GAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNS-DLLGVEAYVIGHHHQQNVWMEFDLEK 383
Query: 423 SRLGVARELC 432
SR+G+A+ C
Sbjct: 384 SRIGLAQVRC 393
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 183/406 (45%), Gaps = 40/406 (9%)
Query: 58 VLEMLAKDQARLQFLS------------------SLAVARKSVVPIASGRQITQSPTYIV 99
V L +D AR+QFL+ + P+ SG+ Y+
Sbjct: 91 VRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLA 150
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQ 153
+ +G P + + DT +D W+ C C ++ +F+ S+++ L C + Q
Sbjct: 151 QIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQ 210
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVP 211
CK + C C + + YG + L+ +T+S ++ +P GC G
Sbjct: 211 CKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG 270
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
GL+GLG G++SL +Q L S+FSYCL + + S S +L P +PL+K
Sbjct: 271 GAGLIGLGGGAISLSSQ---LKASSFSYCLVNLDSDS-SSTLEFNS-NMPSDSLTSPLVK 325
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
N R S YV ++ I VG + + I P + + + G I+DSGT+ +RL + Y ++R+
Sbjct: 326 NDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLRE 385
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
F + S + FDTCY+ + PTI + S G ++ LP N LI
Sbjct: 386 AFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAG 445
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA S L++I + QQQ R+ YD+ NS +G + C
Sbjct: 446 TYCLAFI----KTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 159/355 (44%), Gaps = 38/355 (10%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
TY+V IGTP L +DT +D W PC C + ++ A+S T+ N+ C +
Sbjct: 99 TYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGS 158
Query: 152 AQCKQVPN-------------PTCGGGACAFNLTYGS-STIAANLSQDTISL-ATDIVPG 196
C +P+ P G C + +YG S+ L+ +T + A V
Sbjct: 159 RLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGTTVHD 218
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
FGC G + GL+G+GRG LSL++Q L + FSYC F + S L LG
Sbjct: 219 LAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQ---LGVTKFSYCFTPFNDTTTSSPLFLG 275
Query: 257 PIGQ----PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
K + P PRRSS YY++L I VG ++ I P + + G II
Sbjct: 276 SSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGRGGLII 335
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
DSGT FT L A+ + RV L + G C++ P + P + L
Sbjct: 336 DSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRLVLH 395
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F G ++ LP+ + ++ + CL + +A ++V+ +MQQQN + YDV
Sbjct: 396 FDGADMELPRSSAVVEDRVAGVACLGIVSA-----RGMSVLGSMQQQNMHVRYDV 445
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 182/424 (42%), Gaps = 52/424 (12%)
Query: 55 EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
+ES L+ KD AR+ + A+A + V + SG + S Y
Sbjct: 94 KESFLDSAGKDVARIHTMLRRVAGAGGGRAATNSTPRRALAERIVATVESGVAVG-SGEY 152
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
+V +GTP + M MDT +D W+ C C+ C VF+ A S +++N+ C +C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRC 212
Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
V PT C + YG S +L+ + ++ A+ V FG
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFG 272
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
C G GLLGLGRG+LS +Q + +Y FSYCL + S + G
Sbjct: 273 CGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS-SVGSKIVFGDDDA 331
Query: 258 -IGQPKRIKYTPLLKNPRR--SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+G P R+ YT + + YYV L + VG ++I P GTIIDS
Sbjct: 332 LLGHP-RLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDS 390
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
GT + PAY +R F R+ V CY+V + P +L+F+ G
Sbjct: 391 GTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADG 450
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P +N + I CLA+ P S +++I N QQQN +LYD+ N+RLG A
Sbjct: 451 AVWDFPAENYFVRLDPDGIMCLAVLGTP---RSAMSIIGNFQQQNFHVLYDLQNNRLGFA 507
Query: 429 RELC 432
C
Sbjct: 508 PRRC 511
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 119/381 (31%), Positives = 166/381 (43%), Gaps = 44/381 (11%)
Query: 84 PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNS 138
P++ G PT Y+V IGTP Q + + +DT +D W C C C F+
Sbjct: 20 PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDP 79
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL-- 189
+ S+T C + C+ +P +CG C + +YG ++ L D +
Sbjct: 80 STSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVG 139
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC-------L 241
A VPG FGC G + G+ G GRG LSL +Q L FS+C +
Sbjct: 140 AGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAI 196
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLL---KNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
PS L L G ++ TPL+ KN +LYY++L I VG + +P
Sbjct: 197 PSTVLLDLPADLFSNGQG---AVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPES 253
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV 358
A TG GTIIDSGT T L Y VRD F ++ + + G TC+S P
Sbjct: 254 AFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQ 312
Query: 359 A----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
A P + L F G + LP++N + SI CLA+ N +I N QQ
Sbjct: 313 AKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQ 367
Query: 412 QNHRILYDVPNSRLGVARELC 432
QN +LYD+ N+ L C
Sbjct: 368 QNMHVLYDLQNNMLSFVAAQC 388
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 182/424 (42%), Gaps = 52/424 (12%)
Query: 55 EESVLEMLAKDQARLQFL-----------------SSLAVARKSVVPIASGRQITQSPTY 97
+ES L+ KD AR+ + A+A + V + SG + S Y
Sbjct: 94 KESFLDSAGKDVARIHTMLRRVAGAGGGRAATNSTPRRALAERIVATVESGVAVG-SGEY 152
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
+V +GTP + M MDT +D W+ C C+ C VF+ A S +++N+ C +C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRC 212
Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
V PT C + YG S +L+ + ++ A+ V FG
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFG 272
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
C G GLLGLGRG+LS +Q + +Y FSYCL + S + G
Sbjct: 273 CGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS-SVGSKIVFGDDDA 331
Query: 258 -IGQPKRIKYTPLLKNPRR--SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+G P R+ YT + + YYV L + VG ++I P GTIIDS
Sbjct: 332 LLGHP-RLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDS 390
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVP----IVAPTITLMFS-G 368
GT + PAY +R F R+ V CY+V + P +L+F+ G
Sbjct: 391 GTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADG 450
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P +N + I CLA+ P S +++I N QQQN +LYD+ N+RLG A
Sbjct: 451 AVWDFPAENYFVRLDPDGIMCLAVLGTP---RSAMSIIGNFQQQNFHVLYDLQNNRLGFA 507
Query: 429 RELC 432
C
Sbjct: 508 PRRC 511
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 124/452 (27%), Positives = 196/452 (43%), Gaps = 68/452 (15%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL---SSLAVARKSV 82
++ + +++ + H PC+P S + S+ E L +D+AR ++ ++ +
Sbjct: 37 NSDPNRASVPLVHRHGPCAP---SAASGGKPSLAERLRRDRARANYIVTKAAGGRTAATA 93
Query: 83 VPIASGRQITQSPT----------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----G 127
V A G T PT Y+V IGTPA ++ +DT +D +WV C
Sbjct: 94 VSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGE 153
Query: 128 CVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-------CAFNLTYGS-STIA 179
C +F+ + S+++ ++ C + C+++ G G C + + YG+ +T
Sbjct: 154 CYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTT 213
Query: 180 ANLSQDTISLATDIV-PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
S +T++L +V + FGC G GLLGLG SL++QT + + FS
Sbjct: 214 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 273
Query: 239 YCLPSFKALSFSGSLRLGPIGQPKRIK---------YTPLLKNPRRSSLYYVNLLAIRVG 289
YCLP SG +G P +TP+ + P + Y V L I VG
Sbjct: 274 YCLP-----PTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVG 328
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG- 348
+ +PP A +G +IDSGTV T L A AY A+R FR + + G
Sbjct: 329 GAPLAVPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGA 382
Query: 349 -FDTCYSVP----IVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
DTCY + PTI L FSG +++ P L+ CLA A A +
Sbjct: 383 VLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLVDG-------CLAFAGA--GTD 433
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +I N+ Q+ +LYD +G C
Sbjct: 434 DTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 183/406 (45%), Gaps = 40/406 (9%)
Query: 58 VLEMLAKDQARLQFLS------------------SLAVARKSVVPIASGRQITQSPTYIV 99
V L +D AR+QFL+ + P+ SG+ Y+
Sbjct: 91 VRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLA 150
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQ 153
+ +G P + + DT +D W+ C C ++ +F+ S+++ L C + Q
Sbjct: 151 QIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQ 210
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVP 211
CK + C C + + YG + L+ +T+S ++ +P GC G
Sbjct: 211 CKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG 270
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
GL+GLG G++SL +Q L S+FSYCL + + S S +L P +PL+K
Sbjct: 271 GAGLIGLGGGAISLSSQ---LKASSFSYCLVNLDSDS-SSTLEFNSY-MPSDSLTSPLVK 325
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
N R S YV ++ I VG + + I P + + + G I+DSGT+ +RL + Y ++R+
Sbjct: 326 NDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLRE 385
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
F + S + FDTCY+ + PTI + S G ++ LP N LI
Sbjct: 386 AFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAG 445
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA S L++I + QQQ R+ YD+ NS +G + C
Sbjct: 446 TYCLAFI----KTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 163/371 (43%), Gaps = 38/371 (10%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
S Y + +GTP + + +DT +D W+ C C+ C S ++ S++F+N+ C
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCH 253
Query: 151 AAQCKQV--PNP----TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIV 194
+C+ V P+P +C + YG T NL+ + V
Sbjct: 254 DPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHV 313
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
FGC G GLLGLG+G LS +Q Q+LY +FSYCL + + S
Sbjct: 314 ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKL 373
Query: 255 LGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
+ G+ K + P L K+ + YYV + ++ V V+ IP +
Sbjct: 374 I--FGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEG 431
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTI 362
GTIIDSGT T PAY +++ F R++ V L CY+V + P
Sbjct: 432 AGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDF 491
Query: 363 TLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
++F+ V P +N I + CLA+ P S L++I N QQQN ILYD+
Sbjct: 492 GILFADEAVWNFPVENYFIW-IDPEVVCLAILGNP---RSALSIIGNYQQQNFHILYDMK 547
Query: 422 NSRLGVARELC 432
SRLG A C
Sbjct: 548 KSRLGYAPMKC 558
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 124/437 (28%), Positives = 192/437 (43%), Gaps = 63/437 (14%)
Query: 42 PCSPFKPSKPLSWEESVLEMLAKDQARLQF----LSSLAVARKSVVPI------ASGRQI 91
PCS + +ML DQ R + LS+ +P+ S R
Sbjct: 75 PCSSTSSRASEDMGIDIDDMLMWDQLRTSYIRTQLSTHVGVVGGGMPVIARSTTVSNRDY 134
Query: 92 TQSPTYIVRAKIGT----------------PAQTLLMAMDTSNDAAWVPCTGCV--GC-- 131
T S T V GT A + + +DTS+D WV C C C
Sbjct: 135 TPSSTASVGTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQCLPCPIPQCHL 194
Query: 132 -SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----GACAFNLTYGS-STIAANLSQ 184
+++ A+S+TF + C + CK++ + G C + + YG
Sbjct: 195 QKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYGDGKATTGTYVT 254
Query: 185 DTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
DT++++ T +V + FGC G+ S G+L LG G SLL QT + Y + FSYC+P
Sbjct: 255 DTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIP 314
Query: 243 SFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ F L LG P+ + YTPL+KN + Y V+L AI V + + +PP A
Sbjct: 315 KPSSAGF---LSLGGPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAF- 370
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSV----P 356
G ++DSG V T+L Y A+R FR + + + + + DTCY
Sbjct: 371 -----ATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPD 425
Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ P ++L+F+ G + L ++++ CLA AA P + + I N+QQQ +
Sbjct: 426 VKVPKVSLVFAGGATLDLEPASIILDG------CLAFAATPGEES--VGFIGNVQQQTYE 477
Query: 416 ILYDVPNSRLGVARELC 432
+LYDV ++G R C
Sbjct: 478 VLYDVGGGKVGFRRGAC 494
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 129/454 (28%), Positives = 201/454 (44%), Gaps = 55/454 (12%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS------------ 73
+ +D S +L++ S SP + + + ++S LE KD R+ +
Sbjct: 62 EQKDRSPSLKLH--MSRRSPAEATAGRTRKDSFLESAQKDGVRIATMHRRVALQAQAQPG 119
Query: 74 --------SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
A++ + V + SG + S Y+V +GTP + M MDT +D W+ C
Sbjct: 120 RRSASSSPRRALSERLVATVESGVAVG-SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC 178
Query: 126 TGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP----TCGGGA---CAFNLTYGS 175
C+ C VF+ ST+++N+ C +C V P TC C + YG
Sbjct: 179 APCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGD 238
Query: 176 -STIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
S +L+ + ++ ++ V G GC + G GLLGLGRG LS +Q
Sbjct: 239 QSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQL 298
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLA 285
+ +Y FSYCL + + + G + P+ + YT + ++ YYV L
Sbjct: 299 RAVYGHAFSYCLVDHGS-AVGSKIVFGDDNVLLSHPQ-LNYTAFAPSAAENTFYYVQLKG 356
Query: 286 IRVGRRVVDIPPGALQFNPTTGA-GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TV 343
I VG ++DIP + G+ GTIIDSGT + PAY A+R F R+ +
Sbjct: 357 ILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLI 416
Query: 344 TSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
CY+V + P +L+F+ G P +N I I CLA+ P
Sbjct: 417 ADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTP-- 474
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S +++I N QQQN +LYD+ ++RLG A C
Sbjct: 475 -RSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 171/371 (46%), Gaps = 47/371 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + IGTP T + DT + W PCT C + F A S+TF L C ++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
C+ + +P TC C + YG A L+ +T+ + PG FGC + GNS
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPGVAFGCSTENGVGNS- 208
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
G++GLGR LSL++Q FSYCL S A + + G + + ++ TP
Sbjct: 209 -SSGIVGLGRSPLSLVSQVG---VGRFSYCLRS-DADAGDSPILFGSLAKVTGGNVQSTP 263
Query: 269 LLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA------GTIIDSGTVFTR 320
LL+NP SS YYVNL I VG D+P + F T GA GTI+DSGT T
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTY 321
Query: 321 LVAPAYTAVRDVFRRRVG-SNLTVTSLG---GFDTCYS---------VPIVAPTITLMFS 367
LV Y V+ F ++ +NLT T G GFD C+ VP+ PT+ L F+
Sbjct: 322 LVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPV--PTLVLRFA 379
Query: 368 GMNVTLPQDNLLIHSTA------GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
G + + A ++ CL + A + ++ +++I N+ Q + +LYD+
Sbjct: 380 GGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLS--ISIIGNVMQMDLHVLYDLD 437
Query: 422 NSRLGVARELC 432
A C
Sbjct: 438 GGMFSFAPADC 448
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 184/441 (41%), Gaps = 84/441 (19%)
Query: 18 SEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
S +P D ++L+V H PCS +P K S S ++LA+D++R+ + S
Sbjct: 3 SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANS--PSHTQILAQDESRVASIQSRLA 60
Query: 78 ----------ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
A K+ +P S + S Y+V +G+P + L DT +D W C
Sbjct: 61 KNLAGGSNLKASKATLPSKSASTLG-SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEP 119
Query: 128 CVG-C---SSTVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTI 178
CVG C +F+ + S ++ N+ C + C+++ + P C C + + YG +
Sbjct: 120 CVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSY 179
Query: 179 AANL-SQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQST 236
+ +++ +SL +TD+ + FGC Q G GLLGL R LSL++QT Y
Sbjct: 180 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 239
Query: 237 FSYCLPSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
FSYCLPS + S +G L G G K +K+TP
Sbjct: 240 FSYCLPS--SSSSTGYLSFGSGDGDSKAVKFTP--------------------------- 270
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
RL Y++V+ VFR + V + DTCY +
Sbjct: 271 ------------------------RLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 306
Query: 356 P----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ P I L FSG +I+ S CLA A D + + +I N+QQ
Sbjct: 307 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSD--DDEVAIIGNVQQ 364
Query: 412 QNHRILYDVPNSRLGVARELC 432
+ ++YD R+G A C
Sbjct: 365 KTIHVVYDDAEGRVGFAPSGC 385
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 189/422 (44%), Gaps = 51/422 (12%)
Query: 57 SVLEMLAKDQARLQFLS--------------SLAVARKSVVPIASGRQITQSPTYIVRAK 102
S L++ KD R++ + +L+ + + V + SG + S Y++
Sbjct: 93 SFLDLAEKDAVRVEAMHRRVASSSSSPRRGRALSESERVVATVESGVAVG-SAEYLMDVY 151
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPN 159
+GTP + M MDT +D W+ C C+ C VF+ A S++++NL C +C V
Sbjct: 152 VGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAP 211
Query: 160 PTC---------GGGACAFNLTYG---SSTIAANLSQDTISL----ATDIVPGYTFGCIQ 203
P G C + YG +ST L T++L A+ V G FGC
Sbjct: 212 PEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGH 271
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQS-TFSYCLPSFKA-----LSFSGSLRLGP 257
+ G GLLGLGRG LS +Q + +Y TFSYCL + + F L
Sbjct: 272 RNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHGSDVASKVVFGEDDALAL 331
Query: 258 IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
P R+KYT + + YYV L + VG +++I + GTIIDSGT
Sbjct: 332 AAHP-RLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGT 390
Query: 317 VFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMN 370
+ V PAY +R F R+ GS V CY+V V P ++L+F+ G
Sbjct: 391 TLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGAV 450
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
P +N I I CLA+ P + +++I N QQQN + YD+ N+RLG A
Sbjct: 451 WDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPR 507
Query: 431 LC 432
C
Sbjct: 508 RC 509
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 195/440 (44%), Gaps = 64/440 (14%)
Query: 48 PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS-------------VVPIASGRQI 91
P P + E + +LA D+AR LQ + A + VP+ SG +
Sbjct: 93 PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSGIRF 152
Query: 92 TQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQST 142
Q+ Y+ +G L + +DT +D WV PC+ C +F+ + S
Sbjct: 153 -QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSA 211
Query: 143 TFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQD 185
++ + C A+ C+ G G+CA ++L YG + + L+ D
Sbjct: 212 SYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD 271
Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
T++L V G+ FGC G GL+GLGR LSL++QT + FSYCLP+
Sbjct: 272 TVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAAT 331
Query: 246 ALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ +GSL LG R + YT ++ +P + Y++N+ VG V
Sbjct: 332 SGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAV-------A 384
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTCYSV---- 355
A ++DSGTV TRL Y AVR F R+ G+ + D CY++
Sbjct: 385 AAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHD 444
Query: 356 PIVAPTITLMFS-GMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ P +TL G ++T+ +L + GS CLAMA+ + +I N QQ+N
Sbjct: 445 EVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPIIGNYQQKN 502
Query: 414 HRILYDVPNSRLGVARELCT 433
R++YD SRLG A E C+
Sbjct: 503 KRVVYDTVGSRLGFADEDCS 522
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/386 (30%), Positives = 168/386 (43%), Gaps = 57/386 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTV---------FNSAQSTTF 144
Y V GTP QTL MDT +D W PCT C CS + F +S++
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126
Query: 145 KNLGCQAAQCKQVPNPT------CGGGAC------AFNLTYGSSTIAANLSQDTISLATD 192
K LGC+ +C + + C +C + + YGS T +T+ L +
Sbjct: 127 KLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHLHSL 186
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK-----AL 247
P + GC + +S P G+ G GRG SL +Q L FSYCL S +
Sbjct: 187 SKPNFLVGC---SVFSSHQPAGIAGFGRGLSSLPSQ---LGLGKFSYCLLSHRFDDDTKK 240
Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPR---RSSL---YYVNLLAIRVGRRVVDIPPG 298
S S L + + K+ + YTP +KNP+ +SS YY+ L I VG V +P
Sbjct: 241 SSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYK 300
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG---GFDTCYSV 355
L G IIDSGT FT + A+ + D F R++ V + G C++V
Sbjct: 301 YLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNV 360
Query: 356 P----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVI 406
+ P + L F G +V LP +N G + CL + A P+ V ++
Sbjct: 361 SDAKTVSFPELRLYFKGGADVALPVENYFAF-VGGEVACLTVVTDGVAGPERVGGPGMIL 419
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N Q QN + YD+ N RLG +E C
Sbjct: 420 GNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 169/368 (45%), Gaps = 22/368 (5%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQ 140
P+ SG + S Y V +GTP Q + +D+ +D WV C+ C C S ++ +
Sbjct: 52 PVVSGSTLG-SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSN 110
Query: 141 STTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSSTIAANL-SQDTISLATD 192
S+TF + C ++ C +P T G GACA+ Y ++ + + + ++ ++
Sbjct: 111 SSTFSPVPCLSSDCLLIP-ATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGV 169
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-KALSFSG 251
+ FGC G+ G+LGLG+G LS +Q Y + F+YCL ++ S S
Sbjct: 170 RIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSS 229
Query: 252 SLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
SL G I ++YTP++ NP+ +LYYV + + VG + + I A + + G
Sbjct: 230 SLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGG 289
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLM 365
+I DSGT T AY+ + F V S+ G D C + V P+ T+
Sbjct: 290 SIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVELTGVDQPSFPSFTIE 348
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F V P+ A ++ CLAMA + N I N+ QQN + YD + +
Sbjct: 349 FDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGG-FNTIGNLLQQNFFVQYDREENLI 407
Query: 426 GVARELCT 433
G A C+
Sbjct: 408 GFAPAKCS 415
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 127/441 (28%), Positives = 195/441 (44%), Gaps = 65/441 (14%)
Query: 48 PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS--------------VVPIASGRQ 90
P P + E + +LA D+AR LQ + A + VP+ SG +
Sbjct: 93 PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIR 152
Query: 91 ITQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQS 141
Q+ Y+ +G L + +DT +D WV PC+ C +F+ + S
Sbjct: 153 F-QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGS 211
Query: 142 TTFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQ 184
++ + C A+ C+ G G+CA ++L YG + + L+
Sbjct: 212 ASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLAT 271
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
DT++L V G+ FGC G GL+GLGR LSL++QT + FSYCLP+
Sbjct: 272 DTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAA 331
Query: 245 KALSFSGSLRLGPIGQPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ +GSL LG R + YT ++ +P + Y++N+ VG V
Sbjct: 332 TSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAV------- 384
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTCYSV--- 355
A ++DSGTV TRL Y AVR F R+ G+ + D CY++
Sbjct: 385 AAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGH 444
Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ P +TL G ++T+ +L + GS CLAMA+ + +I N QQ+
Sbjct: 445 DEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPIIGNYQQK 502
Query: 413 NHRILYDVPNSRLGVARELCT 433
N R++YD SRLG A E C+
Sbjct: 503 NKRVVYDTVGSRLGFADEDCS 523
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 165/349 (47%), Gaps = 32/349 (9%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCK 155
++G P Q +DT +D W+ PC G GC + F+ S+++ + C + QC+
Sbjct: 2 RVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQ 61
Query: 156 QVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQ 213
+ C +C + + YG + L+ +T++ ++ +P + GC G V
Sbjct: 62 LLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGAD 121
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFSGSLRLGPIGQPKRIKYTP 268
GL+GLG G++S+ +Q L S+FSYCL PSF L F+ P +P
Sbjct: 122 GLIGLGGGAISISSQ---LKASSFSYCLVDIDSPSFSTLDFN-------TDPPSDSLISP 171
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+KN R S YV ++ + VG + + I + + + G I+DSGT T+L + Y
Sbjct: 172 LVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
+R+ F + + FDTCY + + PTI + G N + LP N LI
Sbjct: 232 LREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVD 291
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA +A L++I N QQQ R+ YD+ NS +G + C
Sbjct: 292 SAGTFCLAFVSA----TFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 143/459 (31%), Positives = 205/459 (44%), Gaps = 84/459 (18%)
Query: 24 ICDTQDHSSTLQVFHVFSPCSPF------KPSKPLSWEESVLEMLAKDQARLQFLSSLAV 77
I + + L + H SPCSP K KP S+ E+L +D RLQ+LS +
Sbjct: 44 ITSGHTNGNKLPLVHRLSPCSPVTGGGAQKKGKP-----SLQEILHRDGLRLQYLSQVQA 98
Query: 78 ARKSV------------------VPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDT 116
A + VP A+ I+ P Y V A GTPAQ L + D
Sbjct: 99 ATAAAAPAAAPAPSATTPASGLSVP-ATQNIISSLPGVFEYTVLAGYGTPAQQLPLFFDV 157
Query: 117 S--NDAAWVPC-TGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA 168
S ++ PC +G G +T F+ + S++F+++ C + C + GG+C
Sbjct: 158 SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSSFRSVLCGSPDCGG--HSCSAGGSCT 215
Query: 169 FNL-----TYGSSTIAANLSQDTISLA-TDIVPGYTFGCIQKATGNSVPPQGL------L 216
F L +G+ TI DT++L+ + + GC+Q N + G+ L
Sbjct: 216 FTLQNSTFVFGNGTIV----MDTLTLSPSATFENFAVGCMQ--LDNDLFTDGVAVGNIDL 269
Query: 217 GLGRGSLSL-LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLK 271
L R SL+ + + + FSYCLP+ G L + P +KY PL+
Sbjct: 270 SLSRHSLATRVLNSSPPGMAAFSYCLPA--DTDTHGFLTIAPALSDYSDHAGVKYVPLVT 327
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
NP + YYV+L+AI + + IPP TG GT+IDS + FT L P Y A+RD
Sbjct: 328 NPTGPNFYYVDLVAIAINGEDLPIPPALF-----TGNGTMIDSQSAFTYLNPPIYAALRD 382
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLI----HS 382
FR+ + V + GG DTCY+ I P ITL FS G + L + H
Sbjct: 383 EFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHL 442
Query: 383 TAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
T G CLA AAAPD N N + + Q+ I+YDV
Sbjct: 443 TDGFPFGCLAFAAAPDQ-NFPWNYLGSQVQRTKEIVYDV 480
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 112/400 (28%), Positives = 173/400 (43%), Gaps = 68/400 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGC---------SSTVFNSAQSTT 143
Y++ IGTP Q + + MDT +D WVPC C+ C SS++F+ S++
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 144 FKNLGCQAAQCKQV-----PNPTCGGGAC---------------AFNLTYGSSTIAAN-L 182
C ++ C ++ P C C +F TYG + + L
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130
Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
++D + T VP ++FGC+ T P G+ G GRG LSL +Q L + FS+C
Sbjct: 131 TRDILKARTRDVPRFSFGCV---TSTYHEPIGIAGFGRGLLSLPSQLGFL-EKGFSHCFL 186
Query: 243 SFKAL---SFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV--V 293
FK + + S L LG I +++TP+L P + YY+ L +I +G +
Sbjct: 187 PFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNITPT 246
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDT 351
+P QF+ G ++DSGT +T L P Y+ + + + + S GFD
Sbjct: 247 QVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATETESRTGFDL 306
Query: 352 CYSVP--------------IVAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAM 392
CY VP +V P+IT F + + LPQ N +A S + CL
Sbjct: 307 CYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLF 366
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D V + QQQN +++YD+ R+G C
Sbjct: 367 QNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 171/395 (43%), Gaps = 45/395 (11%)
Query: 71 FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
F L +S V + SG Y + IG+P + + +DT +D W+ C C
Sbjct: 177 FSGQLMATLESGVSLGSGE-------YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFD 229
Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
C + ++ S +F+N+ C +C+ V +P +C + YG S
Sbjct: 230 CFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTG 289
Query: 177 -----TIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
T NL+ T + V FGC G GLLGLGRG LS +Q Q
Sbjct: 290 DFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 349
Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVN 282
+LY +FSYCL + S S L G + P+ + +T L+ +NP + YY+
Sbjct: 350 SLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENPV-DTFYYLQ 407
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+ +I VG + IP + GTIIDSGT + PAY +++ F R+V
Sbjct: 408 IKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL 467
Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
V CY+V + P + F+ G P +N I I CLAM P
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTP- 526
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N QQQN ILYD NSRLG A C
Sbjct: 527 --KSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 171/395 (43%), Gaps = 45/395 (11%)
Query: 71 FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
F L +S V + SG Y + IG+P + + +DT +D W+ C C
Sbjct: 177 FSGQLMATLESGVSLGSGE-------YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFD 229
Query: 131 C---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----- 176
C + ++ S +F+N+ C +C+ V +P +C + YG S
Sbjct: 230 CFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTG 289
Query: 177 -----TIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQ 230
T NL+ T + V FGC G GLLGLGRG LS +Q Q
Sbjct: 290 DFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 349
Query: 231 NLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL---KNPRRSSLYYVN 282
+LY +FSYCL + S S L G + P+ + +T L+ +NP + YY+
Sbjct: 350 SLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENPV-DTFYYLQ 407
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
+ +I VG + IP + GTIIDSGT + PAY +++ F R+V
Sbjct: 408 IKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL 467
Query: 343 VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD 397
V CY+V + P + F+ G P +N I I CLAM P
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTP- 526
Query: 398 NVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S L++I N QQQN ILYD NSRLG A C
Sbjct: 527 --KSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 169/352 (48%), Gaps = 32/352 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
Y V A G PAQ +A DT+ + + C CVG + F ++S++F + C + +C
Sbjct: 88 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 147
Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
C G +C F + +G+ T+A L +DT++L + G+TFGCI+
Sbjct: 148 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 203
Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
GL+ L R S SL ++ + FSYCLPS A S G L +G P
Sbjct: 204 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 263
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
IKY P+ NP + Y+V L+ I VG + +PP + GT++++ T FT L
Sbjct: 264 IKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 318
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
AY A+RD FRR + DTCY++ + PT+ L F+ G + L +
Sbjct: 319 AAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQM 378
Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
+ S S+ CLA AAAP V +VI + Q++ ++YD+ R+G
Sbjct: 379 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVG 429
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 183/423 (43%), Gaps = 40/423 (9%)
Query: 47 KPSKPLSWEESVLEMLAKDQARLQ--FLSSLAVARKS-----VVPIASGRQITQSPTYIV 99
K +K +SW++ V + + Q L ++SL ++ + + SG + + Y +
Sbjct: 114 KDTKSMSWKQEVKVITIQQQNNLANAVVASLKSSKDEFSGNIMATLESGASLG-TGEYFI 172
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
+GTP + + + +DT +D +W+ C C C + +N +S++++N+ C +C+
Sbjct: 173 DMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQL 232
Query: 157 VPNP------TCGGGACAFNLTYGS----------STIAANLSQDTISLATDIVPGYTFG 200
V +P C + Y T NL+ V FG
Sbjct: 233 VSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFG 292
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIG 259
C G GLLGLGRG LS +Q Q++Y +FSYCL F S S L G
Sbjct: 293 CGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDK 352
Query: 260 Q---PKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+ + +T LL + YY+ + +I VG V+DIP ++ GTIIDS
Sbjct: 353 ELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDS 412
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GM 369
G+ T AY +++ F +++ CY+V + P + F+ G
Sbjct: 413 GSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGA 472
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
P +N + CLA+ P+ +S L +I N+ QQN ILYDV SRLG +
Sbjct: 473 VWNFPAENYFYQYEPDEVICLAILKTPN--HSHLTIIGNLLQQNFHILYDVKRSRLGYSP 530
Query: 430 ELC 432
C
Sbjct: 531 RRC 533
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 180/371 (48%), Gaps = 38/371 (10%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQC- 154
++ KIGTP + +L+ +DT+++ WV T C CS T FN S++F + C ++ C
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60
Query: 155 ---KQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISL-----ATDIVPGYTFGCIQ 203
K C G+C+F + Y + A ++++ SL A + FGC
Sbjct: 61 GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120
Query: 204 KATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQS----TFSYCLPS-FKALSFSGSLRLGP 257
K V G LGL RGS S AQ + +S FSYC P+ + L+ SG + G
Sbjct: 121 KDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGD 180
Query: 258 IGQP-KRIKYTPLLKNPRRSSL---YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
G P +Y L + P +S+ YYV L I VG ++ IP A + + GT D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYSVPI------VAPTITLM 365
SGT + LV PA+TA+ + F RRV +L TS F + CY V AP +TL
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRV-LHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLH 299
Query: 366 F-SGMNVTLPQDNLLI--HSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
F + +++ L + ++ + T +T CLA A +NVI N QQQ++ I +D+
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359
Query: 422 NSRLGVARELC 432
SR+G A C
Sbjct: 360 RSRIGFAPANC 370
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/435 (27%), Positives = 186/435 (42%), Gaps = 77/435 (17%)
Query: 63 AKDQARLQF-LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
++ Q R++ LSS+ V + + + G Y++ IGTP Q + + +DT +D
Sbjct: 56 SQTQERIKKPLSSVDVVMEPLREVRDG--------YLITLNIGTPPQAVQVYLDTGSDLT 107
Query: 122 WVPCTG----CVGC---------SSTVFNSAQSTTFKNLGCQAAQCKQV-----PNPTCG 163
WVPC C+ C S +VF+ S+T C ++ C ++ P C
Sbjct: 108 WVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCA 167
Query: 164 GGAC---------------AFNLTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATG 207
C +F TYG I+ L++D + T VP ++FGC+ T
Sbjct: 168 VAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCV---TS 224
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---SFSGSLRLGP----IGQ 260
P G+ G GRG LSL +Q L + FS+C FK + + S L LG I
Sbjct: 225 TYREPIGIAGFGRGLLSLPSQLGFL-EKGFSHCFLPFKFVNNPNISSPLILGASALSINL 283
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV--VDIPPGALQFNPTTGAGTIIDSGTVF 318
+++TP+L P + YY+ L +I +G + +P QF+ G ++DSGT +
Sbjct: 284 TDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTY 343
Query: 319 TRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYSVP--------------IVAPTI 362
T L P Y+ + + + S GFD CY VP ++ P+I
Sbjct: 344 THLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSI 403
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
T F + + LPQ N +A S + CL D V + QQQN +++
Sbjct: 404 TFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVV 463
Query: 418 YDVPNSRLGVARELC 432
YD+ R+G C
Sbjct: 464 YDLEKERIGFQAMDC 478
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 22/349 (6%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQ 150
Y R +G P Q+ DT +D +W+ PC G GC + F+ S+++ L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISL-ATDIVPGYTFGCIQKATGN 208
+ QC + C +C + + YG + L+ +T S ++ +P GC G
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGL 303
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V GL+GLG G++SL +Q L ++FSYCL + S S +L QP +P
Sbjct: 304 FVGADGLIGLGGGAISLSSQ---LEATSFSYCLVDLDSES-SSTLDFN-ADQPSDSLTSP 358
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+KN R + YV ++ + VG + + I + + + + G I+DSGT T + + Y
Sbjct: 359 LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDV 418
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
+RD F + + FDTCY + + PTI + G N + LP N LI
Sbjct: 419 LRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVD 478
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA + L++I N+QQQ R+ YD+ NS +G + + C
Sbjct: 479 SAGTFCLAFLPS----TFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/399 (30%), Positives = 181/399 (45%), Gaps = 47/399 (11%)
Query: 64 KDQARLQFLSSLAVAR-KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
+D A + LS VA +S VP+ SG Y+V +GTP + M MDT +D W
Sbjct: 122 RDSAPRRALSERVVATVESGVPVGSGE-------YLVDVYLGTPPRRFRMIMDTGSDLNW 174
Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG---------GACAFN 170
+ C C+ C S +F+ A S +++N+ C +C+ V P C +
Sbjct: 175 LQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYY 234
Query: 171 LTYGS-STIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
YG S +L+ + ++ T V G FGC + G GLLGLGRG LS
Sbjct: 235 YWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLS 294
Query: 225 LLAQTQNLY-QSTFSYCLPSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRRSSLY 279
+Q + +Y FSYCL + + S + G + P+ + YT + Y
Sbjct: 295 FASQLRGVYGGHAFSYCLVEHGSAAGS-KIIFGHDDALLAHPQ-LNYTAFAPTTDADTFY 352
Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG- 338
Y+ L +I VG V+I L + GTIIDSGT + PAY A+R F R+
Sbjct: 353 YLQLKSILVGGEAVNISSDTL-----SAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSP 407
Query: 339 SNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMA 393
S + CY+V + P ++L+F+ G P +N I I CLA+
Sbjct: 408 SYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVL 467
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P S +++I N QQQN +LYD+ ++RLG A C
Sbjct: 468 GTP---RSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRC 503
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 172/394 (43%), Gaps = 62/394 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
Y A +GTP Q L + +DT + WVPCT C CSS VF+ S++ + +
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 148 GCQ-------------AAQCKQVPN-------PTCGGGACA-FNLTYGSSTIAANLSQDT 186
GC+ A +C++ P P C + + YGS + A L DT
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK- 245
+ VPG+ GC + PP GL G GRG+ S+ AQ L FSYCL S +
Sbjct: 219 LRAPGRAVPGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRF 273
Query: 246 --ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPG 298
+ SGSL LG G + ++Y PL+K+ L YY+ L + VG + V +P
Sbjct: 274 DDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPAR 333
Query: 299 ALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
A N GTI+DSGT FT L P AV R + G C++
Sbjct: 334 AFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFA 393
Query: 355 VP-----IVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---- 404
+P + P ++ F G V LP +N + + G++ + +A D
Sbjct: 394 LPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEG 453
Query: 405 -----VIANMQQQNHRILYDVPNSRLGVARELCT 433
++ + QQQN+ + YD+ RLG R+ CT
Sbjct: 454 SGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCT 487
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/439 (25%), Positives = 188/439 (42%), Gaps = 74/439 (16%)
Query: 52 LSWEESVLEMLAKDQARLQFLSS--LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
L+ E + + + + RL ++ L + ++ V +A ++ Y+V+ +GTP
Sbjct: 41 LTDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHC 100
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG--- 163
A+DT++D W C CV C VFN ST++ + C + C ++ C
Sbjct: 101 FTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDG 160
Query: 164 ----GGACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLL 216
AC + +Y G++T L+ D +++ D+ G FGC + G PPQ G++
Sbjct: 161 DSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGGP-PPQVSGVV 219
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-----IGQPKRIKYTPLLK 271
GLGRG+LSL++Q L F YCLP + S +G L LG + P+
Sbjct: 220 GLGRGALSLVSQ---LSVRRFMYCLPPPVSRS-AGRLVLGADAAATVRNASERVVVPMST 275
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDI---------PPGALQFNPTT---------------- 306
R S YY+NL I +G R + PG P +
Sbjct: 276 GSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPD 335
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV------GSNLTVTSLGGFDTCYSVP---- 356
G IID + T L Y + D + GS+L G D C+ +P
Sbjct: 336 AYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDL------GLDLCFILPEGVP 389
Query: 357 ---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ AP ++L F G+ + L ++ + + A + CL M D V +++ N QQQN
Sbjct: 390 MSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCL-MVGKTDGV----SILGNYQQQN 444
Query: 414 HRILYDVPNSRLGVARELC 432
+++Y++ R+ + C
Sbjct: 445 MQVMYNLRRGRITFIKTAC 463
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 124/424 (29%), Positives = 191/424 (45%), Gaps = 53/424 (12%)
Query: 55 EESVLEMLAKDQARLQFLSSLA----VAR-------------KSVVPIASGRQITQSPTY 97
+ES L+ KD R++ + A VAR + V + SG + S Y
Sbjct: 91 KESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATVESGVAVG-SGEY 149
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC 154
++ +GTP + M MDT +D W+ C C+ C VF+ A S++++N+ C +C
Sbjct: 150 LIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRC 209
Query: 155 KQVPNPTC-------GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYTFG 200
V P +C + YG S +L+ ++ ++ A+ V G FG
Sbjct: 210 GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFG 269
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
C + G GLLGLGRG LS +Q + +Y TFSYCL + + S + G
Sbjct: 270 CGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGS-KVVFGEDYL 328
Query: 258 -IGQPKRIKYTPLLKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
+ P+ +KYT + YYV L + VG +++I GTIIDSG
Sbjct: 329 VLAHPQ-LKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSG 387
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLT--VTSLGGFDTCYSVPIVA----PTITLMFS-G 368
T + V PAY +R F + S L + + CY+V V P ++L+F+ G
Sbjct: 388 TTLSYFVEPAYQVIRQAFVDLM-SRLYPLIPDFPVLNPCYNVSGVERPEVPELSLLFADG 446
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P +N + I CLA+ P + +++I N QQQN ++YD+ N+RLG A
Sbjct: 447 AVWDFPAENYFVRLDPDGIMCLAVRGTP---RTGMSIIGNFQQQNFHVVYDLQNNRLGFA 503
Query: 429 RELC 432
C
Sbjct: 504 PRRC 507
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 171/369 (46%), Gaps = 43/369 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--- 155
V +G+P QT+ M +DT ++ +W+ C S VF+ +S+++ + C + C+
Sbjct: 58 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCRTRT 116
Query: 156 ---QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
+P C ++Y +S+I NL+ DT + +P FGC+ +
Sbjct: 117 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 176
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
GL+G+ RGSLS + Q + FSYC+ + SG L G K +K
Sbjct: 177 EDSKTTGLIGMNRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALK 230
Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
YTPL++ P + Y V L I+V ++ +P + T T++DSGT FT
Sbjct: 231 YTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 290
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
L+ P YTA+++ F R+ ++L V G D CY VP+ PT+TLMF G
Sbjct: 291 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 350
Query: 369 MNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+++ + L+ + + S+ C + + + +I + QQN + +D+ S
Sbjct: 351 AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKS 409
Query: 424 RLGVARELC 432
R+G A C
Sbjct: 410 RVGFAEVRC 418
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 171/369 (46%), Gaps = 43/369 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--- 155
V +G+P QT+ M +DT ++ +W+ C S VF+ +S+++ + C + C+
Sbjct: 65 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCRTRT 123
Query: 156 ---QVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
+P C ++Y +S+I NL+ DT + +P FGC+ +
Sbjct: 124 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 183
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
GL+G+ RGSLS + Q + FSYC+ + SG L G K +K
Sbjct: 184 EDSKTTGLIGMNRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALK 237
Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
YTPL++ P + Y V L I+V ++ +P + T T++DSGT FT
Sbjct: 238 YTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 297
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
L+ P YTA+++ F R+ ++L V G D CY VP+ PT+TLMF G
Sbjct: 298 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 357
Query: 369 MNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+++ + L+ + + S+ C + + + +I + QQN + +D+ S
Sbjct: 358 AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKS 416
Query: 424 RLGVARELC 432
R+G A C
Sbjct: 417 RVGFAEVRC 425
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 170/358 (47%), Gaps = 32/358 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
Y V A G PAQ +A DT+ + + C CVG + F ++S++F + C + +C
Sbjct: 88 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 147
Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
C G +C F + +G+ T+A L +DT++L + G+TFGCI+
Sbjct: 148 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 203
Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
GL+ L R S SL ++ + FSYCLPS A S G L +G P
Sbjct: 204 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 263
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
IKY P+ NP + Y+V+L+ I VG + +PP + GT++++ T FT L
Sbjct: 264 IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 318
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
AY A+RD FR+ + DTCY++ + P + L F+ G + L +
Sbjct: 319 AAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQM 378
Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ S S+ CLA AAAP V +VI + Q++ ++YD+ R+G C
Sbjct: 379 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 65/397 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNSAQSTTFKNLG---- 148
Y++ IGTP Q + + MDT +D WVPC C+ C N +T +
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141
Query: 149 --------CQAAQCKQVPNPTCGGGAC---------------AFNLTYGSSTIAAN-LSQ 184
C P TC C +F TYG+ + L++
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201
Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
DT+ + +P + FGC+ A P G+ G GRG+LS+++Q L Q FS
Sbjct: 202 DTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTLSMVSQLGFL-QKGFS 257
Query: 239 YCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
+C +FK + S L +G I + +++TP+L +P + YYV L AI VG
Sbjct: 258 HCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVSA 317
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
++P +F+ G IDSGT +T L P Y+ V + + + + + GFD
Sbjct: 318 TEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQTGFD 377
Query: 351 TCYSVP----------IVAPTITLMF-SGMNVTLPQDNLLIHSTA----GSITCLAMAAA 395
CY VP + P+IT F + +++ LPQ N +A + CL +
Sbjct: 378 LCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQST 437
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D + V + QQQN ++YD+ R+G C
Sbjct: 438 DDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 180/395 (45%), Gaps = 47/395 (11%)
Query: 52 LSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLL 111
+ + ++ L A ++RLQ LS S R + Y++ IGTP +
Sbjct: 29 IGFTKTELMRRAAHRSRLQALSGYDAN--------SPRLHSVQVEYLMELAIGTPPVPFV 80
Query: 112 MAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQC------KQVPNPTC 162
DT +D W C C C + V++ + S+TF + C +A C + NP+
Sbjct: 81 ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPS- 139
Query: 163 GGGACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYT-------FGCIQKATGNSVPPQG 214
C + +Y + L +T+++ + VPG T FGC G+S+ G
Sbjct: 140 --SPCRYIYSYSDGAYSVGILGTETLTIGSS-VPGQTVSVGSVAFGCGTDNGGDSLNSTG 196
Query: 215 LLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ----PKRIKYTPLL 270
+GLGRG+LSLLAQ L FSYCL F + LG + + P ++ TPLL
Sbjct: 197 TVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLL 253
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
++P S Y+VNL I +G + IP G G ++DSGT FT L + V
Sbjct: 254 QSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVV 313
Query: 331 DVFRRRVGS-NLTVTSLGGFDTCYSVPI---VAPTITLMFS-GMNVTLPQDNLLIHSTAG 385
D + +G + +SL C+ P P + L F+ G ++ L +DN + ++
Sbjct: 314 DRVAQLLGQPPVNASSLD--SPCFPSPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDD 371
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
S CL + +P S + + N QQQN ++L+D+
Sbjct: 372 SSFCLNIVGSP----STWSRLGNFQQQNIQMLFDM 402
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 169/375 (45%), Gaps = 44/375 (11%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
S Y + ++G+P + +DT +D W+ PC+ C S +++ + S+TF C
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 151 AAQCKQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISL-----ATDIVPGYTFGCI 202
+ C+ +P C A C + YG SS+ + + +T++L ++ P + FGC
Sbjct: 61 TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-------LSFSGSLRL 255
+ +G+ G++GLG+G +SL Q + + FSYCL F L F S
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF------------- 302
G TP++ N RS+ Y+V L I VG + + + A+ F
Sbjct: 181 G-----SGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IV 358
GTI DSGT T L Y+ V+ F V S GFD CY V
Sbjct: 236 LEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFK 295
Query: 359 APTITLMFSGMNVTLPQDN-LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
P +TL F G + PQ N +I TA ++ CLAM + + L +I N+ QQN+ ++
Sbjct: 296 FPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGS---GSLGLGIIGNLMQQNYHVV 352
Query: 418 YDVPNSRLGVARELC 432
YD S + ++ C
Sbjct: 353 YDRGTSTISMSPAQC 367
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 164/349 (46%), Gaps = 22/349 (6%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTV---FNSAQSTTFKNLGCQ 150
Y R +G P Q+ DT +D +W+ PC G GC + F+ S+++ L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISL-ATDIVPGYTFGCIQKATGN 208
+ QC + C +C + + YG + L+ +T S ++ +P GC G
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGL 303
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP 268
V GL+GLG G++SL +Q L ++FSYCL + S S +L QP +P
Sbjct: 304 FVGAAGLIGLGGGAISLSSQ---LEATSFSYCLVDLDSES-SSTLDFN-ADQPSDSLTSP 358
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+KN R + YV ++ + VG + + I + + + + G I+DSGT T + + Y
Sbjct: 359 LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDV 418
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMN-VTLPQDNLLIHST 383
+RD F + + FDTCY + + PTI + G N + LP N L
Sbjct: 419 LRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVD 478
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA + L++I N+QQQ R+ YD+ NS +G + + C
Sbjct: 479 SAGTFCLAFLPS----TFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 165/382 (43%), Gaps = 41/382 (10%)
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMAMDTSNDAAWVPCTGCVGCSST---V 135
+ P+ASG + Y++ IGTP Q + + +DT +D W C C C +
Sbjct: 75 RVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPR 134
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-- 192
F+++ S T + C C+ + C G C + + YG +++ L++D+ +
Sbjct: 135 FDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGG 194
Query: 193 ---IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCL------- 241
VP FGC Q TGN + G+ G GRG LSL Q L S+FSYC
Sbjct: 195 GKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQ---LGVSSFSYCFTTIFESK 251
Query: 242 --PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
P F + + LR G I TP L P YY++L I VG+ + +P A
Sbjct: 252 STPVFLGGAPADGLRAHATGP---ILSTPFL--PNHPEYYYLSLKGITVGKTRLAVPESA 306
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPI 357
GTIIDSGT T + ++ + F +V T + G T C+S
Sbjct: 307 FVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTES 366
Query: 358 V-------APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
V P +TL G + LP++N + C+ + A D+ +I N Q
Sbjct: 367 VPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSDQLCVVVLAGDDD----RTMIGNFQ 422
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN I++D+ ++L + C
Sbjct: 423 QQNMHIVHDLAGNKLVIEPAQC 444
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 167/382 (43%), Gaps = 53/382 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNSAQSTTFKNLGCQAA 152
Y+V +GTP + + + +DT +D W C C+ C V + A S+T + C A
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAP 153
Query: 153 QCKQVPNPTCGGG-------ACAFNLTYGSSTI-AANLSQDTISLA-TDIVPG------- 196
C+ +P +CG G +C + YG +I L+ D + D G
Sbjct: 154 VCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSERR 213
Query: 197 YTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLR 254
TFGC G G+ G GRG SL +Q L ++FSYC S F++ S +L
Sbjct: 214 LTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQ---LGVTSFSYCFTSMFESTSSLVTLG 270
Query: 255 LGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
+ P + +++ TPLL++P + SLY+++L AI VG + IP + A II
Sbjct: 271 VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLRE---ASAII 327
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA------------- 359
DSG T L Y AV+ F +VG ++ D C+++P A
Sbjct: 328 DSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRGR 387
Query: 360 --------PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
P + G + LP++N + + CL + AA + + VI N Q
Sbjct: 388 GRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTV-VIGNYQ 446
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN ++YD+ N L A C
Sbjct: 447 QQNTHVVYDLENDVLSFAPARC 468
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 125/434 (28%), Positives = 189/434 (43%), Gaps = 89/434 (20%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS-----LAVARKSVVPI 85
S L + + PCS S+P S +E + +D++R+ F++S K P
Sbjct: 63 SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKFNQYAPENLKDHTP- 117
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
+ + + ++V GTP Q + +DT + W C C TV N+
Sbjct: 118 -NNKLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKAC-----TVENN------- 164
Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQ 203
+N+TYG ST N DT++L +D+ + FG +
Sbjct: 165 -----------------------YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGR 201
Query: 204 KATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQ 260
G+ G+LGLG+G LS ++QT + + FSYCLP ++ GSL G Q
Sbjct: 202 NNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATSQ 258
Query: 261 PKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+K+T L+ P + S Y+VNL I VG ++IP GTIIDS TV
Sbjct: 259 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTV 313
Query: 318 FTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSV----PIVAPT 361
TRL AY+A++ F RR+ G L DTCY++ ++ P
Sbjct: 314 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNLSGRKDVLLPE 365
Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYD 419
I L F G +V L N++ S + CLA A + +N L +I N QQ + +LYD
Sbjct: 366 IVLHFGGGADVRLNGTNIVWGSDESRL-CLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424
Query: 420 VPNSRLGVARELCT 433
+ R+G C+
Sbjct: 425 IQGGRIGFRSNGCS 438
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 41/366 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPC--TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
I+ IGTP Q M +DT + +W+ C T F+ + S++F L C CK
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 156 -QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATG 207
++P+ PT C C ++ Y T A NL ++ I+ + T+I P GC +++
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD 192
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGPIGQPKRI 264
+ +G+LG+ RG LS ++Q + S FSYC+P + + +GS LG
Sbjct: 193 D----RGILGMNRGRLSFVSQAK---ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 265 KYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAG--TIIDSG 315
KY LL P + Y V ++ IR G + ++I F P G T++DSG
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNI--SGSVFRPDAGGSGQTMVDSG 303
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS-----VPIVAPTITLMFS- 367
+ FT LV AY VR RVG L + G D C+ +P + + +F+
Sbjct: 304 SEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTR 363
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G+ + +P++ +L++ G I C+ + + + + N+I N+ QQN + +DV N R+G
Sbjct: 364 GVEILVPKERVLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 421
Query: 428 ARELCT 433
A+ C+
Sbjct: 422 AKADCS 427
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 181/375 (48%), Gaps = 41/375 (10%)
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTF 144
I S I+ IGTP+Q+ + +DT + +W+ C + +T F+ + S++F
Sbjct: 73 NIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSF 132
Query: 145 KNLGCQAAQCK-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPG 196
+L C CK ++P+ T C C ++ Y T A NL ++ + + + P
Sbjct: 133 SDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPP 192
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSL 253
GC +++T +G+LG+ G LS ++Q + S FSYC+P+ L+ +GS
Sbjct: 193 LILGCAKESTDE----KGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSF 245
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTT 306
LG + KY LL P+ + Y V L IR+G++ ++IP + +
Sbjct: 246 YLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGG 305
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAP 360
T++DSG+ FT LV AY V++ R VGS L + G FD +S+ I
Sbjct: 306 SGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRL 365
Query: 361 TITLMFS---GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
L+F G+ + + + +LL++ G I C+ + + + + N+I N+ QQN +
Sbjct: 366 IGDLVFEFGRGVEILVEKQSLLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVE 423
Query: 418 YDVPNSRLGVARELC 432
+DV N R+G ++ C
Sbjct: 424 FDVTNRRVGFSKAEC 438
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 140/432 (32%), Positives = 198/432 (45%), Gaps = 72/432 (16%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
S+ L++ H PC+P + S + SV + L DQ R +++ S A A
Sbjct: 65 SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
+ VP + G I + Y+V A +GTP M +DT +D +WV PC+ C S
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
+F+ AQS+++ + C P C G C + ++YG S
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233
Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
S DT++L A+ V G+ FGC +G N V GLLGLGR SL+ QT Y F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291
Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
SYCLP+ S +G L L GP G T LL +P + Y V L I VG + +
Sbjct: 292 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTC 352
+P A T++D+GTV TRL AY A+R FR + S T S G DTC
Sbjct: 350 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 403
Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
Y+ + P + L F SG VTL D +L S CLA AP + + ++
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 455
Query: 408 NMQQQNHRILYD 419
N+QQ++ + D
Sbjct: 456 NVQQRSFEVRID 467
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 41/366 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPC--TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
I+ IGTP Q M +DT + +W+ C T F+ + S++F L C CK
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 156 -QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATG 207
++P+ PT C C ++ Y T A NL ++ I+ + T+I P GC +++
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD 192
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGPIGQPKRI 264
+ +G+LG+ RG LS ++Q + S FSYC+P + + +GS LG
Sbjct: 193 D----RGILGMNRGRLSFVSQAK---ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 265 KYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAG--TIIDSG 315
KY LL P + Y V ++ IR G + ++I F P G T++DSG
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNI--SGSVFRPDAGGSGQTMVDSG 303
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS-----VPIVAPTITLMFS- 367
+ FT LV AY VR RVG L + G D C+ +P + + +F+
Sbjct: 304 SEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTR 363
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G+ + +P++ +L++ G I C+ + + + + N+I N+ QQN + +DV N R+G
Sbjct: 364 GVEIFVPKERVLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 421
Query: 428 ARELCT 433
A+ C+
Sbjct: 422 AKADCS 427
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 136/454 (29%), Positives = 196/454 (43%), Gaps = 70/454 (15%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEE---SVLEMLAKDQARLQFLSS-LAVARKSVVPI 85
+S+ + H++ PCSP S + + S+ +M+ DQ R ++ L A P+
Sbjct: 61 NSTWAPLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRRADYIQKRLTGATDDKQPM 120
Query: 86 A-SGR--QITQSPTYIVRAKI--------------------GTPAQTLLMAMDTSNDAAW 122
A S R Q ++ Y + GT A T + +D+ +D +W
Sbjct: 121 AFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTNSAPDGTSAVTQTVIIDSGSDVSW 180
Query: 123 VPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGGA-CAFNLTY 173
V C C C +F+ A STT+ + C +A C Q+ P C A C F + Y
Sbjct: 181 VQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPYRRGCSANAQCQFGINY 239
Query: 174 GS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQT 229
G ST S D ++L D++ G+ FGC G++ G L LG GS SL+ QT
Sbjct: 240 GDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQT 299
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY------TPLLKNPRRSSLYYVNL 283
Y FSYCLP S G L LG P+R + TPLL + + Y V L
Sbjct: 300 ATRYGRVFSYCLP--PTASSLGFLVLG--VPPERAQLIPSFVSTPLLSSSMAPTFYRVLL 355
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
AI V R + +PP A ++IDS T+ +RL AY A+R FR +
Sbjct: 356 RAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA 409
Query: 344 TSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
+ DTCY I P+I L+F G V L +L+ S CLA AP
Sbjct: 410 PPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAF--APTA 461
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + I N+QQ+ ++YDVP + C
Sbjct: 462 SDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 169/352 (48%), Gaps = 32/352 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--STVFNSAQSTTFKNLGCQAAQC 154
Y V A G PAQ +A DT+ + + C CVG + F ++S++F + C + +C
Sbjct: 176 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCDPAFEPSRSSSFAAIPCGSPEC 235
Query: 155 KQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNSV-- 210
C G +C F + +G+ T+A L +DT++L + G+TFGCI+
Sbjct: 236 AV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTFD 291
Query: 211 PPQGLLGLGRGSLSL----LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKR 263
GL+ L R S SL ++ + FSYCLPS A S G L +G P
Sbjct: 292 GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGD 351
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
IKY P+ NP + Y+V+L+ I VG + +PP + GT++++ T FT L
Sbjct: 352 IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATEFTFLAP 406
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNL 378
AY A+RD FR+ + DTCY++ + P + L F+ G + L +
Sbjct: 407 AAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQM 466
Query: 379 LI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
+ S S+ CLA AAAP V +VI + Q++ ++YD+ R+G
Sbjct: 467 MYFADPSSVFSSVACLAFAAAPLPAFPV-SVIGTLAQRSTEVVYDLRGGRVG 517
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 163/362 (45%), Gaps = 42/362 (11%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
IGTP Q + +DT +D W C S V++ +S+TF L C C+
Sbjct: 97 IGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156
Query: 156 --QVPNPTC-GGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSV 210
Q C C + YGS+ L+ +T + + FGC + G+ +
Sbjct: 157 EGQFSFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI 216
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSLRLGPIGQPKRIK 265
G+LGL SLSL+ Q L FSYCL F L F L + I+
Sbjct: 217 GATGILGLSPESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQ 273
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
T ++ NP ++ YYV L+ I +G + + +P +L P G GTI+DSG+ LV A
Sbjct: 274 TTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 333
Query: 326 YTAVR----DVFRRRVGSNLTVTSLGGFDTCYSVP----------IVAPTITLMFS-GMN 370
+ AV+ DV R V +N TV ++ C+ +P + P + L F G
Sbjct: 334 FEAVKEAVMDVVRLPV-ANRTVED---YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAA 389
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ LP+DN AG + CLA+ D S +++I N+QQQN +L+DV + + A
Sbjct: 390 MVLPRDNYFQEPRAG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLFDVQHHKFSFAPT 446
Query: 431 LC 432
C
Sbjct: 447 QC 448
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 115/419 (27%), Positives = 170/419 (40%), Gaps = 61/419 (14%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
L FL+S + R + + +SP Y GTP QTL + DT +
Sbjct: 46 LTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLV 105
Query: 122 WVPCTGCVGCSSTVFNSAQ-----------STTFKNLGCQAAQCKQVPNP---------- 160
W PCT CS F S++ K +GCQ +C + P
Sbjct: 106 WFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN 165
Query: 161 ----TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
C A+ + YGS + A L +T+ +P + GC + P G+
Sbjct: 166 PKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQ---PSGIA 222
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIG-QPKRIKYTPLLKNP 273
G GRGS SL +Q + F+YCL S F SG L L G + + YTP +NP
Sbjct: 223 GFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNP 279
Query: 274 RRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
S+ YY+N+ I VG + V +P L P G+IIDSG+ FT + P
Sbjct: 280 SVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEV 339
Query: 329 VRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL 379
V F +++ +N T V +L G C+ + + P + F G LP +N
Sbjct: 340 VAREFEKQL-ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYF 398
Query: 380 IHSTAGSITCLA-----MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++ + CL M ++ QQQN + YD+ N RLG ++ C+
Sbjct: 399 ALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 179/368 (48%), Gaps = 41/368 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
I+ IGTP+Q+ + +DT + +W+ C + +T F+ + S++F +L C
Sbjct: 82 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 153 QCK-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQK 204
CK ++P+ T C C ++ Y T A NL ++ + + + P GC ++
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKE 201
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQP 261
+T +G+LG+ G LS ++Q + S FSYC+P+ L+ +GS LG
Sbjct: 202 STD----VKGILGMNLGRLSFISQAK---ISKFSYCIPTRSNRPGLASTGSFYLGENPNS 254
Query: 262 KRIKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+ KY LL P+ + Y V LL IR+G++ ++IP + + T++DS
Sbjct: 255 RGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDS 314
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAPTITLMFS- 367
G+ FT LV AY V++ R VGS L + G FD + + I L+F
Sbjct: 315 GSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEF 374
Query: 368 --GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G+ + + + LL++ G I C+ + + + + N+I N+ QQN + +DV N R+
Sbjct: 375 GRGVEILVEKQRLLVN-VGGGIHCVGIGRS-SMLGAASNIIGNVHQQNLWVEFDVANRRV 432
Query: 426 GVARELCT 433
G ++ C+
Sbjct: 433 GFSKAECS 440
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 43/369 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---- 154
V +GTP Q + M +DT ++ +W+ C T F+ +S+++ + C + C
Sbjct: 87 VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSSYSPVPCSSLTCTDRT 145
Query: 155 KQVPNP-TCGGGA-CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA----TG 207
+ P P +C C L+Y +S+ NL+ DT + +PG FGC+ + T
Sbjct: 146 RDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTNTE 205
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIK 265
GL+G+ RGSLS ++Q FSYC+ FSG L LG +
Sbjct: 206 EDSKNTGLMGMNRGSLSFVSQMDF---PKFSYCI---SDSDFSGVLLLGDANFSWLMPLN 259
Query: 266 YTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
YTPL++ P + Y V L I+V +++ +P + T T++DSGT FT
Sbjct: 260 YTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTF 319
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSG 368
L+ P Y+A+R+ F + L V GG D CY VP+ PT++LMF G
Sbjct: 320 LLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRG 379
Query: 369 MNVTLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+ + D LL + S+ C + D + VI + QQN + +D+ S
Sbjct: 380 AEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNS-DLLAVEAYVIGHHHQQNVWMEFDLEKS 438
Query: 424 RLGVARELC 432
R+G A+ C
Sbjct: 439 RIGFAQVQC 447
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 171/371 (46%), Gaps = 31/371 (8%)
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFN 137
+++PI G + Y V GTP Q M +DT + V C C S++ F+
Sbjct: 134 TIIPI-DGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFD 192
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDI-VPG 196
++QSTTF ++ C + C N G C FNL + + SQD +++A + V
Sbjct: 193 TSQSTTFTHVPCDSPDCPSTAN-CSAGSVCPFNLFF----VEGTFSQDVLTVAPSVAVQD 247
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
+TF C+ + +P G L L R SL ++ + FSYC+P + G L LG
Sbjct: 248 FTFVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYP--DSPGFLSLG 305
Query: 257 PIGQPKR---IKYTPLL--KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ + PLL +P +++Y+++++ + +G + IP G N A TI
Sbjct: 306 DDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTFGNN----ASTI 361
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDTCYSV----PIVAPTITLMF 366
+++GT FT L AYT +RD FR+ + N +V FDTCY+ + P + F
Sbjct: 362 VEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKF 421
Query: 367 -SGMNVTLPQDNLLIHSTAG----SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+G ++ + D +L + ++TCLA + + + V VI ++YDV
Sbjct: 422 GNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVA 481
Query: 422 NSRLGVARELC 432
+G E C
Sbjct: 482 GGTVGFIPESC 492
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/419 (27%), Positives = 170/419 (40%), Gaps = 61/419 (14%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
L FL+S + R + + +SP Y GTP QTL + DT +
Sbjct: 46 LTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLV 105
Query: 122 WVPCTGCVGCSSTVFNSAQ-----------STTFKNLGCQAAQCKQVPNP---------- 160
W PCT CS F S++ K +GCQ +C + P
Sbjct: 106 WFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN 165
Query: 161 ----TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
C A+ + YGS + A L +T+ +P + GC + P G+
Sbjct: 166 PKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQ---PSGIA 222
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIG-QPKRIKYTPLLKNP 273
G GRGS SL +Q + F+YCL S F SG L L G + + YTP +NP
Sbjct: 223 GFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNP 279
Query: 274 RRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
S+ YY+N+ I VG + V +P L P G+IIDSG+ FT + P
Sbjct: 280 SVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEV 339
Query: 329 VRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLL 379
V F +++ +N T V +L G C+ + + P + F G LP +N
Sbjct: 340 VAREFEKQL-ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYF 398
Query: 380 IHSTAGSITCLA-----MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++ + CL M ++ QQQN + YD+ N RLG ++ C+
Sbjct: 399 ALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 166/345 (48%), Gaps = 42/345 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y + IGTP Q L DT +D W C C C S + +S++F L C +
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSL 141
Query: 154 CKQVPNPTC--GGGACAFNLTYGSSTIAANLSQ-----DTISLATDIVPGYTFGCIQKAT 206
C +P+ C GG C + +YG ++ + +Q +T +L +D VPG FGC +
Sbjct: 142 CSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTTMSE 201
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS---FSGSLRLGPIGQPKR 263
G GL+GLGRG LSL++Q L FSYCL S A + GS L G
Sbjct: 202 GGYGSGSGLVGLGRGPLSLVSQ---LNVGAFSYCLTSDAAKTSPLLFGSGALTGAG---- 254
Query: 264 IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
++ TPLL R S+ YY VNL +I + GA T +G I DSGT L
Sbjct: 255 VQSTPLL---RTSTYYYTVNLESISI---------GAATTAGTGSSGIIFDSGTTVAFLA 302
Query: 323 APAYTAVRDVFRRRVGSNLTVTS-LGGFDTCYSVP-IVAPTITLMFSGMNVTLPQDNLLI 380
PAYT ++ + +NLT+ S G++ C+ V P++ L F G ++ LP +N
Sbjct: 303 EPAYTLAKEAVLSQT-TNLTMASGRDGYEVCFQTSGAVFPSMVLHFDGGDMDLPTENYF- 360
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ S++C + +P L+++ N+ Q N+ I YDV S L
Sbjct: 361 GAVDDSVSCWIVQKSPS-----LSIVGNIMQMNYHIRYDVEKSML 400
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 158/323 (48%), Gaps = 41/323 (12%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSND 119
+A+ +AR+ L S AV V PI + R + S Y+V IGTP MDT +D
Sbjct: 52 IARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSD 111
Query: 120 AAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-S 175
W C C+ C+ + F+ +S T++ L C++++C + +P+C C + YG +
Sbjct: 112 LIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQYYYGDT 171
Query: 176 STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
++ A L+ +T + AT+I FGC G+ G++G GRG LSL+
Sbjct: 172 ASTAGVLANETFTFGAANSTKVRATNIA----FGCGSLNAGDLANSSGMVGFGRGPLSLV 227
Query: 227 AQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPI----GQPKRIKYTPLLKNPRRSS 277
+Q L S FSYCL S+ + L F L G P ++ TP + NP +
Sbjct: 228 SQ---LGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP--VQSTPFVINPALPN 282
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
+Y+++L AI +G +++ I P N G IIDSGT T L AY AV RR +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV----RRGL 338
Query: 338 GSNLTVTSLG----GFDTCYSVP 356
S + +T++ G DTC+ P
Sbjct: 339 VSAIPLTAMNDTDIGLDTCFQWP 361
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 114/403 (28%), Positives = 185/403 (45%), Gaps = 51/403 (12%)
Query: 66 QARL-QFLSSLAVARKSVVPIAS-GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
ARL + L +L+ A V P++ G +T IGTP Q + +DT +D W
Sbjct: 59 NARLARVLGNLSAADVPVAPLSDQGHSLT--------VGIGTPPQPRTLIVDTGSDLIWT 110
Query: 124 PCTGCVGCSST----------VFNSAQSTTFKNLGCQAAQCK--QVPNPTCG-GGACAFN 170
C+ + T ++ +S++F L C C+ Q C C ++
Sbjct: 111 QCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYD 170
Query: 171 LTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
YGS+ L+ +T + + + FGC + G+ V GL+GL G +SL++Q
Sbjct: 171 ELYGSAEAGGVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQ 230
Query: 229 TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR------IKYTPLLKNP-RRSSLYYV 281
L FSYCL F S L G + +R ++ T +L+NP ++ YYV
Sbjct: 231 ---LSVPRFSYCLTPFAERKTS-PLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYV 286
Query: 282 NLLAIRVGRRVVDIPPGAL-QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-- 338
L+ + +G + +D+P +L P GTI+DSG+ + L A+ AV+ V
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346
Query: 339 -SNLTVTSLGGFDTCYSVP-------IVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITC 389
+N T ++ C+++P + P + L F G +TLP+DN AG + C
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAG-LMC 405
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LA+ +PD +++I N+QQQN +L+DV N + A C
Sbjct: 406 LAVGTSPDGFG--VSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 167/352 (47%), Gaps = 30/352 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W C C C + ++++A S++F + C +A
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152
Query: 154 CKQV---PNPTCGGGACAFNLTYGSSTIAAN-LSQDTISL--ATDI-VPGYTFGCIQKAT 206
C + N T C + YG +A L +T++ A + V G FGC
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCGVDNG 212
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ------ 260
G S G +GLGRGSLSL+AQ L FSYCL F S + G + +
Sbjct: 213 GLSYNSTGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPST 269
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
++ TPL+++P + YYV+L I +G + IP G G I+DSGT FT
Sbjct: 270 GAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTF 329
Query: 321 LVAPAYTAVRD----VFRRRV--GSNLTVTSLGGFDTCYSVPIVAPTITLMFS-GMNVTL 373
LV A+ V D V R+ V S+L +P + P + L F+ G ++ L
Sbjct: 330 LVESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAM-PDMVLHFAGGADMRL 388
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+DN + + S CL +A +P ++ ++++ N QQQN ++L+D+ +L
Sbjct: 389 HRDNYMSFNQEESSFCLNIAGSP---SADVSILGNFQQQNIQMLFDITVGQL 437
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 120/398 (30%), Positives = 179/398 (44%), Gaps = 44/398 (11%)
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+ ++RL L++ AV+ P S + + S Y + IGTPA L DT +D
Sbjct: 57 RSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGSDLI 116
Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA--------CAFN 170
W C C CS S + S++ + C C ++P P C A C+++
Sbjct: 117 WTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYH 176
Query: 171 LTYGSSTIAANLSQ-----DTISLATDIV--PGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
YG++ + ++ +T + D PG FGC ++ G GL+GLGRG L
Sbjct: 177 YAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKL 236
Query: 224 SLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL- 278
SL+ Q L F Y L S +SF GSL G TPLL NP L
Sbjct: 237 SLVTQ---LNVEAFGYRLSSDLSAPSPISF-GSLADVTGGNGDSFMSTPLLTNPVVQDLP 292
Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII-DSGTVFTRLVAPAYTAVRDVFRRR 336
YYV L I VG ++V IP G F+ +TGAG +I DSGT T L PAYT VRD +
Sbjct: 293 FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQ 352
Query: 337 VGSNLTVTSLGGFD-TCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH---STAGSIT 388
+G + D C+ S P++ L F G ++ L +N L +
Sbjct: 353 MGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETAR 412
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP-NSRL 425
C ++ + + L +I N+ Q + +++D+ N+R+
Sbjct: 413 CWSVVKS----SQALTIIGNIMQMDFHVVFDLSGNARM 446
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 162/383 (42%), Gaps = 53/383 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGC-------SSTVFNSAQSTTFKN 146
Y + GTP QTL + MDT +D W PCT C C SS +F S++ K
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 147 LGCQAAQCKQVP--------------NPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
LGC +C + +P C + + YGS + +T+ L
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209
Query: 193 IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
VP + GC +T P G+ G GRG SL +Q L FSYCL S + + S
Sbjct: 210 GVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQ---LGLKKFSYCLLSRRYDDTTES 263
Query: 253 LRLGPIGQPKR------IKYTPLLKNPRR------SSLYYVNLLAIRVGRRVVDIPPGAL 300
L G+ + YTP ++NP+ S YY+ L I VG + V IP L
Sbjct: 264 SSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYL 323
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVPIV 358
GTIIDSGT FT + + V F ++V S V + G C+++ +
Sbjct: 324 IPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGL 383
Query: 359 A----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIANM 409
P +TL F G + LP N + + CL + AA + ++ N
Sbjct: 384 NTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNF 443
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQQN + YD+ N RLG ++ C
Sbjct: 444 QQQNFYVEYDLRNERLGFRQQSC 466
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 174/394 (44%), Gaps = 62/394 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
Y A +GTP Q L + +DT + WVPCT C CSS VF+ S++ + +
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 148 GCQ-------------AAQCKQVPN-------PTCGGGACA-FNLTYGSSTIAANLSQDT 186
GC+ A +C++ P P C + + YGS + A L DT
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 186
Query: 187 ISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK- 245
+ VPG+ GC + PP GL G GRG+ S+ AQ L FSYCL S +
Sbjct: 187 LRAPGRAVPGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRF 241
Query: 246 --ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPG 298
+ SGSL LG G + ++Y PL+K+ L YY+ L + VG + V +P
Sbjct: 242 DDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPAR 301
Query: 299 ALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
A N GTI+DSGT FT L P AV R + G C++
Sbjct: 302 AFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFA 361
Query: 355 VP-----IVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPD--------NVN 400
+P + P ++ F G V LP +N + + G++ + +A D N
Sbjct: 362 LPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEG 421
Query: 401 SVLNVI-ANMQQQNHRILYDVPNSRLGVARELCT 433
S +I + QQQN+ + YD+ RLG R+ CT
Sbjct: 422 SGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCT 455
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 120/398 (30%), Positives = 179/398 (44%), Gaps = 44/398 (11%)
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+ ++RL L++ AV+ P S + + S Y + IGTPA L DT +D
Sbjct: 57 RSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGSDLI 116
Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA--------CAFN 170
W C C CS S + S++ + C C ++P P C A C+++
Sbjct: 117 WTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYH 176
Query: 171 LTYGSSTIAANLSQ-----DTISLATDIV--PGYTFGCIQKATGNSVPPQGLLGLGRGSL 223
YG++ + ++ +T + D PG FGC ++ G GL+GLGRG L
Sbjct: 177 YAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKL 236
Query: 224 SLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL- 278
SL+ Q L F Y L S +SF GSL G TPLL NP L
Sbjct: 237 SLVTQ---LNVEAFGYRLSSDLSAPSPISF-GSLADVTGGNGDSFMSTPLLTNPVVQDLP 292
Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII-DSGTVFTRLVAPAYTAVRDVFRRR 336
YYV L I VG ++V IP G F+ +TGAG +I DSGT T L PAYT VRD +
Sbjct: 293 FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQ 352
Query: 337 VGSNLTVTSLGGFD-TCY---SVPIVAPTITLMFS-GMNVTLPQDNLLIH---STAGSIT 388
+G + D C+ S P++ L F G ++ L +N L +
Sbjct: 353 MGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETAR 412
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP-NSRL 425
C ++ + + L +I N+ Q + +++D+ N+R+
Sbjct: 413 CWSVVKS----SQALTIIGNIMQMDFHVVFDLSGNARM 446
>gi|217070596|gb|ACJ83658.1| unknown [Medicago truncatula]
Length = 65
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 60/65 (92%), Positives = 63/65 (96%)
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
MNVTLPQDN+LIHSTAGS TCLAMA APDNVNSVLNVIANMQQQNHR+LYDVPNSR+GVA
Sbjct: 1 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVA 60
Query: 429 RELCT 433
RELCT
Sbjct: 61 RELCT 65
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 174/376 (46%), Gaps = 42/376 (11%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSST----VFNSAQSTTFKNL 147
S Y V ++GTPA+ + +DT +D W+ C SS+ ++ + S++++ +
Sbjct: 56 SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREI 115
Query: 148 GCQAAQCKQVPNPTCGGGACAF------NLTYG---SSTIAANLSQDTISLATDIVPG-- 196
C +C+ +P P G +C+ + TYG S L+ +TIS+ + G
Sbjct: 116 PCTDDECQFLPAPI--GSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 173
Query: 197 -------------YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCL 241
GC +++ G S + G+LGLG+G +SL QT++ FSYCL
Sbjct: 174 AGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 233
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGAL 300
+ S + S + +++ +TP+++NP S YYVN+ + V + VD I
Sbjct: 234 VDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDW 293
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA- 359
+ GTI DSGT + L PAY+ V + GF+ CY+V +
Sbjct: 294 GIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEK 353
Query: 360 --PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
P + + F G V LP +N ++ A ++ C+A+ S N++ N+ QQ+H I
Sbjct: 354 GMPKLGVEFQGGAVMELPWNNYMVL-VAENVQCVALQKVTTTNGS--NILGNLLQQDHHI 410
Query: 417 LYDVPNSRLGVARELC 432
YD+ +R+G C
Sbjct: 411 EYDLAKARIGFKWSPC 426
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 168/397 (42%), Gaps = 65/397 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CV----------------------- 129
Y++ IGTP Q + + MDT +D WVPC C+
Sbjct: 12 YLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSSY 71
Query: 130 --GCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACAFNLTYGS-STIAANLSQ 184
C+S S+ C A C + TC +F TYG+ + L++
Sbjct: 72 RDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTR 131
Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
DT+ + T +P + FGC+ P G+ G RG+LS +Q L + FS
Sbjct: 132 DTLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL-GLLKKGFS 187
Query: 239 YCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
+C +FK + S L +G + +++TP+LK+P + YY+ L AI VG
Sbjct: 188 HCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVSA 247
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFD 350
+P +F+ G +IDSGT +T L P Y+ + +F+ + V GFD
Sbjct: 248 TTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEMRAGFD 307
Query: 351 TCYSVPI----------VAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAAA 395
CY VP + P+IT F + ++ LPQ N +A S + CL +
Sbjct: 308 LCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQSM 367
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D+ V + QQQN +I+YD+ R+G C
Sbjct: 368 ADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 147/296 (49%), Gaps = 27/296 (9%)
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLAT-DIVPGYTFGCIQKATGN 208
AA C GG C + + YG + + DT++L++ D + G+ FGC ++ G
Sbjct: 5 AAAWSDXTTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGL 64
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP---KRIK 265
GLLGLGRG SL QT + Y F++C P+ S +G L GP P ++
Sbjct: 65 FGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARS--SGTGYLEFGPGSSPAVSAKLS 122
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
TP+L + + YYV + IRVG +++ IP AGTI+DSGTV TRL A
Sbjct: 123 TTPMLID-TGPTFYYVGMTGIRVGGKLLPIPQSVF-----AAAGTIVDSGTVITRLPPAA 176
Query: 326 YTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD-NL 378
Y+++R F + + +L DTCY + + PT++L+F G V+L D +
Sbjct: 177 YSSLRSAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQG-GVSLDVDASG 235
Query: 379 LIHSTAGSITCLAMAA--APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+I++ + S CL A A D+V ++ N Q + ++YD+ + +G C
Sbjct: 236 IIYAASVSQACLGFAGNEAADDV----AIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 131/437 (29%), Positives = 192/437 (43%), Gaps = 52/437 (11%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---SLAVARKSVVPIASGR 89
+LQ+ H + PS+ +VL + ++D AR+ +L S + + S + SG
Sbjct: 58 SLQLLHRDTVSGTKHPSR----RHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGG 113
Query: 90 QITQ--SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTF 144
I S Y+VR IG+P + DT +D WV PC+ C +F+ A S +F
Sbjct: 114 TIVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASF 173
Query: 145 KNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGY 197
+ C + C+ GGG C + ++YG + L+ +T++L V G
Sbjct: 174 SPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGV 233
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--LRL 255
GC + G GLLGLG G +SL+ Q FSYCL + + SGS L L
Sbjct: 234 AMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVL 293
Query: 256 G-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G P + PL++NP S YYV + + V + + G G G ++D+
Sbjct: 294 GREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDT 353
Query: 315 GTVFTRLVAPAYTAVRDVFR--------RRVGSNLTVTSLGGFDTCYSV----PIVAPTI 362
GT TRL A AY A+R F R G +L FDTCY + + PT+
Sbjct: 354 GTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSL-------FDTCYDLSGYASVRVPTV 406
Query: 363 TLMF-------SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
L F ++TLP NLL+ G CLA AA V S +++ N+QQQ
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAA----VASGPSILGNIQQQGIE 462
Query: 416 ILYDVPNSRLGVARELC 432
I D + +G C
Sbjct: 463 ITVDSASGYVGFGPATC 479
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 187/440 (42%), Gaps = 70/440 (15%)
Query: 40 FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTY 97
+ PCSP + + P S++EML DQAR ++ A V P + Q +
Sbjct: 75 YGPCSPSEGTPP-----SLVEMLRWDQARTDYVRRKATGEVDDVLEPDRPHVDMMQM-DF 128
Query: 98 IVRAKIGTPAQT------------------LLMAMDTSNDAAWVPCTGCV--GC---SST 134
++R G + + MA+DT+ D W+ C C+ C +
Sbjct: 129 MLRGTFGIGSGSGYGAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNA 188
Query: 135 VFNSAQSTTFKNLGCQAAQCKQV---------PNPTCGGGACAFNLTYGSSTIA-ANLSQ 184
F+ +S+T + C + C+ + PN T G C + + Y +
Sbjct: 189 FFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNST---GDCLYRIEYSDHRLTLGTYMT 245
Query: 185 DTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
DT++++ + + FGC G S G + LG G SLL+QT Y + FSYC+P
Sbjct: 246 DTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVP 305
Query: 243 SFKA---LSFSGSLRLGPIGQPKRIKYTPLLK--NPRRSSLYYVNLLAIRVGRRVVDIPP 297
A LS G + G TPL++ N ++Y V L I V R +++PP
Sbjct: 306 GPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPP 365
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-- 355
GT++DS V T+L AY A+R FR + + T G DTC+
Sbjct: 366 VVFS------GGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVG 419
Query: 356 --PIVAPTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ PT++L+F G V L ++L+ S CLA AP + L I N+QQQ
Sbjct: 420 VSKVTVPTVSLVFDGGAVIELGLLSVLLDS------CLAF--APMAADFALGFIGNVQQQ 471
Query: 413 NHRILYDVPNSRLGVARELC 432
H +LYDV +G C
Sbjct: 472 THEVLYDVAGGAVGFRHGAC 491
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/420 (27%), Positives = 183/420 (43%), Gaps = 34/420 (8%)
Query: 33 TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
++ + H SP SPF KPS L+ + ++ + +L S + K + +I
Sbjct: 30 SIDLIHRDSPLSPFYKPS--LTPSDRIINTALRSIYQLNRASHSDLNEKKTLERV---RI 84
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
Y++R IGTP L DT++D WV C+ C C + +F +S+TF NL
Sbjct: 85 PNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLS 144
Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGS-STIAANLSQDTISLATDIV--PGYTFGC-- 201
C + C C G C + TYG S+ L ++I + V P FGC
Sbjct: 145 CDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGS 204
Query: 202 ---IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-- 256
N V G++GLG G LSL++Q + FSYCL F + S + L+ G
Sbjct: 205 NNDFMHQISNKV--TGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTS-TIKLKFGND 261
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ TPL+ +P S Y+++L+ I +G++++ + + T IID GT
Sbjct: 262 TTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV-----RTTDHTNGNIIIDLGT 316
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSL-GGFDTCY--SVPIVAPTITLMFSGMNVTL 373
V T L Y + R +G + T + FD C+ I P I F+G V L
Sbjct: 317 VLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTGAKVFL 376
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
NL ++ CLA+ PD +V N+ Q + ++ YD ++ A C+
Sbjct: 377 SPKNLFFRFDDLNMICLAV--LPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 175/376 (46%), Gaps = 42/376 (11%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSST----VFNSAQSTTFKNL 147
S Y V ++GTPA+ + +DT +D W+ C SS+ ++ + S++++ +
Sbjct: 24 SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREI 83
Query: 148 GCQAAQCKQVPNPTCGGGACAF------NLTYGSSTIAAN---LSQDTISLATDIVPG-- 196
C +C +P P G +C+ + TYG S + L+ +TIS+ + G
Sbjct: 84 PCTDDECLFLPAPI--GSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 141
Query: 197 -------------YTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCL 241
GC +++ G S + G+LGLG+G +SL QT++ FSYCL
Sbjct: 142 AGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 201
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD-IPPGAL 300
+ S + S + + +++ +TP+++NP S YYVN+ + V + VD I
Sbjct: 202 VDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDW 261
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA- 359
+ GTI DSGT + L PAY+ V + GF+ CY+V +
Sbjct: 262 GIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEK 321
Query: 360 --PTITLMFSGMNV-TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
P + + F G V LP +N ++ A ++ C+A+ S N++ N+ QQ+H I
Sbjct: 322 GMPKLGVEFQGGAVMELPWNNYMVL-VAENVQCVALQKVTTTNGS--NILGNLLQQDHHI 378
Query: 417 LYDVPNSRLGVARELC 432
YD+ +R+G C
Sbjct: 379 EYDLAKARIGFKWSPC 394
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 134/453 (29%), Positives = 202/453 (44%), Gaps = 66/453 (14%)
Query: 22 NPICD-----TQD-HSSTLQVFHVFSPCSPFKPSKPLSWEESV---------LEMLAKDQ 66
NP C T D + +++ + H PC+P S S E + + AK
Sbjct: 44 NPACSPAPQVTSDPNRASMPLAHRHGPCAPATTSSWPSLAERLRRDRARRDHITRKAKAS 103
Query: 67 ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT 126
R LS +++ P + G + S Y+V IGTPA + +DT +D +WV C
Sbjct: 104 GRTTTLSDVSI------PTSLGAAV-DSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCK 156
Query: 127 GCVGCS-----STVFNSAQSTTFKNLGCQAAQCKQ-VPNP-------TCGGGACAFNLTY 173
C S +++ S+T+ + C + CK VP+ + G C + + Y
Sbjct: 157 PCNSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEY 216
Query: 174 GS-STIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
G+ T S +T++L+ + V + FGC G GLLGLG SL++QT
Sbjct: 217 GNRDTTVGVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAE 276
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRV 288
Y FSYCLP S +G L LG P + TPL P +++ Y VNL + V
Sbjct: 277 TYGGAFSYCLPPGN--STTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSV 334
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSL 346
G + +DIPP L G IIDSGT+ T L AY+A+R FR + + L +
Sbjct: 335 GGKPLDIPPTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNND 388
Query: 347 GGFDTCYSVPIVA----PTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNV 399
DTCY+ +A PT+ L F G +++ +P +LI CLA A +
Sbjct: 389 DVLDTCYNFTGIANVTVPTVALTFDGGATIDLDVPS-GVLIQD------CLAFAGGASDG 441
Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + +I N+ Q+ +LYD +G C
Sbjct: 442 D--VGIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 129/449 (28%), Positives = 190/449 (42%), Gaps = 48/449 (10%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+FF LFL S S+ D+ T +FH S SP + S LS + + +
Sbjct: 7 LFFHLILFLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLANAFRRS 62
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAW 122
+R S A+ ++ A G Q + P Y++ IGTP L DT +D W
Sbjct: 63 LSR-----SAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTW 117
Query: 123 VPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTI 178
C C+ C +FN +ST+F ++ C C V + CG G C ++ TYG T
Sbjct: 118 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTY 177
Query: 179 A-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQS 235
+ +L + I++ + V GC ++G G++GLG G LSL++Q
Sbjct: 178 SKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236
Query: 236 TFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG--R 290
FSYCLP+ + + +G + G + P + TPL+ + + YY+ L AI +G R
Sbjct: 237 RFSYCLPTLLSHA-NGKINFGENAVVSGPGVVS-TPLI-SKNTVTYYYITLEAISIGNER 293
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+ G + IIDSGT T L Y V + V + G D
Sbjct: 294 HMAFAKQGNV----------IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLD 343
Query: 351 TCYSVPIVA------PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
C+ I A P IT FS G NV L N A ++ CL + AA
Sbjct: 344 LCFDDGINAAASLGIPVITAHFSGGANVNLLPINTF-RKVADNVNCLTLKAASPTTE--F 400
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
+I N+ Q N I YD+ RL +C
Sbjct: 401 GIIGNLAQANFLIGYDLEAKRLSFKPTVC 429
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 183/426 (42%), Gaps = 55/426 (12%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGR---QITQSPT---------- 96
K LS E + + + +AR LS AV ++ SG+ Q T PT
Sbjct: 43 KQLSRSELIRRAMQRSKARAAALS--AVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDL 100
Query: 97 -YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAA 152
Y+V IGTP Q + +DT +D W PC C+ +F +S +++ + C
Sbjct: 101 EYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQ 160
Query: 153 QCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATD------IVPGYTFGCIQK 204
C + + C C + YG T+ + + + + + VP FGC
Sbjct: 161 LCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LGFGCGSM 219
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPIGQ 260
G+ G++G GR LSL++Q L FSYCL S+ K+ GSL G G
Sbjct: 220 NVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYGSGRKSTLLFGSLSGGVYGD 276
Query: 261 PKR-IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
++ TPLL++ + + YYV+L + VG R + IP A P G I+DSGT T
Sbjct: 277 ATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALT 336
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVP-----------IVAPTITLMF 366
L V FR+++ L + G + C+ VP + P + F
Sbjct: 337 LLPGAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHF 394
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
++ LP+ N ++ CL +A + D+ ++ I N+ QQ+ R+LYD+ L
Sbjct: 395 QDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGST----IGNLVQQDMRVLYDLEAETLS 450
Query: 427 VARELC 432
A C
Sbjct: 451 FAPAQC 456
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 163/337 (48%), Gaps = 42/337 (12%)
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC 162
P+ ++A + W C CV C S F+ + S T+ C +P+ T
Sbjct: 84 PSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSC-------IPS-TV 135
Query: 163 GGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLG 219
G +N+TYG ST N DT++L +D+ P + FGC + G+ G+LGLG
Sbjct: 136 GN---TYNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLG 192
Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNP----- 273
+G LS ++QT + ++ FSYCLP ++ GSL G + +K+T L+ P
Sbjct: 193 QGQLSTVSQTASKFKKVFSYCLPEEDSI---GSLLFGEKATSQSSLKFTSLVNGPGTSGL 249
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
S Y+V LL I VG + +++P GTIIDSGTV T L AY+A+ F
Sbjct: 250 EESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTAAF 304
Query: 334 RRRVG----SNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHSTA 384
++ + SN DTCY++ ++ P I L F G +V L ++ + A
Sbjct: 305 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDA 364
Query: 385 GSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDV 420
+ CLA A + +NS L +I N QQ + +LYD+
Sbjct: 365 SRL-CLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 163/354 (46%), Gaps = 37/354 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG------CVGCSSTVFNSAQSTTFKNLGCQ 150
Y + +GTPA T LM +DT +D W P V S+ + T N C
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--CV 179
Query: 151 AAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-VPGYTFGCIQKAT 206
A C+++ + C +C + + YG ++ A + + +T++ A V GC
Sbjct: 180 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNE 239
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
G + GLLGLGRG LS +Q + +FSYCL + +
Sbjct: 240 GLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSR-------------RARPS 286
Query: 267 TPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAP 324
PR ++ YYV+LL V G RV + L+ NPTTG G I+DSGT TRL P
Sbjct: 287 RRWGGTPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARP 346
Query: 325 AYTAVRDVFR-RRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNVTLPQDNL 378
Y AVRD FR VG ++ FDTCY++ + PT+++ + G +V LP +N
Sbjct: 347 VYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENY 406
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LI C AMA V ++I N+QQQ R+++D R+G + C
Sbjct: 407 LIPVDTSGTFCFAMAGTDGGV----SIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 158/361 (43%), Gaps = 39/361 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W PC C G + ++++ S++F L C +A
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 154 CKQVPNPTCG--GGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
C + + C C + Y + + ++ G FGC G S
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGAYSPECAGISVG-------GIAFGCGVDNGGLSYN 195
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR-------- 263
G +GLGRGSLSL+AQ L FSYCL F S S + G + +
Sbjct: 196 STGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAA 252
Query: 264 -IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA-GTIIDSGTVFTRL 321
++ TPL+++P S YYV+L I +G + IP G N G+ G I+DSGT+FT L
Sbjct: 253 VVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTIL 312
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-CYSVPIVA-------PTITLMFS-GMNVT 372
V + V D +G V + D C+ P P + L F+ G ++
Sbjct: 313 VETGFRVVVDHVAGVLGQ--PVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMR 370
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L +DN + + S CL + SVL N QQQN ++L+D+ +L C
Sbjct: 371 LHRDNYMSFNEEESSFCLNIVGTESASGSVL---GNFQQQNIQMLFDITVGQLSFMPTDC 427
Query: 433 T 433
+
Sbjct: 428 S 428
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 190/428 (44%), Gaps = 54/428 (12%)
Query: 55 EESVLEMLAKDQARLQFLS-------------------SLAVARKSVVPIASGRQITQSP 95
+ESVL++ KD R++ + A++ + V + SG + S
Sbjct: 91 KESVLDLADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERMVATVESGVAVG-SG 149
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAA 152
Y++ +GTP + M MDT +D W+ C C+ C VF+ A S++++N+ C
Sbjct: 150 EYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQ 209
Query: 153 QCKQV--PNP--TC---GGGACAFNLTYGS-STIAANLSQDTISL------ATDIVPGYT 198
+C V P P C G +C + YG S +L+ ++ ++ A+ V
Sbjct: 210 RCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVV 269
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSL 253
FGC G GLLGLGRG LS +Q + +Y TFSYCL + + F
Sbjct: 270 FGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVASKVVFGEDD 329
Query: 254 RLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGAL--QFNPTTGAGT 310
L ++ YT + + YYV L + VG +++I GT
Sbjct: 330 ALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGGT 389
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCYSVPIV----APTITLM 365
IIDSGT + V PAY +R F R+G + + CY+V V P ++L+
Sbjct: 390 IIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLL 449
Query: 366 FS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F+ G P +N I I CLA+ P + +++I N QQQN ++YD+ N+R
Sbjct: 450 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQQNFHVVYDLKNNR 506
Query: 425 LGVARELC 432
LG A C
Sbjct: 507 LGFAPRRC 514
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 172/398 (43%), Gaps = 67/398 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNS--------AQSTTF 144
Y++ +GTP + + + MDT +D WVPC C+ C+ N S++
Sbjct: 12 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 71
Query: 145 KNLGCQAAQCKQVPNPTCGGGACA--------------------FNLTYGS-STIAANLS 183
++L C + C V + CA F TYG+ + L+
Sbjct: 72 RDL-CVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 130
Query: 184 QDTISLA------TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
+DT++ T VP + FGC+ P G+ G GRG LSL +Q L Q F
Sbjct: 131 RDTLTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQLGFL-QKGF 186
Query: 238 SYCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-R 291
S+C FK + S L +G I +++T LLKNP + YY+ L AI VG
Sbjct: 187 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 246
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGF 349
+ +P +F+ G IIDSGT +T L P YT + + + + + GF
Sbjct: 247 AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGF 306
Query: 350 DTCYSVPI----------VAPTITLMFS-GMNVTLPQDNLLIHSTAGS----ITCLAMAA 394
D CY +P + P+I+ FS +++ LPQ N A S + CL +
Sbjct: 307 DLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQN 366
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D+ + V + QQQN +++YD+ R+G C
Sbjct: 367 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 139/432 (32%), Positives = 196/432 (45%), Gaps = 72/432 (16%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
S+ L++ H PC+P + S + SV + L DQ R +++ S A A
Sbjct: 65 SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
+ VP + G I + Y+V A +GTP M +DT +D +WV C C S
Sbjct: 123 AVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQK 181
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-----------GACAFNLTYG-SSTIAA 180
+F+ AQS+++ + C P C G C + ++YG S
Sbjct: 182 DPLFDPAQSSSYAAVPCG--------GPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTG 233
Query: 181 NLSQDTISL-ATDIVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
S DT++L A+ V G+ FGC +G N V GLLGLGR SL+ QT Y F
Sbjct: 234 VYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVF 291
Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
SYCLP+ S +G L L GP G T LL +P + Y V L I VG + +
Sbjct: 292 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTC 352
+P A T++D+GTV TRL AY A+R FR + S T S G DTC
Sbjct: 350 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 403
Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
Y+ + P + L F SG VTL D +L S CLA AP + + ++
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 455
Query: 408 NMQQQNHRILYD 419
N+QQ++ + D
Sbjct: 456 NVQQRSFEVRID 467
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/415 (27%), Positives = 168/415 (40%), Gaps = 77/415 (18%)
Query: 83 VPIASGRQITQSPTYIVRAKIGT-PAQTLLMAMDTSNDAAWVPC---------------- 125
+P+A G Y + +G+ P Q + + MDT +D W PC
Sbjct: 67 LPLAPGSD------YTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTK 120
Query: 126 -------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGS 175
T V C S ++A ++ + C ++C + C +C F YG
Sbjct: 121 PANITKQTHSVSCQSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGD 180
Query: 176 STIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--- 232
+ ANL Q T+SL++ + +TFGC A P G+ G GRG LSL AQ L
Sbjct: 181 GSFVANLYQQTLSLSSLHLQNFTFGCAHTALAE---PTGVAGFGRGILSLPAQLSTLSPH 237
Query: 233 YQSTFSYCLPSFKALSFSGSL--RLGPI--------------GQPKRIKYTPLLKNPRRS 276
+ FSYCL S SF G R P+ G+ YT +L NP+
Sbjct: 238 LGNRFSYCLVSH---SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHP 294
Query: 277 SLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
Y V L I VG+R V P + + G ++DSGT FT L Y AV + F +R
Sbjct: 295 YYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKR 354
Query: 337 VG----SNLTVTSLGGFDTCYSVPIVA--PTITLMFSGMN--VTLPQDNLLIH------- 381
V + + G CY + ++ P + L F G N V LP+ N
Sbjct: 355 VNRFHKRASEIETKTGLGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDG 414
Query: 382 -STAGSITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
G + C+ + D + N QQQ ++YD+ R+G A++ C
Sbjct: 415 IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/401 (28%), Positives = 172/401 (42%), Gaps = 73/401 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNS--------AQSTTF 144
Y++ +GTP + + + MDT +D WVPC C+ C+ N S++
Sbjct: 29 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 88
Query: 145 KNLGCQAAQCKQVPNPTCGGGACA--------------------FNLTYGS-STIAANLS 183
++L C + C V + CA F TYG+ + L+
Sbjct: 89 RDL-CVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 147
Query: 184 QDTISLA------TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
+DT++ T VP + FGC+ P G+ G GRG LSL +Q L Q F
Sbjct: 148 RDTLTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQLGFL-QKGF 203
Query: 238 SYCLPSFKAL---SFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR-R 291
S+C FK + S L +G I +++T LLKNP + YY+ L AI VG
Sbjct: 204 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 263
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR-----RVGSNLTVTSL 346
+ +P +F+ G IIDSGT +T L P YT + + + R T
Sbjct: 264 AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEART-- 321
Query: 347 GGFDTCYSVPI----------VAPTITLMFS-GMNVTLPQDNLLIHSTAGS----ITCLA 391
GFD CY +P + P+I+ FS +++ LPQ N A S + CL
Sbjct: 322 -GFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLL 380
Query: 392 MAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ D+ + V + QQQN +++YD+ R+G C
Sbjct: 381 LQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 121/437 (27%), Positives = 189/437 (43%), Gaps = 51/437 (11%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFL----SSLAVARKSVVPIA 86
TL V H SPCSP ++ E+ SV ++L +D R + L + + A P A
Sbjct: 63 DTLPVVHRLSPCSPLGAARIQQLEKPSVADILHRDALRFRSLFRDHNHGSAAPAPTSPGA 122
Query: 87 SG------------RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA----WVPCTGCVG 130
G +++ + Y V A GTP Q + DT+ A PC
Sbjct: 123 DGGGLSIPSRGDPIQELPGAFEYHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCAADEP 182
Query: 131 CSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISL 189
C F+ + S++ ++ C + C N C G +C +++ ++ + A D ++L
Sbjct: 183 CHH-AFDPSASSSIAHVPCGSPDCPF--NKGCSGHSCTLSVSINNTLLGNATFFTDKLTL 239
Query: 190 AT-DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT--QNLYQSTFSYCLPSFKA 246
+IV + F C++ G+L L R S SL ++ + FSYCLPS+
Sbjct: 240 TPWNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYP- 298
Query: 247 LSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG P +++ YTPL N +LY V L+ + +G + +P A+
Sbjct: 299 -SDVGFLSLGATKPELLGRKVSYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAI--- 354
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
G GTI++ T FT L Y A+RD FR+ + G DTCY+ ++
Sbjct: 355 --AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSV 412
Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P +TL F G L D ++ G S+ CLA A VI +M Q +
Sbjct: 413 PAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGGA-----VIGSMAQMSTE 467
Query: 416 ILYDVPNSRLGVARELC 432
++YDV ++G C
Sbjct: 468 VVYDVRGGKVGFVPYRC 484
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 155/354 (43%), Gaps = 29/354 (8%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--------FNSAQSTT 143
T + Y++ +GTP Q + +D ++D W+ C+ C C + F + S+T
Sbjct: 92 TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSST 151
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG---SSTIAANLSQDTISLATDIVPGYT 198
+ + C C+++ TC C ++ YG ++T A L+ D + AT G
Sbjct: 152 IREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI 211
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC G+ G++GLGRG LSL++Q Q FSY L A+ +
Sbjct: 212 FGCAVATEGDI---GGVIGLGRGELSLVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDD 265
Query: 259 GQPK--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+P+ R TPL+ N SLYYV L IRV + IP G G ++
Sbjct: 266 AKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITI 325
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV- 371
T L A AY VR ++G S G D CY+ +A P++ L+F+G V
Sbjct: 326 PVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 385
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
L N + + CL + +P S+L ++ Q ++YD+ SRL
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDGSLLG---SLIQVGTHMIYDISGSRL 436
>gi|217073832|gb|ACJ85276.1| unknown [Medicago truncatula]
Length = 122
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/117 (63%), Positives = 90/117 (76%)
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
A L QD++ LATD++P Y+FG I +G S+P QGLLGLGRG LSLL+QT +LY FSY
Sbjct: 1 ATLVQDSLRLATDVIPSYSFGSINAISGFSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSY 60
Query: 240 CLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
CLPSFK+ FSGSL+LGP+GQPK I+ TPLL+NPRR SLY+VNL I VG+ V P
Sbjct: 61 CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFP 117
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 171/372 (45%), Gaps = 46/372 (12%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG---CSSTVFNSAQSTTFKNLGCQAAQCK 155
V +GTP Q + M +DT ++ +W+ C S+ F S+TF + C +AQC+
Sbjct: 87 VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146
Query: 156 --QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+P+P GA C+ +L+Y GSS+ A L+ D ++ + FGC+ A +
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGA-LATDVFAVGSGPPLRAAFGCMSSAFDS 205
Query: 209 S---VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
S V GLLG+ RG+LS ++Q FSYC+ +G L LG P +
Sbjct: 206 SPDGVASAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---AGVLLLGHSDLPTFLP 259
Query: 266 ------YTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
Y P L P + Y V LL IRVG + + IP L + T T++DSGT F
Sbjct: 260 LNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQF 319
Query: 319 TRLVAPAYTAVRDVFRRRVG------SNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
T L+ AY+A++ F R+ + + FDTC+ VP P +TL+
Sbjct: 320 TFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLL 379
Query: 366 FSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F+G + + D LL + CL A D V + VI + Q N + YD+
Sbjct: 380 FNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPIMAYVIGHHHQMNVWVEYDL 438
Query: 421 PNSRLGVARELC 432
R+G+A C
Sbjct: 439 ERGRVGLAPVRC 450
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 150/355 (42%), Gaps = 42/355 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP T + DT +D W PCT C + F A S+TF L C ++
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
C+ +PN TC C +N YGS A L+ +T+ + P FGC +T N +
Sbjct: 146 CQFLPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKVGDASFPSVAFGC---STENGLG 202
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTPL 269
Q LG+GR FSYCL S A S + G + ++ TP
Sbjct: 203 -QLDLGVGR----------------FSYCLRSGSAAGAS-PILFGSLANLTDGNVQSTPF 244
Query: 270 LKNPR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPA 325
+ NP S YYVNL I VG D+P F T G GTI+DSGT T L
Sbjct: 245 VNNPAVHPSYYYVNLTGITVGE--TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDG 302
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYS------VPIVAPTITLMFS-GMNVTLPQDNL 378
Y V+ F + TV G D C+ I P++ L F G +P
Sbjct: 303 YEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFA 362
Query: 379 LIHSTA-GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + + GS+T + P + ++VI N+ Q + +LYD+ A C
Sbjct: 363 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 168/371 (45%), Gaps = 42/371 (11%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
T V +G+P Q + M +DT ++ +W+ C +S VFN S+++ + C + C+
Sbjct: 39 TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS-VFNPLSSSSYSPIPCSSPVCR 97
Query: 156 ----QVPNP-TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
+PNP TC C ++Y +S++ NL+ D + + +PG FGC+
Sbjct: 98 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 157
Query: 206 -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPK 262
+ GL+G+ RGSLS + Q L FSYC+ + SG L G +
Sbjct: 158 NSEEDAKTTGLMGMNRGSLSFVTQ---LGLPKFSYCISGRDS---SGVLLFGDSHLSWLG 211
Query: 263 RIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+ YTPL++ P + Y V L IRVG +++ +P + T T++DSGT
Sbjct: 212 NLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQ 271
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNL------TVTSLGGFDTCYSVPIVA-----PTITLMF 366
FT L+ P YTA+R+ F + L G D CY VP P ++LMF
Sbjct: 272 FTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMF 331
Query: 367 SGMNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
G + + + LL + + CL + D + VI + QQN + +D+
Sbjct: 332 RGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNS-DLLGIEAFVIGHHHQQNVWMEFDLV 390
Query: 422 NSRLGVARELC 432
SR+G C
Sbjct: 391 KSRVGFVETRC 401
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 120/375 (32%), Positives = 183/375 (48%), Gaps = 30/375 (8%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS--- 132
A A +P SG + + ++V +GTPAQ + DT +D +WV C C G S
Sbjct: 124 APAPAVTIPDRSGTYL-DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPC-GSSGHC 181
Query: 133 ----STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQD 185
+F+ ++S+T+ + C QC + C C + + YG S+ LS+D
Sbjct: 182 HPQQDPLFDPSKSSTYAAVHCGEPQCAAAGD-LCSEDNTTCLYLVRYGDGSSTTGVLSRD 240
Query: 186 TISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
T++L ++ + G+ FGC + G+ GLLGLGRG LSL +Q + + FSYCLPS
Sbjct: 241 TLALTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSS 300
Query: 245 KALSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
S +G L +G P +YT +L+ P+ S Y+V L++I +G V+ +PP
Sbjct: 301 N--STTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVF-- 356
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIV 358
T GT++DSGTV T L A AY +RD FR + D CY +V
Sbjct: 357 ---TRGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVV 413
Query: 359 APTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
P ++ F G L ++I ++ CLA AA D L++I N QQ++ ++
Sbjct: 414 VPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAM-DTGGLPLSIIGNTQQRSAEVI 471
Query: 418 YDVPNSRLGVARELC 432
YDV ++G C
Sbjct: 472 YDVAAEKIGFVPASC 486
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 127/437 (29%), Positives = 180/437 (41%), Gaps = 68/437 (15%)
Query: 40 FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVA-RKSVVPIASGRQITQSPTYI 98
F PCSP P S+LEML DQ R +++ A + V+ A R + +
Sbjct: 63 FGPCSPSAGRAP---APSLLEMLRWDQVRTEYVRRKASGGAEDVLNPAKPRVLMSQTDFA 119
Query: 99 VRAKI---------------GTP--AQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVF 136
VR+ G P MA+DT+ D W+ C C +F
Sbjct: 120 VRSPFGVGSGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLF 179
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCGG-------GACAFNLTYGSS-TIAANLSQDTIS 188
+ S+T + C++ C+ + P G C + + Y A DT++
Sbjct: 180 DPTTSSTAAAVRCRSPACRSL-GPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLT 238
Query: 189 LA-TDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
++ T V + FGC G S G + LG G+ SLLAQT + FSYC+P A
Sbjct: 239 ISGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASA 298
Query: 247 LSFSGSLRLGPIGQPKR------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
F IG P TPL+++ SLY V L I V R + IPP A
Sbjct: 299 SGFLS------IGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAF 352
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----VP 356
AG ++DS V T+L AY A+R FR + + + G DTCY
Sbjct: 353 S------AGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLTN 406
Query: 357 IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ P ++L+F G V L ++I CLA A ++ L I N+QQQ H
Sbjct: 407 VRVPAVSLVFGGGAVVVLDPPAVMIGG------CLAFTATSSDL--ALGFIGNVQQQTHE 458
Query: 416 ILYDVPNSRLGVARELC 432
+LYDV +G R C
Sbjct: 459 VLYDVAAGGVGFRRGAC 475
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 177/391 (45%), Gaps = 50/391 (12%)
Query: 62 LAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+A+ +ARL S+ +AR S Y V IGTP Q + DT++D
Sbjct: 68 VARLEARLTGDMSVPLARIS------------DEGYTVTIGIGTPPQLHTLIADTASDLT 115
Query: 122 WVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNP---TCGGGACAFNLTYGS 175
W C + V F+ A+S++F + C + C + NP C C + Y S
Sbjct: 116 WTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTE-DNPGTKRCSNKTCRYVYPYVS 174
Query: 176 STIAANLSQDTISLATD---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
A L+ ++ +L+ + I + FGC GN + G+LG+ LS+++Q L
Sbjct: 175 VEAAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQ---L 231
Query: 233 YQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL---YYVNLLAIRVG 289
FSYCL + S L G R K T P + SL YYV L+ + +G
Sbjct: 232 AIPKFSYCLTPYTDRK-SSPLFFGAWADLGRYKTT----GPIQKSLTFYYYVPLVGLSLG 286
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
R +D+P GT++D G +L PA+TA+++ + LT ++ +
Sbjct: 287 TRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVKDY 343
Query: 350 DTCYSVP-------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
C+++P + P + L F G ++ LP+DN TAG + CLA+
Sbjct: 344 KVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAG-LMCLALVPG-----G 397
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+++I N+QQQN +L+DV +S+ A +C
Sbjct: 398 GMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 167/366 (45%), Gaps = 45/366 (12%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNP-- 160
IGTP Q + M +DT ++ +W+ C +S +FN S T+ + C + CK +
Sbjct: 73 IGTPPQNITMVLDTGSELSWLRCKKEPNFTS-IFNPLASKTYTKIPCSSQTCKTRTSDLT 131
Query: 161 ---TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ-- 213
TC C F ++Y +S++ +L+ +T + P FGC+ + ++
Sbjct: 132 LPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAK 191
Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPL 269
GL+G+ RGSLS + Q + FSYC+ L +G L LG K + YTPL
Sbjct: 192 TTGLMGMNRGSLSFVNQ---MGFRKFSYCI---SGLDSTGFLLLGEARYSWLKPLNYTPL 245
Query: 270 LKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
++ P + Y V L I+V +V+ +P + T T++DSGT FT L+ P
Sbjct: 246 VQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGP 305
Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVT 372
Y+A+R F + L V + G D CY + + P + LMF G ++
Sbjct: 306 VYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMFRGAEMS 365
Query: 373 LPQDNLLIH-----STAGSITCLAMAAAPD-NVNSVLNVIANMQQQNHRILYDVPNSRLG 426
+ LL S+ C + + ++S L I + QQQN + YD+ NSR+G
Sbjct: 366 VSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFL--IGHHQQQNVWMEYDLENSRIG 423
Query: 427 VARELC 432
A C
Sbjct: 424 FAELRC 429
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 167/386 (43%), Gaps = 56/386 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST--------VFNSAQSTTFK 145
Y V GTP+QT+ DT + W+PCT C GC + F S++ K
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 146 NLGCQAAQCKQV--PNPTCGG----------GACAFNLTYGSSTIAANLSQDTISLATDI 193
+GCQ+ +C+ + PN C G G + L YG + A L + +
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLT 209
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSG 251
VP + GC +T P G+ G GRG +SL +Q NL + FS+CL S F + +
Sbjct: 210 VPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQ-MNLKR--FSHCLVSRRFDDTNVTT 263
Query: 252 SLRL------GPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
L L + + YTP KNP S+ YY+NL I VGR+ V IP L
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYL 323
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV- 355
G+I+DSG+ FT + P + V + F ++ SN T + G C+++
Sbjct: 324 APGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM-SNYTREKDLEKETGLGPCFNIS 382
Query: 356 ---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA----PDNVNSVLNVIA 407
+ P + F G + LP N CL + + P ++
Sbjct: 383 GKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILG 442
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
+ QQQN+ + YD+ N R G A++ C+
Sbjct: 443 SFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 164/386 (42%), Gaps = 38/386 (9%)
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
K + + SG + S Y + +GTP + + +DT +D W+ C C C + +
Sbjct: 146 KLIATLESGMTLG-SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFY 204
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPT------CGGGACAFNLTYGS----------STIAA 180
+ S +FKN+ C +C + +P +C + YG T
Sbjct: 205 DPKTSASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTV 264
Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
NL+ + V FGC G GLLGLGRG LS +Q Q+LY +FSYC
Sbjct: 265 NLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 324
Query: 241 LPSFKA-LSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVD 294
L + + S L G + + +T + S + YY+ + +I VG +D
Sbjct: 325 LVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALD 384
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCY 353
IP +P GTIIDSGT + PAY +++ F ++ N L D C+
Sbjct: 385 IPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCF 444
Query: 354 SVP------IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
+V I P + + F+ G P +N I + + CLA+ P S ++I
Sbjct: 445 NVSGIEENNIHLPELGIAFADGAVWNFPAENSFIW-LSEDLVCLAILGTP---KSTFSII 500
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQQN ILYD SRLG C
Sbjct: 501 GNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 133/451 (29%), Positives = 194/451 (43%), Gaps = 74/451 (16%)
Query: 38 HVFSPCSPFK-------PSKPLS----WEESVLEMLAK---------DQARLQFLSSLAV 77
H+ SPCSP P K LS W+E + + D A + S V
Sbjct: 74 HLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQSTQV 133
Query: 78 ARKSVVPIASGRQITQS--PTYIVRAKIGTPAQTLL------MAMDTSNDAAWVPCT--- 126
+ G+ T S IV A G Q L M +DT++D WV C
Sbjct: 134 TSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCP 193
Query: 127 --GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGG----GACAFNLTY--GSS 176
C S +++ +S C + QC+ + C G G C + + Y GS
Sbjct: 194 QPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAGNTGTCQYRVLYPDGSG 253
Query: 177 TIAANLSQDTISLATD---IVPGYTFGC----IQKATGNSVPPQGLLGLGRGSLSLLAQT 229
T +S D ++L D V + FGC ++ + N+ G + LGRG+ SL +QT
Sbjct: 254 TSGTYVS-DLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKT-AGFMALGRGAQSLSSQT 311
Query: 230 QNLYQ--STFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
+ + + FSYCLP S G L LG P R TP+LK+ +Y V L+ I
Sbjct: 312 KGTFSKGNVFSYCLPPTG--SHKGFLSLGVPQHAASRYAVTPMLKSKMAPMIYMVRLIGI 369
Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
V + + +PP N +DS T+ TRL AY A+R FR ++ + V
Sbjct: 370 DVAGQRLPVPPAVFAAN------AAMDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPK 423
Query: 347 GGFDTCYS---VPIVA-PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
G DTCY VP+V P +TL+F V L +++ S CLA AP+ +
Sbjct: 424 GQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLDS------CLAF--APNANDF 475
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ +I N+QQQ +LY+V + +G R C
Sbjct: 476 MPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 115/423 (27%), Positives = 197/423 (46%), Gaps = 49/423 (11%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQ 90
S L + + + PCS K S ++ L+ D++R++ +++ + S G
Sbjct: 61 SQGLPITYSYGPCSQLGQKKSPSRQQIFLQ----DRSRVRSINAKIFGQYSTQESKDGWS 116
Query: 91 ------ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GC-SSTVFNSAQS 141
+ + ++V GTP Q + +DT +D W+ C C C + FN + S
Sbjct: 117 PESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLS 176
Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQ-DTISLATDIVPGYTFG 200
+++ N C +P+ + + Y ++ + + D ++L D+ P + FG
Sbjct: 177 SSYSNRSC-------IPSTDTN-----YTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFG 224
Query: 201 CIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-- 257
C G G+LGL +G SL++QT + ++ FSYC P + GSL G
Sbjct: 225 CGDSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEHTL--GSLLFGEKA 282
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
I +K+T LL NP Y+V L+ I V ++ +++ +L +P GTIIDSGTV
Sbjct: 283 ISASPSLKFTQLL-NPPSGLGYFVELIGISVAKKRLNVS-SSLFASP----GTIIDSGTV 336
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSVP------IVAPTITLMFSG 368
TRL AY A+R F++ + +++ DTCY++ I P I L F G
Sbjct: 337 ITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVG 396
Query: 369 -MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
++V+L +L + + CLA A + S + +I N QQ + +++YD+ RLG
Sbjct: 397 EVDVSLHPSGILWANGDLTQACLAFARKSN--PSHVTIIGNRQQVSLKVVYDIEGGRLGF 454
Query: 428 ARE 430
+
Sbjct: 455 GND 457
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 164/367 (44%), Gaps = 38/367 (10%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
T + IG+P Q + M +DT ++ +W+ C +ST FN S+++ C ++ C
Sbjct: 58 TLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNST-FNPLLSSSYTPTPCNSSVCM 116
Query: 155 ---KQVPNP-TC--GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA-- 205
+ + P +C C ++Y +S+ L+ +T SLA PG FGC+ A
Sbjct: 117 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 176
Query: 206 ---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-IGQP 261
GL+G+ RGSLSL+ Q + FSYC+ A G L LG P
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCISGEDAF---GVLLLGDGPSAP 230
Query: 262 KRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
++YTPL+ S Y V L I+V +++ +P + T T++DSGT
Sbjct: 231 SPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGT 290
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP---IVAPTITLMFS 367
FT L+ P Y +++D F + LT G D CY P P +TL+FS
Sbjct: 291 QFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAAVPAVTLVFS 350
Query: 368 GMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G + + + LL + G + C + D + VI + QQN + +D+ SR+
Sbjct: 351 GAEMRVSGERLLYRVSKGRDWVYCFTFGNS-DLLGIEAYVIGHHHQQNVWMEFDLVKSRV 409
Query: 426 GVARELC 432
G C
Sbjct: 410 GFTETTC 416
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 151/347 (43%), Gaps = 31/347 (8%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
I P M++DTS D W+ C C + +F+ +S T + C +A C ++
Sbjct: 155 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 214
Query: 158 PN--PTCGGGACAFNLTYGSS-TIAANLSQDTISL-ATDIVPGYTFGCIQKATGN-SVPP 212
C C + + YG + D ++L + +V + FGC GN S
Sbjct: 215 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAST 274
Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
G + LG G SLL+QT + + FSYC+P + F G R TPL++N
Sbjct: 275 SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRN 334
Query: 273 PRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P +LY V L I VG R +++PP G ++DS + T+L AY A+R
Sbjct: 335 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 388
Query: 332 VFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAG 385
FR + + V G DTCY + P ++L+F G V L +++
Sbjct: 389 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG--- 445
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA P + L I N+QQQ H +LYDV +G R C
Sbjct: 446 ---CLAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 38/367 (10%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
T V +G+P Q + M +DT ++ +W+ C +ST FN S+++ C ++ C
Sbjct: 59 TLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNST-FNPLLSSSYTPTPCNSSICT 117
Query: 155 ---KQVPNP-TC--GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA-- 205
+ + P +C C ++Y +S+ L+ +T SLA PG FGC+ A
Sbjct: 118 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 177
Query: 206 ---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP-IGQP 261
GL+G+ RGSLSL+ Q + FSYC+ AL G L LG P
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQ---MSLPKFSYCISGEDAL---GVLLLGDGTDAP 231
Query: 262 KRIKYTPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
++YTPL+ S Y V L I+V +++ +P + T T++DSGT
Sbjct: 232 SPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGT 291
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP---IVAPTITLMFS 367
FT L+ Y++++D F + LT G D CY P P +TL+FS
Sbjct: 292 QFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPAVTLVFS 351
Query: 368 GMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G + + + LL + GS + C + D + VI + QQN + +D+ SR+
Sbjct: 352 GAEMRVSGERLLYRVSKGSDWVYCFTFGNS-DLLGIEAYVIGHHHQQNVWMEFDLLKSRV 410
Query: 426 GVARELC 432
G + C
Sbjct: 411 GFTQTTC 417
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 151/347 (43%), Gaps = 31/347 (8%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
I P M++DTS D W+ C C + +F+ +S T + C +A C ++
Sbjct: 139 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 198
Query: 158 P--NPTCGGGACAFNLTYGSS-TIAANLSQDTISL-ATDIVPGYTFGCIQKATGN-SVPP 212
C C + + YG + D ++L + +V + FGC GN S
Sbjct: 199 GRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAST 258
Query: 213 QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
G + LG G SLL+QT + + FSYC+P + F G R TPL++N
Sbjct: 259 SGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRN 318
Query: 273 PRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P +LY V L I VG R +++PP G ++DS + T+L AY A+R
Sbjct: 319 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 372
Query: 332 VFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAG 385
FR + + V G DTCY + P ++L+F G V L +++
Sbjct: 373 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG--- 429
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA P + L I N+QQQ H +LYDV +G R C
Sbjct: 430 ---CLAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 118/404 (29%), Positives = 182/404 (45%), Gaps = 50/404 (12%)
Query: 63 AKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
+ DQ L L + + R S ++ +T + V +G+P Q + M +DT ++ +W
Sbjct: 31 SSDQTLLFSLKTQKLPRSSSDKLSFRHNVTLT----VTLAVGSPPQNISMVLDTGSELSW 86
Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK----QVPNP-TCGGGA--CAFNLTYGS 175
+ C S VFN S+T+ + C + C+ +P P +C C ++Y
Sbjct: 87 LHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFCHVAISYAD 145
Query: 176 ST-IAANLSQDTISLATDIVPGYTFGCIQKA----TGNSVPPQGLLGLGRGSLSLLAQTQ 230
+T I NL+ DT + + PG FGC+ + GL+G+ RGSLS + Q
Sbjct: 146 ATSIEGNLAHDTFVIGSVTRPGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQ-- 203
Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR--IKYTPLLKN----PRRSSLYY-VNL 283
L S FSYC+ + SG L LG I+YTPL+ P + Y V L
Sbjct: 204 -LGFSKFSYCISGSDS---SGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQL 259
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
IRVG +++ +P + T T++DSGT FT L+ P YTA+++ F + S L +
Sbjct: 260 EGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRI 319
Query: 344 TS------LGGFDTCYSVPIVA-------PTITLMFSGMNVTLPQDNLLIH-STAGS--- 386
G D CY V P I+LMF G +++ LL + AGS
Sbjct: 320 VDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGK 379
Query: 387 --ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ C + D + VI + QQN + +D+ SR+G A
Sbjct: 380 EEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 422
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 159/372 (42%), Gaps = 56/372 (15%)
Query: 84 PIASGRQITQSPT--YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNS 138
P++ G PT Y+V IGTP Q + + +DT +D W C C C F+
Sbjct: 74 PVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDP 133
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISL--ATDIVPG 196
+ S+T C + C+ +P +A+ D + A VPG
Sbjct: 134 STSSTLSLTSCDSTLCQGLP-------------------VASLPRSDKFTFVGAGASVPG 174
Query: 197 YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC-------LPSFKALS 248
FGC G + G+ G GRG LSL +Q L FS+C +PS L
Sbjct: 175 VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTTITGAIPSTVLLD 231
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
L G ++ TPL++NP + YY++L I VG + +P TG
Sbjct: 232 LPADLFSNGQGA---VQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG- 287
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTI 362
GTIIDSGT T L Y VRD F +V L V S D C S P+ A P +
Sbjct: 288 GTIIDSGTAMTSLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKL 345
Query: 363 TLMFSGMNVTLPQDNLL--IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
L F G + LP++N + + SI CLA+ + + I N QQQN +LYD+
Sbjct: 346 VLHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGE-----VTTIGNFQQQNMHVLYDL 400
Query: 421 PNSRLGVARELC 432
NS+L C
Sbjct: 401 QNSKLSFVPAQC 412
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 182/373 (48%), Gaps = 40/373 (10%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------V 135
+P SG + + ++V +GTPAQ + DT +D +WV C C G S +
Sbjct: 136 IPDRSGTYL-DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPC-GSSGHCHPQQDPL 193
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT-------YGS-STIAANLSQDTI 187
F+ ++S+T+ + C QC GG C+ + T YG S+ LS+DT+
Sbjct: 194 FDPSKSSTYAAVHCGEPQCAAA------GGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTL 247
Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+L ++ + G+ FGC + G+ GLLGLGRG LSL +Q + + FSYCLPS
Sbjct: 248 ALTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSN- 306
Query: 247 LSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
S +G L +G P +YT +L+ P+ S Y+V L++I +G ++ +PP
Sbjct: 307 -STTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVF---- 361
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAP 360
T GT++DSGTV T L A AY +RD FR + D CY ++ P
Sbjct: 362 -TRGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVP 420
Query: 361 TITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
++ F G L ++I ++ CLA AA D L++I N QQ++ ++YD
Sbjct: 421 AVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAM-DAGGLPLSIIGNTQQRSAEVIYD 478
Query: 420 VPNSRLGVARELC 432
V ++G C
Sbjct: 479 VAAEKIGFVPASC 491
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 157/363 (43%), Gaps = 36/363 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAA 152
Y + +GTP +DT +D W C T C + +++ A+S+TF L C +
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155
Query: 153 QCKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISL--------ATDIVPGYTFGCI 202
C+ +P+ C C ++ Y A L+ DT+++ A+ G FGC
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCS 215
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-- 260
G+ G++GLGR +LSLL+Q + FSYCL S A + + + G +
Sbjct: 216 TANGGDMDGASGIVGLGRSALSLLSQ---IGVGRFSYCLRS-DADAGASPILFGALANVT 271
Query: 261 PKRIKYTPLLKNP----RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+++ T LL+NP RR+ YYVNL I VG + + F G I+DSGT
Sbjct: 272 GDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGT 331
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVPIV---APTITLMFS-GMN 370
FT L YT +R F + LT S FD C+ P + F+ G
Sbjct: 332 TFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADTPVPRLVFRFAGGAE 391
Query: 371 VTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+P+ + G + CL + ++VI N+ Q + +LYD+ + A
Sbjct: 392 YAVPRQSYFDAVDEGGRVACLLVLP-----TRGVSVIGNVMQMDLHVLYDLDGATFSFAP 446
Query: 430 ELC 432
C
Sbjct: 447 ADC 449
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 161/349 (46%), Gaps = 34/349 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W C C C + V++ + S+TF + C +A
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 136
Query: 154 CKQV---PNPTCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI------VPGYTFGCIQ 203
C V N + C + +Y +A L +T++L + + V FGC
Sbjct: 137 CLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGT 196
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--- 260
G+S+ G +GLGRG+LSLLAQ L FSYCL F + LG + +
Sbjct: 197 DNGGDSLNSTGTVGLGRGTLSLLAQ---LGVGKFSYCLTDFFNSTLDSPFLLGTLAELAP 253
Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
P ++ TPLL++P S Y V+L I +G + IP + + G ++DSGT F+
Sbjct: 254 GPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFS 313
Query: 320 RLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDTCYSVPI------VAPTITLMFS-GMNV 371
L + V D + +G + +SL C+ P P + L F+ G ++
Sbjct: 314 ILPESGFRVVVDHVAQVLGQPPVNASSLD--SPCFPAPAGERQLPFMPDLVLHFAGGADM 371
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
L +DN + ++ S CL + S +++ N QQQN ++L+D+
Sbjct: 372 RLHRDNYMSYNQEDSSFCLNIVG----TTSTWSMLGNFQQQNIQMLFDM 416
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 108/372 (29%), Positives = 167/372 (44%), Gaps = 43/372 (11%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC- 154
T V GTP Q + M +DT ++ +W+ C +S +FN S T+ + C + C
Sbjct: 66 TLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNS-IFNPLASKTYTKIPCSSPTCE 124
Query: 155 ---KQVPNPTCGGGA--CAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
+ +P P A C F ++Y +S++ NL+ +T + + P FGC+
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSS 184
Query: 206 -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPK 262
+ GL+G+ RGSLS + Q + FSYC+ + SG L LG K
Sbjct: 185 NSEEDAKTTGLMGMNRGSLSFVNQ---MGFRKFSYCISDRDS---SGVLLLGEASFSWLK 238
Query: 263 RIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+ YTPL++ P + Y V L IRV +V+ +P + T T++DSGT
Sbjct: 239 PLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQ 298
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV-PIVA-----PTITLM 365
FT L+ P Y+A++ F + L V + G D CY + P A P + LM
Sbjct: 299 FTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLM 358
Query: 366 FSGMNVTLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F G +++ LL S+ C + D++ VI + QQQN + YD+
Sbjct: 359 FRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS-DSLGIESFVIGHHQQQNVWMEYDL 417
Query: 421 PNSRLGVARELC 432
SR+G A C
Sbjct: 418 EKSRIGFAEVRC 429
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 46/385 (11%)
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQ 140
AS + + + V +GTP Q + M +DT ++ +W+ C G S+ F
Sbjct: 55 ASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRA 114
Query: 141 STTFKNLGCQAAQCK--QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDI 193
S TF ++ C +AQC+ +P+P GA C +L+Y GSS+ A L+ + ++
Sbjct: 115 SLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGA-LATEVFTVGQGP 173
Query: 194 VPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
FGC+ A + + V GLLG+ RG+LS ++Q FSYC+ +
Sbjct: 174 PLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---A 227
Query: 251 GSLRLGPIGQP-KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNP 304
G L LG P + YTPL + P + Y V LL IRVG + + IP L +
Sbjct: 228 GVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDH 287
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV--- 355
T T++DSGT FT L+ AY+A++ F R+ L + FDTC+ V
Sbjct: 288 TGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQG 347
Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIA 407
P P +TL+F+G +T+ D LL + CL A D V VI
Sbjct: 348 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPITAYVIG 406
Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
+ Q N + YD+ R+G+A C
Sbjct: 407 HHHQMNVWVEYDLERGRVGLAPIRC 431
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 196/447 (43%), Gaps = 79/447 (17%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
+++ + + PC+P S + S EML +D+AR R ++ ASGR+I
Sbjct: 56 ASMPLMYRHGPCAP--ASAAATNRPSPAEMLRRDRAR----------RNHILRKASGRRI 103
Query: 92 T-------------QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
T S Y+V GTPA ++ +DT +D +WV C C SST
Sbjct: 104 TLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCN--SSTCYPQ 161
Query: 135 ---VFNSAQSTTFKNLGCQAAQCKQVP---------NPTCGGGACAFNLTYGS-STIAAN 181
VF+ + S+T+ + C + C+ + N + G C + + YG+ T
Sbjct: 162 KDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGV 221
Query: 182 LSQDTISL---ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
S +T++L A +V ++FGC G GLLGLG SL++QT Y FS
Sbjct: 222 YSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFS 281
Query: 239 YCLPSFKALSFSGSLRLGPIG----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
YCLP+ S +G L LG ++TPL ++ Y V L I VG + +D
Sbjct: 282 YCLPAGN--STAGFLALGAPATGGNNTAGFQFTPL--QVVETTFYLVKLTGISVGGKQLD 337
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTC 352
I P G IIDSGT+ T L AY+A+R FR + + L DTC
Sbjct: 338 IEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTC 391
Query: 353 Y----SVPIVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
Y + + PT+ L F G +++ +P LL CLA A + ++ +
Sbjct: 392 YDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG-------CLAFVAGASDGDT--GI 442
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I N+ Q+ +LYD +G C
Sbjct: 443 IGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 154/349 (44%), Gaps = 34/349 (9%)
Query: 112 MAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTC 162
+ +DT +D W C S V++ +S+TF L C C+ Q C
Sbjct: 28 LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87
Query: 163 -GGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNSVPPQGLLGLG 219
C + YGS+ L+ +T + + FGC + G+ + G+LGL
Sbjct: 88 TSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLS 147
Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
SLSL+ Q L FSYCL F L F L + I+ T ++ NP
Sbjct: 148 PESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPV 204
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
+ YYV L+ I +G + + +P +L P G GTI+DSG+ LV A+ AV++
Sbjct: 205 ETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVM 264
Query: 335 RRVGSNLTVTSLGGFDTCYSVP----------IVAPTITLMFS-GMNVTLPQDNLLIHST 383
V + ++ ++ C+ +P + P + L F G + LP+DN
Sbjct: 265 DVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPR 324
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
AG + CLA+ D S +++I N+QQQN +L+DV + + A C
Sbjct: 325 AG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 172/372 (46%), Gaps = 38/372 (10%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
L FL A + VP + G I + Y+V A +GTP M +DT +D +WV C C
Sbjct: 21 LGFLPCSHAAAVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPC 79
Query: 129 VGCSST------VFNSAQSTTFKNLGCQAAQCKQV---PNPTCGGGACAFNLTYG-SSTI 178
S +F+ AQS+++ + C C + C C + ++YG S
Sbjct: 80 AAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNT 139
Query: 179 AANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTF 237
S DT++L A+ V G+ FGC +G GLLGLGR SL+ QT Y F
Sbjct: 140 TGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVF 199
Query: 238 SYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
SYCLP+ S +G L L GP G T LL +P + Y V L I VG + +
Sbjct: 200 SYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 257
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTC 352
+P A T++D+GTV TRL AY A+R FR + S T S G DTC
Sbjct: 258 VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTC 311
Query: 353 YSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
Y+ + P + L F SG VTL D +L S CLA AP + + ++
Sbjct: 312 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILG 363
Query: 408 NMQQQNHRILYD 419
N+QQ++ + D
Sbjct: 364 NVQQRSFEVRID 375
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 179/427 (41%), Gaps = 56/427 (13%)
Query: 38 HVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--------VPIASGR 89
H PCSP + L EML +D+ R +++ A + + VP G
Sbjct: 67 HRNGPCSPVRGKGELP----RAEMLRRDRERTEYIIRRASRSRRLQDNNDAVSVPTQLGS 122
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTF 144
S Y+ +GTPA + +DT + WV C C +F+ S+++
Sbjct: 123 SY-DSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSY 181
Query: 145 KNLGCQAAQCKQVPNPTCGGG-------ACAFNLTYGS-STIAANLSQDTISLATD-IVP 195
+ C + +C+ + G G CA+ + YGS +T A S D ++L IV
Sbjct: 182 SPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVK 241
Query: 196 GYTFGC-IQKATGNSVPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCLPSFKALSFSGSL 253
+ FGC + G G+LGLGR SL Q + FS+CLP +G L
Sbjct: 242 RFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV--STGFL 299
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
LG +TPLL + Y + AI V +++DIPP + G I D
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFR------EGVITD 353
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGM 369
SGTV + L AYTA+R FR + +G DTC++ + PT++L F G
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRG- 412
Query: 370 NVTLPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+H A S CLA ++ D +I ++ Q+ +LYD+P ++
Sbjct: 413 -------GATVHLDASSGVLMDGCLAFWSSGDEYT---GLIGSVSQRTIEVLYDMPGRKV 462
Query: 426 GVARELC 432
G C
Sbjct: 463 GFRTGAC 469
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 130/482 (26%), Positives = 196/482 (40%), Gaps = 84/482 (17%)
Query: 10 AFLFLFSLSEGLNPICDTQDHSSTLQ------VFHVFSPCSPFKPSKPLSWEESVLEMLA 63
A + F + NP+C S L + PCS + P SV E L
Sbjct: 35 ANYYYFVAASSPNPVCQGHRVSPPLSGGGWVPLSRPHGPCSSSMDAPP----SSVAETLR 90
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQ--------------- 108
DQ R ++ + VPI S +V+ K+GT Q
Sbjct: 91 WDQHRAGYIQR---KLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGVQPAGEPVGDAP 147
Query: 109 -------TLLMAMDTSNDAAWVPCTGCVG--C---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
M +DT++D WV C C C + +++ ++S++ C + C+
Sbjct: 148 TGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRN 207
Query: 157 VPNP-----TCGGGACAFNLTY--GSSTIAANLSQDTISL----ATDIVPGYTFGC---I 202
+ P T G C + + Y GS++ +S D ++L + + FGC +
Sbjct: 208 L-GPYANGCTPAGDQCQYRVQYPDGSASAGTYIS-DVLTLNPAKPASAISEFRFGCSHAL 265
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQP 261
+ S G++ LGRG+ SL QT+ Y FSYCLP SG LG P
Sbjct: 266 LQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPV--HSGFFILGVPRVAA 323
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
R TP+L++ LY V L+AI V + + +PP AG ++DS T+ TRL
Sbjct: 324 SRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVF------AAGAVMDSRTIVTRL 377
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---------VPIVAPTITLMFSGMN-- 370
AY A+R F + + DTCY + P ITL+F G N
Sbjct: 378 PPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGA 437
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
V L +L+ CLA AP+ + + +I N+QQQ +LY+V + +G R
Sbjct: 438 VELDPSGVLLDG------CLAF--APNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRG 489
Query: 431 LC 432
C
Sbjct: 490 AC 491
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/374 (30%), Positives = 164/374 (43%), Gaps = 48/374 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W C C C + ++++A S +F + C +A
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 154 CKQVPNPTCGGGA-----CAFNLTYGSSTIAAN-LSQDTISLATDI---------VPGYT 198
C + + A C + Y +A L +T++ A V G
Sbjct: 155 CLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVA 214
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC G S G +GLGRGSLSL+AQ L FSYCL F S + G +
Sbjct: 215 FGCGVDNGGLSYNSTGTVGLGRGSLSLVAQ---LGVGKFSYCLTDFFNTSLGSPVLFGSL 271
Query: 259 GQ---PKRI-----KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
+ P I + TPL++ P S YYV+L I +G + IP G G
Sbjct: 272 AELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGM 331
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGFDT-CYSVPIVA--------P 360
I+DSGT+FT LV A+ R V G N V + D+ C+ P A P
Sbjct: 332 IVDSGTIFTVLVESAF---RVVVNHVAGVLNQPVVNASSLDSPCF--PATAGEQQLPDMP 386
Query: 361 TITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+ L F+ G ++ L +DN + + S CL +A AP S+L N QQQN ++L+D
Sbjct: 387 DMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSIL---GNFQQQNIQMLFD 443
Query: 420 VPNSRLGVARELCT 433
+ +L C+
Sbjct: 444 ITVGQLSFVPTDCS 457
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 177/387 (45%), Gaps = 49/387 (12%)
Query: 86 ASGRQIT--QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
+S R+++ + T V +GTP Q++ M +DT ++ +W+ C +S VFN S++
Sbjct: 57 SSTRKVSFYHNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINS-VFNPHLSSS 115
Query: 144 FKNLGCQAAQCKQ------VPNPTCGGGACAFNLTYGSST-IAANLSQDTISLATDIVPG 196
+ + C + CK +P C ++Y T + NL+ DT +++ PG
Sbjct: 116 YTPIPCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPG 175
Query: 197 YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
FG + ++ GL+G+ RGSLS + Q + FSYC+ A SG
Sbjct: 176 IIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQ---MGFPKFSYCISGKDA---SGV 229
Query: 253 LRLGP--IGQPKRIKYTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPT 305
L G +KYTPL+K P + Y V L+ IRVG + + +P + T
Sbjct: 230 LLFGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHT 289
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYS----- 354
T++DSGT FT L+ YTA+R+ F + LT+ G D C+
Sbjct: 290 GAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGG 349
Query: 355 -VPIVAPTITLMFSGMNVTLPQDNLL--------IHSTAGSITCLAMAAAPDNVNSVLNV 405
VP V P +T++F G +++ + LL + G + CL + D + V
Sbjct: 350 VVPAV-PAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNS-DLLGIEAYV 407
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I + QQN + +D+ NSR+G A C
Sbjct: 408 IGHHHQQNVWMEFDLVNSRVGFADTKC 434
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/370 (29%), Positives = 168/370 (45%), Gaps = 44/370 (11%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
T V +G P Q + M +DT ++ +W+ C S VFN S+T+ + C + C+
Sbjct: 64 TLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICR 122
Query: 156 ----QVPNP-TCGGGA--CAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKA-- 205
+P P +C C ++Y +T I NL+ +T + + PG FGC+
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182
Query: 206 --TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-----FKALSFSGSLRLGPI 258
+ GL+G+ RGSLS + Q L S FSYC+ F L + LGPI
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSDSSVFLLLGDASYSWLGPI 239
Query: 259 G-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
P ++ TPL R + Y V L IRVG +++ +P + T T++DSGT
Sbjct: 240 QYTPLVLQSTPLPYFDRVA--YTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA-------PTITL 364
FT L+ P YTA+++ F + S L + G D CY V P ++L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357
Query: 365 MFSGMNVTLPQDNLLIH-STAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
MF G +++ LL + AGS + C + D + VI + QQN + +
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEF 416
Query: 419 DVPNSRLGVA 428
D+ SR+G A
Sbjct: 417 DLAKSRVGFA 426
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/370 (29%), Positives = 168/370 (45%), Gaps = 44/370 (11%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
T V +G P Q + M +DT ++ +W+ C S VFN S+T+ + C + C+
Sbjct: 64 TLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICR 122
Query: 156 ----QVPNP-TCGGGA--CAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKA-- 205
+P P +C C ++Y +T I NL+ +T + + PG FGC+
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182
Query: 206 --TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-----FKALSFSGSLRLGPI 258
+ GL+G+ RGSLS + Q L S FSYC+ F L + LGPI
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSDSSGFLLLGDASYSWLGPI 239
Query: 259 G-QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
P ++ TPL R + Y V L IRVG +++ +P + T T++DSGT
Sbjct: 240 QYTPLVLQSTPLPYFDRVA--YTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA-------PTITL 364
FT L+ P YTA+++ F + S L + G D CY V P ++L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357
Query: 365 MFSGMNVTLPQDNLLIH-STAGS-----ITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
MF G +++ LL + AGS + C + D + VI + QQN + +
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNS-DLLGIEAFVIGHHHQQNVWMEF 416
Query: 419 DVPNSRLGVA 428
D+ SR+G A
Sbjct: 417 DLAKSRVGFA 426
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 121/449 (26%), Positives = 194/449 (43%), Gaps = 54/449 (12%)
Query: 22 NPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL--------- 72
+PI + L V H +PCSP S SV ++ + RL+ L
Sbjct: 56 SPIPSGASNGKKLPVLHRLNPCSPLNAGGKQSTTSSV-DVSHRAGRRLRSLFAAVQSGDD 114
Query: 73 ----SSLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
+ A A V +G +P Y V GTPAQ L MA DT + V C
Sbjct: 115 AAPAPAPAAASGGVTIPTTGTPEPGAPGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRC 174
Query: 126 TGC---VGCSSTV-FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN 181
C C F+ ++S+TF + C + C+ C G+ ++
Sbjct: 175 AACRPGAPCDGLASFDPSRSSTFAPVPCGSPDCRS----GCSSGSTPSCPLTSFPFLSGA 230
Query: 182 LSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
++QD ++L V +TFGC++ ++G + GLL L R S S+ ++ TFSYC
Sbjct: 231 VAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYC 290
Query: 241 LPSFKALSFSGSLRLG----PIGQPKRI-KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
LP S G L +G P + R+ PL+ +P + Y ++L + +G R + I
Sbjct: 291 LP-LSTTSSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPI 349
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
PP A T A ++D+ +T + Y +RD FRR + ++G DTCY+
Sbjct: 350 PPHAA----TASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNF 405
Query: 356 -----PIVAPTITLMF-------SGMNVTLPQDNLLIHSTAG---SITCLAMAAAPDNVN 400
++ P + L F G + L D + S G S+TCLA AA P + +
Sbjct: 406 TGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGD 465
Query: 401 S---VLNVIANMQQQNHRILYDVPNSRLG 426
+ + V+ + Q + +++DVP ++G
Sbjct: 466 AEAPLAMVMGTLAQSSMEVVHDVPGGKIG 494
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 168/369 (45%), Gaps = 43/369 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ-- 156
V +G+P Q + M +DT ++ +W+ C +S VFN S T+ + C + CK
Sbjct: 71 VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNS-VFNPLSSKTYSKVPCLSPTCKTRT 129
Query: 157 ----VPNPTCGGGACAFNLTYGSST-IAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
+P C ++Y +T I NL+ +T L + P FGC+ ++
Sbjct: 130 RDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSE 189
Query: 212 PQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP--KRIK 265
GL+G+ RGSLS + Q + FSYC+ F + +G L LG P K +
Sbjct: 190 EDSKTTGLIGMNRGSLSFVNQ---MGYPKFSYCISGFDS---AGVLLLGNASFPWLKPLS 243
Query: 266 YTPLLKN----PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
YTPL++ P + Y V L I+V +V+ +P + T T++DSGT FT
Sbjct: 244 YTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTF 303
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCY----SVPIVA--PTITLMFSG 368
L+ P YTA+++ F + L V + G D CY S P + P ++LMF G
Sbjct: 304 LLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG 363
Query: 369 MNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
+++ + LL S+ C + D + VI + QQN + +D+ S
Sbjct: 364 AEMSVSGERLLYRVPGEVRGRDSVWCFTFGNS-DLLGVEAFVIGHHHQQNVWMEFDLEKS 422
Query: 424 RLGVARELC 432
R+G+A C
Sbjct: 423 RIGLADVRC 431
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 46/385 (11%)
Query: 86 ASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-----SSTVFNSAQ 140
AS + + + V +GTP Q + M +DT ++ +W+ C G S+ F
Sbjct: 54 ASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRA 113
Query: 141 STTFKNLGCQAAQCK--QVPNPTCGGGA---CAFNLTY--GSSTIAANLSQDTISLATDI 193
S TF ++ C +AQC+ +P+P GA C +L+Y GSS+ A L+ + ++
Sbjct: 114 SLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGA-LATEVFTVGQGP 172
Query: 194 VPGYTFGCIQKA---TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
FGC+ A + + V GLLG+ RG+LS ++Q FSYC+ +
Sbjct: 173 PLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCISDRDD---A 226
Query: 251 GSLRLGPIGQP-KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNP 304
G L LG P + YTPL + P + Y V LL IRVG + + IP L +
Sbjct: 227 GVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDH 286
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSV--- 355
T T++DSGT FT L+ AY+A++ F R+ L + FDTC+ V
Sbjct: 287 TGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQG 346
Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIA 407
P P +TL+F+G +T+ D LL + CL A D V VI
Sbjct: 347 RAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNA-DMVPITAYVIG 405
Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
+ Q N + YD+ R+G+A C
Sbjct: 406 HHHQMNVWVEYDLERGRVGLAPIRC 430
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 138/288 (47%), Gaps = 29/288 (10%)
Query: 162 CGGGA--CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
CG A C + + YG + L + + T +V + FGC + G GL+GL
Sbjct: 126 CGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGL 185
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPR 274
GR LSL++QT ++ FSYCLPS + SGSL LG R I Y +++NP+
Sbjct: 186 GRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYAKMIENPQ 244
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVF 333
+ Y++NL I +G G P+ G I +DSGTV TRL Y A++ F
Sbjct: 245 LYNFYFINLTGISIG--------GVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEF 296
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD----NLLIHSTAG 385
++ + DTC+++ + PTI + F G N L D + S A
Sbjct: 297 LKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEG-NAELTVDVTGVFYFVKSDAS 355
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ CLA+A+ + ++ N QQ+N R++YD +++G A E C+
Sbjct: 356 QV-CLALASL--EYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 132/454 (29%), Positives = 195/454 (42%), Gaps = 58/454 (12%)
Query: 22 NPICDTQDHS--STLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFL----- 72
P C + HS S + V H SPCSP + E SV ++L +D RL+ L
Sbjct: 46 KPTCSSA-HSAHSAVPVVHRLSPCSPLAGAARNQQPERRSVADVLHRDALRLRSLLHREE 104
Query: 73 -----SSLAVARKSVVPIAS-GRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAA-W 122
+ A V I S G I + P Y V A GTP Q L + DT+ A
Sbjct: 105 DNHRTPAPAAPPGGGVSIPSRGEPIEELPGAFEYHVVAGFGTPMQKLPVGFDTTTTGATL 164
Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG-----ACAFNLTYGSST 177
+ CT C + F+ + S++ + C + C P C G + +FN T +
Sbjct: 165 LQCTPCGSGADHAFDPSASSSVSQVPCGSPDC---PFHGCSGRPSCTLSVSFNNTLLGNA 221
Query: 178 IAANLSQDTISLATDIVPGYTFGCIQ-----KATGNSVPPQGLLGLGRGSLSL---LAQT 229
+ ++ V + F C++ A S G+L L R S SL L +
Sbjct: 222 TFFTDTLTLTPSSSATVDKFRFACLEGIAPGPAEDGSA---GILDLSRNSHSLPSRLVAS 278
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
+ FSYCLP+ A G L LG P +++ YTPL +P +LY V+L+ +
Sbjct: 279 SPPHAVAFSYCLPASTA--DVGFLSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGL 336
Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
+G + IPP A+ G TI++ T FT L Y +RD FR+ + L
Sbjct: 337 GLGGPDLPIPPAAI-----AGDDTILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPL 391
Query: 347 GGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAG---SITCLAMAAAPDN 398
G DTCY+ P +TL F+ G +V L D ++ + SI CLA A D+
Sbjct: 392 GSLDTCYNFTGLDAFSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDD 451
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ VI +M Q + ++YDV ++G C
Sbjct: 452 CDGG-TVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 167/387 (43%), Gaps = 59/387 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
Y V GTP+QTL MDT + W PCT C CS + A+ TF K
Sbjct: 90 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 149
Query: 146 NLGCQAAQCKQVPN-------PTCGGG------AC-AFNLTYGSSTIAANLSQDTISLAT 191
+GC +C V + P C AC + + YG T L +++ A
Sbjct: 150 IVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE 209
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
P + GC + +S P G+ G GRG SL Q + FSYCL S +
Sbjct: 210 RTEPDFVVGC---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPK 263
Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
S +L +GP + + + YTP KNP S+ YYV L I VG + V +P
Sbjct: 264 SSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSF 323
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV 355
+ GTI+DSG+ FT + P + AV F R++ +N T V +L G C+++
Sbjct: 324 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNL 382
Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN-----V 405
+ P++ F G + LP N S+ CL + + + V S L+ +
Sbjct: 383 SGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSN-EAVGSTLSSGPSII 441
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
+ N Q QN YD+ N R G R+ C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 135/419 (32%), Positives = 188/419 (44%), Gaps = 72/419 (17%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
S+ L++ H PC+P + S + SV + L DQ R +++ S A A
Sbjct: 65 SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSST- 134
+ VP + G I + Y+V A +GTP M +DT +D +WV PC+ C S
Sbjct: 123 AAATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQK 181
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
+F+ AQS+++ VP CGG CA Y + +
Sbjct: 182 DPLFDPAQSSSYA----------AVP---CGGPVCAGLGIY--------AASACSAAQCG 220
Query: 193 IVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
V G+ FGC +G N V GLLGLGR SL+ QT Y FSYCLP+ S +
Sbjct: 221 AVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP--STA 276
Query: 251 GSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G L L GP G T LL +P + Y V L I VG + + +P A
Sbjct: 277 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG---- 332
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP----IVAPT 361
T++D+GTV TRL AY A+R FR + S T S G DTCY+ + P
Sbjct: 333 --TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 390
Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+ L F SG VTL D +L S CLA AP + + ++ N+QQ++ + D
Sbjct: 391 VALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILGNVQQRSFEVRID 441
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 181/365 (49%), Gaps = 38/365 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
+V IGTP QT M +DT + +W+ C V S+VF+ + S++F L C C
Sbjct: 83 LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 155 K-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKAT 206
K ++P+ PT C C ++ Y T+A NL ++ I+ + + P GC ++++
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS 202
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKR 263
+G+LG+ G LS +Q + + FSYC+P+ + + +GS LG
Sbjct: 203 D----AKGILGMNLGRLSFASQAK---LTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGG 255
Query: 264 IKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+Y LL ++ R +L Y V + IR+G + ++IP A + +P+ T+IDSG+
Sbjct: 256 FRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGS 315
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCY---SVPIVAPTITLMFS---G 368
FT LV AY VR+ R VG+ L + G D C+ ++ I ++F G
Sbjct: 316 EFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKG 375
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ + + ++ +L G + C+ + + + + + N+I N QQN + +D+ N R+G
Sbjct: 376 VEIVVEKERVLA-DVGGGVHCVGIGRS-EMLGAASNIIGNFHQQNIWVEFDLANRRVGFG 433
Query: 429 RELCT 433
+ C+
Sbjct: 434 KADCS 438
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 173/362 (47%), Gaps = 36/362 (9%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
I+ IGTP QT M +DT + +W+ C +++ F+ + S+TF L C CK +
Sbjct: 76 IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-FDPSLSSTFSILPCTHPLCKPR 134
Query: 157 VPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFGCIQKATGNS 209
+P+ PT C C ++ Y T A NL ++ + + + P GC ++T
Sbjct: 135 IPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATESTD-- 192
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKRIKY 266
P+G+LG+ G LS Q++ + FSYC+P + + +GS LG K KY
Sbjct: 193 --PRGILGMNLGRLSFAKQSK---ITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKY 247
Query: 267 TPLLKNPRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
++ + R+ Y + ++ IR+ + ++I P + + T+IDSG+ FT
Sbjct: 248 VGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTY 307
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG------FDTCYSVPIVAPTITLMFS---GMNV 371
LV+ AY VR R VG L + G FD+ +V I ++F G+ V
Sbjct: 308 LVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEV 367
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
+P++ +L G + C+ + ++ D + + N+I N QQN + +D+ R+G +
Sbjct: 368 VIPKERVLA-DVGGGVHCVGIGSS-DKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKAD 425
Query: 432 CT 433
C+
Sbjct: 426 CS 427
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 166/365 (45%), Gaps = 38/365 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
IV IGTP Q M +DT + +W+ C + F+ + S+TF L C C
Sbjct: 98 IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVC 157
Query: 155 K-QVPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFGCIQKAT 206
K ++P+ T C C ++ Y T A NL ++ + + + P GC ++T
Sbjct: 158 KPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATEST 217
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF---KALSFSGSLRLGPIGQPKR 263
P+G+LG+ RG LS +Q++ + FSYC+P+ + +GS LG
Sbjct: 218 D----PRGILGMNRGRLSFASQSK---ITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNT 270
Query: 264 IKYTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+Y +L R + Y V L IR+G R ++I P + + T++DSG+
Sbjct: 271 FRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGS 330
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS------VPIVAPTITLMFSG 368
FT LV AY VR R VG + + G D C+ ++ + G
Sbjct: 331 EFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKG 390
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ + +P++ +L + G + C+ +A + D + + N+I N QQN + +D+ N R+G
Sbjct: 391 VQIVVPKERVLA-TVEGGVHCIGIANS-DKLGAASNIIGNFHQQNLWVEFDLVNRRMGFG 448
Query: 429 RELCT 433
C+
Sbjct: 449 TADCS 453
>gi|3123349|emb|CAA06698.1| hypothetical protein [Cicer arietinum]
Length = 99
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/91 (62%), Positives = 74/91 (81%), Gaps = 2/91 (2%)
Query: 344 TSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
+SLG FDTC+ + +AP ITL F+ +N+TLP +N LIHS++GS+ CLAMAAAP NVNS
Sbjct: 8 SSLGAFDTCFVKTYETLAPAITLRFTDLNLTLPMENSLIHSSSGSLACLAMAAAPSNVNS 67
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
VLNVIAN QQQN R+L+D N+++G+ARELC
Sbjct: 68 VLNVIANFQQQNLRVLFDTVNNKVGIARELC 98
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 118/377 (31%), Positives = 173/377 (45%), Gaps = 51/377 (13%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPC-TGCVGCSSTV--------FNSAQSTTFKNLGC 149
V +GTP Q + M +DT ++ +W+ C TG G ++ F S TF + C
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 150 QAAQC--KQVPNP-TCGGGA--CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCI 202
+ QC + +P P +C G + C +L+Y GS++ A L+ D ++ FGC+
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGA-LATDVFAVGEAPPLRSAFGCM 183
Query: 203 QKATGNS---VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
A +S V GLLG+ RG+LS + Q FSYC+ +G L LG
Sbjct: 184 STAYDSSPDGVATAGLLGMNRGTLSFVTQAST---RRFSYCI---SDRDDAGVLLLGHSD 237
Query: 260 QP-KRIKYTPL----LKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
P + YTPL L P + Y V LL IRVG + + IP L + T T++D
Sbjct: 238 LPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVD 297
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF------DTCYSVP-------IVAP 360
SGT FT L+ AY+A++ F ++ L F DTC+ VP P
Sbjct: 298 SGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLP 357
Query: 361 TITLMFSGMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+TL+F+G +++ D LL H A + CL A D V VI + Q N
Sbjct: 358 PVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNA-DMVPLTAYVIGHHHQMNLW 416
Query: 416 ILYDVPNSRLGVARELC 432
+ YD+ R+G+A C
Sbjct: 417 VEYDLERGRVGLAPVKC 433
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 174/392 (44%), Gaps = 68/392 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSSTVFNSAQS-------TTFK 145
Y++ IGTP Q + + MDT +D W PC C+ C + N + ++
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139
Query: 146 NLGCQAAQCKQV---PNP-----------------TCGGGACAFNLTYGS-STIAANLSQ 184
C + C V NP TC F TYG+ + L++
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTR 199
Query: 185 DTISL------ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
DT+ + T +P + FGC+ + P G+ G GRG+LSL +Q L + FS
Sbjct: 200 DTLRVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQLGFL-RKGFS 255
Query: 239 YCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGR-RV 292
+C +FK + S L +G I + +++TP+LK+P + YYV L AI VG
Sbjct: 256 HCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSA 315
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV----GSNLTVTSLGG 348
++P +F+ G ++DSGT +T L P Y+ V V + + +++ + + G
Sbjct: 316 TEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRT--G 373
Query: 349 FDTCYSVPI---------VAPTITLMF-SGMNVTLPQDNLLIHSTAGS----ITCLAMAA 394
FD CY VP + P+IT F + ++ L + + +A S + CL +
Sbjct: 374 FDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQS 433
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
D V+ + QQQ+ ++YD+ R+G
Sbjct: 434 MDDGDYGPAGVLGSFQQQDVEVVYDMEKERIG 465
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/396 (26%), Positives = 179/396 (45%), Gaps = 54/396 (13%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
P A+ + + + V +GTP Q + M +DT ++ +W+ C G F+++ S++
Sbjct: 50 PPANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSR--HDAPFDASASSS 107
Query: 144 FKNLGCQAAQC----KQVP-NPTCGGGACAFNLTYGSSTIAANL-SQDTISLATDIVPGY 197
+ + C + C + +P P C AC +L+Y ++ A L + DT L + +P
Sbjct: 108 YAPVPCSSPACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPAL 167
Query: 198 TFGCIQKATGNS----VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFS 250
FGCI + ++ PP GLLG+ RG LS + QT F+YC+ + + L
Sbjct: 168 -FGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTAT---RRFAYCIAAGQGPGILLLG 223
Query: 251 GSLRLGPIGQP--KRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
G+ P+ P +++ YTPL++ + + Y V L IRVG ++ IP L +
Sbjct: 224 GNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPD 283
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----------VTSLGGFDTCY 353
T T++DSGT FT L+ AY A++ F ++ +L G FD C+
Sbjct: 284 HTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACF 343
Query: 354 ----------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGS-------ITCLAMAAAP 396
+ + P + L+ G V + L++ G + CL ++
Sbjct: 344 RGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSS- 402
Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D VI + QQ+ + YD+ N+RLG A C
Sbjct: 403 DMAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARC 438
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 150/339 (44%), Gaps = 46/339 (13%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAW----VPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
TY+V IGTP L +DT +D W PC C + ++ A+S T+ N+ C++
Sbjct: 91 TYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRS 150
Query: 152 AQCK--QVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGCIQKA 205
C+ Q P C CA+ +YG T L+ +T +L +D V G FGC +
Sbjct: 151 PMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTEN 210
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
G++ GL+G+GRG LSL++Q LG + +P+R
Sbjct: 211 LGSTDNSSGLVGMGRGPLSLVSQ--------------------------LG-VTRPRRSC 243
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ L I VG ++ I P + P G IIDSGT FT L A
Sbjct: 244 RARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERA 303
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIH 381
+ A+ RV L + G C++ + P + L F G ++ L +++ ++
Sbjct: 304 FVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVE 363
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ + CL M +A ++V+ +MQQQN ILYD+
Sbjct: 364 DRSAGVACLGMVSARG-----MSVLGSMQQQNTHILYDL 397
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 175/370 (47%), Gaps = 42/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + +G P + L+ +DT +D W+ PC C S VF+ +QST+FK + C AA
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 154 CKQVPNPTCGGGA-------CAFNLTYG-SSTIAANLSQDTISLATDIVPG------YTF 199
C V + C + C + YG SS + +L+ +++S++ P
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN--LYQSTFSYCL-PSFKALSFSGSLRLG 256
GC G GLLGLG+G+LS +Q ++ + QS FSYCL LS S ++ G
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSVSSAISFG 349
Query: 257 PIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
R +++TP ++ N + YY+ + I++ + ++ IP P GTI
Sbjct: 350 AGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTI 409
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLM 365
IDSGT T L AY AV F R+ S CY +VP PT++++
Sbjct: 410 IDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPF--PTLSIV 466
Query: 366 F-SGMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F +G + LPQ+N I + CLA+ +++I N QQQN LYDV ++
Sbjct: 467 FQNGAELDLPQENYFIQPDPQEAKHCLAILPT-----DGMSIIGNFQQQNIHFLYDVQHA 521
Query: 424 RLGVARELCT 433
RLG A C+
Sbjct: 522 RLGFANTDCS 531
>gi|413916846|gb|AFW56778.1| hypothetical protein ZEAMMB73_865423 [Zea mays]
Length = 130
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 66/131 (50%), Positives = 92/131 (70%), Gaps = 3/131 (2%)
Query: 283 LLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT 342
++ IRVG R V +P AL F P +G TI+++GT+FTRL AP Y VRDVF+ RV + +
Sbjct: 1 MVRIRVGGRPVPVPASALAFEPASGRDTIVEAGTMFTRLSAPVYAVVRDVFQSRVRAPVA 60
Query: 343 VTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDN-VN 400
LGGF+T Y+V I P +T F G ++VTLP+ N++I S++ I CLAMAA P N V+
Sbjct: 61 -GPLGGFNTFYNVTISVPIVTFSFDGRVSVTLPERNVVIRSSSDGIACLAMAAGPSNGVD 119
Query: 401 SVLNVIANMQQ 411
+VLN++A+MQQ
Sbjct: 120 AVLNMLASMQQ 130
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 163/386 (42%), Gaps = 38/386 (9%)
Query: 80 KSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVF 136
K + + SG + S Y + +GTP + + +DT +D W+ C C C + +
Sbjct: 144 KLIATLESGMTLG-SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFY 202
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGS----------STIAA 180
+ S +FKN+ C +C + +P +C + YG T
Sbjct: 203 DPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 262
Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
NL+ + V FGC G GLLGLGRG LS +Q Q+LY +FSYC
Sbjct: 263 NLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 322
Query: 241 LPSFKA-LSFSGSLRLGP---IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVD 294
L + + S L G + + +T + S + YY+ + +I VG + +D
Sbjct: 323 LVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALD 382
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV-TSLGGFDTCY 353
IP + GTIIDSGT + PAY +++ F ++ N + D C+
Sbjct: 383 IPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCF 442
Query: 354 SVP------IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
+V I P + + F G P +N I + + CLA+ P S ++I
Sbjct: 443 NVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIW-LSEDLVCLAILGTP---KSTFSII 498
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQQN ILYD SRLG C
Sbjct: 499 GNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 175/394 (44%), Gaps = 34/394 (8%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
SV A + + L+ A +VVPI TQ+ Y+ IGTP Q +D
Sbjct: 15 SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70
Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNL 171
+ + W C C C + +F+ S T++ C C+ +P+ + C G CA+
Sbjct: 71 AGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCSGNVCAYQA 130
Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQ 230
+ + + DT ++ T FGC+ + +++ P G++GLGR SL+ QT
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189
Query: 231 NLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
+ FSYCL A L S +L G+ + + N S YY V L
Sbjct: 190 ---VAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
++ G ++ +PP +G+ ++D+ + + LV AY AV+ VG+ T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMAT 298
Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
+ FD C+ AP + F G +T+P N L+ G++ CLAM ++A N
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTV-CLAMLSSARLNS 357
Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ L+++ ++QQ+N L+D+ L CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 166/387 (42%), Gaps = 59/387 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
Y V GTP+QTL MDT + W PCT C CS + A+ TF K
Sbjct: 90 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 149
Query: 146 NLGCQAAQCKQVPN-------PTCGGG------AC-AFNLTYGSSTIAANLSQDTISLAT 191
+GC +C V + P C AC + + YG T L +++ A
Sbjct: 150 IVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE 209
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
P + GC + +S P G+ G GRG SL Q + FSYCL S +
Sbjct: 210 RTEPDFVVGC---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPK 263
Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
S +L +GP + + + YTP KNP S+ YYV L I VG + V P
Sbjct: 264 SSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSF 323
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV 355
+ GTI+DSG+ FT + P + AV F R++ +N T V +L G C+++
Sbjct: 324 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNL 382
Query: 356 ----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN-----V 405
+ P++ F G + LP N S+ CL + + + V S L+ +
Sbjct: 383 SGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSN-EAVGSTLSSGPSII 441
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
+ N Q QN YD+ N R G R+ C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 124/427 (29%), Positives = 187/427 (43%), Gaps = 39/427 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEE-SVLEMLAKDQARLQFLSSLAVARKS-VVPIASGRQ 90
+L++ H +S SPF P +E + L L+K +A +LA+ S P A +
Sbjct: 29 SLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAH-----NLAITTSSGFSPEAFRLR 83
Query: 91 ITQSPT-YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
I+Q T Y+V+ IG+P L + DT + W C C +FNS S T+++
Sbjct: 84 ISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRD 143
Query: 147 LGCQAAQCKQVPNP-TCGGGACAFNLTY-GSSTIAANLSQDTI-SLATDIVPGYTFGCIQ 203
L CQ C N C C + + Y G S A +QD + S D +P Y FGC +
Sbjct: 144 LPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRIPFY-FGCSR 202
Query: 204 KATGNSV-----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS---LRL 255
S G++GL +SLL Q ++ ++ FSYCL F S S + LR
Sbjct: 203 DNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRF 262
Query: 256 GPIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
G + R KY TP + +PR Y++NL+ + V + IPPG P GTIID
Sbjct: 263 GNDIRKSRRKYLSTPFV-SPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIID 321
Query: 314 SGTVFTRLVAPAY----TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLM 365
SGT T + AY TA ++ F + G L G+ CY P++
Sbjct: 322 SGTAVTYISQTAYFPVITAFKNYFDQH-GFQRVNIQLSGY-ICYKQQGHTFHNYPSMAFH 379
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F G + + + + + C+A+ + +I + Q N + +YD N +L
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRT---IIGALNQANTQFIYDAANRQL 436
Query: 426 GVARELC 432
E C
Sbjct: 437 LFTPENC 443
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 173/399 (43%), Gaps = 68/399 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFK 145
Y +GTP Q L + +DT + +WVPCT C + VF+ S++ +
Sbjct: 91 YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150
Query: 146 NLGCQAAQCKQVPNP---TCG-------GGAC-AFNLTYGSSTIAANLSQDTISLATDIV 194
+GC+ C+ + + TCG G C + + YGS + + L DT+ L+
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLLISDTLRLSPSSS 210
Query: 195 PGYTFGCIQKATGNSV-----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---A 246
A G S+ PP GL G GRG+ S+ +Q L FSYCL S +
Sbjct: 211 SSAPAPFRNFAIGCSIVSVHQPPSGLAGFGRGAPSVPSQ---LKVPKFSYCLLSRRFDDN 267
Query: 247 LSFSGSLRLG----PIGQPK-RIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDIPP 297
+ SG L LG P G+ K ++Y PLL N P S YY+ L I VG + V++P
Sbjct: 268 SAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPS 327
Query: 298 GALQFNPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
A F P++G G IIDSGT FT L P A+ R + V G C+
Sbjct: 328 RA--FVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRPCF 385
Query: 354 SVP------IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSIT-------CLAMAA----- 394
++P + P + L F G V LP +N + + CLA+ +
Sbjct: 386 ALPPGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPAS 445
Query: 395 -APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++ + QQQN+ I YD+ RLG ++ C
Sbjct: 446 GGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 103/348 (29%), Positives = 154/348 (44%), Gaps = 27/348 (7%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
T + Y+ IGTP Q + A+D S+D W C ++ FN +STT ++ C
Sbjct: 95 TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-----ATAPFNPVRSTTVADVPCTD 149
Query: 152 AQCKQVPNPTCGGGA--CAFNLTYGSSTIAAN----LSQDTISLATDIVPGYTFGCIQKA 205
C+Q TCG GA CA+ YG AAN L + + + G FGC K
Sbjct: 150 DACQQFAPQTCGAGASECAYTYMYGGG--AANTTGLLGTEAFTFGDTRIDGVVFGCGLKN 207
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--R 263
G+ G++GLGRG+LSL++Q Q FSY ++ + G P+
Sbjct: 208 VGDFSGVSGVIGLGRGNLSLVSQLQ---VDRFSYHFAPDDSVDTQSFILFGDDATPQTSH 264
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT-VFTRLV 322
T LL + SLYYV L I+V + + IP G G+G + S T + T L
Sbjct: 265 TLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLE 324
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDN 377
AY +R ++G S G D CY+ +A P++ L+F+G V L N
Sbjct: 325 EAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGN 384
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ + CL + + SVL ++ Q ++YD+ S+L
Sbjct: 385 YFYMDSTTGLACLTILPSSAGDGSVL---GSLIQVGTHMMYDINGSKL 429
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 173/370 (46%), Gaps = 44/370 (11%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPC-TG-CVGCSSTVFNSAQSTTFKNLGCQAAQC-- 154
V +GTP Q + M +DT ++ +W+ C TG ++ F S TF + C +A+C
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSS 122
Query: 155 KQVPNP-TCGGGA--CAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKA---T 206
+ +P P +C + C +L+Y GS++ A L+ D ++ FGC+ A +
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSYADGSASDGA-LATDVFAVGDAPPLRSAFGCMSAAYDSS 181
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP-KRIK 265
++V GLLG+ RG+LS + Q FSYC+ +G L LG P +
Sbjct: 182 PDAVATAGLLGMNRGALSFVTQAST---RRFSYCI---SDRDDAGVLLLGHSDLPFLPLN 235
Query: 266 YTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
YTPL + P + Y V LL IRVG + + IPP L + T T++DSGT FT
Sbjct: 236 YTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTF 295
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVP-------IVAPTITLMFS 367
L+ AY+AV+ F ++ L FDTC+ VP P +TL+F+
Sbjct: 296 LLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFN 355
Query: 368 GMNVTLPQDNLLI-----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G +++ D LL A + CL A D V VI + Q N + YD+
Sbjct: 356 GAQMSVAGDRLLYKVPGERRGADGVWCLTFGNA-DMVPLTAYVIGHHHQMNLWVEYDLER 414
Query: 423 SRLGVARELC 432
R+G+A C
Sbjct: 415 GRVGLAPVKC 424
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 34/363 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y+V +GTP Q + +DT +D W C C C +F+ S++++ + C
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGEL 163
Query: 154 CKQVPNPTCGG-GACAFNLTYGSSTIAANL---------SQDTISLATDIVPGYTFGCIQ 203
C + + +C C + +YG T + S + T + FGC
Sbjct: 164 CNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGT 223
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF----KALSFSGSLRLGPI- 258
G+ G++G GR LSL++Q L FSYCL + K+ GSLR G
Sbjct: 224 MNKGSLNNGSGIVGFGRAPLSLVSQ---LAIRRFSYCLTPYASGRKSTLLFGSLRGGVYD 280
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
++ T LL++ + + YYV + VG R + IP A P G I+DSGT
Sbjct: 281 AATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTAL 340
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSV-------PIVAPTITLMFSGM 369
T AP V FR ++ G D C++ P V P + G
Sbjct: 341 TLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGA 400
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
++ LP+ N ++ CL +A + D+ + I N QQ+ R+LYD+ L A
Sbjct: 401 DLDLPRRNYVLDDQRKGNLCLLLADSGDSGTT----IGNFVQQDMRVLYDLEADTLSFAP 456
Query: 430 ELC 432
C
Sbjct: 457 AQC 459
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 131/435 (30%), Positives = 188/435 (43%), Gaps = 70/435 (16%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEE---SVLEMLAKDQARLQFLSS-LAVARKSVVPI 85
+S+ + H++ PCSP S + + S+ +M+ DQ R ++ L A P+
Sbjct: 61 NSTWAPLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRRADYIQKRLTGATDDKQPM 120
Query: 86 A-SGR--QITQSPTYIVRAKI--------------------GTPAQTLLMAMDTSNDAAW 122
A S R Q ++ Y + GT A T + +D+ +D +W
Sbjct: 121 AFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTNSAPDGTSAVTQTVIIDSGSDVSW 180
Query: 123 VPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPT---CGGGA-CAFNLTY 173
V C C C +F+ A STT+ + C +A C Q+ P C A C F + Y
Sbjct: 181 VQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPYRRGCSANAQCQFGINY 239
Query: 174 GS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQT 229
G ST S D ++L D++ G+ FGC G++ G L LG GS SL+ QT
Sbjct: 240 GDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQT 299
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY------TPLLKNPRRSSLYYVNL 283
Y FSYCLP S G L LG P+R + TPLL + + Y V L
Sbjct: 300 ATRYGRVFSYCLP--PTASSLGFLVLG--VPPERAQLIPSFVSTPLLSSSMAPTFYRVLL 355
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV 343
AI V R + +PP A ++IDS T+ +RL AY A+R FR +
Sbjct: 356 RAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA 409
Query: 344 TSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
+ DTCY I P+I L+F G V L +L+ S CLA AP
Sbjct: 410 PPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAF--APTA 461
Query: 399 VNSVLNVIANMQQQN 413
+ + I N+QQ+
Sbjct: 462 SDRMPGFIGNVQQKT 476
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 98/231 (42%), Gaps = 32/231 (13%)
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-LLKN 272
G + R L L TQ Y FSYC+P + S G + LG P+R P +
Sbjct: 510 GPYDVDRQGLPLRTATQ--YGRVFSYCIP--PSPSSLGFITLG--VPPQRAALVPTFVST 563
Query: 273 PRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
P SS Y V L AI V R + +PP + ++I S TV +RL AY
Sbjct: 564 PLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTS------SVIASTTVISRLPPTAY 617
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIH 381
A+R FRR + T + DTCY I P+I L+F G V L +L+
Sbjct: 618 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 677
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA AP + + I N+QQ+ ++YDVP + C
Sbjct: 678 G------CLAF--APTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 200/455 (43%), Gaps = 42/455 (9%)
Query: 1 MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVL 59
MKP + F LAF +S+S + + T+ + H SP SPF PS L+ + ++
Sbjct: 1 MKPFVFFCLAF---YSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPS--LTPSQRII 55
Query: 60 EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
+ +RL +S+L + + + +P I + Y++R IGTP L DT +D
Sbjct: 56 NAALRSISRLNRVSNL-LDQNNKLP--QSVLILHNGEYLMRFYIGTPPVERLATADTGSD 112
Query: 120 AAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQV--PNPTCG-GGACAFNLTY 173
WV C+ C C S+ +F +S+TF C++ C + CG G C + Y
Sbjct: 113 LIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKY 172
Query: 174 GS--STIAANLSQDTI------SLATDIVPGYTFGCIQKATGNSVPPQ---GLLGLGRGS 222
G S LS +T+ + T P FGC P G++GLG G
Sbjct: 173 GDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGP 232
Query: 223 LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG--PIGQPKRIKYTPLLKNPRRSSLYY 280
LSL++Q + FSYCL + S + L+ G I + + TP++ P + Y+
Sbjct: 233 LSLVSQIGDQIGHKFSYCLLPLGSTS-TSKLKFGNESIITGEGVVSTPMIIKPWLPTYYF 291
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
+NL A+ V ++ V P G+ T IIDSGT+ T L Y + +
Sbjct: 292 LNLEAVTVAQKTV--PTGS------TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVE 343
Query: 341 LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDN 398
L L C Y V P I F+G V+L NL + + + CL + AP +
Sbjct: 344 LVQDVLSPLPFCFPYRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMI--APSS 401
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
V+ + ++ + Q + ++ YD+ ++ C+
Sbjct: 402 VSGI-SIFGSFSQIDFQVEYDLEGKKVSFQPTDCS 435
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 191/460 (41%), Gaps = 56/460 (12%)
Query: 1 MKPQLVFFLAFLFLFSLS-----EGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSW 54
M P + LA L +LS EGL ++ + H SP SPF PS L+
Sbjct: 1 MHPWVFMILALFSLSTLSSREAREGL--------RGFSVDLIHRDSPSSPFYNPS--LTP 50
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
E ++ + +RLQ +S K + I Y++R IG+P L +
Sbjct: 51 SERIINAALRSMSRLQRVSHFLDENK----LPESLLIPDKGEYLMRFYIGSPPVERLAMV 106
Query: 115 DTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTCGG-GACA 168
DT + W+ C+ C C + +F +S+T+K C + C Q CG G C
Sbjct: 107 DTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCI 166
Query: 169 FNLTYGSSTIAAN-LSQDTISLA------TDIVPGYTFGC-----IQKATGNSVPPQGLL 216
+ + YG + + L +T+S T P FGC T N V G+
Sbjct: 167 YGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKV--MGIA 224
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG--PIGQPKRIKYTPLLKNPR 274
GLG G LSL++Q FSYCL + + S S L+ G I + TPL+ P
Sbjct: 225 GLGAGPLSLVSQLGAQIGHKFSYCLLPYDSTSTS-KLKFGSEAIITTNGVVSTPLIIKPS 283
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
+ Y++NL A+ +G++VV T +IDSGT T L Y +
Sbjct: 284 LPTYYFLNLEAVTIGQKVVS--------TGQTDGNIVIDSGTPLTYLENTFYNNFVASLQ 335
Query: 335 RRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
+G L TC+ + P I F+G +V L N+LI T +I CLA+
Sbjct: 336 ETLGVKLLQDLPSPLKTCFPNRANLAIPDIAFQFTGASVALRPKNVLIPLTDSNILCLAV 395
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ S+ IA Q + ++ YD+ ++ A C
Sbjct: 396 VPSSGIGISLFGSIA---QYDFQVEYDLEGKKVSFAPTDC 432
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/428 (25%), Positives = 177/428 (41%), Gaps = 56/428 (13%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSV--VPIASGRQITQSP------------ 95
K +S E + + + +AR ++L+VAR VP S +Q Q
Sbjct: 45 KQMSRRELIRRAMQRSKARA---AALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDL 101
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAA 152
Y++ IGTP Q + +DT +D W PC C+ +F A S+++ + C
Sbjct: 102 EYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 153 QCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATDI-----VPGYTFGCIQKA 205
C + + +C C + YG T + + + + A+ VP FGC
Sbjct: 162 LCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVP-LGFGCGTMN 220
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL--------SFSGSLRLGP 257
G+ G++G GR LSL++Q L FSYCL + + S S + G
Sbjct: 221 VGSLNNGSGIVGFGRDPLSLVSQ---LSIRRFSYCLTPYTSTRKSTLMFGSLSDGVFEGD 277
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+++ T LL++ + + YYV + VG R + IP A P G I+DSGT
Sbjct: 278 DAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTA 337
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------------IVAPTITL 364
T A T V FR ++ T +S C++ P + P +
Sbjct: 338 LTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAF 397
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F G ++ LP+ N ++ C+ +A + D+ + I N QQ+ R+LYD+
Sbjct: 398 HFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGAT----IGNFVQQDMRVLYDLEAET 453
Query: 425 LGVARELC 432
L A C
Sbjct: 454 LSFAPAQC 461
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 163/378 (43%), Gaps = 46/378 (12%)
Query: 91 ITQSPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKN 146
+ S Y++ IGTP Q + + MDT +D W CT C C +F+ + S+TF+
Sbjct: 81 VPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRA 140
Query: 147 LGCQAAQCKQVPNPTCGGGACAFNL-------TYGSSTIAAN-LSQDTISLATD------ 192
+ C C+ P+ ACA +YG +I A + +DT + +
Sbjct: 141 VACPDPICR--PSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAP 198
Query: 193 --IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF 249
V G FGC TG + G+ G GRG LSL +Q L FSYCL S
Sbjct: 199 PVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQ---LRVGRFSYCLTSHDETE- 254
Query: 250 SGSLRLGPIGQPKR---------IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
S +G P + TP++ +P + YY++L I VG+ + +
Sbjct: 255 SNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVF 314
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP--- 356
GT+IDSGT T A + +++ F ++ TS G C+ P
Sbjct: 315 ALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLLCFQRPKGG 374
Query: 357 --IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNH 414
+ P + + ++ LP++N + T + CL + A V+ VL I N QQQN
Sbjct: 375 KQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGA--EVDMVL--IGNFQQQNM 430
Query: 415 RILYDVPNSRLGVARELC 432
I+YDV NS+L A C
Sbjct: 431 HIVYDVENSKLLFASAQC 448
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 174/394 (44%), Gaps = 34/394 (8%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
SV A + + L+ A +VVPI TQ+ Y+ IGTP Q +D
Sbjct: 15 SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70
Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNP--TCGGGACAFNL 171
+ + W C C C + +F+ S T++ C C+ +P+ C G CA+
Sbjct: 71 AGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPLCESIPSDVRNCSGNVCAYEA 130
Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQ 230
+ + + DT ++ T FGC+ + +++ P G++GLGR SL+ QT
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189
Query: 231 NLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
+ FSYCL A L S +L G+ + + N S YY V L
Sbjct: 190 ---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
++ G ++ +PP +G+ ++D+ + + LV AY AV+ VG+ T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMAT 298
Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
+ FD C+ AP + F G +T+P N L+ G++ CLAM ++A N
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTV-CLAMLSSARLNS 357
Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ L+++ ++QQ+N L+D+ L CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/429 (27%), Positives = 177/429 (41%), Gaps = 97/429 (22%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
S L + + PCS S+P S +E + +D++R+ F++S S +
Sbjct: 63 SQGLPITQKYGPCSGSGHSQPPSPQE----IFGRDESRVSFINSKCNQYTSGNLKNHAHN 118
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
+ ++V GTP Q ++ +DT + W C CV C S FN + S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTY 178
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
+ C + +N+TYG ST N DT++L +D+ + FGC
Sbjct: 179 SSGSCIPGTVEN-----------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCG 227
Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
+ G+ G+LGLG+G LS ++QT + + FSYCLP ++ GSL G
Sbjct: 228 RNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSI---GSLLFGEKATS 284
Query: 260 QPKRIKYTPLLKNP---RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
Q +K+T L+ P + S Y+VNL I VG ++IP GTIIDS T
Sbjct: 285 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRT 339
Query: 317 VFTRLVAPAYTAVRDVF------------RRRVGSNLTVTSLGGFDTCYSVPIVAPTITL 364
V TRL AY+A++ F RR+ G L DTCY+
Sbjct: 340 VITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL--------DTCYNX--------- 382
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
P+ L +I N QQ + +LYD+ R
Sbjct: 383 --------------------------XXXXXPE-----LTIIGNRQQLSLTVLYDIQGGR 411
Query: 425 LGVARELCT 433
+G C+
Sbjct: 412 IGFRSNGCS 420
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 175/365 (47%), Gaps = 38/365 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQC 154
+V IGTP Q+ M +DT + +W+ C V STVF+ + S++F L C C
Sbjct: 78 LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 155 K-QVPN---PT-CG-GGACAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKAT 206
K ++P+ PT C C ++ Y T+A NL ++ I+ +T P GC + A+
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS 197
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKR 263
+ +G+LG+ G LS +Q + + FSYC+P+ + + +GS LG
Sbjct: 198 DD----KGILGMNLGRLSFASQAK---ITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAG 250
Query: 264 IKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+Y LL ++ R +L + V L IR+G + ++IP A + +P+ ++IDSG+
Sbjct: 251 FQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGS 310
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCYS------VPIVAPTITLMFSG 368
FT LV AY VR+ R G L + G D C+ ++ + G
Sbjct: 311 EFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNMVFEFDKG 370
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ + + + +L G + C+ + + + + + N+I N QQN + +D+ N R+G
Sbjct: 371 VEIVIEKGRVLA-DVGGGVHCVGIGRS-EMLGAASNIIGNFHQQNLWVEFDIANRRVGFG 428
Query: 429 RELCT 433
+ C+
Sbjct: 429 KADCS 433
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 157/391 (40%), Gaps = 68/391 (17%)
Query: 107 AQTLLMAMDTSNDAAWVPCT------------------------GCVGCSSTVFNSAQST 142
+QTL + MDT +D W PC+ + C S ++A ++
Sbjct: 102 SQTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNS 161
Query: 143 TFKNLGCQAAQC--KQVPNPTCGGGAC-AFNLTYGSSTIAANLSQDTISL-ATDIVP--- 195
+ C A+C ++ C C +F YG ++ A L + + + +T P
Sbjct: 162 PSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTSNKPFSL 221
Query: 196 -GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCLPSFK----AL 247
+TFGC A G P G+ G G GSLSL AQ NL + FSYCL S L
Sbjct: 222 KDFTFGCAHSALGE---PIGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKL 278
Query: 248 SFSGSLRLGPIGQPK-----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
L LG + + + YTP+L NP+ Y V++ AI VG V P ++
Sbjct: 279 HHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALIRI 338
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL----TVTSLGGFDTCYSVP-- 356
+ G ++DSGT +T L Y +V RRVG S G CY +
Sbjct: 339 DRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGN 398
Query: 357 ------IVAPTITLMFSG-MNVTLPQDNLLIHSTAGS-------ITCLAMAAAPDNVNSV 402
+V P + F G +V LP+ N G + CL + D
Sbjct: 399 GVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGG 458
Query: 403 LNV-IANMQQQNHRILYDVPNSRLGVARELC 432
+ N QQQ +++YD+ R+G A C
Sbjct: 459 PGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 121/450 (26%), Positives = 188/450 (41%), Gaps = 40/450 (8%)
Query: 7 FFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQ 66
F+ + L LF + TQ++ ++++ H S SPF + ++ M
Sbjct: 3 FYSSLLLLFCFCRV--SVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNM-KHST 59
Query: 67 ARLQFLSSLAVARKSVVPIASGRQITQSP----TYIVRAKIGTPAQTLLMAMDTSNDAAW 122
R+ +L+ + + VP I SP YI+ IGTP L MDT+ND W
Sbjct: 60 NRVHYLNHVFSFPPNKVP-----NIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIW 114
Query: 123 V---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTYGSS 176
PC C +S +F+ ++S+T+K + C + +CK V N C C ++ TYG
Sbjct: 115 FQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGE 174
Query: 177 TIA-ANLSQDTISLATDIVPGYTFGCIQKATG--NSVPPQGL----LGLGRGSLSLLAQT 229
+ +LS DT++L ++ +F I G N P +G +GLGRG LS ++Q
Sbjct: 175 AYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQL 234
Query: 230 QNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAI 286
+ FSYCL P F SG L G + + P + Y L A+
Sbjct: 235 NSSIGGKFSYCLVPLFSNEGISGKLHF---GDKSVVSGVGTVSTPITAGEIGYSTTLNAL 291
Query: 287 RVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL 346
VG ++ + N G TIIDSGT T L Y+ + + V +
Sbjct: 292 SVGDHIIKFENSTSK-NDNLG-NTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPN 349
Query: 347 GGFDTCYSVPIV---APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
F CY + P IT F+G +V L N + + C A + N
Sbjct: 350 QQFKLCYKATLKNLDVPIITAHFNGADVHLNSLNTF-YPIDHEVVCFAFVSVG---NFPG 405
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
+I N+ QQN + +D+ + + CT
Sbjct: 406 TIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 123/476 (25%), Positives = 198/476 (41%), Gaps = 85/476 (17%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAK 64
L+ L+ +FL S + I S T ++ H+ SP SPF + E+ LAK
Sbjct: 16 LIIILSTVFLSSFA-----IIQADKFSFTAELIHIDSPNSPF-----FNASETTTHRLAK 65
Query: 65 DQARLQFLSSLAVARKSVVPIASGRQ------ITQSPTYIVRAKIGTPAQTLLMAMDTSN 118
R S+ VAR + P+++ + + Y+++ IGTP + A+DT +
Sbjct: 66 ALQR----SANRVAR--LNPLSNSDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGS 119
Query: 119 DAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS 175
+ W+PC C C SS++FN S+T+++ C + QC+ + C ++
Sbjct: 120 NVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKH 179
Query: 176 STIAAN--LSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLS 224
N ++ DT++L + +P F C GNS+ G++GLGRG+LS
Sbjct: 180 QLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVC-----GNSIYKTFAGVGVIGLGRGALS 234
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--------------TPLL 270
L ++ +L FSYCL + + QP +I + + L
Sbjct: 235 LTSKLYHLSDGKFSYCLADYYS------------KQPSKINFGLQSFISDDDLEVVSTTL 282
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
+ R S YYV L I VG + D+ F P G +IDSGT+FT L Y +
Sbjct: 283 GHHRHSGNYYVTLEGISVGEKRQDLYYVDDPFAPPVG-NMLIDSGTMFTLLPKDFYDYLW 341
Query: 331 DVFRRRVGSN-----------LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDN 377
+ N ++ + C Y + P IT+ F+ +V L DN
Sbjct: 342 STVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDN 401
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
I A + C A AA ++V + QQ N + YD+ + R C+
Sbjct: 402 SFIR-VAEDVVCFAFAATQPGQSTVY---GSWQQMNFILGYDLKRGTVSFKRTDCS 453
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/372 (29%), Positives = 171/372 (45%), Gaps = 48/372 (12%)
Query: 96 TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK 155
T V +G+P Q + M +DT ++ +W+ C +S VFN S+++ + C + C+
Sbjct: 999 TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS-VFNPLSSSSYSPIPCSSPICR 1057
Query: 156 ----QVPNP-TCG-GGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKA--- 205
+PNP TC C ++Y +S++ NL+ D + + +PG FGC+
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 1117
Query: 206 -TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLRLGPIGQP 261
+ GL+G+ RGSLS + Q L FSYC+ S L F G L L +G
Sbjct: 1118 NSEEDAKTTGLMGMNRGSLSFVTQ---LGLPKFSYCISGRDSSGVLLF-GDLHLSWLGN- 1172
Query: 262 KRIKYTPLLKN----PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ YTPL++ P + Y V L IRVG +++ +P + T T++DSGT
Sbjct: 1173 --LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGT 1230
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNL------TVTSLGGFDTCYSVPI-----VAPTITLM 365
FT L+ P YTA+R+ F + L G D CYSV P+++LM
Sbjct: 1231 QFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLM 1290
Query: 366 FSGMNVTLPQDNLL-----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F G + + + LL + + CL + D + VI + QQN + +D+
Sbjct: 1291 FRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNS-DLLGIEAFVIGHHHQQNVWMEFDL 1349
Query: 421 PNSRLGVARELC 432
+ A +LC
Sbjct: 1350 ----VAFAADLC 1357
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 151/386 (39%), Gaps = 55/386 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-----------STTFK 145
Y V GTP Q L DT + W PCT CS F S++ K
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191
Query: 146 NLGCQAAQCKQVPNPT--------------CGGGACAFNLTYGSSTIAANLSQDTISLAT 191
+GC+ +C + P C + L YGS A L +T+ L
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLDLEN 251
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSF 249
VP + GC + P G+ G GRG SL +Q + FS+CL S F
Sbjct: 252 KRVPDFLVGCSVMSVHQ---PAGIAGFGRGPESLPSQMR---LKRFSHCLVSRGFDDSPV 305
Query: 250 SGSLRLGPIGQPKRIK-----YTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGA 299
S L L + K Y P +NP S+ YY++L I +G + V P
Sbjct: 306 SSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKY 365
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR---RVGSNLTVTSLGGFDTCYSVP 356
L + T G IIDSG+ FT L P + A+ D + + V + G C+++P
Sbjct: 366 LVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIP 425
Query: 357 IVA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM---AAAPDNVNSVLNVIA 407
P + L F G ++L +N L T + CL M A ++
Sbjct: 426 KEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILG 485
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
QQQN + YD+ R+G ++ CT
Sbjct: 486 AFQQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 153/356 (42%), Gaps = 34/356 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y++R +GTP+ L DT +D +W+ CT C C + +F+ QS+T+ ++ C++
Sbjct: 88 YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147
Query: 154 CKQVPNP--TCGGGA-CAFNLTYGSSTIA-ANLSQDTISLATD-------IVPGYTFGCI 202
C P CG C + YG+ + L DTIS ++ P FGC
Sbjct: 148 CTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCA 207
Query: 203 QKATGN---SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
+ S G +GLG G LSL +Q + FSYC+ F + S +G L+ G +
Sbjct: 208 FYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTS-TGKLKFGSMA 266
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ TP + NP S Y +NL I VG++ V G IIDS + T
Sbjct: 267 PTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPILT 318
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQDN 377
L YT + + + + F+ C P + P F+G +V L N
Sbjct: 319 HLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNFPEFVFHFTGADVVLGPKN 378
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ I + ++ C+ + + +++ N Q N ++ YD+ ++ A C+
Sbjct: 379 MFI-ALDNNLVCMTVVPSKG-----ISIFGNWAQVNFQVEYDLGEKKVSFAPTNCS 428
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 123/439 (28%), Positives = 179/439 (40%), Gaps = 69/439 (15%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKS----------------VVPIASG--RQITQ----- 93
SVLE+ +D R+Q L +A+K+ P+AS Q Q
Sbjct: 85 SVLELQIRDLTRIQTLHKRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATL 144
Query: 94 -------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTT 143
S Y + +G+P + + +DT +D W+ PC C + ++ S +
Sbjct: 145 ESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASAS 204
Query: 144 FKNLGCQAAQCKQV--PNP----TCGGGACAFNLTYGSS----------TIAANLSQDTI 187
+KN+ C +C V P+P +C + YG S T NL+
Sbjct: 205 YKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGG 264
Query: 188 SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
S V FGC G GLLGLGRG LS +Q Q+LY +FSYCL +
Sbjct: 265 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSD 324
Query: 248 SFSGSLRLGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
+ S + G+ K + P L K + YYV + +I V V++IP
Sbjct: 325 TNVSSKLI--FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEET 382
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCYSV--- 355
+ GTIIDSGT + PAY +++ + + G D C++V
Sbjct: 383 WNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGI 442
Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
I P + + F+ G P +N I + CLA+ P S ++I N QQQN
Sbjct: 443 DSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTP---KSAFSIIGNYQQQN 498
Query: 414 HRILYDVPNSRLGVARELC 432
ILYD SRLG A C
Sbjct: 499 FHILYDTKRSRLGYAPTKC 517
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 166/386 (43%), Gaps = 56/386 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST--------VFNSAQSTTFK 145
Y V GTP+QT+ DT + +PCT C GC + F S++ K
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 146 NLGCQAAQCKQV--PNPTCGG----------GACAFNLTYGSSTIAANLSQDTISLATDI 193
+GCQ+ +C+ + PN C G G + L YG + A L + +
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLT 209
Query: 194 VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSG 251
VP + GC +T P G+ G GRG +SL +Q NL + FS+CL S F + +
Sbjct: 210 VPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQ-MNLKR--FSHCLVSRRFDDTNVTT 263
Query: 252 SLRL------GPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
L L + + YTP KNP S+ YY+NL I VGR+ V IP L
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYL 323
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV- 355
G+I+DSG+ FT + P + V + F ++ SN T + G C+++
Sbjct: 324 APGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM-SNYTREKDLEKETGLGPCFNIS 382
Query: 356 ---PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAA----PDNVNSVLNVIA 407
+ P + F G + LP N CL + + P ++
Sbjct: 383 GKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILG 442
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
+ QQQN+ + YD+ N R G A++ C+
Sbjct: 443 SFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 134/448 (29%), Positives = 200/448 (44%), Gaps = 60/448 (13%)
Query: 10 AFLFLFSLSEGLNPI--CDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQA 67
A L LFS + G PI D+ D+ + V + + + + + + LA+D A
Sbjct: 50 AALPLFSAASGEAPILELDSDDNGNASTVRFLLAH-REAFAAPNATAAQLLAHRLARDAA 108
Query: 68 RLQFLSSLA--VARKS---VVPIASGRQITQ-SPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
R + +S A V R P+ SG + Q S Y +GTP L+ +DT +D
Sbjct: 109 RAEAISVSARNVTRAGGGFSAPVVSG--LAQGSGEYFASVGVGTPPTPALLVLDTGSDVV 166
Query: 122 WV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTY 173
W+ PC C S VF+ +S ++ + C A C+ + GG C + + Y
Sbjct: 167 WLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAY 226
Query: 174 GSSTI-AANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN 231
G ++ A +L+ +T+ A VP GC G V GLLGLGRG LSL QT
Sbjct: 227 GDGSVTAGDLATETLWFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTAR 286
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
Y FSYC F GS + + ++ R+ +V G R
Sbjct: 287 RYGRRFSYC--------FQGS----------DLDHRTII----RTVHQHVG------GAR 318
Query: 292 VVDIPPGALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS-NLTVTSLGGF 349
V + +L+ +P+TG G I+DSGT TRL P Y AVR+ FR G L F
Sbjct: 319 VRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLF 378
Query: 350 DTCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
DTCY + + PT+++ + G V LP +N LI CLA+A V +
Sbjct: 379 DTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGV----S 434
Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
++ N+QQQ R+++D R+ + + C
Sbjct: 435 IVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 170/364 (46%), Gaps = 39/364 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
IV IGTP QT M +DT + +W+ C T F+ S++F L C + CK +
Sbjct: 79 IVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPR 138
Query: 157 VPNPT----CGGGA-CAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGYTFGCIQKATGNS 209
VP+ T C C ++ Y T A NL ++ + ++ P GC ++
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSDT- 197
Query: 210 VPPQGLLG--LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS---GSLRLGPIGQPKRI 264
QG+LG LGR S S LA+ S FSYC+P ++ S S GS LGP
Sbjct: 198 ---QGILGMNLGRLSFSSLAKI-----SKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGF 249
Query: 265 KYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
KY L+ ++ R +L Y + +L IR+ + ++I A + +P+ T+IDSGT
Sbjct: 250 KYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTW 309
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSL--GGFDTCY--SVPIVAPTITLMF----SGM 369
FT LV AY+ V++ + G L + G D C+ ++ I M +G+
Sbjct: 310 FTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGV 369
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ + ++ +L G + CL + + D + N+I N QQ+ + +D+ R+G R
Sbjct: 370 EIVVEREKMLA-DVGGGVQCLGIGRS-DLLGVASNIIGNFHQQDLWVEFDLVGRRVGFGR 427
Query: 430 ELCT 433
C+
Sbjct: 428 TDCS 431
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 112/406 (27%), Positives = 161/406 (39%), Gaps = 73/406 (17%)
Query: 97 YIVRAKIGT-PAQTLLMAMDTSNDAAWVPCT--GCVGCSSTV------------FNSAQS 141
Y + +G+ P Q + + MDT +D W PC C+ C S+ S
Sbjct: 73 YTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSAS 132
Query: 142 TTFKNLGCQAAQCKQVPNPTCGGGACAFNL----------------TYGSSTIAANLSQD 185
+ K+ C AA + C C L YG ++ A L +D
Sbjct: 133 VSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRD 192
Query: 186 TISLATD---IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSY 239
++S+ ++ +TFGC A G P G+ G GRG LSL AQ + + FSY
Sbjct: 193 SLSMPASSPLVLHNFTFGCAHTALGE---PVGVAGFGRGVLSLPAQLASFSPHLGNQFSY 249
Query: 240 CL--PSFKA--LSFSGSLRLGPIG----QPKRIK-------YTPLLKNPRRSSLYYVNLL 284
CL SF A + L LG + KR+ YT +L NP+ Y V L
Sbjct: 250 CLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLE 309
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-- 342
I VG R + +P + + G ++DSGT FT L A Y ++ F R+G
Sbjct: 310 GITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRA 369
Query: 343 --VTSLGGFDTCYSVPIVA---PTITLMFSGMN-VTLPQDNLLIHSTAG--------SIT 388
+ G CY A P + L F G + V LP++N G +
Sbjct: 370 TQIEERTGLGPCYYSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVG 429
Query: 389 CLAMAAAPDNVNS--VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL + D S + N QQQ ++YD+ R+G AR C
Sbjct: 430 CLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 153/354 (43%), Gaps = 29/354 (8%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--------FNSAQSTT 143
T + Y++ +GTP Q + +D ++D W+ C+ C C + F + S+T
Sbjct: 92 TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSST 151
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA--CAFNLTYG---SSTIAANLSQDTISLATDIVPGYT 198
+ + C C+++ TC C ++ YG ++T A L+ D + AT G
Sbjct: 152 IREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI 211
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC G+ G++GLGRG LS ++Q Q FSY L A+ +
Sbjct: 212 FGCAVATEGDI---GGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDD 265
Query: 259 GQPK--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+P+ R TPL+ + SLYYV L IRV + IP G G ++
Sbjct: 266 AKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITI 325
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV- 371
T L A AY VR ++ S G D CY+ +A P++ L+F+G V
Sbjct: 326 PVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 385
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
L N + + CL + +P S+L ++ Q ++YD+ SRL
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDGSLLG---SLIQVGTHMIYDISGSRL 436
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 105/422 (24%), Positives = 169/422 (40%), Gaps = 48/422 (11%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAK------- 102
K L E + + + +AR LS + IA R+ + P VRA
Sbjct: 41 KELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVL 100
Query: 103 ---IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
+GTP Q + +DT +D W C C C +F+ S++++ + C C
Sbjct: 101 DLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGD 160
Query: 157 VPNPTC-GGGACAFNLTYGSSTIA------ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+ + +C C + +YG T + + S T VP FGC G+
Sbjct: 161 ILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSL 219
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------PK 262
G++G GR LSL++Q L FSYCL + A S +L+ G +
Sbjct: 220 NNASGIVGFGRDPLSLVSQ---LSIRRFSYCLTPY-ASSRKSTLQFGSLADVGLYDDATG 275
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
++ TP+L++ + + YYV + VG R + IP A P G IIDSGT T
Sbjct: 276 PVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFP 335
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------------SVPIVAPTITLMFSGMN 370
A V FR ++ S C+ + + P + F G +
Sbjct: 336 AAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD 395
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ LP++N ++ C+ + + D+ + I N QQ+ R++YD+ L A
Sbjct: 396 LDLPRENYVLEDHRRGHLCVLLGDSGDDGAT----IGNFVQQDMRVVYDLERETLSFAPV 451
Query: 431 LC 432
C
Sbjct: 452 EC 453
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 115/442 (26%), Positives = 177/442 (40%), Gaps = 72/442 (16%)
Query: 57 SVLEMLAKDQARLQFLSSLAVAR----------------KSVVPIASGRQITQSPTYIVR 100
S+ ++ D+ R+ F++S R +P+ SG T Y VR
Sbjct: 42 SLADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSG-AYTGIGQYFVR 100
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV------------FNSAQSTTFKNLG 148
++GTPAQ L+ DT +D WV C +S++ F S T+ +
Sbjct: 101 FRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPIS 160
Query: 149 CQAAQC-KQVPN--PTC--GGGACAFNLTYGSSTIA---ANLSQDTISLA-----TDIVP 195
C + C K +P TC G CA++ Y + A TI+L+ +
Sbjct: 161 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKAKLK 220
Query: 196 GYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
G GC TG S G+L LG +S + + + FSYCL + + L
Sbjct: 221 GLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNATSYL 280
Query: 254 RLGP---IGQPK-----------RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA 299
GP + P+ R + TPLL + R Y V+L AI V + IP
Sbjct: 281 TFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKIPRAV 340
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS----- 354
++ G G I+DSGT T L PAY AV + + + L ++ F+ CY+
Sbjct: 341 --WDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGL-AGLPRVTMDPFEYCYNWTSPS 397
Query: 355 ---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ P + + F+G P + A + C+ + P ++VI N+ Q
Sbjct: 398 GKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP---WPGISVIGNILQ 454
Query: 412 QNHRILYDVPNSRLGVARELCT 433
Q H +D+ N RL R CT
Sbjct: 455 QEHLWEFDIKNRRLKFQRSRCT 476
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 162/384 (42%), Gaps = 40/384 (10%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
V + SG + S Y + +G+P + + +DT +D W+ C C C + ++
Sbjct: 156 VATLESGMTLG-SGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214
Query: 139 AQSTTFKNLGCQAAQCKQVPNP------TCGGGACAFNLTYGSS----------TIAANL 182
S ++KN+ C +C V +P +C + YG S T NL
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNL 274
Query: 183 SQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP 242
+ + S V FGC G GLLGLGRG LS +Q Q+LY +FSYCL
Sbjct: 275 TTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 334
Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLL--------KNPRRSSLYYVNLLAIRVGRRVVD 294
+ + S + G+ K + P L K + YYV + +I V V++
Sbjct: 335 DRNSDTNVSSKLI--FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLN 392
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCY 353
IP + GTIIDSGT + PAY +++ + + G D C+
Sbjct: 393 IPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCF 452
Query: 354 SVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+V + P + + F+ G P +N I + CLAM P S ++I N
Sbjct: 453 NVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTP---KSAFSIIGN 508
Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
QQQN ILYD SRLG A C
Sbjct: 509 YQQQNFHILYDTKRSRLGYAPTKC 532
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 170/366 (46%), Gaps = 39/366 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPC----TGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
IV IGTP QT M +DT + +W+ C +T F+ + S++F L C
Sbjct: 81 IVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPL 140
Query: 154 CK-QVPN----PTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLAT-DIVPGYTFGCIQKA 205
CK ++P+ TC C ++ Y T A +L ++ I+ ++ P GC + +
Sbjct: 141 CKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEAS 200
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRLGPIGQPK 262
T +G+LG+ G S +Q + S FSYC+P+ +A LS +GS LG
Sbjct: 201 TDE----KGILGMNLGRRSFASQAK---ISKFSYCVPTRQARAGLSSTGSFYLGNNPNSG 253
Query: 263 RIKY------TPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
R +Y TP ++P L Y + + IR+G ++I + +P+ TIIDSG
Sbjct: 254 RFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSG 313
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS------VPIVAPTITLMFS 367
+ FT LV AY VR+ R VG L + G D C+ ++ +
Sbjct: 314 SEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEK 373
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G+ + + + +L G + C+ + + + + + N+I N QQN + YD+ N R+G+
Sbjct: 374 GVEIVIDKWRVLA-DVGGGVHCIGIGRS-EMLGAASNIIGNFHQQNLWVEYDLANRRIGL 431
Query: 428 ARELCT 433
+ C+
Sbjct: 432 GKADCS 437
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 173/370 (46%), Gaps = 42/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + +G P + L+ +DT +D W+ PC C S VF+ +QST+FK + C AA
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 154 CKQVPNPTCGGGA-------CAFNLTYG-SSTIAANLSQDTISLATDIVPG------YTF 199
C V + C + C + YG SS + +L+ +++S++ P
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQN--LYQSTFSYCL-PSFKALSFSGSLRLG 256
GC G GLLGLG+G+LS +Q ++ + QS FSYCL LS S ++ G
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSVSSAISFG 265
Query: 257 PIGQPKR----IKYTPLLK-NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
R +K+TP ++ N + YY+ + I++ + ++ IP GTI
Sbjct: 266 AGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTI 325
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------SVPIVAPTITLM 365
IDSGT T L AY AV F R+ S CY +VP P ++++
Sbjct: 326 IDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPF--PALSIV 382
Query: 366 F-SGMNVTLPQDNLLIH-STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F +G + LPQ+N I + CLA+ +++I N QQQN LYDV ++
Sbjct: 383 FQNGAELDLPQENYFIQPDPQEAKHCLAILPT-----DGMSIIGNFQQQNIHFLYDVQHA 437
Query: 424 RLGVARELCT 433
RLG A C+
Sbjct: 438 RLGFANTDCS 447
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 155/385 (40%), Gaps = 55/385 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
Y + +GTP QT +DT + W PCT CS F + S+T K
Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAK 151
Query: 146 NLGCQAAQCKQV--------------PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
LGC+ +C + + C A+ + YG + A L D ++
Sbjct: 152 LLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPG 211
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSG 251
VP + GC + P G+ G GRG SL +Q NL + FSYCL S +
Sbjct: 212 KTVPQFLVGCSILSIRQ---PSGIAGFGRGQESLPSQ-MNLKR--FSYCLVSHRFDDTPQ 265
Query: 252 S----LRLGPIGQPKR--IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGAL 300
S L++ G K + YTP NP ++ YY+ L + VG + V IP L
Sbjct: 266 SSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFL 325
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
+ GTI+DSG+ FT + P Y V F +++ N + + G C+++
Sbjct: 326 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNIS 385
Query: 357 ----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIA 407
+ P +T F G +T P N + CL + A P ++
Sbjct: 386 GVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILG 445
Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
N QQQN I YD+ N R G C
Sbjct: 446 NYQQQNFYIEYDLENERFGFGPRSC 470
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 109/438 (24%), Positives = 194/438 (44%), Gaps = 45/438 (10%)
Query: 9 LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
+ LFL S + + + +D+ T+++ H SP SP S ++ ++ L + R
Sbjct: 5 FSLLFLISTASVFSAVT-ARDYGFTVELIHRDSPKSPMYNSSETHFDR-IVNALRRSSHR 62
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PC 125
+ A PI Y+V +GTP +++ DT +D W PC
Sbjct: 63 NTVVLESDTAE---API-----FNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114
Query: 126 TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN-PTCGGGA-CAFNLTYGSSTIA-ANL 182
+ C ++ +F+ ++STT+KN+ C + C + +C + C +++ YG + + NL
Sbjct: 115 SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNL 174
Query: 183 SQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQST 236
+ DT+++ + P GC G + G++GLGRG SL+ Q
Sbjct: 175 AVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGK 234
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIK-----YTPLLKNPRRSSLYYVNLLAIRVGRR 291
FSYCL S + S +L G + TP+ + + + Y + L A+ VG
Sbjct: 235 FSYCLIPIGTGSTNDSTKLN-FGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDT 293
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--- 348
+ P GA + + IIDSGT T L +A+ + F + ++++
Sbjct: 294 KFNFPEGASKLGGES--NIIIDSGTTLTYLP----SALLNSFGSAISQSMSLPHAQDPSE 347
Query: 349 -FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
D C++ P +T+ F G +V L ++NL + + +I CLA + PD+ +
Sbjct: 348 FLDYCFATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDDTI-CLAFGSFPDD---NIF 403
Query: 405 VIANMQQQNHRILYDVPN 422
+ N+ Q N + YD+ N
Sbjct: 404 IYGNIAQSNFLVGYDIKN 421
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 174/394 (44%), Gaps = 34/394 (8%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
SV A + + L+ A +VVPI TQ+ Y+ IGTP Q +D
Sbjct: 15 SVTARAAAFRVHGRLLADAATEGGAVVPI----HWTQAMNYVANFTIGTPPQPASAVIDL 70
Query: 117 SNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPT--CGGGACAFNL 171
+ + W C C C + +F+ S T++ C C+ +P+ + C G CA+
Sbjct: 71 AGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCSGNVCAYQA 130
Query: 172 TYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQ 230
+ + + DT ++ T FGC+ + +++ P G++GLGR SL+ QT
Sbjct: 131 STNAGDTGGKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTG 189
Query: 231 NLYQSTFSYCLPSFK-----ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLL 284
+ FSYCL AL S +L G+ + + N S YY V L
Sbjct: 190 ---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLE 246
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
++ G ++ +PP +G+ ++D+ + + LV AY AV+ VG+ T
Sbjct: 247 GLKAGDAMIPLPP--------SGSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMAT 298
Query: 345 SLGGFDTCY---SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM-AAAPDNV 399
+ FD C+ AP + F G +T+ N L+ G++ CLAM ++A N
Sbjct: 299 PVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYLLDYKNGTV-CLAMLSSARLNS 357
Query: 400 NSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ L+++ ++QQ+N L+D+ L CT
Sbjct: 358 TTELSLLGSLQQENIHFLFDLDKETLSFEPADCT 391
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 155/386 (40%), Gaps = 57/386 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
Y + +GTP QT +DT + W PCT CS F + S+T K
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 146 NLGCQ---------------AAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLA 190
LGC+ QCK+ + C ++ + YG A L D ++
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFP 207
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
VP + GC + P G+ G GRG SL +Q NL + FSYCL S +
Sbjct: 208 GKTVPQFLVGCSILSIRQ---PSGIAGFGRGQESLPSQ-MNLKR--FSYCLVSHRFDDTP 261
Query: 251 GS----LRLGPIGQPKR--IKYTPLLKNPRRSSL----YYVNLLAIRVGRRVVDIPPGAL 300
S L++ G K + YTP NP +S+ YYV L + VG V IP L
Sbjct: 262 QSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFL 321
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
+ GTI+DSG+ FT + P Y V F R++G + V + G C+++
Sbjct: 322 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNIS 381
Query: 357 ----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM-----AAAPDNVNSVLNVI 406
I P T F G ++ P N + C + A P + ++
Sbjct: 382 GVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAI-IL 440
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N QQQN + YD+ N R G C
Sbjct: 441 GNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 154/352 (43%), Gaps = 31/352 (8%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
T + Y+ IGTP Q + A+D S+D W C ++ FN +STT ++ C
Sbjct: 95 TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-----ATAPFNPVRSTTVADVPCTD 149
Query: 152 AQCKQVPNPTCGGGA------CAFNLTYGSSTIAAN----LSQDTISLATDIVPGYTFGC 201
C+Q TCG GA CA+ YG AAN L + + + G FGC
Sbjct: 150 DACQQFAPQTCGAGAGAGSSECAYTYMYGGG--AANTTGLLGTEAFTFGDTRIDGVVFGC 207
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ G+ G++GLGRG+LSL++Q Q FSY ++ + G P
Sbjct: 208 GLQNVGDFSGVSGVIGLGRGNLSLVSQLQ---VDRFSYHFAPDDSVDTQSFILFGDDATP 264
Query: 262 K--RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT-VF 318
+ T LL + SLYYV L I+V + + IP G G+G + S T +
Sbjct: 265 QTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLV 324
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TL 373
T L AY +R ++G S G D CY+ +A P++ L+F+G V L
Sbjct: 325 TVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMEL 384
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
N + + CL + + SVL ++ Q ++YD+ S+L
Sbjct: 385 ELGNYFYMDSTTGLACLTILPSSAGDGSVL---GSLIQVGTHMMYDINGSKL 433
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 134/419 (31%), Positives = 186/419 (44%), Gaps = 72/419 (17%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFL------------SSLAVA 78
S+ L++ H PC+P + S + SV + L DQ R +++ S A A
Sbjct: 65 SAVLRLTHRHGPCAPSRASSLAA--PSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---- 134
+ VP + G I + Y+V A +GTP M +DT +D +WV C C S
Sbjct: 123 AVATVPASWGYDI-GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQK 181
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD 192
+F+ AQS+++ VP CGG CA Y + +
Sbjct: 182 DPLFDPAQSSSYA----------AVP---CGGPVCAGLGIY--------AASACSAAQCG 220
Query: 193 IVPGYTFGCIQKATG--NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS 250
V G+ FGC +G N V GLLGLGR SL+ QT Y FSYCLP+ S +
Sbjct: 221 AVQGFFFGCGHAQSGLFNGV--DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP--STA 276
Query: 251 GSLRL---GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
G L L GP G T LL +P + Y V L I VG + + +P A
Sbjct: 277 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG---- 332
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP----IVAPT 361
T++D+GTV TRL AY A+R FR + S T S G DTCY+ + P
Sbjct: 333 --TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 390
Query: 362 ITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
+ L F SG VTL D +L S CLA AP + + ++ N+QQ++ + D
Sbjct: 391 VALTFGSGATVTLGADGIL------SFGCLAF--APSGSDGGMAILGNVQQRSFEVRID 441
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 167/349 (47%), Gaps = 41/349 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
Y + +GTP QTL DT +D W C C C+ S + +S++F L C +A
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 154 CKQVPN---PTCGG-----GACAFNLTYGSSTIAANLSQ-----DTISLATDIVPGYTFG 200
C+ + + TCGG C++ +YG S+ + +Q +T +L +D V G FG
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFG 200
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C + G GL+GLGRG LSL+ Q L FSYCL S + S G +
Sbjct: 201 CTTMSEGGYGSGSGLVGLGRGKLSLVRQ---LKVGAFSYCLTSDPSTSSPLLFGAGALTG 257
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTVFT 319
P ++ TPL+ N + S+ Y VNL +I +G P TG G I DSGT T
Sbjct: 258 PG-VQSTPLV-NLKTSTFYTVNLDSISIGAAKT----------PGTGRHGIIFDSGTTLT 305
Query: 320 RLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY--SVPIVAPTITLMFSGMNVTLPQD 376
L PAYT + +NLT V G++ C+ S V P++ L F G ++ L +
Sbjct: 306 FLAEPAYTLAEAGLLSQT-TNLTRVPGTDGYEVCFQTSGGAVFPSMVLHFDGGDMALKTE 364
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
N + S++C + +P S ++++ N+ Q ++ I YD+ S L
Sbjct: 365 NYF-GAVNDSVSCWLVQKSP----SEMSIVGNIMQMDYHIRYDLDKSVL 408
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 76/248 (30%), Positives = 116/248 (46%), Gaps = 19/248 (7%)
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGS 252
TFGC + G G++G+ G LS+L Q L + FSYCL F + F
Sbjct: 25 TFGCGKLTNGTIAGASGIMGVSPGPLSVLKQ---LSITKFSYCLTPFTDHKTSPVMFGAM 81
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
LG +++ PLLKNP YYV ++ I +G + +D+P L P GT++
Sbjct: 82 ADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDVPEAILALRPDGTGGTVL 141
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLM 365
DS T LV PA+ ++ + S+ + C+ +P + P + L
Sbjct: 142 DSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYPVCFELPRGMSMEGVQVPPLVLH 201
Query: 366 FSG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F+G ++LP+D+ + G + CLA+ AP NVI N+QQQN +LYD+ N +
Sbjct: 202 FAGDAEMSLPRDSYFQEPSPG-MMCLAVMQAP--FEGAPNVIGNVQQQNMHVLYDLGNRK 258
Query: 425 LGVARELC 432
A C
Sbjct: 259 FSYAPTKC 266
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 159/398 (39%), Gaps = 77/398 (19%)
Query: 106 PAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVFNSAQST-------TFKNLGCQAAQC-- 154
P Q + + +DT +D W PC C+ C N+ ST T +++ C+++ C
Sbjct: 92 PPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSA 151
Query: 155 ------------------KQVPNPTCGGGAC-AFNLTYGSSTIAANLSQDTISLATDI-- 193
+ + C +C +F YG ++ A L D+I L
Sbjct: 152 AHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPS 211
Query: 194 --VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYCLPSFKALS 248
+ +TFGC A P G+ G GRG LSL AQ + + FSYCL S S
Sbjct: 212 LSLHNFTFGCAHTALAE---PVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSH---S 265
Query: 249 F-SGSLRL-GPI------GQPKRIK-------YTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
F S LRL P+ + KR+ YT +L NP+ Y V L I +G++ +
Sbjct: 266 FNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKI 325
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL----TVTSLGGF 349
P + + G ++DSGT FT L A Y +V F RVG V G
Sbjct: 326 PAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGL 385
Query: 350 DTCYSVPIVA--PTITLMFSG--MNVTLPQDNLLIHSTAGS--------ITCLAMAAAPD 397
CY V P++ L F G +V LP+ N G + CL + +
Sbjct: 386 GPCYYYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGE 445
Query: 398 NVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELC 432
+ N QQ ++YD+ R+G AR C
Sbjct: 446 EAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 201/425 (47%), Gaps = 51/425 (12%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG-- 88
S L + + + PCS K S ++ L+ D++R++ +++ + + S G
Sbjct: 61 SQGLPITYSYGPCSQLGQKKSPSRQQIFLQ----DRSRVRSINARILGQYSTEESKDGGS 116
Query: 89 ----RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSST---VFNSA 139
+ + ++V G P Q L + +DT +D W+ C C C + FN +
Sbjct: 117 PESMHSLNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPS 176
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQ-DTISLATDIVPGYT 198
S+++ N C +P+ + + Y ++ + + D ++L D+ P +
Sbjct: 177 LSSSYSNRSC-------IPSTKTN-----YTMNYEDNSYSKGVFVCDEVTLKPDVFPKFQ 224
Query: 199 FGCIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP 257
FGC G+ G+LGL +G SL++QT + ++ FSYC P + + GSL G
Sbjct: 225 FGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCFPHNE--NTRGSLLFGE 282
Query: 258 --IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
I +K+T LL NP S+Y+V L+ I V ++ +++ +L +P GTIIDSG
Sbjct: 283 KAISASPSLKFTRLL-NPSSGSVYFVELIGISVAKKRLNVS-SSLFASP----GTIIDSG 336
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSVP------IVAPTITLMF 366
TV T L AY A+R F++ + +V+ DTCY++ I P I L F
Sbjct: 337 TVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHF 396
Query: 367 SG-MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G ++V+L +L G +T +A A + S + +I N QQ + +++YD+ RL
Sbjct: 397 VGEVDVSLHPSGILW--ANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRL 454
Query: 426 GVARE 430
G +
Sbjct: 455 GFGND 459
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 166/399 (41%), Gaps = 65/399 (16%)
Query: 97 YIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG------------------- 130
Y + +G P A ++ + +DT +D W PC C G
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDSR 147
Query: 131 ---CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQ 184
C+S + ++A S+ + C AA+C + +C AC YG ++ ANL +
Sbjct: 148 RISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANLRR 207
Query: 185 DTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-- 241
+ LA + V +TF C A P G+ G GRG LSL AQ FSYCL
Sbjct: 208 GRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVA 264
Query: 242 PSFKA--LSFSGSLRLG------PIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
SF+A L S L LG IG + YTPLL NP+ Y V L A+ VG +
Sbjct: 265 HSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKR 324
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG----- 347
+ P + G ++DSGT FT L + + V D F R + + + G
Sbjct: 325 IQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT 384
Query: 348 GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAPDNVN 400
G CY P + L F G V LP+ N + S G S+ CL + N +
Sbjct: 385 GLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 444
Query: 401 SVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + N QQQ ++YDV R+G AR CT
Sbjct: 445 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/422 (24%), Positives = 168/422 (39%), Gaps = 48/422 (11%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAK------- 102
K L E + + + +AR LS + IA R+ + P VRA
Sbjct: 41 KELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVL 100
Query: 103 ---IGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQ 156
+GTP Q + +DT +D W C C C +F+ S++++ + C C
Sbjct: 101 DLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGD 160
Query: 157 VPNPTC-GGGACAFNLTYGSSTIA------ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+ + +C C + +YG T + + S T VP FGC G+
Sbjct: 161 ILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSL 219
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------PK 262
G++G GR LSL++Q L FSYCL + A S +L+ G +
Sbjct: 220 NNASGIVGFGRDPLSLVSQ---LSIRRFSYCLTPY-ASSRKSTLQFGSLADVGLYDDATG 275
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
++ TP+L++ + + YYV + VG R + IP A P G IIDSGT T
Sbjct: 276 PVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFP 335
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------------SVPIVAPTITLMFSGMN 370
V FR ++ S C+ + + P + F G +
Sbjct: 336 VAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD 395
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ LP++N ++ C+ + + D+ + I N QQ+ R++YD+ L A
Sbjct: 396 LDLPRENYVLEDHRRGHLCVLLGDSGDDGAT----IGNFVQQDMRVVYDLERETLSFAPV 451
Query: 431 LC 432
C
Sbjct: 452 EC 453
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 128/249 (51%), Gaps = 20/249 (8%)
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
FGC + G+ V GL+GL G++SL++Q L FSYCL F S L G +
Sbjct: 96 FGCGALSAGSLVGASGLMGLSPGTMSLISQ---LSVPRFSYCLTPFAERKTSPML-FGAM 151
Query: 259 GQPKR------IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
++ I+ T +L+NP + YY V L+ + +G + + +P +L NP GTI
Sbjct: 152 ADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTI 211
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITL 364
+DSG+ L A+ AV+ V + ++ ++ C++VP + P + L
Sbjct: 212 VDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPPLVL 271
Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F G + LP+DN AG + CLA+A +P+++ + +++I N+QQQN +L+DV N
Sbjct: 272 HFDGGAAMALPRDNYFQEPRAG-LMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDVHNQ 330
Query: 424 RLGVARELC 432
+ A C
Sbjct: 331 KFSFAPTKC 339
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 162/381 (42%), Gaps = 62/381 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y+V+ GTP A+DT++D W+ C CV C VFN S+++ + C +
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151
Query: 154 CKQVPNPTC---GGGACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
C Q+ C GAC + Y G L+ D +++ D+ FGC + G
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGP 211
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP-----KR 263
+ GL+GLGRG LSL++Q L F YCLP + + SG L LG R
Sbjct: 212 AAQASGLVGLGRGPLSLVSQ---LSVHRFMYCLPPPMSRT-SGKLVLGAGADAVRNMSDR 267
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT----------------- 306
+ T + + R S YY+NL + VG D PG + N T+
Sbjct: 268 VTVT-MSSSTRYPSYYYLNLDGLAVG----DQTPGTTR-NATSPPSGGAGGGGGGGGGGI 321
Query: 307 -------GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVP-- 356
G I+D + + L Y + D + SL G D C+ +P
Sbjct: 322 VGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEG 381
Query: 357 -----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
+ PT++L F G + L +D L + T G + CL + S ++++ N Q
Sbjct: 382 VGMDRVYVPTVSLSFDGRWLELDRDRLFV--TDGRMMCLMIGR-----TSGVSILGNFQL 434
Query: 412 QNHRILYDVPNSRLGVARELC 432
QN R+L+++ ++ A+ C
Sbjct: 435 QNMRVLFNLRRGKITFAKASC 455
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 62/400 (15%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST- 134
A RK+VV A + + Y+V+ IGTP A+DT++D W+ C CV C
Sbjct: 69 ARNRKAVVGEAP--LVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQL 126
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
+FN S+++ + C + C Q+ C AC +N Y G++ L+ D ++
Sbjct: 127 DPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLA 186
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
+ ++ GC + G PPQ GL+GL RG LSLL+Q L F YCLP +
Sbjct: 187 VGGNVFHAVVLGCSDSSVGGP-PPQASGLVGLARGPLSLLSQ---LSVRRFMYCLPPPMS 242
Query: 247 LSFSGSLRLGPIGQPKRIKYTP------LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ G L LG ++ + + R S YY+N + VG D PG +
Sbjct: 243 RT-PGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVG----DQTPGTI 297
Query: 301 QFNPTT--------------------GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-S 339
+ PT+ G I+D + + L A Y + D +
Sbjct: 298 R-RPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLP 356
Query: 340 NLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM 392
T ++ G D C+ +P + PT+++ F G + L +D L + G + CL +
Sbjct: 357 RATPSTRLGLDLCFILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLED--GRMMCLMI 414
Query: 393 AAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S ++++ N QQQN +LY++ ++ A+ C
Sbjct: 415 GR-----TSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 119/432 (27%), Positives = 182/432 (42%), Gaps = 63/432 (14%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT------QSPTYIVRAKIG 104
PLS S L+ + L LSSL+ AR P ++T Y V +G
Sbjct: 24 PLSISPSALDKW--ESINLAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLG 81
Query: 105 TPAQTLLMAMDTSNDAAWVPCT------GCVGCSST--------VFNSAQSTTFKNLGCQ 150
TP Q + + +DT + W PCT C C+ + ++ +S+T ++L C+
Sbjct: 82 TPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCR 141
Query: 151 AAQCKQV----PNPTCGGGACAFNLTYGSSTIAANLSQDTISLAT-DIVPGYTFGCIQKA 205
+ +C V N + + L YG + L D + L+ + +P + FGC +
Sbjct: 142 SPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKLNRIPDFLFGC---S 198
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSL-----RLGPI 258
++ P+G+ G GRG S+ AQ L + FSYCL S F SG L R
Sbjct: 199 LVSNRQPEGIAGFGRGLASIPAQ---LGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHAD 255
Query: 259 GQPKRIKYTPLLKNPR---RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
+ Y P K+P S YY++L I VG + V IPP L + G I+DSG
Sbjct: 256 AAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSG 315
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLT-------VTSLGGFDTCYSV----PIVAPTITL 364
+ FT + + D R + ++T + G CY++ + P +T
Sbjct: 316 STFTFMER----IIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLTF 371
Query: 365 MFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDV 420
F G N+ LP + T G + C+ + PD S ++ N QQQN I YD+
Sbjct: 372 SFKGGANMDLPLTDYFSLVTDG-VVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDL 430
Query: 421 PNSRLGVARELC 432
R G + C
Sbjct: 431 KKQRFGFKPQQC 442
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 149/354 (42%), Gaps = 42/354 (11%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---KQVPN 159
+GTP + + ++ N+ W C F + TF G A C K PN
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSR-GLPFASCGSPKFWPN 59
Query: 160 PTCGGGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQKATGNSVPPQ-GL 215
TC + +YG ++ L D + A VPG FGC G + G+
Sbjct: 60 QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGI 114
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYC-------LPSFKALSFSGSLRLGPIGQPKRIKYTP 268
G GRG LSL +Q L FS+C +PS L L G ++ TP
Sbjct: 115 AGFGRGPLSLPSQ---LKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQG---AVQTTP 168
Query: 269 LL---KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
L+ KN +LYY++L I VG + +P A TG GTIIDSGT T L
Sbjct: 169 LIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSGTSITSLPPQV 227
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH 381
Y VRD F ++ + + G TC+S P A P + L F G + LP++N +
Sbjct: 228 YQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFE 287
Query: 382 ---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
SI CLA+ N +I N QQQN +LYD+ N+ L C
Sbjct: 288 VPDDAGNSIICLAI-----NKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 157/370 (42%), Gaps = 47/370 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTVFNSAQSTTFKNLGCQA 151
YI +G P Q +DT + W CT CV FN++ S +F + CQ
Sbjct: 86 YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145
Query: 152 AQCKQVPNPTCG-GGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV 210
C C G C F +TYG+ I L D + + FGC+ T +
Sbjct: 146 KACAGNYLHFCALDGTCTFRVTYGAGGIIGFLGTDAFTFQSGGAT-LAFGCV-SFTRFAA 203
Query: 211 P-----PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPI----GQ 260
P GL+GLGRG LSL +QT FSYCL P F S L +G G
Sbjct: 204 PDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYFHNNGASSHLFVGAAASLSGG 260
Query: 261 PKRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNPTT----GAGTIID 313
+ +++P+ S+ YY+ L+ I VG + IP A G IID
Sbjct: 261 GGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIID 320
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLT---VTSLGGFDTCYS---VPIVAPTITLMFS 367
SG+ FT LV AY + R++ +L GG C + + V PT+ L FS
Sbjct: 321 SGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLVLHFS 380
Query: 368 -GMNVTLPQDNL---LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G ++ LP +N L STA C+A+ + ++I N QQQN IL+DV
Sbjct: 381 GGADMALPPENYWAPLEKSTA----CMAIVRG-----YLQSIIGNFQQQNMHILFDVGGG 431
Query: 424 RLGVARELCT 433
RL C+
Sbjct: 432 RLSFQNADCS 441
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 179/382 (46%), Gaps = 53/382 (13%)
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTF 144
GR+ + Y K+G+P Q ++ +DT ++ W+ C C C+ T++++A+S ++
Sbjct: 94 GRKFGE---YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSY 150
Query: 145 KNLGCQAAQ-CKQVPNPT---CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI----- 193
K + C +Q C T C G+ C F YG + + +LS DT+ + T +
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 194 -VPGYTFGCIQKA-----TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
V + FGC Q TG S G+LGL G ++L Q + FS+C P +
Sbjct: 211 TVQDFAFGCAQGDLELVPTGAS----GILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSH 266
Query: 247 LSFSGSLRLGPIGQP-KRIKYT--PLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQF 302
L+ +G + G P ++++YT L + + Y+V L + + +V +P G++
Sbjct: 267 LNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSV-- 324
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF-RRRVGS--NLTVTSLGGFDTCYSVP--- 356
I+DSG+ F+ V P ++ +R+ F + R S +L S G TC+ V
Sbjct: 325 -------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDD 377
Query: 357 -----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
P+++L+F G+ + +P +L+ A D + +NVI N Q
Sbjct: 378 IDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQ 437
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN + YD+ SR+G AR C
Sbjct: 438 QQNLWVEYDIQRSRVGFARASC 459
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 166/399 (41%), Gaps = 65/399 (16%)
Query: 97 YIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG------------------- 130
Y + +G P A ++ + +DT +D W PC C G
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDSR 147
Query: 131 ---CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQ 184
C+S + ++A S+ + C AA+C + +C AC YG ++ ANL +
Sbjct: 148 RISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANLRR 207
Query: 185 DTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-- 241
+ LA + V +TF C A P G+ G GRG LSL AQ FSYCL
Sbjct: 208 GRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVA 264
Query: 242 PSFKA--LSFSGSLRLG------PIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
SF+A L S L LG IG + YTPLL NP+ Y V L A+ VG +
Sbjct: 265 HSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKR 324
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG----- 347
+ P + G ++DSGT FT L + + V D F R + + + G
Sbjct: 325 IQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT 384
Query: 348 GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAPDNVN 400
G CY P + L F G V LP+ N + S G S+ CL + N +
Sbjct: 385 GLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 444
Query: 401 SVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + N QQQ ++YDV R+G AR CT
Sbjct: 445 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 167/374 (44%), Gaps = 50/374 (13%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV----FNSAQSTTFKNLGCQAAQC 154
+ +GTP Q + M +DT ++ +W+ C ++T+ FN S+++ + C + C
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCN--TNTTATIPYPFFNPNISSSYTPISCSSPTC 125
Query: 155 ----KQVPNP-TC-GGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATG 207
+ P P +C C L+Y +S+ NL+ DT + PG FGC+ +
Sbjct: 126 TTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYS 185
Query: 208 NSVPPQ----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQP 261
+ GL+G+ GSLSL++Q L FSYC+ FSG L LG
Sbjct: 186 TNSESDSNTTGLMGMNLGSLSLVSQ---LKIPKFSYCI---SGSDFSGILLLGESNFSWG 239
Query: 262 KRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG-TIIDSG 315
+ YTPL++ S Y V L I++ ++++I G L TGAG T+ D G
Sbjct: 240 GSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNIS-GNLFVPDHTGAGQTMFDLG 298
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF------DTCYSVPI------VAPTIT 363
T F+ L+ P Y A+RD F + L F D CY VP+ P+++
Sbjct: 299 TQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVS 358
Query: 364 LMFSGMNVTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
L+F G + + D LL S+ C + D + +I + QQ+ + +
Sbjct: 359 LVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNS-DLLGVEAFIIGHHHQQSMWMEF 417
Query: 419 DVPNSRLGVARELC 432
D+ R+G+A C
Sbjct: 418 DLVEHRVGLAHARC 431
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 152/336 (45%), Gaps = 41/336 (12%)
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCS---STVFNSAQSTTFKNLGCQAAQCK 155
A GT A T + +D+ +D +WV C C C +F+ A STT+ + C +A C
Sbjct: 68 APDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACA 127
Query: 156 QVPNPTCGGGA---CAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGNSV 210
Q+ G A C F + YG ST S D ++L D++ G+ FGC G++
Sbjct: 128 QLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAF 187
Query: 211 PPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY-- 266
G L LG GS SL+ QT Y FSYCLP S G L LG P+R +
Sbjct: 188 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLP--PTASSLGFLVLGV--PPERAQLIP 243
Query: 267 ----TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
TPLL + + Y V L AI V R + +PP A ++IDS T+ +RL
Sbjct: 244 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFS------ASSVIDSSTIISRLP 297
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDN 377
AY A+R FR + + DTCY I P+I L+F G V L
Sbjct: 298 PTAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAG 357
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+L+ S CLA AP + + I N+QQ+
Sbjct: 358 ILLGS------CLAF--APTASDRMPGFIGNVQQKT 385
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 97/231 (41%), Gaps = 32/231 (13%)
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTP-LLKN 272
G + R L L TQ Y FSYC+P + S G + LG P+R P +
Sbjct: 419 GPYDVDRQGLPLRTATQ--YGRVFSYCIP--PSPSSLGFITLGV--PPQRAALVPTFVST 472
Query: 273 PRRSS------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
P SS Y V L AI V R + +PP ++I S TV +RL AY
Sbjct: 473 PLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFS------TSSVIASTTVISRLPPTAY 526
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIH 381
A+R FRR + T + DTCY I P+I L+F G V L +L+
Sbjct: 527 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 586
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA AP + + I N+QQ+ ++YDVP + C
Sbjct: 587 G------CLAF--APTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 151/349 (43%), Gaps = 42/349 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPC---TGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + IGTP Q L DT +D W C G S+ ++ S+TF L C
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRL 159
Query: 154 CKQVPNPT-----CGGGACAFNLTYGSST----IAANLSQDTISLATDIVPGYTFGCIQK 204
C + + + GG C + YG L +T +L D VPG FGC
Sbjct: 160 CAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTA 219
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----GQ 260
G+ GL+GLGRG LSL++Q L TF YCL + S + L G + G
Sbjct: 220 LEGDYGEGAGLVGLGRGPLSLVSQ---LDAGTFMYCLTA--DASKASPLLFGALATMTGA 274
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
++ T LL + ++ Y VNL +I +G A G + DSGT T
Sbjct: 275 GAGVQSTGLLAS---TTFYAVNLRSITIGS--------ATTAGVGGPGGVVFDSGTTLTY 323
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---PTITLMFS-GMNVTLPQD 376
L PAYT + F + S V GF+ CY P A P + L F G ++ LP
Sbjct: 324 LAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARLIPAMVLHFDGGADMALPVA 383
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
N ++ G + C + +P L++I N+ Q N+ +L+DV S L
Sbjct: 384 NYVVEVDDG-VVCWVVQRSPS-----LSIIGNIMQMNYLVLHDVRKSVL 426
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 115/415 (27%), Positives = 171/415 (41%), Gaps = 62/415 (14%)
Query: 70 QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV 129
+ LSS A A +VV S Y V GTP+QT+ DT + W PCT
Sbjct: 65 EALSSTATASATVV--KSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRY 122
Query: 130 GCSSTVFNS-----------AQSTTFKNLGCQAAQCKQV--PNPTCGGGACAFN------ 170
CS F+ S++ + +GCQ +C+ + N C G C N
Sbjct: 123 LCSDCNFSGLDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRG--CDPNTRNCTV 180
Query: 171 ------LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
L YG + A L + + VP + GC +T P G+ G GRG S
Sbjct: 181 PCPPYILQYGLGSTAGILISEKLDFPDLTVPDFVVGCSVISTRT---PAGIAGFGRGPES 237
Query: 225 LLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKR------IKYTPLLKNPRRS 276
L +Q + +FS+CL S F + + L L K + YTP KNP S
Sbjct: 238 LPSQMK---LKSFSHCLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVS 294
Query: 277 S-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ YY+NL I VG + V IP L G+I+DSG+ FT + P + V +
Sbjct: 295 NTAFLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAE 354
Query: 332 VFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS 382
F ++ SN T + + G C+++ + P + F G + LP N
Sbjct: 355 EFATQM-SNYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFV 413
Query: 383 TAGSITCLAMAAA----PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
CL + + P ++ + QQQN+ + YD+ N R G A++ C+
Sbjct: 414 GNADTVCLTVVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 179/382 (46%), Gaps = 53/382 (13%)
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTF 144
GR+ + Y K+G+P Q ++ +DT ++ W+ C C C+ T++++A+S ++
Sbjct: 94 GRKFGE---YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASY 150
Query: 145 KNLGCQAAQ-CKQVPNPT---CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI----- 193
+ + C +Q C T C G+ C F YG + + +LS DT+ + T +
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 194 -VPGYTFGCIQKA-----TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA- 246
V + FGC Q TG S G+LGL G ++L Q + FS+C P +
Sbjct: 211 TVQDFAFGCAQGDLELVPTGAS----GILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSH 266
Query: 247 LSFSGSLRLGPIGQP-KRIKYT--PLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQF 302
L+ +G + G P ++++YT L + + Y+V L + + +V +P G++
Sbjct: 267 LNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSV-- 324
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF-RRRVGS--NLTVTSLGGFDTCYSVP--- 356
I+DSG+ F+ V P ++ +R+ F + R S +L S G TC+ V
Sbjct: 325 -------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDD 377
Query: 357 -----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
P+++L+F G+ + +P +L+ A D + +NVI N Q
Sbjct: 378 IDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQ 437
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN + YD+ SR+G AR C
Sbjct: 438 QQNLWVEYDIQRSRVGFARASC 459
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 152/328 (46%), Gaps = 43/328 (13%)
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDI 193
F A S+TF L C ++ C+ + +P TC C + YG A L+ +T+ +
Sbjct: 96 FQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGAS 155
Query: 194 VPGYTFGC-IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
PG FGC + GNS G++GLGR LSL++Q FSYCL S A +
Sbjct: 156 FPGVAFGCSTENGVGNS--SSGIVGLGRSPLSLVSQVG---VGRFSYCLRS-DADAGDSP 209
Query: 253 LRLGPIGQPKRIKYTP-LLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA- 308
+ G + + K +P +L+NP SS YYVNL I VG D+P + F T GA
Sbjct: 210 ILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVG--ATDLPVTSTTFGFTRGAG 267
Query: 309 -----GTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLG---GFDTCYS----- 354
GTI+DSGT T LV Y V+ F ++ +NLT T G GFD C+
Sbjct: 268 AGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDANAAG 327
Query: 355 ----VPIVAPTITLMFSGMNVTLPQDNLLIHSTA------GSITCLAMAAAPDNVNSVLN 404
VP+ PT+ L F+G + + ++ CL + A + ++ ++
Sbjct: 328 GGSGVPV--PTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLS--IS 383
Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
+I N+ Q + +LYD+ A C
Sbjct: 384 IIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 164/369 (44%), Gaps = 53/369 (14%)
Query: 114 MDTSNDAAWVPCT---GCVGC-----SSTVFNSAQSTTFKNLGCQAAQCK-------QVP 158
MDT +D WVPCT C+ C S+ VF S++ + C + CK ++
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 159 NPTCGGG--ACA-----FNLTYGSSTIAANLSQDTISLATDIVPG------YTFGCIQKA 205
+C G C+ + + YG + A L +T++L + G + GC +
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGC---S 117
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPK 262
+S P G+ G GRG+LS+ +Q +++ + F+YCL S F + + LG P
Sbjct: 118 IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALPN 177
Query: 263 RI--KYTPLLKNPR------RSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIID 313
I YTP L N R YY+ L + +G +R+ +P L+F+ GTIID
Sbjct: 178 NIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTIID 237
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNL--TVTSLGGFDTCYSVP----IVAPTITLMFS 367
SGT FT + + F ++G V G CY V IV P F
Sbjct: 238 SGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVTGLENIVLPEFAFHFK 297
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAMAAAPD--NVNSVLNVI-ANMQQQNHRILYDVPNS 423
G ++ LP N + ++ CL M ++ V+S VI N QQQ+ +LYD +
Sbjct: 298 GGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYDREKN 357
Query: 424 RLGVARELC 432
RLG ++ C
Sbjct: 358 RLGFTQQTC 366
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 162/380 (42%), Gaps = 53/380 (13%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQC---- 154
V +GTP Q + M +DT ++ + + C G FN++ S T+ + C + C
Sbjct: 67 VSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVWRG 126
Query: 155 KQVP-NPTCGG---GACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCI------- 202
+ +P P C +C +++Y ++ A +L DT L T VP FGCI
Sbjct: 127 RDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTFILGTQAVPAL-FGCITSYSSST 185
Query: 203 ---QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
AT S GLLG+ RGSLS + QT L F+YC+ + G
Sbjct: 186 AINSSATDPSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGQGPGILLLGGDGGAA 242
Query: 260 QPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P + YTPL++ + Y V L IRVG ++ IP L + T T++DS
Sbjct: 243 PP--LNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTMVDS 300
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCYSVPI--------VAP 360
GT FT L+A AY A++ F + S L G FD C+ P + P
Sbjct: 301 GTQFTFLLADAYAALKAEFLNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLP 360
Query: 361 TITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ L+ G V + + LL A ++ CL + D VI + QQ
Sbjct: 361 EVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNS-DMAGMSAYVIGHHHQQ 419
Query: 413 NHRILYDVPNSRLGVARELC 432
+ + YD+ N R+G A C
Sbjct: 420 DVWVEYDLQNGRVGFAPARC 439
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/423 (26%), Positives = 184/423 (43%), Gaps = 42/423 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T+ + H SP SPF S S + + ++ LQF + A I S R
Sbjct: 27 TIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRG-- 84
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGC 149
Y++ IGTP +L DT +D W C C C +S +F+ +S+T++ + C
Sbjct: 85 ---EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSC 141
Query: 150 QAAQCKQVPNPTCG--GGACAFNLTYG-SSTIAANLSQDTISLATD-----IVPGYTFGC 201
++QC+ + + +C C++ +TYG +S +++ DT+++ + + GC
Sbjct: 142 SSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGC 201
Query: 202 IQKATGNSVPPQGLLGLGRGSL-SLLAQTQNLYQSTFSYCLPSFKALS-FSGSLRLGPIG 259
+ TG P + G SL++Q + FSYCL F + + + + G G
Sbjct: 202 GHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNG 261
Query: 260 ---QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT---TGAGTI-I 312
+ + + K+P ++ Y++NL AI VG + +QF T TG G I I
Sbjct: 262 IVSGDGVVSTSMVKKDP--ATYYFLNLEAISVGSK-------KIQFTSTIFGTGEGNIVI 312
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGMN 370
DSGT T L + Y + V + + G CY S P IT+ F G +
Sbjct: 313 DSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGD 372
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
V L N + + + ++C A AA N L + N+ Q N + YD + + +
Sbjct: 373 VKLGNLNTFV-AVSEDVSCFAFAA-----NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKT 426
Query: 431 LCT 433
C+
Sbjct: 427 DCS 429
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 126/457 (27%), Positives = 187/457 (40%), Gaps = 54/457 (11%)
Query: 7 FFLAFLFLFSLSEGLNPICDTQDHSSTLQVF-----HVFSPCSPF-----KPSKPLSWEE 56
F FL L S S I + S TL F H SP SPF PS+ + +
Sbjct: 4 FVFCFLLLCSHS-----IASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERI--KN 56
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDT 116
+VL A+ + RL+ LS + I IT+ Y++R IGTP DT
Sbjct: 57 TVLRSFARSKRRLR-LSQNDDRSPGTITIPD-EPITE---YLMRFYIGTPPVERFAIADT 111
Query: 117 SNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP--NPTCGG--GACAF 169
+D WV PC CV ++ +F+ +S+TFK + C + C +P C G G C +
Sbjct: 112 GSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYY 171
Query: 170 NLTYGSSTIAAN-LSQDTISLATD----IVPGYTFGCI---QKATGNSVPPQGLLGLGRG 221
YG T+ + L ++I+ + P TFGC S GL+GLG G
Sbjct: 172 QYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVG 231
Query: 222 SLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK---YTPLLKNPRRSSL 278
LSL++Q FSYC P + S S +R G K+IK TPL+ S
Sbjct: 232 PLSLISQLGYQIGRKFSYCFPPLSSNSTS-KMRFGNDAIVKQIKGVVSTPLIIKSIGPSY 290
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG 338
YY+NL + +G + V T +IDSGT FT L Y + + G
Sbjct: 291 YYLNLEGVSIGNKKVKTSESQ------TDGNILIDSGTSFTILKQSFYNKFVALVKEVYG 344
Query: 339 SNLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAA 395
++ C+ P + +F+G V + NL + ++ C+
Sbjct: 345 VEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDASNLF-EAEDNNLLCMVALPT 403
Query: 396 PDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
D +S+ N Q +++ YD+ + A C
Sbjct: 404 SDEDDSIF---GNHAQIGYQVEYDLQGGMVSFAPADC 437
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 120/465 (25%), Positives = 189/465 (40%), Gaps = 77/465 (16%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSK------PLSWEESVLEMLAKDQARLQFLSSL---- 75
D+++++++ F +F SP S+ P S + ++L D AR Q +SSL
Sbjct: 34 DSKNNNNSGVWFEMFHMHSPKLKSQSKFLGPPKSRLDGTRQLLQSDNARRQMISSLRHGT 93
Query: 76 -----AVARKSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMAMDTSNDAAWVPCT-GC 128
V+ + +PI SG QS Y V +IGTP Q ++ DT +D W+ C C
Sbjct: 94 RRKAFEVSHTAQIPIHSGADSGQS-QYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWC 152
Query: 129 VGCSS------TVFNSAQSTTFKNLGCQAAQCK----------QVPNPTCGGGACAFNLT 172
C VF + S++F+ + C + CK + PNP C F+
Sbjct: 153 KSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPN---APCLFDYR 209
Query: 173 Y----------GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
Y + T+ L+ D++ GC + + P G++GLG
Sbjct: 210 YLNGPRAIGVFANETVTVGLNDHKKIRLFDVL----IGCTESFNETNGFPDGVMGLGYRK 265
Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPK--RIKYTPLLKNPRRSSLY 279
SL + ++ + FSYCL + + L G I + K ++++T LL ++ Y
Sbjct: 266 HSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG-YINAFY 324
Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
VN+ I VG ++ I +N T G I+DSGT T L AY V D +
Sbjct: 325 PVNVSGISVGGSMLSISSDI--WNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDK 382
Query: 340 NLTVTSL------------GGFDTCYSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSI 387
+ V + GFD P + + F+ + P I A I
Sbjct: 383 HKKVVPIELPELNNFCFEDKGFDRA-----AVPRLLIHFADGAIFKPPVKSYIIDVAEGI 437
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL + A +S+L N+ QQNH YD+ +LG C
Sbjct: 438 KCLGIIKADFPGSSIL---GNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/420 (24%), Positives = 175/420 (41%), Gaps = 32/420 (7%)
Query: 33 TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
++ + H SP SPF PSK + E + + + +R+ A+ + R +
Sbjct: 33 SVDLIHRDSPHSPFFDPSK--TRTERLTDAFHRSASRVGRFRQSAMTSDGI----QSRLV 86
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLG 148
+ YI+ IGTP ++ +DT +D W C C C V F+ S+T+++
Sbjct: 87 PSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSS 146
Query: 149 CQAAQCKQVPNP-TC-GGGACAFNLTYGSSTI-AANLSQDTISLATDI-----VPGYTFG 200
C + C + N +C G C F +Y + NL+ +T+++A+ PG+ FG
Sbjct: 147 CGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFG 206
Query: 201 CIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC-LPSFKALSFSGSLRLGPI 258
C+ ++ G G++GLG LS+++Q ++ FSYC LP F S S + G
Sbjct: 207 CVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRS 266
Query: 259 G--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G TPL+ + Y + L VG++ + + + G I+DSGT
Sbjct: 267 GIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEG-NIIVDSGT 325
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTL 373
+T L Y + + + G CY+ I AP IT F NV L
Sbjct: 326 TYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAHFKDANVEL 385
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N + + C + D + ++ N+ Q N + +D+ R+ CT
Sbjct: 386 QPWNTFLRMQE-DLVCFTVLPTSD-----IGILGNLAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 113/440 (25%), Positives = 185/440 (42%), Gaps = 38/440 (8%)
Query: 13 FLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQF 71
FLF L E + + ++ + H SP SPF PSK + E + + + +R+
Sbjct: 17 FLFQLLE----VALARGGGFSVDLIHRDSPHSPFFDPSK--TQAERLTDAFRRSVSRVGR 70
Query: 72 LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
A+ + R + + Y++ IGTP ++ +DT +D W C C C
Sbjct: 71 FRPTAMTSDGI----QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHC 126
Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-PNPTCGG-GACAFNLTYGSSTI-AANLSQD 185
V F+ S+T+++ C + C + + +C C F +Y + NL+ +
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186
Query: 186 TISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
T+++ + PG+ FGC + G G++GLG G LSL++Q ++ FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246
Query: 240 C-LPSFKALSFSGSLRLGPIGQPKRIK--YTPLL-KNPRRSSLYYVNLLAIRVGRRVVDI 295
C LP S S + G G+ TPL+ K+P + YY+ L I VG++ +
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSP--DTFYYLTLEGISVGKKRLPY 304
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-- 353
G + I+DSGT +T L Y+ + + G F CY
Sbjct: 305 -KGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNT 363
Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ I AP IT F NV L N + + C +A D + V+ N+ Q N
Sbjct: 364 TAEINAPIITAHFKDANVELQPLNTFMRMQE-DLVCFTVAPTSD-----IGVLGNLAQVN 417
Query: 414 HRILYDVPNSRLGVARELCT 433
+ +D+ R+ CT
Sbjct: 418 FLVGFDLRKKRVSFKAADCT 437
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 160/357 (44%), Gaps = 44/357 (12%)
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCS---STVFNSAQSTTFKNLGCQAAQCK 155
A GT A + + +D+ +D WV C C + C +F+ A STT+ + C +A C
Sbjct: 72 APDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACA 131
Query: 156 QVPNPTCGG----GACAFNLTYGS-STIAANLSQDTISLAT-DIVPGYTFGCIQKATGN- 208
++ P G C F +TY + +T S D ++L D+V G+ FGC G+
Sbjct: 132 RL-GPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGST 190
Query: 209 -SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY- 266
S G L LG GS S + QT + Y FSYC+P + S G + G P+R
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVP--PSTSSFGFIMFGV--PPQRAALV 246
Query: 267 -----TPLLKNPRRSSLYYVNLL-AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
TPLL + S +Y LL +I V R + +PP A ++IDS TV +R
Sbjct: 247 PTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFS------ASSVIDSATVISR 300
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQ 375
+ AY A+R FR + + DTCY I P+I L+F G V L
Sbjct: 301 IPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDA 360
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+L+ CLA AP + + I N+QQ+ ++YDVP + C
Sbjct: 361 AGILLQG------CLAF--APTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 128/451 (28%), Positives = 193/451 (42%), Gaps = 50/451 (11%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+FF L L S S+ D+ T +FH S SP + S LS + + +
Sbjct: 7 IFFHLILLLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLTNAFRRS 62
Query: 66 QARLQFLSSLAVARKSV---VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW 122
+R L + A ++ P+ G S Y++ IGTP + DT +D W
Sbjct: 63 LSRSATLLNRAATNGALDLQAPLTPG-----SGEYLMSVSIGTPPVDYIGMADTGSDLMW 117
Query: 123 VPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTI 178
C C+ C S +F+ +ST+F ++ C + CK + + CG G C ++ TYG T
Sbjct: 118 AQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTY 177
Query: 179 A-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQS 235
+L + I++ + V GC ++ G G++GLG G LSL++Q
Sbjct: 178 TKGDLGFEKITIGSSSVKS-VIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236
Query: 236 TFSYCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVG-- 289
FSYCLP+ + + +G + G + P + TPL+ KNP + YYV L AI +G
Sbjct: 237 RFSYCLPTLLSHA-NGKINFGQNAVVSGPGVVS-TPLISKNP--VTYYYVTLEAISIGNE 292
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
R + G + IIDSGT + L Y V + V + +
Sbjct: 293 RHMASAKQGNV----------IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFW 342
Query: 350 DTCYSVPI-VA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
D C+ I VA P IT FS G NV L N A ++ CL + P +
Sbjct: 343 DLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTF-QKVANNVNCLTL--TPASPTDE 399
Query: 403 LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+I N+ N I YD+ RL +CT
Sbjct: 400 FGIIGNLALANFLIGYDLEAKRLSFKPTVCT 430
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 161/384 (41%), Gaps = 55/384 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS----TVFNSAQSTTFKNLGC 149
Y + + GTP+QT +DT + W+PC+ C C+S F S++ K +GC
Sbjct: 86 YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGC 145
Query: 150 QAAQCKQVPNPTCGGGAC---------------AFNLTYGSSTIAANLSQDTISLATDIV 194
+C V P C A+ + YG + A L + ++ T
Sbjct: 146 TNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPTKKY 205
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSG 251
+ GC + + P G+ G GRG SL +Q NL + FSYCL S + + + +
Sbjct: 206 SDFLLGC---SVVSVYQPAGIAGFGRGEESLPSQ-MNL--TRFSYCLLSHQFDDSATITS 259
Query: 252 SLRLGPI----GQPKRIKYTPLLKNPRRS------SLYYVNLLAIRVGRRVVDIPPGALQ 301
+L L G+ + YTP LKNP + YY+ L I VG + V +P L+
Sbjct: 260 NLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLE 319
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVPIVA 359
N G I+DSG+ FT + P + V F ++V + G C+ + A
Sbjct: 320 PNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGA 379
Query: 360 PTIT---LMFS---GMNVTLPQDNLLIHSTAGSITCLAM-----AAAPDNVNSVLNVIAN 408
T + L F G + LP N G + CL + A + V + ++ N
Sbjct: 380 ETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAV-ILGN 438
Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
QQQN + YD+ N R G + C
Sbjct: 439 YQQQNFYVEYDLENERFGFRSQSC 462
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 38/351 (10%)
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSSTV---FNSAQSTTFKNLGCQAAQC 154
R+K+ QT+++ D+++D WV C C C V ++ ++S T C + C
Sbjct: 21 RSKLPGVIQTVVL--DSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTC 78
Query: 155 KQVPNP---TCGGGACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
+ P C C + + Y GSST A ++ A + V G+ FGC G+
Sbjct: 79 TAL-GPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSF 137
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
G++ LG G SLL+QT + Y + FSYC+P+ S SG LG P R T
Sbjct: 138 DARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TASDSGFFTLGVPRRASSRYVVT 195
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
P+++ + ++ Y V L I VG + + + P AG+++DS T TRL AY
Sbjct: 196 PMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQ 249
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQD--NLLIH 381
A+R FR + + G DTCY V I P I+L+F N LP D +L +
Sbjct: 250 ALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD-RNAVLPLDPSGILFN 308
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA + D + + V+ ++QQQ +LYDV +G + C
Sbjct: 309 D------CLAFTSNAD--DRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 158/367 (43%), Gaps = 68/367 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
Y +G+P + + MDT +D WV C C S+ F+ S T+K L C
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTC------- 55
Query: 157 VPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLAT------DIVPGYTFGCIQKATGNS 209
A ++ YG + +LS DT+ +A + PG+ FGC G
Sbjct: 56 ---------ADDYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLI 106
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----------- 258
G+L L GSLS +Q Y + FSYCL A SL+ P+
Sbjct: 107 SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTA---QNSLKKSPMVFGEAAVELKE 163
Query: 259 ---GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGAL---QFNPTTGAGTI 311
G+ + ++YTP+ SS+YY V L I VG + +D+ P A Q P TI
Sbjct: 164 PGSGKLQELQYTPI----GESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKP-----TI 214
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS 367
DSGT T L +++ V S ++ G D C+ VP + P IT F+
Sbjct: 215 FDSGTTLTMLPPGVCDSIKQSLASMV-SGAEFVAIKGLDACFRVPPSSGQGLPDITFHFN 273
Query: 368 GMN--VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G VT P + ++ GS+ CL P N +++ N+QQQ+ +L+D+ N R+
Sbjct: 274 GGADFVTRPSNYVI---DLGSLQCLIF--VPTN---EVSIFGNLQQQDFFVLHDMDNRRI 325
Query: 426 GVARELC 432
G C
Sbjct: 326 GFKETDC 332
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 118/415 (28%), Positives = 184/415 (44%), Gaps = 43/415 (10%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
P + E + E+ A D AR L V P+ Y + K+GTP +
Sbjct: 38 PPNHELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREF 97
Query: 111 LMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPN 159
+ +DT +D WV CT C GC T F+ S++ + C +C Q +
Sbjct: 98 NVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157
Query: 160 PTCGGGACAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGNSV 210
C+++ YG + I+ +S DT+ +LA + + FGC TG+
Sbjct: 158 GCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQ 217
Query: 211 PPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
P+ G+ GLG+GSLS+++Q Q L FS+CL K S G + LG I +P +
Sbjct: 218 RPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK--SGGGIMVLGQIKRPDTV 275
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
YTPL+ + Y VNL +I V +++ I P F TG GTIID+GT L
Sbjct: 276 -YTPLVPSQPH---YNVNLQSIAVNGQILPIDPSV--FTIATGDGTIIDTGTTLAYLPDE 329
Query: 325 AYTAVRDVFRRRV---GSNLTVTSLGGFDTCYSVPIVAPTITLMFSG--MNVTLPQDNLL 379
AY+ V G +T S F+ V P ++L F+G V P L
Sbjct: 330 AYSPFIQAIANAVSQYGRPITYESYQCFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQ 389
Query: 380 IHSTAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
I S++G SI C+ + + ++ ++ ++ ++YD+ R+G A C+
Sbjct: 390 IFSSSGSSIWCIGFQRMS---HRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 125/261 (47%), Gaps = 23/261 (8%)
Query: 52 LSWEESVLEMLAKDQARLQFLS----SLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA 107
L+ E + + + + RL + A ARK+VV A + Y+V+ IGTP
Sbjct: 42 LTEHELLRRAIQRSRYRLAGIGMARGEAASARKAVV--AETPIMPAGGEYLVKLGIGTPP 99
Query: 108 QTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
A+DT++D W C C GC V FN S+T+ L C + C ++ CG
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 165 G---ACAFNLTY-GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLLGL 218
+C + TY G++T L+ D + + D G FGC +TG + PPQ G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYT----PLLKNPR 274
GRG LSL++Q L F+YCLP A G L LG R P+ ++PR
Sbjct: 220 GRGPLSLVSQ---LSVRRFAYCLPP-PASRIPGKLVLGADADAARNATNRIAVPMRRDPR 275
Query: 275 RSSLYYVNLLAIRVGRRVVDI 295
S YY+NL + +G R + +
Sbjct: 276 YPSYYYLNLDGLLIGDRTMSL 296
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 150/346 (43%), Gaps = 41/346 (11%)
Query: 109 TLLMAMDTSNDAAWVPCTGC-----VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
T M +DT++D WV C+ C +++ +S++ C + C Q+ P
Sbjct: 143 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYAN 201
Query: 164 G----GACAFNLTY--GSSTIAANLSQD-TISLATDIVPGYTFGCIQKATGN---SVPPQ 213
G C + + Y G+ST +S TI+ AT V + FGC G+
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPAT-AVRSFQFGCSHGVQGSFSFGSSAA 260
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
G++ LG G SL++QT Y FS+C P F LG P R TP+LKN
Sbjct: 261 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF---FTLGVPRVAAWRYVLTPMLKN 317
Query: 273 PR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P + Y V L AI V + + +PP AG +DS T TRL AY A+R
Sbjct: 318 PAIPPTFYMVRLEAIAVAGQRIAVPPTVF------AAGAALDSRTAITRLPPTAYQALRQ 371
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGS 386
FR R+ G DTCY + V P ITL+F V L +L
Sbjct: 372 AFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG---- 427
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A P+ + V +I N+Q Q +LY++P + +G C
Sbjct: 428 --CLAFTAGPN--DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 150/346 (43%), Gaps = 41/346 (11%)
Query: 109 TLLMAMDTSNDAAWVPCTGC-----VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG 163
T M +DT++D WV C+ C +++ +S++ C + C Q+ P
Sbjct: 168 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYAN 226
Query: 164 G----GACAFNLTY--GSSTIAANLSQD-TISLATDIVPGYTFGCIQKATGN---SVPPQ 213
G C + + Y G+ST +S TI+ AT V + FGC G+
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPAT-AVRSFQFGCSHGVQGSFSFGSSAA 285
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYTPLLKN 272
G++ LG G SL++QT Y FS+C P F LG P R TP+LKN
Sbjct: 286 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF---FTLGVPRVAAWRYVLTPMLKN 342
Query: 273 PR-RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P + Y V L AI V + + +PP AG +DS T TRL AY A+R
Sbjct: 343 PAIPPTFYMVRLEAIAVAGQRIAVPPTVF------AAGAALDSRTAITRLPPTAYQALRQ 396
Query: 332 VFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGS 386
FR R+ G DTCY + V P ITL+F V L +L
Sbjct: 397 AFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG---- 452
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA A P+ + V +I N+Q Q +LY++P + +G C
Sbjct: 453 --CLAFTAGPN--DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 170/427 (39%), Gaps = 94/427 (22%)
Query: 21 LNPICDTQDHSSTLQVFHVFSPCSPFKPS----KPLSWEESVLEMLAKDQARLQFLSSLA 76
L I + D +S++ + H + PCSP P+ +P E + L D R +F S
Sbjct: 20 LATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNG 79
Query: 77 VA-------RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC---- 125
A K VP G + + Y++ +G+PA T + +DT +D +WV C
Sbjct: 80 TAAGEDGQSSKVSVPTTLGSSL-DTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCP 138
Query: 126 --TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-----CAFNLTYGSSTI 178
+ C + +F+ A S+T+ C AA C Q+ + G C + + YG +
Sbjct: 139 APSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSN 198
Query: 179 AANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQST 236
G+ FGC G + + GL+GLG + SL++QT
Sbjct: 199 TTGT-------------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTA------ 239
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
R K P + Y+ L I VG + + +
Sbjct: 240 -------------------------ARSKKVP--------TYYFAALEDIAVGGKKLGLS 266
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
P AG+++DSGTV TRL AY A+ FR + LG DTC++
Sbjct: 267 PSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFT 320
Query: 356 ---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
+ PT+ L+F+G V +L H S CLA AP + I N+QQ+
Sbjct: 321 GLDKVSIPTVALVFAGGAVV----DLDAHGIV-SGGCLAF--APTRDDKAFGTIGNVQQR 373
Query: 413 NHRILYD 419
+LYD
Sbjct: 374 TFEVLYD 380
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 163/392 (41%), Gaps = 64/392 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQ-----------STTFK 145
Y + +GTP+QT+ + MDT + W PCT C+S F + S++ K
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 146 NLGCQAAQCKQV-----------PNP---TCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
+GC+ +C V NP C + + YG + A L +TI+
Sbjct: 144 LIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPN 203
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----AL 247
+ + GC +T P+G+ G GR SL Q L FSYCL S + +
Sbjct: 204 KTISDFLAGCSLLSTRQ---PEGIAGFGRSQESLPLQ---LGLKKFSYCLVSRRFDDSPV 257
Query: 248 SFSGSLRLGPIGQPKR---IKYTPLLK------NPRRSSLYYVNLLAIRVGRRVVDIPPG 298
S L +GP + + YTP K NP YYV L I VG+ V +P
Sbjct: 258 SSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYS 317
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS----LGGFDTCYS 354
L GTI+DSG+ FT + + + F +++ +N TV + L G C+
Sbjct: 318 FLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQM-ANYTVATNVQKLTGLRPCFD 376
Query: 355 V----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLN- 404
+ +V P +T F G + LP N G + CL + AAA V +
Sbjct: 377 ISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMG-VVCLTIVSDNAAALGGDGGVRSS 435
Query: 405 ----VIANMQQQNHRILYDVPNSRLGVARELC 432
++ N QQQN I YD+ N R G + C
Sbjct: 436 GPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 136/283 (48%), Gaps = 29/283 (10%)
Query: 162 CGGGA--CAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
CG A C + + YG + L + + T +V + FGC + G GL+GL
Sbjct: 69 CGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGL 128
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IKYTPLLKNPR 274
GR LSL++QT ++ FSYCLPS + SGSL LG R I Y +++NP+
Sbjct: 129 GRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYAKMIENPQ 187
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI-IDSGTVFTRLVAPAYTAVRDVF 333
+ Y++NL I +G ALQ P+ G I +DSGTV TRL Y A++ F
Sbjct: 188 LYNFYFINLTGISIGGV-------ALQ-APSVGPSRILVDSGTVITRLPPTIYKALKAEF 239
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQD----NLLIHSTAG 385
++ + DTC+++ + PTI + F G N L D + S A
Sbjct: 240 LKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEG-NAELTVDVTGVFYFVKSDAS 298
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ CLA+A+ + ++ N QQ+N R++YD ++ V+
Sbjct: 299 QV-CLALASL--EYQDEVAILGNYQQKNLRVIYDTKETKADVS 338
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 163/351 (46%), Gaps = 38/351 (10%)
Query: 100 RAKIGTPAQTLLMAMDTSNDAAWVPCTGCV--GCSSTV---FNSAQSTTFKNLGCQAAQC 154
R+K+ QT+++ D+++D WV C C C V ++ ++S + C + C
Sbjct: 151 RSKLPGVIQTVVL--DSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTC 208
Query: 155 KQVPNP---TCGGGACAFNLTY--GSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN- 208
+ P C C + + Y GSST A ++ A + V G+ FGC G+
Sbjct: 209 TAL-GPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSF 267
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG-PIGQPKRIKYT 267
G++ LG G SLL+QT + Y + FSYC+P+ S SG LG P R T
Sbjct: 268 DARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TASDSGFFTLGVPRRASSRYVVT 325
Query: 268 PLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYT 327
P+++ + ++ Y V L I VG + + + P AG+++DS T TRL AY
Sbjct: 326 PMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF------AAGSVLDSRTAITRLPPTAYQ 379
Query: 328 AVRDVFRRRVGSNLTVTSLGGFDTCYS----VPIVAPTITLMFSGMNVTLPQD--NLLIH 381
A+R FR + + G DTCY V I P I+L+F N LP D +L +
Sbjct: 380 ALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD-RNAVLPLDPSGILFN 438
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA + D + + V+ ++QQQ +LYDV +G + C
Sbjct: 439 D------CLAFTSNAD--DRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 116/448 (25%), Positives = 185/448 (41%), Gaps = 46/448 (10%)
Query: 6 VFFLAFLFLFS--LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
+F L F+ FS LS G ++++ H SP SP+ ++ V
Sbjct: 11 LFSLCFIASFSHALSNGF-----------SVELIHRDSPKSPYYKPTENKYQHFV----- 54
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
D AR + + S I Y++ +GTP + DT +D W+
Sbjct: 55 -DAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWL 113
Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA 179
PC C ++ +FN ++S+++KN+ C + C V + +C +C + ++YG S+ +
Sbjct: 114 QCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHS 173
Query: 180 -ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL 232
+LS DT+SL + P GC G G++GLG G +SL+ Q +
Sbjct: 174 QGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSS 233
Query: 233 YQSTFSYCLPSF--KALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
FSYCL K + S L G + + TPL+K + Y++ L A V
Sbjct: 234 IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSV 291
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G + V+ G IIDSGT T + + YT + V +
Sbjct: 292 GNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQ 349
Query: 349 FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
F CYS+ P IT+ F G +V L + + T G I C A +P + ++
Sbjct: 350 FSLCYSLKSNEYDFPIITVHFKGADVELHSISTFVPITDG-IVCFAFQPSP----QLGSI 404
Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
N+ QQN + YD+ + CT
Sbjct: 405 FGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/306 (32%), Positives = 136/306 (44%), Gaps = 29/306 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y+V IGTP Q + + +DT +D W C C C F+ + S+T C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 154 CKQVPNPTCG------GGACAFNLTYGSSTIAAN-LSQDTISL--ATDIVPGYTFGCIQK 204
C+ +P +CG C + +YG ++ L D + A VPG FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 205 ATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
G + G+ G GRG LSL +Q L FS+C + L S L P K
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKPSTVLLDLPADLYKS 258
Query: 264 ----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
++ TPL++NP + YY++L I VG + +P TG GTIIDSGT T
Sbjct: 259 GRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTG-GTIIDSGTAMT 317
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPIVA----PTITLMFSGMNVTL 373
L Y VRD F +V L V S D C S P+ A P + L F G + L
Sbjct: 318 SLPTRVYRLVRDAFAAQV--KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDL 375
Query: 374 PQDNLL 379
P++N +
Sbjct: 376 PRENYV 381
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/411 (24%), Positives = 170/411 (41%), Gaps = 51/411 (12%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSP-------TYIVRAKIGTPAQTLLMAMDTSNDAA 121
LQ L++ +++R + + Q+ + + GTP Q L MDT +
Sbjct: 52 LQHLATASMSRSHHLKHGKASPLIQTSLFPHSYGAHTIPLSFGTPPQKLSFLMDTGSHVV 111
Query: 122 WVPCT---GCVGCSST------VFNSAQSTTFKNLGCQAAQCKQV--PN-----PTCGGG 165
W PCT C CS + +FN S++ K LGC+ +C PB P C G
Sbjct: 112 WAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNGN 171
Query: 166 ------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
AC + L YG+ + + + + + GC A L G
Sbjct: 172 SKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSAD-REPSSDALAGF 230
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
GR SL Q + F+YCL S + SG L L G+ + + Y P KNP
Sbjct: 231 GRTMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPD 287
Query: 276 SSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
+ YY+ + +++G +V+ IP L + G +IDSG ++ + P + V + +
Sbjct: 288 YPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELK 347
Query: 335 RRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
+++ +L + + G CY+ I P + F+ G N+ +P N + + S
Sbjct: 348 KQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEAS 407
Query: 387 ITCLAMAAAPDNVNSVLN-----VIANMQQQNHRILYDVPNSRLGVARELC 432
+ C + N ++ N QQ +H + +D+ N RLG ++ C
Sbjct: 408 LGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/437 (25%), Positives = 186/437 (42%), Gaps = 49/437 (11%)
Query: 26 DTQDHSSTLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFLSSLA---VARK 80
++Q+ ++++ H S SPF + + +V+ K L + SL+ + +
Sbjct: 21 ESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKP 80
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFN 137
+++P A Y++ IGTP L +DT +D W PC C+ +S +FN
Sbjct: 81 TIIPYAGSY-------YVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFN 133
Query: 138 SAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTISLATDI 193
++S+T+KN+ C + CK+ C C + +TY S ++S+DT++L ++
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193
Query: 194 -----VPGYTFGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCLPS- 243
P GC K NS+ +GL +G GRG+ S+++Q + FSYCL S
Sbjct: 194 GSPISFPKIVIGCGHK---NSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASL 250
Query: 244 FKALSFSGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
F + S L G + + TPL+++ + Y+ NL A VG ++ + +L
Sbjct: 251 FSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFSVGDHIIKLKDSSLI 309
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI---V 358
P +IDSG+ T+L Y+ + V CY +
Sbjct: 310 --PDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYE 367
Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIANMQQQNHRI 416
P IT F G +V L N I + C A +A P V N+ QQN +
Sbjct: 368 VPIITAHFRGADVKLNAFNTFIQMNH-EVMCFAFNSSAFP------WVVYGNIAQQNFLV 420
Query: 417 LYDVPNSRLGVARELCT 433
YD + + CT
Sbjct: 421 GYDTLKNIISFKPTNCT 437
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/443 (25%), Positives = 189/443 (42%), Gaps = 55/443 (12%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T+++ H SP SP P L E +L+ A A L +S+ K+V+ +
Sbjct: 15 TMELIHKDSPQSPLYPGN-LPPGEQILQPAACPFAGLHHQTSMMSTNKAVMNRMMSPLTS 73
Query: 93 QSPTYIVRAKIG----------TPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVF------ 136
++ A++G T +T +DT N+ +W+ C GC + F
Sbjct: 74 YGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPP 133
Query: 137 -NSAQSTTFKNLGC-QAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLATD- 192
S+QS ++K + C Q + C+ PN C G CA+N+TYG S + NL+ +T + ++
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCE--PN-QCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNH 190
Query: 193 ----IVPGYTFGCIQKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL 241
+ +FGC + + P G+LG+G G S LAQ ++ FSYC+
Sbjct: 191 GKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCI 250
Query: 242 PSFKALSFSGSLRLGP-IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
+ + + LR G + + K ++ T +++ + S+ Y+VNLL I V ++I L
Sbjct: 251 TANN--THNTYLRFGKHVVKSKNLQTTKIMQ-VKPSAAYHVNLLGISVNGVKLNITKTDL 307
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSVP 356
G IID+GT+ T LV P + + + SN V D CY
Sbjct: 308 AVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQL 367
Query: 357 IVA-----PTITLMFSGMNVTL-PQDNLLIHSTAG-SITCLAMAAAPDNVNSVLNVIANM 409
A P +T ++ + P+ L G ++ CL+M + +I
Sbjct: 368 SDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSKT-----IIGAY 422
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ + +YD L E C
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDC 445
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 161/369 (43%), Gaps = 62/369 (16%)
Query: 122 WVPCTG---CVGCSST------VFNSAQSTTFKNLGCQ-------------AAQCKQVPN 159
WVPCT C CSS VF+ S++ + +GC+ A +C++ P
Sbjct: 85 WVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPC 144
Query: 160 -------PTCGGGACA-FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVP 211
P C + + YGS + A L DT+ VPG+ GC + P
Sbjct: 145 SPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQ--P 202
Query: 212 PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSLRLGPIGQPKRIKYTP 268
P GL G GRG+ S+ AQ L FSYCL S + + SGSL LG G + ++Y P
Sbjct: 203 PSGLAGFGRGAPSVPAQ---LGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVP 259
Query: 269 LLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL-- 321
L+K+ L YY+ L + VG + V +P A N GTI+DSGT FT L
Sbjct: 260 LVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDP 319
Query: 322 --VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-TL 373
P AV R + G C+++P + P ++ F G V L
Sbjct: 320 TVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQL 379
Query: 374 PQDNLLIHSTAGSITCLAMAAAPD--------NVNSVLNVI-ANMQQQNHRILYDVPNSR 424
P +N + + G++ + +A D N S +I + QQQN+ + YD+ R
Sbjct: 380 PVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKER 439
Query: 425 LGVARELCT 433
LG R+ CT
Sbjct: 440 LGFRRQSCT 448
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 157/385 (40%), Gaps = 58/385 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST---TF--------K 145
Y + GTP QT MDT + W PCT CS F + + T TF K
Sbjct: 83 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSK 142
Query: 146 NLGCQAAQCKQVPNP--------------TCGGGACAFNLTYGSSTIAANLSQDTISLAT 191
+GC+ +C + P C + + YGS + A L +T+
Sbjct: 143 LIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPN 202
Query: 192 D-IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALS 248
+P + GC + + P+G+ G GR SL +Q L FSYCL S F
Sbjct: 203 KKTIPDFLVGC---SIFSIKQPEGIAGFGRSPESLPSQ---LGLKKFSYCLVSHAFDDTP 256
Query: 249 FSGSLRLGP-----IGQPKRIKYTPLLKNPRRS--SLYYVNLLAIRVGRRVVDIPPGALQ 301
S L L + + + +TP LKNP + YYV L I +G V +P L
Sbjct: 257 TSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLV 316
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCYSV--- 355
GTI+DSGT FT + P Y V F +++ T +L G CY++
Sbjct: 317 PGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGE 376
Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN------VIA 407
+ P + F G + LP N +G I CL + + DNV ++
Sbjct: 377 KSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVI-CLTIVS--DNVAGPGLGGGPAIILG 433
Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
N QQ+N + +D+ N + G ++ C
Sbjct: 434 NYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 121/426 (28%), Positives = 180/426 (42%), Gaps = 64/426 (15%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA-QTLLMA 113
E + M+A+ +ARL L S A P+ G S Y++ IGTP Q +++
Sbjct: 52 HELLRRMVARSKARLASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLH 111
Query: 114 MDTSNDAAWV--PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ-VPNPTCGGGA---- 166
+DT +D W CT C VF ++ S TF + C C V P G A
Sbjct: 112 LDTGSDLVWTQCACTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRS 171
Query: 167 CAFNLTYGSSTI-AANLSQDTISL-ATD------IVPGYTFGCIQKATGNSVPPQ-GLLG 217
C + Y +I +++DT + A D VP FGC G P Q G+ G
Sbjct: 172 CFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAG 231
Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI---GQPKRIKY-------- 266
G G LSL +Q L FSYC F A+ S R+ P+ G+P+ I+
Sbjct: 232 FGTGPLSLPSQ---LKVRRFSYC---FTAMEES---RVSPVILGGEPENIEAHATGPIQS 282
Query: 267 TPLLKNPRRSSL-----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
TP P + + Y+++L + VG + GT IDSGT T
Sbjct: 283 TPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFF 342
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-----CYSVPI-----VAPTITLMFSGMNV 371
+ ++R+ F +V L V G+ C+SVP P + L G +
Sbjct: 343 PQAVFRSLREAFVAQV--PLPVAK--GYTDPDNLLCFSVPAKKKAPAVPKLILHLEGADW 398
Query: 372 TLPQDNLLIH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
LP++N ++ S AG C+ + +A NS +I N QQQN I+YD+ ++++
Sbjct: 399 ELPRENYVLDNDDDGSGAGRKLCVVILSAG---NSNGTIIGNFQQQNMHIVYDLESNKMV 455
Query: 427 VARELC 432
A C
Sbjct: 456 FAPARC 461
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/280 (34%), Positives = 138/280 (49%), Gaps = 35/280 (12%)
Query: 164 GGACAFNLTY--GSSTIAANLSQDTISLATD-IVPGYTFGCIQKATGNSVPPQGLLGLGR 220
G C F ++Y G+ST+ A SQD ++LA IV + FGC G+LGLGR
Sbjct: 34 GKQCGFAISYADGTSTVGA-YSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGR 92
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYY 280
SL A+ Y FSYCLPS S G L LG P +TP+ P + +
Sbjct: 93 LRESLGAR----YGGVFSYCLPSVS--SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFST 146
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
V L I VG + +D+ P A G I+DSGTV T L + AY A+R FR+ + +
Sbjct: 147 VTLAGINVGGKKLDLRPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAY 200
Query: 341 LTVTSLGGFDTCYSVP----IVAPTITLMFSG---MNVTLPQDNLLIHSTAGSITCLAMA 393
+ + G DTCY++ +V P I L F+G +N+ +P + +L++ CLA A
Sbjct: 201 RLLPN-GDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVP-NGILVNG------CLAFA 252
Query: 394 -AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ PD VL N+ Q+ +L+D S+ G + C
Sbjct: 253 ESGPDGSAGVL---GNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 163/393 (41%), Gaps = 33/393 (8%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
EE V +A + RL FL + + + + + Y+ IG P Q +
Sbjct: 49 EELVRRAVAAGKQRLAFLDAAMAGGGDGGGVGAPVRWA-TLQYVAEYLIGDPPQRAEALI 107
Query: 115 DTSNDAAWVPCTGCVG--CSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPT--CG-GGA 166
DT +D W C+ C+ C+ +NS+ S+TF + C A C + C
Sbjct: 108 DTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHFCDLAAG 167
Query: 167 CAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGSL 223
C+ YG+ +A L + + + FGC+ + G GL+GLGRG L
Sbjct: 168 CSVIAGYGAGVVAGTLGTEAFAFQSGTAE-LAFGCVTFTRIVQGALHGASGLIGLGRGRL 226
Query: 224 SLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLY 279
SL++QT + FSYCL P F +G L +G +G + T +K P+ S Y
Sbjct: 227 SLVSQTG---ATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFY 283
Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
Y+ L+ + VG + IP G IIDSG+ FT LV AY A+
Sbjct: 284 YLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAA 343
Query: 336 RVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITC 389
R+ +L D V V P + F G ++ +P ++ +
Sbjct: 344 RLNGSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACM 403
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
+A P S VI N QQQN R+LYD+ N
Sbjct: 404 AIASAGPYRRQS---VIGNYQQQNMRVLYDLAN 433
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 122/457 (26%), Positives = 196/457 (42%), Gaps = 60/457 (13%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAK 64
+ F + F+ S S Q + ++++ H S SP +KP++ ++ V
Sbjct: 9 LLFFSICFIVSFSHA-------QKNGFSVELIHRDSLKSPLYKPTQN-KYQYFV------ 54
Query: 65 DQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV- 123
D AR + + S+ I I Y++ +GTP L +DT +D W+
Sbjct: 55 DAARRSINRANHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQ 114
Query: 124 --PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYG-SSTIA 179
PC C ++ +FN ++S+++KN+ C + C+ + + +C C ++ YG +S
Sbjct: 115 CEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSG 174
Query: 180 ANLSQDTISLA-----TDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQ 230
+LS DT++L T P GC T N + + G++G G G S + Q
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGC---GTNNILSYEGASSGIVGFGSGPASFITQLG 231
Query: 231 NLYQSTFSYCL-PSFKALSF----SGSLRLGPIG--QPKRIKYTPLLKNPRRSSLYYVNL 283
+ FSYCL P F + + L G + TP+LK + YY+ L
Sbjct: 232 SSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPE-TFYYLTL 290
Query: 284 LAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY----TAVRDVFRRRVGS 339
A VG R V+I G + N IIDSGT T L Y +AV D+ +
Sbjct: 291 EAFSVGNRRVEI--GGVP-NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVD 347
Query: 340 NLTVTSLGGFDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP 396
+ T T + CYSV P IT+ F G +V L + + S A + CLA ++
Sbjct: 348 DPTQT----LNLCYSVKAEGYDFPIITMHFKGADVDLHPISTFV-SVADGVFCLAFESSQ 402
Query: 397 DNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
D+ + N+ QQN + YD+ + CT
Sbjct: 403 DHA-----IFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 169/398 (42%), Gaps = 69/398 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSST------VFNSAQSTTFKNL 147
Y A +GTP Q L + +DT + WVPCT C CSS VF+ S++ + +
Sbjct: 103 YAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLV 162
Query: 148 GCQAAQCKQVPN---------PTCGGGACA--------FNLTYGSSTIAANLSQDTISLA 190
GC+ C V + P G C + + YGS + A L DT+
Sbjct: 163 GCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAP 222
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---AL 247
V G+ GC + PP GL G GRG+ S+ AQ L S FSYCL S +
Sbjct: 223 GRAVSGFVLGCSLVSVHQ--PPSGLAGFGRGAPSVPAQ---LGLSKFSYCLLSRRFDDNA 277
Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGALQF 302
+ SGSL LG G ++Y PL+K+ YY+ L + VG + V +P A
Sbjct: 278 AVSGSLVLG--GDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAA 335
Query: 303 NPTTGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-- 356
N G I+DSGT FT L P AV R + V G C+++P
Sbjct: 336 NAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQG 395
Query: 357 ---IVAPTITLMFSGMNVT-LPQDNLLIHSTAGSI------------TCLAM------AA 394
+ P ++L F G V LP +N + + + CLA+ +
Sbjct: 396 AKSMALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSG 455
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A D ++ + QQQN+ + YD+ RLG R+ C
Sbjct: 456 AGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 168/363 (46%), Gaps = 37/363 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-Q 156
+V IGTP Q M +DT + +W+ C +++ F+ + S++F L C CK +
Sbjct: 89 VVTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTAS-FDPSLSSSFYVLPCTHPLCKPR 147
Query: 157 VPN----PTCGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTFGCIQKATGNS 209
VP+ TC C ++ Y T A NL ++ ++ + + P GC + S
Sbjct: 148 VPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGC----SSES 203
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF----SGSLRLGPIGQPKRIK 265
+G+LG+ G LS Q + + FSYC+P+ + + +GS LG R +
Sbjct: 204 RDARGILGMNLGRLSFPFQAK---VTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFR 260
Query: 266 YTPLLKNPRRSSL-------YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
Y +L P+ + Y V + IR+G R ++IPP + N T++DSG+ F
Sbjct: 261 YVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDSGSEF 320
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCY-----SVPIVAPTITLMFS-GMN 370
T LV AY VR+ R +G + + G D C+ + + + F G+
Sbjct: 321 TFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVE 380
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ +P++ +L G + C+ + + + + + N+I N QQN + +D+ N R+G
Sbjct: 381 IVVPKERVLA-DVGGGVHCVGIGRS-ERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVA 438
Query: 431 LCT 433
C+
Sbjct: 439 DCS 441
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 107/400 (26%), Positives = 162/400 (40%), Gaps = 68/400 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGC---------------SSTVFNSA 139
Y + +G+ + + + MDT +D W PC+ C+ C + +V SA
Sbjct: 76 YTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSA 135
Query: 140 QSTTFKNLG-------CQAAQC--KQVPNPTCGGGACA-FNLTYGSSTIAANLSQDTISL 189
+ + + G C ++C + + C +C F YG ++ A L +D++SL
Sbjct: 136 AACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL 195
Query: 190 ATDI------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL---YQSTFSYC 240
T V +TFGC G P G+ G GRG LS+ +Q + FSYC
Sbjct: 196 PTPAPSPPINVRNFTFGCAHTTLGE---PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYC 252
Query: 241 LPSFKALSFSGSLRLGPI-------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
L S + + R P+ G+ + I YT LL+NP+ Y V L I VG +
Sbjct: 253 LVS-HSFAADRVRRPSPLILGRYYTGETEFI-YTSLLENPKHPYFYSVGLAGISVGNIRI 310
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS----NLTVTSLGGF 349
P + + G ++DSGT FT L A Y +V F R G + G
Sbjct: 311 PAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL 370
Query: 350 DTCYSVP--IVAPTITLMFSGM--NVTLPQDNLLIHSTAG---------SITCLAMAAAP 396
CY + P + L F G NV LP+ N G + CL +
Sbjct: 371 SPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGG 430
Query: 397 DNVNSVLN---VIANMQQQNHRILYDVPNSRLGVARELCT 433
D + N QQQ ++YD+ +R+G AR C+
Sbjct: 431 DEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 470
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 158/373 (42%), Gaps = 60/373 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQA 151
YI IG P Q +DT ++ W C+ GC G T ++ ++S T K + C
Sbjct: 84 YIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACND 143
Query: 152 AQCKQVPNPTCG--GGACAFNLTYGSSTIAANL------------SQDTISLATDIVPGY 197
C C G ACA YG+ I L S++ +SLA
Sbjct: 144 TACLLGSETRCARDGKACAVLTAYGAGAIGGFLGTEVFTFGHGQSSENNVSLA------- 196
Query: 198 TFGCIQKAT---GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
FGCI + G+ G++GLGRG LSL +Q L + FSYCL P F + + +L
Sbjct: 197 -FGCITASRLTPGSLDGASGIIGLGRGKLSLPSQ---LGDNKFSYCLTPYFSDAANTSTL 252
Query: 254 RLGPIGQPKRIKY----TPLLKNPRRS---SLYYVNLLAIRVGRRVVDIPPGALQFN--- 303
+G P LKNP S YY+ L I VG +D+P A
Sbjct: 253 FVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVA 312
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCY------SV 355
P GT+IDSG+ FT L+ AY A+RD R++G+++ G G D C
Sbjct: 313 PAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDA 372
Query: 356 PIVAPTITLMF-----SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN---VIA 407
+ P + L F G +V +P +N + + ++ N LN +I
Sbjct: 373 GKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIG 432
Query: 408 NMQQQNHRILYDV 420
N QQ+ +LYD+
Sbjct: 433 NYMQQDMHLLYDL 445
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 117/415 (28%), Positives = 185/415 (44%), Gaps = 43/415 (10%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTL 110
P + E + E+ A D AR L V P+ Y + K+GTP +
Sbjct: 38 PPNHELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREF 97
Query: 111 LMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPN 159
+ +DT +D WV CT C GC T F+ S++ + C +C Q +
Sbjct: 98 NVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES 157
Query: 160 PTCGGGACAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGNSV 210
C+++ YG + I+ +S DT+ +LA + + FGC +G+
Sbjct: 158 GCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQ 217
Query: 211 PPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRI 264
P+ G+ GLG+GSLS+++Q Q L FS+CL K S G + LG I +P +
Sbjct: 218 RPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK--SGGGIMVLGQIKRPDTV 275
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
YTPL+ + Y VNL +I V +++ I P F TG GTIID+GT L
Sbjct: 276 -YTPLVPSQPH---YNVNLQSIAVNGQILPIDPSV--FTIATGDGTIIDTGTTLAYLPDE 329
Query: 325 AYTAVRDVFRRRV---GSNLTVTSLGGFDTCYSVPIVAPTITLMFSG--MNVTLPQDNLL 379
AY+ V G +T S F+ V P ++L F+G V P+ L
Sbjct: 330 AYSPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQ 389
Query: 380 IHSTAG-SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
I S++G SI C+ + + ++ ++ ++ ++YD+ R+G A C+
Sbjct: 390 IFSSSGSSIWCIGFQRMS---HRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 176/409 (43%), Gaps = 52/409 (12%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMD 115
E + DQ RL+ + VA PI+ + Y R +GTP Q + +D
Sbjct: 11 EYYRTLREHDQRRLRRILPEVVA----FPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVD 66
Query: 116 TSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCKQVPNPTC--GGG 165
T +D AWV C C C ++F+ +ST+ ++ C +C N C
Sbjct: 67 TGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSM 126
Query: 166 ACAFNLTYGS-STIAANLSQDTISL---------ATDIVPGYTFGCIQKATGNSVPPQGL 215
+C ++ YG S+ A L D +S AT TFGC TG + GL
Sbjct: 127 SCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TDGL 185
Query: 216 LGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
+G G+ +SL +Q QN+ + F++CL SG+L +G I +P + YTP++ P
Sbjct: 186 VGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDN--KGSGTLVIGHIREPGLV-YTPIV--P 240
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY----TAV 329
++S Y V LL I V V P F+ + G I+DSGT T LV PAY V
Sbjct: 241 KQSH-YNVELLNIGVSGTNVTTPTA---FDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKV 296
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMNVTL--PQDNLL--IHSTAG 385
RD R V L V F ++ P +TL F+G L P L + +T
Sbjct: 297 RDCMRSGV---LPV----AFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGL 349
Query: 386 SITCLAMAAAPDNVNSV-LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
S C + + + + + ++ ++YD N+R+G CT
Sbjct: 350 SAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 165/389 (42%), Gaps = 66/389 (16%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------TVFNSAQSTTFKNLGCQAA 152
V +G P Q + M +DT ++ +W+ C G S+ FN + S+T+ C +
Sbjct: 62 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 121
Query: 153 QCKQ------VPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
+C+ VP P C G +C +L+Y ++ A L+ DT L FGC+
Sbjct: 122 ECQWRGRDLPVP-PFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXALFGCV 180
Query: 203 QKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFS 250
+ +S GLLG+ RGSLS + QT L F+YC+ P L
Sbjct: 181 TSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGDGPGLLVLGGD 237
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
G+ L P ++ YTPL++ R Y V L IRVG ++ IP L + T
Sbjct: 238 GAA-LAP-----QLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHT 291
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCY------ 353
T++DSGT FT L+A AY ++ F + + L G FD C+
Sbjct: 292 GAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEAR 351
Query: 354 --SVPIVAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVL 403
+ + P + L+ G V + + LL A ++ CL + D
Sbjct: 352 VAAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNS-DMAGMSA 410
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
VI + QQN + YD+ N R+G A C
Sbjct: 411 YVIGHHHQQNVWVEYDLQNGRVGFAPARC 439
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 165/389 (42%), Gaps = 66/389 (16%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------TVFNSAQSTTFKNLGCQAA 152
V +G P Q + M +DT ++ +W+ C G S+ FN + S+T+ C +
Sbjct: 64 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 123
Query: 153 QCKQ------VPNPTCGG---GACAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTFGCI 202
+C+ VP P C G +C +L+Y ++ A L+ DT L FGC+
Sbjct: 124 ECQWRGRDLPVP-PFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRALFGCV 182
Query: 203 QKATG-------NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-----PSFKALSFS 250
+ +S GLLG+ RGSLS + QT L F+YC+ P L
Sbjct: 183 TSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGDGPGLLVLGGD 239
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
G+ L P ++ YTPL++ R Y V L IRVG ++ IP L + T
Sbjct: 240 GAA-LAP-----QLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHT 293
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT------VTSLGGFDTCY------ 353
T++DSGT FT L+A AY ++ F + + L G FD C+
Sbjct: 294 GAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEAR 353
Query: 354 --SVPIVAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAAAPDNVNSVL 403
+ + P + L+ G V + + LL A ++ CL + D
Sbjct: 354 VAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNS-DMAGMSA 412
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
VI + QQN + YD+ N R+G A C
Sbjct: 413 YVIGHHHQQNVWVEYDLQNGRVGFAPARC 441
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 115/448 (25%), Positives = 184/448 (41%), Gaps = 46/448 (10%)
Query: 6 VFFLAFLFLFS--LSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLA 63
+F L F+ FS LS G ++++ H SP SP+ ++ V
Sbjct: 11 LFSLCFIASFSHALSNGF-----------SVELIHRDSPKSPYYKPTENKYQHFV----- 54
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
D AR + + S I Y++ +GTP + DT +D W+
Sbjct: 55 -DAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWL 113
Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA 179
PC C ++ +FN ++S+++KN+ C + C V + +C +C + ++YG S+ +
Sbjct: 114 QCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHS 173
Query: 180 -ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL 232
+LS DT+SL + P GC G G++GLG G +SL+ Q +
Sbjct: 174 QGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSS 233
Query: 233 YQSTFSYCLPSF--KALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
FSYCL K + S L G + + TPL+K + Y++ L A V
Sbjct: 234 IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSV 291
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G + V+ G IIDSGT T + + YT + V +
Sbjct: 292 GNKRVEF--GGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQ 349
Query: 349 FDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
F CYS+ P IT F G ++ L + + T G I C A +P + ++
Sbjct: 350 FSLCYSLKSNEYDFPIITAHFKGADIELHSISTFVPITDG-IVCFAFQPSP----QLGSI 404
Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
N+ QQN + YD+ + CT
Sbjct: 405 FGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 174/404 (43%), Gaps = 64/404 (15%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQST 142
P A+ + + + V +GTP Q + M +DT ++ +W+ C G T FN++ S+
Sbjct: 42 PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSS 101
Query: 143 TFKNLGCQAAQC----KQVPNP----TCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI 193
++ + C + C + +P P T AC +L+Y ++ A L+ DT L
Sbjct: 102 SYGAVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGA 161
Query: 194 VP---GYTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFS 238
P G FGCI TG V GLLG+ RG+LS + QT F+
Sbjct: 162 PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFA 218
Query: 239 YCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRV 292
YC+ + G L LG G + YTPL++ + Y V L IRVG +
Sbjct: 219 YCIAPGEG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCAL 275
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL------ 346
+ IP L + T T++DSGT FT L+A AY A++ F + + L + L
Sbjct: 276 LPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFV 333
Query: 347 --GGFDTCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSIT 388
G FD C+ P + P + L+ G V + + LL A ++
Sbjct: 334 FQGAFDACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVW 393
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL + D VI + QQN + YD+ N R+G A C
Sbjct: 394 CLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 116/370 (31%), Positives = 162/370 (43%), Gaps = 52/370 (14%)
Query: 94 SPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCS------STVFNSAQSTTFKN 146
+P ++ +GTP AQT+ +D ++ W C C + +T F S TF
Sbjct: 85 APPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSP 144
Query: 147 LGCQAAQCKQVPNPTCGGGAC-----------AFNLTYGSSTIAAN----LSQDTISLAT 191
L C + C V TCG +++LTYG S AAN L+ DT +
Sbjct: 145 LPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGS--AANTSGYLATDTFTFGA 202
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---S 248
VPG FGC + G+ G++G+GRG+LSL++Q Q FSY L + +A S
Sbjct: 203 TAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGS 259
Query: 249 FSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPT 305
+R G P KR + TPLL + YYVNL +RV G R+ IP G
Sbjct: 260 ADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRAN 319
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPIVA- 359
G I+ S T T L AY DV R V S + + ++ G D CY+ +A
Sbjct: 320 GTGGVILSSTTPVTYLEQAAY----DVVRAAVASRIGLPAVNGSAALELDLCYNASSMAK 375
Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P +TL+F G ++ L N + CL M P SVL + Q
Sbjct: 376 VKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTM--LPSQGGSVL---GTLLQTGTN 430
Query: 416 ILYDVPNSRL 425
++YDV RL
Sbjct: 431 MIYDVDAGRL 440
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 174/404 (43%), Gaps = 64/404 (15%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-VFNSAQST 142
P A+ + + + V +GTP Q + M +DT ++ +W+ C G T FN++ S+
Sbjct: 42 PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSS 101
Query: 143 TFKNLGCQAAQC----KQVPNP----TCGGGACAFNLTYGSSTIAAN-LSQDTISLATDI 193
++ + C + C + +P P T AC +L+Y ++ A L+ DT L
Sbjct: 102 SYGAVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGA 161
Query: 194 VP---GYTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFS 238
P G FGCI TG V GLLG+ RG+LS + QT F+
Sbjct: 162 PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFA 218
Query: 239 YCLPSFKALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRV 292
YC+ + G L LG G + YTPL++ + Y V L IRVG +
Sbjct: 219 YCIAPGEG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCAL 275
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL------ 346
+ IP L + T T++DSGT FT L+A AY A++ F + + L + L
Sbjct: 276 LPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFV 333
Query: 347 --GGFDTCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSIT 388
G FD C+ P + P + L+ G V + + LL A ++
Sbjct: 334 FQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVW 393
Query: 389 CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CL + D VI + QQN + YD+ N R+G A C
Sbjct: 394 CLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 111/229 (48%), Gaps = 12/229 (5%)
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
V GLLGLG G +S + Q TFSYCL S + SGSL G P + L
Sbjct: 3 VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVS-RGTESSGSLEFGRESVPVGASWVSL 61
Query: 270 LKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
+ NPR S YY+ L + VG V I + N G ++D+GT TRL A AY A
Sbjct: 62 IHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNAF 121
Query: 330 RDVFRRRVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNV-TLPQDNLLIHST 383
RD F + +NL TS + FDTCY V + PTI+ F G + TLP N LI
Sbjct: 122 RDAFVAQT-TNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A + +S L++I N+QQ+ I D N +G +C
Sbjct: 181 SVGTFCFAFAPS----SSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 161/382 (42%), Gaps = 39/382 (10%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG--------CSST 134
+P+ SG T + Y V+ ++GTPAQ ++ DT +D WV C G S
Sbjct: 97 MPLTSG-AYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPR 155
Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VPN--PTCGGGA-----CAFNLTYGSSTIAANL---S 183
VF A S ++ + C + CK VP C G C ++ Y + A +
Sbjct: 156 VFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTD 215
Query: 184 QDTISLATD------IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQST 236
TI+L+ + GC G S G+L LG ++S ++ +
Sbjct: 216 AATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 275
Query: 237 FSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
FSYCL A + S L GP+G TPLL + + + Y V + A+ V + ++I
Sbjct: 276 FSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNI 335
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
P A ++ G I+DSGT T L PAY AV +++ VT + F+ CY+
Sbjct: 336 P--AEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVT-MDPFEYCYNW 392
Query: 356 -----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
P P + + F+G P + A + C+ + + V ++VI N+
Sbjct: 393 TATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQ---EGVWPGVSVIGNIL 449
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQ H +D+ N L C
Sbjct: 450 QQEHLWEFDLANRWLRFQESRC 471
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 114/448 (25%), Positives = 177/448 (39%), Gaps = 79/448 (17%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKS------VVPIASGRQITQSPTYIVRAKIGTPAQTL 110
S+ ++ D+ R+ F+SS R + +P++SG T + Y VR ++GTPAQ
Sbjct: 42 SLADLARMDRERMAFISSRGRRRAAETASAFAMPLSSG-AYTGTGQYFVRFRVGTPAQPF 100
Query: 111 LMAMDTSNDAAWVPCTGCV------------------GCSSTVFNSAQSTTFKNLGCQAA 152
L+ DT +D WV C F +S T+ + C +A
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSA 160
Query: 153 QCKQ---VPNPTCGGGA--CAFNLTYGSSTIA---ANLSQDTISLATDI-----VPGYTF 199
C++ C A CA++ Y + A + TI+L+ + G
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVL 220
Query: 200 GCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGP 257
GC G S + G+L LG ++S ++ + + FSYCL A + S L GP
Sbjct: 221 GCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGP 280
Query: 258 ------------IGQPKRI-------------KYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
I K + TPL+ + R Y V + + V +
Sbjct: 281 NPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGEL 340
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
+ IP ++ G G I+DSGT T L PAY AV +R+ + L ++ FD C
Sbjct: 341 LKIPRAV--WDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRL-AGLPRVTMDPFDYC 397
Query: 353 YS--------VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN 404
Y+ V P + + F+G P + A + C+ + P L+
Sbjct: 398 YNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGP---WPGLS 454
Query: 405 VIANMQQQNHRILYDVPNSRLGVARELC 432
VI N+ QQ H YD+ N RL R C
Sbjct: 455 VIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 125/262 (47%), Gaps = 26/262 (9%)
Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-AN 181
PC G C F+ ++S++F + C + +C C G +C F + +G+ T+A
Sbjct: 21 APCVGGAPCD-VAFDPSRSSSFAAIPCGSPECAV----ECTGASCPFTIQFGNVTVANGT 75
Query: 182 LSQDTISLA-TDIVPGYTFGCIQKATGNSV--PPQGLLGLGRGSLSLLAQT-----QNLY 233
L +DT++L+ + G+TFGCI+ GL+ L R S SL ++
Sbjct: 76 LVRDTLTLSPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTTT 135
Query: 234 QSTFSYCLPSFKALSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
+ FSYCLPS + G L +G P IKY P+ NP + Y+V+L+ I VG
Sbjct: 136 TAAFSYCLPSLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGG 195
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+ +PP L + GT++++ T FT L AY A+RD FR + D
Sbjct: 196 EDLPVPPAVLAAH-----GTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLD 250
Query: 351 TCYSV----PIVAPTITLMFSG 368
TCY++ + P + L F+G
Sbjct: 251 TCYNLTGLASLAVPAVALRFAG 272
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 164/374 (43%), Gaps = 59/374 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS---TVFNSAQSTTFKNLGCQA 151
YI IG P Q +DT ++ W C+ C GC S + ++ ++S T + + C
Sbjct: 71 YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACND 130
Query: 152 AQCKQVPNPTCG--GGACAFNLTYGSSTIAANLSQDTISLA--TDIVPGYTFGCIQKAT- 206
C C ACA YG+ I L + + ++ V FGCI
Sbjct: 131 TACALGSETRCARDNKACAVLTAYGAGVIGGVLGTEAFTFQPQSENV-SLAFGCIAATRL 189
Query: 207 --GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKA------LSFSGSLRLGP 257
G+ G++GLGRG+LSL++Q L + FSYCL P F L S L
Sbjct: 190 TPGSLDGASGIIGLGRGNLSLVSQ---LGDNKFSYCLTPYFSQSTNTSRLFVGASAGLSS 246
Query: 258 IGQPKRIKYTPLLKNPRR---SSLYYVNLLAIRVGRRVVDIPPGALQFNP-TTG--AGTI 311
G P P LKNP S+ YY+ L I VG + +P A TG AGT+
Sbjct: 247 GGAPA--TSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTL 304
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG--GFDTCYSVP-----IVAPTITL 364
IDSG+ FT LV AY A+RD +++G+++ G G D C +V + P + L
Sbjct: 305 IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVL 364
Query: 365 MF--SGMNVTLPQDN-----------LLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQ 411
F G +V +P +N +++ S+ G + L M + +I N Q
Sbjct: 365 HFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPM--------NETTIIGNYMQ 416
Query: 412 QNHRILYDVPNSRL 425
Q+ +LYD+ L
Sbjct: 417 QDMHLLYDLEKGML 430
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 116/370 (31%), Positives = 162/370 (43%), Gaps = 52/370 (14%)
Query: 94 SPTYIVRAKIGTP-AQTLLMAMDTSNDAAWVPCTGCVGCS------STVFNSAQSTTFKN 146
+P ++ +GTP AQT+ +D ++ W C C + +T F S TF
Sbjct: 85 APPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSP 144
Query: 147 LGCQAAQCKQVPNPTCGGGAC-----------AFNLTYGSSTIAAN----LSQDTISLAT 191
L C + C V TCG +++LTYG S AAN L+ DT +
Sbjct: 145 LPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGS--AANTSGYLATDTFTFGA 202
Query: 192 DIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL---S 248
VPG FGC + G+ G++G+GRG+LSL++Q Q FSY L + +A S
Sbjct: 203 TAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGS 259
Query: 249 FSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPT 305
+R G P KR + TPLL + YYVNL +RV G R+ IP G
Sbjct: 260 ADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRAN 319
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPIVA- 359
G I+ S T T L AY DV R V S + + ++ G D CY+ +A
Sbjct: 320 GTGGVILSSTTPVTYLEQAAY----DVVRAAVASRIGLPAVNGSAALELDLCYNASSMAK 375
Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P +TL+F G ++ L N + CL M P SVL + Q
Sbjct: 376 VKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTM--LPSQGGSVL---GTLLQTGTN 430
Query: 416 ILYDVPNSRL 425
++YDV RL
Sbjct: 431 MIYDVDAGRL 440
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 134/280 (47%), Gaps = 46/280 (16%)
Query: 5 LVFFLAFLFLFSLSE----------GLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSW 54
LV+FL + +L + + L P + + + HV P S P P+S+
Sbjct: 3 LVWFLGWFYLLATASSFVEKENEAVALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSF 62
Query: 55 EESVLEMLAKDQARLQFLSSLAVAR-----------------KSV-VPIASGRQITQSPT 96
+ +LA D AR++ L+S + KSV VP+ G I S
Sbjct: 63 SD----VLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIG-SGN 117
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCV-GC---SSTVFNSAQSTTFKNLGCQAA 152
Y V+ G+PA+ M +DT + +W+ C CV C + +F+ + S T+K+L C ++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 153 QCKQV-----PNPTC--GGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
QC + NP C C + +YG S+ + LSQD ++LA + +PG+ +GC Q
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQ 237
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS 243
+ G G+LGLGR LS+L Q + + FSYCLP+
Sbjct: 238 DSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPT 277
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/411 (23%), Positives = 167/411 (40%), Gaps = 51/411 (12%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSPTY-------IVRAKIGTPAQTLLMAMDTSNDAA 121
LQ L++ +++R + + Q+ + + GTP Q L +DT +
Sbjct: 52 LQHLATASMSRSHHLKHGKASPLIQTSLFPHSHGGHTIPLSFGTPPQKLSFLVDTGSHVV 111
Query: 122 WVPCT---GCVGCSST------VFNSAQSTTFKNLGCQAAQCKQVPNPT-------CGGG 165
W PCT C CS + +FN S++ K LGC+ +C +P C G
Sbjct: 112 WAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNGN 171
Query: 166 ------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
AC + L YG+ + + + + + GC A L G
Sbjct: 172 SKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSAD-REPSSDALAGF 230
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG-PIGQPKRIKYTPLLKNPRR 275
GR SL Q + F+YCL S + SG L L G+ + + Y P LKNP
Sbjct: 231 GRTMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPD 287
Query: 276 SSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
YY+ + +++G +++ IP L + G +IDSG + + P + V + +
Sbjct: 288 YPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELK 347
Query: 335 RRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGS 386
+++ +L + G CY+ I P + F+ G N+ +P N + + S
Sbjct: 348 KQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEAS 407
Query: 387 ITCLAMAAAPDNVNSVLN-----VIANMQQQNHRILYDVPNSRLGVARELC 432
+ C + N ++ N QQ +H + +D+ N RLG ++ C
Sbjct: 408 LGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/305 (28%), Positives = 138/305 (45%), Gaps = 34/305 (11%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST- 134
A R +V A G + Y+V +GTP + + + +DT +D W C C C
Sbjct: 68 ARVRAGLVAAAGGIATNE---YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQG 124
Query: 135 --VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI-AANLSQDTISLAT 191
+ + A S+T+ L C A +C+ +P +CGG +C + YG ++ ++ D +
Sbjct: 125 IPLLDPAASSTYAALPCGAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGD 184
Query: 192 D-------IVPG---YTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYC 240
+ +P TFGC G + G+ G GRG SL +Q L ++FSYC
Sbjct: 185 NGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQ---LNATSFSYC 241
Query: 241 LPS-FKALSFSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
S F + S +L P ++ TPL KNP + SLY+++L I VG+ +
Sbjct: 242 FTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLP 301
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS 354
+P + TIIDSG T L Y AV+ F +VG + D C++
Sbjct: 302 VPETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFA 354
Query: 355 VPIVA 359
+P+ A
Sbjct: 355 LPVSA 359
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 125/447 (27%), Positives = 187/447 (41%), Gaps = 54/447 (12%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+FF LFL S S+ D+ T +FH S SP + S LS + + +
Sbjct: 7 LFFHLILFLISFSQ---TTIINGDNGFTTSLFHRDSLLSPLEFSS-LSHYDRLANAFRRS 62
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC 125
+R S A+ ++ A G Q + IGTP L DT +D W C
Sbjct: 63 LSR-----SAALLNRAATSGAVGLQ---------SSIIGTPPVDYLGIADTGSDLTWAQC 108
Query: 126 TGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIA-A 180
C+ C +FN +ST+F ++ C C V + CG G C ++ TYG T +
Sbjct: 109 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKG 168
Query: 181 NLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL--YQSTFS 238
+L + I++ + V GC ++G G++GLG G LSL++Q FS
Sbjct: 169 DLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFS 227
Query: 239 YCLPSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG--RRVV 293
YCLP+ + + +G + G + P + TPL+ + + YY+ L AI +G R +
Sbjct: 228 YCLPTLLSHA-NGKINFGQNAVVSGPGVVS-TPLI-SKNTVTYYYITLEAISIGNERHMA 284
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
G + IIDSGT + L Y V + V + +D C+
Sbjct: 285 FAKQGNV----------IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 334
Query: 354 SVPI-VA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
I VA P IT FS G NV L N A ++ CL + P + +I
Sbjct: 335 DDGINVATSSGIPIITAQFSGGANVNLLPVNTF-QKVANNVNCLTL--TPASPTDEFGII 391
Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
N+ N I YD+ RL +CT
Sbjct: 392 GNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)
Query: 51 PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
P + E + ++ A+D+AR + L SL P+ Y + ++GTP +
Sbjct: 36 PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
+ +DT +D WV C C GC T F+ S T + C +C Q
Sbjct: 94 FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153
Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
+ C CA+ YG S +++ Q + + + +VP T FGC TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ V G+ G G+ +S+++Q +Q + FS+CL G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ +TPL+ + Y VNLL+I V + + I P F+ + G GTIID+GT L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325
Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
AY + V ++ V S G + CY SV + P ++L F+G P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383
Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
QD L+ + G ++ C+ N + ++ ++ ++ +YD+ R+G A C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
Query: 433 T 433
+
Sbjct: 441 S 441
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 113/403 (28%), Positives = 162/403 (40%), Gaps = 77/403 (19%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTP--AQTLLMAMDTSNDAAWVPCTG-----CVG----- 130
+P+A G Y + +G P A ++ + +DT +D W PC C G
Sbjct: 80 LPLAPGSD------YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPG 133
Query: 131 -----------------CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGGACA-FN 170
C+S + ++A S+ + C AA+C + +C AC
Sbjct: 134 GNHSSPLPPPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY 193
Query: 171 LTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT 229
YG ++ ANL + + LA + V +TF C A P G+ G GRG LSL AQ
Sbjct: 194 YAYGDGSLVANLRRGRVGLAASMAVENFTFACAHTALAE---PVGVAGFGRGPLSLPAQL 250
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRV 288
A S SGS IG + YTPLL NP+ Y V L A+ V
Sbjct: 251 ----------------APSLSGSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 294
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG- 347
G + + P + G ++DSGT FT L + + V D F R + + + G
Sbjct: 295 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 354
Query: 348 ----GFDTCYSV---PIVAPTITLMFSG-MNVTLPQDNLLI--HSTAG-SITCLAMAAAP 396
G CY P + L F G V LP+ N + S G S+ CL +
Sbjct: 355 EAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVG 414
Query: 397 DNVNSVLN------VIANMQQQNHRILYDVPNSRLGVARELCT 433
N + + + N QQQ ++YDV R+G AR CT
Sbjct: 415 GNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 164/361 (45%), Gaps = 40/361 (11%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQ 150
Q+ Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 151 AAQC-KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
+ C +P C C F ++Y + + L QDT++ + +PG++FGC
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNM 196
Query: 204 KATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLG 256
+ G + GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLG 255
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ ++YT ++ + + L++V+L AI V + + P + G + DSG+
Sbjct: 256 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGS 310
Query: 317 VFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-S 367
+ R ++ +R++ +R + CY + V P I+L F
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDD 365
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G L + + + +A AP +++I ++ Q + ++YD+ +G+
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQLIGI 422
Query: 428 A 428
Sbjct: 423 G 423
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 198/427 (46%), Gaps = 43/427 (10%)
Query: 30 HSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR----LQFLSSLAVA--RKSVV 83
H T +F SP SP + LS +S+++ + +R L L+S++ A R ++
Sbjct: 26 HGFTTSLFRRDSPLSPLH-NPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPII 84
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQ 140
P S +++ IGTP ++ DT +D W +PC C S +FN +
Sbjct: 85 P--------DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRR 136
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGG--ACAFNLTYGSSTIA-ANLSQDTISLATDIVPGY 197
S++++ + C + C+ + + CG +C++ +YG + +L+ D I++ + +P
Sbjct: 137 SSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKT 196
Query: 198 TFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNL--YQSTFSYCLPS-FKALSFSGSL 253
GC + G G++GLG GSLSL++Q + + + FSYCLP+ F + +G++
Sbjct: 197 VIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTI 256
Query: 254 RLG--PIGQPKRIKYTPLLKNPRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT 310
G + +++ TPL+ PR + Y++ L AI VG++ G T
Sbjct: 257 SFGRKAVVSGRQVVSTPLV--PRSPDTFYFLTLEAISVGKKRFKAANGISAM--TNHGNI 312
Query: 311 IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF 366
IIDSGT T L Y V R + + G + CYS V P IT F
Sbjct: 313 IIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHF 372
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ G +V L N A ++TCL A A + + + N+ Q N + YD+ N RL
Sbjct: 373 AGGADVKLLPVNTFA-PVADNVTCLTFAPA-----TQVAIFGNLAQINFEVGYDLGNKRL 426
Query: 426 GVARELC 432
+LC
Sbjct: 427 SFEPKLC 433
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/411 (27%), Positives = 172/411 (41%), Gaps = 59/411 (14%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
EE V + RL + + PI G Q YI IG P Q +
Sbjct: 39 EERVRRATERTHRRLASMGGV------TAPIHWGGQ----SQYIAEYLIGDPPQRAEAII 88
Query: 115 DTSNDAAWVPCTGCVGCSSTVF-------NSAQSTTFKNLGCQAAQCKQVPNPTC--GGG 165
DT ++ W T C C T F + ++S + +GC A C C
Sbjct: 89 DTGSNLIW---TQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGSETQCLSDNK 145
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
CA YG+ IA L+ + ++ ++ V FGCI + + G+ G++GLGRG
Sbjct: 146 TCAVVTGYGAGNIAGTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGK 204
Query: 223 LSLLAQTQNLYQSTFSYCL---------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNP 273
LSL +Q L + FSYCL PS + S L G + P +++P
Sbjct: 205 LSLPSQ---LGDTRFSYCLTPYFEDTIEPSHMVVGASAGLINGS-ASSTPVTTVPFVRSP 260
Query: 274 RR---SSLYYVNLLAIRVGRRVVDIPPGAL---QFNPTTGAGTIIDSGTVFTRLVAPAYT 327
S+ YY+ L I G+ + +P A Q P GT IDSG T LV AY
Sbjct: 261 SDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQ 320
Query: 328 AVRDVFRRRVGSNLTVTSLG--GFDTCYSV---PIVAPTITLMF-----SGMNVTLPQDN 377
A+R R++G+ L G GFD C ++ + P + L F +G ++ +P N
Sbjct: 321 ALRAELARQLGAALVQPLAGTTGFDLCVALKDAERLVPPLVLHFGGGSGTGTDLVVPPAN 380
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRL 425
+ + C+ + ++ D + +N VI N QQN +LYD+ L
Sbjct: 381 YWAPVDSAT-ACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVL 430
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 165/362 (45%), Gaps = 43/362 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP QT + +DT + +VPC+ C C F S+T++ L C + +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150
Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
C TC C ++ Y S+ + L +D +S +++ P T FGC TG
Sbjct: 151 C------TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVETG 204
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + ++FS C G++ LG I P
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAG 262
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ +T +P RS+ Y ++L I + + + I P GTI+DSGT + L
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316
Query: 324 PAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMFS-GMNVT 372
PA+ A +D + + S + D C+S + P + L+FS G ++
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376
Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
L P++ L HS A CL + N N ++ + +N ++YD + ++G +
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433
Query: 432 CT 433
C+
Sbjct: 434 CS 435
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/418 (25%), Positives = 171/418 (40%), Gaps = 60/418 (14%)
Query: 69 LQFLSSLAVARKS-VVPIASGR-----QITQSPT----YIVRAKIGTPAQTLLMAMDTSN 118
L+FL LA A S + G+ QI+ SP + + GTP Q L +DT +
Sbjct: 49 LRFLQHLATASLSRAHHLKHGKTSPLTQISLSPHSYGGHSIPLSFGTPPQKLSFLVDTGS 108
Query: 119 DAAWVPCT---GCVGCSST--------VFNSAQSTTFKNLGCQAAQCKQVPNPT------ 161
W PCT C CS + +FN S++ K LGC+ +C +P
Sbjct: 109 HVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCP 168
Query: 162 -CGGG------AC-AFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQ 213
C G AC ++L YG+ + + + ++ + + GC A G V
Sbjct: 169 PCNGNSKNCSHACPPYSLQYGTGASSGDFLLENLNFPGKTIHEFLVGCTTSAVGE-VTSA 227
Query: 214 GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLL 270
L G GR SL Q + F+YCL S S +L G+ K + Y P L
Sbjct: 228 ALAGFGRSMFSLPMQ---MGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFL 284
Query: 271 KNPRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV 329
KNP + YY+ + I++G +++ IP L G +IDSG + + P + V
Sbjct: 285 KNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKV 344
Query: 330 RDVFRRRVGS---NLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIH 381
+ ++R+ +L + G CY+ I P + F G + +P N +
Sbjct: 345 TNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVL 404
Query: 382 STAGSITCLAMAAAPDNVNSVLN-------VIANMQQQNHRILYDVPNSRLGVARELC 432
S+ C + D + L ++ N Q ++ + +D+ N RLG ++ C
Sbjct: 405 IPEISLACFPLTT--DAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 164/369 (44%), Gaps = 46/369 (12%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKN----------L 147
+V IGTP Q M +DT + +W+ C T TT L
Sbjct: 83 VVTLPIGTPPQLQQMVLDTGSQLSWIQCHN----KKTPQKKQPPTTSSFDPSLSSSFFVL 138
Query: 148 GCQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLA-TDIVPGYTF 199
C CK +VP+ PT C + C ++ Y T A NL ++ I+ + + P
Sbjct: 139 PCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIIL 198
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC + S +G+LG+ G L +Q + + FSYC+P+ +A SGS LG
Sbjct: 199 GCATQ----SDDARGILGMNLGRLGFPSQAK---ITKFSYCVPTKQAQPASGSFYLGNNP 251
Query: 260 QPKRIKYTPLL---KNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
+Y LL ++ R +L Y + L I +G + ++IPP + N T+I
Sbjct: 252 ASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMI 311
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP------IVAPTITL 364
DSG+ FT LV AY +R+ ++VG + + G D C+ +V +
Sbjct: 312 DSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDMVFE 371
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G+ + +P++ +L + G + CL M + + + + N+I N QQN + +D+ N R
Sbjct: 372 FEKGVQIVIPKERVLA-TVDGGVHCLGMGRS-ERLGAGGNIIGNFHQQNLWVEFDLANRR 429
Query: 425 LGVARELCT 433
+G C+
Sbjct: 430 VGFGEADCS 438
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 174/414 (42%), Gaps = 81/414 (19%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNS-------- 138
T + Y++ +G P Q + +DT +D WVPC C+ C + S
Sbjct: 20 TYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSP 79
Query: 139 -AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACA---------------FNLTYGSST 177
S+ K L C + C + + C CA F+ TYG
Sbjct: 80 SQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTYGGGA 138
Query: 178 IA-ANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLA 227
+ +L++D ++L I VPG+ FGC+ G+S+ P G+ G G+G LSL +
Sbjct: 139 LVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCV----GSSIREPIGIAGFGKGILSLPS 194
Query: 228 QTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVN 282
Q L FS+C F+ +F+ SL +G + + +TP+LK+ + YY+
Sbjct: 195 QLGFL-DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIG 253
Query: 283 LLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV----RDVFRRRV 337
L + +G + PP + G I+D+GT +T L P YTA+ V
Sbjct: 254 LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYER 313
Query: 338 GSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIHSTAGS-- 386
+L + + GFD C+ +P P I F G + +TLP+D+ TA
Sbjct: 314 SYDLEMRT--GFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNS 371
Query: 387 --ITCLAM--AAAPDNVNSVLN----VIANMQQQNHRILYDVPNSRLGVARELC 432
+ CL D+V N V+ + Q QN ++YD+ R+G + C
Sbjct: 372 VVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 425
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 165/362 (45%), Gaps = 43/362 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP QT + +DT + +VPC+ C C F S+T++ L C + +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150
Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
C TC C ++ Y S+ + L +D +S +++ P T FGC TG
Sbjct: 151 C------TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVETG 204
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + ++FS C G++ LG I P
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAG 262
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ +T +P RS+ Y ++L I + + + I P GTI+DSGT + L
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316
Query: 324 PAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMFS-GMNVT 372
PA+ A +D + + S + D C+S + P + L+FS G ++
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376
Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
L P++ L HS A CL + N N ++ + +N ++YD + ++G +
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433
Query: 432 CT 433
C+
Sbjct: 434 CS 435
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 58/370 (15%)
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
T Y +G+P + + MDT +D WV C C S+ F+ S T+K L C
Sbjct: 118 FTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTC- 176
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIAANLS-QDTISLA------TDIVPGYTFGCIQ 203
A ++P L + S +DT+ +A + PG+ FGC
Sbjct: 177 -ADDLRLP----------VLLRLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGCGS 225
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI----- 258
G G+L L GSLS +Q Y + FSYCL A SL+ P+
Sbjct: 226 LLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTA---QNSLKKSPMVFGEA 282
Query: 259 ---------GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G+P+ ++YTP+ SS+YY V L I VG + +D+ P F
Sbjct: 283 AVELKEPGSGKPQELQYTPI----GESSIYYTVRLDGISVGNQRLDLSPST--FLNGQDK 336
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITL 364
TI DSGT T L + +++ V S ++ G D C+ VP + P IT
Sbjct: 337 PTIFDSGTTLTMLPSGVCDSIKQSLASMV-SGAEFVAIKGLDACFRVPPSSGQGLPDITF 395
Query: 365 MFSGMN--VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
F+G VT P + ++ GS+ CL P N +++ N+QQQ+ +L+D+ N
Sbjct: 396 HFNGGADFVTRPSNYVI---DLGSLQCLIF--VPTN---EVSIFGNLQQQDFFVLHDMDN 447
Query: 423 SRLGVARELC 432
R+G C
Sbjct: 448 RRIGFKETDC 457
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)
Query: 51 PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
P + E + ++ A+D+AR + L SL P+ Y + ++GTP +
Sbjct: 36 PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
+ +DT +D WV C C GC T F+ S T + C +C Q
Sbjct: 94 FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153
Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
+ C CA+ YG S +++ Q + + + +VP T FGC TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ V G+ G G+ +S+++Q +Q + FS+CL G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ +TPL+ + Y VNLL+I V + + I P F+ + G GTIID+GT L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325
Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
AY + V ++ V S G + CY SV + P ++L F+G P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383
Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
QD L+ + G ++ C+ N + ++ ++ ++ +YD+ R+G A C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
Query: 433 T 433
+
Sbjct: 441 S 441
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/429 (25%), Positives = 163/429 (37%), Gaps = 87/429 (20%)
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
R+ +P++ G T S T +Q + + +DT +D W PC C+ C
Sbjct: 70 RQVSLPLSPGSDYTLSFT--------LDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAE 121
Query: 137 NSAQSTT-------------FKNLGCQAAQC---------------KQVPNPTCGGGAC- 167
N++ ++T K+ C AA + + C +C
Sbjct: 122 NTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCP 181
Query: 168 AFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
F YG ++ A L +D+ISL IV +TFGC A P G+ G GRG
Sbjct: 182 QFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAE---PIGVAGFGRGV 238
Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSF----------------SGSLRLGPIGQPKR 263
LSL AQ L + FSYCL S S R+ + +P R
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKP-R 297
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
YT +L N Y V L I +GR+ + P + + G ++DSGT FT L A
Sbjct: 298 FVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPA 357
Query: 324 PAYTAVRDVFRRRVG----SNLTVTSLGGFDTCY-----SVPIVAPTITLMFSGMNVTLP 374
Y +V F RVG + G CY V + + + + +G +V LP
Sbjct: 358 SLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLP 417
Query: 375 QDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNS 423
+ N + CL + D + N QQQ ++YD+ N
Sbjct: 418 RRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENK 477
Query: 424 RLGVARELC 432
R+G AR C
Sbjct: 478 RVGFARRQC 486
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 114/415 (27%), Positives = 171/415 (41%), Gaps = 46/415 (11%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS--GRQITQ-SPTYIVRAKIGTP 106
KPLS E V+ DQ R +S R S V + G I + Y ++GTP
Sbjct: 62 KPLSRIEDVI---GADQKRHSLISR---KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 115
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCK--------- 155
A+ + +DT ++ WV C VF + +S +FK +GC CK
Sbjct: 116 AKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSL 175
Query: 156 -QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATGN 208
P P+ C+++ Y + A + +++TI++ +PG+ GC TG
Sbjct: 176 TTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQ 232
Query: 209 SVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKR-IK 265
S G+LGL S + +LY + FSYCL + S L G K +
Sbjct: 233 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFR 292
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
T L R Y +N++ I +G ++DIP ++ T+G GTI+DSGT T L A
Sbjct: 293 RTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV--WDATSGGGTILDSGTSLTLLADAA 350
Query: 326 YTAVRDVFRRRVGSNLTVTSLG-------GFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
Y V R + V G F + ++V + P +T G P
Sbjct: 351 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKL-PQLTFHLKGGARFEPHRKS 409
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ A + CL +A NVI N+ QQN+ +D+ S L A CT
Sbjct: 410 YLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 161/384 (41%), Gaps = 63/384 (16%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P+ SGR Y K+G+P Q + +DT ++ W+ C S
Sbjct: 100 MPMHSGRDDALGE-YFAEVKVGSPGQRFWLVVDTGSEFTWLNC---------------SK 143
Query: 143 TFKNLGCQAAQCKQ----------VPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA- 190
+F+ + C + +CK P P+ C ++++Y + A D+I++
Sbjct: 144 SFEAVTCASRKCKVDLSELFSLSVCPKPS---DPCLYDISYADGSSAKGFFGTDSITVGL 200
Query: 191 TDIVPG----YTFGCIQKATGNSV----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL- 241
T+ G T GC K+ N V G+LGLG S + + N Y + FSYCL
Sbjct: 201 TNGKQGKLNNLTIGCT-KSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLV 259
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL------YYVNLLAIRVGRRVVDI 295
S S +L +G K LL RR+ L Y VN++ I +G +++ I
Sbjct: 260 DHLSHRSVSSNLTIGGHHNAK------LLGEIRRTELILFPPFYGVNVVGISIGGQMLKI 313
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCY 353
PP FN GT+IDSGT T L+ PAY AV + + + VT + C+
Sbjct: 314 PPQVWDFNAE--GGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCF 371
Query: 354 SVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
V P + F+G P I A + C+ + P + +VI N+
Sbjct: 372 DAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGI--VPIDGIGGASVIGNI 429
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
QQNH +D+ + +G A CT
Sbjct: 430 MQQNHLWEFDLSTNTVGFAPSTCT 453
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 114/415 (27%), Positives = 171/415 (41%), Gaps = 46/415 (11%)
Query: 50 KPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIAS--GRQITQ-SPTYIVRAKIGTP 106
KPLS E V+ DQ R +S R S V + G I + Y ++GTP
Sbjct: 40 KPLSRIEDVI---GADQKRHSLISR---KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 93
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCK--------- 155
A+ + +DT ++ WV C VF + +S +FK +GC CK
Sbjct: 94 AKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSL 153
Query: 156 -QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATGN 208
P P+ C+++ Y + A + +++TI++ +PG+ GC TG
Sbjct: 154 TTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQ 210
Query: 209 SVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKR-IK 265
S G+LGL S + +LY + FSYCL + S L G K +
Sbjct: 211 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFR 270
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
T L R Y +N++ I +G ++DIP ++ T+G GTI+DSGT T L A
Sbjct: 271 RTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV--WDATSGGGTILDSGTSLTLLADAA 328
Query: 326 YTAVRDVFRRRVGSNLTVTSLG-------GFDTCYSVPIVAPTITLMFSGMNVTLPQDNL 378
Y V R + V G F + ++V + P +T G P
Sbjct: 329 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKL-PQLTFHLKGGARFEPHRKS 387
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ A + CL +A NVI N+ QQN+ +D+ S L A CT
Sbjct: 388 YLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 48/368 (13%)
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAAQCKQ------VP 158
P Q + M +DT ++ +W+ C + F+ +S+++ + C + C+ +P
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 159 NPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQ--- 213
C L+Y +S+ NL+ + FGC+ +G S P +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSG-SDPEEDTK 200
Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIK 265
GLLG+ RGSLS ++Q + FSYC+ P F L S L P+ I+
Sbjct: 201 TTGLLGMNRGSLSFISQ---MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIR 257
Query: 266 Y-TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TPL R + Y V L I+V +++ IP L + T T++DSGT FT L+ P
Sbjct: 258 ISTPLPYFDRVA--YTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGP 315
Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA---------PTITLMFSGM 369
YTA+R F R LTV G D CY + V PT++L+F G
Sbjct: 316 VYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGA 375
Query: 370 NVTLPQDNLLI---HSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+ + LL H T G S+ C + D + VI + QQN I +D+ SR
Sbjct: 376 EIAVSGQPLLYRVPHLTVGNDSVYCFTFGNS-DLMGMEAYVIGHHHQQNMWIEFDLQRSR 434
Query: 425 LGVARELC 432
+G+A C
Sbjct: 435 IGLAPVEC 442
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 131/264 (49%), Gaps = 28/264 (10%)
Query: 50 KPLSWEESVLEMLAKD-------QARL-QFLSSLAVARKSV-VPIASGRQITQSPTYIVR 100
K ++W + L D Q RL + +SS +V + +P+ASG Q+ YIV
Sbjct: 90 KKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNF-QTLNYIVT 148
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQV 157
++G Q + + +DT +D WV C C+ C + VF + S++++++ C ++ C+ +
Sbjct: 149 MELG--GQDMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSL 206
Query: 158 PNPTCGGGACAFN-------LTYGS-STIAANLSQDTISLATDIVPGYTFGCIQKATGNS 209
T GAC N + YG S L + +S V + FGC + G
Sbjct: 207 QLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLF 266
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR----IK 265
GL+GLGR +LSL++QT + + FSYCLP A + SGSL +G + I
Sbjct: 267 GGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGA-SGSLAMGNESSVFKNLTPIA 325
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVG 289
YT ++ NP+ S+ Y +NL I VG
Sbjct: 326 YTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 102/399 (25%), Positives = 178/399 (44%), Gaps = 46/399 (11%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+L +D RL+ L +L S + + + Y R IG+P Q + +DT +
Sbjct: 54 VLDRDH-RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112
Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-S 176
+VPC+ CV C + F S+T++ + C A C N G C + Y S
Sbjct: 113 TYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADCNCDEN----GVQCTYERRYAEMS 167
Query: 177 TIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQT-- 229
T + L++D +S +++VP FGC +G+ + G++GLGRG+LS++ Q
Sbjct: 168 TSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVG 227
Query: 230 QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
+ + ++FS C G++ LG I P + ++ +P RS Y + L I V
Sbjct: 228 KGVVSNSFSLCYGGMDV--GGGAMVLGGISSPPGMVFSH--SDPSRSPYYNIELKEIHVA 283
Query: 290 RRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTVT 344
+ L+ NP T G I+DSGT + AY A +D +++ ++
Sbjct: 284 GK-------PLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGP 336
Query: 345 SLGGFDTCYS--------VPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAA 394
D C+S +P V P + ++F+ G ++L P++ L H+ CL +
Sbjct: 337 DPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFK 396
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N N ++ + +N + Y+ NS +G + C+
Sbjct: 397 ---NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 113/444 (25%), Positives = 183/444 (41%), Gaps = 46/444 (10%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+F L + +F +S + D+ T+++ H SP SP PL E+ +A
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMY--NPL---ENHYHRVADT 58
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAW--- 122
R ++ V PI + R Y+++ +GTP ++ DT +D W
Sbjct: 59 LRRSISHNTGLVTNTVEAPIYNNRG-----EYLMKLSVGTPPFPIIAVADTGSDIIWTQC 113
Query: 123 VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV--PNPTCGGGACAFNLTYGSSTIA- 179
VPCT C +FN ++STT++ + C + C N C ++++YG ++ +
Sbjct: 114 VPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173
Query: 180 ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
+ + DT+++ + P GC G+ G++GLG G SL+ Q +
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233
Query: 234 QSTFSYCLPSF-------KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
FSYCL L+F + + G TP+ + + S Y + L A+
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVS----TPIYISDKFKSFYSLKLKAV 289
Query: 287 RVGRRVVDIPPGALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
VGR N G A IIDSGT T L Y + T
Sbjct: 290 SVGRNNTFYSTA----NSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDD 345
Query: 345 SLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
+ C+ P I + F G N+ L ++N+LI + ++ CLA A A DN
Sbjct: 346 PNQFLEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIR-VSDNVICLAFAGAQDN--- 401
Query: 402 VLNVIANMQQQNHRILYDVPNSRL 425
+++ N+ Q N + YDV N L
Sbjct: 402 DISIYGNIAQINFLVGYDVTNMSL 425
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 42/363 (11%)
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQA 151
P Y+ IGTP Q + + + W C+ C C +FN + S+T++ C
Sbjct: 26 PLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGT 85
Query: 152 AQCKQVPNPTCGG-GACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-----IQKA 205
A C+ VP TC G G C++ + + DT ++ T FGC I++
Sbjct: 86 ALCESVPASTCSGDGVCSYEVETMFGDTSGIGGTDTFAIGTATA-SLAFGCAMDSNIKQL 144
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPK 262
G S G++GLGR SL+ Q + + FSYCL A +L LG + K
Sbjct: 145 LGAS----GVVGLGRTPWSLVGQ---MNATAFSYCLAPHGAAGKKSALLLGASAKLAGGK 197
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
TPL+ SS Y ++L I+ G ++ PP G+ ++D+ + LV
Sbjct: 198 SAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPP--------NGSVVLVDTIFGVSFLV 249
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----------SVPIVAPTITLMFSG-MN 370
A+ A++ VG+ T FD C+ S+P+ P + L F G
Sbjct: 250 DAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPL--PDVVLTFQGAAA 307
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+T+P + + G++ M++A N+ + L+++ + Q+N L+D+ L
Sbjct: 308 LTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPA 367
Query: 431 LCT 433
C+
Sbjct: 368 DCS 370
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/400 (25%), Positives = 178/400 (44%), Gaps = 48/400 (12%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+L +D RL+ L +L S + + + Y R IG+P Q + +DT +
Sbjct: 54 VLDRDH-RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112
Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQA-AQCKQVPNPTCGGGACAFNLTYGS- 175
+VPC+ CV C + F S+T++ + C A C + G C + Y
Sbjct: 113 TYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCDE------NGVQCTYERRYAEM 166
Query: 176 STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN--SVPPQGLLGLGRGSLSLLAQT- 229
ST + L++D +S +++VP FGC +G+ + G++GLGRG+LS++ Q
Sbjct: 167 STSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLV 226
Query: 230 -QNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
+ + ++FS C G++ LG I P + ++ +P RS Y + L I V
Sbjct: 227 GKGVVSNSFSLCYGGMDV--GGGAMVLGGISSPPGMVFSH--SDPSRSPYYNIELKEIHV 282
Query: 289 GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SNLTV 343
+ L+ NP T G I+DSGT + AY A +D +++ ++
Sbjct: 283 AGK-------PLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISG 335
Query: 344 TSLGGFDTCYS--------VPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMA 393
D C+S +P V P + ++F+ G ++L P++ L H+ CL +
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIF 395
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N N ++ + +N + Y+ NS +G + C+
Sbjct: 396 K---NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 83/254 (32%), Positives = 122/254 (48%), Gaps = 30/254 (11%)
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF-----KALSFSGSL 253
FGC + G+ + G+LGL SLSL+ Q L FSYCL F L F
Sbjct: 129 FGCGALSAGSLIGATGILGLSPESLSLITQ---LKIQRFSYCLTPFADKKTSPLLFGAMA 185
Query: 254 RLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
L + I+ T ++ NP + YYV L+ I +G + + +P +L P G GTI+D
Sbjct: 186 DLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 245
Query: 314 SGTVFTRLVAPAYTAVR----DVFRRRVGSNLTVTSLGGFDTCYSVP----------IVA 359
SG+ LV A+ AV+ DV R V +N TV ++ C+ +P +
Sbjct: 246 SGSTVAYLVEAAFEAVKEAVMDVVRLPV-ANRTVED---YELCFVLPRRTAAAAMEAVQV 301
Query: 360 PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
P + L F G + LP+DN AG + CLA+ D S +++I N+QQQN +L+
Sbjct: 302 PPLVLHFDGGAAMVLPRDNYFQEPRAG-LMCLAVGKTTD--GSGVSIIGNVQQQNMHVLF 358
Query: 419 DVPNSRLGVARELC 432
DV + + A C
Sbjct: 359 DVQHHKFSFAPTQC 372
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 151/385 (39%), Gaps = 58/385 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----------AQSTTFK 145
Y + GTP QT MDT + W PCT CS F + QS++
Sbjct: 92 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSN 151
Query: 146 NLGCQAAQCKQVPNP--------------TCGGGACAFNLTYGSSTIAANLSQDTISL-A 190
+GC+ +C + P C + + YG + A L +T+
Sbjct: 152 LIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPH 211
Query: 191 TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALS 248
+PG+ GC + P+G+ G GR SL +Q L FSYCL S F
Sbjct: 212 KKTIPGFLVGCSLFSIRQ---PEGIAGFGRSPESLPSQ---LGLKKFSYCLVSHAFDDTP 265
Query: 249 FSGSLRLGPIGQPKRIK-----YTPLLKNPRRS--SLYYVNLLAIRVGRRVVDIPPGALQ 301
S L L K YTP KNP + YYV L I +G V +P L
Sbjct: 266 ASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLV 325
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG---GFDTCYSV--- 355
GTI+DSGT FT + P Y V F ++V T + G C+++
Sbjct: 326 PGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGE 385
Query: 356 -PIVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV------LNVIA 407
+ P F G + LP N +G I CL + + DN++ ++
Sbjct: 386 KSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVI-CLTIVS--DNMSGSGIGGGPAIILG 442
Query: 408 NMQQQNHRILYDVPNSRLGVARELC 432
N QQ+N + +D+ N R G ++ C
Sbjct: 443 NYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 111/421 (26%), Positives = 183/421 (43%), Gaps = 53/421 (12%)
Query: 51 PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
P + E + ++ A+D+AR + L SL P+ Y + ++G+P +
Sbjct: 36 PANHEMELSQLKARDKARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKIRLGSPPRD 93
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
+ +DT +D WV C C GC T F+ S T + C +C Q
Sbjct: 94 FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSS 153
Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
+ C CA+ YG S +++ Q + + + +VP T FGC TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ V G+ G G+ +S+++Q +Q L FS+CL G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENG--GGGILVLGEIVEP 271
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ +TPL+ + Y VNLL+I V + + I P F+ + G GTIID+GT L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325
Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
AY + V ++ V S G + CY SV + P ++L F+G P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVIATSVADIFPPVSLNFAGGASMFLNP 383
Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
QD L+ + G ++ C+ N + ++ ++ ++ +YD+ R+G A C
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQ---NQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
Query: 433 T 433
+
Sbjct: 441 S 441
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 121/439 (27%), Positives = 183/439 (41%), Gaps = 89/439 (20%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSL----AVARKSVVPIASGR 89
LQ+ HV + L+ E + M + +AR L S R + P+ G
Sbjct: 26 LQLSHV-------DAGRGLTHWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGA 78
Query: 90 QITQSP--TYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQST 142
P Y+V GTP Q + + +DT +D W C + C + +F+ + S+
Sbjct: 79 YDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASS 138
Query: 143 TFKNLGCQAAQCKQVPNPTCGGG------ACAFNLTYGSSTIA-ANLSQDTISLATDI-- 193
+F +L C + C+ P CGGG C ++++YG +++ + ++ + A+
Sbjct: 139 SFASLPCSSPACETT--PPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGE 196
Query: 194 -----VPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKAL 247
VPG FGC G + G+ G GRGSLSL +Q L FS+C +
Sbjct: 197 GSSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQ---LKVGNFSHCFTTITGS 253
Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
S L LG P + P S L GRR G+ + T
Sbjct: 254 KTSAVL-LG----------LPGVAPPSASPL----------GRRR-----GSYRCRSTPR 287
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVPIVA-----P 360
+ +SGT T L Y AVR+ F +V L V D TC+S P+ P
Sbjct: 288 SS---NSGTSITSLPPRTYRAVREEFAAQV--KLPVVPGNATDPFTCFSAPLRGPKPDVP 342
Query: 361 TITLMFSGMNVTLPQDNLLIH----STAGS---ITCLAMAAAPDNVNSVLNVIANMQQQN 413
T+ L F G + LPQ+N + AG+ I CLA+ + ++ N+QQQN
Sbjct: 343 TMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV------IEGGEIILGNIQQQN 396
Query: 414 HRILYDVPNSRLGVARELC 432
+LYD+ NS+L C
Sbjct: 397 MHVLYDLQNSKLSFVPAQC 415
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 163/361 (45%), Gaps = 40/361 (11%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQ 150
Q+ Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C
Sbjct: 78 QTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 151 AAQC-KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQ 203
+ C +P C C F ++Y + + L QDT++ + +P +TFGC
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNL 196
Query: 204 KATGNSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLG 256
+ G + GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLG 255
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ ++YT ++ + + L++V+L AI V + + P + G + DSG+
Sbjct: 256 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGS 310
Query: 317 VFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-S 367
+ R ++ +R++ RR + CY + V P I+L F
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDD 365
Query: 368 GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
G L + + + +A AP +++I ++ Q + ++YD+ +G+
Sbjct: 366 GARFDLGSHGVFVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQLIGI 422
Query: 428 A 428
Sbjct: 423 G 423
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 130/298 (43%), Gaps = 29/298 (9%)
Query: 149 CQAAQCKQVPNPTCGG------GACAFNLTYGSSTIAANLSQ-DTISL-ATDIVPGYTFG 200
C + C+ + +CG C + Y ++ L + D + A VPG FG
Sbjct: 38 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDKFTFGAGASVPGVAFG 97
Query: 201 CIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
C G + G+ G GRG LSL +Q L FS+C + L S L P
Sbjct: 98 CGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKQSTVLLDLPAD 154
Query: 260 QPKR----IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
K ++ TPL++N + YY++L I VG + +P A TG GTIIDSG
Sbjct: 155 LYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSG 213
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV 371
T T L Y VRD F ++ + + G TC+S P A P + L F G +
Sbjct: 214 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATM 273
Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
LP++N + SI CLA+ N +I N QQQN +LYD+ N G
Sbjct: 274 DLPRENYVFEVPDDAGNSIICLAI-----NKGDETTIIGNFQQQNMHVLYDLQNMHRG 326
>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
Length = 503
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 116/441 (26%), Positives = 196/441 (44%), Gaps = 58/441 (13%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQF--------LSSLAVARKSVVPI 85
L + H SPCSP L+ + + + + R SSLAV +++P
Sbjct: 79 LPIVHQQSPCSPLHGLPSLTAADGLHHDASLIRRRFSSKSSPVAPPASSLAV---TIIPT 135
Query: 86 ASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSN-DAAWVPCTGCVGCSST---VFNS 138
T+ P Y V GTP Q + +DTS+ + + C C S F++
Sbjct: 136 NGSSDPTRKPVTLQYSVLVSYGTPEQQFPVLLDTSSIGMSLLRCKPCASGSDDCHLAFDT 195
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGA----CAFNLTYGSSTIAANLSQDTISLA--TD 192
++S+TF ++ C + C P G G C + TY S I ++D ++LA +
Sbjct: 196 SRSSTFAHVLCGSPDC---PTNCSGDGDGDSFCPLDSTY--SIIDGAFAEDVLTLAPSSK 250
Query: 193 IVPGYTFGCIQ-KATGNSVPPQGLLGLGRG-SLSLLAQTQNLYQST--FSYCLPSFKALS 248
+ + F C+ + +P G L L R + + + Q+T FSYCLP K+ S
Sbjct: 251 AIENFRFVCLDVDEPDDDLPVAGTLDLSRDRNSLPSQLSSSPGQATAAFSYCLP--KSPS 308
Query: 249 FSGSLRLG---PIGQPKRIKYTPLLKN---PRRSSLYYVNLLAIRVGRRVVDIPP-GALQ 301
G L L + K + PL+ N P +S+Y+++L+ + +G + IPP G+
Sbjct: 309 SQGYLSLAVDATVRHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGVDDIPIPPAGSFG 368
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSV----P 356
N G +D GT FT+L Y +RD FR+++ +N ++ GFDTC+++
Sbjct: 369 NN-----GVNLDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDGFDTCFNLTGVRD 423
Query: 357 IVAPTITLMFS-GMNVTLPQDNLLIHSTAG----SITCLAMAAAPDNVNSVLNVIANMQQ 411
+ P + FS G + + D +L + ++ CLA ++ D +S VI
Sbjct: 424 LAMPLLWFKFSNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSL-DAGDSFSAVIGTHTL 482
Query: 412 QNHRILYDVPNSRLGVARELC 432
+ ++YDV ++G C
Sbjct: 483 ASTEVIYDVAGGKVGFIPRSC 503
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 110/329 (33%), Positives = 153/329 (46%), Gaps = 37/329 (11%)
Query: 112 MAMDTSNDAAWVPCTGCVGCSST------VFNSAQSTTFKNLGCQAAQCKQV---PNPTC 162
M +DT +D +WV C C S +F+ AQS+++ + C C + C
Sbjct: 1 MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60
Query: 163 GGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGR 220
C + ++YG S S DT++L A+ V G+ FGC +G GLLGLGR
Sbjct: 61 SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL---GPIGQPKRIKYTPLLKNPRRSS 277
SL+ QT Y FSYCLP+ S +G L L GP G T LL +P +
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKP--STAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y V L I VG + + +P A T++D+GTV TRL AY A+R FR +
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGM 232
Query: 338 GSNL--TVTSLGGFDTCYSVP----IVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCL 390
S T S G DTCY+ + P + L F SG VTL D +L S CL
Sbjct: 233 ASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL------SFGCL 286
Query: 391 AMAAAPDNVNSVLNVIANMQQQNHRILYD 419
A AP + + ++ N+QQ++ + D
Sbjct: 287 AF--APSGSDGGMAILGNVQQRSFEVRID 313
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 165/372 (44%), Gaps = 61/372 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
Y +GTPA + L+A+DT +D WVPC C+ C+ ++ A+STT
Sbjct: 66 YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 124
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
++L C C+ VP T C +N+ Y S ++ L +DT+ L D VP
Sbjct: 125 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 184
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC QK +G + + P GLLGLG +S+ LA+ L Q++FS C FK S SG
Sbjct: 185 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 239
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G P + + TP + + Y VN+ +G + ++ T +
Sbjct: 240 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 288
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
+DSGT FT L Y A F +++ + + CYS +P V PTITL F
Sbjct: 289 VDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 347
Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ +N LP + G++ +A P + +IA + +++D
Sbjct: 348 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 400
Query: 421 PNSRLGVARELC 432
+ +LG R C
Sbjct: 401 ESMKLGWYRSEC 412
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 165/372 (44%), Gaps = 61/372 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
Y +GTPA + L+A+DT +D WVPC C+ C+ ++ A+STT
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
++L C C+ VP T C +N+ Y S ++ L +DT+ L D VP
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC QK +G + + P GLLGLG +S+ LA+ L Q++FS C FK S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G P + + TP + + Y VN+ +G + ++ T +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
+DSGT FT L Y A F +++ + + CYS +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377
Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ +N LP + G++ +A P + +IA + +++D
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430
Query: 421 PNSRLGVARELC 432
+ +LG R C
Sbjct: 431 ESMKLGWYRSEC 442
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 158/375 (42%), Gaps = 36/375 (9%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P++SG + + Y V+ ++GTP Q + DT +D WV C G VF S
Sbjct: 103 LPMSSG-AYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG-ASPPGRVFRPKTSR 160
Query: 143 TFKNLGCQAAQCK-QVP----NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPG- 196
++ + C + CK VP N + C ++ Y + A T S AT +PG
Sbjct: 161 SWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTES-ATIALPGG 219
Query: 197 -------YTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-L 247
GC G S G+L LG +S Q + +FSYCL A
Sbjct: 220 KVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPR 279
Query: 248 SFSGSLRLGPIGQPKRI--KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ +G L GP GQ R T L +P Y V + AI V + +DIP A ++
Sbjct: 280 NATGYLAFGP-GQVPRTPATQTKLFLDPEM-PFYGVKVDAIHVAGKALDIP--AEVWDAK 335
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------VPIV 358
+G G I+DSG T L APAY AV + + + S F+ CY+ P +
Sbjct: 336 SG-GVILDSGNTLTVLAAPAYKAVVAALSKHL-DGVPKVSFPPFEHCYNWTARRPGAPEI 393
Query: 359 APTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
P + + F+G P + + C+ + + L+VI N+ QQ H +
Sbjct: 394 IPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQ---EGEWPGLSVIGNIMQQEHLWEF 450
Query: 419 DVPNSRLGVARELCT 433
D+ N ++ + CT
Sbjct: 451 DLKNMQVRFKQSNCT 465
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 151/371 (40%), Gaps = 45/371 (12%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSST--------VFNSAQSTTFKNLGCQA 151
GTP Q L +DT +D W PCT C CS + +F+ S++ K L C+
Sbjct: 84 FGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRN 143
Query: 152 AQC-------KQVPNPTCGGG------ACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
+C + P C G AC ++ YG+ + + + + +
Sbjct: 144 PKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENLKFPRKTIRNFL 203
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLG 256
GC A + L G GR SL Q + F+YCL S + SG L L
Sbjct: 204 LGCTTSA-ARELSSDALAGFGRSMFSLPIQ---MGVKKFAYCLNSHDYDDTRNSGKLILD 259
Query: 257 -PIGQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G+ K + YTP LK+P S+ YY + + I++G +++ IP L +G IIDS
Sbjct: 260 YRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDS 319
Query: 315 GTVFT-RLVAPAYTAVRDVFRRRVGS---NLTVTSLGGFDTCYSVP-----IVAPTITLM 365
G + P + V + ++++ +L + G CY+ + P I
Sbjct: 320 GYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQF 379
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN----VIANMQQQNHRILYDVP 421
G N+ +P N S S+ C M N + ++ N Q ++ + YD+
Sbjct: 380 RGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLK 439
Query: 422 NSRLGVARELC 432
N R G R+ C
Sbjct: 440 NDRFGFRRQTC 450
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/429 (25%), Positives = 163/429 (37%), Gaps = 87/429 (20%)
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
R+ +P++ G T S T +Q + + +DT +D W PC C+ C
Sbjct: 70 RQVSLPLSPGSDYTLSFT--------LDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAE 121
Query: 137 NSAQSTT-------------FKNLGCQAAQC---------------KQVPNPTCGGGAC- 167
N++ ++T K+ C AA + + C +C
Sbjct: 122 NTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCP 181
Query: 168 AFNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
F YG ++ A L +D+ISL IV +TFGC A P G+ G GRG
Sbjct: 182 QFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAE---PIGVAGFGRGV 238
Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSF----------------SGSLRLGPIGQPKR 263
LSL AQ L + FSYCL S S R+ + +P R
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKP-R 297
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
YT +L N Y V L I +GR+ + P + + G ++DSGT FT L A
Sbjct: 298 FVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPA 357
Query: 324 PAYTAVRDVFRRRVG----SNLTVTSLGGFDTCY-----SVPIVAPTITLMFSGMNVTLP 374
Y +V F RVG + G CY V + + + + +G +V LP
Sbjct: 358 SLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLP 417
Query: 375 QDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNS 423
+ N + CL + + + N QQQ ++YD+ N
Sbjct: 418 RRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENK 477
Query: 424 RLGVARELC 432
R+G AR C
Sbjct: 478 RVGFARRQC 486
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 166/372 (44%), Gaps = 61/372 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
Y +GTPA + L+A+DT +D WVPC C+ C+ ++ A+STT
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS--STIAANLSQDTISL--ATDIVP---GY 197
++L C C+ VP T C +N+ Y S +T + L +DT+ L D VP
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC QK +G + + P GLLGLG +S+ LA+ L Q++FS C FK S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G P + + TP + + Y VN+ +G + ++ T +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
+DSGT FT L Y A F +++ + + CYS +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377
Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ +N LP + G++ +A P + +IA + +++D
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430
Query: 421 PNSRLGVARELC 432
+ +LG R C
Sbjct: 431 ESMKLGWYRSEC 442
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 135/304 (44%), Gaps = 28/304 (9%)
Query: 149 CQAAQCKQVPNPTCGG------GACAFNLTYGSSTIAANLSQ-DTISL-ATDIVPGYTFG 200
C + C+ + +CG C + Y ++ L + D + A VPG FG
Sbjct: 190 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFGAGASVPGVAFG 249
Query: 201 CIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLR--LGP 257
C G + G+ G GRG LSL +Q L FS+C + L S L L
Sbjct: 250 CGLFNNGVFKSNETGIAGFGRGPLSLPSQ---LKVGNFSHCFTAVNGLKQSTVLLDLLAD 306
Query: 258 IGQPKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
+ + R ++ TPL++N +LYY++L I VG + +P A TG GTIIDSG
Sbjct: 307 LYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG-GTIIDSG 365
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV 371
T T L Y VRD F ++ + + G TC+S P A P + L F G +
Sbjct: 366 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATM 425
Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
LP++N + S+ CLA+ D + I N QQQN +LYD+ N+ L
Sbjct: 426 DLPRENYVFEVPDDAGNSMICLAINELGDERAT----IGNFQQQNMHVLYDLQNNMLSFV 481
Query: 429 RELC 432
C
Sbjct: 482 AAQC 485
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 62/140 (44%), Gaps = 13/140 (9%)
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I VG + +P A TG GTIIDSGT T L Y VRD F ++ +
Sbjct: 41 GITVGSTRLPVPESAFALTNGTG-GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPG 99
Query: 345 SLGGFDTCYSVPIVA----PTITLMFSGMNVTLPQDNLLIH---STAGSITCLAMAAAPD 397
+ G TC+S P A P + L F G + LP++N + SI CLA+
Sbjct: 100 NATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI----- 154
Query: 398 NVNSVLNVIANMQQQNHRIL 417
N +I N QQQN L
Sbjct: 155 NKGDETTIIGNFQQQNMHAL 174
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 162/404 (40%), Gaps = 60/404 (14%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------- 134
+P++SG T + Y VR ++GTPAQ ++ DT +D WV C G S
Sbjct: 97 MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAA 155
Query: 135 ----------VFNSAQSTTFKNLGCQAAQCK-----QVPNPTCGGGACAFNLTYGSSTIA 179
VF S T+ + C + CK + N + AC+++ Y ++ A
Sbjct: 156 APSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAA 215
Query: 180 AN-LSQDTISLATDI-------------VPGYTFGCIQKATGNSVPP-QGLLGLGRGSLS 224
+ D+ ++A + G GC G G+L LG ++S
Sbjct: 216 RGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNIS 275
Query: 225 LLAQTQNLYQSTFSYCLPSFKA-------LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSS 277
++ + + FSYCL A L+F P TPLL + R
Sbjct: 276 FASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRP 335
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y V + ++ V +DIP A ++ + GTIIDSGT T L PAY AV ++
Sbjct: 336 FYAVAVDSVSVDGVALDIP--AEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQL 393
Query: 338 GSNLTVTSLGGFDTCYSV--------PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITC 389
+ L ++ FD CY+ + P + + F+G P + A + C
Sbjct: 394 -AGLPRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKC 452
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + + ++VI N+ QQ H +D+ N L + CT
Sbjct: 453 IGVQ---EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 162/367 (44%), Gaps = 50/367 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
++V IG+P T L+ MDT++D W+ C C+ C S +F+ ++S T +N C+ +Q
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144
Query: 154 CKQVPNPTCGGG--ACAFNLTY----GSSTIAA------NLSQDTISLAT--DIVPGYTF 199
+P+ +C +++ Y GS I A N D S A D+V F
Sbjct: 145 -YSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVV----F 199
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRLGPI 258
GC G + G+LGLG G SL+ + + + FSYC S S+ L LG
Sbjct: 200 GCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGTKFSYCFGSLDDPSYPHNVLVLGDD 255
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDSGTV 317
G TPL + YYV + AI V ++ I P N TG GTIID+G
Sbjct: 256 GANILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD----TCYS-------VPIVAPTITLMF 366
T LV AY +++ T + D CY+ V P +T F
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHF 372
Query: 367 S-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
S G ++L ++ + + ++ CLA+ P N+NS I QQ++ I YD+ ++
Sbjct: 373 SDGAELSLDVKSVFM-KLSPNVFCLAV--TPGNMNS----IGATAQQSYNIGYDLEAKKI 425
Query: 426 GVARELC 432
R C
Sbjct: 426 SFERIDC 432
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 168/357 (47%), Gaps = 30/357 (8%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTG--CVGCSS-TVFNSAQSTTFKNLGCQAAQCKQVP- 158
+GTP Q L + + +WV C+ + C++ ++F ST+ L C + C
Sbjct: 5 LGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAFSA 64
Query: 159 -NPTCG-GGACAFNLTYGSS-TIAANLSQDTISLAT----DIVPGYTFGCIQKATG--NS 209
+ +CG +C++N +YG++ + A +L D ++ + + + GC + + G
Sbjct: 65 VSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLEL 124
Query: 210 VPPQGLLGLGRGSLSLLAQTQNL-YQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIKY 266
+ G +G +G++S + Q L Y+S F YCLPS F+ G+ +L + Y
Sbjct: 125 LDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGKLVIGNYKLRNASISSSMAY 184
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
TP++ NP+ + LY++NL I + + +P N T GT+ID+ T + L + Y
Sbjct: 185 TPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGT--GGTVIDTTTFLSYLTSDFY 242
Query: 327 T----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAP-----TITLMF-SGMNVTLPQD 376
T A+++ V + +V G + CY++ + T+T F G V +
Sbjct: 243 TQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFLGGAGVEVSTW 302
Query: 377 NLLIHS-TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LL S + + C+A+ + ++V LNVI QQ + + YD+ R G + C
Sbjct: 303 FLLDDSDSVNNTICMAIGRS-ESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 163/383 (42%), Gaps = 40/383 (10%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-------- 134
+P+ SG T + Y VR ++GTPAQ ++ DT +D WV C+ SS+
Sbjct: 91 MPLTSG-AYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQR 149
Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIA---ANLSQDT 186
VF A S ++ L C + CK VP N + C+++ Y ++ A L T
Sbjct: 150 VFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSAT 209
Query: 187 ISLATD------IVPGYTFGCIQKATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
+SL+ + + GC G S G+L LG ++S ++ + + FSY
Sbjct: 210 VSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSY 269
Query: 240 CLPSFKA-------LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
CL A L+F R LL++ R Y+V++ A+ V
Sbjct: 270 CLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGER 329
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
++I P F GA I+DSGT T L PAY AV ++ + + ++ F+ C
Sbjct: 330 LEILPDVWDFRKNGGA--ILDSGTSLTILATPAYDAVVKAISKQF-AGVPRVNMDPFEYC 386
Query: 353 YS---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
Y+ V P + L F+G P + TA + C+ + + ++VI N+
Sbjct: 387 YNWTGVSAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVV---EGAWPGVSVIGNI 443
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ H +D+ N L + C
Sbjct: 444 LQQEHLWEFDLANRWLRFKQSRC 466
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 182/444 (40%), Gaps = 46/444 (10%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKD 65
+F L + +F +S + D+ T+++ H SP SP PL E+ +A
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMY--NPL---ENHYHRVADT 58
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV-- 123
R ++ V PI + R Y+++ +GTP ++ DT +D W
Sbjct: 59 LRRSISHNTGLVTNTVEAPIYNNRG-----EYLMKLSVGTPPFPIIAVADTGSDIIWTQC 113
Query: 124 -PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK--QVPNPTCGGGACAFNLTYGSSTIA- 179
PCT C +FN ++STT++ + C + C N C ++++YG ++ +
Sbjct: 114 EPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173
Query: 180 ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
+ + DT+++ + P GC G+ G++GLG G SL+ Q +
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233
Query: 234 QSTFSYCLPSF-------KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAI 286
FSYCL L+F + + G TP+ + + S Y + L A+
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVS----TPIYISDKFKSFYSLKLKAV 289
Query: 287 RVGRRVVDIPPGALQFNPTTG--AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
VGR N G A IIDSGT T L Y + T
Sbjct: 290 SVGRNNTFYSTA----NSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDD 345
Query: 345 SLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
+ C+ P I + F G N+ L ++N+LI + ++ CLA A A DN
Sbjct: 346 PNQFLEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIR-VSDNVICLAFAGAQDN--- 401
Query: 402 VLNVIANMQQQNHRILYDVPNSRL 425
+++ N+ Q N + YDV N L
Sbjct: 402 DISIYGNIAQINFLVGYDVTNMSL 425
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 38/312 (12%)
Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAANL-SQDTISLATDI--------VPGYT 198
C C + + +C C + YG T+ + + + + A+ VP
Sbjct: 3 CAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LG 61
Query: 199 FGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS----GSLR 254
FGC G+ G++G GR LSL++Q L FSYCL S+ + S GSL
Sbjct: 62 FGCGSVNVGSLNNGSGIVGFGRNPLSLVSQ---LSIRRFSYCLTSYASRRQSTLLFGSLS 118
Query: 255 LGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
G G R++ TPLL++P+ + YYV+ + VG R + IP A P G I+D
Sbjct: 119 DGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVD 178
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD--TCYSVPIV-----------AP 360
SGT T L A V FR+++ L + G + C+ VP P
Sbjct: 179 SGTALTLLPAAVLAEVVRAFRQQL--RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVP 236
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ L F G ++ LP+ N ++ CL +A + D+ ++ I N+ QQ+ R+LYD+
Sbjct: 237 RMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST----IGNLVQQDMRVLYDL 292
Query: 421 PNSRLGVARELC 432
L +A C
Sbjct: 293 EAETLSIAPARC 304
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 166/356 (46%), Gaps = 34/356 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQC 154
++ IG P L+ +DT +D W+ C C T+ F+ ++S+T++N C++A
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPFFHPSRSSTYRNASCESAP- 146
Query: 155 KQVPN--PTCGGGACAFNLTYGS-STIAANLSQDTISLATDIV-----PGYTFGCIQKAT 206
+P G C ++L Y S L+++ ++ T P FGC Q +
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY 266
G + G+LGLG G+ S++ + + S FSYC S ++ + + +G RI+
Sbjct: 207 GFT-QYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLIDPTYPHNFLI--LGNGARIEG 260
Query: 267 --TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TPL R YY++L AI +G +++DI PG Q + G GT+ID+G T L
Sbjct: 261 DPTPLQIFQDR---YYLDLQAISLGEKLLDIEPGIFQRYRSKG-GTVIDTGCSPTILARE 316
Query: 325 AYTAVRDVFRRRVGSNL-TVTSLGGF-DTCYSVPIVA-----PTITLMFS-GMNVTLPQD 376
AY + + +G L V + + CY + P +T F+ G + L +
Sbjct: 317 AYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVE 376
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+L + S +G CLAM N ++VI M QQN+ + Y++ ++ R C
Sbjct: 377 SLFVSSESGDSFCLAMTM---NTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/334 (29%), Positives = 144/334 (43%), Gaps = 43/334 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNSAQSTTF--------K 145
Y V GTP+QTL MDT + W PCT C CS + A+ TF K
Sbjct: 106 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAK 165
Query: 146 NLGCQAAQCKQVPN----PTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC 201
+GC +C V + C + + YG T L +++ A P + GC
Sbjct: 166 IVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGC 225
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK----ALSFSGSLRLGP 257
+ +S P G+ G GRG SL Q + FSYCL S + S +L +GP
Sbjct: 226 ---SILSSRQPSGIAGFGRGPSSLPKQ---MGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 279
Query: 258 IGQPKR---IKYTPLLKNPRRSS-----LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
+ + + YTP KNP S+ YYV L I VG + V +P + G
Sbjct: 280 DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 339
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV----PIVAPT 361
TI+DSG+ FT + P + AV F R++ +N T V +L G C+++ + P+
Sbjct: 340 TIVDSGSTFTFMEKPVFEAVATEFDRQM-ANYTRAADVEALSGLKPCFNLSGVGSVALPS 398
Query: 362 ITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAA 394
+ F G + LP N S+ CL + +
Sbjct: 399 LVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVS 432
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/438 (25%), Positives = 167/438 (38%), Gaps = 67/438 (15%)
Query: 57 SVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSP----------TYIVRAKIGTP 106
S+ ++ D+ R+ F++S R S + P Y VR ++GTP
Sbjct: 44 SLADLARSDRQRMAFIASHGRRRARETAAGSSAAAFEMPLTSGAYTGIGQYFVRFRVGTP 103
Query: 107 AQTLLMAMDTSNDAAWVPC-------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC-KQVP 158
AQ L+ DT +D WV C + S F S T+ + C + C K +P
Sbjct: 104 AQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLP 163
Query: 159 N--PTC--GGGACAFNLTYGSSTIA---ANLSQDTISLA-------TDIVPGYTFGCIQK 204
TC G CA++ Y + A TI+L+ + G GC
Sbjct: 164 FSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSS 223
Query: 205 ATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP----- 257
TG S G+L LG +S + + + FSYCL + + L GP
Sbjct: 224 YTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVA 283
Query: 258 -----------------IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGAL 300
R + TPLL + R Y V + A+ V + + IP
Sbjct: 284 SSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLKIPRAVW 343
Query: 301 QFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SV 355
+ G G I+DSGT T L PAY AV + + L ++ F+ CY S
Sbjct: 344 DVD--AGGGVILDSGTSLTVLAKPAYRAVVAALSEGL-AGLPRVTMDPFEYCYNWTSPSG 400
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ P + + F+G P + A + C+ + P ++VI N+ QQ H
Sbjct: 401 DVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP---WPGISVIGNILQQEHL 457
Query: 416 ILYDVPNSRLGVARELCT 433
+D+ N RL R CT
Sbjct: 458 WEFDIKNRRLKFQRSRCT 475
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 154/353 (43%), Gaps = 47/353 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTVFNSAQSTTFKNLGCQA 151
Y + +GTP Q L DT +D W C G C S + S+TF L C
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 152 AQCKQVPNPT-----CGGGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYTFGC 201
C + + + G C + +YG L+++T +L D VP FGC
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPSVRFGC 210
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ- 260
+ G GL+GLGRG LSL++Q L STF YCL S S + L G +
Sbjct: 211 TTASEGGYGSGSGLVGLGRGPLSLVSQ---LNASTFMYCLTS--DASKASPLLFGSLASL 265
Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+++ T LL + ++ Y VNL +I +G PG + G + DSGT T
Sbjct: 266 TGAQVQSTGLLAS---TTFYAVNLRSISIGSATT---PGVGE-----PEGVVFDSGTTLT 314
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPTITLMFSGMNVT 372
L PAY+ + F + + V GF+ C+ P PT+ L F G ++
Sbjct: 315 YLAEPAYSEAKAAFLSQTSLD-QVEDTDGFEACFQKPANGRLSNAAVPTMVLHFDGADMA 373
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
LP N ++ G + C + +P L++I N+ Q N+ +L+DV S L
Sbjct: 374 LPVANYVVEVEDG-VVCWIVQRSPS-----LSIIGNIMQVNYLVLHDVHRSVL 420
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 119/438 (27%), Positives = 184/438 (42%), Gaps = 69/438 (15%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLA 63
L+FF F F+ SLS LN + TL++ H S SPF +P++ + E + +
Sbjct: 9 LLFFTIFCFIISLSHALN-------NGFTLELIHRDSSKSPFYQPTQ--NKYERIANAVR 59
Query: 64 KDQARLQ--FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAA 121
+ R+ + SL +S V G Y++ IGTP + +DT +D
Sbjct: 60 RSINRVNHFYKYSLTSTPQSTVNSDKGE-------YLMSYSIGTPPFKVFGFVDTGSDLV 112
Query: 122 WVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTI 178
W+ C C C + +F+ + S++++N+ C + C + +C +
Sbjct: 113 WLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSC--------------DV 158
Query: 179 AANLSQDTISLATDIVPGYT-------FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQ 230
LS +T++L D GY+ GC + TG P G++GLG G +SL +Q
Sbjct: 159 RGYLSVETLTL--DSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLG 216
Query: 231 NLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
FSYCL + S S L G I TP++K +S YY+ L A V
Sbjct: 217 TSIGGKFSYCLGPWLPNSTS-KLNFGDAAIVYGDGAMTTPIVKKDAQSG-YYLTLEAFSV 274
Query: 289 GRRVVDIPPGALQFNPTTGAGT---IIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS 345
G ++++ PT G +IDSGT FT L Y +
Sbjct: 275 GNKLIEFG------GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDP 328
Query: 346 LGGFDTCYSVP---IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV 402
G F CY+V AP IT F G ++ L + I + G I CLA + S
Sbjct: 329 NGTFKLCYNVAYHGFEAPLITAHFKGADIKLYYISTFIKVSDG-IACLAF------IPSQ 381
Query: 403 LNVIANMQQQNHRILYDV 420
+ N+ QQN + Y++
Sbjct: 382 TAIFGNVAQQNLLVGYNL 399
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 161/368 (43%), Gaps = 48/368 (13%)
Query: 106 PAQTLLMAMDTSNDAAWVPCTGCVGCSS-TVFNSAQSTTFKNLGCQAAQCKQ------VP 158
P Q + M +DT ++ +W+ C + F+ +S+++ + C + C+ +P
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 159 NPTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVP-GYTFGCIQKATGNSVPPQ--- 213
C L+Y +S+ NL+ + FGC+ +G S P +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSG-SDPEEDTK 200
Query: 214 --GLLGLGRGSLSLLAQTQNLYQSTFSYCL------PSFKALSFSGSLRLGPIGQPKRIK 265
GLLG+ RGSLS ++Q + FSYC+ P F L S L P+ I+
Sbjct: 201 TTGLLGMNRGSLSFISQ---MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIR 257
Query: 266 Y-TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TPL R + Y V L I+V +++ IP L + T T++DSGT FT L+ P
Sbjct: 258 ISTPLPYFDRVA--YTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGP 315
Query: 325 AYTAVRDVFRRRVGSNLTVTS------LGGFDTCYSVPIVA---------PTITLMFSGM 369
YTA+R F + LTV G D CY + PT++L+F G
Sbjct: 316 VYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSLVFEGA 375
Query: 370 NVTLPQDNLLI---HSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+ + LL H TAG S+ C + D + VI + QQN I +D+ SR
Sbjct: 376 EIAVSGQPLLYRVPHLTAGNDSVYCFTFGNS-DLMGMEAYVIGHHHQQNMWIEFDLQRSR 434
Query: 425 LGVARELC 432
+G+A C
Sbjct: 435 IGLAPVQC 442
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 129/289 (44%), Gaps = 28/289 (9%)
Query: 167 CAFNLTYGSS----------TIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLL 216
C + YG S T NL+ + V FGC G GLL
Sbjct: 74 CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGP----IGQPKRIKYTPLL- 270
GLGRG LS +Q Q+LY +FSYCL + + S L G + P+ + +T L+
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVA 192
Query: 271 --KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
+NP + YYV + +I VG VV+IP Q GTIIDSGT + PAY
Sbjct: 193 GKENPV-DTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQV 251
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHST 383
+++ F +V V + CY+V V P ++FS G P +N I
Sbjct: 252 IKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIE 311
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ CLA+ P S L++I N QQQN ILYD SRLG A C
Sbjct: 312 PREVVCLAILGTPP---SALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 174/425 (40%), Gaps = 72/425 (16%)
Query: 75 LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS- 133
L VA + P A+ + + V +G P Q + M +DT ++ +W+ C G S+
Sbjct: 37 LVVAPPTRSPAANRLRFRHDVSLTVPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTP 96
Query: 134 ------TVFNSAQSTTFKNLGCQAA-QCKQ------VPNPTCGG---GACAFNLTYGSST 177
FN + S+T+ C ++ +C+ VP P C G +C +L+Y ++
Sbjct: 97 PQPQAPAAFNGSASSTYAAAHCSSSPECQWRGRDLPVP-PFCAGPPSNSCRVSLSYADAS 155
Query: 178 IAAN-LSQDTISLATDIVPGYTFGCI-----------------QKATGNSVPPQGLLGLG 219
A L+ DT L FGCI AT +S GLLG+
Sbjct: 156 SADGVLAADTFLLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMN 215
Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRLGPIGQPKRIKYTPLLKNPR-- 274
RGSLS + QT L F+YC+ L G + ++ YTPL++ +
Sbjct: 216 RGSLSFVTQTGTL---RFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPL 272
Query: 275 ---RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
Y V L IRVG ++ IP L + T T++DSGT FT L+A AY ++
Sbjct: 273 PYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKG 332
Query: 332 VFRRRVGSNLT------VTSLGGFDTCY----------SVPIVAPTITLMFSGMNVTLPQ 375
F + + L G FD C+ + + P + L+ G V +
Sbjct: 333 EFLNQTSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGG 392
Query: 376 DNLLI--------HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
+ LL + ++ CL + D VI + QQN + YD+ NSR+G
Sbjct: 393 EKLLYMVPGERRGEGGSEAVWCLTFGNS-DMAGMSAYVIGHHHQQNVWVEYDLQNSRVGF 451
Query: 428 ARELC 432
A C
Sbjct: 452 APARC 456
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 164/368 (44%), Gaps = 54/368 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
Y +GTP + ++A+DT +D W+PC C+ C+ ++ A+STT
Sbjct: 208 YYTWVDVGTPNTSFMVALDTGSDLFWIPCD-CIECAPLSGYHGSLDRDLGIYKPAESTTS 266
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGY 197
++L C C + T C +N Y ++T + L +D + L + +
Sbjct: 267 RHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASV 326
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC +K +G + + P GLLGLG +S+ LA+ L +++FS C SG
Sbjct: 327 IIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMCF-----TKDSG 380
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G + + TP + + Y VN+ VG + + +T I
Sbjct: 381 RIFFGDQGVSTQ-QSTPFVPLYGKLQTYTVNVDKSCVGHKCFE----------STSFQAI 429
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV-PIV---APTITLMFS 367
+DSGT FT L Y AV F ++V ++ FD CYS P+V PT+TL F+
Sbjct: 430 VDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTLTFA 489
Query: 368 GMNVTLPQD-NLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G P + L+H G++ CLA+ +P+ + +IA + +++D N +
Sbjct: 490 GNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPI----GIIAQNFLLGYHVVFDRENMK 545
Query: 425 LGVARELC 432
LG R C
Sbjct: 546 LGWYRSEC 553
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 153/372 (41%), Gaps = 59/372 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC------------SSTVFNSAQSTTF 144
+ +GTPA + L+A+DT +D W+PC C C + ++++ +S+T
Sbjct: 113 HFANVSVGTPASSYLVALDTGSDLFWLPCN-CTKCVHGIQLSTGQKIAFNIYDNKESSTS 171
Query: 145 KNLGCQAAQCKQVPN-PTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD-------IV 194
KN+ C ++ C+Q + GG C + + Y S + L +D + L TD
Sbjct: 172 KNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQHAN 231
Query: 195 PGYTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSF 249
P TFGC Q TG + P GL GLG +S+ + Q L ++FS C A
Sbjct: 232 PLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCF----AADG 287
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
G + G TP P S+ Y + + I VG D L+FN
Sbjct: 288 LGRITFGDNNSSLDQGKTPFNIRPSHST-YNITVTQIIVGGNSAD-----LEFN------ 335
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSV----PIVAP 360
I D+GT FT L PAY + F ++ L S F+ CY + I P
Sbjct: 336 AIFDTGTSFTYLNNPAYKQITQSFDSKI--KLQRHSFSNSDDLPFEYCYDLRTNQTIEVP 393
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
I L G + D +I S G+ L +A N +N+I +RI++D
Sbjct: 394 NINLTMKGGDNYFVMD-PIITSGGGNNGVLCLAVLKSN---NVNIIGQNFMTGYRIVFDR 449
Query: 421 PNSRLGVARELC 432
N LG C
Sbjct: 450 ENMTLGWKESNC 461
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 55/368 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F+ S+T+K + C
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC---- 138
Query: 154 CKQVPNPTC----GGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
N C G C + Y ST + L +D IS ++++P FGC
Sbjct: 139 -----NIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQ 260
TG+ S G++GLG G LSL+ Q + +FS C + + G++ LG I
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLC---YGGMDIGGGAMVLGGISP 250
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P + +T +P RS Y V+L I V + + + G F+ GA ++DSGT +
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGI--FDGRYGA--VLDSGTTYAY 304
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF- 366
L A A++A +D + S + + G D C+S + PT+ ++F
Sbjct: 305 LPAEAFSAFKDAIMDEIHS---LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFE 361
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ HS CL + +N N ++ + +N ++YD NS++
Sbjct: 362 NGQKLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKI 418
Query: 426 GVARELCT 433
G + C+
Sbjct: 419 GFWKTNCS 426
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 165/368 (44%), Gaps = 55/368 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F+ S+T+K + C
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC---- 138
Query: 154 CKQVPNPTC----GGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
N C G C + Y ST + L +D IS ++++P FGC
Sbjct: 139 -----NIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQ 260
TG+ S G++GLG G LSL+ Q + +FS C + + G++ LG I
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLC---YGGMDIGGGAMVLGGISP 250
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P + +T +P RS Y V+L I V + + + G F+ GA ++DSGT +
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGI--FDGRYGA--VLDSGTTYAY 304
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF- 366
L A A++A +D + S + + G D C+S + PT+ ++F
Sbjct: 305 LPAEAFSAFKDAIMDEIHS---LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFE 361
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ HS CL + +N N ++ + +N ++YD NS++
Sbjct: 362 NGQKLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKI 418
Query: 426 GVARELCT 433
G + C+
Sbjct: 419 GFWKTNCS 426
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 154/382 (40%), Gaps = 64/382 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCS-------STVFNSAQSTTFKN 146
Y + GTP QTL + MDT +D W PCT C CS S +F S++ K
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 147 LGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGY-------- 197
LGC NP CG +GS + + T T I P Y
Sbjct: 150 LGCV--------NPKCG-------WIHGSKVQSRCRDCEPTSPNCTQICPPYLNFLRFWD 194
Query: 198 ----TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
F + + + G GRG SL +Q L FSYCL S + + S
Sbjct: 195 HRRSQFHRRMLCPLHQSTRREISGFGRGPPSLPSQ---LGLKKFSYCLLSRRYDDTTESS 251
Query: 254 RLGPIGQPKR------IKYTPLLKNPRR------SSLYYVNLLAIRVGRRVVDIPPGALQ 301
L G+ + YTP ++NP+ S YY+ L I VG + V IP L
Sbjct: 252 SLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLI 311
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSVPIVA 359
GTIIDSGT FT + + V F ++V S V + G C+++ +
Sbjct: 312 PGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLN 371
Query: 360 ----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAM----AAAPDNVNSVLNVIANMQ 410
P +TL F G + LP N + + CL + AA + ++ N Q
Sbjct: 372 TPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQ 431
Query: 411 QQNHRILYDVPNSRLGVARELC 432
QQN + YD+ N RLG ++ C
Sbjct: 432 QQNFYVEYDLRNERLGFRQQSC 453
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/424 (25%), Positives = 169/424 (39%), Gaps = 73/424 (17%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
LQ+ H SP SPF P K L+ E + ++ + R S + P+
Sbjct: 34 LQLIHRDSPESPFYPGK-LTNSERISRLVEFSKIRAHNFDSGFSSEAFRPPV-----FQD 87
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y+V+ +IG P L + DT + W TV N F+
Sbjct: 88 FTCYLVKVRIGNPGIPLYLVPDTGSALIW-----------TVNNQ---NIFQ-------- 125
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTI--SLATDIVPGYTFGCIQKATGNSVP 211
C C++ Y +I ++ I S ++ +P Y FGC + SV
Sbjct: 126 --------CRNNKCSYTRRYDDGSITTGVAAQDILQSEGSERIPFY-FGCSRDNQNFSVF 176
Query: 212 PQ-----GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---------LSFSGSLRLGP 257
G++GL +SLL Q ++ Q FSYCL ++ L F +R G
Sbjct: 177 EHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPSSLLRFGNDIRKGR 236
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+R + TPL+ +P R + Y++NLL + V + + +PPG GTIIDSGT
Sbjct: 237 ----RRFQSTPLMSSPDRPN-YFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTG 291
Query: 318 FTRLVAPAY----TAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-----IVAPTITLMFSG 368
T + AY +A ++ F R + + FD CYS ++T F
Sbjct: 292 LTFITQTAYPRLISAFQNYFDHRGFQRVHIPE---FDLCYSFRGNHTFHDHASMTFHFER 348
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
+ T+ D + + + C+A+ P +V+ I Q N R +YD +L
Sbjct: 349 ADFTVQADYVYLPMEDDNAFCVALQPTPPQQRTVIGAI---NQGNTRFIYDAAAHQLLFI 405
Query: 429 RELC 432
E C
Sbjct: 406 AENC 409
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 164/372 (44%), Gaps = 61/372 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
Y +GTPA + L+A+DT +D WVPC C+ C+ ++ A+STT
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD-CIQCAPLSGYRGNLDRDLRIYRPAESTTS 154
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISL--ATDIVP---GY 197
++L C C+ VP T C +N+ Y S ++ L +DT+ L D VP
Sbjct: 155 RHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC QK +G + + P GLL LG +S+ LA+ L Q++FS C FK S SG
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARA-GLVQNSFSMC---FKEDS-SG 269
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G P + + TP + + Y VN+ +G + ++ T +
Sbjct: 270 RIFFGDQGVPSQ-QSTPFVPLYGKLQTYAVNVDKSCIGHKCLE----------GTSFKAL 318
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
+DSGT FT L Y A F +++ + + CYS +P V PTITL F
Sbjct: 319 VDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDV-PTITLTF 377
Query: 367 S------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+ +N LP + G++ +A P + +IA + +++D
Sbjct: 378 AADKSLQAVNPILP-----FNDKQGALAGFCLAVLPS--TEPIGIIAQNFLVGYHVVFDR 430
Query: 421 PNSRLGVARELC 432
+ +LG R C
Sbjct: 431 ESMKLGWYRSEC 442
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 141/308 (45%), Gaps = 39/308 (12%)
Query: 149 CQAAQCKQVPNPTCGGGA--------CAFNLTYGSSTIAANLSQ-----DTISLATDIV- 194
C C ++P P C A C+++ YG++ + ++ +T + D
Sbjct: 28 CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 87
Query: 195 -PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LSF 249
PG FGC ++ G GL+GLGRG LSL+ Q L F Y L S + +SF
Sbjct: 88 FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQ---LNVEAFGYRLSSDLSAPSPISF 144
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
GSL G TPLL NP L YYV L I VG ++V IP G F+ +TG
Sbjct: 145 -GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTG 203
Query: 308 AGTII-DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD-TCY---SVPIVAPTI 362
AG +I DSGT T L PAYT VRD ++G + D C+ S P++
Sbjct: 204 AGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSM 263
Query: 363 TLMFS-GMNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
L F G ++ L +N L + C ++ + + L +I N+ Q + +++
Sbjct: 264 VLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKS----SQALTIIGNIMQMDFHVVF 319
Query: 419 DVP-NSRL 425
D+ N+R+
Sbjct: 320 DLSGNARM 327
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 158/364 (43%), Gaps = 47/364 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+T++ + C
Sbjct: 77 YTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC---- 132
Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
NP+C G C + Y S+ + +++D +S +++ P FGC
Sbjct: 133 -----NPSCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVE 187
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ S G++GLGRG LS++ Q + + +FS C G++ LG I P
Sbjct: 188 TGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDV--GGGAMVLGQISPP 245
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ ++ NP RS Y + L + V + + + P GT++DSGT +
Sbjct: 246 PNMVFS--HSNPYRSPYYNIELKELHVAGKPLKLKPKVFD----EKHGTVLDSGTTYAYF 299
Query: 322 VAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMN 370
A+ A++D + + + D C+S + V P + ++F SG
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQK 359
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
++L P++ L H+ CL + N N + ++ + +N + YD N ++G +
Sbjct: 360 LSLSPENYLFRHTKVSGAYCLGIF---QNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWK 416
Query: 430 ELCT 433
C+
Sbjct: 417 TNCS 420
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/420 (25%), Positives = 175/420 (41%), Gaps = 37/420 (8%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
++++ H S SPF + ++ V + + R + ++V +V S +
Sbjct: 28 SVEIIHRDSSRSPFYRATETQFQR-VTNAVRRSMNRANHFNQISVYSNAV---ESPVTLL 83
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGC 149
Y++ +GTP + +DT++D WV C C C +S +F+ + S T+KNL C
Sbjct: 84 DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPC 143
Query: 150 QAAQCKQVPNPTCGGGA---CAFNLTYGS-STIAANLSQDTISLATDIVPGYTF-----G 200
+ CK V +C C + Y S +L +T++L + P F G
Sbjct: 144 SSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIG 203
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--- 257
CI+ T S G++GLG G +SL+ Q + FSYCL S L+ G
Sbjct: 204 CIRN-TNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDR--SSKLKFGDAAM 260
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IIDSGT 316
+ + + K+ ++ YY+ L A VG ++ + + ++G G IIDSGT
Sbjct: 261 VSGDGTVSTRIVFKDWKK--FYYLTLEAFSVGNNRIEFRSSSSR---SSGKGNIIIDSGT 315
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTL 373
FT L Y+ + V L F CY + P IT FSG +V L
Sbjct: 316 TFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPVITAHFSGADVKL 375
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N I + + CLA ++ + N+ QQN + YD+ + CT
Sbjct: 376 NALNTFI-VASHRVVCLAFLSSQSGA-----IFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 46/363 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F +S+T+ + C
Sbjct: 88 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN-MD 146
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
C N G C + Y S+ + L +D IS +++VP FGC TG+
Sbjct: 147 C----NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVETGDL 202
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
S G++GLGRG LS++ Q +N+ +FS C G++ LG I P +
Sbjct: 203 YSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHV--GGGAMVLGGIPPPPDMV 260
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
++ +P RS Y + L I V + + + P GT++DSGT + L A
Sbjct: 261 FS--RSDPYRSPYYNIELKEIHVAGKPLKLSPSTFD----RKHGTVLDSGTTYAYLPEEA 314
Query: 326 YTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMFS-GMNV 371
+ A RD ++ + + + G D C+S + P + ++FS G +
Sbjct: 315 FVAFRDAIIKK---SHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQKL 371
Query: 372 TL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+L P++ L H+ CL + D+ + +I +N + YD N ++G +
Sbjct: 372 SLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIV----RNTLVTYDRENEKIGFWKT 427
Query: 431 LCT 433
C+
Sbjct: 428 NCS 430
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 164/387 (42%), Gaps = 44/387 (11%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST----VFNS 138
+P++SG T + Y VR ++GTPAQ ++ DT +D WV C+G + VF +
Sbjct: 99 MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRA 157
Query: 139 AQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDTISLA 190
A S ++ + C + C VP N + CA++ Y + A + TI+L+
Sbjct: 158 AASRSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALS 217
Query: 191 TD----------IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
+ G GC G S G+L LG ++S ++ + FSY
Sbjct: 218 GSESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSY 277
Query: 240 CLPSFKALSFSGS-LRLGPIG----------QPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
CL A + S L GP G TPLL + R S Y V + A+ V
Sbjct: 278 CLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHV 337
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
+DIP A ++ G G I+DSGT T L PAY AV R+ + L S+
Sbjct: 338 AGEALDIP--ADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERL-AGLPRVSMDP 394
Query: 349 FDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
F+ CY+ A P + + F+G P + A + C+ + + ++V
Sbjct: 395 FEYCYNWTAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQ---EGAWPGVSV 451
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I N+ QQ+H +D+ + L C
Sbjct: 452 IGNILQQDHLWEFDLRDRWLRFKHTRC 478
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 119/418 (28%), Positives = 184/418 (44%), Gaps = 63/418 (15%)
Query: 34 LQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQ 93
+ +H++S KP +E+ +E L +A+ + VPI I Q
Sbjct: 35 VHSYHIYSR----KPPHVYHIKEASVERLEYLKAKTT--GDIIAHLSPNVPI-----IPQ 83
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQ 150
+ ++V IG+P T L+ MDT++D W+ C C+ C S +F+ ++S T +N C+
Sbjct: 84 A--FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCR 141
Query: 151 AAQCKQVPNPTCGGG--ACAFNLTY----GSSTIAA------NLSQDTISLAT--DIVPG 196
+Q +P+ +C +++ Y GS I A N D S A D+V
Sbjct: 142 TSQ-YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVV-- 198
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRL 255
FGC G + G+LGLG G SL+ + + FSYC S S+ L L
Sbjct: 199 --FGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGKKFSYCFGSLDDPSYPHNVLVL 252
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-AGTIIDS 314
G G TPL + YYV + AI V ++ I P N TG GTIID+
Sbjct: 253 GDDGANILGDTTPL---EIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDT 309
Query: 315 GTVFTRLVAPAY----TAVRDVFRRRVG----SNLTVTSLGGFDTCYSVPIVA---PTIT 363
G T LV AY + D+F R S + + ++ + +V P +T
Sbjct: 310 GNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVT 369
Query: 364 LMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
FS G ++L +L + + ++ CLA+ P N+NS I QQ++ I YD+
Sbjct: 370 FHFSEGAELSLDVKSLFM-KLSPNVFCLAV--TPGNLNS----IGATAQQSYNIGYDL 420
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 158/373 (42%), Gaps = 48/373 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R ++G P + + +DT +D WV C C GC +T F+ STT +
Sbjct: 83 YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142
Query: 149 CQAAQCK---QVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLATDI--------V 194
C C Q + C G + CA+ YG S + D I L I
Sbjct: 143 CSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSS 202
Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
FGC TG+ G+ G G+ LS+++Q ++ + FS+CL S
Sbjct: 203 ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDD--S 260
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG I +P + YTPL+ + Y +NL +I V +V+ I P F ++
Sbjct: 261 GGGILVLGEIVEPN-VVYTPLVPSQPH---YNLNLQSISVNGQVLPISPAV--FATSSSQ 314
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
GTIIDSGT L AY A V + L G + CY SV + P ++L
Sbjct: 315 GTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKG-NRCYVTSSSVSDIFPQVSL 373
Query: 365 MFSGMN--VTLPQDNLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F+G V QD L+ ++ G T C+ P + ++ ++ ++ +YD+
Sbjct: 374 NFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIP---GQGITILGDLVLKDKIFIYDL 430
Query: 421 PNSRLGVARELCT 433
N R+G C+
Sbjct: 431 ANQRIGWTNYDCS 443
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 164/407 (40%), Gaps = 39/407 (9%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
EE V +A + RL + R S A T+ YI IG P Q +
Sbjct: 44 EERVRRAVAVSRERLAYTQQQQQLRASGDVSAPVHLATRQ--YIAEYLIGDPPQRAAALI 101
Query: 115 DTSNDAAWVPCT---GCVGCSST---VFNSAQSTTFKNLGC--QAAQCKQVPNPTCG-GG 165
DT ++ W C G C+ +N ++S+TF + C A C CG G
Sbjct: 102 DTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDG 161
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
+C F +YG+ ++ +L + + + FGC+ + G GL+GLGRG
Sbjct: 162 SCTFAASYGAGSVFGSLGTEAFTFQSGAAK-LGFGCVSLTRITKGALNGASGLIGLGRGR 220
Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP----IGQPKRIKYTPLLKNPRR-- 275
LSL++QT + FSYCL P + S L +G G + P +K+P
Sbjct: 221 LSLVSQTG---ATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYP 277
Query: 276 -SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG----AGTIIDSGTVFTRLVAPAYTAVR 330
S+ YY+ L+ I VG + IP A + G IID+G+ T L AY+A+
Sbjct: 278 YSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALS 337
Query: 331 DVFRRRVGSNLTV-TSLGGFDTCYS---VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGS 386
D R++ +L + G D C + V V P + F G S
Sbjct: 338 DEVARQLNRSLVQPPADTGLDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKS 397
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
C+ + VI N QQQ+ +LYD+ L C+
Sbjct: 398 TACMLI-----EEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 160/368 (43%), Gaps = 45/368 (12%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLG 148
T P ++V +G PA L MDT ++ WV C C C+ + + ++S+T+ +L
Sbjct: 94 TYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLP 153
Query: 149 CQAAQCKQVPNPTCGG-GACAFNLTYGSSTIAANL--SQDTISLATD----IVPGYTFGC 201
C C P+ C C +NL+Y + +A + ++ I ++D VP FGC
Sbjct: 154 CTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGC 213
Query: 202 IQKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-------FKALSFSGS 252
+ G+ + G+ GLG+G S + + S FSYCL + + L F
Sbjct: 214 SHE-NGDYKDRRFTGVFGLGKGITSFVTRM----GSKFSYCLGNIADPHYGYNQLVFGEK 268
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
P ++ + YYV L I VG + +DI A + +I
Sbjct: 269 ANFEGYSTPLKVV----------NGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSA-LI 317
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-----PTITLMFS 367
DSGT T L A+ A+ + R+ + L G F CY + P +T FS
Sbjct: 318 DSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSF-ACYKGTVSQDLIGFPVVTFHFS 376
Query: 368 -GMNVTLPQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G ++ L +++ +T I C+A+ A+A N +VI M QQ + + YD+ +++
Sbjct: 377 GGADLDLDTESMFYQATP-DILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNK 435
Query: 425 LGVARELC 432
L R C
Sbjct: 436 LFFQRIDC 443
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 115/405 (28%), Positives = 171/405 (42%), Gaps = 75/405 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS-------TVFNSAQSTTFKN 146
Y +GTP Q L + +DT + +WVPCT C CSS VF+ S++ +
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 147 LGCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQ 184
+GC+ C + +P +C G C + + YGS + A L
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
DT+ V + GC + PP GL G GRG+ S+ +Q L + FSYCL S
Sbjct: 209 DTLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSR 263
Query: 245 K---ALSFSGSLRL---GPIGQPKRIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVD 294
+ + SG L L G ++Y PL ++ P S YY+ L AI VG + V
Sbjct: 264 RFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQ 323
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTR----LVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+P A G G I+DSGT F+ + P AV R + V G
Sbjct: 324 LPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382
Query: 351 TCYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNS 401
C+++P + P ++L F G +V LP +N + + +G +A A V+
Sbjct: 383 PCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSD 442
Query: 402 VLN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
V ++ + QQQN+ I YD+ RLG R+ C
Sbjct: 443 VPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 162/367 (44%), Gaps = 54/367 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F ST+++ L C
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC---- 131
Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
NP C G C + Y S+ + LS+D IS + + P FGC +
Sbjct: 132 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ S G++GLGRG LS++ Q + + + FS C + G++ LG I P
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 244
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ ++ +P RS Y ++L + V + + + P FN GT++DSGT +
Sbjct: 245 PGMVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FN--GKHGTVLDSGTTYAYF 298
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF-S 367
A+ A++D + + S + + G D C+S + P I + F +
Sbjct: 299 PKEAFIAIKDAVIKEIPS---LKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGN 355
Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G + L P++ L H+ CL + PD ++ L + + +N + YD N +LG
Sbjct: 356 GQKLILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLG 411
Query: 427 VARELCT 433
+ C+
Sbjct: 412 FLKTNCS 418
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 109/417 (26%), Positives = 174/417 (41%), Gaps = 84/417 (20%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNS-------- 138
T + Y++ +G P Q + +DT +D WVPC C+ C + S
Sbjct: 20 TYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSP 79
Query: 139 -AQSTTFKNLGCQAAQCKQV-----PNPTCGGGACA---------------FNLTYGSST 177
S+ K L C + C + + C CA F+ TYG
Sbjct: 80 SQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTYGGGA 138
Query: 178 IA-ANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLA 227
+ +L++D ++L I VPG+ FGC+ G+S+ P G+ G G+G LSL +
Sbjct: 139 LVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCV----GSSIREPIGIAGFGKGILSLPS 194
Query: 228 QTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQPKR--IKYTPLLKNPRRSSLYYVN 282
Q L FS+C F+ +F+ SL +G + + +TP+LK+ + YY+
Sbjct: 195 QLGFL-DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIG 253
Query: 283 LLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV----RDVFRRRV 337
L + +G + PP + G I+D+GT +T L P YTA+ V
Sbjct: 254 LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYER 313
Query: 338 GSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIHSTAGS-- 386
+L + + GFD C+ +P P I F G + +TLP+D+ TA
Sbjct: 314 SYDLEMRT--GFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNS 371
Query: 387 --ITCLAM-----AAAPDNVNSVLN----VIANMQQQNHRILYDVPNSRLGVARELC 432
+ CL D+V N V+ + Q QN ++YD+ R+G + C
Sbjct: 372 VVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 428
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 124/431 (28%), Positives = 185/431 (42%), Gaps = 81/431 (18%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTY----IVRAKIGTPAQTLLMAMDT 116
M+ +D R+ LA R + + A+G + Q + +GTP L+A+DT
Sbjct: 75 MVHRD--RVFHGRRLADDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDT 132
Query: 117 SNDAAWVP--CTGCVGCSST---------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
+D W+P CT CV T ++ +S+T KN+ C + CKQ + G
Sbjct: 133 GSDLFWLPCNCTSCVRGLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCKQTQCHS-SGS 191
Query: 166 ACAFNLTYGSSTIAAN--LSQDTISLAT------DIVPGYTFGCIQKATG---NSVPPQG 214
+C + + Y S+ +++ L +D + L T DI T GC Q TG N P G
Sbjct: 192 SCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNG 251
Query: 215 LLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLK 271
L GLG ++ S+LAQ + L +FS C S SG + G G + K TP
Sbjct: 252 LFGLGMENVSVPSILAQ-KGLISDSFSMCFGS----DGSGRITFGDTGSSDQGK-TPF-- 303
Query: 272 NPRRSS-LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
N R S Y V + I VG D +F+ I DSGT FT L PAYT +
Sbjct: 304 NLRESHPTYNVTITQIIVGGYAAD-----HEFH------AIFDSGTSFTYLNDPAYTLIS 352
Query: 331 DVFRRRVGSN----LTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHS 382
+ F V +N L+ S F+ CY + I P + L G + D ++ S
Sbjct: 353 EKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTIEVPFLNLTMKGGDDYYVTDPIVPVS 412
Query: 383 T--AGSITCLAMAAAPDNVNSVLN--------------VIANMQQQN----HRILYDVPN 422
+ G++ CL + + DN+N + +I Q+N +RI++D N
Sbjct: 413 SEVEGNLLCLGIQKS-DNLNIIGREYTTEEEFLHLKHMIIKFFIQKNFMTGYRIVFDREN 471
Query: 423 SRLGVARELCT 433
LG CT
Sbjct: 472 MNLGWKESNCT 482
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 151/348 (43%), Gaps = 62/348 (17%)
Query: 112 MAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN---PTCG 163
+ +DT++D WV C + SS+ ++ A+S+T+ L C +A C ++ C
Sbjct: 126 VVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGACV 185
Query: 164 GGACAFNL-------------TYGSSTIAANLSQDTISLATDIVPG----YTFGCIQ--- 203
C + + TYGS D + L D G + FGC
Sbjct: 186 NNQCQYRVPIPSSPASSSSSGTYGS---------DLLKLTADPADGASMSFKFGCSHGEA 236
Query: 204 KATGNSV---PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP---SFKALSFSGSLRLGP 257
K G G++ LG G SL++Q +Y S FSYC+P S + F +G
Sbjct: 237 KQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGD 296
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
+ TP+L+ R +LY V LLAI V + +++ P +G+++DS T
Sbjct: 297 LSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVF------ASGSVLDSRTA 350
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVT- 372
TRL AY A+R+ FR R+ G DTCY ++ P + L+ G V
Sbjct: 351 ITRLPPTAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVA 410
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
L + +L H CL + D + + ++ N+QQQ +LY+V
Sbjct: 411 LDRQGILFHD------CLVFTSNTD--DRMPGILGNVQQQTMEVLYNV 450
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 108/424 (25%), Positives = 180/424 (42%), Gaps = 40/424 (9%)
Query: 33 TLQVFHVFSPCSPFKPSKPLS---WEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGR 89
T ++ H SP SP S+ W +++ +++ + ++++ IA+G
Sbjct: 32 TTELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANGG 91
Query: 90 QITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKN 146
+ Y++ +GTP +L DT +D W CT C C + F+ S T+++
Sbjct: 92 E------YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRD 145
Query: 147 LGCQAAQCKQV-PNPTCGGGA-CAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGY----T 198
L C QC+ + + +C C ++ YG + NL+ DT++L +T+ P Y
Sbjct: 146 LSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTV 205
Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL-- 255
GC ++ G G++GLG G +SL++Q + FSYCL F + S S +L
Sbjct: 206 IGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHF 265
Query: 256 --GPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
+ ++ TPL+ KNP + YY+ L A+ VG + ++ + II
Sbjct: 266 GRNAVVSGSGVQSTPLISKNP--DTFYYLTLEAMSVGDKKIEF---GGSSFGGSEGNIII 320
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCY--SVPIVAPTITLMFSGM 369
DSGT T +T V T + G CY + + P IT F+G
Sbjct: 321 DSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPVITAHFNGA 380
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+V L N I + + CLA + + N+ Q N I YD+ +
Sbjct: 381 DVVLQTLNTFIL-ISDDVLCLAFNSTQSGA-----IFGNVAQMNFLIGYDIQGKSVSFKP 434
Query: 430 ELCT 433
CT
Sbjct: 435 TDCT 438
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 113/420 (26%), Positives = 171/420 (40%), Gaps = 64/420 (15%)
Query: 59 LEMLAKDQARLQFLSSLAVARKSVV-----PIASGRQITQS---------------PTYI 98
+EM+ +D +R F S + V I + QS Y+
Sbjct: 31 VEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTVISALGEYL 90
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCK 155
+ +GTP+ + +DT +D W+ C C C ++ +F+S++S T+K L C + C+
Sbjct: 91 ISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNTCQ 150
Query: 156 QVPNPTCGG-GACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGYTFGCIQ-KAT 206
V C C +++ Y GS ++ +LS +T++L + PG GC + A
Sbjct: 151 SVQGTFCSSRKHCLYSIHYVDGSQSL-GDLSVETLTLGSTNGSPVQFPGTVIGCGRYNAI 209
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIK 265
G G++GLGRG +SL+ Q FSYCL P S + + +
Sbjct: 210 GIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTV 269
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDI-PPGALQFNPTTGAGT-IIDSGTVFTRLVA 323
TPL Y++ L A VGR ++ PG+ G G IIDSGT T L
Sbjct: 270 STPLFSK-NGLVFYFLTLEAFSVGRNRIEFGSPGS------GGKGNIIIDSGTTLTALPN 322
Query: 324 PAYTAV-----RDVFRRRVGSNLTVTSLGGFDTCYSV-----PIVAPTITLMFSGMNVTL 373
Y+ + + V +RV V L CY V P IT FSG +VTL
Sbjct: 323 GVYSKLEAAVAKTVILQRVRDPNQVLGL-----CYKVTPDKLDASVPVITAHFSGADVTL 377
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
N + A + C A V N+ QQN + YD+ + + CT
Sbjct: 378 NAINTFVQ-VADDVVCFAFQPTETGA-----VFGNLAQQNLLVGYDLQMNTVSFKHTDCT 431
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 113/452 (25%), Positives = 185/452 (40%), Gaps = 59/452 (13%)
Query: 1 MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLE 60
M P L FFLA LF + ++ S+TL+ H+ S + + E +
Sbjct: 13 MLPYL-FFLAILFAWPVT------------SATLRA-HL----SHVDDGRGFTKRELLRR 54
Query: 61 MLAKDQARLQFLS--SLAVARKSVVPIASGRQITQSPTYIVRAKIGTP-AQTLLMAMDTS 117
M+ + +AR L S A AR + P+ S Y++ IG P +Q +++ +DT
Sbjct: 55 MVVRSRARAANLCPYSGATARPATAPVGRANTDVNS-EYLIHLSIGAPRSQPVVLTLDTG 113
Query: 118 NDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG 174
+D W C C C + F++A S T +++ C C C C + YG
Sbjct: 114 SDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYG 173
Query: 175 SSTIA-ANLSQDTISLATD------IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLL 226
+++ + +D+ + VP FGC G + + G+ G GRG LSL
Sbjct: 174 DGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLP 233
Query: 227 AQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL------- 278
+Q L FSYC + F+A S + LG G K P+L P SL
Sbjct: 234 SQ---LKVRQFSYCFTTRFEAK--SSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNS 288
Query: 279 -YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y ++ + VG+ + +P + T IDSGT T + ++ F +
Sbjct: 289 HYVLSFKGVTVGKTRLPVP----EIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQA 344
Query: 338 GSNLTVTSLGGFDTCYSVP----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
+ T+ D C+S P + G + LP++N + C+A++
Sbjct: 345 ALPVNKTADED-DICFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVS 403
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+ + +I N QQQN I+YD+ +L
Sbjct: 404 TSGQMDRT---LIGNFQQQNTHIVYDLAAGKL 432
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 112/431 (25%), Positives = 161/431 (37%), Gaps = 91/431 (21%)
Query: 79 RKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVF 136
R+ +P++ G T S T +Q + + +DT +D W PC C+ C
Sbjct: 70 RQVSLPLSPGSDYTLSFT--------INSQPISLYLDTGSDLVWFPCQPFECILCEGKAE 121
Query: 137 NSAQ--------STTFKNLGCQAAQCKQVPNPTCGGGACA-------------------- 168
N++ S T + C+++ C V + CA
Sbjct: 122 NASLASTPPPKLSKTATPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCP 181
Query: 169 -FNLTYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS 222
F YG ++ A L +D+I L I +TFGC P G+ G GRG
Sbjct: 182 QFYYAYGDGSLIARLYRDSIRLPLSNQTNLIFNNFTFGCAHTTLAE---PIGVAGFGRGV 238
Query: 223 LSLLAQTQNL---YQSTFSYCLPSFKALSFSGS-------LRLGPIGQPKRIK------- 265
LSL AQ L + FSYCL S SF L LG ++ +
Sbjct: 239 LSLPAQLATLSPQLGNQFSYCLVSH---SFDSDRVRRPSPLILGRYDHDEKERRVNGVKK 295
Query: 266 ----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
YT +L NPR Y V L I +GR+ + P + + G ++DSGT FT L
Sbjct: 296 PSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTML 355
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSL----GGFDTCY-----SVPIVAPTITLMFSGMNVT 372
A Y V F RVG S+ G CY V + + + +G +V
Sbjct: 356 PASLYDFVVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVV 415
Query: 373 LPQDNLLIH--------STAGSITCLAMAAAPDNVNSVLN---VIANMQQQNHRILYDVP 421
LP+ N + CL + D + N QQQ ++YD+
Sbjct: 416 LPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLE 475
Query: 422 NSRLGVARELC 432
N R+G AR C
Sbjct: 476 NRRVGFARRQC 486
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 175/387 (45%), Gaps = 54/387 (13%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNS 138
V P+ SG S Y + +GTP LM +DT +D W+ C C C S +F+
Sbjct: 133 VAPVVSG-LAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCG--GGACAFNLTYGSSTI-AANLSQDTISLATDI-V 194
S ++ + C A C+++ + C AC + + YG ++ A + + +T++ A+ V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS---- 250
P GC G V GLLGLGRGSLS +Q + +FSYCL + S S
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311
Query: 251 ------GSLRLGPIGQPKRIKYTPLLKNPR------RSSLYYVNLLAIRVGRRVVDIPPG 298
GS G +G +R+ + P + P+ R++ + R GR V PP
Sbjct: 312 SSTVTFGSGARGALG--RRVLH-PDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPP- 367
Query: 299 ALQFNPTTG-AGTIIDSGTVFTRLVAPAYTAVRD----VFRRRVGSNLTVTSLGG---FD 350
+P+TG G I+DSG +PA+ R R + S GG FD
Sbjct: 368 ----DPSTGRGGVIVDSGR-----PSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFD 418
Query: 351 TCYSVP----IVAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
TCY + + PT+++ F+ G LP +N LI + C A A + +++
Sbjct: 419 TCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFA----GTDGGVSI 474
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
I N+QQQ R+++D RLG + C
Sbjct: 475 IGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 160/364 (43%), Gaps = 48/364 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F ST+++ L C
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC---- 131
Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
NP C G C + Y S+ + LS+D IS + + P FGC +
Sbjct: 132 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ S G++GLGRG LS++ Q + + + FS C + G++ LG I P
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 244
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ ++ +P RS Y ++L + V + + + P FN GT++DSGT +
Sbjct: 245 PGMVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FN--GKHGTVLDSGTTYAYF 298
Query: 322 VAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMN 370
A+ A++D + + S + D C+S + P I + F +G
Sbjct: 299 PKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQK 358
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ L P++ L H+ CL + PD ++ L + + +N + YD N +LG +
Sbjct: 359 LILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLGFLK 414
Query: 430 ELCT 433
C+
Sbjct: 415 TNCS 418
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 115/405 (28%), Positives = 171/405 (42%), Gaps = 75/405 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSS-------TVFNSAQSTTFKN 146
Y +GTP Q L + +DT + +WVPCT C CSS VF+ S++ +
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 147 LGCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQ 184
+GC+ C + +P +C G C + + YGS + A L
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
DT+ V + GC + PP GL G GRG+ S+ +Q L + FSYCL S
Sbjct: 209 DTLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSR 263
Query: 245 K---ALSFSGSLRL---GPIGQPKRIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVD 294
+ + SG L L G ++Y PL ++ P S YY+ L AI VG + V
Sbjct: 264 RFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQ 323
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFT----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+P A G G I+DSGT F+ + P AV R + V G
Sbjct: 324 LPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382
Query: 351 TCYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNS 401
C+++P + P ++L F G +V LP +N + + +G +A A V+
Sbjct: 383 PCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSD 442
Query: 402 VLN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
V ++ + QQQN+ I YD+ RLG R+ C
Sbjct: 443 VPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 161/367 (43%), Gaps = 54/367 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+++K L C
Sbjct: 80 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKC---- 135
Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKA 205
NP C G C + Y S+ + LS+D IS + + P FGC
Sbjct: 136 -----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVE 190
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ S G++GLGRG LS++ Q + + + FS C + G++ LG I P
Sbjct: 191 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV--GGGAMVLGKISPP 248
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ ++ +P RS Y ++L + V + + + P FN GT++DSGT +
Sbjct: 249 AGMVFSH--SDPFRSPYYNIDLKQMHVAGKSLKLNPKV--FNGK--HGTVLDSGTTYAYF 302
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMF-S 367
A+ A++D + + S + + G D C+S + P I + F +
Sbjct: 303 PKEAFIAIKDAIIKEIPS---LKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGN 359
Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G + L P++ L H+ CL + PD ++ L + + +N + YD N +LG
Sbjct: 360 GQKLILSPENYLFRHTKVRGAYCLGI--FPDRDSTTL--LGGIVVRNTLVTYDRENDKLG 415
Query: 427 VARELCT 433
+ C+
Sbjct: 416 FLKTNCS 422
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 165/438 (37%), Gaps = 82/438 (18%)
Query: 66 QARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIG--TPAQTLLMAMDTSNDAAWV 123
+ R L S R+ +P+A G Y + +G + A + + +DT +D W
Sbjct: 58 RHRTHHLPSSRRHRQLSLPLAPGSD------YTLSLSVGPLSTANPVSLFLDTGSDLVWF 111
Query: 124 PCTG-----CVG------------------------CSSTVFNSAQSTTFKNLGCQAAQC 154
PC C G C+S ++A S+ C AA+C
Sbjct: 112 PCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAAARC 171
Query: 155 --KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNS 209
+ +C AC YG ++ A L + + +A + V +TF C A G
Sbjct: 172 PLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGIAASVAVENFTFACAHTALGE- 230
Query: 210 VPPQGLLGLGRGSLSLLAQ-TQNLYQSTFSYCL--PSFKALSFSGSLRLGPI-------- 258
P G+ G GRG LSL AQ FSYCL SF+A +R P+
Sbjct: 231 --PVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRA---DRPIRPSPLILGRSPGE 285
Query: 259 --GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
I YTPLL NP+ Y V L A+ VG + P + G ++DSGT
Sbjct: 286 DPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGRAGDGGMVVDSGT 345
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-----GFDTCYSVPIVA-----------P 360
FT L Y V + F R + + + G CY A P
Sbjct: 346 TFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVP 405
Query: 361 TITLMFSG-MNVTLPQDNLLI---HSTAGSITCLA-MAAAPDNVNSVLNVIANMQQQNHR 415
+ + F G V LP+ N + + CL M D+ + N QQQ
Sbjct: 406 PLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFE 465
Query: 416 ILYDVPNSRLGVARELCT 433
++YDV R+G AR CT
Sbjct: 466 VVYDVDAGRVGFARRRCT 483
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 5/199 (2%)
Query: 94 SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQ 150
S Y R +GTP + M +DT +D AW+ C C C S +FN + S +F +GC
Sbjct: 154 SGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 151 AAQCKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTFGCIQKATGNS 209
+A C Q+ C G C + +YG + + + + +T++ T V GC K G
Sbjct: 214 SAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTFGTTSVANVAIGCGHKNVGLF 273
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPL 269
+ GLLGLG G+LS Q TFSYCL ++ S SG L+ GP P +TPL
Sbjct: 274 IGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDS-SGPLQFGPKSVPVGSIFTPL 332
Query: 270 LKNPRRSSLYYVNLLAIRV 288
KNP + YY+++ AI +
Sbjct: 333 EKNPHLPTFYYLSVTAISI 351
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 185/427 (43%), Gaps = 50/427 (11%)
Query: 33 TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
++ + H SP SPF PS L+ E + + +RL +S + I
Sbjct: 33 SIDLIHRDSPLSPFYDPS--LTPSERITNAAFRSSSRLNRVSHFLDENN----LPESLLI 86
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
++ Y++ IGTP L DT +D WV C+ C C + +F +S+TFK
Sbjct: 87 PENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAAT 146
Query: 149 CQAAQCKQVP--NPTCGG-GACAFNLTYGSSTIAAN-LSQDTISLA------TDIVPGYT 198
C + C VP CG G C ++ +YG + + +T+S T P
Sbjct: 147 CDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSI 206
Query: 199 FGC-----IQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSL 253
FGC T + V LG G SL Q Y+ FSYCL F + S + L
Sbjct: 207 FGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYK--FSYCLLPFSSNS-TSKL 263
Query: 254 RLG--PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G I + TPL+ P S Y++NL A+ +G++VV P G T I
Sbjct: 264 KFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVV--PTGR------TDGNII 315
Query: 312 IDSGTVFTRLVAPAYT----AVRDVFRRRVGSNLTVTSLGGFDTCYSV-PIVAPTITLMF 366
IDSGTV T L Y ++++V +L F C+ + P I F
Sbjct: 316 IDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFP----FKFCFPYRDMTIPVIAFQF 371
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
+G +V L NLLI ++ CLA+ P +++ + ++ N+ Q + +++YD+ ++
Sbjct: 372 TGASVALQPKNLLIKLQDRNMLCLAV--VPSSLSGI-SIFGNVAQFDFQVVYDLEGKKVS 428
Query: 427 VARELCT 433
A CT
Sbjct: 429 FAPTDCT 435
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 168/369 (45%), Gaps = 41/369 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---------CVGCSSTVFNSAQSTTFKNLG 148
+V IGTP Q + +DT + +W+ C +T F+ + S++F L
Sbjct: 67 VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126
Query: 149 CQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFG 200
C CK ++P+ PT C C ++ Y T+A NL ++ + + + P G
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C Q +T N +G+LG+ RG LS ++Q + S FSYC+PS + +G LG
Sbjct: 187 CAQASTEN----RGILGMNRGRLSFISQAK---ISKFSYCVPSRTGSNPTGLFYLGDNPN 239
Query: 261 PKRIKYTPLLKNPRRSS-------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+ KY +L P S Y + + AI++ + +++PP A + + T+ID
Sbjct: 240 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQTMID 299
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVPIVAPT------ITLM 365
SG+ T LV AY V++ R VG+ + + D C+ + A I+
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359
Query: 366 F-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F +G+ + + + ++ + C+ + + + + N+I + QQN + YD+ N R
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS-ERLGIGSNIIGTVHQQNMWVEYDLANKR 418
Query: 425 LGVARELCT 433
+G C+
Sbjct: 419 VGFGGAECS 427
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 166/417 (39%), Gaps = 87/417 (20%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG----CVGCSS----------TVFNSAQST 142
Y++ IGTP Q + + MDT +D WVPC C C F S+
Sbjct: 21 YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 80
Query: 143 TFKNLGCQAAQCKQV---PNP-----------------TCGGGACAFNLTYGSS-TIAAN 181
T C ++ C + NP TC +F TYG+S + +
Sbjct: 81 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140
Query: 182 LSQDTI---------SLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
L++D + + +P + FGC+ P G+ G GRG LSL Q
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPFQL-GF 196
Query: 233 YQSTFSYCLPSFK---ALSFSGSLRLGPIG---QPKRIKYTPLLKNPRRSSLYYVNLLAI 286
FS+C FK +FS L LG + + + +++TPLLK+P + YY+ L +I
Sbjct: 197 SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESI 256
Query: 287 RVGRRVVDIPPGA----LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG--SN 340
+G + G + + G +IDSGT +T L P Y+ + +G
Sbjct: 257 TIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRA 316
Query: 341 LTVTSLGGFDTCYSVPIVA-----------PTITLMF-SGMNVTLPQDNLL------IHS 382
V GFD CY VP P+IT F + ++V LPQ N I+S
Sbjct: 317 KQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINS 376
Query: 383 TAGSITCLAMAAAPDNVNSVL-------NVIANMQQQNHRILYDVPNSRLGVARELC 432
T + CL + + + + QQQN ++YD+ RLG C
Sbjct: 377 TV--VKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 431
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 171/400 (42%), Gaps = 40/400 (10%)
Query: 60 EMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSND 119
++ ++ R + + +A + +P++SG + Y V+ +GTPAQ + DT ++
Sbjct: 55 QLPSRRGGRQRVAAEVASSSAVSLPMSSG-AYAGTGQYFVKVLVGTPAQEFTLVADTGSE 113
Query: 120 AAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-QVP----NPTCGGGACAFNLTY- 173
WV C G VF S ++ + C + CK VP N + C+++ Y
Sbjct: 114 LTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYK 173
Query: 174 -GSSTIAANLSQDTISLATDIVPG--------YTFGCIQKATGNSVPP-QGLLGLGRGSL 223
GS+ + D+ ++A +PG GC G S G+L LG +
Sbjct: 174 EGSAGALGVVGTDSATIA---LPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKI 230
Query: 224 SLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGPIGQPKRI--KYTPLLKNPRRSSLYY 280
S ++ + +FSYCL A + +G L GP GQ R T L +P Y
Sbjct: 231 SFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGP-GQVPRTPATQTKLFLDPAM-PFYG 288
Query: 281 VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
V + A+ V + +DIP A ++P +G G I+DSGT T L PAY AV + + +
Sbjct: 289 VKVDAVHVAGQALDIP--AEVWDPKSG-GVILDSGTTLTVLATPAYKAVVAALTKLL-AG 344
Query: 341 LTVTSLGGFDTCYS-------VPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMA 393
+ F+ CY+ P + P + + F+G P + + C+ +
Sbjct: 345 VPKVDFPPFEHCYNWTAPRPGAPEI-PKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQ 403
Query: 394 AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++VI N+ QQ H +D+ N + CT
Sbjct: 404 ---EGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTCT 440
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 185/424 (43%), Gaps = 55/424 (12%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLS---------------SL 75
S+ L++ H PC+ PS+ S S E+L D+ R +++ +
Sbjct: 422 SAVLRLTHRHGPCA--GPSRSAS-APSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTA 478
Query: 76 AVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC--- 131
A + KSV +P G I + Y+V +GTP + +DT +D +WV C C
Sbjct: 479 ASSSKSVTIPANIGHSIG-TLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACY 537
Query: 132 --SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQD 185
+F+ A+S+++ + C A C ++ G G C + ++YG S D
Sbjct: 538 AQKDQLFDPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSD 597
Query: 186 TISLA-TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCLPS 243
T++L D V G+ FGC G GLL LGR +SL +QT Y FSYCLP
Sbjct: 598 TLTLTDADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLP- 656
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQF 302
+ S +G L LG T LL + Y V L I V G+++ +P A
Sbjct: 657 -PSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA- 714
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL---TVTSLGGFDTCYSV---- 355
GT++D+GTV TRL P A R + + G DTCY+
Sbjct: 715 -----GGTVVDTGTVITRL-PPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYG 768
Query: 356 PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
+ PT++L FSG TL D S+ CLA A + + ++ N+QQ++
Sbjct: 769 TVTLPTVSLTFSG-GATLKLDAPGFLSSG----CLAFATNSGDGDPA--ILGNVQQRSFA 821
Query: 416 ILYD 419
+ +D
Sbjct: 822 VRFD 825
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 122/452 (26%), Positives = 191/452 (42%), Gaps = 61/452 (13%)
Query: 9 LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQAR 68
L F+F FSL+ T + +++ H S SP+ SK W+ ++L + +
Sbjct: 23 LPFIFHFSLTTATIT---TSTINLVIKLIHHESSLSPYN-SKDTIWDHYSHKILKQTFSN 78
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC 128
++S+L + + VV +++ IG P L MDT + WV C C
Sbjct: 79 -DYISNLVPSPRYVV-------------FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC 124
Query: 129 VGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY---GSST---IA 179
CS +F+ ++S+T+ NL C V N G C +++ Y GSS
Sbjct: 125 SSCSQQSVPIFDPSKSSTYSNLSCSECNKCDVVN-----GECPYSVEYVGSGSSQGIYAR 179
Query: 180 ANLSQDTISLATDIVPGYTFGCIQK--ATGNSVPPQGL---LGLGRGSLSLLAQTQNLYQ 234
L+ +TI + VP FGC +K + N P QG+ GLG G SLL +
Sbjct: 180 EQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPS----FG 235
Query: 235 STFSYCLPSFKALSFS-GSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
FSYC+ + + ++ L LG + T + N LYYVNL AI +G R +
Sbjct: 236 KKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN----GLYYVNLEAISIGGRKL 291
Query: 294 DIPPGALQFNPT-TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---F 349
DI P + + T +G IIDSG T L + + + L + +
Sbjct: 292 DIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPY 351
Query: 350 DTCYSVPIVA-----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAP---DNVN 400
CYS + P +T F+ G + L ++ I +T C+AM D+
Sbjct: 352 TLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQTTENEF-CMAMLPGNYFGDDYE 410
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
S + I + QQN+ + YD+ R+ R C
Sbjct: 411 S-FSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 155/392 (39%), Gaps = 64/392 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPC--------------TGCVGCSSTVFNSAQST 142
Y VR ++GTPAQ L+ DT +D WV C + F +S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 143 TFKNLGCQAAQCKQ--------VPNPTCGGGACAFNLTYGSSTIA---ANLSQDTISLAT 191
T+ + C + C + P P G CA++ Y + A TI+L++
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTP---GSPCAYDYRYKDGSAARGTVGTESATIALSS 211
Query: 192 DI-----------VPGYTFGCIQKATGNSVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSY 239
+ G GC TG S G+L LG ++S + + + FSY
Sbjct: 212 SSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSY 271
Query: 240 CLPSFKA-------LSFSGSLRLG---PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVG 289
CL + L+F + L P + TPL+ + R Y V++ AI V
Sbjct: 272 CLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVD 331
Query: 290 RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGF 349
++ IP + + G G I+DSGT T L PAY AV +++ + ++ F
Sbjct: 332 GELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPF 388
Query: 350 DTCYSVPIVA--------PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS 401
+ CY+ + P + + F+G P + A + C+ + P
Sbjct: 389 EYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGP---WP 445
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++VI N+ QQ H +D+ N RL R CT
Sbjct: 446 GISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/416 (25%), Positives = 165/416 (39%), Gaps = 94/416 (22%)
Query: 31 SSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKS---VVPIAS 87
S L + + PCS S+P S +E + +D++R+ F++S S +
Sbjct: 63 SQGLPITQKYGPCSGSGHSQPPSPQE----IXGRDESRVSFINSKCNQYTSGNLKNHAHN 118
Query: 88 GRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTF 144
+ ++V GTP Q + +DT + W C CV C S FB + S+T+
Sbjct: 119 NNLFDEDGNFLVDVAFGTPPQXFXLILDTGSSITWTQCKACVNCLQDSXRYFBXSASSTY 178
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA-TDIVPGYTFGCI 202
C + +N+TYG ST N T++L +D+ + FG
Sbjct: 179 SXGSCIPXTVEN-----------NYNMTYGDDSTSVGNYGCXTMTLEPSDVFQKFQFGXG 227
Query: 203 QKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IG 259
+ G+ G+LGLG+G LS ++QT + + FSYCLP ++ GSL G
Sbjct: 228 RNNKGDFGSGADGMLGLGQGQLSTVSQTASKFXKVFSYCLPEEDSI---GSLLFGEKATS 284
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
Q +K+T L+ P S L +SG F
Sbjct: 285 QSSSLKFTSLVNGPGTSGL---------------------------------XESGYYFV 311
Query: 320 RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVTLPQDNL 378
+L+ D+ SV ++ P I L F G +V L N+
Sbjct: 312 KLL--------DI---------------------SVDVLLPEIVLHFGGGADVRLNGTNI 342
Query: 379 LIHSTAGSITCLAMAA-APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ S A S CLA A + +N L +I N QQ + +LYD+ R+G C+
Sbjct: 343 VWGSDA-SRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 154/361 (42%), Gaps = 34/361 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP +DT +D W +PCT C + +F+ S+T+ N+ +
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSES 118
Query: 154 CKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATDI-----VPGYTFGCIQKA 205
C ++ + +C C + +Y +I L+Q+T++L + + G FGC
Sbjct: 119 CSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN 178
Query: 206 TG-NSVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCLPSFKA-------LSF-SGSLRL 255
G + G++GLGRG LSL++Q + + FS CL F +SF GS L
Sbjct: 179 NGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVL 238
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G + TPL+ + Y+V LL I V + G+ P T +IDSG
Sbjct: 239 G-----NGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMVIDSG 292
Query: 316 TVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVT 372
T T L Y + + R +V + + + G+ CY P + T+T F G +V
Sbjct: 293 TPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKGTTLTAHFEGADVL 352
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
L + I G I C A + N + N Q N+ I +D+ + C
Sbjct: 353 LTPTQIFIPVQDG-IFCFAFTSTFSN---EYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408
Query: 433 T 433
T
Sbjct: 409 T 409
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 115/421 (27%), Positives = 164/421 (38%), Gaps = 68/421 (16%)
Query: 57 SVLEMLAKDQARLQFLSSLAVAR--------------KSVV---PIASG--RQITQ---- 93
SVLE+ +D R+Q L + + K VV P+AS Q Q
Sbjct: 99 SVLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVAT 158
Query: 94 --------SPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFK 145
S Y + +G+P + + +DT +D W+ C C C
Sbjct: 159 LESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDC-------------- 204
Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKA 205
Q + P G + + T NL+ + S V FGC
Sbjct: 205 ---FQQNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWN 261
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
G GLLGLGRG LS +Q Q+LY +FSYCL + + S + G+ K +
Sbjct: 262 RGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI--FGEDKDLL 319
Query: 266 YTPLL--------KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
P L K + YYV + +I V V++IP + GTIIDSGT
Sbjct: 320 SHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTT 379
Query: 318 FTRLVAPAYTAVRD-VFRRRVGSNLTVTSLGGFDTCYSVP----IVAPTITLMFS-GMNV 371
+ PAY +++ + + G D C++V + P + + F+ G
Sbjct: 380 LSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVW 439
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
P +N I + CLAM P S ++I N QQQN ILYD SRLG A
Sbjct: 440 NFPTENSFIWLNE-DLVCLAMLGTP---KSAFSIIGNYQQQNFHILYDTKRSRLGYAPTK 495
Query: 432 C 432
C
Sbjct: 496 C 496
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 152/363 (41%), Gaps = 34/363 (9%)
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSSTVFNSAQSTTFK 145
I + Y++R IGTP+ L DT +D WV C T C ++ +++ S+TF
Sbjct: 90 IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149
Query: 146 NLGCQAAQCKQVP--NPTCGG-GACAFNLTYGSSTIA-ANLSQDTISLA---TDIVPGYT 198
L C + C Q+P C G C + TYG ++ + LS D+I L
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKIC 209
Query: 199 FGC--IQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRL 255
FGC K T + S G++GLG G LSL++Q + FSYCL F + S S L+
Sbjct: 210 FGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNS-KLKF 268
Query: 256 GP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
G I Q + TPL+ P YY+NL I VG + V T IID
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVK--------TGQTDGNIIID 319
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPI---VAPTITLMFSGMN 370
SG+ T L Y + + V FD C++ P + F+G +
Sbjct: 320 SGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGD 379
Query: 371 VTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
V L N L+ I + + D + + N+ Q + + YD+ ++ A
Sbjct: 380 VVLKPMNTLVLIEDNLICSTVVPSHFDGI----AIFGNLGQIDFHVGYDIQGGKVSFAPT 435
Query: 431 LCT 433
C+
Sbjct: 436 DCS 438
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 158/380 (41%), Gaps = 42/380 (11%)
Query: 75 LAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST 134
LA +VVP ++ + IGTP Q +D + + W C+ C+ C
Sbjct: 6 LADGGGAVVPFHWSPELYN----VANFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQ 61
Query: 135 ---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYG-SSTIAANLSQDTISLA 190
VF S+TFK C CK +P P C CAF+ G ++ DT ++
Sbjct: 62 DLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIG 121
Query: 191 TDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA--- 246
T FGC+ + +++ P G +GLGR SL+AQ + + FSYCL
Sbjct: 122 TAAPASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQ---MKLTRFSYCLAPHDTGKN 178
Query: 247 --LSFSGSLRLGPIGQPKRIKYTPLLK---NPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
L S +L G +TP +K N S Y + L I+ G + +P
Sbjct: 179 SRLFLGASAKLAGGG-----AWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMP----- 228
Query: 302 FNPTTGAGTIIDSGTV--FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG-FDTCYSVPIV 358
G T++ V + LV Y + VG+ T T +G F+ C+ V
Sbjct: 229 ----RGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFPKAGV 284
Query: 359 --APTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSV--LNVIANMQQQN 413
AP + F +G +T+P N L ++ M+ A N+ ++ LN++ + QQ+N
Sbjct: 285 SGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQEN 344
Query: 414 HRILYDVPNSRLGVARELCT 433
+L+D+ L C+
Sbjct: 345 VHLLFDLDKDMLSFEPADCS 364
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 159/358 (44%), Gaps = 38/358 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV--FNSAQSTTFKNLGCQAAQC 154
++ IG P L+ +DT +D W+ C C T+ F+ ++S+T++N C +A
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFHPSRSSTYRNASCVSAP- 136
Query: 155 KQVPN--PTCGGGACAFNLTY----------GSSTIAANLSQDTISLATDIVPGYTFGCI 202
+P G C ++L Y + S D + +IV FGC
Sbjct: 137 HAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIV----FGCG 192
Query: 203 QKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK 262
Q +G + G+LGLG G+ S++ + + S FSYC S ++ ++ + G
Sbjct: 193 QDNSGFT-KYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLTNPTYPHNILILGNGAKI 248
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
TPL R YY++L AI G +++DI PG Q + G GT+ID+G T L
Sbjct: 249 EGDPTPLQIFQDR---YYLDLQAISFGEKLLDIEPGTFQRYRSQG-GTVIDTGCSPTILA 304
Query: 323 APAYTAVRDVFRRRVGSNL-TVTSLGGFDT-CYSVPIVA-----PTITLMFS-GMNVTLP 374
AY + + +G L V + T CY + P +T F+ G + L
Sbjct: 305 REAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALD 364
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++L + S +G CLAM N ++VI M QQN+ + Y++ ++ R C
Sbjct: 365 VESLFVSSESGDSFCLAMTM---NTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 152/363 (41%), Gaps = 36/363 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
+ + IGTP Q + +DT +D W C +++ A+S++F C
Sbjct: 89 HTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRL 148
Query: 154 CK--QVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATD--IVPGYTFGCIQKATGNS 209
C+ C C + YGS+T L+ +T + + FGC + +G+
Sbjct: 149 CETGSFNTKNCSRNKCIYTYNYGSATTKGELASETFTFGEHRRVSVSLDFGCGKLTSGSL 208
Query: 210 VPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR------ 263
G+LG+ LSL++Q Q FSYCL F + + + G + +
Sbjct: 209 PGASGILGISPDRLSLVSQLQ---IPRFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGP 265
Query: 264 IKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
I+ T L+ NP S+ YY V L+ I VG + +++P + GT +DSG L
Sbjct: 266 IQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLP 325
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDT--CYSVPI-----------VAPTITLMFSGM 369
+ A+++ V + + G++ C+ +P V P + G
Sbjct: 326 SVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGA 385
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
+ L +D+ ++ +AG + CL +++ +I N QQQN +L+DV N A
Sbjct: 386 AMLLRRDSYMVEVSAGRM-CLVISSGARGA-----IIGNYQQQNMHVLFDVENHEFSFAP 439
Query: 430 ELC 432
C
Sbjct: 440 TQC 442
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 166/398 (41%), Gaps = 45/398 (11%)
Query: 59 LEMLAKDQAR--------LQFLSSLAVARKSV--------VPIASGRQITQSP-TYIVRA 101
+EM+ +D +R QF +SV A+ ITQ+ Y++
Sbjct: 31 VEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFHKAHKAAKATITQNDGEYLISY 90
Query: 102 KIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
+G P L +DT +D W+ PC C ++ +F+ ++S T+K L + C+ V
Sbjct: 91 SVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSVE 150
Query: 159 NPTCGGG---ACAFNLTYGSSTIA-ANLSQDTISLATDIVPGYTF-----GCIQKATGN- 208
+ +C C + + YG + + +LS +T++L + F GC + T +
Sbjct: 151 DTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSF 210
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
G++GLG G +SL+ Q + S FSYCL S +S + +
Sbjct: 211 EGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAVVSGDGTV 270
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
TP++ + + YY+ L A VG ++ + +F IIDSGT T L
Sbjct: 271 STPIVTHDPK-VFYYLTLEAFSVGNNRIEFTSSSFRFGEK--GNIIIDSGTTLTLLPNDI 327
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHS 382
Y+ + V + L CY + AP I FSG +V L N I
Sbjct: 328 YSKLESAVADLVELDRVKDPLKQLSLCYRSTFDELNAPVIMAHFSGADVKLNAVNTFIEV 387
Query: 383 TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
G +TCLA ++ + + NM QQN + YD+
Sbjct: 388 EQG-VTCLAFISSK-----IGPIFGNMAQQNFLVGYDL 419
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 163/378 (43%), Gaps = 53/378 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
Y R ++G+P + + +DT +D WV C+ C GC T F+ STT +
Sbjct: 84 YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143
Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTISLA----TDI 193
C +C Q + C C + YG + +A + DT+ L+ + I
Sbjct: 144 CSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQI 203
Query: 194 VPGY----TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
Y +F C TG+ G+ G G+ +S+++Q +Q + FS+CL
Sbjct: 204 CQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKG 263
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG I +P I YTPL+ + +LY L +I V + + I P F
Sbjct: 264 DD--SGGGVLVLGEIVEPN-IVYTPLVPSQPHYNLY---LQSISVAGQTLAIDPSV--FG 315
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVA 359
++ GTI+DSGT L AY V N T L + CY SV V
Sbjct: 316 ASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNAR-TYLSKGNQCYLVTSSVNDVF 374
Query: 360 PTITLMFSGMNVTL--PQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P ++L F+G + PQD LL ++ G ++ C+ P + ++ ++ ++
Sbjct: 375 PQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTP---GQQITILGDLVLKDKI 431
Query: 416 ILYDVPNSRLGVARELCT 433
+YD+ N R+G C+
Sbjct: 432 FVYDIANQRVGWTNYDCS 449
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 155/347 (44%), Gaps = 48/347 (13%)
Query: 112 MAMDT------SNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGG 165
MA DT + AA P C G +S F+ ++S+TF + C + C+ C G
Sbjct: 1 MAFDTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDCRS----GCSSG 54
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDI-VPGYTFGCIQKATGNSVPPQGLLGLGRGSLS 224
+ ++ ++QD ++L V +TFGC++ ++G + GLL L R S S
Sbjct: 55 STPSCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRS 114
Query: 225 LLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG----PIGQPKRI-KYTPLLKNPRRSSLY 279
L ++ TFSYCLP S G L +G P + R+ PL+ +P + Y
Sbjct: 115 LASRLAAGAGGTFSYCLP-LSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHY 173
Query: 280 YVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGS 339
++L + +G R + IPP A ++D+ +T + Y +RD FRR +
Sbjct: 174 VIDLAGVSLGGRDIPIPP---------HAAMVLDTALPYTYMKPSMYAPLRDAFRRAMAR 224
Query: 340 NLTVTSLGGFDTCYSV-----PIVAPTITLMF---------SGMNVTLPQDNLLIHSTAG 385
++G DTCY+ ++ P + L F G + L D +L S G
Sbjct: 225 YPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPG 284
Query: 386 ---SITCLAMAAAP---DNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
S+TCLA AA P D + V+ + Q + +++DV ++G
Sbjct: 285 NFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIG 331
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 167/405 (41%), Gaps = 33/405 (8%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T + H SP SPF P+ L F K P +
Sbjct: 32 TADLIHRDSPKSPFY--NPMETSSQRLRNAIHRSVNRVF----HFTEKDNTPQPQIDLTS 85
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGC 149
S Y++ IGTP ++ DT +D W C C C + V F+ S+T+K++ C
Sbjct: 86 NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 145
Query: 150 QAAQCKQVPNP---TCGGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVP----GYTFG 200
++QC + N + C+++L+YG +S N++ DT++L ++D P G
Sbjct: 146 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 205
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG-- 256
C G + G++GLG G +SL+ Q + FSYCL P + + G
Sbjct: 206 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
I + TPL+ + + YY+ L +I VG + + + + ++ IIDSGT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
T L Y+ + D + + G CYS + P IT+ F G +V L
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLD 382
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
N + + + C A +P ++ N+ Q N + YD
Sbjct: 383 SSNAFVQ-VSEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 421
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 167/405 (41%), Gaps = 33/405 (8%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T + H SP SPF P+ L F K P +
Sbjct: 32 TADLIHRDSPKSPFY--NPMETSSQRLRNAIHRSVNRVF----HFTEKDNTPQPQIDLTS 85
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGC 149
S Y++ IGTP ++ DT +D W C C C + V F+ S+T+K++ C
Sbjct: 86 NSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSC 145
Query: 150 QAAQCKQVPNP---TCGGGACAFNLTYG-SSTIAANLSQDTISL-ATDIVP----GYTFG 200
++QC + N + C+++L+YG +S N++ DT++L ++D P G
Sbjct: 146 SSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIG 205
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG-- 256
C G + G++GLG G +SL+ Q + FSYCL P + + G
Sbjct: 206 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTN 265
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
I + TPL+ + + YY+ L +I VG + + + + ++ IIDSGT
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNVTLP 374
T L Y+ + D + + G CYS + P IT+ F G +V L
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLD 382
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
N + + + C A +P ++ N+ Q N + YD
Sbjct: 383 SSNAFVQ-VSEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 421
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 164/384 (42%), Gaps = 62/384 (16%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSSTVFNSAQSTT 143
T + Y K+GTP + + +DT +D WV C C +G T+++ S+T
Sbjct: 81 TDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASST 140
Query: 144 FKNLGCQAAQCKQVPN---PTCGGGA-CAFNLTY--GSSTIAANLSQ----DTISLATDI 193
+ C A C P CG C +++TY GSSTI + ++ D ++
Sbjct: 141 GSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQT 200
Query: 194 VPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
P FGC + G+ + G+LG G + S+L+Q T + F++CL +
Sbjct: 201 QPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTI 260
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
K G +G + QPK +K TPL+ + Y VNL I VG + +P A F P
Sbjct: 261 KG---GGIFSIGDVVQPK-VKTTPLVADKPH---YNVNLKTIDVGGTTLQLP--AHIFEP 311
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVR-DVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
GTIIDSGT T L + V VF + ++T + GF C+ P
Sbjct: 312 GEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKH--QDITFHDVQGF-LCFQYPGSVDDGF 368
Query: 360 PTITLMFSGMNVTLPQDNLLIH--------STAGSITCLAM--AAAPDNVNSVLNVIANM 409
PTIT F +D+L +H + + C+ A+ + ++ ++
Sbjct: 369 PTITFHF--------EDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDL 420
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
N ++YD+ N +G C+
Sbjct: 421 VLSNKLVIYDLENRVIGWTDYNCS 444
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 162/376 (43%), Gaps = 48/376 (12%)
Query: 51 PLSWEESVLEMLAKDQARL-QFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
P + E + ++ A+D+AR + L SL P+ Y + ++GTP +
Sbjct: 36 PANHEMELSQLKARDEARHGRLLQSLGGVID--FPVDGTFDPFVVGLYYTKLRLGTPPRD 93
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVP 158
+ +DT +D WV C C GC T F+ S T + C +C Q
Sbjct: 94 FYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSS 153
Query: 159 NPTCG--GGACAFNLTYG-----SSTIAANLSQDTISLATDIVPGYT----FGCIQKATG 207
+ C CA+ YG S +++ Q + + + +VP T FGC TG
Sbjct: 154 DSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTG 213
Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ V G+ G G+ +S+++Q +Q + FS+CL G L LG I +P
Sbjct: 214 DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG--GGGILVLGEIVEP 271
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ +TPL+ + Y VNLL+I V + + I P F+ + G GTIID+GT L
Sbjct: 272 NMV-FTPLVPSQPH---YNVNLLSISVNGQALPINPSV--FSTSNGQGTIIDTGTTLAYL 325
Query: 322 VAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTITLMFSGMNVTL--P 374
AY + V ++ V S G + CY SV + P ++L F+G P
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKG--NQCYVITTSVGDIFPPVSLNFAGGASMFLNP 383
Query: 375 QDNLLIHSTAGSITCL 390
QD L+ + S C
Sbjct: 384 QDYLIQQNNVASALCF 399
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 149/349 (42%), Gaps = 43/349 (12%)
Query: 108 QTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
Q +A+D +W+ C C C S VF+ +S TF N+ + P
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLAN 168
Query: 165 GACAFNLTYGSSTIAAN-LSQDTISLAT---DIVP--GYTFGCIQKATG--NSVPPQGLL 216
GAC F++ Y +T A+ L++DT S D VP FGC + N G+L
Sbjct: 169 GACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGIL 228
Query: 217 GLGRGSL-----SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPKRI--KY 266
GLG G + Q + FSYC P +S LR G P P + +
Sbjct: 229 GLGMGPAGKPPTAFTKQVLPAHGGRFSYC-PFVPGMSMYSYLRFGSDIPSHPPPNVHRQS 287
Query: 267 TPLLKNPRRSSLYYVNLLAIRVG-RRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
TP+L S Y+V L + VG R+ + P + N G ++D GT T + A
Sbjct: 288 TPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSA 347
Query: 326 YT----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPI----VAPTITLMF-SGMNVTLPQD 376
Y AVR +RR G+++ V +TC P V P++TL F +G + + +
Sbjct: 348 YVHIDHAVRQHLQRR-GAHIVVVRG---NTCVQQPAPHHDVLPSMTLHFENGAWLRVMPE 403
Query: 377 NLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
++ + G C ++ D L VI QQ NHR ++D+ ++
Sbjct: 404 HVFMPFVVGGHHYQCFGFVSSTD-----LTVIGARQQVNHRFIFDLHDT 447
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 149/352 (42%), Gaps = 38/352 (10%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPN 159
IGTP Q +D + + W C+ C+ C VF S+TFK C CK +P
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119
Query: 160 PTCGGGACAFNLTYG-SSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLG 217
P C CA++ G ++ DT ++ T FGC+ + +++ P G +G
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 179
Query: 218 LGRGSLSLLAQTQNLYQSTFSYCLPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLK- 271
LGR SL+AQ + + FSYCL L S +L G +TP +K
Sbjct: 180 LGRTPWSLVAQMK---LTRFSYCLAPHDTGKNSRLFLGASAKLAGGG-----AWTPFVKT 231
Query: 272 --NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV--FTRLVAPAYT 327
N S Y + L I+ G + +P G T++ V + LV Y
Sbjct: 232 SPNDGMSQYYPIELEEIKAGDATITMP---------RGRNTVLVQTAVVRVSLLVDSVYQ 282
Query: 328 AVRDVFRRRVGSNLTVTSLGG-FDTCYSVPIV--APTITLMF-SGMNVTLPQDNLLIHST 383
+ VG+ T T +G F+ C+ V AP + F +G +T+P N L
Sbjct: 283 EFKKAVMASVGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDVG 342
Query: 384 AGSITCLAMAAAPDNVNSV--LNVIANMQQQNHRILYDVPNSRLGVARELCT 433
++ M+ A N+ ++ LN++ + QQ+N +L+D+ L C+
Sbjct: 343 NDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 108/430 (25%), Positives = 179/430 (41%), Gaps = 55/430 (12%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
P + + PL + E+ A+D+ R + L R+S V P+ Y
Sbjct: 43 PLQRAFPLDEPVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
+ K+G+P + +DT +D WV C+ C C + F++ S T ++ C
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCS 161
Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
C V T C ++ YG S + DT SL +
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221
Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
FGC +G+ G+ G G+G LS+++Q ++ + FS+CL S G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
LG I P + Y+PLL + Y +NLL+I V ++ +P A F + GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLLPSQPH---YNLNLLSIGVNGQI--LPIDAAVFEASNTRGTI 333
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
+D+GT T LV AY + V +T+ G + CY S+ + P ++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNG-EQCYLVSTSISDMFPPVSLNFA 392
Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G + PQD L + S+ C+ AP+ ++ ++ ++ +YD+
Sbjct: 393 GGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448
Query: 424 RLGVARELCT 433
R+G A C+
Sbjct: 449 RIGWANYDCS 458
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 156/389 (40%), Gaps = 61/389 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---CVGCSSTVFNS------AQSTTFKNL 147
Y + K GTP QT +DT + W+PC C C+S N+ S + K +
Sbjct: 216 YSIDLKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFV 275
Query: 148 GCQAAQCKQV------------------PNPTCGGGACAFNLTYGSSTIAANLSQDTISL 189
GC+ +C V N C A+ + YG + A L + ++
Sbjct: 276 GCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFLLSENLNF 335
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKAL 247
V + GC + + P G+ G GRG SL AQ NL + FSYCL S F
Sbjct: 336 PAKNVSDFLVGC---SVVSVYQPGGIAGFGRGEESLPAQ-MNL--TRFSYCLLSHQFDES 389
Query: 248 SFSGSLRLGPI--GQPKR---IKYTPLLKNPRRS-----SLYYVNLLAIRVGRRVVDIPP 297
+ L + G+ K+ + YT LKNP + YY+ L I VG + V +P
Sbjct: 390 PENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPR 449
Query: 298 GALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN--LTVTSLGGFDTCYSV 355
L+ + G I+DSG+ T + P + V + F ++V + G C+ +
Sbjct: 450 RMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVL 509
Query: 356 PIVA-----PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN----- 404
A P + F G + LP N G + CL + + D+V
Sbjct: 510 AGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVS--DDVAGQGGAVGPA 567
Query: 405 -VIANMQQQNHRILYDVPNSRLGVARELC 432
++ N QQQN + D+ N R G + C
Sbjct: 568 VILGNYQQQNFYVECDLENERFGFRSQSC 596
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 127/277 (45%), Gaps = 36/277 (12%)
Query: 48 PSKPLSWEESVLEMLAKDQARLQFL---------SSLAVARKSVVPIASGRQITQSPTYI 98
P P++ + + +LA D++R S+ + + VP+ SG ++ Q+ Y+
Sbjct: 35 PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL-QTLNYV 93
Query: 99 VRAKIG----TPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
+G +PA L + +DT +D WV PC+ C +F+ A S T+ + C A
Sbjct: 94 TTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNA 153
Query: 152 AQCKQ-------VPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLATDIVPGYTF 199
+ C P GA C + L YG + + L+ DT++L + G+ F
Sbjct: 154 SACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVF 213
Query: 200 GCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
GC G GL+GLGR LSL++QT + Y FSYCLP+ + SGSL LG
Sbjct: 214 GCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 273
Query: 260 QPKR-------IKYTPLLKNPRRSSLYYVNLLAIRVG 289
+ YT ++ +P + Y++N+ VG
Sbjct: 274 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVG 310
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 162/386 (41%), Gaps = 43/386 (11%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS-------ST 134
+P++SG T + Y VR ++GTPAQ ++ DT +D WV C G + +
Sbjct: 87 AMPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPAR 145
Query: 135 VFNSAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDT 186
VF +A S ++ + C + C VP N + CA++ Y + A + T
Sbjct: 146 VFRTAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSAT 205
Query: 187 ISLATDI--------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQN 231
I+L++ + G GC G S G+L LG ++S ++
Sbjct: 206 IALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAA 265
Query: 232 LYQSTFSYCLPSFKALSFSGS-LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
+ FSYCL A + S L GP G TPLL + R + Y V + A+ V
Sbjct: 266 RFGGRFSYCLVDHLAPRNATSYLTFGP-GATAPAAQTPLLLDRRMTPFYAVTVDAVYVAG 324
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+DIP + GA I+DSGT T L PAY AV + + + L ++ F+
Sbjct: 325 EALDIPADVWDVDRNGGA--ILDSGTSLTILATPAYRAVVTALSKHL-AGLPRVTMDPFE 381
Query: 351 TCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
CY+ + P + + F+G P + A + C+ + + ++VI
Sbjct: 382 YCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQ---EGSWPGVSVI 438
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N+ QQ H +D+ + L C
Sbjct: 439 GNILQQEHLWEFDLRDRWLRFKHTRC 464
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 93/275 (33%), Positives = 125/275 (45%), Gaps = 34/275 (12%)
Query: 171 LTYGSSTIAAN----LSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLL 226
LTYG S AAN L+ DT + VPG FGC + G+ G++G+GRG+LSL+
Sbjct: 120 LTYGGS--AANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLI 177
Query: 227 AQTQNLYQSTFSYCLPSFKAL---SFSGSLRLGPIGQP--KRIKYTPLLKNPRRSSLYYV 281
+Q Q FSY L + +A S +R G P KR + TPLL + YYV
Sbjct: 178 SQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYV 234
Query: 282 NLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN 340
NL +RV G R+ IP G G I+ S T T L AY DV R V S
Sbjct: 235 NLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAY----DVVRAAVASR 290
Query: 341 LTVTSLGG-----FDTCYSVPIVA----PTITLMFS-GMNVTLPQDNLLIHSTAGSITCL 390
+ + ++ G D CY+ +A P +TL+F G ++ L N + CL
Sbjct: 291 IGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECL 350
Query: 391 AMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
M P SVL + Q ++YDV RL
Sbjct: 351 TM--LPSQGGSVL---GTLLQTGTNMIYDVDAGRL 380
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 160/368 (43%), Gaps = 55/368 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S T++ + C
Sbjct: 89 YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC---- 144
Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
P+ C G C ++ Y S+ + L +D +S +++ P FGC TG
Sbjct: 145 ---TPDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDETG 201
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + +FS C G++ LG I P+
Sbjct: 202 DLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMILGGISPPED 259
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTR 320
+ +T +P RS Y +NL + V + LQ NP GT++DSGT +
Sbjct: 260 MVFTH--SDPDRSPYYNINLKEMHVAGK-------KLQLNPKVFDGKHGTVLDSGTTYAY 310
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----------SVPIVA---PTITLMF- 366
L A+ A + + S + + G D Y V +A P + ++F
Sbjct: 311 LPETAFLAFKRAIMKERNS---LKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFE 367
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ L HS CL + + N ++ + +N ++YD NS++
Sbjct: 368 NGHKLSLSPENYLFRHSKVRGAYCLGVFS---NGRDPTTLLGGIFVRNTLVMYDRENSKI 424
Query: 426 GVARELCT 433
G + C+
Sbjct: 425 GFWKTNCS 432
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 107/408 (26%), Positives = 180/408 (44%), Gaps = 36/408 (8%)
Query: 33 TLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
T + H SP SPF P++ S + + + + +R+ + +++K A +
Sbjct: 32 TADLIHRDSPKSPFYNPTETSS--QRLRNAIHRSVSRVFHFTD--ISQKDASDNAPQIDL 87
Query: 92 T-QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
T S Y++ +GTP ++ DT +D W C C C + V F+ S+T+K++
Sbjct: 88 TSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDV 147
Query: 148 GCQAAQCKQVPN-PTCG--GGACAFNLTYGS-STIAANLSQDTISL-ATDIVP----GYT 198
C ++QC + N +C C+++ +YG S N++ DT++L +TD P
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207
Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLG 256
GC G + G++GLG G++SL+ Q + FSYCL P + + G
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267
Query: 257 --PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT-IID 313
+ + TPL+ + + YY+ L +I VG + V P + +G G IID
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQE-TFYYLTLKSISVGSKEVQYPGS----DSGSGEGNIIID 322
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLMFSGMNV 371
SGT T L Y+ + D + + G CYS + P IT+ F G +V
Sbjct: 323 SGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADV 382
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
L N + + + C A +P ++ N+ Q N + YD
Sbjct: 383 NLKPSNCFVQ-ISEDLVCFAFRGSPS-----FSIYGNVAQMNFLVGYD 424
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 69/376 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
Y +GTP + L+A+DT +D WVPC C+ C+ ++ ++STT
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCD-CIQCAPLSSYHGSLDRDLGIYKPSESTTS 160
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDIVPGYT---- 198
++L C C T C +N+ Y S ++ L +D + L D G+
Sbjct: 161 RHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHL--DSREGHAPVNA 218
Query: 199 ---FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSF 249
GC +K +G + P GLLGLG +S+ LA+ L +++FS C FK
Sbjct: 219 SVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FKKDD- 273
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
SG + G G P + + TP + + Y VN+ +G + T GAG
Sbjct: 274 SGRIFFGDQGVPTQ-QSTPFVPMNGKLQTYAVNVDKYCIGHKC------------TEGAG 320
Query: 310 --TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTI 362
++D+GT FT L AY ++ F +++ ++ + F+ CYS +P V PTI
Sbjct: 321 FQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDV-PTI 379
Query: 363 TLMFS------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
TL F+ +N LP ++ ++ CLA+ +P+ V +I + +
Sbjct: 380 TLTFAENKSFQAVNPILPFND---RQGEFAVFCLAVLPSPEPV----GIIGQNFMVGYHV 432
Query: 417 LYDVPNSRLGVARELC 432
++D N +LG R C
Sbjct: 433 VFDRENMKLGWYRSEC 448
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 69/376 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
Y +GTP + L+A+DT +D WVPC C+ C+ ++ ++STT
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCD-CIQCAPLSSYHGSLDRDLGIYKPSESTTS 160
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDIVPGYT---- 198
++L C C T C +N+ Y S ++ L +D + L D G+
Sbjct: 161 RHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHL--DSREGHAPVNA 218
Query: 199 ---FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSF 249
GC +K +G + P GLLGLG +S+ LA+ L +++FS C FK
Sbjct: 219 SVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FKKDD- 273
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
SG + G G P + + TP + + Y VN+ +G + T GAG
Sbjct: 274 SGRIFFGDQGVPTQ-QSTPFVPMNGKLQTYAVNVDKYCIGHKC------------TEGAG 320
Query: 310 --TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTI 362
++D+GT FT L AY ++ F +++ ++ + F+ CYS +P V PTI
Sbjct: 321 FQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDV-PTI 379
Query: 363 TLMFS------GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
TL F+ +N LP ++ ++ CLA+ +P+ V +I + +
Sbjct: 380 TLTFAENKSFQAVNPILPFND---RQGEFAVFCLAVLPSPEPV----GIIGQNFMVGYHV 432
Query: 417 LYDVPNSRLGVARELC 432
++D N +LG R C
Sbjct: 433 VFDRENMKLGWYRSEC 448
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/433 (24%), Positives = 180/433 (41%), Gaps = 34/433 (7%)
Query: 21 LNPICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK 80
L PI D T+++ + SP SPF + + ++ + + +R+ S +
Sbjct: 19 LVPI-DAAKDGFTVELINRDSPKSPFYNPRETPTQR-IVSAVRRSMSRVHHFSPTKNS-D 75
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFN 137
A I+ Y+++ +GTPA +L DT +D W C C C + +F+
Sbjct: 76 IFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFD 135
Query: 138 SAQSTTFKNLGCQAAQCKQVPN-PTCGGGA---CAFNLTYGS-STIAANLSQDTISLATD 192
S+T++++ C QC + +C G C ++ +YG S + N++ DTI+L +
Sbjct: 136 PKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGST 195
Query: 193 -----IVPGYTFGCIQKATGNSVPPQGLLGLGRGS-LSLLAQTQNLYQSTFSYCL-PSFK 245
++P GC G+ + G +SL++Q + FSYCL P
Sbjct: 196 SGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSS 255
Query: 246 ALSFSGSLRLGPIG--QPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
+ S L G G ++ TPL+ K+P + Y++ L A+ VG + P +
Sbjct: 256 NATNSSKLNFGSNGIVSGGGVQSTPLISKDP--DTFYFLTLEAVSVGSERIKFPGSSFG- 312
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAP 360
T+ IIDSGT T ++ + + V G CYS+ + P
Sbjct: 313 --TSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDADLKFP 370
Query: 361 TITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
+IT F G +V L N + S T L A P N ++ N+ Q N + YD+
Sbjct: 371 SITAHFDGADVKLNPLNTFVQV---SDTVLCFAFNPINSGAIF---GNLAQMNFLVGYDL 424
Query: 421 PNSRLGVARELCT 433
+ CT
Sbjct: 425 EGKTVSFKPTDCT 437
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R ++GTP + + +DT +D WV C C C T F+ S+T L
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100
Query: 149 CQAAQC---KQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQD-------TISLATDIVP 195
C ++C Q+ C C ++ Y GS T+ +S + + +
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160
Query: 196 GYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF 249
TFGC +G+ P G+ G G+ LS+++Q +Q L FS+CL A
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEG--ADPG 218
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAG 309
G L LG I +P + YTP++ + Y +NL I V + + I P F T G
Sbjct: 219 GGILVLGEITEPGMV-YTPIVPSQPH---YNLNLQGIAVNGQQLSIDPQV--FATTNTRG 272
Query: 310 TIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVAPTITLMF 366
TIID GT L AY + V + L G F T +S+ + P++TL F
Sbjct: 273 TIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEIFPSVTLYF 332
Query: 367 SGMNVTL-PQDNLLIHSTAGS--ITCLAMAAAPDNV--NSVLNVIANMQQQNHRILYDVP 421
G + L P+D L+ + S + C+ + +S + ++ ++ ++ +YD+
Sbjct: 333 EGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLE 392
Query: 422 NSRLGVARELCT 433
N R+G C+
Sbjct: 393 NQRIGWTSFDCS 404
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 106/436 (24%), Positives = 169/436 (38%), Gaps = 94/436 (21%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC---------------- 125
+P++SG T + Y VR ++GTPA+ L+ DT +D WV C
Sbjct: 93 AMPLSSG-AYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAA 151
Query: 126 --------------TGCVGCSSTVFNSAQSTTFKNLGCQAAQC--------KQVPNPTCG 163
+ VF +S T+ + C + C P P
Sbjct: 152 PASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP--- 208
Query: 164 GGACAFNLTYGSSTIA-ANLSQDTISLATD-----------IVPGYTFGCIQKATGNS-V 210
G CA++ Y + A + D+ ++A + G GC TG+S +
Sbjct: 209 GSPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFL 268
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS-LRLGP-----IGQPKRI 264
G+L LG ++S ++ + FSYCL A + S L GP P +
Sbjct: 269 ASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKT 328
Query: 265 ------------------KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
+ TPLL + R Y V + I V ++ IP L ++
Sbjct: 329 ACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIP--RLVWDVAK 386
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS---------VPI 357
G G I+DSGT T LV+PAY AV +++ + L ++ FD CY+ + +
Sbjct: 387 GGGAILDSGTSLTVLVSPAYRAVVAALNKKL-AGLPRVTMDPFDYCYNWTSPSTGEDLTV 445
Query: 358 VAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
P + + F+G P + A + C+ + + ++VI N+ QQ H
Sbjct: 446 AMPELAVHFAGSARLQPPAKSYVIDAAPGVKCIGLQ---EGEWPGVSVIGNILQQEHLWE 502
Query: 418 YDVPNSRLGVARELCT 433
+D+ N RL R CT
Sbjct: 503 FDLKNRRLRFKRSRCT 518
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 166/369 (44%), Gaps = 41/369 (11%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTG---------CVGCSSTVFNSAQSTTFKNLG 148
+V IGTP Q + +DT + +W+ C + F+ + S++F L
Sbjct: 67 VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126
Query: 149 CQAAQCK-QVPN---PT-CGGGA-CAFNLTYGSSTIA-ANLSQDTISLATDI-VPGYTFG 200
C CK ++P+ PT C C ++ Y T+A NL ++ + + + P G
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILG 186
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
C Q +T N +G+LG+ G LS ++Q + S FSYC+PS + +G LG
Sbjct: 187 CAQASTEN----RGILGMNHGRLSFISQAK---ISKFSYCVPSRTGSNPTGLFYLGDNPN 239
Query: 261 PKRIKYTPLLKNPRRSS-------LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIID 313
+ KY +L P S Y + + AI++ + ++IPP A + + T+ID
Sbjct: 240 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 299
Query: 314 SGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVPIVAPT------ITLM 365
SG+ T LV AY V++ R VG+ + + D C+ + A I+
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359
Query: 366 F-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F +G+ + + + ++ + C+ + + + + N+I + QQN + YD+ N R
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS-ERLGIGSNIIGTVHQQNMWVEYDLANKR 418
Query: 425 LGVARELCT 433
+G C+
Sbjct: 419 VGFGGAECS 427
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/301 (31%), Positives = 133/301 (44%), Gaps = 43/301 (14%)
Query: 61 MLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
+ DQ RL+ + V+ PI+ I Y R +GTP Q + +DT ++
Sbjct: 9 LRKHDQRRLRRMLPEVVS----FPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNV 64
Query: 121 AWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCKQVPNP-TCGGG--ACAF 169
AWV C C GC + F+ +STT ++ C A+C + C +C +
Sbjct: 65 AWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPERLSCPY 124
Query: 170 NLTYGSSTIAANLSQDTI----------SLATDIVPGYTFGCIQKATGNSVPPQGLLGLG 219
+L YG + A + + S A FGC TG S GLLG G
Sbjct: 125 SLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTG-SWSVDGLLGFG 183
Query: 220 RGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRS 276
++SL LAQ QN+ + F++CL +S GSL +G I +P + YTP++
Sbjct: 184 PTTVSLPNQLAQ-QNISVNIFAHCLQG--DVSGRGSLVIGTIREPDLV-YTPMVFGEDH- 238
Query: 277 SLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRR 336
Y V LL I + R V P F+ G IIDSGT T LV PAY D FRR
Sbjct: 239 --YNVQLLNIGISGRNVTTPA---SFDLEYTGGVIIDSGTTLTYLVQPAY----DEFRRG 289
Query: 337 V 337
V
Sbjct: 290 V 290
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 99/206 (48%), Gaps = 16/206 (7%)
Query: 237 FSYCLPSFKALSFSGSLRLGPIG----QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
FSYCL S + + SL G + P +I TPL++NP S YY+ L I VG +
Sbjct: 180 FSYCLTSIHE-NKTSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTL 238
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
+ IP A Q G I+DSGT T L A+ +++ F + + +S G D C
Sbjct: 239 LPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLC 298
Query: 353 YSVP------IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
+ +P + P + F G+++ LP +N ++ + CLA+ A L++
Sbjct: 299 FHLPVKNAAEVKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDAT-----GSLSIF 353
Query: 407 ANMQQQNHRILYDVPNSRLGVARELC 432
N+QQQN +L+D+ S L + C
Sbjct: 354 GNIQQQNMLVLHDLKKSTLSLVPTQC 379
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 152/358 (42%), Gaps = 82/358 (22%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
V +G+P QT+ M +DT ++ +W+ C S VF+ +S+++ + C + C+
Sbjct: 377 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS-VFDPLRSSSYSPIPCTSPTCR--- 432
Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
T+ +T GL+G+
Sbjct: 433 -----------TRTHSKTT------------------------------------GLIGM 445
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKN---- 272
RGSLS + Q + FSYC+ + SG L G K +KYTPL++
Sbjct: 446 NRGSLSFVTQ---MGLQKFSYCISGQDS---SGILLFGESSFSWLKALKYTPLVQISTPL 499
Query: 273 PRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P + Y V L I+V ++ +P + T T++DSGT FT L+ P YTA+++
Sbjct: 500 PYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 559
Query: 332 VFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLL 379
F R+ ++L V G D CY VP+ PT+TLMF G +++ + L+
Sbjct: 560 EFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLM 619
Query: 380 -----IHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ + S+ C + + + +I + QQN + +D+ SR+G A C
Sbjct: 620 YRVPGVIRGSDSVYCFTFGNS-ELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 676
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 137/343 (39%), Gaps = 42/343 (12%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCT-----GCVGCSSTVFNSAQSTTFKNLGCQAAQCKQV 157
I P M++DTS D W+ C C + +F+ +S T + C +A C ++
Sbjct: 155 IDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL 214
Query: 158 PNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN-SVPPQGLL 216
YG + + GN S G +
Sbjct: 215 GR-------------YGRWLLQQPVPVLRRLRRRQGQ--PRGRTCHAVRGNFSASTSGTM 259
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRR- 275
LG G SLL+QT + + FSYC+P + F G R TPL++NP
Sbjct: 260 SLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII 319
Query: 276 SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRR 335
+LY V L I VG R +++PP G ++DS + T+L AY A+R FR
Sbjct: 320 PTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRS 373
Query: 336 RVGSNLTVTS-LGGFDTCYS----VPIVAPTITLMFSGMNVT-LPQDNLLIHSTAGSITC 389
+ + V G DTCY + P ++L+F G V L +++ C
Sbjct: 374 AMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVEG------C 427
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LA P + L I N+QQQ H +LYDV +G R C
Sbjct: 428 LAFVPTPGDF--ALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 152/357 (42%), Gaps = 31/357 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W +PCT C + +F+ S+++ N+ C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTES 119
Query: 154 CKQVPNPTCGGG--ACAFNLTYGSSTIAAN-LSQDTISLATDI-----VPGYTFGCIQKA 205
C ++ + C C + +Y ++I L+Q+T++L + G FGC
Sbjct: 120 CNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNN 179
Query: 206 TGNSVPPQGLLGLGRGSLSLLAQTQNLYQS---TFSYCLPSFKAL-SFSGSLRLGPIGQ- 260
+G + GL+GLGRG LSL++Q + + FS CL F S + + G +
Sbjct: 180 SGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEV 239
Query: 261 -PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP-PGALQFNPTTGAGTIIDSGTVF 318
TPL+ + + Y+ LL I V +++P T +IDSGT
Sbjct: 240 LGNGTVSTPLIS--KDGTGYFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTTI 295
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQD 376
T L Y + + R +V L + G++ CY P + PT+T+ F G +V L
Sbjct: 296 TYLPEEFYHRLIEQVRNKVA--LEPFRIDGYELCYQTPTNLNGPTLTIHFEGGDVLLTPA 353
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ I + C A+ + N N Q N+ I +D+ + CT
Sbjct: 354 QMFIPVQDDNF-CFAVF----DTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 108/430 (25%), Positives = 189/430 (43%), Gaps = 58/430 (13%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT----YIVR 100
P + + PL+ + + + A+D+AR + V VV + Q T P Y +
Sbjct: 31 PLERAIPLNQQVELEALRARDRARHGRILQGVVG--GVVDFS--VQGTSDPYFVGLYFTK 86
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAA 152
K+G+PA+ + +DT +D W+ C C C + F++A S+T + C
Sbjct: 87 VKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADP 146
Query: 153 QCK---QVPNPTCGGGA--CAFNLTYGSST------IAANLSQDTISLATDIVPGYT--- 198
C Q C A C++ YG + ++ + DT+ L +V +
Sbjct: 147 ICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTI 206
Query: 199 -FGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
FGC +G+ G+ G G G+LS+++Q ++ + FS+CL + + G
Sbjct: 207 VFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE--NGGG 264
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L LG I +P I Y+PL+ + Y +NL +I V ++ +P + F T GTI
Sbjct: 265 VLVLGEILEPS-IVYSPLVPSLPH---YNLNLQSIAVNGQL--LPIDSNVFATTNNQGTI 318
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMF- 366
+DSGT LV AY D V S + + + CY SV + P ++L F
Sbjct: 319 VDSGTTLAYLVQEAYNPFVDAITAAV-SQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFM 377
Query: 367 SGMNVTLPQDNLLIHS---TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G ++ L ++ L+H + ++ C+ V ++ ++ ++ +YD+ N
Sbjct: 378 GGASMVLNPEHYLMHYGFLDSAAMWCIGF----QKVERGFTILGDLVLKDKIFVYDLANQ 433
Query: 424 RLGVARELCT 433
R+G A C+
Sbjct: 434 RIGWADYNCS 443
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 127/452 (28%), Positives = 189/452 (41%), Gaps = 69/452 (15%)
Query: 48 PSKPLSWEESVLEMLAKDQ-ARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTP 106
P P + + L LA+ AR L + + P+ + Y +GTP
Sbjct: 36 PLPPAAAQHHPLSRLARASLARASRLRGHHQGQAASSPVRAALYPHSYGGYAFSLSLGTP 95
Query: 107 AQTLLMAMDTSNDAAWVPCTG---CVGCSST-----VFNSAQSTT------------FKN 146
Q L + +DT + WVPCT C CS+ VF+ S++ + +
Sbjct: 96 PQPLPVLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIH 155
Query: 147 LGCQAAQCKQVPNP----TCGGGACAFN------LTYGSSTIAANLSQDTISLATDIVPG 196
+ C + P T A A N + YGS + A L DT+ L+
Sbjct: 156 SKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAAS 215
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK---ALSFSGSL 253
F PP GL G GRG+ S+ AQ L + FSYCL S + + SG L
Sbjct: 216 RNFAVGCSLASVHQPPSGLAGFGRGAPSVPAQ---LGVNKFSYCLLSRRFDDDAAISGEL 272
Query: 254 RLGP--IGQPK-RIKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDIPPGALQ-FNPT 305
LG G+ K ++Y PLLKN P S YY++L I VG + V +P AL +
Sbjct: 273 VLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPARALAPVSGG 332
Query: 306 TGAGTIIDSGTVFTRL----VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-- 359
G G IIDSGT FT L P A+ R + V G C+++P A
Sbjct: 333 GGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGART 392
Query: 360 ---PTITLMFS-GMNVTLPQDNLLIHS-----TAGSITCLAMAAAPDNVNSVLN------ 404
P ++L FS G + LP +N + + A CLA+ + + +
Sbjct: 393 MDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGGGG 452
Query: 405 ---VIANMQQQNHRILYDVPNSRLGVARELCT 433
++ + QQQN+++ YD+ +RLG ++ C+
Sbjct: 453 PAIILGSFQQQNYQVEYDLEKNRLGFRQQPCS 484
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/413 (24%), Positives = 164/413 (39%), Gaps = 44/413 (10%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPI-ASGRQITQSPTYIVRAKIGTPAQT 109
P++ E+ + M AR ++L + V Q ++ + V +G P
Sbjct: 21 PVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVP 80
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCGG 164
MDT + W+ C C CSS VFN A S+TF C C+ PN C
Sbjct: 81 QFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCSS 140
Query: 165 GACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ--GLL 216
C + Y S T + L+++ ++ T + FGC + G + + G+L
Sbjct: 141 NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHE-NGEQLESEFTGIL 199
Query: 217 GLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGP----IGQPKRIKYTPLLK 271
GLG SL Q S FSYC+ ++ L LG +G P I++
Sbjct: 200 GLGAKPTSLAVQL----GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFET--- 252
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
+ +YY+NL I VG + ++I P + + G I+D+GT++T L AY + +
Sbjct: 253 ---ENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLYTWLADIAYRELYN 308
Query: 332 VFRRRVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMFSG-----MNVTLPQDNLLIH 381
+ + L F CY I P +T F+G M T +
Sbjct: 309 EIKSILDPKLERFWFRDF-LCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTES 367
Query: 382 STAGSITCLAMAAAPDNVNSV--LNVIANMQQQNHRILYDVPNSRLGVARELC 432
T ++ C+++ ++ I M QQ + I YD+ + + R C
Sbjct: 368 DTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRIDC 420
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 161/368 (43%), Gaps = 55/368 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+T+K + C
Sbjct: 88 YTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQC---- 143
Query: 154 CKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKA 205
NP+C G C + Y S+ + L++D +S +++ P FGC
Sbjct: 144 -----NPSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVE 198
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG S G++GLGRG LS++ Q + + ++FS C + G++ LG I P
Sbjct: 199 TGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVV--GGAMVLGNIPPP 256
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTV 317
+ + +P RS+ Y + L + V G+R L+ NP GT++DSGT
Sbjct: 257 PDMVFA--HSDPYRSAYYNIELKELHVAGKR--------LKLNPRVFDGKHGTVLDSGTT 306
Query: 318 FTRLVAPAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF- 366
+ L A+ A +D + + + D C+S + + P + ++F
Sbjct: 307 YAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFG 366
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ L H+ CL + + ++L I +N + YD N ++
Sbjct: 367 NGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIV---VRNTLVTYDRDNDKI 423
Query: 426 GVARELCT 433
G + C+
Sbjct: 424 GFWKTNCS 431
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 160/372 (43%), Gaps = 48/372 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R ++G+P + + +DT +D WV C+ C GC + F+ S T +
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
C +C Q + C C + YG + ++ L DTI S+ +
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
FGC TG+ P G+ G G+ +S+++Q +Q + FS+CL S
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDD--S 267
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG I +P I YTPL+ + Y +NL +I V + + I P F ++
Sbjct: 268 GGGILVLGEIVEPN-IVYTPLVPSQPH---YNLNLQSIYVNGQTLAIDPSV--FATSSNQ 321
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
GTIIDSGT L AY V +++ L + CY S+ V P ++L
Sbjct: 322 GTIIDSGTTLAYLTEAAYDPFISAITSTVSPSVS-PYLSKGNQCYLTSSSINDVFPQVSL 380
Query: 365 MFSGMN--VTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F+G + +PQD L+ S+ ++ C+ + ++ ++ ++ +YD+
Sbjct: 381 NFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQ---GQEITILGDLVLKDKIFVYDI 437
Query: 421 PNSRLGVARELC 432
R+G A C
Sbjct: 438 AGQRIGWANYDC 449
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 151/363 (41%), Gaps = 48/363 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R KIGTP + +DT + +VPC+ C C + F+ A S+++K L C +
Sbjct: 35 YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGSE- 93
Query: 154 CKQVPNPTCGGGACAFNLTY-----GSSTIAANLSQDTISLATDIVPG---YTFGCIQKA 205
C G C + Y ST + L +D I + G FGC
Sbjct: 94 --------CSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETAE 145
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ G++GLGRG LS++ Q +N + FS C G++ LG P
Sbjct: 146 TGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMD--EGGGAMILGGFQPP 203
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
K + +T +P RS Y + L IRVG + + P GT++DSGT +
Sbjct: 204 KDMVFTA--SDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGK----YGTVLDSGTTYAYF 257
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYS--------VPIVAPTITLMF-SGMN 370
A+ A + + +VGS V D CY+ + P++ +F G +
Sbjct: 258 PGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQS 317
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
VTL P++ L H+ CL + D + +I +N + Y+ + +G +
Sbjct: 318 VTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIV----RNMLVTYNRGKASIGFLK 373
Query: 430 ELC 432
C
Sbjct: 374 TKC 376
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 182/445 (40%), Gaps = 69/445 (15%)
Query: 31 SSTLQVFHVFSPCS----PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR----KSV 82
S T + H++SP PF S P +E L+ A F+ S + + + +
Sbjct: 57 SFTFNIHHLYSPAVRQILPFH-SFP---DEGTLDYYAAMVRTDHFVHSRRLGQVQDHRPL 112
Query: 83 VPIASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------ 133
++ + SP Y +GTP L+A+DT +D W+PC CV C +
Sbjct: 113 TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQ 171
Query: 134 -----TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDT 186
+++ S+T K + C ++ C + + C + ++Y S ++ L +D
Sbjct: 172 GPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDI 231
Query: 187 ISLATDIVPG------YTFGCIQKATG---NSVPPQGLLGLGRGSLSLLAQTQN--LYQS 235
+ L T+ V T GC + +G +S P GL GLG ++S+ + N L +
Sbjct: 232 LHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISN 291
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
+FS C + G + G G P + + TP RR Y V++ I VG + D+
Sbjct: 292 SFSLCFGPARM----GRIEFGDKGSPGQNE-TPFNLG-RRHPTYNVSITQIGVGGHISDL 345
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYS 354
I DSGT FT L PAY+ D F V T+ S F+ CY
Sbjct: 346 -----------DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYE 394
Query: 355 VPIVAPTITL------MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+ T T M G + + +LI + + + CLA+A + +N+I
Sbjct: 395 LSPNQTTFTYPLMNLTMKGGGHFVINHPIVLISTESKRLFCLAIARS-----DSINIIGQ 449
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
+ I++D LG CT
Sbjct: 450 NFMTGYHIVFDREKMVLGWKESNCT 474
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 104/428 (24%), Positives = 173/428 (40%), Gaps = 53/428 (12%)
Query: 13 FLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSPF-KPSKPLSWEESVLEMLAKDQARLQF 71
FLF L E + + ++ + H SP SPF PSK + E + + + +R+
Sbjct: 17 FLFQLLE----VALARGGGFSVDLIHRDSPHSPFFDPSK--TQAERLTDAFRRSVSRVGR 70
Query: 72 LSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC 131
A+ + R + + Y++ IGTP ++ +DT +D W C C C
Sbjct: 71 FRPTAMTSDGI----QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHC 126
Query: 132 SSTV---FNSAQSTTFKNLGCQAAQCKQV-PNPTCGG-GACAFNLTYGSSTI-AANLSQD 185
V F+ S+T+++ C + C + + +C C F +Y + NL+ +
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186
Query: 186 TISLATDI-----VPGYTFGCIQKATG-NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
T+++ + PG+ FGC + G G++GLG G LSL++Q ++ FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246
Query: 240 C-LPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
C LP S S + G G R+ + P R L + + ++ G
Sbjct: 247 CLLPVSTDSSISSRINFGASG---RVSGYGTVSTPLR--------LPYKGYSKKTEVEEG 295
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVP 356
+ I+DSGT +T L Y+ + + G F CY +
Sbjct: 296 NI----------IVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 345
Query: 357 IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
I AP IT F NV L N + + C +A D + V+ N+ Q N +
Sbjct: 346 INAPIITAHFKDANVELQPLNTFMRMQE-DLVCFTVAPTSD-----IGVLGNLAQVNFLV 399
Query: 417 LYDVPNSR 424
+D+ R
Sbjct: 400 GFDLRKKR 407
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 164/378 (43%), Gaps = 47/378 (12%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
+Q Y + K+GTP + L + +DT +D WV C C GC T F+ S+T
Sbjct: 72 SQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSST 131
Query: 144 FKNLGCQAAQCK---QVPNPTCGG--GACAFNLTYGSSTIAANLSQDTI---------SL 189
+ C +C+ Q + +C G C + YG + + + +L
Sbjct: 132 SSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
T+ FGC TG+ + G+ G G+ +S+++Q +Q + FS+CL
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG I +P I Y+PL+ + Y +NL +I V ++V I P F
Sbjct: 252 DN--SGGGVLVLGEIVEPN-IVYSPLVPSQPH---YNLNLQSISVNGQIVRIAPSV--FA 303
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYT----AVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA 359
+ GTI+DSGT L AY A+ V + V S L+ + T S +
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIF 363
Query: 360 PTITLMFSGMN--VTLPQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P ++L F+G V PQD L+ + GS+ C+ + ++ ++ ++
Sbjct: 364 PQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKIS---GQSITILGDLVLKDKI 420
Query: 416 ILYDVPNSRLGVARELCT 433
+YD+ R+G A C+
Sbjct: 421 FVYDLAGQRIGWANYDCS 438
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 155/368 (42%), Gaps = 37/368 (10%)
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
I+ Y++ +GTP +L DT +D W C C C V F+ +S T+K L
Sbjct: 88 ISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTL 147
Query: 148 GCQAAQCKQVPNP-TCGG-GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTF 199
C C+ + +C C ++ +YG S +LS DT+++ + PG F
Sbjct: 148 DCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAF 207
Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP 257
GC G + GL+GLG G LSL+ Q + FSYCL P + S + G
Sbjct: 208 GCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGK 267
Query: 258 IG--QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP--------PGALQFNPTTG 307
G TPL+K + YY+ L + VG V P A++
Sbjct: 268 SGVVSGSGTVSTPLIKG-TPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVE-----E 321
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV--PIVAPTITLM 365
IIDSGT T L YT V +G T G F CYS + PTIT
Sbjct: 322 GNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEIPTITAH 381
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F+G +V LP N + + C +M + S L + N+ Q N + YD+ N+++
Sbjct: 382 FTGADVQLPPLNTFVQ-VQEDLVCFSMIPS-----SNLAIFGNLAQINFLVGYDLKNNKV 435
Query: 426 GVARELCT 433
+ CT
Sbjct: 436 SFKQTDCT 443
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 176/469 (37%), Gaps = 112/469 (23%)
Query: 65 DQARLQFLSSLAVARKS----------------------VVPIASGRQITQSPTYIVRAK 102
DQ R F+SS A R + +P++SG T + Y VR +
Sbjct: 2 DQERTAFISSHARRRATEAGRAKPKPKAKAKAAPADEAFAMPLSSG-AYTGTGQYFVRFR 60
Query: 103 IGTPAQTLLMAMDTSNDAAWVPC-------------------------------TGCVGC 131
+GTPA+ L+ DT +D WV C +
Sbjct: 61 VGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSVSAAASS 120
Query: 132 SSTVFNSAQSTTFKNLGCQAAQC--------KQVPNPTCGGGACAFNLTYGSSTIA-ANL 182
+ VF +S T+ + C + C P P G CA+ Y + A +
Sbjct: 121 PARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP---GSPCAYEYRYKDGSAARGTV 177
Query: 183 SQDTISLATD-----------IVPGYTFGCIQKATGNS-VPPQGLLGLGRGSLSLLAQTQ 230
D+ ++A + G GC TG S + G+L LG ++S ++
Sbjct: 178 GTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAA 237
Query: 231 NLYQSTFSYCL------------------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKN 272
+ FSYCL P+ + S S + G P + TPLL +
Sbjct: 238 ARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPG-ARQTPLLLD 296
Query: 273 PRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
R Y V + + V ++ IP L ++ G G I+DSGT T LV+PAY AV
Sbjct: 297 HRMRPFYAVAVNGVSVDGELLRIP--RLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAA 354
Query: 333 FRRRVGSNLTVTSLGGFDTCYS---------VPIVAPTITLMFSGMNVTLPQDNLLIHST 383
+++ L ++ FD CY+ + + P + + F+G P +
Sbjct: 355 LGKKL-VGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA 413
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
A + C+ + + ++VI N+ QQ H +D+ N RL R C
Sbjct: 414 APGVKCIGLQ---EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 165/379 (43%), Gaps = 52/379 (13%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTF 144
Q Y + ++GTP + +DT +D WV C C GC T F+ S+T
Sbjct: 74 QVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTS 133
Query: 145 KNLGCQAAQC---KQVPNPTCG--GGACAFNLTYGSST------IAANLSQDTI---SLA 190
+ C +C KQ + TC C++ YG + ++ + +TI S+
Sbjct: 134 SMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMT 193
Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
T+ FGC + TG+ G+ G G+ +S+++Q +Q + FS+CL
Sbjct: 194 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG I +P I YT L+ P Y +NL +I V + + I F
Sbjct: 254 S--SGGGILVLGEIVEPN-IVYTSLVPAQPH----YNLNLQSISVNGQTLQIDSSV--FA 304
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCY----SVPIV 358
+ GTI+DSGT L AY + ++ TV S G + CY SV V
Sbjct: 305 TSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRG--NQCYLITSSVTDV 362
Query: 359 APTITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNH 414
P ++L F+G + PQD L+ ++ G ++ C+ + ++ ++ ++
Sbjct: 363 FPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQ---GQGITILGDLVLKDK 419
Query: 415 RILYDVPNSRLGVARELCT 433
++YD+ R+G A C+
Sbjct: 420 IVVYDLAGQRIGWANYDCS 438
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 111/225 (49%), Gaps = 22/225 (9%)
Query: 32 STLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSS---------LAVARKSV 82
S+L+V H+ CS +K + E+L +D+AR++ + S ++ A+ +
Sbjct: 63 SSLRVVHMHGACSHLSSNKDARLDHD--EILRRDEARVESIHSKLSKNIADEVSKAKSTK 120
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CSSTV---FNS 138
+P +G I SP YIV IGTP + + DT +D W C C+G C S FN
Sbjct: 121 LPAKNG-IILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 179
Query: 139 AQSTTFKNLGCQAAQCKQVPNP-TCGGGACAFNLTYGSSTIAAN-LSQDTISLA-TDIVP 195
+ S+++ N+ C + C NP +C C + + YG ++ L+++ +L +D++
Sbjct: 180 SSSSSYHNVSCSSPMC---GNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLTNSDVLD 236
Query: 196 GYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
FGC + G + G+LGLG G S QT Y + FSYC
Sbjct: 237 DIYFGCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 160/379 (42%), Gaps = 71/379 (18%)
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCS-----STVFNSAQSTTFKNLGCQAAQCKQVPNPT 161
AQT MA+DT+ D W+ C C + +F+ +S + + C + C+ + N
Sbjct: 164 AQT--MAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYG 221
Query: 162 CG-------------------GGACAFNLTYGSSTIAANLSQD---TISLATDIVPGYTF 199
G G C + + Y +++ TIS T + + F
Sbjct: 222 NGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFL-NFRF 280
Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA---LSFSGSLRL 255
GC G+ S G + LG G SLL+QT Y + FSYC+P A LS G++
Sbjct: 281 GCSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPKPSASGFLSLGGAIND 340
Query: 256 GPIGQPKRIKY--TPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
G + TPL++N R + Y V L I V R +++PP GT+
Sbjct: 341 GDSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVFS------GGTL 394
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRR---------RVGSNLTVTSLGG---FDTCYSVP--- 356
+DS V T+L AY A+R FR R GS + T GG DTCY
Sbjct: 395 MDSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGST-SSTPAGGEMILDTCYDFEGLD 453
Query: 357 -IVAPTITLMFSGMNVT--LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ PT++L+F G V P +++ CLA P + + L I N+QQQ
Sbjct: 454 NVTVPTVSLVFFGGAVVDLDPTTAVMMEG------CLAFVPTPADFD--LGFIGNVQQQT 505
Query: 414 HRILYDVPNSRLGVARELC 432
H +LYDV +G R C
Sbjct: 506 HEVLYDVGARNVGFRRGAC 524
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 107/430 (24%), Positives = 179/430 (41%), Gaps = 55/430 (12%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
P + + PL + E+ A+D+ R + L R+S V P+ Y
Sbjct: 43 PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
+ K+G+P + +DT +D WV C+ C C + F++ S T ++ C
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCS 161
Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
C V T C ++ YG S + DT SL +
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221
Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
FGC +G+ G+ G G+G LS+++Q ++ + FS+CL S G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
LG I P + Y+PL+ + Y +NLL+I V ++ +P A F + GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEASNTRGTI 333
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
+D+GT T LV AY + V S L + + CY S+ + P+++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFA 392
Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G + PQD L + S+ C+ AP+ ++ ++ ++ +YD+
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448
Query: 424 RLGVARELCT 433
R+G A C+
Sbjct: 449 RIGWASYDCS 458
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 107/430 (24%), Positives = 187/430 (43%), Gaps = 58/430 (13%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPT----YIVR 100
P + + PL+ + + + A+D+AR + V VV + Q T P Y +
Sbjct: 31 PLERAIPLNQQVELEALRARDRARHGRILQGVVG--GVVDFS--VQGTSDPYFVGLYFTK 86
Query: 101 AKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAA 152
K+G+PA+ + +DT +D W+ C C C + F++A S+T + C
Sbjct: 87 VKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDP 146
Query: 153 QCK---QVPNPTCGGGA--CAFNLTYGSST------IAANLSQDTISLATDIVPGYT--- 198
C Q C A C++ YG + ++ + DT+ L +V +
Sbjct: 147 ICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTI 206
Query: 199 -FGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
FGC +G+ G+ G G G+LS+++Q ++ + FS+CL + + G
Sbjct: 207 IFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE--NGGG 264
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
L LG I +P I Y+PL+ + Y +NL +I V ++ +P + F T GTI
Sbjct: 265 VLVLGEILEPS-IVYSPLVPSQPH---YNLNLQSIAVNGQL--LPIDSNVFATTNNQGTI 318
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMF- 366
+DSGT LV AY V S + + + CY SV + P ++L F
Sbjct: 319 VDSGTTLAYLVQEAYNPFVKAITAAV-SQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFM 377
Query: 367 SGMNVTLPQDNLLIHS---TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G ++ L ++ L+H ++ C+ V ++ ++ ++ +YD+ N
Sbjct: 378 GGASMVLNPEHYLMHYGFLDGAAMWCIGF----QKVEQGFTILGDLVLKDKIFVYDLANQ 433
Query: 424 RLGVARELCT 433
R+G A C+
Sbjct: 434 RIGWADYDCS 443
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 164/376 (43%), Gaps = 54/376 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
Y R ++GTP + + +DT +D WV C C GC F+ S T +
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111
Query: 149 CQAAQCK---QVPNPTCGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
C +C Q + C C +N YG + ++ L DT+ S+ +
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSS 171
Query: 195 PGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
FGC TG+ G+ G G+ +S+++Q +Q + FS+CL S
Sbjct: 172 APIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDD--S 229
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG I +P I YTPL+ + Y +N+ +I V + + I P F ++
Sbjct: 230 GGGILVLGEIVEPN-IVYTPLVPSQPH---YNLNMQSISVNGQTLAIDPSV--FGTSSSQ 283
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT---SLGGFDTCY----SVPIVAPT 361
GTIIDSGT L AY D F + S ++ + L + CY S+ + P
Sbjct: 284 GTIIDSGTTLAYLAEAAY----DPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIFPQ 339
Query: 362 ITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
++L F+G + +PQD L+ S+ G ++ C+ + ++ ++ ++ +
Sbjct: 340 VSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQ---GQGITILGDLVLKDKIFV 396
Query: 418 YDVPNSRLGVARELCT 433
YD+ N R+G A C+
Sbjct: 397 YDIANQRIGWANYDCS 412
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 134/323 (41%), Gaps = 22/323 (6%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQA 151
T + Y++ +GTP Q + +D ++D W+ C+ C C + + + F
Sbjct: 92 TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAF-LSF 150
Query: 152 AQCKQVPNPTCGGGACAFNLTYG---SSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ P CG ++ YG ++T A L+ D + AT G FGC G+
Sbjct: 151 HDTRAPTTPPCG-----YSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAVATEGD 205
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--RIKY 266
G++GLGRG LS ++Q Q FSY L A+ + +P+ R
Sbjct: 206 I---GGVIGLGRGELSPVSQLQ---IGRFSYYLAPDDAVDVGSFILFLDDAKPRTSRAVS 259
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAY 326
TPL+ + SLYYV L IRV + IP G G ++ T L A AY
Sbjct: 260 TPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPVTFLDAGAY 319
Query: 327 TAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMFSGMNV-TLPQDNLLIH 381
VR ++ S G D CY+ +A P++ L+F+G V L N
Sbjct: 320 KVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVMELEMGNYFYM 379
Query: 382 STAGSITCLAMAAAPDNVNSVLN 404
+ + CL + +P S+L
Sbjct: 380 DSTTGLECLTILPSPAGDGSLLG 402
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 177/407 (43%), Gaps = 43/407 (10%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
EE VL +A +R Q L + V R Q YI IG+P Q +
Sbjct: 49 EERVLRAVAV--SRQQQQQRLMAGAEDDVSAQVHRATRQ---YIASYLIGSPPQRTEALI 103
Query: 115 DTSNDAAWVPC-TGCV--GCSST---VFNSAQSTTFKNLGC--QAAQCKQVPNPTCG-GG 165
DT +D W C T C+ C+ +N +QS+TF + C +A C CG G
Sbjct: 104 DTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKAGFCAANGVHLCGLDG 163
Query: 166 ACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGNSVPPQGLLGLGRGS 222
+C F +YG+ + +L ++ + + FGC+ + +G GL+GLGRG
Sbjct: 164 SCTFIASYGAGRVIGSLGTESFAFESGTTS-LAFGCVSLTRITSGALNDASGLIGLGRGR 222
Query: 223 LSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIKYT-PLLKNPRR---SS 277
LSL++Q + + FSYCL P F + S L +G + P +K+P+ S+
Sbjct: 223 LSLVSQ---IGATRFSYCLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYST 279
Query: 278 LYYVNLLAIRVGR-RVVDIPPGALQ----FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
YY+ L I VG+ R+ + Q F G IID+G+ T+L + AY A+++
Sbjct: 280 FYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEE 339
Query: 333 FRRRVGSNLTVTS--LGGFDTCYS---VPIVAPTITLMF-SGMNVTLPQDNLLIHSTAGS 386
++G+ V + G + C + V P + F G ++ +P + +
Sbjct: 340 VAAQLGNGSLVPAPEDSGLELCVAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAA 399
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + D ++I N QQQ+ +LYD+ R CT
Sbjct: 400 ACMMILEGGYD------SIIGNFQQQDMHLLYDLRRGRFSFQTADCT 440
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/437 (24%), Positives = 182/437 (41%), Gaps = 64/437 (14%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV-------------PIASGRQI 91
P + + PL + E+ A+D+ R + L R+S V P G ++
Sbjct: 43 PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGSKM 101
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
T Y + K+G+P + +DT +D WV C+ C C + F++ S T
Sbjct: 102 TM--LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLT 159
Query: 144 FKNLGCQAAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLA 190
++ C C V T C ++ YG S + DT SL
Sbjct: 160 AGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 219
Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
+ FGC +G+ G+ G G+G LS+++Q ++ + FS+CL
Sbjct: 220 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 279
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNP 304
S G LG I P + Y+PL+ + Y +NLL+I V ++ +P A F
Sbjct: 280 G--SGGGVFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEA 331
Query: 305 TTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAP 360
+ GTI+D+GT T LV AY + V S L + + CY S+ + P
Sbjct: 332 SNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFP 390
Query: 361 TITLMFSGMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+++L F+G + PQD L + S+ C+ AP+ ++ ++ ++
Sbjct: 391 SVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVF 446
Query: 417 LYDVPNSRLGVARELCT 433
+YD+ R+G A C+
Sbjct: 447 VYDLARQRIGWASYDCS 463
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 189/459 (41%), Gaps = 59/459 (12%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLA 63
L F L+ +FL G + + + S T ++ H SP SP F S+ + + + +
Sbjct: 11 LSFALSIIFLTVSMSGFS-LVQAEKLSFTTELIHRDSPNSPLFNASE--TTDIRLANAVE 67
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
+ R+ + L + + A I + ++++ IG P LL+ + T +D W+
Sbjct: 68 RSADRVNRFNDLI---SNSITAAEFPSILDNGDFLMKISIGIPPTELLVNVATGSDLVWI 124
Query: 124 PCTG---CV-GCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLT--YGSST 177
PC C C F+ +S+T+KN+ C + +C+ TC C ++ + S
Sbjct: 125 PCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDSC 184
Query: 178 IAANLSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNL 232
+L+ DT++L + ++P F C + G+ P G+LGLG GSLSLL + +L
Sbjct: 185 PDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGGD-YPGVGILGLGHGSLSLLNRISHL 243
Query: 233 YQSTFSYCLPSFKA-----LSFSG----------SLRLGPIGQPKRIKYTPLLKNPRRSS 277
FS+C+ + + LSF S RL G P
Sbjct: 244 IDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYS-------------- 289
Query: 278 LYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRV 337
Y ++ I VG + I G + + G +DSGT+FT Y+ + R +
Sbjct: 290 -YTLSFYGISVGNK--SISAGGIGSDYYMN-GLGMDSGTMFTYFPEYFYSQLEYDVRYAI 345
Query: 338 GSN-LTVTSLGGFDTC--YSVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAA 394
L C YS PTIT+ F G +V L N I T I CLA A
Sbjct: 346 QQEPLYPDPTRRLRLCYRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTE-DIVCLAFAT 404
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++V QQ N I YD+ L + CT
Sbjct: 405 SSSEQDAVFGY---WQQTNLLIGYDLDAGFLSFLKTDCT 440
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 150/332 (45%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G++S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 107/429 (24%), Positives = 178/429 (41%), Gaps = 55/429 (12%)
Query: 45 PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVV------PIASGRQITQSPTYI 98
P + + PL + E+ A+D+ R + L R+S V P+ Y
Sbjct: 43 PLQRAFPLDELVELSELRARDRVRHARIL-LGGGRQSSVGGVVDFPVQGSSDPYLVGLYF 101
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQ 150
+ K+G+P + +DT +D WV C+ C C + F++ S T ++ C
Sbjct: 102 TKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCS 161
Query: 151 AAQCKQVPNPTCG----GGACAFNLTYGS-STIAANLSQDTI--------SLATDIVPGY 197
C V T C ++ YG S + DT SL +
Sbjct: 162 DPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPI 221
Query: 198 TFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSG 251
FGC +G+ G+ G G+G LS+++Q ++ + FS+CL S G
Sbjct: 222 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG--SGGG 279
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
LG I P + Y+PL+ + Y +NLL+I V ++ +P A F + GTI
Sbjct: 280 VFVLGEILVPGMV-YSPLVPSQPH---YNLNLLSIGVNGQM--LPLDAAVFEASNTRGTI 333
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFS 367
+D+GT T LV AY + V S L + + CY S+ + P+++L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFA 392
Query: 368 GMNVTL--PQDNLLIHS--TAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
G + PQD L + S+ C+ AP+ ++ ++ ++ +YD+
Sbjct: 393 GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE----QTILGDLVLKDKVFVYDLARQ 448
Query: 424 RLGVARELC 432
R+G A C
Sbjct: 449 RIGWASYDC 457
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 151/332 (45%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G++S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGAMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + + +A AP S++
Sbjct: 289 DLGRGGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 182/445 (40%), Gaps = 69/445 (15%)
Query: 31 SSTLQVFHVFSPCS----PFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVAR----KSV 82
S T + H++SP PF S P +E L+ A F+ S + + + +
Sbjct: 34 SFTFNIHHLYSPAVRQILPFH-SFP---DEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPL 89
Query: 83 VPIASGRQITQSPT---YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------ 133
++ + SP Y +GTP L+A+DT +D W+PC CV C +
Sbjct: 90 TFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQ 148
Query: 134 -----TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDT 186
+++ S+T K + C ++ C + + C + ++Y S ++ L +D
Sbjct: 149 GPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDI 208
Query: 187 ISLATDIVPG------YTFGCIQKATG---NSVPPQGLLGLGRGSLSLLAQTQN--LYQS 235
+ L T+ V T GC + +G +S P GL GLG ++S+ + N L +
Sbjct: 209 LHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISN 268
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
+FS C + G + G G P + + TP RR Y V++ I VG + D+
Sbjct: 269 SFSLCFGPARM----GRIEFGDKGSPGQNE-TPFNLG-RRHPTYNVSITQIGVGGHISDL 322
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYS 354
I DSGT FT L PAY+ D F V T+ S F+ CY
Sbjct: 323 -----------DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYE 371
Query: 355 VPIVAPTITL------MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
+ T T M G + + +LI + + + CLA+A + +N+I
Sbjct: 372 LSPNQTTFTYPLMNLTMKGGGHFVINHPIVLISTESKRLFCLAIARS-----DSINIIGQ 426
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
+ I++D LG CT
Sbjct: 427 NFMTGYHIVFDREKMVLGWKESNCT 451
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 161/387 (41%), Gaps = 51/387 (13%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ +++ Y + IGTP++ + +DT +D WV C GC +G T
Sbjct: 141 LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 200
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
+++ STT +GC C P G G C +++ YG S+ QD +
Sbjct: 201 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 260
Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
T FGC K +G S G+LG G+ + S+L+Q + +
Sbjct: 261 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 320
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
FS+CL + G +G + +PK + TPL++N + Y V + I VG +D+P
Sbjct: 321 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQN---QAHYNVVMKEIEVGGDPLDVP 373
Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
A F GTIIDSGT V+ L+ + D+ V T
Sbjct: 374 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 426
Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
FD +V PT+TL F +++T+ P + L H I A + L ++
Sbjct: 427 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKD-LTLL 485
Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
++ N ++YD+ +G C+
Sbjct: 486 GDLVLSNKLVVYDLEKQGIGWVEYNCS 512
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 166/386 (43%), Gaps = 51/386 (13%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-------SSTV 135
+P+ Q Y + +GTP++ + +DT +D WV C GC+ C T
Sbjct: 71 IPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP 130
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA-CAFNLTYGS-STIAANLSQDTISLATD 192
++ S+T K++ C C V + C G+ C + + YG S+ L +D + L D
Sbjct: 131 YDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHL--D 188
Query: 193 IVPG----------YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQST 236
+V G FGC K +G Q G++G G+ + S ++Q +Q + +
Sbjct: 189 LVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRS 248
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
F++CL + G +G + PK +K TP+L +S+ Y VNL AI VG V+++
Sbjct: 249 FAHCLDNNNG---GGIFAIGEVVSPK-VKTTPMLS---KSAHYSVNLNAIEVGNSVLELS 301
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYT-AVRDVFRRRVGSNLTVTSLGGFDTCYSV 355
A F+ G IIDSGT L Y + ++ LT+ ++ TC+
Sbjct: 302 SNA--FDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASH--PELTLHTVQESFTCFHY 357
Query: 356 PIVA---PTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIA 407
PT+T F +V+L P++ L C + L ++
Sbjct: 358 TDKLDRFPTVTFQFD-KSVSLAVYPREYLF--QVREDTWCFGWQNGGLQTKGGASLTILG 414
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
+M N ++YD+ N +G C+
Sbjct: 415 DMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 153/355 (43%), Gaps = 55/355 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTF 144
+ K+GTP ++A+DT +D WVPC C C+ T ++N STT
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTTN 165
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDI-----VPGY 197
K + C + C Q C + ++Y S+ + + L +D + L T+ V Y
Sbjct: 166 KKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAY 225
Query: 198 -TFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSG 251
TFGC Q +G + P GL GLG +S+ + + L +FS C G
Sbjct: 226 VTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF----GHDGVG 281
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G + + TP NP + Y + + +RVG ++D AL
Sbjct: 282 RISFGDKGSSDQ-EETPFNLNPSHPN-YNITVTRVRVGTTLIDDEFTAL----------- 328
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA-----PTITLM 365
D+GT FT LV P YT V + F + + S F+ CY + A P+++L
Sbjct: 329 FDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLT 388
Query: 366 FSGMNVTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
G + D +++ ST G + CLA+ + S LN+I +R+++D
Sbjct: 389 MKGNSHFTINDPIIVISTEGELVYCLAIVKS-----SELNIIGQNYMTGYRVVFD 438
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 115/447 (25%), Positives = 180/447 (40%), Gaps = 97/447 (21%)
Query: 48 PSKPLSWEESVLEMLAKDQAR---LQFLSSLAVARKS--------------VVPIASGRQ 90
P P + E + +LA D+AR LQ + A + VP+ SG +
Sbjct: 38 PDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIR 97
Query: 91 ITQSPTYIVRAKIGTPAQ------TLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQS 141
Q+ Y+ +G L + +DT +D WV PC+ C +F+ + S
Sbjct: 98 F-QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGS 156
Query: 142 TTFKNLGCQAAQCKQVPNPTCG-GGACA---------------FNLTYGSSTIAAN-LSQ 184
++ + C A+ C+ G G+CA ++L YG + + L+
Sbjct: 157 ASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLAT 216
Query: 185 DTISLATDIVPGYTFGCIQKATGNSVP----------PQGLLGLGRGSLSLLAQTQNLYQ 234
DT++L V G+ FGC G P P G G GSLSL T +
Sbjct: 217 DTVALGGASVDGFVFGCGLSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRN 276
Query: 235 STFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVD 294
+T + YT ++ +P + Y++N+ VG V
Sbjct: 277 AT--------------------------PVSYTRMIADPAQPPFYFMNVTGASVGGAAV- 309
Query: 295 IPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS--LGGFDTC 352
A ++DSGTV TRL Y AVR F R+ G+ + D C
Sbjct: 310 ------AAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDAC 363
Query: 353 YSV----PIVAPTITL-MFSGMNVTLPQDNLLIHSTA-GSITCLAMAAAPDNVNSVLNVI 406
Y++ + P +TL + +G ++T+ +L + GS CLAMA+ + +I
Sbjct: 364 YNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASL--SFEDQTPII 421
Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
N QQ+N R++YD SRLG A E C+
Sbjct: 422 GNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 152/390 (38%), Gaps = 73/390 (18%)
Query: 114 MDTSNDAAWVPCTG-----CVG--------------------CSSTVFNSAQSTTFKNLG 148
+DT +D W PC C G C+S + ++A ++ +
Sbjct: 109 LDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRRIPCASPLCSAAHASAPPSDL 168
Query: 149 CQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-------VPGY 197
C AA+C + + +CG AC YG ++ A+L + ++L V +
Sbjct: 169 CAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNF 228
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKA--------- 246
TF C A G P G+ G GRG LSL Q FSYCL SF+A
Sbjct: 229 TFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPL 285
Query: 247 -LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
L S + YTPLL NP+ Y V L A+ VG + P + +
Sbjct: 286 ILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELARVDRA 345
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV-- 358
G ++DSGT FT L Y V + F R + + G CY
Sbjct: 346 GNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDR 405
Query: 359 -APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----ITCLAMA----AAPDNVNSVL 403
P + L F G V LP+ N + + AG+ + CL + A+ + +
Sbjct: 406 GVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPA 465
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ N QQQ ++YDV R+G AR CT
Sbjct: 466 GTLGNFQQQGFEVVYDVDAGRVGFARRRCT 495
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 109/223 (48%), Gaps = 21/223 (9%)
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKY--TPLLKNP 273
+GLG G+ SL++QT FSYCLP S SG L LG G + TP+L++
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLP--PTPSSSGFLTLGAAGGSGTSGFVKTPMLRSS 58
Query: 274 RRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF 333
+ + Y V L AIRVG R + IP AGT++DSGTV TRL AY+A+ F
Sbjct: 59 QVPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAF 112
Query: 334 RRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITC 389
+ + G DTC+ + P++ L+FSG V + +I S C
Sbjct: 113 KAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILS-----NC 167
Query: 390 LAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
LA A D +S L +I N+QQ+ +LYDV +G C
Sbjct: 168 LAFAGNSD--DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 159/364 (43%), Gaps = 47/364 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C S F S T++ + C Q
Sbjct: 93 YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC-TWQ 151
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATGN- 208
C N C + Y ST + L +D +S T++ P FGC TG+
Sbjct: 152 C----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCENDETGDI 207
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
+ G++GLGRG LS++ Q + + +FS C G++ LG I P +
Sbjct: 208 YNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCY--GGMGVGGGAMVLGGISPPADMV 265
Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRL 321
+T +P RS Y ++L I V G+R L NP GT++DSGT + L
Sbjct: 266 FT--RSDPVRSPYYNIDLKEIHVAGKR--------LHLNPKVFDGKHGTVLDSGTTYAYL 315
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYS--------VPIVAPTITLMF-SGMN 370
A+ A + + S ++ D C+S + P + ++F +G
Sbjct: 316 PESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVFGNGHK 375
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
++L P++ L HS CL + + N N ++ + +N ++YD ++++G +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS---NGNDPTTLLGGIVVRNTLVMYDREHTKIGFWK 432
Query: 430 ELCT 433
C+
Sbjct: 433 TNCS 436
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 145/327 (44%), Gaps = 27/327 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG+TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP F +G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233
Query: 321 LVAPAYTAVRDVFRR---RVGSNLTVTSLGGFDTCYSVPIVAPTITLMF-SGMNVTLPQD 376
+ A + +R R + G+ + +D P I+L F G L
Sbjct: 234 IPDRALSVLRQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSH 293
Query: 377 NLLIHSTAGSITCLAMAAAPDNVNSVL 403
+ + + +A AP S++
Sbjct: 294 GVFVERSVQEQDVWCLAFAPTKSVSII 320
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 107/427 (25%), Positives = 166/427 (38%), Gaps = 52/427 (12%)
Query: 24 ICDTQDHSSTLQVFHVFSPCSPFKPSKPLSWEE--SVLEMLAKDQARLQFLSSLAVARKS 81
+ TQ+H +++ H S SPF K + S+L L + S + +
Sbjct: 19 LTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYLNHVFSFSPNKIQ 78
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNS 138
VP++S Y++ IGTP L +DT ND W PC C+ +S +F+
Sbjct: 79 DVPLSS----FMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHP 134
Query: 139 AQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYT 198
++S+T+K + C + CK G N G+ N+
Sbjct: 135 SKSSTYKTIPCTSPICKNADGHYLGVDTLTLNSNNGTPISFKNI---------------V 179
Query: 199 FGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSL 253
GC + G P +G +GL RG LS ++Q + FSYCL P F + S L
Sbjct: 180 IGCGHRNQG---PLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKL 236
Query: 254 RLGPIGQPKRIKY--TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
G + TP+ + + Y+V+L A VG ++ + N +I
Sbjct: 237 HFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVGDHIIKLE------NSDNRGNSI 286
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY---SVPIVAPT--ITLMF 366
IDSGT T L Y+ + V V F+ CY S ++ IT F
Sbjct: 287 IDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF 346
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
SG V L N + + C A + + S L + N+ QQN + +D+ +
Sbjct: 347 SGSEVHLNALNTF-YPITDEVICFAFVSGGN--FSSLAIFGNVVQQNFLVGFDLNKKTIS 403
Query: 427 VARELCT 433
CT
Sbjct: 404 FKPTDCT 410
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 152/350 (43%), Gaps = 55/350 (15%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
K+GTP ++A+DT +D WVPC C C+ T ++N STT K + C
Sbjct: 110 KLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKISTTNKKVTC 168
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATDI-----VPGY-TFGC 201
+ C Q C + ++Y S+ + + L +D + L T+ V Y TFGC
Sbjct: 169 NNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGC 228
Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
Q +G + P GL GLG +S+ + + L +FS C G + G
Sbjct: 229 GQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF----GHDGVGRISFG 284
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G + + TP NP + Y + + +RVG ++D AL D+GT
Sbjct: 285 DKGSSDQ-EETPFNLNPSHPN-YNITVTRVRVGTTLIDDEFTAL-----------FDTGT 331
Query: 317 VFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCYSVPIVA-----PTITLMFSGMN 370
FT LV P YT V + F + + S F+ CY + A P+++L G +
Sbjct: 332 SFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNS 391
Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
D +++ ST G + CLA+ + S LN+I +R+++D
Sbjct: 392 HFTINDPIIVISTEGELVYCLAIVKS-----SELNIIGQNYMTGYRVVFD 436
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 104/393 (26%), Positives = 149/393 (37%), Gaps = 79/393 (20%)
Query: 114 MDTSNDAAWVPCTG-----CVG--------------------------CSSTVFNSAQST 142
+DT +D W PC C G C+S + ++A ++
Sbjct: 113 LDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSAAHAS 172
Query: 143 TFKNLGCQAAQC--KQVPNPTCGGGACA---FNLTYGSSTIAANLSQDTISLATDI-VPG 196
+ C AA C + + +C G + A YG ++ A+L + + L + V
Sbjct: 173 APPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLVAHLRRGRVGLGASVAVDN 232
Query: 197 YTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKALSFSGSLR 254
+TF C A G P G+ G GRG LSL Q FSYCL SF+A +R
Sbjct: 233 FTFACAHTALGE---PVGVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRL---IR 286
Query: 255 LGPI---------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
P+ + YTPLL NP+ Y V L A+ VG + P + +
Sbjct: 287 PSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRA 346
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV-- 358
G ++DSGT FT L Y V + F R + + G CY
Sbjct: 347 GNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCYHYAASDR 406
Query: 359 -APTITLMFSG-MNVTLPQDNLLI----HSTAG------SITCLAMAAAPD------NVN 400
P + L F G V LP+ N + AG + CL + D +
Sbjct: 407 GVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDD 466
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ N QQQ ++YDV R+G AR CT
Sbjct: 467 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 499
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 149/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + A+WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 158/360 (43%), Gaps = 41/360 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+T++ + C
Sbjct: 84 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 142
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
C N C + Y ST + L +D IS +++ P FGC TG+
Sbjct: 143 C----NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENVETGDL 198
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
S G++GLGRG LS++ Q +N+ +FS C G++ LG I P +
Sbjct: 199 YSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDV--GGGAMVLGGISPPSDMA 256
Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
+ +P RS Y ++L I V G+R +P A F+ GT++DSGT + L
Sbjct: 257 FA--YSDPVRSPYYNIDLKEIHVAGKR---LPLNANVFD--GKHGTVLDSGTTYAYLPEA 309
Query: 325 AYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVPIVA--------PTITLMF-SGMNVTL 373
A+ A +D + + S ++ D C+S + P + ++F +G TL
Sbjct: 310 AFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTL 369
Query: 374 -PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P++ + HS CL + N N ++ + +N ++YD +++G + C
Sbjct: 370 SPENYMFRHSKVRGAYCLGVF---QNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ +R + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 156/398 (39%), Gaps = 73/398 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC----------VGC---SSTVFNSAQSTT 143
YI IG P Q +DT +D W C+ C GC + +N + S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 144 FKNLGCQ---AAQCKQVPNPT-C--GGG----ACAFNLTYGSSTIAANLSQDTISLATDI 193
+ + C A C P C GGG AC +YG+ L D + +
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALGVLGTDAFTFPSSS 197
Query: 194 VPGYTFGCIQK---ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSF 249
FGC+ + + G G++GLGRG+LSL++Q L + FSYCL P F+
Sbjct: 198 SVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQ---LNATEFSYCLTPYFRDTVS 254
Query: 250 SGSLRLGPIGQPKR--------------IKYTPLLKNPRR---SSLYYVNLLAIRVGRRV 292
L +G G+ + P KNP+ S+ YY+ L+ + G
Sbjct: 255 PSHLFVGD-GELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNAT 313
Query: 293 VDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRR------------- 335
V +P GA G +IDSG+ FTRLV PA+ A+ R
Sbjct: 314 VALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPA 373
Query: 336 RVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS-----GMNVTLPQDNLLIHSTAGSITCL 390
++G L + G D P + L F G + +P + A +
Sbjct: 374 KLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMA 433
Query: 391 AMAAAPDNVNSVLN---VIANMQQQNHRILYDVPNSRL 425
+++A N N +I N QQ+ R+LYD+ N L
Sbjct: 434 VVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLL 471
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 102/433 (23%), Positives = 182/433 (42%), Gaps = 43/433 (9%)
Query: 6 VFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAK 64
+ F + F+ S S L +S + ++ H S SP +KP++ +
Sbjct: 9 LLFFSLCFIISFSHSLR-------NSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSI 61
Query: 65 DQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVP 124
++A F SL+ +S V + G Y++ +GTP + +DT +D W+
Sbjct: 62 NRANRLFKDSLSNTPESTVYVNGGE-------YLMTYSVGTPPFNVYGVVDTGSDIVWLQ 114
Query: 125 CTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAFNLTYGSSTIA- 179
C C C ++ +FN ++S+++KN+ C + C+ V +C +C + + + + +
Sbjct: 115 CKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQ 174
Query: 180 ANLSQDTISLATDI-----VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY 233
LS +T++L + P GC G G++GLG G +SL Q ++
Sbjct: 175 GELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSI 234
Query: 234 QSTFSYC-LPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
FSYC LP + + L G + + TP +K + + YY+ L A VG
Sbjct: 235 GGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQ-AFYYLTLEAFSVGN 293
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+ ++ + + I+DSGT T L + YT + + V + +
Sbjct: 294 KRIEFEV----LDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLN 349
Query: 351 TCYSV---PIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIA 407
CYS+ P IT F G ++ L + H A + CLA ++ +
Sbjct: 350 LCYSITSDQYDFPIITAHFKGADIKLNPISTFAH-VADGVVCLAFTSSQTGP-----IFG 403
Query: 408 NMQQQNHRILYDV 420
N+ Q N + YD+
Sbjct: 404 NLAQLNLLVGYDL 416
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
++V +G P L+ +DT +D WV C C C S+ +F+ ++S+T+ +L +
Sbjct: 91 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 150
Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
C P C +N +Y ST + NL+ + I T V FGC
Sbjct: 151 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 210
Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
G Q G+LGL G S++++ S FSYC+ F L LG G
Sbjct: 211 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 265
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TP + YYV L I VG +DI P Q + G ++DSGT T L
Sbjct: 266 SSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 322
Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
+ + + +R V + + ++ G+ CY + P + F+ G ++ L
Sbjct: 323 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 381
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++L + + CLA+ + N+ ++ +VI M QQ++ + YD+ R+ R C
Sbjct: 382 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
++V +G P L+ +DT +D WV C C C S+ +F+ ++S+T+ +L +
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118
Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
C P C +N +Y ST + NL+ + I T V FGC
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178
Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
G Q G+LGL G S++++ S FSYC+ F L LG G
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TP + YYV L I VG +DI P Q + G ++DSGT T L
Sbjct: 234 SSTPFHT---FNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 290
Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
+ + + +R V + + ++ G+ CY + P + F+ G ++ L
Sbjct: 291 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++L + + CLA+ + N+ ++ +VI M QQ++ + YD+ R+ R C
Sbjct: 350 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/222 (29%), Positives = 106/222 (47%), Gaps = 24/222 (10%)
Query: 28 QDHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAV---------- 77
D ++L+V H PCS K S S +ML +D++R+ + S
Sbjct: 62 DDKRASLEVIHKHGPCSKLSQDKGRS--PSRTQMLDQDESRVNSIRSRLAKNPADGGKLK 119
Query: 78 ARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-C---SS 133
K +P SG I + Y+V +GTP + L DT +D W C C C
Sbjct: 120 GSKVTLPSKSGSTIG-TGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQE 178
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPN-----PTCGGGACAFNLTYGSSTIAANL-SQDTI 187
+FN ++ST++ N+ C + C ++ + P+C C + + YG + + +QD +
Sbjct: 179 PIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKL 238
Query: 188 SL-ATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
+L +TD+ + FGC Q G V GL+GLGR +LSL+++
Sbjct: 239 ALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G PA+ + +DT +D WV C+ C GC ++ FN S+T +
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
C +C Q C C + TYG S + DT+ T + T
Sbjct: 65 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 124
Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
FGC +G+ G+ G G+ LS+++Q +L S FS+CL
Sbjct: 125 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 182
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ + G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 183 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 236
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
GTI+DSGT L AY V ++ ++ S G F T SV PT+
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 296
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
TL F G+ +++ +N L+ + + L N + ++ ++ ++ +YD+
Sbjct: 297 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 356
Query: 422 NSRLGVARELCT 433
N R+G A C+
Sbjct: 357 NMRMGWADYDCS 368
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 165/374 (44%), Gaps = 52/374 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y + K+G+P + + +DT +D WV C C C T F+S+ S+T +
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125
Query: 149 CQAAQCKQVPNPT---CGG--GACAFNLTYGSST------IAANLSQDTI---SLATDIV 194
C C T C C++ YG + ++ L D I SL +
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185
Query: 195 PGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
FGC +G+ G+ G G+G LS+++Q T+ + FS+CL S
Sbjct: 186 ALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDG--S 243
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG I +P I Y+PL+ + Y +NLL+I V +++ I P A F +
Sbjct: 244 GGGILVLGEILEPG-IVYSPLVPSQPH---YNLNLLSIAVNGQLLPIDPAA--FATSNSQ 297
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPIVAPTIT 363
GTI+DSGT LVA AY V ++T +TS G + CY SV + P +
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKG--NQCYLVSTSVSQMFPLAS 355
Query: 364 LMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
F+G V P+D L+ ++G ++ C+ + ++ ++ ++ +YD
Sbjct: 356 FNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-----VTILGDLVLKDKIFVYD 410
Query: 420 VPNSRLGVARELCT 433
+ R+G A C+
Sbjct: 411 LVRQRIGWANYDCS 424
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 33/357 (9%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
++V +G P L+ +DT +D WV C C C S+ +F+ ++S+T+ +L +
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118
Query: 154 CKQVPNPTCGG-GACAFNLTYGS-STIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
C P C +N +Y ST + NL+ + I T V FGC
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178
Query: 207 GNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPS-FKALSFSGSLRLGPIGQPKRI 264
G Q G+LGL G S++++ S FSYC+ F L LG G
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLG----SRFSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TP + YYV L I VG +DI P Q + G ++DSGT T L
Sbjct: 234 SSTPFHT---FNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKD 290
Query: 325 AYTAVRDVFRRRVGSN---LTVTSLGGFDTCYSVPIVA-----PTITLMFS-GMNVTLPQ 375
+ + + +R V + + ++ G+ CY + P + F+ G ++ L
Sbjct: 291 GFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
++L + + CLA+ + N+ ++ +VI M QQ++ + YD+ R+ R C
Sbjct: 350 NSLFVQKNQ-DVFCLAVLES--NLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 103/210 (49%), Gaps = 18/210 (8%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y + IGTP T + DT + W PCT C + F A S+TF L C ++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 154 CKQVPNP--TCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGC-IQKATGNSV 210
C+ + +P TC C + YG A L+ +T+ + PG TFGC + GNS
Sbjct: 150 CQFLTSPYRTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPGVTFGCSTENGVGNS- 208
Query: 211 PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ--PKRIKYTP 268
G++GLGR LSL++Q + FSYCL S A + + G + + ++ TP
Sbjct: 209 -SSGIVGLGRSPLSLVSQVG---VARFSYCLRS-NADAGDSPILFGSLAKVTGGNVQSTP 263
Query: 269 LLKNPR--RSSLYYVNLLAIRVGRRVVDIP 296
LL+NP SS YYVNL I VG D+P
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLP 291
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 164/385 (42%), Gaps = 49/385 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC-------SSTV 135
+P+ Q Y + +GTP++ + +DT +D WV C GC+ C T
Sbjct: 71 LPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP 130
Query: 136 FNSAQSTTFKNLGCQAAQCKQVPNPT-CGGGA-CAFNLTYGS-STIAANLSQDTISLATD 192
+++ S+T K++ C C V + C G+ C + + YG S+ L +D + L D
Sbjct: 131 YDADASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHL--D 188
Query: 193 IVPG----------YTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQST 236
+V G FGC K +G Q G++G G+ + S ++Q +Q + +
Sbjct: 189 LVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRS 248
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
F++CL + G +G + PK +K TP+L +S+ Y VNL AI VG V+ +
Sbjct: 249 FAHCLDNNNG---GGIFAIGEVVSPK-VKTTPMLS---KSAHYSVNLNAIEVGNSVLQLS 301
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP 356
A F+ G IIDSGT L Y + + L + ++ TC+
Sbjct: 302 SDA--FDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILAS-HQELNLHTVQDSFTCFHYI 358
Query: 357 IVA---PTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNVIAN 408
PT+T F +V+L PQ+ L C + L ++ +
Sbjct: 359 DRLDRFPTVTFQFD-KSVSLAVYPQEYLF--QVREDTWCFGWQNGGLQTKGGASLTILGD 415
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
M N ++YD+ N +G C+
Sbjct: 416 MALSNKLVVYDIENQVIGWTNHNCS 440
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 167/383 (43%), Gaps = 60/383 (15%)
Query: 93 QSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTF 144
Q Y + ++GTP + +DT +D WV C C GC T F+ S+T
Sbjct: 71 QVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTS 130
Query: 145 KNLGCQAAQCK---QVPNPTCG--GGACAFNLTYGSST------IAANLSQDTI---SLA 190
+ C +C Q + TC C++ YG + ++ + +TI S+
Sbjct: 131 SMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVT 190
Query: 191 TDIVPGYTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSF 244
T+ FGC + TG+ G+ G G+ +S+++Q +Q + FS+CL
Sbjct: 191 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 250
Query: 245 KALSFSGSLRLGPIGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG I +P I YT L+ P Y +NL +I V + + I F
Sbjct: 251 S--SGGGILVLGEIVEPN-IVYTSLVPAQPH----YNLNLQSIAVNGQTLQIDSSV--FA 301
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-----TVTSLGGFDTCY----S 354
+ GTI+DSGT L AY D F + +++ TV S G + CY S
Sbjct: 302 TSNSRGTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRG--NQCYLITSS 355
Query: 355 VPIVAPTITLMFSG--MNVTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQ 410
V V P ++L F+G + PQD L+ ++ G ++ C+ + ++ ++
Sbjct: 356 VTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQ---GQGITILGDLV 412
Query: 411 QQNHRILYDVPNSRLGVARELCT 433
++ ++YD+ R+G A C+
Sbjct: 413 LKDKIVVYDLAGQRIGWANYDCS 435
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G PA+ + +DT +D WV C+ C GC ++ FN S+T +
Sbjct: 89 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148
Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
C +C Q C C + TYG S + DT+ T + T
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 208
Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
FGC +G+ G+ G G+ LS+++Q +L S FS+CL
Sbjct: 209 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 266
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ + G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 267 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 320
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
GTI+DSGT L AY V ++ ++ S G F T SV PT+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 380
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
TL F G+ +++ +N L+ + + L N + ++ ++ ++ +YD+
Sbjct: 381 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 440
Query: 422 NSRLGVARELCT 433
N R+G A C+
Sbjct: 441 NMRMGWADYDCS 452
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 160/372 (43%), Gaps = 43/372 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G PA+ + +DT +D WV C+ C GC ++ FN S+T +
Sbjct: 91 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150
Query: 149 CQAAQCK---QVPNPTC-----GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT- 198
C +C Q C C + TYG S + DT+ T + T
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 210
Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
FGC +G+ G+ G G+ LS+++Q +L S FS+CL
Sbjct: 211 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG-- 268
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ + G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 269 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIAVNGQ--KLPIDSSLFTTS 322
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
GTI+DSGT L AY V ++ ++ S G F T SV PT+
Sbjct: 323 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTV 382
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
TL F G+ +++ +N L+ + + L N + ++ ++ ++ +YD+
Sbjct: 383 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 442
Query: 422 NSRLGVARELCT 433
N R+G A C+
Sbjct: 443 NMRMGWADYDCS 454
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 159/349 (45%), Gaps = 39/349 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ +GTP + MDT ++ W+ PC C +S +FN ++S+++KN+ C ++
Sbjct: 89 YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSST 148
Query: 154 CKQVPNP--TC--GGGACAFNLTYGSSTIA-ANLSQDTISL-----ATDIVPGYTFGC-- 201
CK + +C GG C +++TYG + +LS D+++L ++ + P GC
Sbjct: 149 CKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGH 208
Query: 202 IQKATGNSVPPQGLLGLGRGSLSLLAQT-QNLYQSTFSYCLPSFKALSFSGS-LRLGP-- 257
I NS G++G+GRG +SL+ Q + S FSYCL + + S S S L G
Sbjct: 209 INVLQDNS-QSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGEDV 267
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGT---IIDS 314
+ + + TP++K + + Y++ L A VG +++ + A T +IDS
Sbjct: 268 VVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNN-------RIEYGERSNASTQNILIDS 320
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV---PIVAPTITLMFSGMNV 371
GT T L + + + V CY+ + P IT F+G +V
Sbjct: 321 GTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHFNGADV 380
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
L + G I C ++ + L + N+ Q N I YD+
Sbjct: 381 KLNSNGTFFPFEDG-IMCFGFISS-----NGLEIFGNIAQNNLLIDYDL 423
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 157/372 (42%), Gaps = 38/372 (10%)
Query: 89 RQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
+ I Q+P +++ IGTP + +DT +D W+ C C+GC + F+
Sbjct: 54 QNIVQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPL 113
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLATDI---- 193
+S+T+ N+ C + C ++ C C + YG +++ L+QDT + ++
Sbjct: 114 KSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPV 173
Query: 194 -VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCL-PSFKALSF 249
+ + FGC TG + GL+GLG G SL++Q L+ FS CL P +
Sbjct: 174 SLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKI 233
Query: 250 SGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
S + G Q + TPL+ + +S Y+V LL I V + N T G
Sbjct: 234 SSRMSFGKGSQVLGNGVVTTPLVPREKDTS-YFVTLLGISVEDTYFPM-------NSTIG 285
Query: 308 -AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCYSVP--IVAPTIT 363
A ++DSGT L Y V R +V +T G CY + PT+T
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLKGPTLT 345
Query: 364 LMFSGMNVTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
F G NV L I T I CLA+ + NS V N Q N+ I +D+
Sbjct: 346 FHFVGANVLLTPIQTFIPPTPQTKGIFCLAIY---NRTNSDPGVYGNFAQSNYLIGFDLD 402
Query: 422 NSRLGVARELCT 433
+ CT
Sbjct: 403 RQVVSFKPTDCT 414
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 129/312 (41%), Gaps = 52/312 (16%)
Query: 169 FNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ 228
F YG ++ A L D++SL + V +TFGC P G+ G GRG LSL AQ
Sbjct: 182 FYYAYGDGSLVAKLFSDSLSLPSVSVANFTFGCAHTTLAE---PIGVAGFGRGRLSLPAQ 238
Query: 229 ---TQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------GQPKRIK------------- 265
++FSYCL S + R P+ + KR+
Sbjct: 239 LSVHSPHLGNSFSYCLVS-HSFDSDRVRRPSPLILGRFVDKKEKRVATTDDDDDGDETKK 297
Query: 266 ------YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+T +L NP+ Y V+L I +G+R + P + + G G ++DSGT FT
Sbjct: 298 KKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFT 357
Query: 320 RLVAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV--PIVAPTITLMFS--GMNV 371
L A Y +V + F RVG V G CY + + P + L F+ G V
Sbjct: 358 MLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNGSTV 417
Query: 372 TLPQDNLLIHSTAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDV 420
TLP+ N G + CL + D ++ N QQQ ++YD+
Sbjct: 418 TLPRRNYFYEFMDGGDGKEEKRKVGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDL 477
Query: 421 PNSRLGVARELC 432
N R+G A+ C
Sbjct: 478 LNRRVGFAKRKC 489
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/428 (25%), Positives = 165/428 (38%), Gaps = 95/428 (22%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG--------------- 127
+P+ +GR Y K+G+P Q +A DT ++ W C
Sbjct: 98 MPMRAGRDDALGE-YFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKN 156
Query: 128 ------------------------------CVGCSSTVFNSAQSTTFKNLGCQAAQCK-- 155
C G VF +S +F+ + C + +CK
Sbjct: 157 KTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKG----VFCPHRSKSFQAVTCASQKCKID 212
Query: 156 --------QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLATDIVPG-------YTF 199
P P+ C ++++Y + A DTI++ D+ G T
Sbjct: 213 LSQLFSLSLCPKPS---DPCLYDISYADGSSAKGFFGTDTITV--DLKNGKEGKLNNLTI 267
Query: 200 GCIQKATGNSV----PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLR 254
GC K+ N V G+LGLG S + + Y + FSYCL + S L
Sbjct: 268 GCT-KSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLT 326
Query: 255 LGPIGQPK---RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+G K IK T L+ P Y VN++ I +G +++ IPP FN + GT+
Sbjct: 327 IGGHHNAKLLGEIKRTELILFP---PFYGVNVVGISIGGQMLKIPPQVWDFN--SQGGTL 381
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYSVP----IVAPTITLM 365
IDSGT T L+ PAY V + + + VT G D C+ V P +
Sbjct: 382 IDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGFDDSVVPRLVFH 441
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F+G P I A + C+ + D + +VI N+ QQNH +D+ + +
Sbjct: 442 FAGGARFEPPVKSYIIDVAPLVKCIGIVPI-DGIGGA-SVIGNIMQQNHLWEFDLSTNTI 499
Query: 426 GVARELCT 433
G A +CT
Sbjct: 500 GFAPSICT 507
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 158/367 (43%), Gaps = 63/367 (17%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
++GTP ++A+DT +D WVPC C C+ T ++N +S+T K + C
Sbjct: 102 ELGTPGVKFMVALDTGSDLFWVPCD-CSRCAPTHGASYASDFELSIYNPRESSTSKKVTC 160
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-----DIVPGY-TFGC 201
C Q +C + ++Y S+ + + L +D + L T + V Y TFGC
Sbjct: 161 NNDMCAQRNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFVEAYVTFGC 220
Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
Q +G + P GL GLG +S+ + + L +FS C G + G
Sbjct: 221 GQVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCF----GHDGIGRISFG 276
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G P + + TP NP + Y V + RVG ++D+ AL DSGT
Sbjct: 277 DKGSPDQ-EETPFNVNPAHPT-YNVTVTQARVGTMLIDVEFTAL-----------FDSGT 323
Query: 317 VFTRLVAPAYTAVRDVFR-----RRVGSNLTVTSLGGFDTCYSV-----PIVAPTITLMF 366
FT +V PAY+ V + F +R + + F+ CY + + P+++L
Sbjct: 324 SFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIP----FEYCYDMSPDANASLVPSMSLTM 379
Query: 367 SGMNVTLPQDNLLIHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
G D +++ ST I CLA+ + + LN+I +R+++D L
Sbjct: 380 KGGRHFTVYDPIIVISTQNEIVYCLAVVKSTE-----LNIIGQNFMTGYRVVFDREKLVL 434
Query: 426 GVARELC 432
G + C
Sbjct: 435 GWKKFDC 441
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 166/398 (41%), Gaps = 68/398 (17%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTT 143
P A+ + + + V +GTP Q + M +DT ++ +W+ C G + A T
Sbjct: 42 PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNG---------SYAPPLT 92
Query: 144 FKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGSSTIAAN-LSQDTISLATDIVP---G 196
++ + VP P C AC +L+Y ++ A L+ DT L P G
Sbjct: 93 RRSTRRWRGRDLPVP-PFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVG 151
Query: 197 YTFGCI----------QKATGNSVPPQ--GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSF 244
FGCI TG V GLLG+ RG+LS + QT F+YC+
Sbjct: 152 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGT---RRFAYCIAPG 208
Query: 245 KALSFSGSLRLGPIGQ-PKRIKYTPLLKNPR-----RSSLYYVNLLAIRVGRRVVDIPPG 298
+ G L LG G + YTPL++ + Y V L IRVG ++ IP
Sbjct: 209 EG---PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKS 265
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSL--------GGFD 350
L + T T++DSGT FT L+A AY A++ F + + L + L G FD
Sbjct: 266 VLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ--ARLLLAPLGEPGFVFQGAFD 323
Query: 351 TCYSVPI--------VAPTITLMFSGMNVTLPQDNLLI--------HSTAGSITCLAMAA 394
C+ P + P + L+ G V + + LL A ++ CL
Sbjct: 324 ACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGN 383
Query: 395 APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ D VI + QQN + YD+ N R+G A C
Sbjct: 384 S-DMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 420
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 144/328 (43%), Gaps = 42/328 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQ 153
YI++ IG P + +DT +D WV C+ C GC+ S +++ A+S + L C +
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146
Query: 154 CK-----QVPNPTCGGGA--CAFNLTYGSS---TIAANLSQDTISLATDIVPG-YTFGCI 202
C+ ++ + C C ++ YG S + L +T + V +FG
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRS 206
Query: 203 QKATGNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL---PSFKALSFSGSLRLGPI 258
G+ GL+GLGRG LSL++Q L F+YCL P+ + GSL
Sbjct: 207 DTIDGSQFGGTAGLVGLGRGHLSLVSQ---LGAGRFAYCLAADPNVYSTILFGSLAALDT 263
Query: 259 GQPKRIKYTPLLKNPR--RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
+ TPL+ NP+ R + YYVNL I VG + I G N G DSG
Sbjct: 264 -SAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGA 322
Query: 317 VFTRLVAPAYTAVRDVFR---RRVGSNLTVTSLGGFDTCY------SVPIVAPTITLMFS 367
+ T L AY VR +R+G + G DTC+ +V + P +
Sbjct: 323 IDTSLKDAAYQVVRQAITSEIQRLGYD------AGDDTCFVAANQQAVAQMPPLVLHFDD 376
Query: 368 GMNVTLPQDNLLIHSTAGS---ITCLAM 392
G +++L N L ST G + C+A+
Sbjct: 377 GADMSLNGRNYLKTSTKGPSEVLVCMAI 404
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/431 (25%), Positives = 176/431 (40%), Gaps = 90/431 (20%)
Query: 82 VVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC-----TGCVGCSST-- 134
+ PIA T + Y++ +GTP Q + +DT +D WVPC C+ C +
Sbjct: 15 IEPIA-----TYTDGYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHS 69
Query: 135 ------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACA-------------------- 168
F+ +QS + C + C V + ACA
Sbjct: 70 ISKPTPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPP 129
Query: 169 FNLTYGS-STIAANLSQDTISLATDI--------VPGYTFGCIQKATGNSV-PPQGLLGL 218
F TYG + + +L++DTI+L I PG+ FGC+ G+S+ P G+ G
Sbjct: 130 FAYTYGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGCV----GSSIREPIGIAGF 185
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSF---KALSFSGSLRLGPIGQPKR--IKYTPLLKNP 273
G+G LSL +Q L FS+C F + + + + +G + + +TP+LK+
Sbjct: 186 GKGKLSLPSQLGFL-DKGFSHCFLGFWFARNPNITSPMVIGDLALSVKDGFLFTPMLKSL 244
Query: 274 RRSSLYYVNLLAIRVGRR-VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
+ YY+ L + +G + PP + G I+D+GT +T L P Y +V
Sbjct: 245 TYPNFYYIGLEGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFYASVLSS 304
Query: 333 FRRRVGSN--LTVTSLGGFDTCYSVPIVA--------PTITLMFSG-MNVTLPQDNLLIH 381
V N + GFD C VP + P IT+ G + + LP+++
Sbjct: 305 LSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKESCYYA 364
Query: 382 STAGS----ITCLAMAAAPDN-VNSVLN---------------VIANMQQQNHRILYDVP 421
TA I CL D+ V S N V+ + Q QN ++YD+
Sbjct: 365 VTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLE 424
Query: 422 NSRLGVARELC 432
+ R+G C
Sbjct: 425 SGRVGFQPRDC 435
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + A+WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L+AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ +R + CY + V P I+L F
Sbjct: 234 IPDRALSVLSQRIRELLLKRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDAARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 146/358 (40%), Gaps = 83/358 (23%)
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVP 158
V +GTP Q + M +DT ++ +W+ C T F+ +S+++ + C +
Sbjct: 70 VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSSYSPVPCSS------- 121
Query: 159 NPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGL 218
C + + N+ GL+G+
Sbjct: 122 ----------------------------------------LTCTDQDSKNT----GLMGM 137
Query: 219 GRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKN---- 272
RGSLS ++Q FSYC+ FSG L LG + YTPL++
Sbjct: 138 NRGSLSFVSQMDF---PKFSYCI---SDSDFSGVLLLGDANFSWLMPLNYTPLIQISTPL 191
Query: 273 PRRSSL-YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRD 331
P + Y V L I+V +++ +P + T T++DSGT FT L+ P Y+A+R+
Sbjct: 192 PYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRN 251
Query: 332 VFRRRVGSNLTVTS------LGGFDTCYSVPIVA------PTITLMFSGMNVTLPQDNLL 379
F + L V GG D CY VP+ PT++LMF G + + D LL
Sbjct: 252 EFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLL 311
Query: 380 IH-----STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ S+ C + D + VI + QQN + +D+ SR+G A+ C
Sbjct: 312 YRVPGEVRGSDSVYCFTFGNS-DLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 171/395 (43%), Gaps = 58/395 (14%)
Query: 81 SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------- 133
SVV +A G + + KIG + + +DT +D WV C GC C
Sbjct: 58 SVVDVALGGNGRPTSNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMD 117
Query: 134 -TVFNSAQSTTFKNLGCQAAQCK-----QVPNPTCGGGACAFNLTYGS------STIAAN 181
T+++ S T K + C C Q+ T G +C +++TYG S I +
Sbjct: 118 LTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCT-KGMSCPYSITYGDGSTTSGSYIKDD 176
Query: 182 LSQDTISLATDIVPGYT---FGCIQKATG-----NSVPPQGLLGLGRGSLSLLAQ--TQN 231
L+ D + VP T FGC K +G G++G G+ + S+L+Q
Sbjct: 177 LTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAG 236
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRR 291
+ FS+CL S +S G +G + QPK +K TPLL+ Y V L I V
Sbjct: 237 KVKRIFSHCLDS---ISGGGIFAIGEVVQPK-VKTTPLLQGMAH---YNVVLKDIEVAGD 289
Query: 292 VVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA-VRDVFRRRVGSNLTVTSLGGFD 350
+ +P L + ++G GTIIDSGT L Y + + +R G L + F
Sbjct: 290 PIQLPSDIL--DSSSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVE-DQF- 345
Query: 351 TCY------SVPIVAPTITLMF-SGMNV-TLPQDNLLIHSTAGSITCL----AMAAAPDN 398
TC+ SV + PT+ F G+ + T P+D L + + C+ +MA D
Sbjct: 346 TCFHYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKE--DMWCVGWQKSMAQTKDG 403
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+L + ++ N ++YD+ N +G A C+
Sbjct: 404 KELIL--LGDLVLANKLVVYDLDNMAIGWADYNCS 436
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 158/364 (43%), Gaps = 47/364 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C S F S T++ + C Q
Sbjct: 93 YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQ 151
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATGN- 208
C N C + Y ST + L +D +S +++ P FGC TG+
Sbjct: 152 C----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDETGDI 207
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
+ G++GLGRG LS++ Q + + FS C G++ LG I P +
Sbjct: 208 YNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCY--GGMGVGGGAMVLGGISPPADMV 265
Query: 266 YTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTT---GAGTIIDSGTVFTRL 321
+T +P RS Y ++L I V G+R L NP GT++DSGT + L
Sbjct: 266 FTH--SDPVRSPYYNIDLKEIHVAGKR--------LHLNPKVFDGKHGTVLDSGTTYAYL 315
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGF--DTCYSVPIVA--------PTITLMF-SGMN 370
A+ A + + S ++ D C+S + P + ++F +G
Sbjct: 316 PESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHK 375
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
++L P++ L HS CL + + N N ++ + +N ++YD +S++G +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS---NGNDPTTLLGGIVVRNTLVMYDREHSKIGFWK 432
Query: 430 ELCT 433
C+
Sbjct: 433 TNCS 436
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 160/372 (43%), Gaps = 37/372 (9%)
Query: 89 RQITQSPT------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSA 139
+ I Q+P Y++ IGTP + +DT +D WV C C+GC + + F+
Sbjct: 50 QDIVQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPL 109
Query: 140 QSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLATDI---- 193
+S+T+ N+ C + C + C C + Y S++ L+Q+T++L ++
Sbjct: 110 KSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPI 169
Query: 194 -VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLY-QSTFSYCL-PSFKALSF 249
+ G FGC TGN + GL+GLG G SL++Q L+ FS CL P ++
Sbjct: 170 SLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITI 229
Query: 250 SGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
S + G + + + TPL++ + + YYV LL I V L N T
Sbjct: 230 SSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTY-------LPMNSTIE 282
Query: 308 AGT-IIDSGTVFTRLVAPAYTAVRDVFRRRVG-SNLTVTSLGGFDTCY--SVPIVAPTIT 363
G ++DSGT L Y V + +V +T G CY + PT+T
Sbjct: 283 KGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLT 342
Query: 364 LMFSGMNVTLPQDNLLIHSTAGS--ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
F G N+ L I T + + CLA+ NS + N Q N+ I +D+
Sbjct: 343 YHFEGANLLLTPIQTFIPPTPETKGVFCLAITNC---ANSDPGIYGNFAQTNYLIGFDLD 399
Query: 422 NSRLGVARELCT 433
+ CT
Sbjct: 400 RQIVSFKPTDCT 411
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + + +A AP S++
Sbjct: 289 DLGRRGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 158/362 (43%), Gaps = 45/362 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+T++ + C
Sbjct: 112 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 170
Query: 154 CKQVPNPTCGGG--ACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
C C G C + Y ST + L +D IS +++ P FGC TG
Sbjct: 171 C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETG 224
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + +FS C G++ LG I P
Sbjct: 225 DLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMVLGGISPPSD 282
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+ + +P RS Y ++L + V G+R +P A F+ GT++DSGT + L
Sbjct: 283 MTFA--YSDPDRSPYYNIDLKEMHVAGKR---LPLNANVFD--GKHGTVLDSGTTYAYLP 335
Query: 323 APAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNV 371
A+ A +D + + S ++ D C+S + P + ++F +G
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKY 395
Query: 372 TL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+L P++ + HS CL + N N ++ + +N ++YD +++G +
Sbjct: 396 SLSPENYMFRHSKVRGAYCLGIF---QNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKT 452
Query: 431 LC 432
C
Sbjct: 453 NC 454
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 166/367 (45%), Gaps = 40/367 (10%)
Query: 98 IVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC------SSTVFNSAQSTTFKNLGCQA 151
+V IGTP Q M +DT + +W+ C G +++ F+ + S++F L C
Sbjct: 70 VVTLPIGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNH 129
Query: 152 AQCK-QVPN---PT-CGGGA-CAFNLTYGSSTI-AANLSQDTISLATDIV-PGYTFGCIQ 203
CK QVP+ PT C C ++ +Y T+ NL ++ I+L+ + P GC
Sbjct: 130 PLCKPQVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCAN 189
Query: 204 KATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
++ +G+LG+ G LS Q + + FSY +P + SGSL LG
Sbjct: 190 QSDD----ARGILGMNLGRLSFPNQAK---ITKFSYFVPVKQTQPGSGSLYLGNNPNSSC 242
Query: 264 IKYTPLLKNPRRSSLYYVNL---------LAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
+Y LL + S NL I +G + ++IPP + + T TIIDS
Sbjct: 243 FRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDS 302
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP------IVAPTITLMF 366
G+ F+ +V AY +R+ ++VGS + + G D C+ +V +
Sbjct: 303 GSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADICFDGDATEIGRLVGDMVFEFE 362
Query: 367 SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G+ + +P++ +LI G + C + A + + N+I N QQN + +D+ R+G
Sbjct: 363 KGVEIVIPKERVLIE-VDGGVHCFGIGRA-EGLGGGGNIIGNFYQQNLWVEFDLAKHRVG 420
Query: 427 VARELCT 433
C+
Sbjct: 421 FRGANCS 427
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/424 (23%), Positives = 174/424 (41%), Gaps = 59/424 (13%)
Query: 46 FKPSKPLSWEESVLEMLAKDQARLQ--FLSSLAVARKSVVPIASGRQITQSPTYIVRAKI 103
FK + +E LE R L+S+ + P+ ++ Y + K+
Sbjct: 27 FKVQHKFAGKEKKLEHFKSHDTRRHSRMLASIDL------PLGGDSRVDSVGLYFTKIKL 80
Query: 104 GTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLGCQAAQCK 155
G+P + + +DT +D WV C C C S ++F+ S+T K +GC C
Sbjct: 81 GSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCS 140
Query: 156 QVP-----NPTCGGGACAFNLTYG-SSTIAANLSQDTISLAT---DIVPG-----YTFGC 201
+ P G C++++ Y ST N +D ++L D+ G FGC
Sbjct: 141 FISQSDSCQPAVG---CSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGC 197
Query: 202 IQKATG----NSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRL 255
+G + G++G G+ + S+L+Q + FS+CL + K G +
Sbjct: 198 GSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG---GGIFAV 254
Query: 256 GPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSG 315
G + PK +K TP++ N Y V L+ + V +D+PP ++ GTI+DSG
Sbjct: 255 GVVDSPK-VKTTPMVPNQMH---YNVMLMGMDVDGTALDLPPSIMR-----NGGTIVDSG 305
Query: 316 TVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSG-MNVT 372
T Y ++ + R+ V ++ + F +V + P ++ F + +T
Sbjct: 306 TTLAYFPKVLYDSLIETILARQPVKLHIVEDTFQCFSFSENVDVAFPPVSFEFEDSVKLT 365
Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--ANMQQQNHRILYDVPNSRLGVAR 429
+ P D L + + C A VI ++ N ++YD+ N +G A
Sbjct: 366 VYPHDYLF--TLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWAD 423
Query: 430 ELCT 433
C+
Sbjct: 424 HNCS 427
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 153/392 (39%), Gaps = 78/392 (19%)
Query: 114 MDTSNDAAWVPCTG-----CVG--------------------CSSTVFNSAQSTTFKNLG 148
+DT +D W PC C G C+S + ++A ++ +
Sbjct: 109 LDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRIPCASPLCSAAHASAPPSDL 168
Query: 149 CQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDTISLATDI-------VPGY 197
C A+C + + +CG AC YG ++ A+L + ++L V +
Sbjct: 169 CAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNF 228
Query: 198 TFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL--PSFKALSFSGSLRL 255
TF C A G P G+ G GRG LSL Q FSYCL SF+A +R
Sbjct: 229 TFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRL---IRP 282
Query: 256 GPI------------GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
P+ + YTPLL NP+ Y V L A+ VG + P + +
Sbjct: 283 SPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELARVD 342
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-----VTSLGGFDTCYSVPIV 358
G ++DSGT FT L Y V + F R + + G CY
Sbjct: 343 RAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAAS 402
Query: 359 ---APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----ITCLAMA----AAPDNVNS 401
P + L F G V LP+ N + + AG+ + CL + A+ + +
Sbjct: 403 DRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDG 462
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ N QQQ ++YDV R+G AR CT
Sbjct: 463 PAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 494
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 74/404 (18%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--VGCSS-------TVFNSAQSTTFKNL 147
Y +GTP Q L + ++T + +WVP T CSS VF+ S++ + +
Sbjct: 89 YAFTVSLGTPPQPLPVLLETGSHLSWVPSTSSYSANCSSLSAASPLHVFHPKNSSSSRLI 148
Query: 148 GCQAAQCKQVPNP----------TCGGGACA------------FNLTYGSSTIAANLSQD 185
GC+ C + +P +C G C + + YGS + A L D
Sbjct: 149 GCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISD 208
Query: 186 TISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFK 245
T+ V + GC + PP GL G GRG+ S+ +Q L + FSYCL S +
Sbjct: 209 TLRTPGRAVRNFVIGCSLASVHQ--PPSGLAGFGRGAPSVPSQ---LGLTKFSYCLLSRR 263
Query: 246 ---ALSFSGSLRLGPIGQPKR---IKYTPLLKN----PRRSSLYYVNLLAIRVGRRVVDI 295
+ SG L LG G ++Y PL ++ P S YY+ L AI VG + V +
Sbjct: 264 FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQL 323
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTR----LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT 351
P A G G I+DSGT F+ + P AV R + V G
Sbjct: 324 PERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSP 382
Query: 352 CYSVP-----IVAPTITLMFSGMNV-TLPQDNLLI---HSTAGSITCLAMAAAPDNVNSV 402
C+++P + P ++L F G +V LP +N + + +G +A A V+ V
Sbjct: 383 CFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDV 442
Query: 403 LN--------------VIANMQQQNHRILYDVPNSRLGVARELC 432
++ + QQQN+ I YD+ RLG R+ C
Sbjct: 443 PTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 486
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 145/351 (41%), Gaps = 59/351 (16%)
Query: 124 PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTC---GGGACAFNLTY-GSSTIA 179
PC C VFN S+++ + C + C Q+ C GAC + Y G
Sbjct: 5 PCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTK 64
Query: 180 ANLSQDTISLATDIVPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFS 238
L+ D +++ D+ FGC + G + GL+GLGRG LSL++Q L F
Sbjct: 65 GTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ---LSVHRFM 121
Query: 239 YCLPSFKALSFSGSLRLGPIGQP-----KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
YCLP + + SG L LG R+ T + + R S YY+NL + VG
Sbjct: 122 YCLPPPMSRT-SGKLVLGAGADAVRNMSDRVTVT-MSSSTRYPSYYYLNLDGLAVG---- 175
Query: 294 DIPPGALQFNPTT------------------------GAGTIIDSGTVFTRLVAPAYTAV 329
D PG + N T+ G I+D + + L Y +
Sbjct: 176 DQTPGTTR-NATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDEL 234
Query: 330 RDVFRRRVGSNLTVTSLG-GFDTCYSVP-------IVAPTITLMFSGMNVTLPQDNLLIH 381
D + SL G D C+ +P + PT++L F G + L +D L +
Sbjct: 235 ADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFV- 293
Query: 382 STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
T G + CL + S ++++ N Q QN R+L+++ ++ A+ C
Sbjct: 294 -TDGRMMCLMIGR-----TSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSKGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 152/360 (42%), Gaps = 36/360 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y++ IGTP + DT +D W VPC C + +F+ +ST+++N+ C +
Sbjct: 25 YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84
Query: 154 CKQVPNPTCG-GGACAFNLTYGSSTIAAN-LSQDTISLAT---DIVP--GYTFGCIQKAT 206
C ++ C C + Y S+ I L+Q+TI+L++ + VP G FGC T
Sbjct: 85 CHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNNT 144
Query: 207 GN-SVPPQGLLGLGRGSLSLLAQTQNLYQST-FSYCLPSFKA-LSFSGSLRLGPIGQ--P 261
G + G++GLG G +S ++Q + + FS CL F +S S + LG +
Sbjct: 145 GGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSG 204
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG-----AGTIIDSGT 316
K + TPL+ ++ Y+V LL I VG L FN ++ +DSGT
Sbjct: 205 KGVVSTPLVAKQDKTP-YFVTLLGISVGNTY-------LHFNGSSSQSVEKGNVFLDSGT 256
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVP--IVAPTITLMFSGMNVTL 373
T L Y + R V L G CY + P +T F G +V L
Sbjct: 257 PPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPVLTAHFEGGDVKL 316
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ G + CL N +S V N Q N+ I +D+ + CT
Sbjct: 317 LPTQTFVSPKDG-VFCLGFT----NTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCT 371
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 159/370 (42%), Gaps = 41/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G+P + + +DT +D WV C+ C GC S+ FN S+T +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT--- 198
C +C Q C C + TYG S + DT+ T + T
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 199 -----FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL 247
FGC +G+ G+ G G+ LS+++Q +L S FS+CL +
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG--SD 294
Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
+ G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 295 NGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTSNT 348
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTITL 364
GTI+DSGT L AY + V ++ ++ S G F T SV PT++L
Sbjct: 349 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSL 408
Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F G+ +T+ +N L+ + L N + ++ ++ ++ +YD+ N
Sbjct: 409 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 468
Query: 424 RLGVARELCT 433
R+G C+
Sbjct: 469 RMGWTDYDCS 478
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 159/370 (42%), Gaps = 41/370 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G+P + + +DT +D WV C+ C GC S+ FN S+T +
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGYT--- 198
C +C Q C C + TYG S + DT+ T + T
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210
Query: 199 -----FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL 247
FGC +G+ G+ G G+ LS+++Q +L S FS+CL +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG--SD 268
Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
+ G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 269 NGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTSNT 322
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTITL 364
GTI+DSGT L AY + V ++ ++ S G F T SV PT++L
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSL 382
Query: 365 MF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
F G+ +T+ +N L+ + L N + ++ ++ ++ +YD+ N
Sbjct: 383 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 442
Query: 424 RLGVARELCT 433
R+G C+
Sbjct: 443 RMGWTDYDCS 452
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 165/384 (42%), Gaps = 46/384 (11%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ + Y + K+G+P + + +DT +D WV C C +G +
Sbjct: 63 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 122
Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTISLAT 191
+++S S+T KN+GC+ A C + + TCG C++++ YG ST + +D I+L
Sbjct: 123 LYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITL-- 180
Query: 192 DIVPG----------YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
D V G FGC + +G G++G G+ + S+++Q +
Sbjct: 181 DQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKR 240
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
FS+CL + G +G + P +K TPL+ N Y V L + V +D+
Sbjct: 241 IFSHCLDNMNG---GGIFAIGEVESP-VVKTTPLVPNQVH---YNVILKGMDVDGEPIDL 293
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDTCY 353
PP N GTIIDSGT L Y ++ + +++V ++ + F
Sbjct: 294 PPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSFTS 351
Query: 354 SVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--ANM 409
+ P + L F + +++ P D L S + C + +VI ++
Sbjct: 352 NTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLGDL 409
Query: 410 QQQNHRILYDVPNSRLGVARELCT 433
N ++YD+ N +G A C+
Sbjct: 410 VLSNKLVVYDLENEVIGWADHNCS 433
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 159/372 (42%), Gaps = 49/372 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+GTP + + +DT +D WV C+ C C T F++ S+T + +
Sbjct: 81 YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140
Query: 149 CQAAQCK---QVPNPTC--GGGACAFNLTYGS-STIAANLSQDTI--------SLATDIV 194
C C Q C C++ YG S + DT SL +
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200
Query: 195 PGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALS 248
FGC +G+ G+ G G+G LS+++Q + + FS+CL S
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGED--S 258
Query: 249 FSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G L LG I +P I Y+PL+ + Y ++L +I V +++ I P A F ++
Sbjct: 259 GGGILVLGEILEPG-IVYSPLVPSQPH---YNLDLQSIAVSGQLLPIDPAA--FATSSNR 312
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITL 364
GTIID+GT LV AY V S L ++ + CY SV V P ++
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSNSVSEVFPPVSF 371
Query: 365 MFSGMNVTL--PQDNL--LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDV 420
F+G L P++ L L + ++ C+ + + ++ ++ ++ +YD+
Sbjct: 372 NFAGGATMLLKPEEYLMYLTNYAGAALWCIGF----QKIQGGITILGDLVLKDKIFVYDL 427
Query: 421 PNSRLGVARELC 432
+ R+G A C
Sbjct: 428 AHQRIGWANYDC 439
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 167/414 (40%), Gaps = 46/414 (11%)
Query: 51 PLSWEESVLEMLAKDQARLQFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQT 109
P++ E+ + + AR ++L +S+ S Q ++ ++V +G P
Sbjct: 49 PITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVP 108
Query: 110 LLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQVPNPTCG- 163
L MDT + W+ C C CSS VFN A S+TF C C+ PN CG
Sbjct: 109 QLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGS 168
Query: 164 GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFGCIQKATGNSVPPQ--GL 215
C + Y S T + L+++ ++ T + FGC G + G+
Sbjct: 169 SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGC-GYENGEQLESHFTGI 227
Query: 216 LGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGP----IGQPKRIKYTPLL 270
LGLG SL Q S FSYC+ ++ L LG +G P I++
Sbjct: 228 LGLGAKPTSLAVQL----GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFE--- 280
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN-PTTGAGTIIDSGTVFTRLVAPAYTAV 329
+S+YY+NL I VG ++I P + P TG I+DSGT++T L AY +
Sbjct: 281 ---TENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGV--ILDSGTLYTWLADIAYREL 335
Query: 330 RDVFRRRVGSNLTVTSLGGFDTCY----SVPIVA-PTITLMFSGMNVTLPQDNLLIH--S 382
+ + + L F CY S ++ P +T F+G + + + S
Sbjct: 336 YNEIKSILDPKLERFWFRDF-LCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLS 394
Query: 383 TAGSITCLAMAAAPDNVN----SVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ M+ P + I M QQ + I YD+ + + R C
Sbjct: 395 EPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFK 145
Y V A +GTP T L+A+DT +D WVPC C+ C+ V++ AQSTT +
Sbjct: 63 YAVVA-LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 120
Query: 146 NLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGY 197
+ C + C +C +++ Y S +++ L +D + L +D +
Sbjct: 121 KVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 180
Query: 198 TFGCIQKATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGS 252
FGC Q TG+ S P GLLGLG S S+ L ++ L ++FS C G
Sbjct: 181 MFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGR 236
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G + K TPL N + + YY + + I VG + + +T I
Sbjct: 237 INFGDTGSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAI 282
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMF 366
+DSGT FT L P YT + F ++ S N+ +S+ F+ CYSV IV P ++L
Sbjct: 283 VDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTA 341
Query: 367 SGMNVTLPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
G ++ P ++ +I T + CLA+ + +N+I ++++D
Sbjct: 342 KGGSI-FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRER 395
Query: 423 SRLGVARELC 432
LG C
Sbjct: 396 MVLGWKNFNC 405
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 153/363 (42%), Gaps = 55/363 (15%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
++GTP ++A+DT +D WVPC C C+ T +++ QS+T K + C
Sbjct: 106 ELGTPGMKFMVALDTGSDLFWVPC-DCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTC 164
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGC 201
C +C + ++Y S+ + + L +D + L ++ I TFGC
Sbjct: 165 NNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGC 224
Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
Q +G N+ P GL GLG +S+ + + L +FS C G + G
Sbjct: 225 GQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCF----GHDGVGRISFG 280
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G P + + TP NP S Y +++ +RVG +VD+ AL DSGT
Sbjct: 281 DKGSPDQ-EETPFNSNPSHPS-YNISVTQVRVGTTLVDVDFTAL-----------FDSGT 327
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSVPIVA-----PTITLMFSGMN 370
FT L+ P Y V + F + F+ CY + A P+++L G
Sbjct: 328 SFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLTMKGRG 387
Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
D +++ +T + CLA+ + + LN+I +R+++D LG
Sbjct: 388 HFTVFDPIIVITTQNELVYCLAIVKSTE-----LNIIGQNFMTGYRVVFDREKLVLGWKE 442
Query: 430 ELC 432
C
Sbjct: 443 TDC 445
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 170/390 (43%), Gaps = 56/390 (14%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P+ T++ Y R IGTPA+ + +DT +D WV C C GC + T
Sbjct: 76 LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135
Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
+ G Q+ + C Q V N P+C + C ++++YG + A L
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195
Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
+ +S P +FGC K G+ ++ G+LG G+ + S+L+Q +
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
F++CL + G +G + QPK +K TPL+ + Y V L I VG + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVSDMPH---YNVILKGIDVGGTALGL 308
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
P F+ GTIIDSGT + Y A+ VF + +++V +L F +C+
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363
Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM----AAAPDNVNSVL 403
SV P +T F G +V+L P D L ++ C+ D + VL
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVL 420
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++ N +LYD+ N +G A C+
Sbjct: 421 --LGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/247 (34%), Positives = 119/247 (48%), Gaps = 24/247 (9%)
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA----LSFS 250
PG FGC ++ G GL+GLGRG LSL+ Q L F Y L S + +SF
Sbjct: 15 PGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQ---LNVEAFGYRLSSDLSAPSPISF- 70
Query: 251 GSLRLGPIGQPKRIKYTPLLKNPRRSSL--YYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
GSL G TPLL NP L YYV L I VG ++V IP G F+ +TGA
Sbjct: 71 GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 130
Query: 309 GTII-DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD-TCY---SVPIVAPTIT 363
G +I DSGT T L PAYT VRD ++G + D C+ S P++
Sbjct: 131 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMV 190
Query: 364 LMFS-GMNVTLPQDNLL--IHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYD 419
L F G ++ L +N L + G C ++ + + L +I N+ Q + +++D
Sbjct: 191 LHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKS----SQALTIIGNIMQMDFHVVFD 246
Query: 420 VP-NSRL 425
+ N+R+
Sbjct: 247 LSGNARM 253
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 111/436 (25%), Positives = 180/436 (41%), Gaps = 62/436 (14%)
Query: 5 LVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVLEMLA 63
++F+ + F+ SLS LN + ++++ H S SP ++P++ +
Sbjct: 8 ILFYFSLCFIISLSHALN-------NGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRS 60
Query: 64 KDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV 123
++A + ++L +S V I Y++ +GTP L DT +D W+
Sbjct: 61 INRANHFYKTALTNTPQSTV-------IPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWL 113
Query: 124 ---PCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA 180
PC C ++ F ++S+T+KN+ C + CK S
Sbjct: 114 QCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLCK--------------------SGQQG 153
Query: 181 NLSQDTISLATDI-----VPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQTQN 231
NLS DT++L + P GC T N+V + G++GLG G SL+ Q +
Sbjct: 154 NLSVDTLTLESSTGHPISFPKTVIGC---GTDNTVSFEGASSGIVGLGGGPASLITQLGS 210
Query: 232 LYQSTFSYC-LPSFKALSFSGSLRLGP--IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRV 288
+ FSYC LP+ + + L G + + TP++K YY+ L A V
Sbjct: 211 SIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKK-DPIVFYYLTLEAFSV 269
Query: 289 GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
G + ++ + N IIDSGT T + Y + V
Sbjct: 270 GNKRIEFEGSS---NGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRL 326
Query: 349 FDTCYSVPIVA---PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNS-VLN 404
F+ CYSV P IT F G +V L + + A I CLA A + S V++
Sbjct: 327 FNLCYSVTSDGYDFPIITTHFKGADVKLHPISTFV-DVADGIVCLAFATTSAFIPSDVVS 385
Query: 405 VIANMQQQNHRILYDV 420
+ N+ QQN + YD+
Sbjct: 386 IFGNLAQQNLLVGYDL 401
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y+ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + + +A AP S++
Sbjct: 289 DLGRHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
+GTP T L+A+DT +D WVPC C+ C+ V++ AQSTT + + C +
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPLQSPNYGSLKFDVYSPAQSTTSRKVPCSS 163
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
C +C +++ Y S +++ L +D + L +D + FGC Q
Sbjct: 164 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 223
Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
TG+ S P GLLGLG S S+ L ++ L ++FS C G + G
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 279
Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G + K TPL N + + YY + + I VG + + +T I+DSGT
Sbjct: 280 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 325
Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
FT L P YT + F ++ S N+ +S+ F+ CYSV IV P ++L G ++
Sbjct: 326 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 383
Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P ++ +I T + CLA+ + +N+I ++++D LG
Sbjct: 384 FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRERMVLGWK 438
Query: 429 RELC 432
C
Sbjct: 439 NFNC 442
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 140/324 (43%), Gaps = 45/324 (13%)
Query: 141 STTFKNLGCQAAQCKQVPNPTCGGGACAFN-------LTYGSSTI-AANLSQDTISLATD 192
S+TFK + C C+ P+ ACA +YG +I A ++ +DT + +
Sbjct: 2 SSTFKAVACPDPICR--PSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSP 59
Query: 193 -----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKA 246
V FGC TG V + G+ G GRG SL +Q L FSYCL +
Sbjct: 60 NGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQ---LKVGRFSYCL-TLVT 115
Query: 247 LSFSGSLRLGPIGQPKRIKY--------TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPG 298
S S + LG P ++ TP++ NP + YY++L I VG+ +
Sbjct: 116 ESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKS 175
Query: 299 ALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTV-----TSLGGFDTCY 353
GT+IDSGT T L AV ++ + + + + T G C+
Sbjct: 176 VFALKKDGSGGTVIDSGTSLTTLPE----AVFELLQEELVAQFPLPRYDNTPEVGDRLCF 231
Query: 354 SVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
P + P + L +G ++ LP+DN + + CL + A D + + +I N
Sbjct: 232 RRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAED---TTMVLIGN 288
Query: 409 MQQQNHRILYDVPNSRLGVARELC 432
QQQN ++YDV N++L A C
Sbjct: 289 FQQQNMHVVYDVENNKLLFAPAQC 312
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 157/369 (42%), Gaps = 39/369 (10%)
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNL 147
I+ +Y++ +GTP ++L DT +D W C C C V F+ +S T+K L
Sbjct: 88 ISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTL 147
Query: 148 GCQAAQCKQVPNP-TCGG-GACAFNLTYGS-STIAANLSQDTISLATDI-----VPGYTF 199
GC C+ + +CG C + +YG S +LS +T ++ + PG F
Sbjct: 148 GCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAF 207
Query: 200 GCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGP 257
GC G + GL+GLG G LSL+ Q + FSYCL P + S + G
Sbjct: 208 GCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGK 267
Query: 258 --IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI---------PPGALQFNPTT 306
+ TPL+K + YY+ L + +G V P A + N
Sbjct: 268 SAVVSGSGTVSTPLIKG-TPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESN--- 323
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS--VPIVAPTITL 364
IIDSGT T L YT + + +G T G F CYS + PTIT
Sbjct: 324 ---IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITA 380
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F G +V LP N + + + C +M + S L + N+ Q N + YD+ N++
Sbjct: 381 HFIGADVQLPPLNTFVQAQE-DLVCFSMIPS-----SNLAIFGNLSQMNFLVGYDLKNNK 434
Query: 425 LGVARELCT 433
+ CT
Sbjct: 435 VSFKPTDCT 443
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/427 (24%), Positives = 165/427 (38%), Gaps = 107/427 (25%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSSTV-----FNSAQSTTFKN 146
Y++ +GTP Q + +DT +D WVPC C+ C S+V F ++ST+
Sbjct: 25 YLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNTR 84
Query: 147 LGCQAAQCKQVPN---------------PTCGGGAC-----AFNLTYGSSTIA-ANLSQD 185
C + C V + P GG C F+ TYG + +LS+D
Sbjct: 85 DLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSRD 144
Query: 186 TISLATDI-------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQN 231
+++L PG+ FGC+ G+S+ P G+ G GRG+LSL +Q
Sbjct: 145 SVTLHGSTHGSGAGAGPLPVAFPGFGFGCV----GSSIREPLGIAGFGRGALSLPSQLGF 200
Query: 232 LYQSTFSYCL--------PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNL 283
L + FS+C P+F + G L L +TP+L + + YYV L
Sbjct: 201 LGKG-FSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGL 259
Query: 284 LAIRVGRR----VVDIPPGALQFNPTTGAGTIIDSGTVFTRL--------------VAPA 325
+ +G + PP + G ++D+GT +T+L AP
Sbjct: 260 EGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPP 319
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--------PTITLMFSG--------- 368
Y RD+ R GFD C+ VP P ITL +G
Sbjct: 320 YERSRDLEART-----------GFDLCFKVPCARAPCADDELPPITLHLAGGARLALPKL 368
Query: 369 ---MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
VT +D++++ + + V+ + Q QN ++YD+ R+
Sbjct: 369 SSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRV 428
Query: 426 GVARELC 432
G C
Sbjct: 429 GFRPRDC 435
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 107/415 (25%), Positives = 175/415 (42%), Gaps = 47/415 (11%)
Query: 56 ESVLEMLAKDQARLQFLSSLAVARKSVV--PIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
E + E AR + L A A VV P+ Y R K+G PA+ +
Sbjct: 46 EHLKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEYFVQ 105
Query: 114 MDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QVPNPTC 162
+DT +D WV C+ C GC ++ FN S+T + C +C Q C
Sbjct: 106 IDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVC 165
Query: 163 GGGA-----CAFNLTYGSST------IAANLSQDTI---SLATDIVPGYTFGCIQKATGN 208
C + TYG + ++ + DT+ + FGC +G+
Sbjct: 166 QSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGD 225
Query: 209 SVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKALSFSGSLRLGPIGQPK 262
+ G+ G G+ LS+++Q +L S TFS+CL + + G L LG I +P
Sbjct: 226 LMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKG--SDNGGGILVLGEIVEPG 283
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+ +TPL+ + Y +NL +I V + +P + F + GTI+DSGT LV
Sbjct: 284 LV-FTPLVPSQPH---YNLNLESIAVSGQ--KLPIDSSLFATSNTQGTIVDSGTTLVYLV 337
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIVAPTITLMFS-GMNVTLPQDNL 378
AY + V ++ G F T SV PT TL F G+++T+ +N
Sbjct: 338 DGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMTVKPENY 397
Query: 379 LIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
L+ GS+ + + + ++ ++ ++ +YD+ N R+G A C+
Sbjct: 398 LLQQ--GSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYDLANMRMGWADYDCS 450
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 170/390 (43%), Gaps = 56/390 (14%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P+ T++ Y R IGTPA+ + +DT +D WV C C GC + T
Sbjct: 76 LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135
Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
+ G Q+ + C Q V N P+C + C ++++YG + A L
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195
Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
+ +S P +FGC K G+ ++ G+LG G+ + S+L+Q +
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
F++CL + G +G + QPK +K TPL+ + Y V L I VG + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVPDMPH---YNVILKGIDVGGTALGL 308
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
P F+ GTIIDSGT + Y A+ VF + +++V +L F +C+
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363
Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM----AAAPDNVNSVL 403
SV P +T F G +V+L P D L ++ C+ D + VL
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVL 420
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++ N +LYD+ N +G A C+
Sbjct: 421 --LGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 159/372 (42%), Gaps = 45/372 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y R K+G+P + + +DT +D WV C+ C GC S+ FN S+T +
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 149 CQAAQCK---QVPNPTC---GGGACAFNLTYGS-STIAANLSQDTISLATDIVPG----- 196
C +C Q C C + TYG S + DT+ D V G
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYF--DSVMGNEQTA 208
Query: 197 -----YTFGCIQKATGNSVPP----QGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFK 245
FGC +G+ G+ G G+ LS+++Q +L S FS+CL
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG-- 266
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ + G L LG I +P + YTPL+ + Y +NL +I V + +P + F +
Sbjct: 267 SDNGGGILVLGEIVEPGLV-YTPLVPSQPH---YNLNLESIVVNGQ--KLPIDSSLFTTS 320
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGG--FDTCYSVPIVAPTI 362
GTI+DSGT L AY + V ++ ++ S G F T SV PT+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 380
Query: 363 TLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+L F G+ +T+ +N L+ + L N + ++ ++ ++ +YD+
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440
Query: 422 NSRLGVARELCT 433
N R+G C+
Sbjct: 441 NMRMGWTDYDCS 452
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
+GTP T L+A+DT +D WVPC C+ C+ V++ AQSTT + + C +
Sbjct: 82 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 140
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
C +C +++ Y S +++ L +D + L +D + FGC Q
Sbjct: 141 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 200
Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
TG+ S P GLLGLG S S+ L ++ L ++FS C G + G
Sbjct: 201 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 256
Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G + K TPL N + + YY + + I VG + + +T I+DSGT
Sbjct: 257 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 302
Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
FT L P YT + F ++ S N+ +S+ F+ CYSV IV P ++L G ++
Sbjct: 303 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 360
Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P ++ +I T + CLA+ + +N+I ++++D LG
Sbjct: 361 FPVNDPIITITDNAFNPVGYCLAIMKSEG-----VNLIGENFMSGLKVVFDRERMVLGWK 415
Query: 429 RELC 432
C
Sbjct: 416 NFNC 419
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 148/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSRGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 164/393 (41%), Gaps = 49/393 (12%)
Query: 69 LQFLSSLAVARKSVVPIASGRQITQSPTY------IVRAKIGTPAQTLLMAMDTSNDAAW 122
L+ L L+ K++ P QSP Y ++ IGTP + DT +D W
Sbjct: 46 LRRLMELSAMEKTLTP--------QSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTW 97
Query: 123 ---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCG-GGACAFNLTYGSSTI 178
VPC C + +F+ +STT++N+ C + C ++ C C + Y S+ I
Sbjct: 98 TSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAI 157
Query: 179 AAN-LSQDTISLAT---DIVP--GYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQN 231
L+Q+TI+L++ VP G FGC TG + G++GLG G +SL++Q +
Sbjct: 158 TRGVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGS 217
Query: 232 LYQST-FSYCLPSFKA-LSFSGSLRLGPIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIR 287
+ FS CL F +S S + G + K + TPL+ ++ Y+V LL I
Sbjct: 218 SFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTP-YFVTLLGIS 276
Query: 288 VGRRVVDIPPGALQFNPTT----GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSN-LT 342
V L FN ++ +DSGT T L Y V R V +T
Sbjct: 277 VENTY-------LHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVT 329
Query: 343 VTSLGGFDTCYSVP--IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVN 400
G CY + P +T F G +V L I G + CL N +
Sbjct: 330 DDPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDG-VFCLGFT----NTS 384
Query: 401 SVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
S V N Q N+ I +D+ + + CT
Sbjct: 385 SDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDCT 417
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 60/364 (16%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
+GTP T L+A+DT +D WVPC C+ C+ V++ AQSTT + + C +
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 163
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
C +C +++ Y S +++ L +D + L +D + FGC Q
Sbjct: 164 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 223
Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
TG+ S P GLLGLG S S+ L ++ L ++FS C G + G
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 279
Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G + K TPL N + + YY + + I VG + + +T I+DSGT
Sbjct: 280 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 325
Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
FT L P YT + F ++ S N+ +S+ F+ CYSV IV P ++L G ++
Sbjct: 326 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 383
Query: 373 LPQDNLLIHSTAGSIT----CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
P ++ +I T + CLA+ + +N+I ++++D LG
Sbjct: 384 FPVNDPIITITDNAFNPVGYCLAIMKS-----EGVNLIGENFMSGLKVVFDRERMVLGWK 438
Query: 429 RELC 432
C
Sbjct: 439 NFNC 442
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 149/362 (41%), Gaps = 52/362 (14%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------------TVFNSAQSTTFKNLG 148
IGTP+ + L+A+D+ +D W+PC CV C+ F+ + STT K
Sbjct: 103 IGTPSVSFLVALDSGSDLLWIPCN-CVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFP 161
Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLA------TDIVPGYTFG 200
C C+ P C + +TY S +++ L +D + LA + + G
Sbjct: 162 CSHKLCESAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVG 221
Query: 201 CIQKATGN---SVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGSLR 254
C +K +G + P G++GLG G +S+ LA+ L +++FS C SG +
Sbjct: 222 CGEKQSGEFLKGIAPDGVMGLGPGEISVPSFLAKA-GLMRNSFSMCFDEED----SGRIY 276
Query: 255 LGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G +G P + R Y +A VG V + L+ + T T+IDS
Sbjct: 277 FGDVG--------PSTQQSTRFLPYKNEFVAYFVGVEVCCVGNSCLKQSSFT---TLIDS 325
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV--APTITLMFSGMNVT 372
G FT L Y V + + + G ++ CY P I L FS N
Sbjct: 326 GQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEYCYETSFEPKVPAIKLKFSSNNTF 385
Query: 373 LPQDNLLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARE 430
+ L + + + CL ++A+ + VI +RI++D N +LG +
Sbjct: 386 VIHKPLFVLQRSEGLVQFCLPISASEEGTG---GVIGQNYMAGYRIVFDRENMKLGWSAS 442
Query: 431 LC 432
C
Sbjct: 443 KC 444
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 174/418 (41%), Gaps = 51/418 (12%)
Query: 59 LEMLAKDQARLQ--FLSSLAVARKSV---------VPIASGRQITQSPTYIVRAKIGTPA 107
L A+D AR S LA R+ +P++SG T + Y VR ++GTPA
Sbjct: 57 LGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSG-AYTGTGQYFVRFRVGTPA 115
Query: 108 QTLLMAMDTSNDAAWVPCTGCVGCSST-----VFNSAQSTTFKNLGCQAAQCKQ-VP--- 158
Q ++ DT +D WV C G G ++ F +++S ++ L C + C VP
Sbjct: 116 QPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCTSYVPFSL 175
Query: 159 -NPTCGGGACAFNLTYGSSTIAANL---SQDTISLATDI-------------VPGYTFGC 201
N + CA++ Y + A + TI+L+ + G GC
Sbjct: 176 ANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKLQGVVLGC 235
Query: 202 IQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS---LRLGP 257
G S G+L LG ++S ++ + FSYCL A + S GP
Sbjct: 236 TATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSYLTFGPGP 295
Query: 258 IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G TPL+ + R S Y V + A+ V +DIP A ++ G G I+DSGT
Sbjct: 296 EGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIP--ADVWDVGRGGGAILDSGTS 353
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTI---TLMFSGMNVTLP 374
T L PAY AV R+ + L ++ F+ CY+ AP I + F+G P
Sbjct: 354 LTVLATPAYRAVVAALGGRLAA-LPRVAMDPFEYCYNWTAGAPEIPKLEVSFAGSARLEP 412
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ A + C+ + + ++VI N+ QQ H +D+ + L C
Sbjct: 413 PAKSYVIDAAPGVKCIGVQ---EGAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 166/385 (43%), Gaps = 50/385 (12%)
Query: 83 VPIASG----RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG-----CVGCSS 133
VP A G + IT+S Y++ +GTP +L DT +D WV C+ +
Sbjct: 82 VPEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGA 141
Query: 134 TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQDTISLA 190
VF+ ++STT+ L CQ+A C+ + +C + C + Y GS TI LS +T S A
Sbjct: 142 VVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGV-LSTETFSFA 200
Query: 191 TDI--------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYC 240
VP +FGC + G S GL+GLG G+LSL++Q FSYC
Sbjct: 201 AAGGGGEGQVRVPRVSFGCSTGSAG-SFRSDGLVGLGAGALSLVSQLGAAARIARRFSYC 259
Query: 241 L-PSFKALSFSGSLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
L P + A + S +L G + P TPL+ + S Y V L ++ V + V
Sbjct: 260 LVPPYAAANSSSTLSFGARAVVSDPGAAS-TPLVPS-EVDSYYTVALESVAVAGQDV--- 314
Query: 297 PGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV- 355
+ I+DSGT T L + RR+ CY V
Sbjct: 315 ------ASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQ 368
Query: 356 ------PIVAPTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIAN 408
P +TL F G +VTL +N G++ CL + P + + ++++ N
Sbjct: 369 GKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTL-CLVL--VPVSESQPVSILGN 425
Query: 409 MQQQNHRILYDVPNSRLGVARELCT 433
+ QQN + YD+ + A CT
Sbjct: 426 IAQQNFHVGYDLDARTVTFAAVDCT 450
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 157/375 (41%), Gaps = 62/375 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-------------TVFNSAQSTT 143
+ +GTP L+A+DT +D W+PC C+ C ++ +S+T
Sbjct: 105 HFANVSVGTPPLWFLVALDTGSDLFWLPCD-CISCVHGGLRTRTGKILKFNTYDLDKSST 163
Query: 144 FKNLGCQAAQ-CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT------DIV 194
+ C + C+Q G C + + Y S+ ++ + +D + L T D
Sbjct: 164 SNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKDAD 223
Query: 195 PGYTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSF 249
FGC Q TG N P GL GLG ++S+ + + L ++FS C S A
Sbjct: 224 TRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSA--- 280
Query: 250 SGSLRLGPIGQPKRIKYTPLLKNPRR-SSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
G + G G P + K TP N R+ Y + + I V V D L+F+
Sbjct: 281 -GRITFGDTGSPDQRK-TPF--NVRKLHPTYNITITKIIVEDSVAD-----LEFH----- 326
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRV----GSNLTVTSLGGFDTCYSVP----IVAP 360
I DSGT FT + PAYT + +++ +V S+ + S FD CY + I P
Sbjct: 327 -AIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVP 385
Query: 361 TITLMFSGMNVTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILY 418
+ L G + D ++ S+ G + CL + + +N+I ++I++
Sbjct: 386 FLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKS-----DSVNIIGQNFMTGYKIVF 440
Query: 419 DVPNSRLGVARELCT 433
D N LG C+
Sbjct: 441 DRDNMNLGWKETNCS 455
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 159/381 (41%), Gaps = 66/381 (17%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y + K+G PA+ + +DT +D WV C+ C GC + +F++ +S++ + L
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 149 CQAAQCKQVPNPT----CGGGACAFNLTY---------------------GSSTIAANLS 183
C C V T C+++ Y G STIA S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN--S 201
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
TI I Y +G + +AT G+ G G+G S+++Q ++ + FS+CL
Sbjct: 202 SATIVFGCSI---YQYGDLTRATK---ALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ + G L LG I +P I Y+PL+ + Y + L +I + ++ P
Sbjct: 256 KGGE--NGGGILVLGEILEPS-IVYSPLIPSQPH---YTLKLQSIALSGQLF---PNPTM 306
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIV 358
F + TIIDSGT LV Y + V V + T T G F SV +
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI 366
Query: 359 APTITLMFSGMN--VTLPQDNLLIHSTA-----GSITCLAMAAAPDNVNSVLNVIANMQQ 411
P + F G+ V P++ L S S+ C+ A D LN++ ++
Sbjct: 367 FPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDG----LNILGDLVL 422
Query: 412 QNHRILYDVPNSRLGVARELC 432
++ I+YD+ R+G A C
Sbjct: 423 KDKIIVYDLAQQRIGWANYDC 443
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P ++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP F +G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ +++ Y + IGTP++ + +DT +D WV C GC +G T
Sbjct: 141 LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 200
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
+++ STT +GC C P G G C +++ YG S+ QD +
Sbjct: 201 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 260
Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
T FGC K +G S G+LG G+ + S+L+Q + +
Sbjct: 261 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 320
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
FS+CL + G +G + +PK + TPL++N + Y V + I VG +D+P
Sbjct: 321 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQN---QAHYNVVMKEIEVGGDPLDVP 373
Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
A F GTIIDSGT V+ L+ + D+ V T
Sbjct: 374 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 426
Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
FD +V PT+TL F +++T+ P + L + A L ++
Sbjct: 427 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLL 486
Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
++ N ++YD+ +G C+
Sbjct: 487 GDLVLSNKLVVYDLEKQGIGWVEYNCS 513
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 150/324 (46%), Gaps = 55/324 (16%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-----------TVFNSAQSTTFKNLGCQA 151
+GTP T L+A+DT +D WVPC C+ C+ V++ AQSTT + + C +
Sbjct: 41 LGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSS 99
Query: 152 AQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQ 203
C +C +++ Y S +++ L +D + L +D + FGC Q
Sbjct: 100 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQ 159
Query: 204 KATGN---SVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
TG+ S P GLLGLG S S+ L ++ L ++FS C G + G
Sbjct: 160 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF----GDDGHGRINFGDT 215
Query: 259 GQPKRIKYTPLLKNPRRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV 317
G + K TPL N + + YY + + I VG + + +T I+DSGT
Sbjct: 216 GSSDQ-KETPL--NVYKQNPYYNITITGITVGSKSI-----------STEFSAIVDSGTS 261
Query: 318 FTRLVAPAYTAVRDVFRRRVGS--NLTVTSLGGFDTCYSVP---IVAPTITLMFSGMNVT 372
FT L P YT + F ++ S N+ +S+ F+ CYSV IV P ++L G ++
Sbjct: 262 FTALSDPMYTQITSSFDAQIRSSRNMLDSSM-PFEFCYSVSANGIVHPNVSLTAKGGSI- 319
Query: 373 LPQDNLLIHSTAGSIT----CLAM 392
P ++ +I T + CLA+
Sbjct: 320 FPVNDPIITITDNAFNPVGYCLAI 343
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y+ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSSGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 167/361 (46%), Gaps = 56/361 (15%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
+GTP QT ++A+DT +D W+PC C GC+ ++ + + S+T + + C +
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
C ++ C + + Y S+ +++ L +D + L+T D +P FGC Q
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239
Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
TG ++ P GL GLG + S+LAQ + L ++F+ C + G + G G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ + TPL NP+ + Y +++ I VG + D L+F+ TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEITVGNSLTD-----LEFS------TIFDTGTSFT 341
Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
L PAYT + F +V +N S F+ CY + I P+I+L G +V
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
+ + ++ + CLA+ + + LN+I R+++D LG +
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456
Query: 432 C 432
C
Sbjct: 457 C 457
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 167/361 (46%), Gaps = 56/361 (15%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
+GTP QT ++A+DT +D W+PC C GC+ ++ + + S+T + + C +
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
C ++ C + + Y S+ +++ L +D + L+T D +P FGC Q
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239
Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
TG ++ P GL GLG + S+LAQ + L ++F+ C + G + G G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ + TPL NP+ + Y +++ I VG + D L+F+ TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEITVGNSLTD-----LEFS------TIFDTGTSFT 341
Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
L PAYT + F +V +N S F+ CY + I P+I+L G +V
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
+ + ++ + CLA+ + + LN+I R+++D LG +
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456
Query: 432 C 432
C
Sbjct: 457 C 457
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 158/365 (43%), Gaps = 49/365 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP+Q + +D+ + +VPC C C + F S+T+ + C
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN-VD 149
Query: 154 CKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
C TC C + Y S+ + L +D +S +++ P FGC TG
Sbjct: 150 C------TCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETG 203
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + +FS C G++ LG + P
Sbjct: 204 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPAPPD 261
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ ++ NP RS Y + L I V + + + P FN + GT++DSGT + L
Sbjct: 262 MVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAYLPE 315
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF-SGM 369
A+ A +D +V S + + G D Y + V P + ++F +G
Sbjct: 316 QAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 372
Query: 370 NVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++L P++ L HS CL + + ++L I +N + YD N ++G
Sbjct: 373 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFW 429
Query: 429 RELCT 433
+ C+
Sbjct: 430 KTNCS 434
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 162/392 (41%), Gaps = 78/392 (19%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSSTVFNSAQSTT 143
T + Y ++GTP + + +DT +D WV C C +G T+++ S+T
Sbjct: 83 TDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASST 142
Query: 144 FKNLGCQAAQCKQVPN---PTCGGGA-CAFNLTY--GSSTIAANLSQDTISLATDIVPG- 196
+ C C P C C +++TY GSST+ + ++ +L D V G
Sbjct: 143 GSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVND---ALQFDQVTGD 199
Query: 197 ---------YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
FGC + G+ S G+LG G + S+L+Q T + F++CL
Sbjct: 200 GQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL 259
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ K G +G + QPK +K TPL+ + Y VNL I VG +++P A
Sbjct: 260 DTIKG---GGIFAIGDVVQPK-VKTTPLVADKPH---YNVNLKTIDVGGTTLELP--ADI 310
Query: 302 FNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY 353
F P GTIIDSGT VF +++ + +D+ V L G D +
Sbjct: 311 FKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGF 370
Query: 354 SVPIVAPTITLMFSGMNVTLPQDNLLIH--------STAGSITCLAMA----AAPDNVNS 401
PT+T F +D+L +H + C+ + D +
Sbjct: 371 ------PTLTFHF--------EDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDI 416
Query: 402 VLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
VL + ++ N ++YD+ N +G C+
Sbjct: 417 VL--MGDLVLSNKLVVYDLENRVIGWTDYNCS 446
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 158/387 (40%), Gaps = 50/387 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ +++ Y + IGTP++ + +DT +D WV C GC +G T
Sbjct: 60 LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLT 119
Query: 135 VFNSAQSTTFKNLGCQAAQCKQVPNPTCG---GGACAFNLTYGS-STIAANLSQDTISL- 189
+++ STT +GC C P G G C +++ YG S+ QD +
Sbjct: 120 LYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN 179
Query: 190 -------ATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQST 236
T FGC K +G S G+LG G+ + S+L+Q + +
Sbjct: 180 RISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKV 239
Query: 237 FSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIP 296
FS+CL + G +G + +PK + TPL++N Y V + I VG +D+P
Sbjct: 240 FSHCLDNVDG---GGIFAIGEVVEPK-VNITPLVQNQAH---YNVVMKEIEVGGDPLDVP 292
Query: 297 PGALQFNPTTGAGTIIDSGT--------VFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG 348
A F GTIIDSGT V+ L+ + D+ V T
Sbjct: 293 SDA--FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTC----- 345
Query: 349 FDTCYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI 406
FD +V PT+TL F +++T+ P + L + A L ++
Sbjct: 346 FDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLL 405
Query: 407 ANMQQQNHRILYDVPNSRLGVARELCT 433
++ N ++YD+ +G C+
Sbjct: 406 GDLVLSNKLVVYDLEKQGIGWVEYNCS 432
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 163/383 (42%), Gaps = 40/383 (10%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----VFN 137
+P++SG T + Y VR ++GTPAQ ++ DT +D WV C G G ++ F
Sbjct: 1 MPLSSG-AYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFR 59
Query: 138 SAQSTTFKNLGCQAAQCKQ-VP----NPTCGGGACAFNLTYGSSTIAANL---SQDTISL 189
+++S ++ L C + C VP N + CA++ Y + A + TI+L
Sbjct: 60 ASESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIAL 119
Query: 190 ATDI-------------VPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQS 235
+ + G GC G S G+L LG ++S ++ +
Sbjct: 120 SGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGG 179
Query: 236 TFSYCLPSFKALSFSGS---LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRV 292
FSYCL A + S GP G TPL+ + R S Y V + A+ V
Sbjct: 180 RFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEA 239
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTC 352
+DIP A ++ G G I+DSGT T L PAY AV R+ + L ++ F+ C
Sbjct: 240 LDIP--ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAA-LPRVAMDPFEYC 296
Query: 353 YSVPIVAPTI---TLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANM 409
Y+ AP I + F+G P + A + C+ + + ++VI N+
Sbjct: 297 YNWTAGAPEIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQ---EGAWPGVSVIGNI 353
Query: 410 QQQNHRILYDVPNSRLGVARELC 432
QQ H +D+ + L C
Sbjct: 354 LQQEHLWEFDLRDRWLRFKHTRC 376
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y+ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGIHGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 83/169 (49%), Gaps = 9/169 (5%)
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L +NP+ + YYV L+ I VG ++ IP + + + G I+DSGT TRL + Y
Sbjct: 1 LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60
Query: 329 VRDVFRRRVGSNLTVTSLGGFDTCYSV----PIVAPTITLMF-SGMNVTLPQDNLLIHST 383
VRD F + L + FDTCY + + PT+ F G + LP N L+
Sbjct: 61 VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120
Query: 384 AGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ C A A S L++I N+QQQ R+ +D+ NS +G + C
Sbjct: 121 SVGTFCFAFAPTM----SSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLG 148
Y + K+G PA+ + +DT +D WV C+ C GC + +F++ +S++ + L
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 149 CQAAQCKQVPNPT----CGGGACAFNLTY---------------------GSSTIAANLS 183
C C V T C+++ Y G STIA S
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN--S 201
Query: 184 QDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL 241
TI I Y +G + +AT G+ G G+G S+++Q ++ + FS+CL
Sbjct: 202 SATIVFGCSI---YQYGDLTRATK---ALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 242 PSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQ 301
+ + G L LG I +P I Y+PL+ + Y + L +I + ++ P
Sbjct: 256 KGGE--NGGGILVLGEILEPS-IVYSPLIPSQPH---YTLKLQSIALSGQLF---PNPTM 306
Query: 302 FNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVPIV 358
F + TIIDSGT LV Y + V V + T T G F SV +
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI 366
Query: 359 APTITLMFSGMN--VTLPQDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNH 414
P + F G+ V P++ L S ++ C+ A D LN++ ++ ++
Sbjct: 367 FPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDG----LNILGDLVLKDK 422
Query: 415 RILYDVPNSRLGVARELC 432
I+YD+ R+G A C
Sbjct: 423 IIVYDLARQRIGWANYDC 440
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 108/418 (25%), Positives = 169/418 (40%), Gaps = 48/418 (11%)
Query: 51 PLSWEESVLEMLAKDQARLQFLSSLAVARKSV-VPIASGRQITQSPTYIVRAKIGTPAQT 109
PLS E +++ DQ R +S + V + + SG + Y ++GTPA+
Sbjct: 45 PLSRIE---DIIGADQKRHSLISRKRKFKGGVKMDLGSGIDY-GTAQYFTEVRVGTPAKK 100
Query: 110 LLMAMDTSNDAAWVPC------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCK-------- 155
+ +DT ++ WV C G V + VF + +S +FK +GC CK
Sbjct: 101 FRVVVDTGSELTWVNCRYRGRGKGKVK-NRRVFRAEESKSFKTVGCFTQTCKVDLMNLFS 159
Query: 156 --QVPNPTCGGGACAFNLTYGSSTIAANL-SQDTISLA-----TDIVPGYTFGCIQKATG 207
P P+ C+++ Y + A + +++TI++ + G GC +G
Sbjct: 160 LSTCPTPST---PCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSG 216
Query: 208 NSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFSGSLRLGPIGQPKRIK 265
S G+LGL S + +L+ + SYCL + S L G K
Sbjct: 217 QSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTK 276
Query: 266 YTPLLKNPRRSSL----YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
P P +L Y +N++ I +G ++DIP ++ TTG GTI+DSGT T L
Sbjct: 277 TAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQV--WDATTGGGTILDSGTSLTLL 334
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLG-GFDTCYSV-----PIVAPTITLMFSGMNVTLPQ 375
AY V R + V G + C+S P +T G P
Sbjct: 335 AEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPH 394
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ A + CL +A NV+ N+ QQN+ +D+ S L A CT
Sbjct: 395 RKSYLVDAAPGVKCLGFMSAG---TPATNVVGNIMQQNYLWEFDLMASTLSFAPSTCT 449
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/430 (25%), Positives = 168/430 (39%), Gaps = 93/430 (21%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCT--GCVGCSSTVFNSAQ 140
+PI+SG Y++ +G+ + + + +DT +D W PC C+ C S +
Sbjct: 75 LPISSGSD------YLISLSVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSP 128
Query: 141 STTFKNLG----------------------CQAAQCKQVPNPTCGGGACA--------FN 170
++ + C + C P G C F
Sbjct: 129 PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNC---PLDFIETGDCNTSSYPCPPFY 185
Query: 171 LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ-- 228
YG ++ A L D++SL + V +TFGC P G+ G GRG LSL AQ
Sbjct: 186 YAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAE---PIGVAGFGRGRLSLPAQLA 242
Query: 229 -TQNLYQSTFSYCLPSFKALSFSGSLRLGPI-------GQPKRIK--------------- 265
++FSYCL S + R P+ + KR+
Sbjct: 243 VHSPHLGNSFSYCLVS-HSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKK 301
Query: 266 ----YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+T +L+NP+ Y V+L I +G+R + P + + G G ++DSGT FT L
Sbjct: 302 NEFVFTEMLENPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTML 361
Query: 322 VAPAYTAVRDVFRRRVGSNLT----VTSLGGFDTCYSV--PIVAPTITLMFSG--MNVTL 373
A Y +V + F RVG V G CY + + P + L F+G +VTL
Sbjct: 362 PAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTL 421
Query: 374 PQDNLLIHSTAG--------SITCLAMAAAPDNVN---SVLNVIANMQQQNHRILYDVPN 422
P+ N G I CL + D ++ N QQQ ++YD+ N
Sbjct: 422 PRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLN 481
Query: 423 SRLGVARELC 432
R+G A+ C
Sbjct: 482 RRVGFAKRKC 491
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 157/390 (40%), Gaps = 41/390 (10%)
Query: 68 RLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTG 127
R + L+ A S VPI R + + IGTP Q +D + + W C+
Sbjct: 42 RGRLLADATPAGGSAVPIHWSRHLYN----VANFTIGTPPQPASAIIDVAGELVWTQCSM 97
Query: 128 CVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANL-- 182
C C +F S+TF+ C CK +P C C + T S L
Sbjct: 98 CSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGI 157
Query: 183 -SQDTISLATDIVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYC 240
+ DT ++ T FGC+ + +++ P GL+GLGR SL++Q + + FSYC
Sbjct: 158 VATDTFAIGTATA-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQ---MNITKFSYC 213
Query: 241 LPSFKA-----LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
L + L S +L G + S Y + L I+ G + +
Sbjct: 214 LTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIAL 273
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-- 353
PP +G ++ + + LV AY A++ + VG+ T T L FD C+
Sbjct: 274 PP--------SGNTVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPK 325
Query: 354 ------SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAP----DNVNSVL 403
S P + T + + V P+ + + G++ C+A+ + ++ L
Sbjct: 326 AGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTV-CMAILSTSWLNTTALDENL 384
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELCT 433
N++ ++QQ+N L D+ L C+
Sbjct: 385 NILGSLQQENTHFLLDLEKKTLSFEPADCS 414
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 143/350 (40%), Gaps = 53/350 (15%)
Query: 131 CSSTVFNSAQSTTFKNLGCQAAQC--KQVPNPTCGGG-ACA-FNLTYGSSTIAANLSQDT 186
C+S + ++A ++ + C A+C + + +CG AC YG ++ A+L +
Sbjct: 26 CASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGR 85
Query: 187 ISLATDI-------VPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSY 239
++L V +TF C A G P G+ G GRG LSL Q FSY
Sbjct: 86 VALGAGARASVAVAVDNFTFACAHTALGE---PVGVAGFGRGPLSLPGQLSPQLSGRFSY 142
Query: 240 CL--PSFKALSFSGSLRLGPI------------GQPKRIKYTPLLKNPRRSSLYYVNLLA 285
CL SF+A +R P+ + YTPLL NP+ Y V L A
Sbjct: 143 CLVSHSFRADRL---IRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEA 199
Query: 286 IRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT--- 342
+ VG + P + + G ++DSGT FT L Y V + F R + +
Sbjct: 200 VSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARA 259
Query: 343 --VTSLGGFDTCYSVPIV---APTITLMFSG-MNVTLPQDNLLI-----HSTAGS----I 387
G CY P + L F G V LP+ N + + AG+ +
Sbjct: 260 ERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDV 319
Query: 388 TCLAMA----AAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
CL + A+ + + + N QQQ ++YDV R+G AR CT
Sbjct: 320 GCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 369
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 51/380 (13%)
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTT 143
+Q Y + K+GTP + + +DT +D WV C C GC T F+ S+T
Sbjct: 72 SQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSST 131
Query: 144 FKNLGCQAAQCK---QVPNPTCG--GGACAFNLTYGSSTIAANLSQDTI---------SL 189
+ C +C+ Q + +C C + YG + + + +L
Sbjct: 132 SSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTL 191
Query: 190 ATDIVPGYTFGCIQKATGNSVPPQ----GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPS 243
T+ FGC TG+ + G+ G G+ +S+++Q Q + FS+CL
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251
Query: 244 FKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
S G L LG I +P I Y+PL+++ Y +NL +I V ++V I P F
Sbjct: 252 DN--SGGGVLVLGEIVEPN-IVYSPLVQSQPH---YNLNLQSISVNGQIVPIAPAV--FA 303
Query: 304 PTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNL-TVTSLGGFDTCYSVPI----- 357
+ GTI+DSGT L AY + V ++ +V S G + CY +
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRG--NQCYLITTSSNVD 361
Query: 358 VAPTITLMFSGMN--VTLPQDNLLIHST--AGSITCLAMAAAPDNVNSVLNVIANMQQQN 413
+ P ++L F+G V PQD L+ + GS+ C+ P + ++ ++ ++
Sbjct: 362 IFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIP---GQSITILGDLVLKD 418
Query: 414 HRILYDVPNSRLGVARELCT 433
+YD+ R+G A C+
Sbjct: 419 KIFVYDLAGQRIGWANYDCS 438
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 155/367 (42%), Gaps = 53/367 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFK----NLGC 149
Y R IGTP Q + +DT + +VPC+ C C F S+T++ N+ C
Sbjct: 13 YTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNIDC 72
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKA 205
KQ C + Y ST + L +D IS + + P FGC
Sbjct: 73 NCDDEKQ---------QCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENME 123
Query: 206 TGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
TG+ S G++G+GRG LS++ + + +FS C G++ LG I P
Sbjct: 124 TGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCY--GGMGIGGGAMVLGGISPP 181
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT---GAGTIIDSGTVF 318
+ ++ +P RS Y ++L I V + L NPT GTI+DSGT +
Sbjct: 182 SNMVFSQ--SDPVRSPYYNIDLKEIHVAGK-------PLPLNPTVFDGKHGTILDSGTTY 232
Query: 319 TRLVAPAYTAVRDVFRRRVGSNLTVT--SLGGFDTCYS--------VPIVAPTITLMF-S 367
L A+ + +D + + S + D C+S + P + ++F +
Sbjct: 233 AYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGN 292
Query: 368 GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLG 426
G + L P++ L HS CL + + ++L I +N +LYD NS++G
Sbjct: 293 GQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIV---VRNTLVLYDRENSKIG 349
Query: 427 VARELCT 433
+ C+
Sbjct: 350 FWKTNCS 356
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y+ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +P +TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS--FS---GSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP K+ FS G LG +
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSLGKVAT 178
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT- 319
++YT ++ + + L++V+L AI V + + P + G + DSG+ +
Sbjct: 179 RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRKGVVFDSGSELSY 233
Query: 320 ---RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGMNV 371
R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 IPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGARF 288
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 DLGSRGVFVERSVQEQDVWCLAFAPTESVSII 320
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 108/429 (25%), Positives = 186/429 (43%), Gaps = 42/429 (9%)
Query: 29 DHSSTLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASG 88
D +L + H SP SP ++ + ++ +R+ + AV S
Sbjct: 31 DPGFSLNLIHRDSPLSPLYNPNHTDFDR-LRNAFSRSISRVNVFKTKAVDINSF----QN 85
Query: 89 RQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWV---PCTGCVGCSSTVFNSAQSTTFK 145
+ Y ++ IGTP +++ DT +D WV PC C S +F+ ++S++++
Sbjct: 86 DLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYR 145
Query: 146 NLGCQAAQCK--QVPNPTC--GGGACAFNLTYGSSTIA-ANLSQDTISL-ATDIVPGY-- 197
++ C + C V C C ++ +YG + NL+ + ++ +T P +
Sbjct: 146 HMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLS 205
Query: 198 --TFGCIQKATGNSVPPQGL----LGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKALSFS 250
FGC TGN L +GLG G+LSL++Q ++ + FSYCL P + + +
Sbjct: 206 PIVFGC---GTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVT 262
Query: 251 GSLRLGP---IGQPKRIKYTPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTT 306
++ G I P+ + TPL+ K P + YYV L AI VG + + G L N
Sbjct: 263 SKIKFGTDSVISGPQVVS-TPLVSKQP--DTYYYVTLEAISVGNKRLPYTNGLLNGNVEK 319
Query: 307 GAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY--SVPIVAPTITL 364
G IIDSGT T L + +T + V V + G F C+ + I P I +
Sbjct: 320 G-NVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDLPVIAV 378
Query: 365 MFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
F+ +V L N + + + C M ++ + + + N+ Q + + YD+
Sbjct: 379 HFNDADVKLQPLNTFVKADE-DLLCFTMISS-----NQIGIFGNLAQMDFLVGYDLEKRT 432
Query: 425 LGVARELCT 433
+ CT
Sbjct: 433 VSFKPTDCT 441
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/334 (25%), Positives = 147/334 (44%), Gaps = 39/334 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTPA+T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG+TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP F +G LG
Sbjct: 120 ANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178
Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
R ++YT ++ + + L++V+L AI V + + P + G + DSG+
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233
Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
+ R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFAPTESVSII 322
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/428 (24%), Positives = 174/428 (40%), Gaps = 53/428 (12%)
Query: 33 TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
++ + H SP SP + PS E+ E L + R S +++ + P S
Sbjct: 36 SIDLIHRDSPKSPLYNPS------ETPAERLDRFFRRFMSFSEASISPNTPEPPVS---- 85
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
+ + Y+++ IGTP + DT +D W C C+ C + +F+ ++ST+FK +
Sbjct: 86 SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVS 145
Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFG 200
C++ QC+ + +C C F+ YG ++A ++ +T++L ++ + FG
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFG 205
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL-SFSGSLRLG 256
C +G + GL G G LSL +Q + S FS CL F+ S + + G
Sbjct: 206 CGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFG 265
Query: 257 PIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P + + TPL+ + Y+V L I VG ++ P + + +
Sbjct: 266 PEAEVSGSXVVSTPLVTK-DDPTYYFVTLDGISVGDKLF----------PFSSSSPMATK 314
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-------CY--SVPIVAPTITLM 365
G VF P RD + R V + CY + I P +T
Sbjct: 315 GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAH 374
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F G +V L N I G + C AM ++ + N Q N I +D+ ++
Sbjct: 375 FDGADVQLKPLNTFISPKEG-VYCFAMQP----IDGDTGIFGNFVQMNFLIGFDLDGKKV 429
Query: 426 GVARELCT 433
CT
Sbjct: 430 SFKAVDCT 437
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 157/358 (43%), Gaps = 41/358 (11%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST------VFNSAQSTTFKNLGCQAAQCK 155
+GTPA L+ +DT + +WV C C V C + FN++ S+T++ +GC A C
Sbjct: 29 LGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCH 88
Query: 156 QV---PNPTCG----GGACAFNLTYGSSTIAAN-LSQDTISLATDI-VPGYTFGC--IQK 204
+ N G +C ++L Y S +A LSQD ++LA + + FGC +
Sbjct: 89 DMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKFIFGCGSDNR 148
Query: 205 ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQ-STFSYCLPSFKALSFSGSLRLGP-IGQPK 262
G+S G++G G S S Q L S FSYC PS + G L +GP +
Sbjct: 149 YNGHSA---GIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQEN--EGFLSIGPYVRDSN 203
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
++ T L +Y + + V G R+ PP T T++DSGTV T +
Sbjct: 204 KLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPP------VYTTRMTVVDSGTVETFV 257
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SVPIVA-PTITLMFSGMNVTLPQ 375
++P + A+ + + + V + C+ SV P + + FS + LP
Sbjct: 258 LSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNGDSVDWSKLPVVEIKFSRSILKLPA 317
Query: 376 DNLLIHSTAGSITCLAMAAAPDNVNS-VLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+N+ + T+ C PD+ + ++ N ++ R+++D+ G C
Sbjct: 318 ENVFYYETSDGSICSTF--QPDDAGVPGVQILGNRATRSFRVVFDIQQRNFGFEAGAC 373
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 158/365 (43%), Gaps = 39/365 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
Y +G P Q L + +DT +D WV C+ C C S +++N + S+T
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142
Query: 149 CQAAQC---KQVPNPTCGGGACAFNLTY--GSSTIAANLSQD---TISLATDIVPGYTFG 200
C C + V + + ACA+ ++Y S++I A + D + FG
Sbjct: 143 CSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFFG 202
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
C TG S P G++G G+ S ++ Q TQ FS+CL K G L G
Sbjct: 203 CAINITG-SWPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEK--HGGGILEFGEE 259
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF--NPTTGAGTIIDSGT 316
+ +TPLL ++ Y V+LL+I V +V+ I + N T G IIDSGT
Sbjct: 260 PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGT 316
Query: 317 VFTRLVAPA----YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMN-V 371
F L A ++ ++++ ++G L + +V P +TL FSG + +
Sbjct: 317 SFALLATKANRILFSEIKNLTTAKLGPKLEGLQCFYLKSGLTVETSFPNVTLTFSGGSTM 376
Query: 372 TLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
L DN L+ + C A ++A L + + ++ + YDV N R+G
Sbjct: 377 KLKPDNYLVMVELKKKRNGYCYAWSSADG-----LTIFGEIVLKDKLVFYDVENRRIGWK 431
Query: 429 RELCT 433
+ C+
Sbjct: 432 GQNCS 436
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 167/361 (46%), Gaps = 56/361 (15%)
Query: 103 IGTPAQTLLMAMDTSNDAAWVPCTGCVGCS---------STVFNSAQSTTFKNLGCQAAQ 153
+GTP QT ++A+DT +D W+PC C GC+ ++ + + S+T + + C +
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQF 180
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLAT-DIVP-----GYTFGCIQKA 205
C ++ C + + Y S+ +++ L +D + L+T D +P FGC Q
Sbjct: 181 C-ELRKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQ 239
Query: 206 TG---NSVPPQGLLGLGRGSL---SLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG 259
TG ++ P GL GLG + S+LAQ + L ++F+ C + G + G G
Sbjct: 240 TGSFLDAAAPNGLFGLGIDMISIPSILAQ-KGLTSNSFAMCF----SRDGIGRISFGDQG 294
Query: 260 QPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFT 319
+ + TPL NP+ + Y +++ + VG + D L+F+ TI D+GT FT
Sbjct: 295 SSDQ-EETPLDVNPQHPT-YTISISEMTVGNSLTD-----LEFS------TIFDTGTSFT 341
Query: 320 RLVAPAYTAVRDVFRRRVGSN-LTVTSLGGFDTCYSVP-----IVAPTITLMFSGMNV-- 371
L PAYT + F +V +N S F+ CY + I P+I+L G +V
Sbjct: 342 YLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 372 TLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
+ + ++ + CLA+ + + LN+I R+++D LG +
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKS-----AKLNIIGQNFMTGLRVVFDRERKILGWKKFN 456
Query: 432 C 432
C
Sbjct: 457 C 457
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 114/465 (24%), Positives = 187/465 (40%), Gaps = 63/465 (13%)
Query: 9 LAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFS-PCSPFKPSK----PLSWEESVLEMLA 63
LA +F+ L C H T + H S P + S P EE +E A
Sbjct: 2 LASVFIIVSLLSLWECCQCHGHVYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYA 61
Query: 64 KDQARLQFLSSLAVAR-KSVVPIASGRQITQSPT----YIVRAKIGTPAQTLLMAMDTSN 118
+ R + L +++ + + + G + + + +IGTP ++A+DT +
Sbjct: 62 ELADRDRLLRGRKLSQIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGS 121
Query: 119 DAAWVP--CTGCVGCSST---------VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGAC 167
D WVP CT C ST V+N S+T K + C + C C
Sbjct: 122 DLFWVPCDCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNC 181
Query: 168 AFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGCIQKATG---NSVPPQGLL 216
+ ++Y S+ + + L +D + L + + FGC Q +G + P GL
Sbjct: 182 PYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLF 241
Query: 217 GLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPR 274
GLG +S+ + + +FS C G + G G + + TP NP
Sbjct: 242 GLGMEKISVPSMLSREGFTADSFSMCF----GRDGIGRISFGDKGSFDQDE-TPFNLNPS 296
Query: 275 RSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
+ Y + + +RVG V+D+ AL DSGT FT LV P YT + + F
Sbjct: 297 HPT-YNITVTQVRVGTTVIDVEFTAL-----------FDSGTSFTYLVDPTYTRLTESFH 344
Query: 335 RRVGSNLTVT-SLGGFDTCYSVPIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGS-I 387
+V + S F+ CY + A P+++L G + D ++I ST +
Sbjct: 345 SQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSELV 404
Query: 388 TCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
CLA+ + + LN+I +R+++D LG + C
Sbjct: 405 YCLAVVKSAE-----LNIIGQNFMTGYRVVFDREKLVLGWKKFDC 444
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/430 (25%), Positives = 185/430 (43%), Gaps = 71/430 (16%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQIT 92
T ++ H SP SPF + ++ + + + ++RL +L + ++ + ++
Sbjct: 9 TARLIHHDSPLSPFY-NHTMTDTARIEATVHRSRSRLNYLYYINKLSENALD----NDVS 63
Query: 93 QSPT-------YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-CS------STVFNS 138
SPT Y++ IG P+ ++ +DTSN WV C+ C C +T F S
Sbjct: 64 LSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLS 123
Query: 139 AQSTTFKNLGCQAAQCKQVPN-PTCGGGA--CAFNLTYGSSTIAAN-LSQDTISLATD-- 192
++S T++ C + C + TC C + L YG + + LS D+ T
Sbjct: 124 SKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDG 183
Query: 193 --IVPGY-TFGCIQK-ATGNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALS 248
+ G+ FGC + TG+ G +GL + LSL++Q L FSYCL F L
Sbjct: 184 MLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQ---LGIKKFSYCLVPFNNLG 240
Query: 249 FSGSLRLGPI-----GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
+ + G + GQ TPLL S YYV +L I +G D P F+
Sbjct: 241 STSKMYFGSLPVTSGGQ------TPLLY--PNSDAYYVKVLGISIGN---DEPHFDGVFD 289
Query: 304 -PTTGAGTIIDSGTVFTRLVAPAYTA-------VRDVFRRRVGSNLTVTSLGGFDTCYSV 355
G IID+G ++ L A+ + ++D +R+ F+ C+ +
Sbjct: 290 VYEVRDGWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKER------FELCFEL 343
Query: 356 PIVA-----PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
P +T+ F G ++ L ++ + I CLA+ + S ++++ N Q
Sbjct: 344 QNANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRS----GSPVSILGNFQ 399
Query: 411 QQNHRILYDV 420
QN+ + YD+
Sbjct: 400 LQNYHVGYDL 409
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 139/316 (43%), Gaps = 45/316 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP QT + +DT + +VPC+ C C F S+T++ + C
Sbjct: 90 YTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNI-- 147
Query: 154 CKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGYT-FGCIQKATG 207
+ TC C + Y S+ + L +D IS +++VP FGC + TG
Sbjct: 148 -----DCTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQETG 202
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKR 263
+ S G++GLGRG LS++ Q + + +FS C G++ LG I P
Sbjct: 203 DLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDI--GGGAMILGGISPPSG 260
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ + +P RS Y ++L AI V + + + P GT++DSGT + L
Sbjct: 261 MVFAE--SDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGK----HGTVLDSGTTYAYLPE 314
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGG-----FDTCYS--------VPIVAPTITLMFS-GM 369
A+TA +D + + S + + G D C+S + P + ++FS G
Sbjct: 315 AAFTAFKDAMMKELTS---LKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSNGQ 371
Query: 370 NVTLPQDNLLIHSTAG 385
++L +N L G
Sbjct: 372 KLSLSPENYLFQYYLG 387
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 148/334 (44%), Gaps = 39/334 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG+TFGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP F +G LG
Sbjct: 120 ANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178
Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
R ++YT ++ + + L++V+L AI V + + P + G + DSG+
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233
Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
+ R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + + +A AP S++
Sbjct: 289 RFDLGRHGVFVERSVQEQDVWCLAFAPTESVSII 322
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 154/362 (42%), Gaps = 41/362 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y+ IGTP Q + +D + + W CT C C +F+ +S+TF+ L C +
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 154 CKQVPNPT--CGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGN 208
C+ +P + C C + + DT ++ FGC+ K
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAIGA-AKETLGFGCVVMTDKRLKT 175
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------- 260
P G++GLGR SL+ Q + + FSYCL A SG+L LG +
Sbjct: 176 IGGPSGIVGLGRTPWSLVTQ---MNVTAFSYCL----AGKSSGALFLGATAKQLAGGKNS 228
Query: 261 --PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA-LQFNPTTGAGTIIDSGTV 317
P IK + + + Y V L I+ G GA LQ ++G+ ++D+ +
Sbjct: 229 STPFVIKTSAGSSDNGSNPYYMVKLAGIKAG--------GAPLQAASSSGSTVLLDTVSR 280
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS---GMNVTLP 374
+ L AY A++ VG + +D C+S + L+F+ G +T+P
Sbjct: 281 ASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVP 340
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVL---NVIANMQQQNHRILYDVPNSRLGVAREL 431
N L+ S G++ ++A N+ L +++ ++QQ+N +L+D+ L
Sbjct: 341 PANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPAD 400
Query: 432 CT 433
C+
Sbjct: 401 CS 402
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 84/162 (51%), Gaps = 11/162 (6%)
Query: 76 AVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV 135
A ARK+VV A + Y+V+ IGTP A+DT++D W C C GC V
Sbjct: 70 ASARKAVV--AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQV 127
Query: 136 ---FNSAQSTTFKNLGCQAAQCKQVPNPTCGGG---ACAFNLTY-GSSTIAANLSQDTIS 188
FN S+T+ L C + C ++ CG +C + TY G++T L+ D +
Sbjct: 128 DPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLV 187
Query: 189 LATDIVPGYTFGCIQKATGNSVPPQ--GLLGLGRGSLSLLAQ 228
+ D G FGC +TG + PPQ G++GLGRG LSL++Q
Sbjct: 188 IGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQ 229
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 52/131 (39%), Gaps = 10/131 (7%)
Query: 309 GTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVP-------IVAPT 361
G IID + T L A Y + + + S G D C+ +P + P
Sbjct: 236 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPA 295
Query: 362 ITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVP 421
+ L F G + L + L + CL + A S+L N QQQN ++LY++
Sbjct: 296 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSIL---GNFQQQNMQVLYNLR 352
Query: 422 NSRLGVARELC 432
R+ + C
Sbjct: 353 RGRVTFVQSPC 363
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 131/331 (39%), Gaps = 42/331 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------------TGCVG 130
+P+ S I Y+V + GTPA + +DT+ND W+ C T VG
Sbjct: 113 LPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVG 172
Query: 131 CSS-----------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
+ A+S++++ + C +C +P TC + A + +Y
Sbjct: 173 AGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232
Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
G+ T+ + +D +PG GC G SV G+L LG G +S
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292
Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+ FS+CL S + + S L GP + P ++ T ++ N Y +
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDIVYNVDVKPAYGPLVT 351
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I VG +DIP G G I+D+ T T LV AY AV R + V
Sbjct: 352 GIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVY 411
Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
L GF+ CY + L NVT+P+
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLAH---NVTVPR 439
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 169/402 (42%), Gaps = 41/402 (10%)
Query: 55 EESVLEMLAKDQARLQFLSSLA-VARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMA 113
+ S +E +++++ L S+ AR S++P G ++V IG+P T L+
Sbjct: 67 QTSSIERFDFLESKIKELKSVGNEARSSLIPFNRG------SGFLVNLSIGSPPVTQLVV 120
Query: 114 MDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGG-GACAF 169
+DT + WV C C+ C S++ F+ +S +FK LGC + C +
Sbjct: 121 VDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEY 180
Query: 170 NLTY-GSSTIAANLSQDTISLATDIVPG------YTFGC--IQKATGNSVPPQGLLGLGR 220
L Y G + L+++++ T + G TFGC + T N G+ GLG
Sbjct: 181 KLRYLGGDSSQGILAKESLLFET-LDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGA 239
Query: 221 G-SLSLLAQTQNLYQSTFSYCLPSFKA-LSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSL 278
+++ Q N FSYC+ L L LG G TPL
Sbjct: 240 YPHITMATQLGN----KFSYCIGDINNPLYTHNHLVLGQ-GSYIEGDSTPL---QIHFGH 291
Query: 279 YYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA----YTAVRDVFR 334
YYV L +I VG + + I P A + + G +IDSG +T+L Y + D+ +
Sbjct: 292 YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMK 351
Query: 335 RRVGSNLTVTSLGG--FDTCYSVPIVA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLA 391
+ T G F S +V P +T F+G + + L G CLA
Sbjct: 352 GLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLA 411
Query: 392 MAAAPDNVNSV-LNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ P N + L+VI + QQN+ + +D+ ++ R C
Sbjct: 412 I--LPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 127/330 (38%), Gaps = 42/330 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC----------------- 125
+P+ S I Y+V +IGTPA + +DT+ D W+ C
Sbjct: 111 LPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQ 170
Query: 126 ------TGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
G S + A+S++++ + C +C +P TC + A + +Y
Sbjct: 171 TMSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQD 230
Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
G+ TI + +D +PG GC G SV G+L LG G +S
Sbjct: 231 GTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVH 290
Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+ FS+CL S + + S L GP + P ++ T +L N Y +
Sbjct: 291 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDILYNVDVKPAYGAQVT 349
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
+ VG +DIP G G I+D+ T T LV AY V R + V
Sbjct: 350 GVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVY 409
Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLP 374
L GF+ CY + NVT+P
Sbjct: 410 ELEGFEYCYKWTFTGDGVD---PAHNVTIP 436
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 97/207 (46%), Gaps = 12/207 (5%)
Query: 232 LYQSTFSYCLPSFKALSFSGSLRLGPIGQP-KRIKYTPLLKNPRRSSLYYVNLLAIRVGR 290
+ ++ FSYCL S S + L LG + + K TPLL NP + S YY++L I VG
Sbjct: 1 MKEAKFSYCLTSMDD-SKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGG 59
Query: 291 RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFD 350
+ I + G IIDSGT T L + ++ F + L +S G D
Sbjct: 60 TQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLD 119
Query: 351 TCYSVP-----IVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV 405
C+S+P + P + F G ++ LP ++ +I + + CLAM A+ + +++
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAMGAS-----NGMSI 174
Query: 406 IANMQQQNHRILYDVPNSRLGVARELC 432
N+QQQN + +D+ + C
Sbjct: 175 FGNVQQQNILVNHDLEKETISFVPTQC 201
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 115/442 (26%), Positives = 182/442 (41%), Gaps = 56/442 (12%)
Query: 33 TLQVFHVFSPCSPFKPSKPLSWEESV-LEML-AKDQARLQFLSSLAVARKSVVPIASGRQ 90
T V H SP S + + V LE+L A+DQAR L V +
Sbjct: 20 TAAVVHCGSPASLLTLERAFPVNQRVELEVLRARDQARHGRLLRGVVGGVVDFTVYGTSD 79
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQ 150
Y + K+G+P + + +DT +D WV C C C T + + F
Sbjct: 80 PYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSS 139
Query: 151 AAQCKQVPNPTCG-------------GGACAFNLTYGSST------IAANLSQDTI---S 188
+P C C+++ YG + ++ L DT+ S
Sbjct: 140 TTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDS 199
Query: 189 LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQTQNL--YQSTFSYCLP 242
L + FGC +G+ G+ G G+ LS+++Q +L FS+CL
Sbjct: 200 LIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLK 259
Query: 243 SFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
G L LG I +P I Y+PL+ P +S Y +NL +I V +++ I P F
Sbjct: 260 GEG--DGGGKLVLGEILEPN-IIYSPLV--PSQSH-YNLNLQSISVNGQLLPIDPAV--F 311
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLT-VTSLGGFDTCY----SVPI 357
+ GTI+DSGT T LV AY V S+ T V S G + CY SV
Sbjct: 312 ATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKG--NQCYLVSTSVDE 369
Query: 358 VAPTITLMFS-GMNVTLPQDNLLIH---STAGSITCLAM--AAAPDNVNSVLNVIANMQQ 411
+ P ++L F+ G ++ L L+H S ++ C+ A P + ++ ++
Sbjct: 370 IFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPG-----ITILGDLVL 424
Query: 412 QNHRILYDVPNSRLGVARELCT 433
++ +YD+ + R+G A C+
Sbjct: 425 KDKIFVYDLAHQRIGWANYDCS 446
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 163/368 (44%), Gaps = 62/368 (16%)
Query: 107 AQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ----CKQVPN 159
QT + +DT + +VPC GC C ++ +S F+ L C A C++
Sbjct: 48 GQTYDLIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCEETMK 107
Query: 160 PTC-GGGACAFNLTYGS-STIAANLSQDTISLATDIVPGY-TFGCIQKATGNSVPPQ--- 213
TC G C++ ++Y S+ + +D + L + FGC ++A N++ Q
Sbjct: 108 GTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLAFGC-EEAETNAIYEQKAD 166
Query: 214 GLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI---GQPKRIKYTP 268
GL G GRG+ ++ AQ + L ++ FS+C+ F A G L LG + TP
Sbjct: 167 GLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGA--NGGVLTLGRFDFGADAPALARTP 224
Query: 269 LLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTA 328
L+ +P + + V + ++G +++ N T T +DSGT FT + + +
Sbjct: 225 LVADPANPAFHNVRTSSWKLGDSLIE------HLNSYT---TTLDSGTTFTFVPRSVWVS 275
Query: 329 VRDVFRRRVGSNLTVTSLGGF--------DTCYSVPIVAPTITLMFS------------- 367
F+ R+ + T L D CY V A +TL S
Sbjct: 276 ----FKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAY 331
Query: 368 --GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
G+++TL P++ L H T + C+ + A P+ N +L + + ++ + +DV NSR
Sbjct: 332 EGGVSLTLGPENYLFAHETNSAAFCVGIFANPN--NQIL--LGQITMRDTLMEFDVANSR 387
Query: 425 LGVARELC 432
+G+A C
Sbjct: 388 VGMAPANC 395
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 157/376 (41%), Gaps = 55/376 (14%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQCKQ 156
Y + K+GTP + +DT +D WV C C GC + Q F ++
Sbjct: 79 YFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVS 138
Query: 157 VPNPTCGGG-------------ACAFNLTY--GSSTIAANLSQDTISLATDIVPGYT--- 198
+P C C++ Y GS T +S+ S+ D+V G +
Sbjct: 139 CSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSE---SMYFDMVMGQSMIA 195
Query: 199 -------FGCIQKATGNSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFK 245
FGC +G+ G+ G G G LS+++Q + + FS+CL
Sbjct: 196 NSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEG 255
Query: 246 ALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT 305
+ G L LG + +P I Y+PL+ + +LY L +I V + + I P F +
Sbjct: 256 --NGGGILVLGEVLEPG-IVYSPLVPSQPHYNLY---LQSISVNGQTLPIDPSV--FATS 307
Query: 306 TGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPT 361
GTIIDSGT LV AYT V ++T T G + CY SV + P
Sbjct: 308 INRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKG-NQCYLVSTSVGEIFPL 366
Query: 362 ITLMFSG-MNVTLPQDNLLIH---STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRIL 417
++L F+G ++ L + L+H ++ C+ + V ++ ++ ++ +
Sbjct: 367 VSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGV----TILGDLVMKDKIFV 422
Query: 418 YDVPNSRLGVARELCT 433
YD+ R+G A C+
Sbjct: 423 YDLARQRIGWASYDCS 438
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 103/428 (24%), Positives = 174/428 (40%), Gaps = 53/428 (12%)
Query: 33 TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQI 91
++ + H SP SP + PS E+ E L + R S +++ + P S
Sbjct: 36 SIDLIHRDSPKSPLYNPS------ETPAERLDRFFRRFMSFSEASISPNTPEPPVS---- 85
Query: 92 TQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLG 148
+ + Y+++ IGTP + DT +D W C C+ C + +F+ ++ST+FK +
Sbjct: 86 SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVS 145
Query: 149 CQAAQCKQVPNPTCG--GGACAFNLTYGSSTIAAN-LSQDTISLATD-----IVPGYTFG 200
C++ QC+ + +C C F+ YG ++A ++ +T++L ++ + FG
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFG 205
Query: 201 CIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQS--TFSYCLPSFKAL-SFSGSLRLG 256
C +G + GL G G LSL +Q + S FS CL F+ S + + G
Sbjct: 206 CGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFG 265
Query: 257 PIGQ--PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
P + + TPL+ + Y+V L I VG ++ P + + +
Sbjct: 266 PEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLF----------PFSSSSPMATK 314
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDT-------CY--SVPIVAPTITLM 365
G VF P RD + R V + CY + I P +T
Sbjct: 315 GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAH 374
Query: 366 FSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
F G +V L N I G + C AM ++ + N Q N I +D+ ++
Sbjct: 375 FDGADVQLKPLNTFISPKEG-VYCFAMQP----IDGDTGIFGNFVQMNFLIGFDLDGKKV 429
Query: 426 GVARELCT 433
CT
Sbjct: 430 SFKAVDCT 437
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 164/382 (42%), Gaps = 48/382 (12%)
Query: 84 PIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC-VGCSST------VF 136
P+ +I + + + +GTP L+ +DT + +WV C C + C +T VF
Sbjct: 63 PVVGNHEIHEG-KFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVF 121
Query: 137 NSAQSTTFKNLGCQAAQCKQVPNPTCG-------GGACAFNLTYGSS----TIAANLSQD 185
+ +STT++ +GC + C V C ++L YGS A L D
Sbjct: 122 DPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTD 181
Query: 186 TISLA--TDIVPGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQT--QNLYQSTFSYCL 241
++LA + I+ G+ FGC + G++G G + S Q Q Y++ FSYC
Sbjct: 182 KLTLASSSSIIDGFIFGCSGDDSFKGY-ESGVIGFGGANFSFFNQVARQTNYRA-FSYCF 239
Query: 242 PSFKALSFSGSLRLGPIGQPK-RIKYTPLLKNPRRSSLYYVNLLAIRV-GRRVVDIPPGA 299
P G L +G PK + YT L+ + S+Y + + + V G R+
Sbjct: 240 PGDHTA--EGFLSIG--AYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRL------Q 289
Query: 300 LQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY------ 353
+ + T ++DSGTV T L+ P + A + + ++ G +TC+
Sbjct: 290 VDQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGD 349
Query: 354 SVPIVA-PTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNV--IANMQ 410
SV PT+ + F G + LP +N+ H S + +A PD V V NV + N
Sbjct: 350 SVDSGDLPTVEMRFIGTTLKLPPENVF-HDLLPSHDKICLAFKPD-VAGVRNVQILGNKA 407
Query: 411 QQNHRILYDVPNSRLGVARELC 432
+ R++YD+ G C
Sbjct: 408 TXSFRVVYDLQAMYFGFQAGAC 429
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 138/304 (45%), Gaps = 47/304 (15%)
Query: 167 CAFNLTY--GSSTIAANLSQDTISLATDIVPGY--TFGCIQKATGNS---VPPQGLLGLG 219
C +L+Y GSS+ A L+ D ++ + P FGC+ A +S V GLLG+
Sbjct: 59 CRVSLSYADGSSSDGA-LATDVFAVGS-ATPSLRAAFGCMASAFDSSPDGVASAGLLGMN 116
Query: 220 RGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPK--RIKYTPL----LKNP 273
RG+LS ++Q FSYC+ +G L LG P + YTPL L P
Sbjct: 117 RGALSFVSQAGT---RRFSYCISDRDD---AGVLLLGHSDLPNFLPLNYTPLYQPSLPLP 170
Query: 274 RRSSLYY-VNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDV 332
+ Y V LL I VG + + IP L + T T++DSGT FT L+ AY A++
Sbjct: 171 YFDRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAE 230
Query: 333 FRR------RVGSNLTVTSLGGFDTCYSVP--------IVAPTITLMFSGMNVTLPQDNL 378
F R R + G FDTC+ VP + P++TL F+G + + D L
Sbjct: 231 FYRQSTPFLRALDEPSFAFQGAFDTCFRVPRGMSPPPGRLLPSVTLRFNGAEMVVGGDRL 290
Query: 379 LIH----------STAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
L + ++ CL A D V + VI + Q N + YD+ R+G+A
Sbjct: 291 LYKVPGERRGGAGADDDAVWCLTFGNA-DMVPIMAYVIGHHHQMNLWVEYDLERGRVGLA 349
Query: 429 RELC 432
+ C
Sbjct: 350 QVRC 353
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 116/426 (27%), Positives = 172/426 (40%), Gaps = 67/426 (15%)
Query: 40 FSPCSPFKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK---SVVPIASGRQITQSPT 96
+ P K L E S LA QAR++ SL SV P +GR I
Sbjct: 50 YKPNETAKDRMELDIEHSAAR-LAYIQARIE--GSLVYNNDYTASVSPSLTGRTI----- 101
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNL-----G 148
+V IG P+ L+ MDT +D W+ C C C + +F+ + S+TF L G
Sbjct: 102 -LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPCG 160
Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKATGN 208
+ +C +P + + T+G + + + S +D++ GC N
Sbjct: 161 FKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVI----IGCGHNIGFN 216
Query: 209 SVPP-QGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFS-GSLRLGPIGQPKRIKY 266
S P G+LGL G SL Q FSYC+ + ++ LRLG G
Sbjct: 217 SDPGYNGILGLNNGPNSLATQIGR----KFSYCIGNLADPYYNYNQLRLGE-GADLEGYS 271
Query: 267 TPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA- 325
TP YYV + I VG + +DI + G I+DSGT T LV A
Sbjct: 272 TPF---EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAH 328
Query: 326 ---YTAVRDV----FRRRVGSNLTVTSLGGFDTCY----SVPIVA-PTITLMF-SGMNVT 372
Y VR++ FR+ + N + CY S +V P +T F G ++
Sbjct: 329 KLLYNEVRNLLKWSFRQVIFEN------APWKLCYYGIISRDLVGFPVVTFHFVDGADLA 382
Query: 373 LPQDNLLIHSTAGSITCLAMAAAPDNVNSVLN------VIANMQQQNHRILYDVPNSRLG 426
L + S I C+ ++ A S+LN VI + QQ++ + YD+ N +
Sbjct: 383 LDTGSFF--SQRDDIFCMTVSPA-----SILNTTISPSVIGLLAQQSYNVGYDLVNQFVY 435
Query: 427 VARELC 432
R C
Sbjct: 436 FQRIDC 441
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 119/473 (25%), Positives = 197/473 (41%), Gaps = 69/473 (14%)
Query: 1 MKPQLVFFLAFLFLFSLSEGLNPICDTQDHSSTLQVFHVFSPCSP-FKPSKPLSWEESVL 59
M Q++ F +LS +P + ++++ H SP SP + P
Sbjct: 1 MATQILLCFFLFFSVTLSSSGHP------KNFSVELIHRDSPLSPIYNP----------- 43
Query: 60 EMLAKDQARLQFLSSLAVARK-----SVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
++ D+ FL S++ +R+ S + SG I + + IGTP +
Sbjct: 44 QITVTDRLNAAFLRSVSRSRRFNHQLSQTDLQSGL-IGADGEFFMSITIGTPPIKVFAIA 102
Query: 115 DTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCG----GGAC 167
DT +D WV C C C +F+ +S+T+K+ C + C+ + + G C
Sbjct: 103 DTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNIC 162
Query: 168 AFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATGN-SVPPQGLLGLGR 220
+ +YG + + +++ +T+S+ + PG FGC G G++GLG
Sbjct: 163 KYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGG 222
Query: 221 GSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS--LRLGPIGQPKRIKY------TPLL-K 271
G LSL++Q + FSYCL S K+ + +G+ + LG P + TPL+ K
Sbjct: 223 GHLSLISQLGSSISKKFSYCL-SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDK 281
Query: 272 NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPT-------TGAGTIIDSGTVFTRLVAP 324
P + YY+ L AI VG++ IP +NP T IIDSGT T L A
Sbjct: 282 EPL--TYYYLTLEAISVGKK--KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAG 337
Query: 325 AYTAVRDVFRRRV-GSNLTVTSLGGFDTCY---SVPIVAPTITLMFSGMNVTLPQDNLLI 380
+ V G+ G C+ S I P IT+ F+G +V L N +
Sbjct: 338 FFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFV 397
Query: 381 HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELCT 433
+ + CL+M + + + N Q + + YD+ + C+
Sbjct: 398 K-LSEDMVCLSMVPTTE-----VAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 131/331 (39%), Gaps = 42/331 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPC------------TGCVG 130
+P+ S I Y+V + GTPA + +DT+ND W+ C T VG
Sbjct: 113 LPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVG 172
Query: 131 CSS-----------TVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTY------ 173
+ A+S++++ + C +C +P TC + A + +Y
Sbjct: 173 AGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232
Query: 174 GSSTIAANLSQDTISLATD----IVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLAQ 228
G+ T+ + +D +PG GC G SV G+L LG G +S
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292
Query: 229 TQNLYQSTFSYCLPSFKALSFSGS-LRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLL 284
+ FS+CL S + + S L GP + P ++ T ++ N Y +
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME-TDIVYNVDVKPAYGPLVT 351
Query: 285 AIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT 344
I VG +DIP G G I+D+ T T LV AY AV R + V
Sbjct: 352 GIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVY 411
Query: 345 SLGGFDTCYSVPIVAPTITLMFSGMNVTLPQ 375
L GF+ CY + L NVT+P+
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLTH---NVTVPR 439
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 116/441 (26%), Positives = 185/441 (41%), Gaps = 63/441 (14%)
Query: 33 TLQVFHVFSPCSP-FKPSKPLSWEESVLEMLAKDQARLQFLSSLAVARK-----SVVPIA 86
++++ H SP SP + P ++ D+ FL S++ +R+ S +
Sbjct: 27 SVELIHRDSPLSPLYNPKNTVT-----------DRLNAAFLRSISRSRRLNNILSQTDLQ 75
Query: 87 SGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTT 143
SG I + + IGTP + DT +D WV C C C + +F+ +S+T
Sbjct: 76 SGL-IGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSST 134
Query: 144 FKNLGCQAAQCKQVPNPTCG----GGACAFNLTYGS-----STIAAN-LSQDTISLATDI 193
+K+ C + C + + G C + +YG +A +S D+ S +
Sbjct: 135 YKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVS 194
Query: 194 VPGYTFGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGS 252
PG FGC G G++GLG G LSL++Q + FSYCL S K+ + +G+
Sbjct: 195 FPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL-SHKSATTNGT 253
Query: 253 --LRLGPIGQPKRIKY------TPLL-KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFN 303
+ LG P + TPL+ K PR + YY+ L AI VG++ IP +N
Sbjct: 254 SVINLGTNSIPSSLSKDSGVISTPLVDKEPR--TYYYLTLEAISVGKK--KIPYTGSSYN 309
Query: 304 PTTG-------AGTIIDSGTVFTRLVAPAYTAVRDVFRRRV-GSNLTVTSLGGFDTCY-- 353
P G IIDSGT T L + + V G+ G C+
Sbjct: 310 PNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKS 369
Query: 354 -SVPIVAPTITLMFSGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQ 412
S I P IT+ F+G +V L N + + + CL+M + + + N Q
Sbjct: 370 GSAEIGLPEITVHFTGADVRLSPINAFVK-VSEDMVCLSMVPTTE-----VAIYGNFAQM 423
Query: 413 NHRILYDVPNSRLGVARELCT 433
+ + YD+ + R C+
Sbjct: 424 DFLVGYDLETRTVSFQRMDCS 444
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 156/366 (42%), Gaps = 51/366 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +D+ + +VPC C C + F S+T+ + C +A
Sbjct: 85 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC-SAD 143
Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLAT--DIVPGY-TFGCIQKATG 207
C TC C + Y S+ + L +D +S T ++ P FGC TG
Sbjct: 144 C------TCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETG 197
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPK 262
+ S G++GLGRG LS++ Q + + +FS C + + G++ LG + P
Sbjct: 198 DLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPP 254
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+ ++ +P RS Y + L I V + + + P + GT++DSGT + L
Sbjct: 255 DMVFS--RSDPVRSPYYNIELKEIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAYLP 308
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA-------------PTITLMF-SG 368
A+ A +D +V + + G D Y A P + ++F G
Sbjct: 309 EQAFVAFKDAVTSKV---RPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDG 365
Query: 369 MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
++L P++ L HS CL + + ++L I +N + YD N ++G
Sbjct: 366 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGF 422
Query: 428 ARELCT 433
+ C+
Sbjct: 423 WKTNCS 428
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 153/359 (42%), Gaps = 39/359 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGC---SSTVFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +DT + +VPC+ C C F S+T+ Q +
Sbjct: 81 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTY-----QPVK 135
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATGN- 208
C N C + Y ST + L +D +S +++ P FGC TG+
Sbjct: 136 CTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENVETGDL 195
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
S G++GLGRG LS++ Q +N+ +FS C G++ LG I P +
Sbjct: 196 YSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDV--GGGAMVLGGISPPSDMV 253
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+ +P RS Y ++L I V + + + P G+++DSGT + L A
Sbjct: 254 FAQ--SDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYLPEEA 307
Query: 326 YTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNVTL- 373
+ A ++ + + S ++ D C+S + P + ++F +G +L
Sbjct: 308 FLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLS 367
Query: 374 PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
P++ + HS CL + + ++L I +N +LYD +++G + C
Sbjct: 368 PENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIV---VRNTLVLYDREQTKIGFWKTNC 423
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 82/244 (33%), Positives = 109/244 (44%), Gaps = 26/244 (10%)
Query: 199 FGCIQKATGN-SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSF-SGSLRLG 256
FGC G S G + LG G SL +QT + Y FSYC+P A F S +G
Sbjct: 177 FGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIG 236
Query: 257 PIGQPKRIKYTPLLK--NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDS 314
G TPL+ NP + Y V L I V R +++PP AGT++DS
Sbjct: 237 SSGSGSGFASTPLVATANP---TFYVVRLQGIDVAGRRLNVPPAVFS------AGTLMDS 287
Query: 315 GTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG--FDTCYSVP----IVAPTITLMFSG 368
V T+L AY A+R FR + V + G DTCY + P ++L+FSG
Sbjct: 288 SAVVTQLPPTAYRALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSG 347
Query: 369 MNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
V + ++ CLA P +S L I N+QQQ H +LYDV +G
Sbjct: 348 GAVVRLEPMAVMMEG-----CLAFVPTP--ADSDLGFIGNVQQQTHEVLYDVGARNVGFR 400
Query: 429 RELC 432
R C
Sbjct: 401 RGAC 404
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 153/362 (42%), Gaps = 41/362 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y+ IGTP Q + +D + + W CT C C +F+ +S+TF+ L C +
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 154 CKQVPNPT--CGGGACAFNLTYGSSTIAANLSQDTISLATDIVPGYTFGCI---QKATGN 208
C+ +P + C C + + DT ++ FGC+ K
Sbjct: 117 CESIPESSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAIGA-AKETLGFGCVVMTDKRLKT 175
Query: 209 SVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ-------- 260
P G++GLGR SL+ Q + + FSYCL A SG+L LG +
Sbjct: 176 IGGPSGIVGLGRTPWSLVTQ---MNVTAFSYCL----AGKSSGALFLGATAKQLAGGKNS 228
Query: 261 --PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGA-LQFNPTTGAGTIIDSGTV 317
P IK + + + Y V L I+ G GA LQ ++G+ ++D+ +
Sbjct: 229 STPFVIKTSAGSSDNGSNPYYMVKLAGIKTG--------GAPLQAASSSGSTVLLDTVSR 280
Query: 318 FTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFS---GMNVTLP 374
+ L AY A++ VG + +D C+ + L+F+ G +T+P
Sbjct: 281 ASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVP 340
Query: 375 QDNLLIHSTAGSITCLAMAAAPDNVNSVL---NVIANMQQQNHRILYDVPNSRLGVAREL 431
N L+ S G++ ++A N+ L +++ ++QQ+N +L+D+ L
Sbjct: 341 PANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPAD 400
Query: 432 CT 433
C+
Sbjct: 401 CS 402
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 154/365 (42%), Gaps = 49/365 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAA- 152
Y R IGTP Q + +D+ + +VPC C C + F S+T+ + C
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--DIVPGYT-FGCIQKATGN 208
C N C + Y S+ + L +D +S T ++ P FGC TG+
Sbjct: 148 TCDSDKN------QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGD 201
Query: 209 --SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPKR 263
S G++GLGRG LS++ Q + + +FS C + + G++ LG + P
Sbjct: 202 LFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPPG 258
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ YT N RS Y + L + V + + + P GT++DSGT + L
Sbjct: 259 MIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGK----HGTVLDSGTTYAYLPE 312
Query: 324 PAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF-SGM 369
A+ A +D +V + + G D+ Y + V P + ++F +G
Sbjct: 313 QAFVAFKDAVSSQV---HPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369
Query: 370 NVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
++L P++ L HS CL + + ++L I +N + YD N ++G
Sbjct: 370 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFW 426
Query: 429 RELCT 433
+ C+
Sbjct: 427 KTNCS 431
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 168/388 (43%), Gaps = 52/388 (13%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNSAQST 142
+P+ T++ Y R IGTPA+ + +DT +D WV C C GC + T
Sbjct: 76 LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELT 135
Query: 143 TFKNLGCQAAQ---CKQ---VPN-----PTCGGGA-CAFNLTYGSSTIAAN------LSQ 184
+ G Q+ + C Q V N P+C + C ++++YG + A L
Sbjct: 136 MYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQY 195
Query: 185 DTISLATDIVPG---YTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLYQS 235
+ +S P +FGC K G+ ++ G+LG G+ + S+L+Q +
Sbjct: 196 NQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRK 255
Query: 236 TFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDI 295
F++CL + G +G + QPK +K TPL+ + Y V L I VG + +
Sbjct: 256 MFAHCLDTVNG---GGIFAIGNVVQPK-VKTTPLVPDMPH---YNVILKGIDVGGTALGL 308
Query: 296 PPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCY- 353
P F+ GTIIDSGT + Y A+ VF + +++V +L F +C+
Sbjct: 309 PTNI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH--QDISVQTLQDF-SCFQ 363
Query: 354 ---SVPIVAPTITLMFSGMNVTL---PQDNLLIHSTAGSITCLAM--AAAPDNVNSVLNV 405
SV P +T F G +V+L P D L ++ C+ L +
Sbjct: 364 YSGSVDDGFPEVTFHFEG-DVSLIVSPHDYLF--QNGKNLYCMGFQNGGGKTKDGKDLGL 420
Query: 406 IANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++ N +LYD+ N +G A C+
Sbjct: 421 LGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 162/370 (43%), Gaps = 58/370 (15%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS------------TVFNSAQSTTF 144
Y +GTP + ++A+DT +D WVPC C+ C+ ++ A+STT
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCD-CIECAPLAGYRETLDRDLGIYKPAESTTS 201
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTY--GSSTIAANLSQDTISLATD-----IVPGY 197
++L C C + C ++ Y ++T + L +D + L + +
Sbjct: 202 RHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASV 261
Query: 198 TFGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSG 251
GC +K +G + + P GLLGLG +S+ LA+ L +++FS C FK SG
Sbjct: 262 VIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA-GLVRNSFSMC---FK--EDSG 315
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G + + TP + + Y VN+ VG + + T +
Sbjct: 316 RIFFGDQGVSIQ-QSTPFVPLYGKYQTYAVNVDKSCVGHKCFE----------ATSFEAL 364
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-----VPIVAPTITLMF 366
+DSGT FT L Y AV F ++V + F+ CYS +P V PT+TL F
Sbjct: 365 VDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDV-PTVTLTF 423
Query: 367 SGMNVTLPQDN--LLIHSTAGSIT--CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPN 422
+ N + N +++ GS+ CLA+ +P+ + +I + I++D N
Sbjct: 424 AA-NKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPI----GIIGQNFLTGYHIVFDKEN 478
Query: 423 SRLGVARELC 432
+LG R C
Sbjct: 479 MKLGWYRSEC 488
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 41/339 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y+++ ++GTP + +DT +D W +PCT C + +F+ + S+TFK
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATG 207
C G +C + + Y +T + L+ +T+++ + ++P T GC ++
Sbjct: 113 -----EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIK 265
G++GL G SL+ Q Y SYC S ++F + + G +
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG----VV 223
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
T + + LYY+NL A+ VG V+ F+ G IIDSGT T
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETM--GTTFHALEG-NIIIDSGTTLTYFPVSY 280
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--PTITLMFS-GMNVTLPQDNLLIHS 382
VR+ V + T G CY + P IT+ FS G ++ L + N+ I +
Sbjct: 281 CNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIET 340
Query: 383 TAGSITCLAMAAA--PDNVNSVLNVIANMQQQNHRILYD 419
CLA+ P + + N Q N + YD
Sbjct: 341 ITRGTFCLAIICNNPPQDA-----IFGNRAQNNFLVGYD 374
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 163/397 (41%), Gaps = 44/397 (11%)
Query: 63 AKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPA--QTLLMAMDTSNDA 120
AK + R + A A + I + S Y V +GT + + MD +
Sbjct: 67 AKQEVRCRIAHRFAGADITAASIRTYLCPPASMVYAVAVGVGTEHGYENYELEMDMAAGF 126
Query: 121 AWVPCTGCVGCS---STVFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSST 177
+W+ C C C + VF+ A+S TF+ + A + P G C F + Y +
Sbjct: 127 SWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPLQDGRCGFGIAYRNGA 186
Query: 178 IAAN-LSQDTISLAT-----DIVPGYTFGCIQK----ATGNSVPPQGLLGLGRGSLS--L 225
AA L++DT S T +PG FGC + T ++ G+LG+G G+ L
Sbjct: 187 SAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRIARFDTHGAL--AGVLGMGMGAEGKPL 244
Query: 226 LAQTQNLYQS---TFSYC--LPSFKALSFSGSLRLG---PIGQPKRIKYTPL--LKNPRR 275
+ LY + FSYC +P A SF LR G P P + + L
Sbjct: 245 TGFMRQLYHNGGGRFSYCPIVPGTTAYSF---LRFGNDIPSQPPAGVHRQSMAVLAPTTT 301
Query: 276 SSLYYVNLLAIRVGR-RVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFR 334
S YYV L I VG RV + P + + G ID GT T +V AY V R
Sbjct: 302 SEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVR 361
Query: 335 RRVGSNLT--VTSLGGFDTCYSVPIVA---PTITLMFSG---MNVTLPQDNLLIHSTAGS 386
+ N V S G + P + P++TL F G + V L++ S G
Sbjct: 362 GHLQRNRARFVQSPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGG 421
Query: 387 ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNS 423
L + PD + + VI MQQ + R ++D+ N+
Sbjct: 422 GEYLCLGLVPD---AEMTVIGAMQQIDTRFIFDLHNN 455
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 41/339 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAW---VPCTGCVGCSSTVFNSAQSTTFKNLGCQAAQ 153
Y+++ ++GTP + +DT +D W +PCT C + +F+ + S+TFK
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112
Query: 154 CKQVPNPTCGGGACAFNLTYGSSTIA-ANLSQDTISLATD-----IVPGYTFGCIQKATG 207
C G +C + + Y +T + L+ +T+++ + ++P T GC ++
Sbjct: 113 -----EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 208 NSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPS--FKALSFSGSLRLGPIGQPKRIK 265
G++GL G SL+ Q Y SYC S ++F + + G +
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG----VV 223
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
T + + LYY+NL A+ VG V+ F+ G IIDSGT T
Sbjct: 224 STTMFLTTAKPGLYYLNLDAVSVGDTHVETM--GTTFHALEG-NIIIDSGTTLTYFPVSY 280
Query: 326 YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVA--PTITLMFS-GMNVTLPQDNLLIHS 382
VR+ V + T G CY + P IT+ FS G ++ L + N+ I +
Sbjct: 281 CNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNMYIET 340
Query: 383 TAGSITCLAMAAA--PDNVNSVLNVIANMQQQNHRILYD 419
CLA+ P + + N Q N + YD
Sbjct: 341 ITRGTFCLAIICNNPPQDA-----IFGNRAQNNFLVGYD 374
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 163/365 (44%), Gaps = 58/365 (15%)
Query: 91 ITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG-------CSSTVFNSAQSTT 143
IT+S Y++ +GTP LL DT +D WV C+ G + VF +S+T
Sbjct: 97 ITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSST 156
Query: 144 FKNLGCQAAQCKQVPNPTCGGGA-CAFNLTY--GSSTIAANLSQDTISLATD------IV 194
+ L CQ+ C+ + +C + C + +Y GS TI LS +T S V
Sbjct: 157 YSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGV-LSTETFSFVDGGGKGQVRV 215
Query: 195 PGYTFGCIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCL-PSFKALSFSG 251
P FGC A+ + GL+GLG G+ SL++Q SYCL PS+ A S S
Sbjct: 216 PRVNFGC-STASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANS-SS 273
Query: 252 SLRLGP---IGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGA 308
+L G + +P TPL+ + S Y V L ++ VG + V T +
Sbjct: 274 TLNFGSRAVVSEPGAAS-TPLVPSD-VDSYYTVALESVAVGGQEV----------ATHDS 321
Query: 309 GTIIDSGTVFT----RLVAPAYTAV-RDVFRRRVGSNLTVTSLGGFDTCYSVPIVA---- 359
I+DSGT T L+ P T + R + +RV + L CY V +
Sbjct: 322 RIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQL-----CYDVQGKSETDN 376
Query: 360 ---PTITLMF-SGMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHR 415
P +TL F G VTL +N G++ CL + P + + ++++ N+ QQN
Sbjct: 377 FGIPDVTLRFGGGAAVTLRPENTFSLLQEGTL-CLVL--VPVSESQPVSILGNIAQQNFH 433
Query: 416 ILYDV 420
+ YD+
Sbjct: 434 VGYDL 438
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 159/368 (43%), Gaps = 45/368 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----AQSTTFK-NLGCQ 150
Y R IGTP+Q + +D+ + +VPC C C + S A F+ +L
Sbjct: 92 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 151
Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQK 204
+ K + TC C + Y S+ + L +D +S +++ P FGC
Sbjct: 152 YSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 211
Query: 205 ATGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
TG+ S G++GLGRG LS++ Q + + +FS C G++ LG +
Sbjct: 212 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPA 269
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P + ++ NP RS Y + L I V + + + P FN + GT++DSGT +
Sbjct: 270 PPDMVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAY 323
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF- 366
L A+ A +D +V S + + G D Y + V P + ++F
Sbjct: 324 LPEQAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFG 380
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ L HS CL + + ++L I +N + YD N ++
Sbjct: 381 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKI 437
Query: 426 GVARELCT 433
G + C+
Sbjct: 438 GFWKTNCS 445
>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
Length = 486
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 160/354 (45%), Gaps = 46/354 (12%)
Query: 74 SLAVARKSVVPIASGRQITQSP---TYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVG 130
+L+ A +++P + P YIV G+P Q + + T+ + + C C
Sbjct: 125 ALSPAAATIIPANGSSDPSTLPGALDYIVLVSYGSPEQQFPVFLGTNVGTSLLRCKPCAS 184
Query: 131 CSST---VFNSAQSTTFKNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAANLSQDTI 187
S F++ QS+TF ++ C + C C C F YG T+ + D +
Sbjct: 185 GSDDCNPAFDTLQSSTFAHVPCSSPDCPV----NCSSSVCPFYDLYG--TVGGTFATDVL 238
Query: 188 SLATD--IVPGYTFGCIQ-KATGNSVPPQGLLGLGRGSLSLLAQTQNLY-----QSTFSY 239
+LA V + F C+ ++ +P G + L R SL +Q + ++FSY
Sbjct: 239 TLAPSSMAVHDFRFVCMDVESPSPDLPEAGSIDLSRHRNSLPSQLSSSSGIAPTAASFSY 298
Query: 240 CLPSFKALSFSGSLRLGP----IGQPKRIK-YTPLLKN--PRRSSLYYVNLLAIRVGRRV 292
CLP ++ + G L LG +G + + P++ N P +S+Y+++L+ + +G
Sbjct: 299 CLP--QSRNSQGFLSLGGDATVVGDDDNLTVHAPMVWNNDPDLASMYFIDLVGMSLGGED 356
Query: 293 VDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTS---LGGF 349
+ IP G A T +D G FT L AYT +RD FR+ + +S GF
Sbjct: 357 LPIPSGTFG-----NASTNLDVGATFTMLAPEAYTTLRDAFRKEMSQYNNRSSPAGFDGF 411
Query: 350 DTCYSV----PIVAPTITLMFS-GMNVTLPQDNLLIHS--TAGSIT--CLAMAA 394
DTC++ +V P + L FS G ++ + D +L + AG T CLA ++
Sbjct: 412 DTCFNFTGLNELVVPLVQLKFSNGESLMIDGDQMLYYHDPAAGPFTMACLAFSS 465
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 165/386 (42%), Gaps = 50/386 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ + Y + K+G+P + + +DT +D WV C C +G +
Sbjct: 60 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 119
Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTIS--- 188
+++S S+T KN+GC+ C + + TCG C++++ YG ST + +D I+
Sbjct: 120 LYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQ 179
Query: 189 ---------LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLY 233
LA ++V FGC + +G G++G G+ + S+++Q
Sbjct: 180 VTGNLRTAPLAQEVV----FGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 235
Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
+ FS+CL + G +G + P +K TP++ N Y V L + V +
Sbjct: 236 KRIFSHCLDNMNG---GGIFAVGEVESP-VVKTTPIVPNQVH---YNVILKGMDVDGDPI 288
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDT 351
D+PP N GTIIDSGT L Y ++ + +++V ++ + F
Sbjct: 289 DLPPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSF 346
Query: 352 CYSVPIVAPTITLMFS-GMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--A 407
+ P + L F + +++ P D L S + C + +VI
Sbjct: 347 TSNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLG 404
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
++ N ++YD+ N +G A C+
Sbjct: 405 DLVLSNKLVVYDLENEVIGWADHNCS 430
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 162/389 (41%), Gaps = 81/389 (20%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST-----------VFNSAQSTTFK 145
+ +GTP+ + L+A+DT ++ W+PC C C + +++ S+T +
Sbjct: 62 HYANVSVGTPSVSFLVALDTGSNLLWLPCD-CSSCVHSLRSPSGTVDLNIYSPNTSSTSE 120
Query: 146 NLGCQAAQCKQVPNPTC--GGGACAFNLTY---GSSTIAANLSQDTISLATD------IV 194
+ C + C Q C C + + Y G+ST + QD + L +D +
Sbjct: 121 KVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGY-IVQDLLHLISDDSQSKAVD 179
Query: 195 PGYTFGCIQKATGNSV---PPQGLLGLGRGSLSLLAQ-TQNLYQS-TFSYCLP--SFKAL 247
TFGC + TG+ + P GL GLG ++S+ + N Y S +FS C +
Sbjct: 180 AKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIGRI 239
Query: 248 SFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTG 307
SF G GQ + T + RSSLY +++ +G + D+ A
Sbjct: 240 SFGDK---GSTGQGE----TSFNQGQPRSSLYNISITQTSIGGQASDLVYSA-------- 284
Query: 308 AGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSV------------ 355
I DSGT FT L PAYT + + F + V ++ FD CY +
Sbjct: 285 ---IFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFS 341
Query: 356 --------PIVAPTITLMFSG---MNVTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVL 403
P + P +TL+ SG NVT P +L+ GS + CL M + D +
Sbjct: 342 CAYANQTEPTI-PAVTLVMSGGDYFNVTDPI--VLVQLADGSAVYCLGMIKSGD-----V 393
Query: 404 NVIANMQQQNHRILYDVPNSRLGVARELC 432
N+I HRI++D LG C
Sbjct: 394 NIIGQNFMTGHRIVFDRERMILGWKPSNC 422
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 111/424 (26%), Positives = 171/424 (40%), Gaps = 74/424 (17%)
Query: 55 EESVLEMLAKDQARLQFLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAM 114
EE + + D RL L P+ T + Y + IGTP++ + +
Sbjct: 55 EEHLAALRKHDGRRLLTAVDL--------PLGGNGIPTDTGLYFTQIGIGTPSKGYYVQV 106
Query: 115 DTSNDAAWVPCTGC--------VGCSSTVFNSAQSTTFKNLGCQAAQCKQVPN----PTC 162
DT +D WV C C +G T+++ S + K + C C N P+C
Sbjct: 107 DTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCATATNGGVPPSC 166
Query: 163 GGGA-CAFNLTYGSST------IAANLSQDTIS------LATDIVPGYTFGCIQKATG-- 207
+ C +++TYG + +A L D +S LA V TFGC K G
Sbjct: 167 AANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASV---TFGCGAKIGGAL 223
Query: 208 --NSVPPQGLLGLGRGSLSLLAQTQNLYQST--FSYCLPSFKALSFSGSLRLGPIGQPKR 263
++V G+LG G+ + S+L+Q + + T FS+CL + G +G + QPK
Sbjct: 224 GSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNG---GGIFAIGNVVQPK- 279
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+K TPL+ Y V L I VG + +P + GTIIDSGT L
Sbjct: 280 VKTTPLVPGMPH---YNVVLKTIDVGGSTLQLPTNIFDIGGGS-RGTIIDSGTTLAYLPE 335
Query: 324 PAYTAVR--------DVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGM--NVTL 373
Y AV DV + V L G D + P +T F G V
Sbjct: 336 VVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGF------PEVTFHFDGDLPLVVY 389
Query: 374 PQDNLLIHSTAGSITCLAMAA----APDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
P D L ++ + C+ + + D + VL + ++ N ++YD+ N +G
Sbjct: 390 PHDYLFQNTE--DVYCVGFQSGGVQSKDGKDMVL--LGDLALSNKLVVYDLENQVIGWTN 445
Query: 430 ELCT 433
C+
Sbjct: 446 YNCS 449
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 157/382 (41%), Gaps = 75/382 (19%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV------------FNSAQSTTF 144
Y +GTP+ L+A+DT +D W+PC C C + + ++ STT
Sbjct: 104 YYANVSVGTPSLDFLVALDTGSDLFWLPCE-CSSCFTYLNTSNGGKFMLNHYSPNDSTTS 162
Query: 145 KNLGCQAAQCKQVPNPTCGGGACAFNLTYGSSTIAA--NLSQDTISLATD------IVPG 196
+ C ++ C + T C + + Y S+ ++ L +D + LATD +
Sbjct: 163 STVPCTSSLCNRC---TSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPVEAK 219
Query: 197 YTFGCIQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSG 251
TFGC TG + P GL+GLG +S+ Q L ++FS C F A + G
Sbjct: 220 ITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMC---FGADGY-G 275
Query: 252 SLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTI 311
+ G G P K TP S Y V I VG D+P A I
Sbjct: 276 RIDFGDTG-PADQKQTPFNTMLEYQS-YNVTFNVINVGGEPNDVPFTA-----------I 322
Query: 312 IDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGG----FDTCYSVPIVAPT---ITL 364
DSGT FT L PAY+ + + G L SL G F+ CY +P A +TL
Sbjct: 323 FDSGTSFTYLTEPAYSTITK--QMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTL 380
Query: 365 MF----------SGMNVTLPQD----NLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQ 410
F + + V LP D N++ T + CLA+A + D +++I
Sbjct: 381 NFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT-HVACLAIAKSTD-----IDLIGQNF 434
Query: 411 QQNHRILYDVPNSRLGVARELC 432
+RI ++ LG + C
Sbjct: 435 MTGYRITFNRDQMVLGWSSSDC 456
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 159/368 (43%), Gaps = 45/368 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTVFNS-----AQSTTFK-NLGCQ 150
Y R IGTP+Q + +D+ + +VPC C C + S A F+ +L
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 150
Query: 151 AAQCKQVPNPTCGG--GACAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQK 204
+ K + TC C + Y S+ + L +D +S +++ P FGC
Sbjct: 151 YSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 210
Query: 205 ATGN--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQ 260
TG+ S G++GLGRG LS++ Q + + +FS C G++ LG +
Sbjct: 211 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV--GGGTMVLGGMPA 268
Query: 261 PKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTR 320
P + ++ NP RS Y + L I V + + + P FN + GT++DSGT +
Sbjct: 269 PPDMVFS--HSNPVRSPYYNIELKEIHVAGKALRLDPKI--FN--SKHGTVLDSGTTYAY 322
Query: 321 LVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYS-------------VPIVAPTITLMF- 366
L A+ A +D +V S + + G D Y + V P + ++F
Sbjct: 323 LPEQAFVAFKDAVTNKVNS---LKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFG 379
Query: 367 SGMNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
+G ++L P++ L HS CL + + ++L I +N + YD N ++
Sbjct: 380 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKI 436
Query: 426 GVARELCT 433
G + C+
Sbjct: 437 GFWKTNCS 444
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 165/386 (42%), Gaps = 50/386 (12%)
Query: 83 VPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGC--------VGCSST 134
+P+ + Y + K+G+P + + +DT +D WV C C +G +
Sbjct: 64 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 123
Query: 135 VFNSAQSTTFKNLGCQAAQCKQV-PNPTCGGGA-CAFNLTYGS-STIAANLSQDTIS--- 188
+++S S+T KN+GC+ C + + TCG C++++ YG ST + +D I+
Sbjct: 124 LYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQ 183
Query: 189 ---------LATDIVPGYTFGCIQKATGN----SVPPQGLLGLGRGSLSLLAQ--TQNLY 233
LA ++V FGC + +G G++G G+ + S+++Q
Sbjct: 184 VTGNLRTAPLAQEVV----FGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 239
Query: 234 QSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVV 293
+ FS+CL + G +G + P +K TP++ N Y V L + V +
Sbjct: 240 KRIFSHCLDNMNG---GGIFAVGEVESP-VVKTTPIVPNQVH---YNVILKGMDVDGDPI 292
Query: 294 DIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVF--RRRVGSNLTVTSLGGFDT 351
D+PP N GTIIDSGT L Y ++ + +++V ++ + F
Sbjct: 293 DLPPSLASTNGD--GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSF 350
Query: 352 CYSVPIVAPTITLMFSG-MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVI--A 407
+ P + L F + +++ P D L S + C + +VI
Sbjct: 351 TSNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLG 408
Query: 408 NMQQQNHRILYDVPNSRLGVARELCT 433
++ N ++YD+ N +G A C+
Sbjct: 409 DLVLSNKLVVYDLENEVIGWADHNCS 434
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 158/366 (43%), Gaps = 51/366 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +D+ + +VPC C C + F S+++ + C
Sbjct: 89 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCN-VD 147
Query: 154 CKQVPNPTCGGGA--CAFNLTYGS-STIAANLSQDTISLA--TDIVPGY-TFGCIQKATG 207
C TC C + Y S+ + L +D +S +++ P FGC TG
Sbjct: 148 C------TCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETG 201
Query: 208 N--SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPK 262
+ S G++GLGRG LS++ Q + + +FS C + + G++ LG + P
Sbjct: 202 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLC---YGGMDIGGGAMVLGGVPAPS 258
Query: 263 RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLV 322
+ ++ +P RS Y + L I V + + + + FN + GT++DSGT + L
Sbjct: 259 DMVFS--HSDPLRSPYYNIELKEIHVAGKALRV--DSRVFN--SKHGTVLDSGTTYAYLP 312
Query: 323 APAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-------------SVPIVAPTITLMF-SG 368
A+ A +D +V S + + G D Y + V P + ++F +G
Sbjct: 313 EQAFVAFKDAVTSKVHS---LKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 369
Query: 369 MNVTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGV 427
++L P++ L HS CL + N ++ + +N + YD N ++G
Sbjct: 370 QKLSLTPENYLFRHSKVDGAYCLGVFQ---NGKDPTTLLGGIIVRNTLVTYDRHNEKIGF 426
Query: 428 ARELCT 433
+ C+
Sbjct: 427 WKTNCS 432
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 151/377 (40%), Gaps = 50/377 (13%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCT---GCVGCSS---------TVFNSAQSTTF 144
Y V K+GTP+Q ++ DT +D W+ C CS+ VF++ S++F
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 145 KNLGCQAAQCK----------QVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD- 192
K + C CK P P C ++ Y ST + +T+++
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLT---PCGYDYRYSDGSTALGFFANETVTVELKE 199
Query: 193 ----IVPGYTFGCIQKATGNSV-PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCL-PSFKA 246
+ GC + G S G++GLG S + + FSYCL
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH 259
Query: 247 LSFSGSLRLGPIGQPK----RIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF 302
+ S L G + + YT L+ +S Y VN++ I +G ++ IP +
Sbjct: 260 KNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV--W 316
Query: 303 NPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVT-SLGGFDTCYS----VPI 357
+ GTI+DSG+ T L PAY V R + V +G + C++
Sbjct: 317 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEES 376
Query: 358 VAPTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 416
+ P + F+ G P + +I S A + CL + S V+ N+ QQNH
Sbjct: 377 LVPRLVFHFADGAEFEPPVKSYVI-SAADGVRCLGFVSVAWPGTS---VVGNIMQQNHLW 432
Query: 417 LYDVPNSRLGVARELCT 433
+D+ +LG A CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 39/365 (10%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS--------TVFNSAQSTTFKNLG 148
Y +G P Q L + +DT +D WV C+ C C S +++N + S+T
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSS 142
Query: 149 CQAAQC---KQVPNPTCGGGACAFNLTY--GSSTIAANLSQD---TISLATDIVPGYTFG 200
C C + V + + ACA+ +Y S+++ A + D + FG
Sbjct: 143 CSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFFG 202
Query: 201 CIQKATGNSVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPI 258
C TG S P G++G G S ++ Q TQ FS+CL K G L G
Sbjct: 203 CATNITG-SWPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEK--HGGGILEFGEA 259
Query: 259 GQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQF--NPTTGAGTIIDSGT 316
+ +TPLL ++ Y V+LL+I V +V+ I P + N T G IIDSGT
Sbjct: 260 PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGT 316
Query: 317 VFTRLVAPA----YTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIVAPTITLMFSGMN-V 371
F L A + ++ + ++G L + ++ P +TL FSG + +
Sbjct: 317 TFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECFYLKSGLTMETSFPNVTLTFSGGSTM 376
Query: 372 TLPQDNLLI---HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVA 428
L DN L+ + + C A ++A L + + ++ + YDV N R+G
Sbjct: 377 KLKPDNYLVMAEYKKKRNGYCYAWSSADG-----LTIFGEIVLKDKLVFYDVENRRIGWK 431
Query: 429 RELCT 433
+ C+
Sbjct: 432 GQNCS 436
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 153/362 (42%), Gaps = 43/362 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSSTV---FNSAQSTTFKNLGCQAA- 152
Y R IGTP Q + +D+ + +VPC C C + F S+T+ + C
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 153 QCKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLAT--DIVPGYT-FGCIQKATGN 208
C N C + Y S+ + L +D +S T ++ P FGC TG+
Sbjct: 148 TCDSDKN------QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGD 201
Query: 209 --SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSF-SGSLRLGPIGQPKR 263
S G++GLGRG LS++ Q + + +FS C + + G++ LG + P
Sbjct: 202 LFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC---YGGMDIGGGAMVLGAMPAPPG 258
Query: 264 IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVA 323
+ YT N RS Y + L + V + + + P GT++DSGT + L
Sbjct: 259 MIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGK----HGTVLDSGTTYAYLPE 312
Query: 324 PAYTAVRDVFRRRVG--SNLTVTSLGGFDTCYS--------VPIVAPTITLMF-SGMNVT 372
A+ A +D +V + D C++ + V P + ++F +G ++
Sbjct: 313 QAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372
Query: 373 L-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAREL 431
L P++ L HS CL + + ++L I +N + YD N ++G +
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIV---VRNTLVTYDRHNEKIGFWKTN 429
Query: 432 CT 433
C+
Sbjct: 430 CS 431
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 161/367 (43%), Gaps = 58/367 (15%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSS-------------TVFNSAQSTTFKNLG 148
IGTP + ++A+D+ +D WVPC CV C+ + ++ +QS+T K L
Sbjct: 103 DIGTPHVSFMVALDSGSDLFWVPCD-CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLS 161
Query: 149 CQAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLA--------TDIVPGYT 198
C C PN +C +++ Y + + +++ L +D I LA T +
Sbjct: 162 CSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVI 221
Query: 199 FGCIQKATG---NSVPPQGLLGLGRGSLSL---LAQTQNLYQSTFSYCLPSFKALSFSGS 252
GC K +G + V P GLLGLG +S+ LA+ L Q++FS C SG
Sbjct: 222 IGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSFLAKA-GLIQNSFSMCFNEDD----SGR 276
Query: 253 LRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTII 312
+ G G P + P LK + Y V + VG + + ++
Sbjct: 277 IFFGDQG-PATQQSAPFLKLNGNYTTYIVGVEVCCVGTSCLK----------QSSFSALV 325
Query: 313 DSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY-----SVPIVAPTITLMFS 367
DSGT FT L + + + F +V ++ + + CY +P + P++ L+F
Sbjct: 326 DSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLPKI-PSLRLIFP 384
Query: 368 GMNVTLPQDNL-LIHSTAGSIT-CLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRL 425
N + Q+ + +I+ G I CLA+ A ++ + I +R+++D N +L
Sbjct: 385 QNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGT----IGQNFMMGYRVVFDRENLKL 440
Query: 426 GVARELC 432
G +R C
Sbjct: 441 GWSRSNC 447
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 147/334 (44%), Gaps = 39/334 (11%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQC 154
Y++ +GTP++T ++ +DT + +WV C C GC + F ++STT + C + C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 155 -KQVPNPTCGGGA----CAFNLTYGSSTIAAN-LSQDTISLA-TDIVPGYTFGCIQKATG 207
+P C C F ++Y + + L QDT++ + +PG++FGC + G
Sbjct: 60 LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDSFG 119
Query: 208 NSV--PPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLP-SFKALSF----SGSLRLGPIGQ 260
+ GLLG+G G +S+L Q+ + FSYCLP F +G LG
Sbjct: 120 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSLGGKIA 178
Query: 261 PKR--IKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVF 318
R ++YT ++ + + L++V+L AI V + + P + G + DSG+
Sbjct: 179 ATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF-----SRKGVVFDSGSEL 233
Query: 319 T----RLVAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCYSVPIV----APTITLMF-SGM 369
+ R ++ +R++ RR + CY + V P I+L F G
Sbjct: 234 SYIPDRALSVLSQRIRELLLRRGAAEEESER-----NCYDMRSVDEGDMPAISLHFDDGA 288
Query: 370 NVTLPQDNLLIHSTAGSITCLAMAAAPDNVNSVL 403
L + + + +A AP S++
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFAPTESVSII 322
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 162/394 (41%), Gaps = 45/394 (11%)
Query: 62 LAKDQARLQ-FLSSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQTLLMAMDTSNDA 120
LA QAR++ L S + V P +GR I + IG P L+ MDT +D
Sbjct: 71 LANIQARIEGSLVSNNDYKARVSPSLTGRTI------MANISIGQPPIPQLVVMDTGSDI 124
Query: 121 AWVPCTGCVGCSSTV---FNSAQSTTFKNLG---CQAAQCKQVPNPTCGGGACAFNLTYG 174
WV CT C C + + F+ ++S+TF L C C+ P P F +TY
Sbjct: 125 LWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFEGCRCDPIP--------FTVTYA 176
Query: 175 -SSTIAANLSQDTISL-----ATDIVPGYTFGCIQKATGNSVPPQ-GLLGLGRGSLSLLA 227
+ST + +DT+ T + FGC ++ P G+LGL G SL+
Sbjct: 177 DNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVT 236
Query: 228 QTQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIKYTPLLKNPRRSSLYYVNLLAIR 287
+ FSYC+ + ++ + G TP + YYV + I
Sbjct: 237 K----LGQKFSYCIGNLADPYYNYHQLILGEGADLEGYSTPF---EVYNGFYYVTMEGIS 289
Query: 288 VGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVRDVFRRRVGSNLTVTSLG 347
VG + +DI P + G IID+G+ T LV + + R +G + ++
Sbjct: 290 VGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIE 349
Query: 348 G------FDTCYSVPIVA-PTITLMFS-GMNVTLPQDNLLIHSTAGSITCLAMAAAPD-N 398
F S +V P +T FS G ++ L + + ++ C+ + N
Sbjct: 350 KSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSFF-NQLNDNVFCMTVGPVSSLN 408
Query: 399 VNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+ S ++I + QQ++ + YD+ N + R C
Sbjct: 409 IKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 157/364 (43%), Gaps = 46/364 (12%)
Query: 97 YIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQAAQ 153
Y R IGTP Q + +D+ + +VPC+ C C F S+T+ Q +
Sbjct: 93 YTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTY-----QPVK 147
Query: 154 CKQVPNPTCGGGACAFNLTYGS-STIAANLSQDTISLATD--IVPGY-TFGCIQKATGN- 208
C N C + Y S+ L +D IS + + P FGC TG+
Sbjct: 148 CNMDCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDL 207
Query: 209 -SVPPQGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQPKRIK 265
S G++GLG+G LSL+ Q + L ++F C GS+ LG P +
Sbjct: 208 YSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDV--GGGSMILGGFDYPSDMV 265
Query: 266 YTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPA 325
+T +P RS Y ++L IRV + + + + F+ GA ++DSGT + L A
Sbjct: 266 FTD--SDPDRSPYYNIDLTGIRVAGKQLSL--HSRVFDGEHGA--VLDSGTTYAYLPDAA 319
Query: 326 YTAVRDVFRRRVGSNLTVTSLGG-----FDTCYSVPI---------VAPTITLMF-SGMN 370
+ A + R V T+ + G DTC+ V + P++ ++F SG +
Sbjct: 320 FAAFEEAVMREVS---TLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQS 376
Query: 371 VTL-PQDNLLIHSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
L P++ + HS CL + + ++L I +N ++YD NS++G R
Sbjct: 377 WLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIV---VRNTLVVYDRENSKVGFWR 433
Query: 430 ELCT 433
C+
Sbjct: 434 TNCS 437
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 152/363 (41%), Gaps = 55/363 (15%)
Query: 102 KIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST------------VFNSAQSTTFKNLGC 149
+IGTP ++A+DT +D WVPC C C++T V+N S+T K + C
Sbjct: 101 QIGTPGVKFMVALDTGSDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTC 159
Query: 150 QAAQCKQVPNPTCGGGACAFNLTYGSSTIAAN--LSQDTISLATD------IVPGYTFGC 201
+ C C + ++Y S+ + + L +D + L + + FGC
Sbjct: 160 NNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGC 219
Query: 202 IQKATG---NSVPPQGLLGLGRGSLSL--LAQTQNLYQSTFSYCLPSFKALSFSGSLRLG 256
Q +G + P GL GLG +S+ + + +FS C G + G
Sbjct: 220 GQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF----GRDGIGRISFG 275
Query: 257 PIGQPKRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGT 316
G + + TP NP + Y + + +RVG ++D+ AL DSGT
Sbjct: 276 DKGSFDQDE-TPFNLNPSHPT-YNITVTQVRVGTTLIDVEFTAL-----------FDSGT 322
Query: 317 VFTRLVAPAYTAVRDVFRRRVGSNLTVT-SLGGFDTCYSVPIVA-----PTITLMFSGMN 370
FT LV P YT + + F +V + S F+ CY + A P+++L G +
Sbjct: 323 SFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGS 382
Query: 371 VTLPQDNLLIHSTAGS-ITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVAR 429
D ++I ST + CLA+ + LN+I +R+++D LG +
Sbjct: 383 HFAVYDPIIIISTQSELVYCLAVVKTAE-----LNIIGQNFMTGYRVVFDREKLVLGWKK 437
Query: 430 ELC 432
C
Sbjct: 438 FDC 440
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 151/369 (40%), Gaps = 48/369 (13%)
Query: 95 PTYIVRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST---VFNSAQSTTFKNLGCQA 151
P Y+ IGTP Q +D + + W C+ C C VF S+TFK C
Sbjct: 43 PYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGT 102
Query: 152 AQCKQVPNPTCGGGACAFN-----LTYGSSTIAANLSQDTISLATDIVPGYTFGCIQKAT 206
A C+ +P +C G C++ L +S AA DT ++ T V FGC+ +
Sbjct: 103 AVCESIPTRSCSGDVCSYKGPPTQLRGNTSGFAAT---DTFAIGTATVR-LAFGCVVASD 158
Query: 207 GNSVP-PQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLG---PIGQPK 262
+++ P G +GLGR SL+AQ + + FSYCL S + S L LG + +
Sbjct: 159 IDTMDGPSGFIGLGRTPWSLVAQ---MKLTRFSYCL-SPRNTGKSSRLFLGSSAKLAGSE 214
Query: 263 RIKYTPLLK---NPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTV-- 317
P +K + S+ Y ++L AIR G + T +G I+ TV
Sbjct: 215 STSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI----------ATAQSGGILVMHTVSP 264
Query: 318 FTRLVAPAYTAVRDVFRRRVGS---NLTVTSLGGFDTCYSVP-----IVAPTITLMFSG- 368
F+ LV AY A + VG T FD C+ AP + F G
Sbjct: 265 FSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 324
Query: 369 MNVTLPQDNLLI----HSTAGSITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSR 424
+T+P LI L+MA ++V+ ++QQ++ LYD+
Sbjct: 325 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKET 384
Query: 425 LGVARELCT 433
L C+
Sbjct: 385 LSFEPADCS 393
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/418 (24%), Positives = 162/418 (38%), Gaps = 42/418 (10%)
Query: 41 SPCSPFKPSKPLSWEESVLEMLAKDQ--ARLQFLSSLAVARKSVVPIASGRQITQSPTYI 98
SP SPF + + S D R +S A +S + + G Y+
Sbjct: 46 SPNSPFYNALEAAATRSTNASQHYDAQIGRFNLMSDSYYASQSELNFSKGN-------YL 98
Query: 99 VRAKIGTPAQTLLMAMDTSNDAAWVPCTGCVGCSST--VFNSAQSTTFKNLGCQAAQCKQ 156
++ +GTP +L D + D W+PC C C+ F ++S+T+ + C++ QC+
Sbjct: 99 IKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFFPSESSTYTSAACESYQCQI 158
Query: 157 VPNPTCGGGACAFNL-----TYGSSTIAANLSQDTISLATD-----IVPGYTFGCIQKAT 206
C C + S T ++ DTIS + P F C
Sbjct: 159 TNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSYPNTNFICGTFID 218
Query: 207 GNSVPPQGLLGLGRGSLSLLAQTQNLYQSTFSYCLPSFKALSFSGSLRLGPIG--QPKRI 264
G++GLGRG S+ +Q ++L TFS CL + + S + G G + +
Sbjct: 219 NWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQ-SSKINFGLKGVVSGEGV 277
Query: 265 KYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAP 324
TP+ + S Y++ L A+ VG V A F + ID T FT L
Sbjct: 278 VSTPIADDG-ESGAYFLFLEAMSVGGNRV-----ANNFYSAPKSNIYIDWRTTFTSLPHD 331
Query: 325 AYTAVRDVFRRRVGSNLTVTSLGG---FDTCYSVP----IVAPTITLMFSGMNVTLPQDN 377
Y V R+ + NLT + CY AP IT+ F+ +V L N
Sbjct: 332 FYENVEAEVRKAI--NLTPINYNNERKLSLCYKSESDHDFDAPPITMHFTNADVQLSPLN 389
Query: 378 LLIHSTAGSITCLAMAAAPDNVNSVLN--VIANMQQQNHRILYDVPNSRLGVARELCT 433
+ ++ C A N + V + QQ N + YD+ +S + + CT
Sbjct: 390 TFVR-MDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLKSSTVSFKQADCT 446
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 77/167 (46%), Gaps = 8/167 (4%)
Query: 271 KNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRLVAPAYTAVR 330
K + YYV + ++ VG V++IP + GTIIDSGT + PAY ++
Sbjct: 25 KENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIK 84
Query: 331 DVFRRRVGSNLTVTSLGGFDTCYSVPIVA----PTITLMF-SGMNVTLPQDNLLIHSTAG 385
F +V + CY+V V P+ ++F G T P +N I
Sbjct: 85 QAFVNKVKRYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPE 144
Query: 386 SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
I CLA+ P +S +++I N QQQN ILYD SRLG A C
Sbjct: 145 DIVCLAILGTP---HSAMSIIGNYQQQNFHILYDTKRSRLGFAPRRC 188
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/421 (25%), Positives = 178/421 (42%), Gaps = 54/421 (12%)
Query: 51 PLSWEESVLEMLAKDQARL-QFL-SSLAVARKSVVPIASGRQITQSPTYIVRAKIGTPAQ 108
PL+ + E+ A+D+ R +FL SS+ V P+ + Y R +G+P +
Sbjct: 23 PLNQRVELDELKARDRVRHGRFLQSSVGVVD---FPVEGTYDPYRVGLYFTRVLLGSPPK 79
Query: 109 TLLMAMDTSNDAAWVPCTGCVGCSST--------VFNSAQSTTFKNLGCQAAQCK---QV 157
+ +DT +D WV C C GC + F+ S+T + C +C Q
Sbjct: 80 EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQS 139
Query: 158 PNPTCG--GGACAFNLTYGSST------IAANLSQDTI--SLATDIVPGYTFGCIQKATG 207
+ C G C + YG + ++ L+ D I S T+ FGC TG
Sbjct: 140 SDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTG 199
Query: 208 NSVPP----QGLLGLGRGSLSLLAQ--TQNLYQSTFSYCLPSFKALSFSGSLRLGPIGQP 261
+ G+ G G+ +S+++Q +Q + FS+CL L I +
Sbjct: 200 DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLG--EIVE- 256
Query: 262 KRIKYTPLLKNPRRSSLYYVNLLAIRVGRRVVDIPPGALQFNPTTGAGTIIDSGTVFTRL 321
+ I Y+PL+ + Y +NL +I V + + I P F +T GTI+DSGT L
Sbjct: 257 EDIVYSPLVPSQPH---YNLNLQSISVNGKSLAIDPEV--FATSTNRGTIVDSGTTLAYL 311
Query: 322 VAPAYTAVRDVFRRRVGSNLTVTSLGGFDTCY----SVPIVAPTITLMFSG---MNVTLP 374
AY V ++ L CY SV + PT++L F+G MN+ P
Sbjct: 312 AEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLK-P 369
Query: 375 QDNLLIHSTAG--SITCLAMAAAPDNVNSVLNVIANMQQQNHRILYDVPNSRLGVARELC 432
+D LL ++ G ++ C+ + ++ ++ ++ +YD+ R+G A C
Sbjct: 370 EDYLLQQNSIGDAAVWCIGFQKIQ---GQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 426
Query: 433 T 433
+
Sbjct: 427 S 427
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.134 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,474,907,821
Number of Sequences: 23463169
Number of extensions: 259548537
Number of successful extensions: 638413
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 872
Number of HSP's successfully gapped in prelim test: 3138
Number of HSP's that attempted gapping in prelim test: 630590
Number of HSP's gapped (non-prelim): 4395
length of query: 433
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 288
effective length of database: 8,957,035,862
effective search space: 2579626328256
effective search space used: 2579626328256
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)